
CS & Applied Math @ Brown | Reinforcement Learning, Continual Learning, Partial Observability
I am a fourth year undergraduate student concentrated in Computer Science and Applied Mathematics at Brown University. I currently conduct research in the Intelligent Robot Lab (IRL)advised by Prof. George Konidaris. My current research interest lies in continual learning and reinforcement learning in partially observable environments. Outside of research, I enjoy everything about science-fiction—favorites include Interstellar, Westworld, and The Three-Body Problem.
B.S. in Computer Science & Applied Mathematics (Honors)

Naicheng He*, Kaicheng Guo*, Arjun Prakash*, Saket Tiwari, Ruo Yu Tao, Tyrone Serapio, Amy Greenwald, George Konidaris
Under review at ICLR 2026; Accepted at NeurIPS ARLET Workshop 2025
We investigate why deep neural networks suffer from loss of plasticity in deep continual learning, failing to learn new tasks without reinitializing parameters. We show that this failure is preceded by Hessian spectral collapse at new-task initialization, where meaningful curvature directions vanish and gradient descent becomes ineffective. To characterize the necessary condition for successful training, we introduce the notion of τ-trainability and show that current plasticity preserving algorithms can be unified under this framework. Targeting spectral collapse directly, we then discuss the Kronecker factored approximation of the Hessian, which motivates two regularization enhancements: maintaining high effective feature rank and applying L2 penalties. Experiments on continual supervised and reinforcement learning tasks confirm that combining these two regularizers effectively preserves plasticity.

Ruo Yu Tao*, Kaicheng Guo*, Cameron Allen, George Konidaris
Reinforcement Learning Conference (RLC 2025)
Mitigating partial observability is a necessary but challenging task for general reinforcement learning algorithms. To improve an algorithm's ability to mitigate partial observability, researchers need comprehensive benchmarks to gauge progress. Most algorithms tackling partial observability are only evaluated on benchmarks with simple forms of state aliasing, such as feature masking and Gaussian noise. Such benchmarks do not represent the many forms of partial observability seen in real domains, like visual occlusion or unknown opponent intent. We argue that a partially observable benchmark should have two key properties. The first is coverage in its forms of partial observability, to ensure an algorithm's generalizability. The second is a large gap between the performance of agents with more or less state information, all other factors roughly equal. This gap implies that an environment is memory improvable: where performance gains in a domain are from an algorithm's ability to cope with partial observability as opposed to other factors. We introduce best-practice guidelines for empirically benchmarking reinforcement learning under partial observability, as well as the open-source library POBAX: Partially Observable Benchmarks in JAX.

Brown University


Brown University


Brown University
Brown University

Robotics Institute, Carnegie Mellon University
PI: Maxim Likhachev
Brown University
Lecturer: Michael Littman