I am a research scientist at Databricks Mosaic Research working on agents, reinforcement learning, and retrieval.
I graduated summa cum laude from Cornell University with a B.S. in Computer Science where I was fortunate to work with professors Wen Sun, Robert Kleinberg, and Kianté Brantley.
I was also previously a research intern at Databricks Mosaic Research, NVIDIA, and a software engineering intern at DRW.
Outside of research, I enjoy mathematics, art, music, literature, and drone photography. A picture of me can be found here.
News: I received the Cornell Bowers 2026 Computer Science Prize for Academic Excellence! I was also awarded (but declined) the NSF Graduate Research Fellowship.
Publications
See my Google Scholar for the most up-to-date list.
We present a system for training enterprise search agents via reinforcement learning that achieves state-of-the-art performance across a diverse suite of hard-to-verify agentic search tasks. Our work makes four core cont…
Recent work has shown that for particular combinations of base model and training algorithm, *reinforcement learning with random rewards* (RLRR) improves the performance of LLMs on certain math reasoning benchmarks. This…
The controllable generation of diffusion models aims to steer the model to generate samples that optimize some given objective functions. It is desirable for a variety of applications including image generation, molecule…
While originally developed for continuous control problems, Proximal Policy Optimization (PPO) has emerged as the work-horse of a variety of reinforcement learning (RL) applications, including the fine-tuning of generati…
dithered image from my trip to the humble administrator's garden. suzhou, china