I am a fourth-year PhD student at the NYU Center for Data Science, advised by Kyunghyun Cho and Tal Linzen. I am supported by the NSF Graduate Research Fellowship.
I study how to train and adapt large language models: [Aioli], [pre-pretraining]
I’m also interested in cognitive science and training dynamics.
- Cognitive science: [On human-scale LMs], [BabyLM]
- Training dynamics: [Latent state models], [Visualization]
Previously, I completed a BSE at Princeton CS, where I spent two lovely years working with Karthik Narasimhan and Tom Griffiths. I then joined Yobi AI for two years as the first employee.
In my spare time, I enjoy cooking, running, and playing basketball.