I am a second-year PhD student in Computer Science and Engineering at the University of California, Santa Cruz
, advised by Yang Liu
and Jeffrey Flanigan
. I work on large language model alignment. I am also interested in post-training control and mechanistic interpretability of large language models. I am currently a research intern at Skywork AI
, working on alignment.