I am a Ph.D. student affiliated with Berkeley AI Research (BAIR) at UC Berkeley,
advised by Prof. Kurt Keutzer. Previously, I received a B.A. in Computer Science and Math from Cornell University. During my undergrad years, I was fortunate to work with Prof.
Zhiru Zhang, Prof.
Vitaly Shmatikov, and Prof.
Song Han.
Email: xiuyu [at] berkeley [dot] edu
My current research interests are enhancing the reasoning capabilities of large language models (LLMs) and developing scalable AI agents. Broadly, my research focuses on improving the efficiency of LLMs, vision-language models (VLMs), and diffusion models from both algorithmic and systems perspectives.
A Mamba-2.8B model finetuned with DPO. It is one of the most downloaded Mamba models on Hugging Face.