I am a Ph.D. candidate affiliated with Berkeley AI Research (BAIR) at UC Berkeley,
advised by Prof. Kurt Keutzer. Previously, I received a B.A. in Computer Science and Math from Cornell University. During my undergrad years, I was fortunate to work with Prof.
Zhiru Zhang, Prof.
Vitaly Shmatikov, and Prof.
Song Han.
Email: xiuyu [at] berkeley [dot] edu
📧 I am actively seeking full-time member of technical staff and AI researcher/engineer positions in the industry. Feel free to reach out to me via email!
My current research interests are enhancing the reasoning capabilities of large language models (LLMs) and developing scalable AI agents. This pursuit is built on my broader expertise in making generative models more efficient in both training and inference across language and vision.
NVIDIA
Bain Capital Ventures
Salesforce AI Research FutureForum
A Mamba-2.8B model finetuned with DPO. It is one of the most downloaded Mamba models on Hugging Face.