Xiuyu Li

I am a Ph.D. student affiliated with Berkeley AI Research (BAIR) at UC Berkeley, advised by Prof. Kurt Keutzer. Previously, I received a B.A. in Computer Science and Math from Cornell University. During my undergrad years, I was fortunate to work with Prof. Zhiru Zhang, Prof. Vitaly Shmatikov, and Prof. Song Han.

Email: xiuyu [at] berkeley [dot] edu

   /      /      /      

Research


I am recently interested in enhancing the reasoning capabilities of large language models (LLMs) and developing scalable AI agents. Broadly, my research focuses on improving the efficiency of LLMs, vision-language models (VLMs), and diffusion models from both systems and algorithmic perspectives.

Selected Publications

For the most up-to-date list of publications, please see google scholar.
* indicates equal contribution

LLoCO: Learning Long Contexts Offline
Sijun Tan*, Xiuyu Li*, Shishir Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Ada Popa
EMNLP, 2024
[abs]  [paper]  [code

Q-Diffusion: Quantizing Diffusion Models
Xiuyu Li, Yijiang Liu, Long Lian, Huanrui Yang, Zhen Dong, Daniel Kang, Shanghang Zhang, Kurt Keutzer
ICCV, 2023
[abs]  [paper]  [code]  [website]  [talk
Integration: NVIDIA TensorRT

SqueezeLLM: Dense-and-Sparse Quantization
Sehoon Kim*, Coleman Hooper*, Amir Gholami*, Zhen Dong, Xiuyu Li, Sheng Shen, Michael W. Mahoney, Kurt Keutzer
ICML, 2024
[abs]  [paper]  [code
Integration: Intel oneAPI


TorchSparse: Efficient Point Cloud Inference Engine
Haotian Tang*, Zhijian Liu*, Xiuyu Li*, Yujun Lin, Song Han
MLSys, 2022
[abs]  [paper [code]  [website]


Talks