About Me

I am a second-year Computer Science Ph.D. student at the University of Wisconsin-Madison. I received dual bachelor's degrees from a joint program of the University of Electronic Science and Technology of China (UESTC) and the University of Glasgow.

My research focuses on the post-training stages of large foundation models, with the aim of making them more efficient and intelligent. Recently, I have become interested in diffusion language models, which offer inherent advantages such as parallel decoding and built-in error correction, and may hold the potential to reshape future paradigms of language modeling.

As an impact-driven researcher, I aspire for my work to drive progress not only in methodology but also in how large foundation models positively transform science, technology, and society.

Projects

A 0.5B diffusion coding model with pretraining pipeline, inference code, checkpoints, and evaluation suite.
Open Source Project

Preprints

Fred Zhangzhi Peng, Zachary Bezemek, Jarrid Rector-Brooks, Shuibai Zhang, Anru R. Zhang, Michael Bronstein, Avishek Joey Bose, Alexander Tong
ArXiv
Wonjun Kang, Kevin Galim, Seunghyuk Oh, Minjae Lee, Yuchen Zeng, Shuibai Zhang, Coleman Hooper, Yuezhou Hu, Hyung Il Koo, Nam Ik Cho, Kangwook Lee
ArXiv
Yuchen Zeng*, Shuibai Zhang*, Wonjun Kang*, et al., Dimitris Papailiopoulos, Kangwook Lee
ArXiv equal contribution

Papers

Thomas Zeng, Shuibai Zhang, et al., Kannan Ramchandran, Dimitris Papailiopoulos, Kangwook Lee
ICML 2025 Oral Presentation
Yue Yang*, Shuibai Zhang*, Wenqi Shao*, Kaipeng Zhang, Yi Bin, Yu Wang, Ping Luo
ICLR 2025 Oral Presentation equal contribution
Weiyun Wang*, Shuibo Zhang*, Yiming Ren*, Yuchen Duan*, Tiantong Li*, et al., Wenhai Wang
NIPS 2024 Poster equal contribution
Linyi Yang*, Shuibai Zhang*, Zhuohao Yu*, et al., Xing Xie, Weizhu Chen, Yue Zhang
ICLR 2024 Poster equal contribution
Linyi Yang*, Shuibai Zhang*, et al., Jindong Wang, Xing Xie, Yue Zhang
ACL 2023 Findings equal contribution

Work Experience

Research Intern
Krafton AI, Madison
July 2025 - Sep 2025
Advisor: Jaewoong Cho
Research Assistant
Shanghai AI Laboratory, Shanghai
May 2024 - Aug 2024
Advisor: Wenqi Shao
Research Assistant
WestlakeNLP Lab, Westlake University, Hangzhou
Jun 2023 - Apr 2024
Advisor: Yue Zhang

Teaching Experience

Teaching Assistant - COMP SCI 220 Data Science Programming I
University of Wisconsin-Madison
Spring 2025
Teaching Assistant - COMP SCI 300 Programming II
University of Wisconsin-Madison
Fall 2024