About Me
I am a second-year Computer Science Ph.D. student at the University of Wisconsin-Madison.
I received dual bachelor's degrees from a joint program of the University of Electronic Science and Technology of China (UESTC) and the University of Glasgow.
My research focuses on the post-training stages of large foundation models, with the aim of making them more efficient and intelligent.
Recently, I have become interested in diffusion language models, which offer inherent advantages such as parallel decoding and built-in error correction, and may hold the potential to reshape future paradigms of language modeling.
As an impact-driven researcher, I aspire for my work to drive progress not only in methodology but also in how large foundation models positively transform science, technology, and society.
Projects
A 0.5B diffusion coding model with pretraining pipeline, inference code, checkpoints, and evaluation suite.
Blog Post
Papers
Yuchen Zeng*, Shuibai Zhang*, Wonjun Kang*, et al., Dimitris Papailiopoulos, Kangwook Lee
Under Review
equal contribution
Thomas Zeng, Shuibai Zhang, et al., Kannan Ramchandran, Dimitris Papailiopoulos, Kangwook Lee
ICML 2025 Oral Presentation
Yue Yang*, Shuibai Zhang*, Wenqi Shao*, Kaipeng Zhang, Yi Bin, Yu Wang, Ping Luo
ICLR 2025 Oral Presentation
equal contribution
Weiyun Wang*, Shuibo Zhang*, Yiming Ren*, Yuchen Duan*, Tiantong Li*, et al., Wenhai Wang
NIPS 2024 Poster
equal contribution
Linyi Yang*, Shuibai Zhang*, Zhuohao Yu*, et al., Xing Xie, Weizhu Chen, Yue Zhang
ICLR 2024 Poster
equal contribution
Linyi Yang*, Shuibai Zhang*, et al., Jindong Wang, Xing Xie, Yue Zhang
ACL 2023 Findings
equal contribution
Work Experience
Research Intern
Krafton AI, Madison
July 2025 - Present
Research Assistant
Shanghai AI Laboratory, Shanghai
May 2024 - Aug 2024
Research Assistant
WestlakeNLP Lab, Westlake University, Hangzhou
Jun 2023 - Apr 2024
Teaching Experience
Teaching Assistant - COMP SCI 220 Data Science Programming I
University of Wisconsin-Madison
Spring 2025
Teaching Assistant - COMP SCI 300 Programming II
University of Wisconsin-Madison
Fall 2024