About Me

Zelei Cheng

 

Mudd 3303

Department of Computer Science

Northwestern University

Email: zelei.cheng@northwestern.edu

I am a PhD candidate in Computer Science at Northwestern University advised by Professor Xinyu Xing. I earned my master’s degree from Purdue University and my bachelor’s degree from Beijing University of Posts and Telecommunications (BUPT). If you are interested in my research, please do not hesitate to contact me via email!

My research interests include

  • Reinforcement Learning (Explainable RL, In-context RL, RLHF)
  • Large Language Model Safety (Toxicty, Jailbreak, Prompt Injection)
  • AI for Software/System Security

Selected Publications

  • Zelei Cheng, Xian Wu, Jiahao Yu, Xin-Qiang Cai, Shuo Han, Xinyu Xing

Soft-Label Integration for Robust Toxicity Classification

In Proc. of NeurIPS, 2024.

  • Zelei Cheng, Xian Wu, Jiahao Yu, Sabrina Yang, Gang Wang, Xinyu Xing

RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation

In Proc. of ICML, 2024. (Spotlight)

  • Zelei Cheng, Xian Wu, Jiahao Yu, Wenhai Sun, Wenbo Guo, Xinyu Xing

StateMask: Explainable Reinforcement Learning through State Mask

In Proc. of NeurIPS, 2023.