About me

I am a CS PhD student at UIUC, advised by Prof. Tong Zhang and Prof. Huan Zhang. Previously, I earned my bachelor’s and master’s degrees from the Department of Automation at Tsinghua University and CSE, HKUST. My research interests lie in deep reinforcement learning (RL) and the application of RL algorithms to Large Language Models (LLMs). I am furtunate to have been working closely with Prof. Chongjie Zhang (Washington University in St. Louis), Dr. Lei Han (Tencent AI Lab), and Prof. Meng Fang (University of Liverpool).

Currently, I am actively researching ways to improve the robustness and generalization abilities of deep reinforcement learning, while also trying to enhance the trustworthiness of LLMs. Feel free to contact me by email if you are interested in discussing or collaborating with me.

News

  • 🎉 (2024.9) GRM is accepted by NeurIPS 2024!
  • 🎉 (2024.5) Rewards-in-Context (RiC) is accepted by ICML 2024! Thanks to my co-authors!
  • 🎉 (2024.5) GOPlan is accepted by Transactions on Machine Learning Research (TMLR)!
  • 🎉 (2024.1) Robust IQL is accepted by ICLR 2024 as a spotlight paper!

Selected Publications

RL for LLMs

Robust Offline RL

Goal-conditioend RL

Experiences

  • Internship at Tencent AI Lab

  • Internship at Meituan Financial Service Group

Services

Conference Reviewer: ICML (22,24), ICLR (24, 25), NeurIPS (22,23 $\color{red}{\text{Top Reviewer}}$, 24), ICRA (23), AAMAS(24).

Journal Reviewer: IEEE Robotics and Automation Letters (RA-L), IEEE Transactions on Artificial Intelligence (TAI), Machine Learning, Journal of Artificial Intelligence Research.

Teaching Assistant: COMP 4211 Machine Learning, HKUST; COMP 1021 Introduction to Computer Science, HKUST

Others

In my leisure time, I enjoy sports like running, table tennis, and swimming. During my time at Tsinghua University, I was an amateur long-distance runner. In 2019, I completed a half marathon (21.0975 km) in 1 h 30 m and a full marathon (42.195 km) in 3 h 36 m.