About me

I am a CS PhD student at UIUC, advised by Prof. Tong Zhang and Prof. Huan Zhang. Previously, I earned my bachelorโ€™s and masterโ€™s degrees from the Department of Automation at Tsinghua University and CSE, HKUST. Currently, my research focuses on: Agents, Trustworthy LLMs/VLMs, and Deep reinforcement learning.

Prior to my PhD, I was fortunate to work closely with Prof. Chongjie Zhang (Washington University in St. Louis), Dr. Lei Han (Noitom Robotics, formerly Tencent AI Lab), and Prof. Meng Fang (University of Liverpool).


News

  • ๐ŸŒŸ (2025.10) We released Embodied Reasoning Agent (ERA), a training recipe for VLM-based embodied agents with enhanced reasoning and grounding capability. Explore more on our project page.
  • ๐ŸŽ‰ (2025.9) GUI-Actor and ADG are accepted to NeurIPS 2025! MergeBench is accepted to the Datasets & Benchmarks Track! Congrats to all co-authors!
  • ๐ŸŽ‰ (2025.8) MiCRo is accepted to EMNLP 2025 main conference with award nomination!
  • ๐ŸŒŸ (2025.6) Weโ€™ve released GUI-Actor, a novel GUI grounding model that combines an attention-based action head with a grounding verifier. Explore more on our project page!
  • ๐Ÿ’ป (2025.5) Starting my internship at Microsoft Research, Redmond.
  • ๐ŸŽ‰ (2025.5) Decomposed Reward Models (DRMs) is accepted to ACL 2025.
  • ๐ŸŽ‰ (2025.5) EmbodiedBench is accepted to ICML 2025 as an oral paper! Thanks to my co-authors!
  • ๐ŸŒŸ (2025.02) We released EmbodiedBench, a new comprehensive and multifaceted benchmark for multimodal embodied agents. Check out our paper and project page.
  • ๐ŸŽ‰ (2025.1) Robust Decision Transformer and DynaMath are accepted by ICLR 2025! New versions will be updated soon.
  • ๐ŸŒŸ (2024.10) A dynamic visual math benchmark is out! Check the project page and the DynaMath paper.
  • ๐ŸŽ‰ (2024.9) GRM is accepted by NeurIPS 2024! Check out our GRM series here.
  • ๐ŸŽ‰ (2024.5) Rewards-in-Context (RiC) is accepted by ICML 2024! Thanks to my co-authors!
  • ๐ŸŽ‰ (2024.1) Robust IQL is accepted by ICLR 2024 as a spotlight paper!

Selected Publications

Multimodal GUI Agent and Embodied Agent

Multimodal Math Reasoning

ML for LLMs

Robust Offline RL

Goal-conditioned RL


Experiences

  • Research Intern at Microsoft Research, 2025.

  • Research Intern at Tencent AI Lab and Robotics X Lab, 2020-2022 (Multiple internship terms).

  • ML Intern at Meituan Financial Service Group, 2019.

Services

Conference Reviewer: ICML, ICLR, NeurIPS ($\color{red}{\text{Top Reviewer}}$ in NeurIPS 2023), ACL/ARR, ICRA, AAMAS.

Journal Reviewer: IEEE Robotics and Automation Letters (RA-L), IEEE Transactions on Neural Networks and Learning Systems (TNNLS), IEEE Transactions on Artificial Intelligence (TAI), Machine Learning, Journal of Artificial Intelligence Research.

Teaching Assistant: COMP 4211 Machine Learning, HKUST; COMP 1021 Introduction to Computer Science, HKUST

Hobbies

In my leisure time, I enjoy sports like running, table tennis, and swimming. During my time at Tsinghua University, I was an amateur long-distance runner. In 2019, I completed a half marathon (21.0975 km) in 1 h 30 min and a full marathon (42.195 km) in 3 h 36 min. However, since starting my PhD I havenโ€™t had time for regular running training, so Iโ€™ve let it slide. Hopefully Iโ€™ll get a chance to update my record once I graduate๐Ÿ™‚.