About me

I am a CS PhD student at UIUC, advised by Prof. Tong Zhang and Prof. Huan Zhang. Previously, I earned my bachelorโ€™s and masterโ€™s degrees from the Department of Automation at Tsinghua University and CSE, HKUST. Currently, my research focuses on: Agents, Trustworthy LLMs/VLMs, and Deep reinforcement learning.

Prior to my PhD, I am fortunate to have been working closely with Prof. Chongjie Zhang (Washington University in St. Louis), Dr. Lei Han (Tencent AI Lab), and Prof. Meng Fang (University of Liverpool).


News

  • ๐ŸŒŸ (2025.6) Weโ€™ve released GUI-Actor, a novel GUI grounding model that combines an attention-based action head with a grounding verifier. Explore more on our project page!
  • ๐Ÿ’ป (2025.5) Starting my internship at Microsoft Research, Redmond.
  • ๐ŸŽ‰ (2025.5) Decomposed Reward Models (DRMs) is accepted to ACL 2025.
  • ๐ŸŽ‰ (2025.5) EmbodiedBench is accepted to ICML 2025 as an oral paper! Thanks to my co-authors!
  • ๐ŸŒŸ (2025.02) We released EmbodiedBench, a new comprehensive and multifaceted benchmark for multimodal embodied agents. Check out our paper and project page.
  • ๐ŸŽ‰ (2025.1) Robust Decision Transformer and DynaMath are accepted by ICLR 2025! New versions will be updated soon.
  • ๐ŸŒŸ (2024.10) A dynamic visual math benchmark is out! Check the project page and the DynaMath paper.
  • ๐ŸŽ‰ (2024.9) GRM is accepted by NeurIPS 2024! Check out our GRM series here.
  • ๐ŸŽ‰ (2024.5) Rewards-in-Context (RiC) is accepted by ICML 2024! Thanks to my co-authors!
  • ๐ŸŽ‰ (2024.1) Robust IQL is accepted by ICLR 2024 as a spotlight paper!

Selected Publications

Multimodal GUI Agent

Multimodal Embodied Agent

Multimodal Math Reasoning

ML for LLMs

Robust Offline RL

Goal-conditioned RL


Experiences

  • Research Intern at Microsoft Research, 2025.

  • Research Intern at Tencent AI Lab and Robotics X Lab, 2020-2022 (Multiple internship terms).

  • ML Intern at Meituan Financial Service Group, 2019.

Services

Conference Reviewer: ICML, ICLR, NeurIPS ($\color{red}{\text{Top Reviewer}}$ in NeurIPS 2023), ACL/ARR, ICRA, AAMAS.

Journal Reviewer: IEEE Robotics and Automation Letters (RA-L), IEEE Transactions on Artificial Intelligence (TAI), Machine Learning, Journal of Artificial Intelligence Research.

Teaching Assistant: COMP 4211 Machine Learning, HKUST; COMP 1021 Introduction to Computer Science, HKUST

Hobbies

In my leisure time, I enjoy sports like running, table tennis, and swimming. During my time at Tsinghua University, I was an amateur long-distance runner. In 2019, I completed a half marathon (21.0975 km) in 1 h 30 m and a full marathon (42.195 km) in 3 h 36 m.