About me

I’m a first year Ph.D. student at CSE, the Hong Kong University of Science and Technology, supervised by Prof. Tong Zhang. I received my master’s degree and bachelor’s degree from the Department of Automation at Tsinghua University. My research interests lie in deep reinforcement learning (RL), especially goal-conditioned RL, offline RL and model-based RL. I’m also interested in the application of RL algorithms to game AI and robotics.

I was furtunate to visit the Machine Intelligence Group of Institute for Interdisciplinary Information Sciences (IIIS) at Tsinghua University, working with Prof. Chongjie Zhang. Before that, I was a student intern at Tencent Robotics X Lab (mentored by Prof. Meng Fang and Dr. Lei Han) and Tencent AI Lab (mentored by Lanqing Li).



  • 2022.09 - , Ph.D. student, Department of Computer Science and Engineering, the Hong Kong University of Science and Technology.
  • 2019.09 - 2022.07, Master, Department of Automation, Tsinghua University.
  • 2015.09 - 2019.07, Bachelor, Department of Automation, Tsinghua University.


  • Tencent Robotics X (Oct. 2020 – Jun. 2021)

    Researched on efficient goal-conditioned RL algorithms.

  • Tencent AI Lab (Jun. 2020 – Sep. 2020)

    Researched on multi-step HER and offline meta RL.

  • Meituan Financial Service Group (Jul. 2019 – Sep. 2019)

    Controlled the advertising speed of Meituan app via a pacing PI control algorithm.


Conference Reviewer: ICML (2022), NeurIPS (2022).


During my leisure time, I like sports such as running, table tennis and swimming. I used to be an amateur long-distance runner. I finished a half marathon (21.0975 km) in 1h30min and a marathon (42.195 km) in 3h36min.