Lihe Li 李立和

How to pronounce my name?

Lihe -> lee-huh.
Li -> lee.
You can just call me Lee.

Hi there, thanks for visiting my website! I am a Master student (Sep. 2023 - Now) at the School of Artificial Intelligence at Nanjing University, where I am fortunate to be advised by Prof. Yang Yu and affiliated with the LAMDA Group led by Prof. Zhi-Hua Zhou. Specifically, I am a member of the LAMDA-RL Group, which focuses on reinforcement learning research. Prior to that, I was an undergraduate student at the same school.

Unity makes strength. Currently my research interest is Reinforcement Learning (RL), especially in Multi-agent Reinforcement Learning (MARL) that enables agents efficiently, robustly and safely coordinate with other agents🤖 and even humans👨‍👩‍👧‍👦.

Please feel free to drop me an Email for any form of communication or collaboration!

Email:  lilh [at] lamda [dot] nju [dot] edu [dot] cn  /

  lilhzq76 [at] gmail [dot] com

CV  /  Google Scholar  /  Semantic Scholar  /  DBLP  /  Github  /  Twitter

profile photo

Just a reminder, I am the guy on the left.

🎓 Seeking PhD Opportunities 🎓

I am actively looking for PhD positions starting in Fall 2026.

Feel free to reach out if you have potential opportunities! 📧

News
Selected Publications [ Full List / Google Scholar ]
semdiv LLM-Assisted Semantically Diverse Teammate Generation for Efficient Multi-agent Coordination
Lihe Li, Lei Yuan, Pengsen Liu, Tao Jiang, Yang Yu
The 42rd International Conference on Machine Learning (ICML), 2025
pdf / link / code / blog / poster / bibtex

Instead of discovering novel teammates only at the policy level, we utilize LLMs to propose novel coordination behaviors described in natural language, and then transform them into teammate policies, enhancing teammate diversity and interpretability, eventually learning agents with language comprehension ability and stronger collaboration skills.

core3 Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal
Lihe Li, Ruotong Chen, Ziqian Zhang, Zhichao Wu, Yi-Chen Li, Cong Guan, Yang Yu, Lei Yuan
The 33rd International Joint Conference on Artificial Intelligence (IJCAI), 2024
pdf / link / code / talk / poster / bibtex

We study the problem of multi-objective reinforcement learning (MORL) with continually evolving learning objectives, and propose CORe3 to enable the MORL agent rapidly learn new objectives and avoid catastrophic forgetting about old objectives lacking reward signals.

macop Learning to Coordinate with Anyone
Lei Yuan, Lihe Li, Ziqian Zhang, Feng Chen, Tianyi Zhang, Cong Guan, Yang Yu, Zhi-Hua Zhou
Proceedings of the Fifth International Conference on Distributed Artificial Intelligence (DAI), Best Paper Award, 2023
pdf / link / code / English talk / Chinese talk / bibtex

We propose Multi-agent Compatible Policy Learning (MACOP), where we adopt an agent-centered teammate generation process that gradually and efficiently generates diverse teammates covering the teammate policy space, and we use continual learning to train the ego agents to coordinate with them and acquire strong coordination ability.

macpro Multi-agent Continual Coordination via Progressive Task Contextualization
Lei Yuan, Lihe Li, Ziqian Zhang, Fuxiang Zhang, Cong Guan, Yang Yu
IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
pdf / link / code / poster / bibtex

We formulate the continual coordination framework and propose MACPro to enable agents to continually coordinate with each other when the dynamic of the training task and the multi-agent system itself changes over time.

survey A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment
Lei Yuan, Ziqian Zhang, Lihe Li, Cong Guan, Yang Yu
Science China Information Sciences (SCIS)
pdf in English / pdf in Chinese / link / bibtex

We review multi-agent cooperation from closed environment to open environment settings, and provide prospects for future development and research directions of cooperative MARL in open environments.

Education
Nanjing University
2023.09 - present

Master Student in Computer Science and Technology
Advisor: Prof. Yang Yu
Nanjing University
2019.09 - 2023.07

Undergraduate Student in Artificial Intelligence
Advisor: Prof. Yang Yu
Guangdong Zhaoqing Middle School
2016.09 - 2019.06

High School Education
Awards & Honors
  • BYD Scholarship, 2025 (CNY 20,000).
  • First-Class Academic Scholarship, 2025 (CNY 12,000).
  • National Scholarship, 2024 (CNY 20,000). [link]
  • First-Class Academic Scholarship, 2024 (CNY 10,000).
  • Best Paper Award of The Fifth Distributed Artificial Intelligence Conference (DAI), 2023. [link]
  • Outstanding Graduate & Bachelor's Thesis of Nanjing University, 2023.
  • The Egret Scholarship, 2022 (CNY 10,000).
  • The People's Scholarship, 2021, 2020.
Service
  • Reviewer: NeurIPS (2025), COLM (2025), ICML (2025), ICLR (2025, 2026).
Teaching Assistant
Miscellaneous

Template courtesy: Jon Barron.