Yuan-Ming Li

I am currently a fourth-year Ph.D student at Sun Yat-sen University advised by Prof. Wei-Shi Zheng. Before coming to SYSU, I obtained my B.E. degree at the University of Electronic Science and Technology of China.

🛠️ Employment:
[2025/08—Present] Research Intern, Tongyi Lab, Alibaba Group. Work closely with Qize Yang and Xihan Wei.

My current research interests focus on Action Understanding, Generation, and Multimodal Reasoning. I am always open for research discussions and collaborations. Feel free to contact me.

Email  / Google Scholar  /   Github

profile photo

News

  • [08/11/2025] 1 papers is accepted by AAAI 2026.
  • [06/26/2025] 2 papers are accepted by ICCV 2025.
  • [02/27/2025] 1 paper is accepted by CVPR 2025.
  • [01/28/2025] 1 paper is accepted by ICRA 2025.
  • [07/16/2024] 1 paper is accepted by ACMMM 2024.
  • [07/01/2024] 1 paper is accepted by ECCV 2024.
  • [04/26/2024] 1 paper is accepted by TCSVT 2024.

Publications


‡: Project lead; †: Equal Contributions; *: Corresponding Authors.

IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation

Yuan-Ming Li, Qize Yang, Nan Lei, Shenghao Fu, Ling-An Zeng, Jian-Fang Hu Xihan Wei, Wei-Shi Zheng*
preprint


LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-step Reasoning

Shenghao Fu, Qize Yang, Yuan-Ming Li, Xihan Wei, Xiaohua Xie, Wei-Shi Zheng*
preprint


ViSpeak: Visual Instruction Feedback in Streaming Videos

Shenghao Fu†, Qize Yang†, Yuan-Ming Li, Yi-Xing Peng, Kun-Yu Lin, Xihan Wei, Jian-Fang Hu, Xiaohua Xie, Wei-Shi Zheng*
ICCV, 2025


Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks

Wei-Jin Huang†, Yuan-Ming Li†, Zhi-Wei Xia, Yu-Ming Tang, Kun-Yu Lin, Jian-Fang Hu, Wei-Shi Zheng*
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025


TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching

Yuan-Ming Li‡†, An-Lan Wang†, Kun-Yu Lin, Yu-Ming Tang, Ling-An Zeng, Wei-Shi Zheng*
AAAI, 2026


Task-Oriented 6-DoF Grasp Pose Detection in Clutters

An-Lan Wang†, Nuo Chen†, Kun-Yu Lin, Yuan-Ming Li, Wei-Shi Zheng*
ICRA, 2025


Loc4Plan: Locating Before Planning for Outdoor Vision and Language Navigation

Hui-Lin Tian, Jing-Ke Meng*, Wei-Shi Zheng, Yuan-Ming Li, Jun-Kai Yan, Yu-Nong Zhang
ACM International Conference on Multimedia (ACMMM), 2024, (Best Paper Nomination)


EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding

Yuan-Ming Li‡†, Wei-Jin Huang†, An-Lan Wang†, Ling-An Zeng, Jing-Ke Meng*, Wei-Shi Zheng*
European Conference on Computer Vision (ECCV), 2024


Continual Action Assessment via Task-Consistent Score-Discriminative Feature Distribution Modeling

Yuan-Ming Li, Ling-An Zeng, Jing-Ke Meng*, Wei-Shi Zheng*
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024


Academic Service

Conference Reviewer: CVPR, ICCV, ECCV

Journal Reviewer: TPAMI


Thanks to Jon Barron for providing the template.