Yuan-Ming Li
I am currently a third-year Ph.D student at Sun Yat-sen University (SYSU) advised by Prof. Wei-Shi Zheng.
Before coming to SYSU, I obtained my B.E. degree at the University of Electronic Science and Technology of China.
My research interests mainly lie around Computer Vision, with the goal of developing general and generalizable vision systems.
My main research focuses on Video Action Understanding.
In the following years, I will focus on Interpretable Action Assessment and Multimodal Learning. And I am always open for research discussions and collaborations.
Email /
Google Scholar /
Github
|
|
News
|
[06/26/2025] 2 papers are accepted by ICCV 2025.
|
[02/27/2025] 1 paper is accepted by CVPR 2025.
|
[01/28/2025] 1 paper is accepted by ICRA 2025.
|
[07/16/2024] 1 paper is accepted by ACMMM 2024.
|
[07/01/2024] 1 paper is accepted by ECCV 2024.
|
[04/26/2024] 1 paper is accepted by TCSVT 2024.
|
Publications
‡: Project lead; †: Equal Contributions; *: Corresponding Authors.
|
|
LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-step Reasoning
Shenghao Fu,
Qize Yang,
Yuan-Ming Li,
Xihan Wei,
Xiaohua Xie,
Wei-Shi Zheng*
preprint
|
|
ViSpeak: Visual Instruction Feedback in Streaming Videos
Shenghao Fu†,
Qize Yang†,
Yuan-Ming Li,
Yi-Xing Peng,
Kun-Yu Lin,
Xihan Wei,
Jian-Fang Hu,
Xiaohua Xie,
Wei-Shi Zheng*
ICCV, 2025
|
|
Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks
Wei-Jin Huang†,
Yuan-Ming Li†,
Zhi-Wei Xia,
Yu-Ming Tang,
Kun-Yu Lin,
Jian-Fang Hu,
Wei-Shi Zheng*
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
|
|
TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Yuan-Ming Li‡†,
An-Lan Wang†,
Kun-Yu Lin,
Yu-Ming Tang,
Ling-An Zeng,
Jian-Fang Hu,
Wei-Shi Zheng*
preprint
|
|
Task-Oriented 6-DoF Grasp Pose Detection in Clutters
An-Lan Wang†,
Nuo Chen†,
Kun-Yu Lin,
Yuan-Ming Li,
Wei-Shi Zheng*
ICRA, 2025
|
|
Loc4Plan: Locating Before Planning for Outdoor Vision and Language Navigation
Hui-Lin Tian,
Jing-Ke Meng*,
Wei-Shi Zheng,
Yuan-Ming Li,
Jun-Kai Yan,
Yu-Nong Zhang
ACM International Conference on Multimedia (ACMMM), 2024, (Best Paper Nomination)
|
|
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding
Yuan-Ming Li‡†,
Wei-Jin Huang†,
An-Lan Wang†,
Ling-An Zeng,
Jing-Ke Meng*,
Wei-Shi Zheng*
European Conference on Computer Vision (ECCV), 2024
|
|
Continual Action Assessment via Task-Consistent Score-Discriminative Feature Distribution Modeling
Yuan-Ming Li,
Ling-An Zeng,
Jing-Ke Meng*,
Wei-Shi Zheng*
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024
|
Academic Service
Conference Reviewer: CVPR, ICCV, ECCV
Journal Reviewer: TPAMI
|
|