Yangfu Li

Ph.D. Student • East China Normal University

Yangfu Li

School of Communication & Electronic Engineering, East China Normal University

I am Yangfu Li, a Ph.D. student in Information and Communication Engineering at East China Normal University. I am advised by Prof. Yue Lyu and Prof. Zhi-Qin John Xu (@SJTU). Previously, I obtained my B.Eng. in Electronic Engineering from Fuzhou University. Currently, I focus on multi-modal large language models (MLLMs), especially visually grounded reasoning and visual token pruning, as well as OCR for open-set text recognition, low-quality text recognition, and handwritten spotting.

MLLMs Visually Grounded Reasoning Visual Token Pruning OCR
Profile

About Me

My research centers on multimodal intelligence, with a current emphasis on visually grounded reasoning in MLLMs and efficient visual token processing. I am also interested in OCR, especially open-set text recognition, low-quality text recognition, and handwritten spotting.

Multi-modal Large Language Models (MLLMs)

Visually grounded reasoning, visual token pruning, and multimodal perception-reasoning alignment.

Optical Character Recognition (OCR)

Open-set text recognition, low-quality text recognition, and handwritten spotting.

Background

Education

East China Normal University, Shanghai
Doctor of Philosophy in Information and Communication Engineering
Sept 2024 - Jun 2027 (Expected)
Supervisors: Yue Lyu and Zhi-Qin John Xu (@SJTU) • Research Area: MLLMs, OCR
Fuzhou University, Fuzhou
Bachelor in Electronic Engineering
Research

Internship & Visitor Experience

Ant Group | Inclusion AI (Ming Team), Shanghai
Research Intern
Oct 2025 - Mar 2026
Advised by Ziyuan Huang and Dandan Zheng • Research Area: Post-training on Unified Multimodal Models, Interleaved Text-Image Generation
Xiamen University, Xiamen
Research Visitor
Research Area: Speech Processing, Image Restoration
Selected Work

Publications

Academic Service

Reviewer

CVPR ECCV ICCV ICLR ICML NeurIPS AAAI