I am a third-year Master’s student at CS Department, Fudan University advised by Prof. Weifeng Ge, and an incoming CS Ph.D student at the University of California, Davis, co-advised by Prof. Lifu Huang and Prof. Junshan Zhang. Previously, I received my Bachelor’s Degree in the CS Department, Southeast University, where I worked with Prof. Ding Ding.
My research primarily focuses on Multimodal Large Langauge Models and their broad applications (Visual Question Answering, Video Understanding, Embodied-AI, Unified Image/Video Generation, etc.).
📝 Publications and Preprints

Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Haibo Wang, Zhiyang Xu, Yu Cheng, Shizhe Diao, Yufan Zhou, Yixin Cao, Qifan Wang, Weifeng Ge, Lifu Huang.
(Preprint)

Haibo Wang, Chenghang Lai, Yixuan Sun, Weifeng Ge.
(ACM MM 2024)

Haibo Wang, Weifeng Ge.
(ECCV 2024)
Adapting Multimodal Large Language Models for Video Question Answering by Capturing Question-critical and Coherent Moments
Haibo Wang, Chenghang Lai, Weifeng Ge. (IEEE TMM 2025)
Yixuan Sun*, Zhangyue Yin*, Haibo Wang, Yan Wang, Xipeng Qiu, Weifeng Ge, Wenqiang Zhang. (CVPR 2024)
Object-Centric Cross-Modal Knowledge Reasoning for Future Event Prediction in Videos
Chenghang Lai, Haibo Wang, Weifeng Ge, Xiangyang Xue. (IEEE TCSVT 2024)
Haibo Wang, Ding Ding, Yuhao Liu, Chi Wang. (CSCWD 2023)
🎖 Honors and Awards
- 2025.03, Shanghai Outstanding Graduates
- 2024.10, National Scholarship
👩💻 Academic Services
- Reviewer: ICLR 2025.
📖 Educations


💻 Experience


