Biography

I am a second-year Master’s student at CS Department, Fudan University advised by Prof. Weifeng Ge. Previously, I received my Bachelor’s Degree in the CS Department, Southeast University, where I worked with Prof. Ding Ding. My research primarily focuses on vision-language learning (Visual Question Answering, Multimodal Large Langauge Models, etc.).

I am looking for a potential Ph.D. position enrolling in Fall 2025. Welcome to reach out to me if interested :)

News

  • [2024.7] One first-authored paper accepted by ACM MM 2024.
  • [2024.7] One first-authored paper accepted by ECCV 2024.

Publications and Preprints

Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering. [ACM Multimedia 2024] [Paper] [Code]

Haibo Wang, Chenghang Lai, Yixuan Sun, Weifeng Ge.

Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge. [ECCV 2024] [Paper] [Code]

Haibo Wang, Weifeng Ge.

Pixel level Semantic Correspondence through Layout aware Representation Learning and Multi scale Matching Integration. [CVPR 2024] [Paper]

Yixuan Sun*, Zhangyue Yin*, Haibo Wang, Yan Wang, Xipeng Qiu, Weifeng Ge, Wenqiang Zhang.

Object-Centric Cross-Modal Knowledge Reasoning for Future Event Prediction in Videos. [IEEE TCSVT 2024] [Paper]

Chenghang Lai, Haibo Wang, Weifeng Ge, Xiangyang Xue.

IVRSandplay: An Immersive Virtual Reality Sandplay System Coupled with Hand Motion Capture and Eye Tracking. [CSCWD 2023] [Paper]

Haibo Wang, Ding Ding, Yuhao Liu, Chi Wang.

Experiences

Virginia Tech Logo
2024.05 -, Summer Intern, Virginia Tech, advised by Prof. Lifu Huang
Fudan University Logo
2022.09 - 2025.06 (expected), Graduate Student, Fudan University, Shanghai.
Southeast University Logo
2018.09 - 2022.06, Undergraduate Student, Southeast University, Nanjing.