📜 Biography
I am a Ph.D. candidate of Beijing Institute of Technology (BIT), advised by Prof. Yuwei Wu and Prof. Yunde Jia. I received my Master’s degree in 2023 from Beijing Institute of Technology supervised by Prof. Yuwei Wu and Prof. Yunde Jia, and Bachlor degree in 2018 from Sichuan Normal University.
My research area lies at:
- Vision-and-Language
- Multimodal Large Language Models
- Embodied Intelligence
📖 Educations
- 2023.09 - NOW,
Ph.D. in Computer Science, Beijing Institute of Technology, Beijing, China.
- 2021.09 - 2023.07,
M.S. in Software Engineering, Beijing Institute of Technology, Beijing, China.
- 2014.09 - 2018.07,
B.S. in Computer Science, Sichuan Normal University, Chengdu, Sichuan, China.
📝 Publications
AAAI 2025

IJCAI 2023

Fast-StrucTexT: An efficient hourglass transformer with modality-guided dynamic token merge for document understanding
- Mingliang Zhai, Yulin Li, Xiameng Qin, Chen Yi, Qunyi Xie, Chengquan Zhang, Kun Yao, Yuwei Wu, Yunde Jia.
- [IJCAI 2023] [paper]
EMNLP 2024

In-Context Compositional Generalization for Large Vision-Language Models
- Chuanhao Li, Chenchen Jing, Zhen Li, Mingliang Zhai, Yuwei Wu, and Yunde Jia.
- [EMNLP 2024] [paper]
ECCV 2024

🎖 Honors and Awards
- 2024.05, the innovation award in the “drive with language” track of [CVPR2024 autonomous grand challenge].
- 2023.01, the second prize in the multi-modal technology innovation competition of the first “Xingzhi Cup” National Artificial Intelligence Innovation Application Competition.
💼 Internship
- 2024.03 - 2024.08, Chongqing Changan Automobile Co., Ltd. Chongqing, China.
- 2022.06 - 2024.03, Baidu Inc. Beijing, China.