"There is no truth, only explanation."
I am currently the last year Ph.D. student supervised by Prof. Tinne Tuytelaars at KU Leuven, Belgium. Before that, I received the B.S. degree from the Beijing University of Posts and Telecommunications in 2017, and I completed my M.S. studies supervised by Prof. Xuming He at ShanghaiTech University in 2020.
My research interests focus on:
Image/Video generation
Vision-Language Understanding, including multitasking and pretraining
Relation-aware visual scene understanding
July. 2023 - Dec. 2023
Oct. 2022 - Feb. 2023
M. Li*, B. Wan*, D. Zhou, Sien Moens, and T. Tuytelaars, "Animate Your Motion: Turning Still Images into Dynamic Videos", Arxiv Preprint, 2024. [pdf][code]
H. Diao, B. Wan, Y. Zhang, X. Jia, H. Lu, and L. Chen, "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory", The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024. [pdf][code]
B. Wan, and T. Tuytelaars, "Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels", WACV, 2024. [pdf]
Lucas Beyer*, Bo Wan*, Gagan Madan*, Filip Pavetic*, Andreas Steiner*, Alexander Kolesnikov, André Susano Pinto, Emanuele Bugliarello, Xiao Wang, Qihang Yu, Liang-Chieh Chen, Xiaohua Zhai*, "A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision", Arxiv Preprint, 2023. [pdf]
B. Wan, Y. Liu, D. Zhou, T. Tuytelaars, and X. He, "Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning", The International Conference on Learning Representations (ICLR) , 2023. [pdf][code]
B. Wan, W. Han, Z. Zheng, and T. Tuytelaars, "Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling", The International Conference on Learning Representations (ICLR) Oral, 2022. [pdf][code]
Y. Liu*, B. Wan*, L. Ma, and X. He, "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding", The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [pdf][code]
Y. Liu*, B. Wan*, L. Ma, and X. He, "Learning cross-modal context graph for visual grounding", The AAAI Conference on Artificial Intelligence, 2020. [pdf][code]
B. Wan*, D. Zhou*, Y. Liu, R. Li, and X. He, "Pose-aware multi-level feature network for human object interaction detection", The IEEE/CVF International Conference on Computer Vision (ICCV) Oral, 2019. [pdf][code]
R. Li, S. Zhang, B. Wan, and X. He, "Bipartite graph network with adaptive message passing for unbiased scene graph generation", The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021. [pdf][code]
Q. He, D. Zhou, B. Wan, and X. He, "Single Image 3D Object Estimation with Primitive Graph Networks", The ACM International Conference on Multimedia, 2021. [pdf][code]
Note: * indicates equal contribution.
Jul. 2023 - Dec. 2023
Oct. 2022 - Feb. 2023
Aug. 2020 - Oct. 2020
Mar. 2017 - Jun. 2017