Biography

I am Yimu Wang, a last-year Ph.D. student at UWaterloo. I obtained my master’s degree under the supervision of Prof. Lijun Zhang in the LAMDA Group led by Prof. Zhihua Zhou at Nanjing University. I was honored to spend a wonderful RA time at Tsinghua University with Prof. Jingjing Liu and Prof. Yang Liu and amazing experiences at Amazon, SONY AI, Borealis AI, Tencent Lightspeed & Quantum Studios, Alibaba, Netease Games, and Megvii.

My major research interests are Multi-modal Learning and 3D understanding.

Yimu Wang
PhD Student
University of Waterloo CS

        

News

Publications [Google Scholar]

  1. Lexicographic Lipschitz Bandits: New Algorithms and a Lower Boun
    Bo Xue, Ji Cheng, Fei Liu, Yimu Wang, Lijun Zhang, and Qingfu Zhang
    Journal of Machine Learning Research (JMLR), 2025.

    JMLR 2025

  2. Hawaii: Hierarchical Visual Knowledge Transfer for Efficient Vision-Language Models
    Yimu Wang, Mozhgan Nasr Azadani, Sean Sedwards, Krzysztof Czarnecki
    Annual Conference on Neural Information Processing Systems (NeurIPS), 2025.

    NeurIPS 2025 Arxiv

  3. Survey of Video Diffusion Models: Foundations, Implementations, and Applications
    Yimu Wang, Xuye Liu, Wei Pang, Li Ma, Shuai Yuan, Paul Debevec, Ning Yu
    Transactions on Machine Learning Research (TMLR), 2025.

    TMLR 2025 Arxiv Paper

  4. LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts
    Yimu Wang, Mozhgan Nasr Azadani, Sean Sedwards, Krzysztof Czarnecki
    Empirical Methods in Natural Language Processing (EMNLP), 2025.

    EMNLP 2025 Arxiv

  5. OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
    A. Chow, E. Riddell, Yimu Wang, S. Sedwards, K. Czarnecki
    International Conference on Computer Vision (ICCV), 2025.

    ICCV 2025 Arxiv

  6. NBDESCRIB: A Dataset for Text Description Generation from Tables and Code in Jupyter Notebooks with Guidelines
    Xuye Liu, Tengfei Ma, Yimu Wang, Fengjie Wang, Jian Zhao
    Annual Meeting of the Association for Computational Linguistics (Findings of ACL), 2025.

    Findings of ACL 2025

  7. ELIOT: Zero-Shot Video-Text Retrieval through Relevance-Boosted Captioning and Structural Information Extractio
    Xuye Liu, Yimu Wang, Jian Zhao
    NAACL Student Research Workshop (SRW of NAACL), 2025.

    SRW of NAACL 2025 Paper

  8. DREAM: Improving Video-Text Retrieval Through Relevance-Based Augmentation Using Large Foundation Models
    Yimu Wang, Shuai Yuan, Bo Xue, Xiangru Jian, Wei Pang, Mushi Wang, Ning Yu
    Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), 2025.

    NAACL 2025 Paper

  9. AIDE: Improving 3D Open-Vocabulary Semantic Segmentation by Aligned Vision-Language Learning
    Yimu Wang, Krzysztof Czarneck
    IEEE Winter Conference on Applications of Computer Vision (WACV), 2025.

    WACV 2025 Paper

  10. Pretext Training Algorithms for Event Sequence Data
    Yimu Wang, He Zhao, Ruizhi Deng, Frederick Tung, Greg Mori
    Conference on Neural Information Processing Systems Workshop (NeurIPS workshop), 2024.

    NeurIPS Workshop 2024 Paper

  11. Lost Domain Generalization Is a Natural Consequence of Lack of Training Domains
    Yimu Wang, Yihan Wu, Hongyang Zhang
    Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2024.

    AAAI 2024 Paper

  12. Multiobjective Lipschitz Bandits under Lexicographic Ordering
    Bo Xue, Ji Cheng, Fei Liu, Yimu Wang, Qingfu Zhang
    Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2024.

    AAAI 2024 Paper

  13. Efficient Algorithms for Generalized Linear Bandits with Heavy-tailed Rewards
    Bo Xue, Yimu Wang, Yuanyu Wan, Jinfeng Yi, and Lijun Zhang
    Conference on Neural Information Processing Systems (NeurIPS), 2023.

    NeurIPS 2023 Paper

  14. Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and Gallery Banks
    Yimu Wang, Xiangru Jian, Bo Xue
    Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP Oral), 2023.

    EMNLP (Oral) 2023 Paper Code

  15. Video-Text Retrieval by Supervised Sparse Multi-Grained Learning
    Yimu Wang, Peng Shi
    Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2023.

    Findings of EMNLP 2023 Paper Code

  16. InvGC: Robust Cross-Modal Retrieval by Inverse Graph Convolution
    Xiangru Jian, Yimu Wang
    Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP), 2023.

    Findings of EMNLP 2023 Paper Code

  17. Cooperation or Competition: Avoiding Player Domination for Multi-target Robustness by Adaptive Budgets
    Yimu Wang, Dinghuai Zhang, Yihan Wu, Heng Huang, Hongyang Zhang
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.

    CVPR 2023 Paper

  18. Multimodal Federated Learning via Contrastive Representation Ensemble
    Qiying Yu, Yang Liu, Yimu Wang, Ke Xu, Jingjing Liu
    International Conference on Learning Representations (ICLR), 2023.

    ICLR 2023 Paper Code

  19. Deep Unified Cross-Modality Hashing by Pairwise Data Alignment
    Yimu Wang, Bo Xue, Quan Cheng, Yuhui Chen, and Lijun Zhang
    International Joint Conference on Artificial Intelligence (IJCAI), 2021.

    IJCAI 2021 Paper

  20. Searching Privately by Imperceptible Lying: A Novel Private Hashing Method with Differential Privacy
    Yimu Wang, Shiyin Lu, and Lijun Zhang
    ACM International Conference on Multimedia (ACM MM), 2020.

    ACM MM 2020 Paper

  21. Nearly Optimal Regret for Stochastic Linear Bandits with Heavy-Tailed Payoffs
    Bo Xue, Guanghui Wang,Yimu Wang, Lijun Zhang
    International Joint Conference on Artificial Intelligence (IJCAI), 2020.

    IJCAI 2020 Paper

  22. An Adversarial Domain Adaptation Network for Cross-Domain Fine-Grained Recognition
    Yimu Wang, Ren-Jie Song, Xiu-Shen Wei, and Lijun Zhang
    IEEE Winter Conference on Applications of Computer Vision (WACV), 2020.

    WACV 2020 Paper

Educations and Research Experience

Working Experiences

Awards