-->
I am now a Tenure-track Assistant Professor (Ph.D. Supervisor) in Wangxuan Institute of Computer Technology, Peking University, Peking University Boya Young Fellow. Also a member of MIPL Group (led by Prof. Yuxin Peng) at Peking University.
Before joining Peking University, I was a Postdoctoral Researcher in the Visual Geometry Group (VGG) at University of Oxford, supervised by Prof. Andrew Zisserman. I received PhD and MPhil in Advanced Computer Science from University of Cambridge, and B.Eng. in Telecommunication Engineering from Beijing University of Posts and Telecommunications (BUPT).
My research interests include computer vision, natural language processing and machine learning, with an emphasis on how these areas can collaborate best to perform real-world tasks. Below are some of my recent research topic:
We are always actively recruiting postdocs, Prospective graduate students and interns!
Welcome to contact me with your detailed CV! Please read this Note first!
Active Object Detection with Knowledge Aggregation and Distillation from Large Models Dejie Yang, Yang Liu† IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection Ting Lei, Shaofeng Yin, Yang Liu† IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
OED: Towards One-stage End-to-End Dynamic Scene Graph Generation Guan Wang, Zhimin Li, Qingchao Chen, Yang Liu† IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
Diff-BGM: A Diffusion Model for Video Background Music Generation Sizhe Li, Yiming Qin, Minghang Zheng, Xin Jin, Yang Liu† IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
Training Free Video Temporal Grounding using Large-scale Pre-trained Models Minghang Zheng, Xinhao Cai, Qingchao Chen, Yuxin Peng, Yang Liu† European Conference on Computer Vision (ECCV), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection Ting Lei, Shaofeng Yin, Yuxin Peng, Yang Liu† European Conference on Computer Vision (ECCV), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
Semantic-Aware Human Object Interaction Image Generation Zhu Xu, Qingchao Chen, Yuxin Peng, Yang Liu† International Conference on Machine Learning (ICML), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
TeachText: CrossModal Text-Video Retrieval through Generalized Distillation Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanuc, Hailin Jin, Andrew Zisserman, Yang Liu†, Samuel Albanie Artificial Intelligence Journal (AIJ), 2024 [ PDF] [Project Page] [Code] |
|
3D Vision and Language Pretraining with Large-Scale Synthetic Data Dejie Yang, Zhu Xu, Wentao Mo, Qingchao Chen, Siyuan Huang, Yang Liu† International Joint Conference on Artificial Intelligence (IJCAI), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA Wentao Mo, Yang Liu† Conference on Artificial Intelligence (AAAI), 2024 [ PDF] [Project Page] [ Code] [ Bibtex] |
|
Semantic-Guided Novel Category Discovery Weishuai Wang, Ting Lei, Qingchao Chen, Yang Liu† Conference on Artificial Intelligence (AAAI), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
Novel Class Discovery in Chest X-Rays via Paired Images and Text Jiaying Zhou, Yang Liu, Qingchao Chen Conference on Artificial Intelligence (AAAI), 2024 [ PDF] [ Bibtex] |
VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges Yuxuan Wang, Cihang Xie, Yang Liu, Zilong Zheng [ PDF] [Project Page] [ Code] |
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding Minghang Zheng, Jiahua Zhang, Qingchao Chen, Yuxin Peng, Yang Liu† ACM International Conference on Multimedia (ACM-MM), 2024 [ PDF] [Project Page] [Video] [ Code] [ Bibtex] |
|
RelScene: A Benchmark and baseline for Spatial Relations in text-driven 3D Scene Generation Zhaoda Ye, Xinhan Zheng, Yang Liu, Yuxin Peng ACM International Conference on Multimedia (ACM-MM), 2024 [ PDF] |
|
Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng Empirical Methods in Natural Language Processing (EMNLP), 2024 [ PDF] [ Code] [ Bibtex] |
|
MAAN: Memory-Augmented Auto-regressive Network for Text-driven 3D Indoor Scene Generation Zhaoda Ye, Yang Liu, Yuxin Peng IEEE Transactions on Multimedia (TMM), 2024 [ PDF] [ Bibtex] |
|
Evidential Multi-Source-Free Unsupervised Domain Adaptation Jiangbo Pei, Aidong Men, Yang Liu, Xiahai Zhuang, Qingchao Chen IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024 [ PDF] [ Code] [ Bibtex] |
|
Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models Dezhao Luo, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liu† Winter Conference on Applications of Computer Vision (WACV), 2024 [ PDF] [ Bibtex] |
|
Masked Retraining Teacher-Student Framework for Domain Adaptive Object Detection Zijing Zhao, Sitong Wei, Qingchao Chen, Dehui Li, Yifan Yang, Yuxin Peng, Yang Liu† International Conference on Computer Vision (ICCV), 2023 [ PDF] [Supplementary Material] [Project Page] [Video] [ Code] [ Bibtex] |
|
Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory Ting Lei, Fabian Caba, Qingchao Chen, Hailin Ji, Yuxin Peng, Yang Liu† International Conference on Computer Vision (ICCV), 2023 [ PDF] [Supplementary Material] [Project Page] [ Code] [ Bibtex] |
|
Confidence-aware Pseudo-label Learning for Weakly Supervised Visual Grounding Yang Liu,Jiahua Zhang, Qingchao Chen, Yuxin Peng International Conference on Computer Vision (ICCV), 2023 [ PDF] [Project Page] [ Code] [ Bibtex] |
|
Moment Detection in Long Tutorial Videos Ioana Croitoru, Simion-Vlad Bogolin, Samuel Albanie, Yang Liu, Zhaowen Wang, Seunghyun Yoon, Franck Dernoncourt, Hailin Jin, Trung Bui International Conference on Computer Vision (ICCV), 2023 [ PDF] [Supplementary Material] [ Dataset] [ Bibtex] |
|
Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization Minghang Zheng, Shaogang Gong, Hailin Jin, Yuxin Peng, Yang Liu† Annual Meeting of the Association for Computational Linguistics (ACL), 2023 [ PDF] [ Code] [ Bibtex] |
|
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training Dezhao Luo, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liu† IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023 [ PDF] [ Bibtex] |
|
Phrase-level Temporal Relationship Mining for Temporal Sentence Localization Minghang Zheng, Sizhe Li, Qingchao Chen, Yuxin Peng, Yang Liu† Conference on Artificial Intelligence (AAAI), 2023 [ PDF] [ Code] [ Bibtex] |
|
Uncertainty-induced transferability representation for source-free unsupervised domain adaptation Jiangbo Pei, Zhuqing Jiang, Aidong Men, Liang Chen, Yang Liu,Qingchao Chen IEEE Transactions on Image Processing (TIP), 2023 [ PDF] [ Code] [ Bibtex] |
|
IoT-V2E: An Uncertainty-Aware Cross-Modal Hashing Retrieval Between Infrared-Videos and EEGs for Automated Sleep State Analysis Jianan Han , Aidong Men, Yang Liu, Ziming Yao, Shaoxing Zhang, Yan Yan, Qingchao Chen IEEE Internet of Things Journal, 2023 [ PDF] [ Code] [ Bibtex] |
|
Video Activity Localisation with Uncertainties in Temporal Boundary Jiabo Huang, Hailin Jin, Shaogang Gong, Yang Liu† European Conference on Computer Vision (ECCV), 2022 [ PDF] [ Code] |
|
Delving into the Continuous Domain Adaptation Yinsong Xu, Zhuqing Jiang, Aidong Men, Yang Liu,Qingchao Chen ACM International Conference on Multimedia (ACM-MM), 2022 [ PDF] [ Bibtex] |
|
Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning Minghang Zheng, Yanjie Huang, Qingchao Chen, Yuxin Peng, Yang Liu† IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022 [ PDF] [Project Page] [Code] [ Bibtex] |
|
Cross Modal Retrieval with Querybank Normalisation Simion-Vlad Bogolin, Ioana Croitoru, Hailin Jin, Yang Liu†, Samuel Albanie† IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022 [ PDF] [Project Page] [Code] [ Bibtex] |
|
Weakly Supervised Video Moment Localization with Contrastive Negative Sample Mining Minghang Zheng, Yanjie Huang, Qingchao Chen, Yang Liu† Conference on Artificial Intelligence (AAAI), 2022 [ PDF] [Project Page] [Code] [ Bibtex] |
|
TeachText: CrossModal Generalized Distillation for Text-Video Retrieval Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu, Hailin Jin, Andrew Zisserman, Samuel Albanie, Yang Liu† International Conference on Computer Vision (ICCV), 2021 [ PDF] [Project Page] [Code] [ Bibtex] |
|
Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation Jiabo Huang, Yang Liu†, Shaogang Gong, Hailin Jin International Conference on Computer Vision (ICCV), 2021 [ PDF] [ Bibtex] |
|
Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval Yang Liu*, Qingchao Chen*, Samuel Albanie IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 [pdf] [Bibtex] |
|
Mind-the-Gap! Unsupervised Domain Adaptation for Text-Video Retrieval Qingchao Chen*, Yang Liu*, Samuel Albanie Conference on Artificial Intelligence (AAAI), 2021 [pdf] [Bibtex] |
|
QuerYD: A video dataset with high-quality textual and audio narrations Andreea-Maria Oncescu, Joao F. Henriques, Yang Liu, Andrew Zisserman, Samuel Albanie IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021 [pdf] [Project Page ] [Audio Description Service ] [Bibtex] |
|
Amplifying Key Cues for Human-Object-Interaction Detection Yang Liu, Qingchao Chen, and Andrew Zisserman European Conference on Computer Vision (ECCV), 2020 [ PDF] [Supplementary Material] [Project Page] [Video] [ Bibtex] |
|
Structure-Aware Feature Fusion for Unsupervised Domain Adaptation Qingchao Chen*, Yang Liu* Conference on Artificial Intelligence (AAAI), 2020 [PDF] [Bibtex] |
|
Use what you have: Video retrieval using representations from collaborative experts Yang Liu*, Samual Albanie*, Arsha Nagrani, and Andrew Zisserman British Machine Vision Conference (BMVC), 2019 [ PDF] [Project Page] [Code] [Challenge] [Challenge Report] [ Bibtex] |
|
Synthetically Supervised Feature Learning for Scene Text Recognition Yang Liu, Zhaowen Wang, Hailin Jin, and Ian Wassell European Conference on Computer Vision (ECCV), 2018 [ PDF] [ Bibtex] |
|
Multi-Task Adversarial Network for Disentangled Feature Learning Yang Liu, Zhaowen Wang, Hailin Jin, and Ian Wassell IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018 [ PDF] [ Bibtex] |
|
Re-weighted Adversarial Adaptation Network for Unsupervised Domain Adaptation Qingchao Chen*, Yang Liu*, Zhaowen Wang, Ian Wassell and Kevin Chetty IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018 [ PDF] [ Bibtex] |
|
Discriminant Dictionary Learning meets CNN in Scene Recognition Yang Liu, Qingchao Chen, Wei Chen and Ian Wassell Conference on Artificial Intelligence (AAAI), 2018 [PDF] [Bibtex] |
|
Joint fall and aspect angle recognition using fine-grained micro-Doppler classification Qingchao Chen, Matthiew Ritchie, Yang Liu, Kevin Chetty, Karl Woodbridge, IEEE Radar Conference (RadarConf), 2017 [PDF] [DOI] [Bibtex] |
|
Simultaneous Bayesian Sparse Approximation With Structured Sparse Models Wei Chen, David Wipf, Yu Wang, Yang Liu and Ian Wassell IEEE Transactions on Signal Processing (TIP), 2016 [PDF] [DOI] [Bibtex] |
|
Support Discrimination Dictionary Learning for Image Classification Yang Liu, Wei Chen, Qingchao Chen and Ian Wassell European Conference on Computer Vision (ECCV), 2016 [PDF] [DOI] [Bibtex] |
|
A New Face Recognition Algorithm based on Dictionary Learning for a Single Training Sample per Person Yang Liu and Ian Wassell British Machine Vision Conference (BMVC), 2015 [PDF] [DOI] [Bibtex] |
Mentor: Prof. Andrew Zisserman |
|
Mentors: Zhaowen Wang, Hailin Jin |
|
Mentor: Dr. Ian Wassell |
|
|
|
|