List of Selected Publications
For fulltext, code and datasets, please see PLUS website or Arxiv.
Preprints
Conference and Journal
2023
Novel Class Discovery for Long-tailed Recognition, [OpenReview]
Chuyu Zhang, Ruijie Xu, Xuming He
Transactions on Machine Learning Research (TMLR), 2023
Grounded Image Text Matching with Mismatched Relation Reasoning, [Arxiv]
Yu Wu, Yana Wei, Haozhe Wang, Yongfei Liu, Sibei Yang, Xuming He
International Conference on Computer Vision (ICCV), 2023
Class-relation Knowledge Distillation for Novel Class Discovery, [Arxiv]
Peiyan Gu, Chuyu Zhang, Ruiji Xu, Xuming He
International Conference on Computer Vision (ICCV), 2023
MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels, [Arxiv]
Chuanyang Hu, Shipeng Yan, Zhitong Gao, Xuming He
International Joint Conference on Artificial Intelligence (IJCAI), 2023
HOICLIP: Efficient Knowledge Transfer for HOI Detection with Visual Linguistic Model, [Arxiv]
Shan Ning*, Longtian Qiu*, Yongfei Liu, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
Modeling Multimodal Aleatoric Uncertainty in Segmentation with Mixture of Stochastic Experts, [Arxiv]
Zhitong Gao, Yucong Chen, Chuyu Zhang, Xuming He
International Conference on Learning Representation (ICLR), 2023
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning, [Arxiv]
Bo Wan, Yongfei Liu, Desen Zhou, Tinne Tuytelaars, Xuming He
International Conference on Learning Representation (ICLR), 2023
Part-aware Prototypical Graph Network for One-shot Skeleton-based Action Recognition, [Arxiv]
Tailin Chen, Desen Zhou, Jian Wang, Shidong Wang, Qian He, Chuanyang Hu, Errui Ding, Yu Guan, Xuming He
IEEE conference series on Automatic Face and Gesture Recognition (FG), 2023, Best Student Paper
2022
Generative Negative Text Replay for Continual Vision-Language Pretraining, [TBA]
Shipeng Yan, Lanqing Hong, Hang Xu, Jianhua Han, Tinne Tuytelaars, Zhenguo Li, Xuming He
European Conference of Computer Vision (ECCV), 2022
Learning Semantic Correspondence with Sparse Annotations, [Arxiv]
Shuaiyi Huang, Luyu Yang, Bo He, Songyang Zhang, Xuming He, Abhinav Shrivastava
European Conference of Computer Vision (ECCV), 2022
ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning,[Arxiv]
Haozhe Wang, Chao Du, Panyan Fang, Shuo Yuan, Xuming He, Liang Wang, Bo Zheng
ACM SIGKDD, 2022
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation,
[Arxiv]
Yongfei Liu, Chenfei Wu, Shao-yen Tseng, Vasudev Lal, Xuming He, Nan Duan
Findings of NAACL, 2022
General Incremental Learning with Domain-aware Categorical Representations, [Arxiv]
Jiangwei Xie, Shipeng Yan, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
SGTR: End-to-end Scene Graph Generation with Transformer, [Arxiv]
Rongjie Li, Songyang Zhang, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
FishGym: A High-Performance Physics-Based Simulation Framework for Underwater Robot Learning, [Youtube]
Wenji Liu, Kai Bai, Xuming He, Shuran Song, Changxi Zheng, Xiaopei Liu
IEEE International Conference on Robotics and Automation (ICRA), 2022
Weakly Supervised Nuclei Segmentation via Instance Learning, [Arxiv]
Weizhen Liu, Qian He, Xuming He
IEEE International Symposium on Biomedical Imaging (ISBI), 2022 (Oral)
2021
DeepPhospho accelerates DIA phosphoproteome profiling through in silico library generation, [Web]
Ronghui Lou, Weizhen Liu, Rongjie Li, Shanshan Li, Xuming He, Wenqing Shui
Nature Communications, 12, 6685, 2021
Dynamic Grained Encoder for Vision Transformers,[Web]
Lin Song, Songyang Zhang, Songtao Liu, Zeming Li, Xuming He, Hongbin Sun, Jian Sun, Nanning Zheng
Advances in Neural Information Processing Systems (NeurIPS), 2021
GNeRF: GAN-based Neural Radiance Field without Posed Camera, [Arxiv]
Quan Meng, Anpei Chen, Haimin Luo, Minye Wu, Hao Su, Lan Xu, Xuming He, Jingyi Yu
International Conference on Computer Vision (ICCV), 2021
Single Image 3D Object Estimation with Primitive Graph Networks, [Arxiv]
Qian He, Desen Zhou, Bo Wan, Xuming He
ACM International Conference on Multimedia (MM ’21), 2021
An EM Framework for Online Class Incremental Semantic Segmentation with Dynamic Sampling, [Arxiv]
Shipeng Yan, Jiale Zhou, Jiangwei Xie, Songyang Zhang, Xuming He
ACM International Conference on Multimedia (MM ’21), 2021
Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition, [Arxiv]
Tailin Chen, Desen Zhou, Jian Wang, Shidong Wang, Yu Guan, Errui Ding, Xuming He
ACM International Conference on Multimedia (MM ’21), 2021
Superpixel-guided Iterative Learning from Noisy Labels for Medical Image Segmentation, [Arxiv]
Zhitong Gao*, Shuailin Li*, Xuming He
International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2021
Learning Implicit Temporal Alignment for Few-shot Video Classification, [Arxiv]
Jialei Zhou, Songyang Zhang, Xuming He
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Weakly Supervised Volumetric Segmentation via Self-taught Shape Denoising Model, [OpenReview]
Qian He, Shuailin Li, Xuming He
Medical Imaging with Deep Learning (MIDL), 2021
Relation-aware Instance Refinement for Weakly Supervised Visual Grounding, [Arxiv]
Yongfei Liu, Bo Wan, Lin Ma, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Distribution Alignment: A Unified Framework for Long-tail Visual Recognition, [Arxiv]
Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, Jian Sun
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation, [Arxiv]
Rongjie Li, Songyang Zhang, Bo Wan, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
DER: Dynamically Expandable Representation for Class Incremental Learning, [Arxiv]
Shipeng Yan, Jiangwei Xie, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
2020
LGNN: A Context-aware Line Segment Detector, [Arxiv]
Quan Meng, Jiakai Zhang, Qiang Hu, Xuming He, Jingyi Yu
ACM International Conference on Multimedia (MM ’20), 2020
Part-aware prototype Network for Few-shot Semantic Segmentation, [Arxiv]
Yongfei Liu, Xiangyi Zhang, Songyang Zhang, Xuming He
European Conference of Computer Vision (ECCV), 2020
Confidence-aware Adversarial Learning for Self-supervised Semantic Matching, [Arxiv]
Shuaiyi Huang, Qiuyue Wang, Xuming He
Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2020
Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images, [Arxiv]
Shuailin Li, Chuyu Zhang, Xuming He
International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2020
Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning, [ArXiv]
Haozhe Wang, Jiale Zhou, Xuming He
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2020
Learning Cross-Modal Context Graph for Visual Grounding, [Arxiv]
Yongfei Liu*, Bo Wan*, Xiaodan Zhu, Xuming He
AAAI Conference on Artificial Intelligence (AAAI), 2020
2019
Dynamic Context Correspondence Network for Semantic Alignment, [ArXiv]
Shuaiyi Huang, Qiuyue Wang, Songyang Zhang, Shipeng Yan, Xuming He
International Conference on Computer Vision (ICCV), 2019
Pose-aware Multi-level Feature Network for Human Object Interaction Detection, [ArXiv]
Bo Wan*, Desen Zhou*, Yongfei Liu, Rongjie Li, Xuming He
International Conference on Computer Vision (ICCV), 2019
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition, [ArXiv]
Songyang Zhang, Shipeng Yan, Xuming He
International Conference on Machine Learning (ICML),2019
A Dual Attention Network with Semantic Embedding for Few-shot Learning, [pdf]
Shipeng Yan*, Songyang Zhang*, Xuming He
AAAI Conference on Artificial Intelligence (AAAI),2019
Learning a Layout Transfer Network for Context Aware Object Detection, [Arxiv]
Tao Wang,Xuming He,Yuanzheng Cai, Guobao Xiao
IEEE Transactions on Intelligent Transportation Systems,PP(99),1-16, 2019.
2018
3D Object Structure Recovery via Semi-supervised Learning on Videos, [pdf]
Qian He, Desen Zhou, Xuming He
British Machine Vision Conference (BMVC),2018
One-shot Action Localization by Learning Sequence Matching Network, [pdf]
Hongtao Yang, Xuming He, Fatih Porikli
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text, [pdf]
Alexander Mathews, Lexing Xie, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Geometry-aware Deep Network for Single-Image Novel View Synthesis, [pdf]
Miaomiao Liu, Xuming He, Mathieu Salzmann
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
3D Box Proposals from a Single Monocular Image of an Indoor Scene, [pdf]
Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu
AAAI Conference on Artificial Intelligence (AAAI),2018
Instance-aware Detailed Action Labeling in Videos, [pdf]
Hongtao Yang, Xuming He, Fatih Porikli
IEEE Winter Conference on Applications of Computer Vision (WACV), 2018
2017
Deep Free-Form Deformation Network for Object-Mask Registration, [pdf]
Haoyang Zhang, Xuming He
International Conference on Computer Vision (ICCV), 2017
Efficient Scene Layout Aware Object Detection for Traffic Surveillance, [pdf]
Tao Wang, Xuming He, Songzhi Su, Yin Guan
IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2017.
Traffic Surveillance Workshop and Challenge Best Paper Award.
Forest Change Detection in Incomplete Satellite Images With Deep Neural Networks, [link]
Salman H. Khan, Xuming He, Fatih Porikli, Mohammed Bennamoun
IEEE Transactions on Geoscience and Remote Sensing, 2017.
Learning deep structured network for weakly supervised change detection, [pdf]
Salman Khan, Xuming He, Fatih Porikli, Ferdous Sohel, Roberto Togneri, Mohammed Bennamoun
International Joint Conference on Artificial Intelligence (IJCAI), 2017
Boundary-aware Instance Segmentation, [pdf]
Zeeshan Hayder, Xuming He, Mathieu Salzmann
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
Indoor Scene Parsing with Instance Segmentation, Semantic Labeling and Support Relationship Inference, [pdf]
Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
Predicting Salient Face in Multiple-face Videos, [pdf]
Yufan Liu, Songyang Zhang, Mai Xu, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
Stacked Learning to Search for Scene Labeling, [link]
Feiyang Cheng, Xuming He, Hong Zhang
IEEE Transactions on Image Processing, 2017
Learning Spatial Transforms for Refining Object Segment Proposals, [pdf]
Haoyang Zhang, Xuming He, Fatih Porikli
IEEE Winter Conference on Applications of Computer Vision (WACV), 2017
2016
Learning to Generate Object Proposals with Multi-modal Cues, [pdf]
Haoyang Zhang, Xuming He, Fatih Porikli
Asian Conference on Computer Vision (ACCV), 2016
Object-Aware Dictionary Learning with Deep Features, [pdf]
Yurui Xie, Fatih Porikli, Xuming He
Asian Conference on Computer Vision (ACCV), 2016
Learning Dynamic Hierarchical Models for Anytime Scene Labeling, [pdf] [arXiv]
Buyu Liu, Xuming He
European Conference on Computer Vision (ECCV), 2016
Building Scene Models by Completing and Hallucinating Depth and Semantics, [pdf]
Miaomiao Liu, Xuming He, Mathieu Salzmann
European Conference on Computer Vision (ECCV), 2016
Semantic Context and Depth-aware Object Proposal Generation, [pdf]
Haoyang Zhang, Xuming He, Fatih Porikli, Laurent Kneip
IEEE International Conference on Image Processing (ICIP), 2016
Learning to Co-Generate Object Proposals with a Deep Structured Network, [pdf]
Zeeshan Hayder, Xuming He, Mathieu Salzmann
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
SentiCap: Generating Image Descriptions with Sentiments, [pdf] [arXiv]
Alexander Mathews, Lexing Xie, Xuming He
AAAI Conference on Artificial Intelligence (AAAI-16), 2016
Contour Completion without Region Segmentation, [link] [pdf]
Yansheng Ming, Hongdong Li, Xuming He
IEEE Transactions on Image Processing, vol. 25, no. 8, pp 3597–3611, 2016
2015
Structural Kernel Learning for Large Scale Multiclass Object Co-Detection, [pdf]
Zeeshan Hayder, Xuming He, Mathieu Salzmann
International Conference on Computer Vision (ICCV), 2015
Studying Object Naming with Online Photos and Caption, [pdf]
Alexander Mathews, Lexing Xie, Xuming He
Multimedia COMMONS, A Workshop at ACM Multimedia, 2015
Multiclass Semantic Video Segmentation with Object-Level Active Inference, [pdf] [suppl zip]
Buyu Liu, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Indoor Scene Structure Analysis for Single Image Depth Estimation, [pdf]
Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Separating Objects and Clutter in Indoor Scenes, [pdf] [suppl pdf]
Salman H. Khan, Xuming He, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Choosing Basic-Level Concept Names using Visual and Language Context, [pdf] [suppl pdf]
Alexander Mathews, Lexing Xie, Xuming He
IEEE Winter Conference on Applications of Computer Vision (WACV), 2015
Multiclass Semantic Video Segmentation with Exemplar-based Object Reasoning, [pdf]
Buyu Liu, Xuming He, Stephen Gould
IEEE Winter Conference on Applications of Computer Vision (WACV), 2015
Motion Segmentation of Truncated Signed Distance Function Based Volumetric Surfaces, [pdf]
Samunda Perera, Nick Barnes, Xuming He, Shahram Izadi, Pushmeet Kohli, Ben Glocker
IEEE Winter Conference on Applications of Computer Vision (WACV), 2015
Robust Face Alignment Under Occlusion via Regional Predictive Power Estimation, [link] [pdf]
Heng Yang, Xuming He, Xuhui Jia, I. Patras
IEEE Transactions on Image Processing, vol.24, no.8, pp 2393–2403, 2015
2014
Object Co-Detection via Efficient Inference in a Fully-Connected CRF, [pdf]
Zeeshan Hayder, Mathieu Salzmann, Xuming He
European Conference on Computer Vision (ECCV), 2014
Superpixel Graph Label Transfer with Learned Distance Metric, [pdf]
Stephen Gould, Jiecheng Zhao, Xuming He, Yuhang Zhang
European Conference on Computer Vision (ECCV), 2014
An Exemplar-based CRF for Multi-instance Object Segmentation, [pdf]
Xuming He, Stephen Gould
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
Discrete-Continuous Depth Estimation from a Single Image, [pdf]
Miaomiao Liu, Mathieu Salzmann, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
Joint Semantic and Geometric Segmentation of Videos with a Stage Model, [pdf]
Buyu Liu, Xuming He, Stephen Gould
IEEE Winter Conference on Applications of Computer Vision (WACV), 2014
Data-Driven Street Scene Layout Estimation for Distant Object Detection, [pdf]
Donghao Zhang, Xuming He, Hanxi Li
International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2014
The Effectiveness of Prosthetic Fixation for Recognizing Faces in Natural Scenes, [poster pdf]
Janine Walker, Xuming He, Hanxi Li and Nick Barnes
Annual Meeting of the Association for Research in Vision and Ophthalmology (ARVO), 2014
Winding Number Constrained Contour Detection, [link] [pdf]
Yansheng Ming, Hongdong Li, Xuming He
IEEE Transactions on Image Processing, vol. 24, no. 1, pp 68–79, 2014
Scene Understanding by Labeling Pixels, [link] [pdf]
Stephen Gould, Xuming He
Communications of the ACM, vol. 57, no. 11, pp 68–77, 2014
A New Theoretical Approach to Improving Face Recognition in Disorders of Central Vision: Face caricaturing, [link] [pdf]
Jessica Irons, Elinor McKone, Rachael Dumbleton, Nick Barnes, Xuming He, Jan Provis, Callin Ivanovici, Alisa Kwa
Journal of Vision, 14(2):12. 2014.
2013
Learning Structured Hough Voting for Joint Object Detection and Occlusion Reasoning, [pdf]
Tao Wang, Xuming He, Nick Barnes
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013
Winding Number for Region-Boundary Consistent Salient Contour Extraction, [pdf]
Yansheng Ming, Hongdong Li, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013
Picture Tags and World Knowledge: Learning Tag Relations from Visual Semantic Sources, [pdf]
Lexing Xie, Xuming He
The 21st ACM International Conference on Multimedia (ACM MM), 2013
Symmetry Detection via Contour Grouping, [pdf]
Yansheng Ming, Hongdong Li, Xuming He
IEEE International Conference on Image Processing (ICIP), 2013
Glass Object Segmentation by Label Transfer on Joint Depth and Apearance Manifolds, [pdf]
Tao Wang, Xuming He, Nick Barnes
IEEE International Conference on Image Processing (ICIP), 2013
Tracking Large-scale Video Remix in Real-world Events, [link] [pdf]
Lexing Xie, Apostol Natsev, Xuming He, John Kender, Matthew Hill, John R Smith
IEEE Transactions on Multimedia, 15(6), 2013.
2012
Connected Contours: a Contour Completion Model That Respects Closure-Effect, [pdf]
Yansheng Ming, Hongdong Li, Xuming He
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012
Glass Object Localization by Joint Inference of Boundary and Depth, [pdf]
Tao Wang, Xuming He, Nick Barnes
International Conference on Pattern Recognition (ICPR), 2012
The Role of Vision Processing in Prosthetic Vision,
Nick Barnes, Xuming He, Chris McCarthy, Lachlan Horne, Junae Kim, Adele Scott and Paulette Lieby
Annual International Conference of the Engineering in Medicine and Biology Society (EMBC), 2012
An Face-based Visual Fixation System for Prosthetic Vision,
Xuming He, Junae Kim, Nick Barnes
Annual International Conference of the Engineering in Medicine and Biology Society (EMBC), 2012
Probabilistic Models of Vision and Max-margin Methods, [link] [pdf]
Alan Yuille and Xuming He
Frontiers of Electrical and Electronic Engineering, 7(1), 94-106, 2012.
2011
Laplacian Margin Distribution Boosting for Learning from Sparsely Labeled Data, [pdf]
Tao Wang, Xuming He, Chunhua Shen, Nick Barnes
International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011
Face Detection and Tracking in Video To Facilitate Face Recognition in a Visual Prothesis, [poster pdf]
Xuming He, Chunhua Shen, Nick Barnes
Annual Meeting of the Association for Research in Vision and Ophthalmology (ARVO), 2011
2010
A Unified Model of Short-range and Long-range Motion Perception, [pdf]
Shuang Wu, Xuming He, Hongjing Lu, and Alan Yuille
Annual Conference on Neural Information Processing Systems (NIPS), 2010
2008
Learning Hybrid Models for Image Annotation with Partially Labeled Data, [pdf]
Xuming He, and Richard S. Zemel
Annual Conference on Neural Information Processing Systems (NIPS), 2008
Latent Topic Random Fields: Learning Using a Taxonomy of Labels, [pdf (with Appendix)]
Xuming He, and Richard S. Zemel
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008
Learning Flexible Features for Conditional Random Fields, [link] [pdf]
Liam Stewart, Xuming He, Richard S. Zemel
IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(8), 1415-1426, 2008.
2007 —
Topological Map Learning from Outdoor Image Sequences, [link] [pdf]
Xuming He, Richard Zemel, and Volodymyr Mnih
Journal of Field Robotics, 23, 1091-1104, 2007.
Learning and Incorporating Top-down Cues in Image Segmentation, [pdf]
Xuming He, Richard Zemel, and Deb Ray
European Conference on Computer Vision (ECCV), 2006.
Learning Landmarks for Localization via Manifolds,
Xuming He, Richard Zemel, and Volodymyr Mnih
NIPS Workshop on Machine Learning Based Robotics in Unstructured Environments, 2005
Multiscale Conditional Random Fields for Image Labelling, [pdf]
Xuming He, Richard Zemel, and Miguel Carreira-Perpinan
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2004
Ph.D. Thesis
Xuming He,
Learning Structured Prediction Models for Image Labeling,
PhD thesis, Department of Computer Science, University of Toronto, 2008
[pdf]
Technical Reports
Budget-aware Few-shot Learning via Graph Convolutional Network, [arXiv]
Shipeng Yan, Songyang Zhang, Xuming He
Arxiv, 2022
Simplifying Sentences with Sequence to Sequence Models, [arXiv]
Alexander Mathews, Lexing Xie, Xuming He
Arxiv, 2018
Weakly Supervised Change Detection in a Pair of Images, [arXiv]
Salman H Khan, Xuming He, Mohammed Bennamoun, Fatih Porikli, Ferdous Sohel, Roberto Togneri
Arxiv, 2016
Semantic-Aware Depth Super-Resolution in Outdoor Scenes, [arXiv]
Miaomiao Liu, Mathieu Salzmann, Xuming He
Arxiv, 2016
Structured Depth Prediction in Challenging Monocular Video Sequences, [arXiv]
Miaomiao Liu, Mathieu Salzmann, Xuming He
Arxiv, 2015
|