publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction
    Songyan Zhang, Yongtao Ge, Jinyuan Tian, Guangkai Xu, Hao Chen, and 2 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
  2. SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting
    Zihui Gao, Jia-Wang Bian, Guosheng Lin, Hao Chen, and Chunhua Shen
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
  3. Unified Open-World Segmentation with Multi-Modal Prompts
    Yang Liu, Yufei Yin, Chenchen Jing, Muzhi Zhu, Hao Chen, and 5 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
  4. SIGGRAPH
    gvm.png
    Generative Video Matting
    Yongtao Ge, Kangyang Xie, Guangkai Xu, Li Ke, Mingyu Liu, and 4 more authors
    In Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers, 2025
  5. SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
    Muzhi Zhu, Yuzhuo Tian, Hao Chen, Chunluan Zhou, Qingpei Guo, and 3 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
  6. Revisiting Convolution Architecture in the Realm of DNA Foundation Models
    Yu Bo, Weian Mao, Yanjun Shao, Weiqiang Bai, Peng Ye, and 4 more authors
    In The Thirteenth International Conference on Learning Representations, 2025
  7. PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training
    Cong Chen, Mingyu Liu, Chenchen Jing, Yizhou Zhou, Fengyun Rao, and 3 more authors
    In The Thirteenth International Conference on Learning Representations, 2025
  8. Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions
    Xiaoran Jiao, Weian Mao, Wengong Jin, Peiyuan Yang, Hao Chen, and 1 more author
    In The Thirteenth International Conference on Learning Representations, 2025
  9. MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
    Canyu Zhao, Mingyu Liu, Wen Wang, Weihua Chen, Fan Wang, and 3 more authors
    In The Thirteenth International Conference on Learning Representations, 2025
  10. Framer: Interactive Frame Interpolation
    Wen Wang, Qiuyu Wang, Kecheng Zheng, Hao Ouyang, Zhekai Chen, and 4 more authors
    In The Thirteenth International Conference on Learning Representations, 2025

2024

  1. 3DV
    LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation
    Weijie Ma, Jingwei Jiang, Yang Yang, Zehui Chen, and Hao Chen
    2024
  2. Generative Active Learning for Long-tailed Instance Segmentation
    Muzhi Zhu, Chengxiang Fan, Hao Chen, Yang Liu, Weian Mao, and 2 more authors
    In Forty-First International Conference on Machine Learning, 2024
  3. Floating Anchor Diffusion Model for Multi-motif Scaffolding
    Ke Liu, Weian Mao, Shuaike Shen, Xiaoran Jiao, Zheng Sun, and 2 more authors
    In Forty-First International Conference on Machine Learning, 2024
  4. Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-Shot Metric Depth and Surface Normal Estimation
    Mu Hu, Wei Yin, Chi Zhang, Zhipeng Cai, Xiaoxiao Long, and 5 more authors
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
  5. AAAI
    diffcalib.jpg
    DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation
    Xiankang He, Guangkai Xu, Bo Zhang, Hao Chen, Ying Cui, and 1 more author
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  6. A Simple Image Segmentation Framework via In-Context Examples
    Yang Liu, Chenchen Jing, Hengtao Li, Muzhi Zhu, Hao Chen, and 2 more authors
    In , 2024
  7. Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
    Muzhi Zhu, Yang Liu, Zekai Luo, Chenchen Jing, Hao Chen, and 3 more authors
    Advances in Neural Information Processing Systems, 2024
  8. AAAI
    Revisiting Open-Set Panoptic Segmentation
    Yufei Yin, Hao Chen, Wengang Zhou, Jiajun Deng, Haiming Xu, and 1 more author
    Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  9. AAAI
    Retrieval-Augmented Primitive Representations for Compositional Zero-Shot Learning
    Chenchen Jing, Yukun Li, Hao Chen, and Chunhua Shen
    Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  10. LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning
    Mingyang Zhang, Hao Chen, Chunhua Shen, Zhen Yang, Linlin Ou, and 2 more authors
    In Findings of the Association for Computational Linguistics: ACL 2024, Aug 2024
  11. ECCV
    freecompose.png
    FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
    Zhekai Chen, Wen Wang, Zhen Yang, Zeqing Yuan, Hao Chen, and 1 more author
    In The 17th European Conference on Computer Vision ECCV 2024, Aug 2024
  12. FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
    Ganggui Ding, Canyu Zhao, Wen Wang, Zhen Yang, Zide Liu, and 2 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Aug 2024
  13. DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
    Chengxiang Fan, Muzhi Zhu, Hao Chen, Yang Liu, Weijia Wu, and 2 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Aug 2024
  14. Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
    Yang Liu, Muzhi Zhu, Hengtao Li, Hao Chen, Xinlong Wang, and 1 more author
    In The Twelfth International Conference on Learning Representations, Aug 2024
  15. AutoStory: Generating Diverse Storytelling Images with Minimal Human Efforts
    Wen Wang, Canyu Zhao, Hao Chen, Zhekai Chen, Kecheng Zheng, and 1 more author
    International Journal of Computer Vision, Aug 2024
  16. De Novo Protein Design Using Geometric Vector Field Networks
    Weian Mao, Muzhi Zhu, Zheng Sun, Shuaike Shen, Lin Yuanbo Wu, and 2 more authors
    In The Twelfth International Conference on Learning Representations, Aug 2024
  17. What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?
    Guangkai Xu, Yongtao Ge, Mingyu Liu, Chengxiang Fan, Kangyang Xie, and 3 more authors
    In The Thirteenth International Conference on Learning Representations, Aug 2024

2023

  1. DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
    Weijia Wu, Yuzhong Zhao, Hao Chen, Yuchao Gu, Rui Zhao, and 4 more authors
    Advances in Neural Information Processing Systems, Aug 2023
  2. Learning To Fuse Monocular and Multi-View Cues for Multi-Frame Depth Estimation in Dynamic Scenes
    Rui Li, Dong Gong, Wei Yin, Hao Chen, Yu Zhu, and 4 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Aug 2023
  3. Learning Conditional Attributes for Compositional Zero-Shot Learning
    Qingsheng Wang, Lingqiao Liu, Chenchen Jing, Hao Chen, Guoqiang Liang, and 2 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Aug 2023
  4. A Dynamic Feature Interaction Framework for Multi-task Visual Perception
    Yuling Xi, Hao Chen, Ning Wang, Peng Wang, Yanning Zhang, and 2 more authors
    International Journal of Computer Vision, Aug 2023
  5. FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models
    Guangkai Xu, Wei Yin, Hao Chen, Chunhua Shen, Kai Cheng, and 1 more author
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, Aug 2023
  6. Object-Aware Inversion and Reassembly for Image Editing
    Zhen Yang, Ganggui Ding, Wen Wang, Hao Chen, Bohan Zhuang, and 1 more author
    In The Twelfth International Conference on Learning Representations, Aug 2023
  7. CTVIS: Consistent Training for Online Video Instance Segmentation
    Kaining Ying, Qing Zhong, Weian Mao, Zhenhua Wang, Hao Chen, and 5 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, Aug 2023
  8. Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
    Wei Yin, Chi Zhang, Hao Chen, Zhipeng Cai, Gang Yu, and 3 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, Aug 2023
  9. SegPrompt: Boosting Open-World Segmentation via Category-Level Prompt Learning
    Muzhi Zhu, Hengtao Li, Hao Chen, Chengxiang Fan, Weian Mao, and 3 more authors
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, Aug 2023

2022

  1. Instance and panoptic segmentation using conditional convolutions
    Zhi Tian, Bowen Zhang, Hao Chen, and Chunhua Shen
    IEEE Transactions on Pattern Analysis and Machine Intelligence, Aug 2022

2021

  1. Abcnet v2: Adaptive bezier-curve network for real-time end-to-end text spotting
    Yuliang Liu, Chunhua Shen, Lianwen Jin, Tong He, Peng Chen, and 2 more authors
    IEEE Transactions on Pattern Analysis and Machine Intelligence, Aug 2021
  2. Exploring the capacity of an orderless box discretization network for multi-orientation scene text detection
    Yuliang Liu, Tong He, Hao Chen, Xinyu Wang, Canjie Luo, and 3 more authors
    International Journal of Computer Vision, Aug 2021
  3. Boxinst: High-performance instance segmentation with box annotations
    Zhi Tian, Chunhua Shen, Xinlong Wang, and Hao Chen
    In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Aug 2021
  4. Generic perceptual loss for modeling structured output dependencies
    Yifan Liu, Hao Chen, Yu Chen, Wei Yin, and Chunhua Shen
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Aug 2021

2020

  1. FCOS: A simple and strong anchor-free object detector
    Zhi Tian, Chunhua Shen, Hao Chen, and Tong He
    IEEE transactions on pattern analysis and machine intelligence, Aug 2020
  2. ECCV
    Conditional convolutions for instance segmentation
    Zhi Tian, Chunhua Shen, and Hao Chen
    In European conference on computer vision, Aug 2020
  3. Blendmask: Top-down meets bottom-up for instance segmentation
    Hao Chen, Kunyang Sun, Zhi Tian, Chunhua Shen, Yongming Huang, and 1 more author
    In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Aug 2020
  4. Abcnet: Real-time scene text spotting with adaptive bezier-curve network
    Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, and 1 more author
    In proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Aug 2020
  5. NAS-FCOS: Fast neural architecture search for object detection
    Ning Wang, Yang Gao, Hao Chen, Peng Wang, Zhi Tian, and 2 more authors
    In proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Aug 2020
  6. Memory-efficient hierarchical neural architecture search for image denoising
    Haokui Zhang, Ying Li, Hao Chen, and Chunhua Shen
    In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Aug 2020
  7. WACV
    Architecture search of dynamic cells for semantic video segmentation
    Vladimir Nekrasov, Hao Chen, Chunhua Shen, and Ian Reid
    In Proceedings of the ieee/cvf winter conference on applications of computer vision, Aug 2020

2019

  1. FCOS: Fully Convolutional One-Stage Object Detection
    Zhi Tian, Chunhua Shen, Hao Chen, and Tong He
    In Proceedings of the IEEE/CVF international conference on computer vision, Aug 2019
  2. IJCAI
    Light-Weight Hybrid Convolutional Network for Liver Tumor Segmentation.
    Jianpeng Zhang, Yutong Xie, Pingping Zhang, Hao Chen, Yong Xia, and 1 more author
    In IJCAI, Aug 2019
  3. Fast neural architecture search of compact semantic segmentation models via auxiliary cells
    Vladimir Nekrasov, Hao Chen, Chunhua Shen, and Ian Reid
    In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Aug 2019
  4. Adversarial learning of structure-aware fully convolutional networks for landmark localization
    Yu Chen, Chunhua Shen, Hao Chen, Xiu-Shen Wei, Lingqiao Liu, and 1 more author
    IEEE transactions on pattern analysis and machine intelligence, Aug 2019