avatar

Ce Zhang 张策

PhD Student
Carnegie Mellon University
cezhang (at) cs.cmu.edu


About Me

Hi there! I’m Ce Zhang. I am currently a third-year PhD candidate in the Robotics Institute at Carnegie Mellon University (CMU), advised by Prof. Katia Sycara, with an expected graduation of 2028.

Previously, I obtained my M.Sc. in Machine Learning in the Machine Learning Department at CMU, also advised by Prof. Katia Sycara. Prior to this, I received my B.Eng. in Communication Engineering from Southern University of Science and Technology (SUSTech), where I worked under the supervision of Prof. Zhihai He.

Feel free to reach out if you’re interested in my work or would like to discuss potential collaborations!

Research Interests

I build multi-modal AI systems that are efficient and reliable enough for real-world use. My research focuses on:

News

Show more
  • [Jan 2026] Our paper "VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models" is accepted to TMLR.
  • [Mar 2025] I will be joining the Ph.D. in Robotics program at Carnegie Mellon University (CMU) in Fall 2025.
  • [Jan 2025] Our paper "Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models" is accepted to ICLR 2025.

Publications

    2026

  1. LENS: Adaptive Spatio-Temporal Zooming for Keyframe Sampling in Long-Form Videos ECCV 2026
    Ce Zhang, Jinxi He, Yaqi Xie, Katia Sycara
    European Conference on Computer Vision (ECCV), 2026.
    Malmö, Sweden, September 8–12, 2026
    Code and paper coming soon.

  2. Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory CVPR 2026
    Ce Zhang*, Jinxi He*, Junyi He, Katia Sycara, Yaqi Xie (*Equal contribution)
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026.
    Also at ICLR 2026 Workshop on Lifelong Agents: Learning, Aligning, Evolving.
    Denver, CO, US, June 3-7, 2026

  3. pySpatial: Generating 3D Visual Programs for Zero-Shot Spatial Reasoning ICLR 2026
    Zhanpeng Luo*, Ce Zhang*, Silong Yong, Cunxi Dai, Qianwei Wang, Haoxi Ran, Guanya Shi, Katia Sycara, Yaqi Xie (*Equal contribution)
    International Conference on Learning Representations (ICLR), 2026.
    Rio de Janeiro, Brazil, April 23-27, 2026

  4. VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models TMLR 2026
    Ce Zhang, Kaixin Ma, Tianqing Fang, Wenhao Yu, Hongming Zhang, Zhisong Zhang, Haitao Mi, Dong Yu
    Transactions on Machine Learning Research (TMLR), 2026.
    Also at ICML 2025 Workshop on Efficient Systems for Foundation Models (ES-FoMo III).

  5. WebAggregator: Enhancing Compositional Reasoning Capabilities of Deep Research Agent Foundation Models ACL 2026 (Main)
    Rui Wang, Ce Zhang, Jun-Yu Ma, Jianshu Zhang, Hongru Wang, Yi Chen, Boyang Xue, Tianqing Fang, Zhisong Zhang, Hongming Zhang, Haitao Mi, Dong Yu, Kam-Fai Wong
    Annual Meeting of the Association for Computational Linguistics (ACL), 2026.
    San Diego, CA, US, July 2-7, 2026

  6. 2025

  7. ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models ICCV 2025
    Zifu Wan*, Ce Zhang*, Silong Yong, Martin Ma, Simon Stepputtis, Louis-Philippe Morency, Deva Ramanan, Katia Sycara, Yaqi Xie (*Equal contribution)
    IEEE/CVF International Conference on Computer Vision (ICCV), 2025.
    Honolulu, HI, US, October 19–23, 2025

  8. InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning ACL 2025 (Main)
    Zifu Wan, Yaqi Xie, Ce Zhang, Zhiqiu Lin, Zihan Wang, Simon Stepputtis, Deva Ramanan, Katia Sycara
    Annual Meeting of the Association for Computational Linguistics (ACL), 2025.
    Also at AAAI 2024 Workshop on Public Sector LLMs: Algorithmic and Sociotechnical Design.
    Vienna, Austria, July 27 - August 1, 2025

  9. Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation ICIP 2025
    Ce Zhang, Zifu Wan, Simon Stepputtis, Katia Sycara, Yaqi Xie
    International Conference on Image Processing (ICIP), 2025.
    Anchorage, AK, US, September 14–17, 2025
    Selected for a Lecture presentation.

  10. Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models ICLR 2025
    Ce Zhang*, Zifu Wan*, Zhehan Kan, Martin Ma, Simon Stepputtis, Deva Ramanan, Russ Salakhutdinov, Louis-Philippe Morency, Katia Sycara, Yaqi Xie (*Equal contribution)
    International Conference on Learning Representations (ICLR), 2025.
    Also at NeurIPS 2024 Workshop on Responsibly Building the Next Generation of Multimodal Foundational Models.
    Singapore, April 24-28, 2025

  11. Enhancing Vision-Language Few-Shot Adaptation with Negative Learning WACV 2025
    Ce Zhang, Simon Stepputtis, Katia Sycara, Yaqi Xie
    IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025.
    Also at ICLR 2024 Workshop on Mathematical and Empirical Understanding of Foundation Models.
    Tucson, AZ, US, February 28 - March 4, 2025

  12. 2024

  13. Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models NeurIPS 2024
    Ce Zhang, Simon Stepputtis, Katia Sycara, Yaqi Xie
    Conference on Neural Information Processing Systems (NeurIPS), 2024.
    Also at ICML 2024 Workshop on Foundation Models in the Wild.
    Vancouver, Canada, December 10-15, 2024

  14. HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation CVPR 2024
    Ce Zhang, Simon Stepputtis, Joseph Campbell, Katia Sycara, Yaqi Xie
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
    Also at NeurIPS 2023 New Frontiers in Graph Learning Workshop.
    Seattle, WA, US, June 17-21, 2024

  15. Concept-Guided Prompt Learning for Generalization in Vision-Language Models AAAI 2024
    Yi Zhang, Ce Zhang, Ke Yu, Yushun Tang, Zhihai He
    AAAI Conference on Artificial Intelligence (AAAI), 2024.
    Vancouver, Canada, February 22-25, 2024

  16. Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation WACV 2024
    Xueting Hu, Ce Zhang, Yi Zhang, Bowen Hai, Ke Yu, Zhihai He
    IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024.
    Waikoloa, HI, US, January 4-8, 2024

  17. 2023

  18. Critical Sampling for Robust Evolution Operator Learning of Unknown Dynamical Systems IEEE TAI
    Ce Zhang, Kailiang Wu, Zhihai He
    IEEE Transactions on Artificial Intelligence, 2023.
    Also at First Workshop on Out-of-Distribution Generalization in Robotics at CoRL 2023.
    Atlanta, GA, US, November 6-9, 2023

  19. BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning BMVC 2023
    Yi Zhang*, Ce Zhang*, Zihan Liao, Yushun Tang, Zhihai He (*Equal contribution)
    British Machine Vision Conference (BMVC), 2023.
    Aberdeen, UK, November 20-24, 2023

  20. Neuro-Modulated Hebbian Learning for Fully Test-Time Adaptation CVPR 2023
    Yushun Tang, Ce Zhang, Heng Xu, Shuoshuo Chen, Jie Cheng, Luziwei Leng, Qinghai Guo, Zhihai He
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
    Vancouver, Canada, June 18-22, 2023

  21. Self-Correctable and Adaptable Inference for Generalizable Human Pose Estimation CVPR 2023
    Zhehan Kan, Shuoshuo Chen, Ce Zhang, Yushun Tang, Zhihai He
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
    Vancouver, Canada, June 18-22, 2023

Experiences

Honors and Awards

Services


Visitors  visitors since Sep 2023.

Powered by Jekyll and Minimal Light theme.