Publications

* denotes equal contribution. Full list also on Google Scholar.

2026

  1. Tech Report
    Kimi K2.5: Visual Agentic Intelligence
    Kimi Team
    arXiv preprint arXiv:2602.02276, 2026
  2. Tech Report
    The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence
    MiniMax
    arXiv preprint arXiv:2605.26494, 2026
  3. ICML 2026
    XSkill: Continual Learning from Experience and Skills in Multimodal Agents
    Guanyu Jiang, Zhaochen Su, Xiaoye Qu, and Yi R. (May) Fung
    In International Conference on Machine Learning (ICML). Guanyu Jiang and Zhaochen Su contributed equally , 2026
  4. arXiv
    AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
    Zhaochen Su, Jincheng Gao, Hangyu Guo, Zhenhua Liu, Lueyang Zhang, Xinyu Geng, Shijue Huang, Peng Xia, Guanyu Jiang, Cheng Wang, Yue Zhang, Yi R. (May) Fung, and Junxian He
    arXiv preprint arXiv:2602.23166, 2026
  5. ICLR 2026
    The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
    Junlong Li, Wenshuo Zhao, Jian Zhao, Weihao Zeng, Haoze Wu, Xiaochen Wang, Rui Ge, Yuxuan Cao, Yuzhen Huang, Wei Liu, Junteng Liu, Zhaochen Su, Yiyang Guo, Fan Zhou, Lueyang Zhang, Juan Michelini, Xingyao Wang, Xiang Yue, Shuyan Zhou, Graham Neubig, and Junxian He
    In International Conference on Learning Representations (ICLR), 2026
  6. ICLR 2026
    GRACE: Generative Representation Learning via Contrastive Policy Optimization
    Jiashuo Sun, Shixuan Liu, Zhaochen Su, Xianrui Zhong, Pengcheng Jiang, Bowen Jin, Peiran Li, Weijia Shi, and Jiawei Han
    In International Conference on Learning Representations (ICLR), 2026
  7. ICLR 2026
    Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
    Shuang Chen, Yue Guo, Zhaochen Su, Yafu Li, Yulun Wu, Jiacheng Chen, Jiayu Chen, Weijie Wang, Xiaoye Qu, and Yu Cheng
    In International Conference on Learning Representations (ICLR), 2026

2025

  1. arXiv
    Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers
    Zhaochen Su, Peng Xia, Hangyu Guo, Zhenhua Liu, Yan Ma, Xiaoye Qu, Jiaqi Liu, Yanshu Li, Kaide Zeng, Zhengyuan Yang, Linjie Li, Yu Cheng, Heng Ji, Junxian He, and Yi R. Fung
    arXiv preprint arXiv:2506.23918, 2025
  2. arXiv
    OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
    Zhaochen Su, Linjie Li, Mingyang Song, Yunzhuo Hao, Zhengyuan Yang, Jun Zhang, Guanjie Chen, Jiawei Gu, Juntao Li, Xiaoye Qu, and Yu Cheng
    arXiv preprint arXiv:2505.08617, 2025
  3. ACL 2025
    PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models
    Mingyang Song, Zhaochen Su, Xiaoye Qu, Jiawei Zhou, and Yu Cheng
    In Annual Meeting of the Association for Computational Linguistics (ACL), 2025

2024

  1. NeurIPS 2024
    ConflictBank: A Benchmark for Evaluating Knowledge Conflicts in Large Language Models
    Zhaochen Su, Jun Zhang, Xiaoye Qu, Tong Zhu, Yanshu Li, Jiashuo Sun, Juntao Li, Min Zhang, and Yu Cheng
    In Advances in Neural Information Processing Systems (NeurIPS), 2024
  2. COLM 2024
    Timo: Towards Better Temporal Reasoning for Language Models
    Zhaochen Su, Jun Zhang, Tong Zhu, Xiaoye Qu, Juntao Li, Min Zhang, and Yu Cheng
    In Conference on Language Modeling (COLM), 2024
  3. ACL 2024
    Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?
    Zhaochen Su, Juntao Li, Jun Zhang, Tong Zhu, Xiaoye Qu, Pan Zhou, Yan Bowen, Yu Cheng, and Min Zhang
    In Annual Meeting of the Association for Computational Linguistics (ACL), 2024
  4. EMNLP 2024
    SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information
    Jiashuo Sun, Jihai Zhang, Yucheng Zhou, Zhaochen Su, Xiaoye Qu, and Yu Cheng
    In Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024

2023

  1. EMNLP 2023
    Efficient Continue Training of Temporal Language Model with Structural Information
    Zhaochen Su, Juntao Li, Zikang Zhang, Zihan Zhou, and Min Zhang
    In Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022

  1. EMNLP 2022
    Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change
    Zhaochen Su, Zecheng Tang, Xinyan Guan, Lijun Wu, Min Zhang, and Juntao Li
    In Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022