Ji Qi   齐济

.

Postdoc Fellow @ Zhipu&Tsinghua

Bio

I am a Postdoctoral Fellow jointly affiliated with Tsinghua University and Zhipu AI, fortunately working with Prof. Jie Tang.

Previously, I received my Ph.D. from the Knowledge Engineering Group (KEG), Department of Computer Science and Technology, Tsinghua University in 2025, advised by Prof. Bin Xu and Prof. Juanzi Li.

I was fortunate to be a visiting student at the School of Computing, National University of Singapore, advised by Prof. Tat-Seng Chua.

Currently, I work on the foundations and development of large multimodal models (LMMs).


  • New Research on Large Multimodal Models
    April 2026

    We recently released GLM-5V-Turbo, the first multimodal coding foundation model, built for vision-based coding tasks.

  • New Research on Large Multimodal Models
    July 2025

    We recently released GLM-4.1V and GLM-4.5V, two foundational and powerful large multimodal models.

  • New Research on Multimodal Video Understanding
    April 2025

    We recently released Quicksviewer, an LMM for efficient long video understanding via reinforced compression of video cubes.

Selected Papers


  1. GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
    GLM-V Team
    Preprint
  2. An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes
    Ji Qi, Yuan Yao, Yushi Bai, Bin Xu, Juanzi Li, Zhiyuan Liu, and Tat-Seng Chua
    Preprint
  3. CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
    Ji Qi, Ming Ding, Weihan Wang, Yushi Bai, Qingsong Lv, Wenyi Hong, Bin Xu, Lei Hou, Juanzi Li, Yuxiao Dong, and Jie Tang
    ICLR 2024
  4. Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction
    Ji Qi, Chuchun Zhang, Xiaozhi Wang, Kaisheng Zeng, Jifan Yu, Jinxin Liu, Lei Hou, Juanzi Li, and Xu Bin
    EMNLP 2023 Outstanding Paper Award
  5. GOAL: A challenging knowledge-grounded video captioning benchmark for real-time soccer commentary generation
    Ji Qi, Jifan Yu, Teng Tu, Kunyu Gao, Yifan Xu, Xinyu Guan, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li, and Jie Tang
    CIKM 2023
  6. Syntactically robust training on partially-observed data for open information extraction
    Ji Qi, Yuxiang Chen, Lei Hou, Juanzi Li, and Bin Xu
    EMNLP 2022
Google Scholar Infobox

Service

  • NeurIPS 2022~2025
  • ICML 2022~2025
  • ICLR 2022~2025
  • CVPR 2022~2025
  • ICCV 2022~2025
  • ACL 2022~2025
  • EMNLP 2022~2025
web counter