๐ Selected Publications

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows ๐ค๐ฌ
Qiushi Sun, Zhoumianze Liu, Chang Ma, Zichen Ding, Fangzhi Xu, Zhangyue Yin, Haiteng Zhao, Zhenyu Wu, Kanzhi Cheng, Zhaoyang Liu, Jianing Wang, Qintong Li, Xiangru Tang, Tianbao Xie, Xiachong Feng, Xiang Li, Ben Kao, Wenhai Wang, Biqing Qi, Lingpeng Kong, Zhiyong Wu
[Paper] | [Slides] | [Project] | [Env] | [Code] |

- First to apply computer-using agents to assist scientific exploration ๐
- Dynamic environment & benchmark for realistic scientific workflows ๐
- Comprehensive evaluation of SOTA LLM/VLM agents ๐งญ

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence 
Qiushi Sun, Jingyang Gong, Yang Liu, Qiaosheng Chen, Lei Li, Kai Chen, Qipeng Guo, Ben Kao, Fei Yuan
[Paper] | [Slides] | [Project] | | [Code] |
- JanusCoder series: foundational models establishing a unified visual-programmatic interface. โ๏ธ
- A versatile data synthesis toolkit for multimodal code intelligence. ๐ ๏ธ
- Superior performance on diverse text- and vision-centric tasks. ๐งญ

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis ๐ฅ๐ฅ
Qiushi Sun*, Kanzhi Cheng*, Zichen Ding*, Chuanyang Jin*, Yian Wang, Fangzhi Xu, Zhenyu Wu, Chengyou Jia, Liheng Chen, Zhoumianze Liu, Ben Kao, Guohao Li, Junxian He, Yu Qiao, Zhiyong Wu
[Paper] | [Slides] | [Project] | [Models & Data] |

- Shift from task-driven to interaction-driven GUI data synthesis ๐ค
- A manual-free pipeline for constructing diverse GUI agent trajectories ๐งฌ
- Great performance on online mobile/web benchmarks ๐

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows ๐ก๏ธ๐ง
Qiushi Sun*, Mukai Li*, Zhoumianze Liu*, Zhihui Xie*, Fangzhi Xu, Zhangyue Yin, Kanzhi Cheng, Zehao Li, Zichen Ding, Qi Liu, Zhiyong Wu, Zhuosheng Zhang, Ben Kao, Lingpeng Kong
[Paper] | [Slides] | [Project] | [Env] | [Code] |

- MobileRisk-Live & MobileRisk, a dynamic environment and benchmark for realistic mobile agent safety ๐ฑ
- OS-Sentinel, a hybrid detection framework combining formal verification with contextual judgment ๐ก๏ธ
- Advanced mobile agent safety at both the step-level and trajectory-level ๐งญ

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond ๐ฅ๐ฅ
Qiushi Sun, Zhirui Chen, Fangzhi Xu, Chang Ma, Kanzhi Cheng, Zhangyue Yin, Jianing Wang, Chengcheng Han, Renyu Zhu, Shuai Yuan, Pengcheng Yin, Qipeng Guo, Xipeng Qiu, Xiaoli Li, Fei Yuan, Lingpeng Kong, Xiang Li, Zhiyong Wu
[Paper] | [Slides] | [Project] | [Video] |

Let me walk you through the development of neural code intelligence:
- Follow LMs for code as a thread to trace the fieldโs development ๐
- Explore cross-domain synergies and opportunities ๐ฑ
- Present a broad array of promising research avenues ๐ก
PreprintCODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning, Zeyi Sun*, Yuhang Cao*, Jianze Liang*, Qiushi Sun*, Ziyu Liu*, Zhixiong Zhang, Yuhang Zang, Xiaoyi Dong, Kai Chen, Dahua Lin, Jiaqi Wang.PreprintOS-MAP: How Far Can Computer Use Agents Go in Breadth and Depth?, Xuetian Chen, Yinghao Chen, Xinfeng Yuan, Lu Chen, Yuekeng Li, Zhoujia Zhang, Yingqian Huang, Leyan Huang, Jiaqing Liang, Tianbao Xie, Zhiyong Wu, Qiushi Sunโ, Biqing Qiโ and Bowen Zhou.DL4C @ NIPS'25CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback, Qiushi Sun*, Jingyang Gong*, Lei Li, Qipeng Guo and Fei Yuan.ACL 2025Dynamic and Generalizable Process Reward Modeling, Zhangyue Yin, Qiushi Sun, Zhiyuan Zeng, Qinyuan Cheng, Xipeng Qiu and Xuanjing Huang.ICLR 2025 (Spotlight)OS-ATLAS: A Foundation Action Model For Generalist GUI Agents, Zhiyong Wu, Zhenyu Wu, Fangzhi Xu, Yian Wang, Qiushi Sun, Chengyou Jia, Kanzhi Cheng, Zichen Ding, Liheng Chen, Paul Pu Liang and Yu Qiao.COLM 2024Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration, Qiushi Sun, Zhangyue Yin, Xiang Li, Zhiyong Wu, Xipeng Qiu and Lingpeng Kong. [LLMAgents @ ICLR 2024] SlidesACL 2024SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents, Kanzhi Cheng, Qiushi Sun, Yougang Chu, Fangzhi Xu, Yantao Li, Jianbing Zhang and Zhiyong Wu. [LLMAgents @ ICLR 2024]COLING 2024TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills, Qiushi Sun, Nuo Chen, Jianing Wang, Xiang Li and Ming Gao.COLING 2024Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives, Qiushi Sun, Chengcheng Han, Nuo Chen, Renyu Zhu, Jingyang Gong, Xiang Li and Ming Gao. | ๐ฅ 100K RMB Award-winning SolutionCIKM 2023 (Demo)HugNLP: A Unified and Comprehensive Library for Natural Language Processing, Jianing Wang, Nuo Chen, Qiushi Sun, Wenkang Huang, Chengyu Wang and Ming Gao. | ๐ Best Paper Award
EMNLP 2022CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, Nuo Chen*, Qiushi Sun*, Renyu Zhu*, Xiang Li, Xuesong Lu and Ming Gao. Slides | Video |
*Denotes equal contribution, โ denotes corresponding author, more working drafts / preprints under review will be released later โ๏ธ