πŸ“ Selected Publications

Preprint
sym

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows πŸ€–πŸ”¬

Qiushi Sun, Zhoumianze Liu, Chang Ma, Zichen Ding, Fangzhi Xu, Zhangyue Yin, Haiteng Zhao, Zhenyu Wu, Kanzhi Cheng, Zhaoyang Liu, Jianing Wang, Qintong Li, Xiangru Tang, Tianbao Xie, Xiachong Feng, Xiang Li, Ben Kao, Wenhai Wang, Biqing Qi, Lingpeng Kong, Zhiyong Wu

[Paper] | [Slides] | [Project] | [Env] | [Code] |

  • First to apply computer-using agents to assist scientific exploration 🌌
  • Dynamic environment & benchmark for realistic scientific workflows 🌍
  • Comprehensive evaluation of SOTA LLM/VLM agents 🧭
ACL 2025
sym

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis πŸ”₯πŸ”₯

Qiushi Sun*, Kanzhi Cheng*, Zichen Ding*, Chuanyang Jin*, Yian Wang, Fangzhi Xu, Zhenyu Wu, Chengyou Jia, Liheng Chen, Zhoumianze Liu, Ben Kao, Guohao Li, Junxian He, Yu Qiao, Zhiyong Wu

[Paper] | [Slides] | [Project] | [Models & Data] |

  • Shift from task-driven to interaction-driven GUI data synthesis πŸ€–
  • A manual-free pipeline for constructing diverse GUI agent trajectories 🧬
  • Great performance on online mobile/web benchmarks 🌟
Survey
sym

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond πŸ”₯πŸ”₯

Qiushi Sun, Zhirui Chen, Fangzhi Xu, Chang Ma, Kanzhi Cheng, Zhangyue Yin, Jianing Wang, Chengcheng Han, Renyu Zhu, Shuai Yuan, Pengcheng Yin, Qipeng Guo, Xipeng Qiu, Xiaoli Li, Fei Yuan, Lingpeng Kong, Xiang Li, Zhiyong Wu

[Paper] | [Slides] | [Project] | [Video] |

Let me walk you through the development of neural code intelligence:

  • Follow LMs for code as a thread to trace the field’s development πŸš€
  • Explore cross-domain synergies and opportunities 🌱
  • Present a broad array of promising research avenues πŸ’‘

*Denotes equal contribution, βœ‰ denotes corresponding author, more working drafts / preprints under review will be released later βŒ›οΈ