Kunlun Zhu, Hongyi Du, Zhaochen Hong, Xiaocheng Yang, Shuyi Guo, Zhe Wang, Zhenhailong Wang, Cheng Qian, Xiangru Tang, Heng Ji, Jiaxuan You
(2025).
MultiAgentBench: Evaluating the Collaboration and Competition of LLM Agents.
In
ACL ’25.
Cheng Qian, Peixuan Han, Qinyu Luo, Bingxiang He, Xiusi Chen, Yuji Zhang, Hongyi Du, Jiarui Yao, Xiaocheng Yang, Denghui Zhang, Yunzhu Li, Heng Ji
(2024).
EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents.
In
ACL ’25.