Publications

(2025). ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges. arXiv preprint.
(2025). MultiAgentBench: Evaluating the Collaboration and Competition of LLM Agents. In ACL ’25.
(2024). EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents. In ACL ’25.