Enabling Versatile Controls for Video Diffusion Models Paper • 2503.16983 • Published Mar 21, 2025 • 15
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks Paper • 2503.04065 • Published Mar 6, 2025
MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering Paper • 2601.22859 • Published 22 days ago • 17
MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering Paper • 2601.22859 • Published 22 days ago • 17
ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models Paper • 2509.24239 • Published Sep 29, 2025 • 5