Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding Paper • 2604.26779 • Published 2 days ago • 3
Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising Paper • 2604.26694 • Published 2 days ago • 4
ClawGym: A Scalable Framework for Building Effective Claw Agents Paper • 2604.26904 • Published 2 days ago • 37
IAM: Identity-Aware Human Motion and Shape Joint Generation Paper • 2604.25164 • Published 3 days ago • 1
IAM: Identity-Aware Human Motion and Shape Joint Generation Paper • 2604.25164 • Published 3 days ago • 1
Toward Scalable Terminal Task Synthesis via Skill Graphs Paper • 2604.25727 • Published 3 days ago • 7
DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios Paper • 2604.25914 • Published 3 days ago • 37
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company Paper • 2604.22446 • Published 7 days ago • 112
SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper • 2604.22875 • Published 8 days ago • 29
Zero-to-CAD: Agentic Synthesis of Interpretable CAD Programs at Million-Scale Without Real Data Paper • 2604.24479 • Published 4 days ago • 5
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 4 days ago • 62
Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing Paper • 2604.22782 • Published 28 days ago • 4
ProEval: Proactive Failure Discovery and Efficient Performance Estimation for Generative AI Evaluation Paper • 2604.23099 • Published 6 days ago • 2