Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published 4 days ago • 173
Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published 5 days ago • 186
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning Paper • 2602.07845 • Published 7 days ago • 67
MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper • 2602.08794 • Published 6 days ago • 149
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper • 2602.06949 • Published 8 days ago • 30
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models Paper • 2602.03392 • Published 12 days ago • 52
RISE-Video: Can Video Generators Decode Implicit World Rules? Paper • 2602.05986 • Published 9 days ago • 26
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Paper • 2602.04634 • Published 11 days ago • 92
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper • 2602.03796 • Published 12 days ago • 56
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published 14 days ago • 279
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 225
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 15 days ago • 180
TTCS: Test-Time Curriculum Synthesis for Self-Evolving Paper • 2601.22628 • Published 16 days ago • 34