You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories Paper • 2605.21468 • Published 4 days ago • 44
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning Paper • 2603.00889 • Published Mar 1 • 55
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning Paper • 2603.00889 • Published Mar 1 • 55
RAST: Reasoning Activation in LLMs via Small-model Transfer Paper • 2506.15710 • Published May 30, 2025
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning Paper • 2506.01347 • Published Jun 2, 2025 • 3
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning Paper • 2506.01347 • Published Jun 2, 2025 • 3
AdaDecode Collection [ICML 2025] AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism. • 18 items • Updated Jun 4, 2025 • 3