OASIS: Order-Augmented Strategy for Improved Code Search Paper • 2503.08161 • Published Mar 11, 2025 • 2
HiPO: Hybrid Policy Optimization for Dynamic Reasoning in LLMs Paper • 2509.23967 • Published Sep 28, 2025 • 2
SAND-Math: Using LLMs to Generate Novel, Difficult and Useful Mathematics Questions and Answers Paper • 2507.20527 • Published Jul 28, 2025 • 6