Agentic RL - a yangjinluan Collection

yangjinluan 's Collections

General Reasoning & Formal Reasoning

Agentic RL

updated about 23 hours ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 177
TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training

Paper • 2603.01714 • Published 2 days ago
SIGHT: Reinforcement Learning with Self-Evidence and Information-Gain Diverse Branching for Search Agent

Paper • 2602.11551 • Published 20 days ago
Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?

Paper • 2510.11184 • Published Oct 13, 2025 • 1