Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yangjinluan 's Collections
Agentic RL
General Reasoning & Formal Reasoning
Model Merging

Agentic RL

updated about 23 hours ago
Upvote
-

  • LongCat-Flash-Thinking-2601 Technical Report

    Paper • 2601.16725 • Published Jan 23 • 177

  • TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training

    Paper • 2603.01714 • Published 2 days ago

  • SIGHT: Reinforcement Learning with Self-Evidence and Information-Gain Diverse Branching for Search Agent

    Paper • 2602.11551 • Published 20 days ago

  • Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?

    Paper • 2510.11184 • Published Oct 13, 2025 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs