arxiv:2601.10201
Jiarui Yao
FlippyDora
AI & ML interests
None yet
Recent Activity
upvoted a paper about 13 hours ago
Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning upvoted a paper 9 days ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper 9 days ago
Rethinking the Divergence Regularization in LLM RL