Kaiyan Zhang's picture

Kaiyan Zhang

iseesaw

·

https://iseesaw.github.io/

AI & ML interests

Large Reasoning Models, Reinforcement Learning, Agent

Recent Activity

authored a paper about 20 hours ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

upvoted a paper 3 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

upvoted a paper 2 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

View all activity

Organizations

Papers 28

arxiv:2605.18643

arxiv:2509.15207

arxiv:2509.09674

arxiv:2509.08827

models 0

None public yet

datasets 0

None public yet