arxiv:2512.22322
Shaofei Cai
phython96
AI & ML interests
Embodied Decision Making, Computer Vision, Game AI, LLM Agents
Recent Activity
authored
a paper
about 6 hours ago
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive
Exploration for Agentic Reinforcement Learning
authored
a paper
about 6 hours ago
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
authored
a paper
about 6 hours ago
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents