AI
Starstrek
Stars321123
AI & ML interests
AI
Recent Activity
upvoted a paper about 2 hours ago
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL upvoted a paper about 2 hours ago
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness upvoted a paper about 15 hours ago
OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models