CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation Paper • 2604.19636 • Published 10 days ago • 86
AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 17 days ago • 161
view article Article Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents +2 15 days ago • 16
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 16 days ago • 153
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published 18 days ago • 141
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 15 days ago • 66
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published Mar 27 • 363
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 25 days ago • 111
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision Paper • 2604.04934 • Published 25 days ago • 46
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 23 days ago • 323
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 28 days ago • 375
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 23 days ago • 187
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale Paper • 2604.04771 • Published 25 days ago • 122
InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper • 2604.03144 • Published 28 days ago • 233
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 340
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 349
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published Mar 25 • 183
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 29 days ago • 882