When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published 3 days ago • 107
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation Paper • 2604.08455 • Published 3 days ago • 35
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 Jan 20 • 43
view article Article Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs +3 3 days ago • 18
INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling Paper • 2604.07209 • Published 4 days ago • 26
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding Paper • 2604.05015 • Published 6 days ago • 224
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 3 items • Updated 3 days ago • 28
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU Paper • 2604.05091 • Published 6 days ago • 39