WorldKV: Efficient World Memory with World Retrieval and Compression Paper • 2605.22718 • Published 2 days ago • 30
FlowLong: Inference-time Long Video Generation via Manifold-constrained Tweedie Matching Paper • 2605.20910 • Published 3 days ago • 24
RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models Paper • 2603.21341 • Published Mar 22 • 24
SCALE: Self-uncertainty Conditioned Adaptive Looking and Execution for Vision-Language-Action Models Paper • 2602.04208 • Published Feb 4 • 20
DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo Paper • 2605.16257 • Published 8 days ago • 50
PhysHanDI: Physics-Based Reconstruction of Hand-Deformable Object Interactions Paper • 2605.09538 • Published 13 days ago
FourierHandFlow: Neural 4D Hand Representation Using Fourier Query Flow Paper • 2307.08100 • Published Jul 16, 2023
Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics Paper • 2409.04033 • Published Sep 6, 2024
Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded Images Paper • 2409.18364 • Published Oct 29, 2024
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 19 days ago • 334
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision Paper • 2604.04934 • Published Apr 6 • 46