arxiv:2507.07984
Chenming Zhu
ChaimZhu
AI & ML interests
Multimodal Large Language Models, 3D Perception and Understanding, Embodied AI
Recent Activity
upvoted a paper about 5 hours ago
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens upvoted a paper 3 months ago
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence