LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs Paper • 2602.00462 • Published 13 days ago • 15
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations Paper • 2407.03471 • Published Jul 3, 2024 • 30