PeppePasti 's Collections Computer Vision
updated
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world
Videos
Paper
• 2409.02095
• Published
• 37
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Paper
• 2409.01704
• Published
• 83
CDM: A Reliable Metric for Fair and Accurate Formula Recognition
Evaluation
Paper
• 2409.03643
• Published
• 19
UniDet3D: Multi-dataset Indoor 3D Object Detection
Paper
• 2409.04234
• Published
• 9
Evaluating Multiview Object Consistency in Humans and Image Models
Paper
• 2409.05862
• Published
• 11
LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation
Paper
• 2409.06703
• Published
• 3
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video
Diffusion Models
Paper
• 2409.07452
• Published
• 21
Instant Facial Gaussians Translator for Relightable and Interactable
Facial Rendering
Paper
• 2409.07441
• Published
• 12
InstantDrag: Improving Interactivity in Drag-based Image Editing
Paper
• 2409.08857
• Published
• 34
MIMO: Controllable Character Video Synthesis with Spatial Decomposed
Modeling
Paper
• 2409.16160
• Published
• 34
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense
Prediction
Paper
• 2409.18124
• Published
• 33