PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training Paper • 2606.03264 • Published 2 days ago • 9
MONET: A Massive, Open, Non-redundant and Enriched Text-to-image dataset Paper • 2605.21272 • Published 15 days ago • 3
GPIC: A Giant Permissive Image Corpus for Visual Generation Paper • 2605.30341 • Published 7 days ago • 2
ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding Paper • 2507.14533 • Published Jul 19, 2025 • 7
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration Paper • 2501.01320 • Published Jan 2, 2025 • 14
FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution Paper • 2510.12747 • Published Oct 14, 2025 • 40
Déjà View: Looping Transformers for Multi-View 3D Reconstruction Paper • 2605.30215 • Published 7 days ago • 5
Qianfan-VL: Domain-Enhanced Universal Vision-Language Models Paper • 2509.18189 • Published Sep 19, 2025 • 4
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs Paper • 2512.14698 • Published Dec 16, 2025 • 25
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published Dec 11, 2025 • 60
OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding Paper • 2604.25276 • Published Apr 28 • 1
TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions Paper • 2602.08711 • Published Feb 9 • 29
RF-DETR: Neural Architecture Search for Real-Time Detection Transformers Paper • 2511.09554 • Published Nov 12, 2025 • 11
Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data Paper • 2107.10833 • Published Jul 22, 2021 • 2
GLiNER Guard: Unified Encoder Family for Production LLM Safety and Privacy Paper • 2605.05277 • Published 29 days ago • 4
OneFormer: One Transformer to Rule Universal Image Segmentation Paper • 2211.06220 • Published Nov 10, 2022 • 1
Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions Paper • 2602.13013 • Published Feb 13 • 55