Darrow O'Lykos's picture

Darrow O'Lykos

darrowoflykos

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

liked a model about 10 hours ago

hollowstrawberry/upscalers-backup

liked a model about 10 hours ago

ABDALLALSWAITI/Upscalers

View all activity

Organizations

None yet

upvoted a paper about 6 hours ago

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

Paper • 2606.03264 • Published 2 days ago • 9

upvoted a paper 1 day ago

MONET: A Massive, Open, Non-redundant and Enriched Text-to-image dataset

Paper • 2605.21272 • Published 15 days ago • 3

upvoted a paper 2 days ago

Mellum2 Technical Report

Paper • 2605.31268 • Published 6 days ago • 50

upvoted 2 papers 4 days ago

GPIC: A Giant Permissive Image Corpus for Visual Generation

Paper • 2605.30341 • Published 7 days ago • 2

ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding

Paper • 2507.14533 • Published Jul 19, 2025 • 7

upvoted 10 papers 5 days ago

SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration

Paper • 2501.01320 • Published Jan 2, 2025 • 14

FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Paper • 2510.12747 • Published Oct 14, 2025 • 40

Efficient Universal Perception Encoder

Paper • 2603.22387 • Published Mar 23 • 10

Déjà View: Looping Transformers for Multi-View 3D Reconstruction

Paper • 2605.30215 • Published 7 days ago • 5

Qianfan-VL: Domain-Enhanced Universal Vision-Language Models

Paper • 2509.18189 • Published Sep 19, 2025 • 4

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Paper • 2512.14698 • Published Dec 16, 2025 • 25

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Paper • 2512.10942 • Published Dec 11, 2025 • 60

OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding

Paper • 2604.25276 • Published Apr 28 • 1

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Paper • 2602.08711 • Published Feb 9 • 29

RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

Paper • 2511.09554 • Published Nov 12, 2025 • 11

upvoted 5 papers 6 days ago

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

Paper • 2107.10833 • Published Jul 22, 2021 • 2

GLiNER Guard: Unified Encoder Family for Production LLM Safety and Privacy

Paper • 2605.05277 • Published 29 days ago • 4

How to Steal Reasoning Without Reasoning Traces

Paper • 2603.07267 • Published Mar 7 • 4

OneFormer: One Transformer to Rule Universal Image Segmentation

Paper • 2211.06220 • Published Nov 10, 2022 • 1

Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions

Paper • 2602.13013 • Published Feb 13 • 55