Wei Cheng's picture

Wei Cheng

wchengad

·

https://wchengad.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

upvoted a paper 1 day ago

VINO: A Unified Visual Generator with Interleaved OmniModal Context

liked a Space 27 days ago

ziheng1234/ImageCritic

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published about 21 hours ago • 27

upvoted a paper 1 day ago

VINO: A Unified Visual Generator with Interleaved OmniModal Context

Paper • 2601.02358 • Published 2 days ago • 23

upvoted a paper 29 days ago

Relational Visual Similarity

Paper • 2512.07833 • Published 30 days ago • 24

upvoted 5 papers about 1 month ago

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published Dec 5, 2025 • 38

Captain Safari: A World Engine

Paper • 2511.22815 • Published Nov 28, 2025 • 9

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 224

REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Paper • 2511.22625 • Published Nov 27, 2025 • 46

iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation

Paper • 2511.20635 • Published Nov 25, 2025 • 32

upvoted a paper about 2 months ago

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 53

upvoted 3 papers 2 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 101

RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

Paper • 2510.25590 • Published Oct 29, 2025 • 27

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Paper • 2510.22706 • Published Oct 26, 2025 • 40

upvoted 3 papers 3 months ago

WithAnyone: Towards Controllable and ID Consistent Image Generation

Paper • 2510.14975 • Published Oct 16, 2025 • 84

DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training

Paper • 2510.11712 • Published Oct 13, 2025 • 30

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9, 2025 • 125

upvoted 5 papers 4 months ago

Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation

Paper • 2509.12815 • Published Sep 16, 2025 • 40

LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence

Paper • 2509.12203 • Published Sep 15, 2025 • 19

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Paper • 2509.12201 • Published Sep 15, 2025 • 106

USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning

Paper • 2508.18966 • Published Aug 26, 2025 • 56

Mixture of Contexts for Long Video Generation

Paper • 2508.21058 • Published Aug 28, 2025 • 35