Haocheng Xi's picture

Haocheng Xi

xihc-ucb

·

xijiu9

AI & ML interests

Efficient ML

Recent Activity

upvoted a paper about 5 hours ago

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

upvoted a paper about 8 hours ago

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

upvoted a paper about 8 hours ago

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

View all activity

Organizations

upvoted a paper about 5 hours ago

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published 1 day ago • 20

upvoted 2 papers about 8 hours ago

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

Paper • 2603.08982 • Published 1 day ago • 14

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Paper • 2603.03269 • Published 8 days ago • 43

upvoted 2 papers 5 days ago

V_1: Unifying Generation and Self-Verification for Parallel Reasoners

Paper • 2603.04304 • Published 7 days ago • 14

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published 7 days ago • 155

upvoted 3 papers about 1 month ago

Residual Context Diffusion Language Models

Paper • 2601.22954 • Published Jan 30 • 35

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Paper • 2602.02958 • Published Feb 3 • 34

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Paper • 2601.14243 • Published Jan 20 • 23

upvoted a paper 3 months ago

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

Paper • 2512.05033 • Published Dec 4, 2025 • 17

upvoted 2 papers 4 months ago

Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Paper • 2406.10774 • Published Jun 16, 2024 • 4

Video-As-Prompt: Unified Semantic Control for Video Generation

Paper • 2510.20888 • Published Oct 23, 2025 • 50

upvoted 7 papers 5 months ago

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22, 2025 • 61

Attention Sinks in Diffusion Language Models

Paper • 2510.15731 • Published Oct 17, 2025 • 49

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 181

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2, 2025 • 96

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

Paper • 2509.25182 • Published Sep 29, 2025 • 39

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published Sep 29, 2025 • 46

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 118

upvoted a collection 5 months ago

Jet-Nemotron

2 items • Updated Sep 28, 2025 • 16

upvoted a paper 7 months ago

XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization

Paper • 2508.10395 • Published Aug 14, 2025 • 42