Chuanming Liu's picture

In a Training Loop 🔄

Chuanming Liu

Chuanming

·

Chuanming

AI & ML interests

Artificial Intelligence, AGI, NLP, LLMs, Multimodality, MLSys. Python/Golang/C/C++/Shell/awk&sed

Recent Activity

liked a model about 1 hour ago

mlx-community/gemma-3-27b-it-qat-bf16

liked a model about 1 hour ago

google/gemma-3-27b-it-qat-q4_0-unquantized

liked a model 14 days ago

zai-org/GLM-4.7

View all activity

Organizations

upvoted an article 14 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

21 days ago

•

104

upvoted a paper 19 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 96

upvoted 2 articles 2 months ago

Article

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

Nov 3, 2022

•

339

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

295

upvoted 2 papers 3 months ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12, 2025 • 134

Finite Scalar Quantization Enables Redundant and Transmission-Robust Neural Audio Compression at Low Bit-rates

Paper • 2509.09550 • Published Sep 11, 2025 • 2

upvoted 2 collections 4 months ago

Qwen3Guard

7 items • Updated 7 days ago • 60

Qwen3-Omni

6 items • Updated 7 days ago • 178

upvoted an article 4 months ago

Article

Understanding Vector Quantization in VQ-VAE

Aug 28, 2024

•

52

upvoted a paper 4 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

upvoted an article 4 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

72

upvoted 2 collections 4 months ago

PP-StructureV3

PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON. • 17 items • Updated Sep 15, 2025 • 12

PP-OCRv5

PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15, 2025 • 50

upvoted a paper 4 months ago

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22, 2025 • 73

upvoted a collection 4 months ago

Marvis-TTS-250m-v0.1

5 items • Updated Aug 26, 2025 • 26

upvoted 2 collections 5 months ago

AFM-Datasets

Training datasets of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 6 items • Updated Aug 6, 2025 • 5

AFM-Models

The models and training dataset of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 12 items • Updated Aug 6, 2025 • 16

upvoted a paper 5 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

upvoted a collection 5 months ago

Seed-OSS

Seed-OSS Open-Source Models • 3 items • Updated Aug 20, 2025 • 59

upvoted an article 5 months ago

Article

Introducing ColQwen-Omni: Retrieve in every modality

Jul 17, 2025

•

75