ky666 (k) – Likes

liked a Space 4 months ago

The Smol Training Playbook

📚

2.98k

The secrets to building world-class LLMs

liked a model 4 months ago

microsoft/UserLM-8b

Text Generation • Updated Oct 9, 2025 • 805 • 362

liked 2 models 7 months ago

unsloth/Qwen3-32B-unsloth-bnb-4bit

Text Generation • 33B • Updated May 14, 2025 • 12.7k • 15

unsloth/Qwen3-14B-unsloth-bnb-4bit

Text Generation • 15B • Updated May 13, 2025 • 137k • 14

liked a model 8 months ago

unsloth/GLM-Z1-32B-0414

Text Generation • 33B • Updated Jul 3, 2025 • 10 • 1

liked a model 10 months ago

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18, 2025 • 69.9k • 512

liked a model 11 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 236k • • 3.09k

liked a model 12 months ago

Qwen/QwQ-32B

Text Generation • Updated Mar 11, 2025 • 62.7k • • 2.89k

liked a dataset 12 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

Viewer • Updated Feb 19, 2025 • 110k • 166 • 216

liked 2 models 12 months ago

microsoft/OmniParser-v2.0

Updated Mar 28, 2025 • 673 • 1.31k

unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Text Generation • 8B • Updated May 10, 2025 • 26.7k • 293

liked a dataset 12 months ago

Conard/fortune-telling

Viewer • Updated Feb 17, 2025 • 207 • 324 • 168

liked a Space 12 months ago

The Ultra-Scale Playbook

🌌

3.69k

The ultimate guide to training LLM on large GPU Clusters

liked 6 models 12 months ago

liked a dataset about 1 year ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 11k • 711

k

AI & ML interests

Organizations

The Smol Training Playbook

microsoft/UserLM-8b

unsloth/Qwen3-32B-unsloth-bnb-4bit

unsloth/Qwen3-14B-unsloth-bnb-4bit

unsloth/GLM-Z1-32B-0414

ByteDance-Seed/UI-TARS-1.5-7B

deepseek-ai/DeepSeek-V3-0324

Qwen/QwQ-32B

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

microsoft/OmniParser-v2.0

unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Conard/fortune-telling

The Ultra-Scale Playbook

Open-Reasoner-Zero/Open-Reasoner-Zero-7B

Open-Reasoner-Zero/Open-Reasoner-Zero-32B

unsloth/DeepSeek-R1-Distill-Qwen-32B-bnb-4bit

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4

ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4

open-r1/OpenR1-Math-220k

k

AI & ML interests

Organizations

ky666's activity

The Smol Training Playbook

The Ultra-Scale Playbook