Running on CPU Upgrade Featured 2.98k The Smol Training Playbook 📚 2.98k The secrets to building world-class LLMs
Running 3.69k The Ultra-Scale Playbook 🌌 3.69k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • 71B • Updated Feb 24, 2025 • 299k • • 741
Running on CPU Upgrade Featured 1k Model Memory Utility 🚀 1k Calculate VRAM needed for training and inference of HF models