Running 3.62k The Ultra-Scale Playbook π 3.62k The ultimate guide to training LLM on large GPU Clusters
shenzhi-wang/Gemma-2-9B-Chinese-Chat Text Generation β’ 9B β’ Updated Jul 4, 2024 β’ 688 β’ β’ 78