Inference Providers
Active filters: cuda
Text Generation
• 8B • Updated • 111k
• 678
prism-ml/Bonsai-1.7B-gguf
Text Generation
• 2B • Updated • 24.3k
• 69
atomicmilkshake/llama-cpp-turboquant-binaries
Multilingual-Multimodal-NLP/IndustrialCoder
Text Generation
• 32B • Updated • 439
• 60
ValiantLabs/gpt-oss-20b-ShiningValiant3
Text Generation
• 21B • Updated • 17
• 19
Jarrodbarnes/KernelBench-RLVR-120b
Text Generation
• 117B • Updated • 31
• 3
Text-to-Speech
• Updated • 386
• 3
Multilingual-Multimodal-NLP/IndustrialCoder-Thinking-32B-FP8
Text Generation
• 32B • Updated • 41
• 1
ValiantLabs/gemma-4-E2B-it-ShiningValiant3
Image-Text-to-Text
• 5B • Updated • 38
• 3
ValiantLabs/gemma-4-E4B-it-ShiningValiant3
Image-Text-to-Text
• 8B • Updated • 33
• 4
8B • Updated • 29.1k
• 104
Text Generation
• Updated • 10
• 23
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
• Updated • 10
marcorez8/llama-cpp-python-windows-blackwell-cuda
ValiantLabs/Qwen3-8B-ShiningValiant3
Text Generation
• 8B • Updated • 13
• 3
mradermacher/Qwen3-8B-ShiningValiant3-GGUF
8B • Updated • 1.62k
• 2
mradermacher/Qwen3-8B-ShiningValiant3-i1-GGUF
8B • Updated • 2.44k
• 2
ValiantLabs/Qwen3-1.7B-ShiningValiant3
Text Generation
• 2B • Updated • 26
• • 5
mradermacher/Qwen3-1.7B-ShiningValiant3-GGUF
2B • Updated • 169
mradermacher/Qwen3-1.7B-ShiningValiant3-i1-GGUF
2B • Updated • 416
ValiantLabs/Qwen3-4B-ShiningValiant3
Text Generation
• 4B • Updated • 33
• • 7
sequelbox/Qwen3-8B-PlumEsper
Text Generation
• 8B • Updated • 4
sequelbox/Qwen3-4B-PlumEsper
Text Generation
• 4B • Updated • 2
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-GGUF
3B • Updated • 190
• 2
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-GGUF
2B • Updated • 142
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-GGUF
2B • Updated • 127
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-i1-GGUF
2B • Updated • 384
• 1
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-i1-GGUF
2B • Updated • 570
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-i1-GGUF
3B • Updated • 426
• 1