-
-
-
-
-
-
Inference Providers
Active filters: int4
Ex0bit/Kimi-K2.5-PRISM-REAP-530B-A32B
Text Generation
• 91B • Updated
• 586
• 13
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated
• 924k
• 13
0xSero/Kimi-K2.5-PRISM-REAP-72
Text Generation
• 91B • Updated
• 240
• 5
alecccdd/moondream3-preview-4bit
Image-Text-to-Text
• Updated
• 462
• 9
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1
Text Generation
• 8B • Updated
• 9
• 6
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2
Text Generation
• 8B • Updated
• 956
• 8
ModelCloud/QwQ-32B-gptqmodel-4bit-vortex-v1
Text Generation
• 33B • Updated
• 14
• 12
ModelCloud/Marin-32B-Base-GPTQMODEL-AWQ-W4A16
Text Generation
• 33B • Updated
• 9
• 2
ModelCloud/Granite-4.0-H-1B-GPTQMODEL-W4A16
Text Generation
• 1B • Updated
• 9
• 1
ModelCloud/Granite-4.0-H-350M-GPTQMODEL-W4A16
Text Generation
• 0.3B • Updated
• 24
• 1
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16
Text Generation
• 15B • Updated
• 3
• 1
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16-v2
Text Generation
• 15B • Updated
• 4
• 1
tonera/oneObsession_v16Noobai
Text-to-Image
• Updated
• 12
• 1
Teaspoon-AI/Voxtral-Mini-4B-INT4-Jetson
Updated
• 115
• 3
Advantech-EIOT/intel_llama-2-chat-7b
Text Generation
• Updated
• 4
RedHatAI/zephyr-7b-beta-marlin
Text Generation
• 1B • Updated
• 23
RedHatAI/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
• 0.3B • Updated
• 242
• 2
RedHatAI/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
• 1B • Updated
• 90
• 2
RedHatAI/Nous-Hermes-2-Yi-34B-marlin
Text Generation
• 5B • Updated
• 4
• 5
ecastera/ecastera-eva-westlake-7b-spanish-int4-gguf
7B • Updated
• 7
• 2
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
• 10B • Updated
• 1
softmax/falcon-180B-chat-marlin
Text Generation
• 26B • Updated
• 3
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated
• 5
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
• 71B • Updated
• 75
• 6
study-hjt/Meta-Llama-3-70B-Instruct-AWQ
Text Generation
• 71B • Updated
• 53
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
• 111B • Updated
• 56
• 2
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int4
Text Generation
• 7B • Updated
• 51
study-hjt/Qwen1.5-110B-Chat-AWQ
Text Generation
• 111B • Updated
• 49
modelscope/Yi-1.5-34B-Chat-AWQ
Text Generation
• 34B • Updated
• 246
• 2
modelscope/Yi-1.5-6B-Chat-GPTQ
Text Generation
• 6B • Updated
• 50