Inference Providers
Active filters: amd
djdeniro/Qwen3.5-397B-A17B-MXFP4
Image-Text-to-Text
• 215B • Updated • 41
• 2
schuttdev/hipfire-qwen3.5-2b
schuttdev/hipfire-carnice-9b
Text Generation
• Updated • 3
djdeniro/Qwen3.5-35B-A3B-MXFP4
Image-Text-to-Text
• 20B • Updated • 49
• 1
amd/Llama-3.2-1B-Instruct-onnx-ryzenai-npu
Text Generation
• Updated • 102
• 2
dahara1/llama3-8b-amd-npu
Tech-Meld/gpus-everywhere
Text-to-Image
• Updated • 8
• • 1
dahara1/llama3.1-8b-Instruct-amd-npu
dahara1/ALMA-Ja-V3-amd-npu
dahara1/llama-translate-amd-npu
Translation
• Updated • 5
dahara1/llama-translate-gguf
8B • Updated • 177
• 16
amd/Llama-2-7b-hf-awq-g128-int4-asym-bf16-onnx-ryzen-strix
Text Generation
• Updated • 65
amd/Llama2-7b-chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix
Text Generation
• Updated • 10
amd/Llama-3-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix
Text Generation
• Updated • 6
• 2
amd/Llama-3.1-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix
Text Generation
• Updated • 8
• 2
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix
Text Generation
• Updated • 13
• 3
amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix
Text Generation
• Updated • 16
• 2
uday610/Llama2-7b-chat-awq-g128-int4-asym-fp32-onnx-ryzen-strix-hybrid
Text Generation
• Updated amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-dml
Text Generation
• Updated amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid
Text Generation
• Updated • 10
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid
Text Generation
• Updated • 10
amd/Llama-2-7b-chat-hf-awq-g128-int4-asym-fp16-onnx-dml
Text Generation
• Updated amd/Llama-2-7b-hf-awq-g128-int4-asym-fp16-onnx-hybrid
Text Generation
• Updated • 5
amd/Llama-2-7b-chat-hf-awq-g128-int4-asym-fp16-onnx-hybrid
Text Generation
• Updated • 8
amd/Llama-3-8B-awq-g128-int4-asym-fp16-onnx-hybrid
Text Generation
• Updated • 6
sbeierle/fame-pytorch-kit
Updated
skshreyas714/AAPL_Team_ACB
Text Generation
• 4B • Updated • 4
magicunicorn/whisper-large-v3-amd-npu-int8
Updated • 2
• 3
magicunicorn/whisper-large-v2-amd-npu-int8
Updated • 6
• 5
magicunicorn/whisper-medium-amd-npu-int8