Original model: https://huggingface.co/5CD-AI/Vintern-1B-v3_5

How to use this

Install llama.cpp

Then:

llama-server -hf ngxson/Vintern-1B-v3_5-GGUF --chat-template vicuna
Downloads last month
478,632
GGUF
Model size
0.6B params
Architecture
qwen2
Hardware compatibility
Log In to view the estimation

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ngxson/Vintern-1B-v3_5-GGUF

Quantized
(3)
this model