Nano-Raccoon-Preview-1104
Prototyping checkpoint for NeAR-specialized SLM. Deployment friendly to single consumer GPU.
This model is a light SFT version from Qwen/Qwen3-14B, aimed at stable generative behavior on NeAR agent scaffold.
Serve with vllm
Single GPU
vllm serve billxbf/Nano-Raccoon-Preview-1104 \
--trust-remote-code \
--host 0.0.0.0 \
--port 8000
Use Tensor Parallel on 8xGPU
vllm serve billxbf/Nano-Raccoon-Preview-1104 \
--tensor-parallel-size 8 \
--trust-remote-code \
--host 0.0.0.0 \
--port 8000
- Downloads last month
- 3
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for billxbf/Nano-Raccoon-Preview-1104
Base model
Qwen/Qwen3-14B-Base
Finetuned
Qwen/Qwen3-14B