09.06.2026

This project is on hold now. For most stable experience currently, please use https://huggingface.co/spiritbuun/buun-Qwen3.6-chat_template

Preserve Thinking:

To enable preserve-thinking (Agent can remember what he thought about in older turns) pass the following parameter to llama.cpp startup: On Linux&MacOS (Windows need slightly different which cant be correctly displayed here...): --chat-template-kwargs '{"preserve_thinking": true}'

OpenCode:

If using OpenCode do notice they have an open bug for showing thinking content as plain text. Open Bug since 4 months(https://github.com/anomalyco/opencode/issues/11439). Meanwhile you can use the llama.cpp Server switch to hide thinking completely(but still let the model think) using the additonal llama.cpp parameter: --reasoning-format deepseek

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for StableQuant/Qwen-Templates-Rebuild-Project

Base model

Qwen/Qwen3.5-27B
Finetuned
(298)
this model