IBM-Grok4-UltraFast-Coder-1B

This model is a full fine-tuned derivative of ibm-granite/granite-4.0-1b.

Training notes: - Full model fine-tuning - No adapters - No LoRA - No QLoRA - Dual-GPU DDP training on Kaggle

Downloads last month
513
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for gss1147/IBM-Grok4-UltraFast-Coder-1B

Finetuned
(6)
this model
Quantizations
2 models

Datasets used to train gss1147/IBM-Grok4-UltraFast-Coder-1B