RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published 2 days ago • 23
Jamba Reasoning 3B Collection AI21's top-performing reasoning model that packs leading scores on intelligence benchmarks and highly-efficient processing into a compact 3B build • 2 items • Updated Oct 8, 2025 • 6
mlx-community/IQuest-Coder-V1-40B-Loop-Instruct-6bit Text Generation • 40B • Updated 3 days ago • 99 • 1
abnormalmapstudio/Qwen3-Next-80B-A3B-Thinking-mxfp4-mlx Text Generation • 80B • Updated Sep 15, 2025 • 135 • 4
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 21 items • Updated about 13 hours ago • 80
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published 18 days ago • 68
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 67 items • Updated 10 days ago • 296
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 5 days ago • 27
MiroThinker-v1.5 Collection MiroMind’s Flagship Search Agent Model • 4 items • Updated 3 days ago • 19