view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 411
DFlash Collection Block Diffusion for Flash Speculative Decoding • 23 items • Updated about 9 hours ago • 139
Qwen-3.5-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 20 items • Updated Mar 29 • 20