MediaTek-Research/Breeze-ASR-25 Automatic Speech Recognition β’ 2B β’ Updated Jul 8, 2025 β’ 5.16k β’ 89
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models Paper β’ 2511.11007 β’ Published Nov 14, 2025 β’ 15
view article Article Weβre open-sourcing our text-to-image model and the process behind it Nov 12, 2025 β’ 76
nvidia/diar_streaming_sortformer_4spk-v2 Automatic Speech Recognition β’ Updated 3 days ago β’ 8.52k β’ 87
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper β’ 2508.16153 β’ Published Aug 22, 2025 β’ 160
facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction β’ 7B β’ Updated Aug 19, 2025 β’ 12.9k β’ 200
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment Paper β’ 2507.20984 β’ Published Jul 28, 2025 β’ 57
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9, 2025 β’ 748