nvidia/diar_streaming_sortformer_4spk-v2 Automatic Speech Recognition • Updated 9 days ago • 8.5k • 92
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 4 days ago • 1.07k • 255
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 13 items • Updated about 7 hours ago • 14
Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement Paper • 2601.01562 • Published 5 days ago • 24
Running on Zero 1.21k Z Image Turbo 🖼 1.21k Generate stunning images from text descriptions in seconds
SAM Audio Collection The SAM Audio model licenses allow for redistribution so long as the original license files are included • 9 items • Updated 15 days ago • 4