MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification • 86.6M • Updated • 301k • 355
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Generate music from lyrics and genre tags
Generate high-quality speech from text with optional voice cloning
Generate audio from text, video, or audio prompts
Generate expressive speech from text with voice and emotion control
Music Generation Foundation Model v1.5
Answer questions about uploaded audio or YouTube videos