audio models - a bann Collection

bann 's Collections

uncensored chat

professional 3d

Drawing Process

Video generation

video tratament

object detction

sound generation

music_generation

face manipulation

Image Captioning

Documents retriever

anime tratament

mask segmentation

Video Description

chat_with_upload

math_physics_etc

audio models

updated Apr 24

MIT/ast-finetuned-audioset-10-10-0.4593

Audio Classification • 86.6M • Updated Sep 6, 2023 • 301k • 355
Running on Zero

Agents

315

Llasa 3b Tts

🔥

315

Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Paused

Agents

Featured

202

YuE

👩

202

Generate music from lyrics and genre tags
Running on Zero

Agents

Featured

413

Zonos

🌍

413

Generate high-quality speech from text with optional voice cloning
Running on Zero

Agents

Featured

171

AudioX

👀

171

Generate audio from text, video, or audio prompts
Running on Zero

Agents

804

IndexTTS 2 Demo

🏢

804

Generate expressive speech from text with voice and emotion control
Running on Zero

Agents

Featured

546

ACE-Step v1.5

🎵

546

Music Generation Foundation Model v1.5
Running on Zero

Agents

Featured

34

Audio Flamingo Next

🔊

34

Answer questions about uploaded audio or YouTube videos