-
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
Paper • 2507.08128 • Published • 10 -
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper • 2311.07919 • Published • 10 -
Pengi: An Audio Language Model for Audio Tasks
Paper • 2305.11834 • Published • 2
park
woongvy
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
25 days ago
Self-Improving VLM Judges Without Human Annotations
liked
a dataset
about 2 months ago
gamma-lab-umd/MMAU-Pro
updated
a dataset
4 months ago
woongvy/clotho-v2.1
Organizations
None yet