Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
soham97
/
mellow
like
3
small audio-language model
ALM
audio
music
sound events
audio reasoning
audio captioning
audio question answering
zero-shot
audio-text
arxiv:
2503.08540
License:
mit
Model card
Files
Files and versions
xet
Community
main
mellow
Commit History
v0_s checkpoint
83672db
soham97
commited on
Apr 17
update reasonaqa links
18b1b5d
soham97
commited on
Mar 17
demo
d849e10
soham97
commited on
Mar 15
update
a04bbe7
soham97
commited on
Mar 12
readme update
1cf68be
soham97
commited on
Mar 10
readme update
5bb426c
soham97
commited on
Mar 10
first
0e6a16e
soham97
commited on
Mar 10
first
fce207b
soham97
commited on
Mar 10
first
6b09558
soham97
commited on
Mar 10
first
f00f693
soham97
commited on
Mar 10
first
2c6e7ae
soham97
commited on
Mar 10
first
ff5da81
soham97
commited on
Mar 10
initial commit
adbfe61
verified
soham97
commited on
Mar 10