Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
VibeVoice-1.5B
like
1.95k
Follow
Microsoft
16.4k
Text-to-Speech
Transformers
Safetensors
VibeVoice
English
Chinese
text-generation
Podcast
arXiv:
2508.19205
arXiv:
2412.08635
License:
mit
Model card
Files
Files and versions
xet
Community
42
Train
Deploy
Use this model
main
VibeVoice-1.5B
5.41 GB
5 contributors
History:
16 commits
unilm
nielsr
HF Staff
Add library_name: transformers to metadata (
#20
)
1904eae
verified
2 months ago
figures
update README
3 months ago
.gitattributes
Safe
1.6 kB
update README
3 months ago
README.md
Safe
7.27 kB
Add library_name: transformers to metadata (#20)
2 months ago
config.json
Safe
2.76 kB
Upload VibeVoice 1.5B model
3 months ago
model-00001-of-00003.safetensors
Safe
1.98 GB
xet
Upload VibeVoice 1.5B model
3 months ago
model-00002-of-00003.safetensors
Safe
1.98 GB
xet
Upload VibeVoice 1.5B model
3 months ago
model-00003-of-00003.safetensors
Safe
1.45 GB
xet
Upload VibeVoice 1.5B model
3 months ago
model.safetensors.index.json
Safe
123 kB
Upload VibeVoice 1.5B model
3 months ago
preprocessor_config.json
Safe
351 Bytes
Upload VibeVoice 1.5B model
3 months ago