Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
SDewittCLathrop3PhD 's Collections
FINANCE
SPEECH TO TEXT
AGENTS
CHARACTER AI
RESEARCH ARXIV
TTS
PERSONALIZATION
VISION
GPT-OSS
DOCUMENT WRITER
PLAYGROUND
SPREADSHEET
LORAS
EMBEDDING
LAW
SEARCH
LEADERBOARD
HEALTH
VIDEO
WRITE
HARDWARE, VRAM
MODELS
SONGS
TRAINING
IMAGE EXPLANATION
IMAGES
OCR
SPACES

SPEECH TO TEXT

updated 10 days ago
Upvote
-

  • Running
    225

    Qwen3 ASR Demo

    👀
    225

    Convert audio to text with context and language options


  • Running on Zero
    2.61k

    Whisper

    📉
    2.61k

    Transcribe audio files or YouTube videos into text


  • openai/whisper-large-v3

    Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 4.28M • • 5.11k

  • Running
    49

    Qwen3 Omni Captioner Demo

    🐠
    49

    Generate captions from audio


  • Qwen/Qwen3-Omni-30B-A3B-Captioner

    Any-to-Any • 32B • Updated Sep 22 • 13.3k • 172

  • nvidia/parakeet-tdt-0.6b-v3

    Automatic Speech Recognition • Updated Sep 18 • 77.3k • 407

  • LiquidAI/LFM2-Audio-1.5B

    Audio-to-Audio • 1B • Updated Sep 19 • 433 • 274

  • Running
    1.19k

    Whisper Web

    🎤
    1.19k

    Convert spoken words into text

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs