Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

29

Full-text search

Active filters: nebius

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 1.55M • • 11.8k

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 4.98M • • 4.89k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 4.44M • • 3.88k

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 3.79M • • 4.12k

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated Aug 21 • 415k • • 742

Qwen/Qwen3-30B-A3B-Instruct-2507

Text Generation • 31B • Updated Sep 17 • 969k • • 652

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 908k • • 4.38k

google/gemma-2-2b-it

Text Generation • 3B • Updated Aug 27, 2024 • 297k • • 1.22k

Qwen/Qwen3-Embedding-8B

Feature Extraction • 8B • Updated Jul 7 • 744k • • 427

Qwen/Qwen3-Coder-480B-A35B-Instruct

Text Generation • 480B • Updated Aug 21 • 38.1k • • 1.23k

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 822k • • 1.67k

Qwen/Qwen3-32B

Text Generation • 33B • Updated Jul 26 • 1.38M • • 567

meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 677k • • 2.56k

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29 • 554k • • 2.39k

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11 • 21.4k • • 1.38k

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17 • 84.3k • • 708

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 632k • • 558

zai-org/GLM-4.5-Air

Text Generation • 110B • Updated Aug 11 • 494k • • 506

NousResearch/Hermes-4-405B

Text Generation • 406B • Updated Sep 2 • 802 • • 73

nvidia/NVIDIA-Nemotron-Nano-12B-v2

Text Generation • 12B • Updated 5 days ago • 44.1k • • 120

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27 • 239k • • 3.08k

Qwen/Qwen3-30B-A3B-Thinking-2507

Text Generation • 31B • Updated Aug 17 • 239k • • 312

NousResearch/Hermes-4-70B

Text Generation • 71B • Updated Sep 2 • 3.45k • • 158

Qwen/Qwen2.5-Coder-7B

Text Generation • 8B • Updated Nov 18, 2024 • 20.1k • • 126

intfloat/e5-mistral-7b-instruct

Feature Extraction • 7B • Updated Apr 23, 2024 • 284k • • 549

google/gemma-2-9b-it

Text Generation • 9B • Updated Aug 27, 2024 • 112k • • 742

BAAI/bge-en-icl

Feature Extraction • 7B • Updated Jan 15 • 2.33k • • 135

BAAI/bge-multilingual-gemma2

Feature Extraction • 9B • Updated 27 days ago • 1.09M • • 191

nvidia/Llama-3_1-Nemotron-Ultra-253B-v1

Text Generation • 253B • Updated 25 days ago • 1.56k • • 339