Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,932

Full-text search

Active filters: vllm

RamManavalan/Qwen3-VL-Embedding-8B-FP8

Feature Extraction • 9B • Updated 13 days ago • 81.3k

drmcbride/Mistral-Nemo-Instruct-2407-Q8_0-GGUF

12B • Updated 13 days ago • 15

drmcbride/Mistral-Nemo-Instruct-2407-Q5_K_M-GGUF

12B • Updated 13 days ago • 32

drmcbride/Mistral-Nemo-Instruct-2407-Q4_K_M-GGUF

12B • Updated 12 days ago • 25

Nicholas0205/Mistral-Nemo-Instruct-2407-Q4_K_M-GGUF

12B • Updated 12 days ago • 45

MuXodious/gpt-oss-20b-tainted-heresy-GGUF

Text Generation • 21B • Updated 10 days ago • 2.43k

McG-221/gpt-oss-20b-tainted-heresy-mlx-8Bit

Text Generation • 21B • Updated 11 days ago • 182

mradermacher/gpt-oss-20b-tainted-heresy-GGUF

21B • Updated 11 days ago • 735

mradermacher/gpt-oss-20b-tainted-heresy-i1-GGUF

21B • Updated 11 days ago • 3.33k

root4k/gpt-oss-120b-mlx-mxfp4

Text Generation • 117B • Updated 10 days ago • 318

jtl11/Jinx-Qwen3-4B-Q5_K_M-GGUF

Text Generation • 4B • Updated 10 days ago • 12

jtl11/Jinx-Qwen3-4B-Q6_K-GGUF

Text Generation • 4B • Updated 10 days ago • 25

BlueMoonlight/Mistral-Nemo-Instruct-2407-mlx-4Bit

12B • Updated 9 days ago • 106

weige15/gpt-oss-20b-mxfp4-gguf

Text Generation • 22B • Updated 8 days ago • 2

Abc7347/this

Audio-Text-to-Text • 24B • Updated 8 days ago • 2

TheHouseOfTheDude/GLM-4.7-Flash_AWQ

Text Generation • Updated 6 days ago • 144

chbae624/vllm-translategemma-12b-it

Text Generation • 13B • Updated 7 days ago • 47

JongYeop/Llama-3.1-8B-Instruct-NVFP4-W4A4

5B • Updated 7 days ago • 10

zxczxcsacaca/gpt-oss-120b-script

Text Generation • 120B • Updated 6 days ago • 6

Mohaddz/temp20B

Text Generation • 21B • Updated 6 days ago • 19

tellang/yeji-4b-rslora-v8-AWQ

Text Generation • 1B • Updated 4 days ago • 37

tellang/yeji-4b-rslora-v8-AWQ-fixed

Text Generation • 1B • Updated 4 days ago • 28

ATL-Machine/affine-top-5GEc6UzXjDCDxcE7cpB8yxW3g83gSNFVQYZJZRYMQXdkBU6Y

Updated 5 days ago • 6

Vishva007/Qwen3-4B-Instruct-2507-W4A16-AutoRound-AWQ

Text Generation • 4B • Updated 4 days ago • 110

mattator/test

4B • Updated 5 days ago • 8

mattator/test-gguf

8B • Updated 5 days ago • 114

cublya/GPT-OSS-Code-Reasoning-20B

Text Generation • 22B • Updated 4 days ago • 9

goodgoals/Nervus-Sapien-Lite-1.01

Text Generation • Updated 4 days ago

JongYeop/Llama-3.1-70B-Instruct-NVFP4-W4A4

41B • Updated 3 days ago • 31

GadflyII/GLM-4.7-Flash-MTP-NVFP4

Text Generation • 19B • Updated 2 days ago • 393