Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

One-click Deployment

Inference Endpoints

Microsoft Foundry

Amazon SageMaker AI

Misc

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

14

Base only

Active filters: Reward

SultanR/SmolTulu-1.7b-RM

Text Classification • 2B • Updated Dec 17, 2024 • 8 • 2

mradermacher/SmolTulu-1.7b-RM-GGUF

2B • Updated Dec 17, 2024 • 43

mradermacher/SmolTulu-1.7b-RM-i1-GGUF

2B • Updated Dec 17, 2024 • 432

Teen-Different/squiral_maze

Reinforcement Learning • Updated Mar 30, 2025

internlm/POLAR-1_8B

Text Classification • Updated Jul 15, 2025 • 16 • 9

internlm/POLAR-1_8B-Base

Text Classification • Updated Jul 15, 2025 • 14 • 1

internlm/POLAR-7B

Text Classification • Updated Jul 15, 2025 • 21 • 25

internlm/POLAR-7B-Base

Text Classification • Updated Jul 15, 2025 • 13 • 5

wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel

Text Generation • 8B • Updated Sep 4, 2025 • 4 • 2

wangclnlp/GRAM-RR-LLaMA-3.2-3B-RewardModel

Text Generation • 3B • Updated Sep 4, 2025 • 10

mradermacher/GRAM-RR-LLaMA-3.2-3B-RewardModel-GGUF

3B • Updated Sep 4, 2025 • 43

mradermacher/GRAM-RR-LLaMA-3.2-3B-RewardModel-i1-GGUF

3B • Updated Dec 28, 2025 • 87

mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-GGUF

8B • Updated Sep 4, 2025 • 38 • 1

mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-i1-GGUF

8B • Updated Dec 28, 2025 • 154 • 1