-
Qwen/Qwen3.5-397B-A17B
Image-Text-to-Text • 403B • Updated • 1.63M • • 1.38k -
Qwen/Qwen3.5-397B-A17B-FP8
Image-Text-to-Text • 403B • Updated • 616k • 145 -
Qwen/Qwen3.5-122B-A10B
Image-Text-to-Text • 125B • Updated • 671k • • 458 -
Qwen/Qwen3.5-122B-A10B-FP8
Image-Text-to-Text • 125B • Updated • 840k • 78
Collections
Discover the best community collections!
Collections trending this week
-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2
Image-Text-to-Text • 28B • Updated • 6.19k • 46 -
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF
Image-Text-to-Text • 27B • Updated • 57.1k • 164 -
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2
Image-Text-to-Text • 10B • Updated • 31.3k • 124 -
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF
Image-Text-to-Text • 9B • Updated • 66.9k • 167
-
nvidia/Nemotron-Cascade-2-30B-A3B
Text Generation • 32B • Updated • 55.5k • 315 -
nvidia/Nemotron-Cascade-2-RL-data
Viewer • Updated • 55.7k • 588 • 32 -
nvidia/Nemotron-Cascade-2-SFT-Data
Viewer • Updated • 15.9M • 6.58k • 31 -
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Paper • 2603.19220 • Published • 58
-
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction • 7B • Updated • 31.3k • 220 -
facebook/dinov3-vits16-pretrain-lvd1689m
Image Feature Extraction • 21.6M • Updated • 234k • 76 -
facebook/dinov3-convnext-small-pretrain-lvd1689m
Image Feature Extraction • 49.5M • Updated • 22.6k • 23 -
facebook/dinov3-vitb16-pretrain-lvd1689m
Image Feature Extraction • 85.7M • Updated • 882k • 112
-
meta-llama/Meta-Llama-3-8B
Text Generation • 8B • Updated • 3.62M • • 6.49k -
meta-llama/Meta-Llama-3-8B-Instruct
Text Generation • 8B • Updated • 1.42M • • 4.44k -
meta-llama/Meta-Llama-3-70B-Instruct
Text Generation • 71B • Updated • 82.2k • • 1.51k -
meta-llama/Meta-Llama-3-70B
Text Generation • 71B • Updated • 301k • • 873
-
ai-sage/GigaChat3.1-702B-A36B-GGUF
Text Generation • 702B • Updated • 479 • 13 -
ai-sage/GigaChat3.1-702B-A36B
Text Generation • 715B • Updated • 417 • 20 -
ai-sage/GigaChat3.1-702B-A36B-bf16
Text Generation • 715B • Updated • 524 • 5 -
ai-sage/GigaChat3.1-10B-A1.8B-GGUF
Text Generation • 11B • Updated • 4.49k • 33
-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled
Image-Text-to-Text • 28B • Updated • 184k • 1.38k -
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF
Image-Text-to-Text • 27B • Updated • 511k • 422 -
Jackrong/Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled
Text Generation • 36B • Updated • 5k • 75 -
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled
Text Generation • 5B • Updated • 10.8k • 19
-
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
Text Generation • 124B • Updated • 139k • 300 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
Text Generation • 124B • Updated • 774k • 201 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
Text Generation • 67B • Updated • 1M • 211 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16
Text Generation • 124B • Updated • 9.95k • 23
-
unsloth/Qwen3.5-35B-A3B-GGUF
Image-Text-to-Text • 35B • Updated • 2.14M • 731 -
unsloth/Qwen3.5-9B-GGUF
Image-Text-to-Text • 9B • Updated • 1.37M • 414 -
unsloth/Qwen3.5-27B-GGUF
Image-Text-to-Text • 27B • Updated • 978k • 355 -
unsloth/Qwen3.5-122B-A10B-GGUF
Image-Text-to-Text • 122B • Updated • 570k • 211
-
Qwen/Qwen3.5-397B-A17B
Image-Text-to-Text • 403B • Updated • 1.63M • • 1.38k -
Qwen/Qwen3.5-397B-A17B-FP8
Image-Text-to-Text • 403B • Updated • 616k • 145 -
Qwen/Qwen3.5-122B-A10B
Image-Text-to-Text • 125B • Updated • 671k • • 458 -
Qwen/Qwen3.5-122B-A10B-FP8
Image-Text-to-Text • 125B • Updated • 840k • 78
-
ai-sage/GigaChat3.1-702B-A36B-GGUF
Text Generation • 702B • Updated • 479 • 13 -
ai-sage/GigaChat3.1-702B-A36B
Text Generation • 715B • Updated • 417 • 20 -
ai-sage/GigaChat3.1-702B-A36B-bf16
Text Generation • 715B • Updated • 524 • 5 -
ai-sage/GigaChat3.1-10B-A1.8B-GGUF
Text Generation • 11B • Updated • 4.49k • 33
-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2
Image-Text-to-Text • 28B • Updated • 6.19k • 46 -
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF
Image-Text-to-Text • 27B • Updated • 57.1k • 164 -
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2
Image-Text-to-Text • 10B • Updated • 31.3k • 124 -
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF
Image-Text-to-Text • 9B • Updated • 66.9k • 167
-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled
Image-Text-to-Text • 28B • Updated • 184k • 1.38k -
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF
Image-Text-to-Text • 27B • Updated • 511k • 422 -
Jackrong/Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled
Text Generation • 36B • Updated • 5k • 75 -
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled
Text Generation • 5B • Updated • 10.8k • 19
-
nvidia/Nemotron-Cascade-2-30B-A3B
Text Generation • 32B • Updated • 55.5k • 315 -
nvidia/Nemotron-Cascade-2-RL-data
Viewer • Updated • 55.7k • 588 • 32 -
nvidia/Nemotron-Cascade-2-SFT-Data
Viewer • Updated • 15.9M • 6.58k • 31 -
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Paper • 2603.19220 • Published • 58
-
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
Text Generation • 124B • Updated • 139k • 300 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
Text Generation • 124B • Updated • 774k • 201 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
Text Generation • 67B • Updated • 1M • 211 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16
Text Generation • 124B • Updated • 9.95k • 23
-
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction • 7B • Updated • 31.3k • 220 -
facebook/dinov3-vits16-pretrain-lvd1689m
Image Feature Extraction • 21.6M • Updated • 234k • 76 -
facebook/dinov3-convnext-small-pretrain-lvd1689m
Image Feature Extraction • 49.5M • Updated • 22.6k • 23 -
facebook/dinov3-vitb16-pretrain-lvd1689m
Image Feature Extraction • 85.7M • Updated • 882k • 112
-
unsloth/Qwen3.5-35B-A3B-GGUF
Image-Text-to-Text • 35B • Updated • 2.14M • 731 -
unsloth/Qwen3.5-9B-GGUF
Image-Text-to-Text • 9B • Updated • 1.37M • 414 -
unsloth/Qwen3.5-27B-GGUF
Image-Text-to-Text • 27B • Updated • 978k • 355 -
unsloth/Qwen3.5-122B-A10B-GGUF
Image-Text-to-Text • 122B • Updated • 570k • 211
-
meta-llama/Meta-Llama-3-8B
Text Generation • 8B • Updated • 3.62M • • 6.49k -
meta-llama/Meta-Llama-3-8B-Instruct
Text Generation • 8B • Updated • 1.42M • • 4.44k -
meta-llama/Meta-Llama-3-70B-Instruct
Text Generation • 71B • Updated • 82.2k • • 1.51k -
meta-llama/Meta-Llama-3-70B
Text Generation • 71B • Updated • 301k • • 873