-
lightblue/DeepSeek-R1-Distill-Qwen-1.5B-Multilingual
Updated • 118 • 24 -
lightblue/DeepSeek-R1-Distill-Qwen-7B-Multilingual
8B • Updated • 84 • 22 -
lightblue/DeepSeek-R1-Distill-Qwen-14B-Multilingual
15B • Updated • 30 • 13 -
lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese
Text Generation • 8B • Updated • 18 • • 33
AI & ML interests
None defined yet.
Recent Activity
View all activity
Multipurpose RAG models for many languages
-
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-full
Text Generation • 8B • Updated • 7.65k • • 2 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-top75
Text Generation • 8B • Updated • 7.64k • • 4 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-half
Text Generation • 8B • Updated • 8.83k • • 16 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-top25
Text Generation • 8B • Updated • 8.77k • • 3
-
lightblue/DeepSeek-R1-Distill-Qwen-1.5B-Multilingual
Updated • 118 • 24 -
lightblue/DeepSeek-R1-Distill-Qwen-7B-Multilingual
8B • Updated • 84 • 22 -
lightblue/DeepSeek-R1-Distill-Qwen-14B-Multilingual
15B • Updated • 30 • 13 -
lightblue/DeepSeek-R1-Distill-Qwen-7B-Japanese
Text Generation • 8B • Updated • 18 • • 33
The models trained under our Karasu and Qarasu project
Multipurpose RAG models for many languages
Our latest fine-tuned models
-
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-full
Text Generation • 8B • Updated • 7.65k • • 2 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-top75
Text Generation • 8B • Updated • 7.64k • • 4 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-half
Text Generation • 8B • Updated • 8.83k • • 16 -
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-top25
Text Generation • 8B • Updated • 8.77k • • 3