Russian-Kyrgyz LoResMT'2026
Collection
This is a collection of artifacts from our submission to the Turkic languages translation challenge at LoResMT 2026 • 3 items • Updated
This model took first place in the Russian-to-Kyrgyz translation track at the LoResMT 2026 Turkic Languages Translation Challenge.
See our paper for full details: tbd (LoResMT @ EACL 2026)
| Benchmark | chrF++ | XCOMET-XXL |
|---|---|---|
| FLORES-200 devtest | 44.9 | 80.5 |
| LoResMT 2026 test | 49.1 | 69.7 |
| Benchmark | chrF++ | XCOMET-XXL |
|---|---|---|
| FLORES-200 devtest | 42.4 | 82.8 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("Novokshanov/ru-ky-mt0-loresmt2026")
model = AutoModelForSeq2SeqLM.from_pretrained("Novokshanov/ru-ky-mt0-loresmt2026")
# Availible prefixes are "<2ky>" and "<2ru>".
text = "<ky>Привет, как дела?"
inputs = tokenizer(text, max_length=512, return_tensors="pt")
outputs = model.generate(**inputs, max_length=512, num_beams=5)
translation = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(translation)
tbd
CC BY-NC 4.0
Base model
bigscience/mt0-large