This collection contains datasets and models related to "BLEUBERI: BLEU is a surprisingly effective reward for instruction following".
Yapei Chang PRO
yapeichang
AI & ML interests
NLP
Recent Activity
updated
a model about 2 months ago
yapeichang/memo-32b-4tier published
a model about 2 months ago
yapeichang/memo-32b-4tier