ADRA-RL/tulu2-7b_aime_controlled_contamination_original Text Generation • 7B • Updated 21 days ago • 71
ADRA-RL/tulu2-7b_aime_controlled_contamination_paraphrased Text Generation • 7B • Updated 21 days ago • 8
ADRA-RL/tulu2-7b_olympiads_controlled_contamination_original Text Generation • 7B • Updated 19 days ago • 9
ADRA-RL/tulu2-7b_lora_adra-plus_aime_paraphrased_lexical_unique_ngram_coverage_s70 Updated 21 days ago
ADRA-RL/tulu2-7b_olympiads_controlled_contamination_paraphrased Text Generation • 7B • Updated 19 days ago • 10
ADRA-RL/aime_lexical_unique_ngram_coverage_ref_ratio_1.50_random_7_p0.25 Viewer • Updated 21 days ago • 32 • 21
ADRA-RL/aime_lexical_unique_ngram_coverage_ref_ratio_1.50_adaptive_match_minkplus_random_7_p0.25 Viewer • Updated 21 days ago • 32 • 21
ADRA-RL/olympiads_lexical_unique_trio_penalty_2.0_augment_random_7_p0.25 Viewer • Updated 19 days ago • 64 • 14
ADRA-RL/olympiads_lexical_unique_trio_ratio_2.0_adaptive_match_minkplus_augment_random_7_p0.25 Viewer • Updated 19 days ago • 64 • 16
ADRA-RL/olympiads_paraphrased_lexical_unique_trio_ratio_2.0_adaptive_match_minkplus_random_7_p0.25 Viewer • Updated 19 days ago • 64 • 15
ADRA-RL/aya_lexical_unique_ngram_coverage_ref_ratio_1.50_random_15_p0.25 Viewer • Updated 19 days ago • 128 • 15
ADRA-RL/aya_lexical_unique_ngram_coverage_ref_ratio_1.50_adaptive_match_minkplus_random_15_p0.25 Viewer • Updated 19 days ago • 128 • 15
ADRA-RL/wildchat_lexical_unique_ngram_coverage_ref_ratio_1.50_random_7_p0.25 Viewer • Updated 19 days ago • 128 • 15
ADRA-RL/wildchat_lexical_unique_ngram_coverage_ref_ratio_1.50_adaptive_match_minkplus_random_7_p0.25 Viewer • Updated 19 days ago • 128 • 13
ADRA-RL/tulu3-8b_lora_adra-plus_wildchat_original_lexical_unique_ngram_coverage_s100 Updated 19 days ago • 18