AI & ML interests
None yet
Organizations
None yet
models 107
citrinegui/Qwen2.5-1.5B-Instruct_countdown2345_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minpTrue_FT10000_800
Updated
citrinegui/Qwen2.5-3B-Instruct_countdown2345_grpo_vrex_0.25_0.75_SEC0.0DRO0.0G1.0_minpTrue_1600
Text Generation
• 242k • Updated
• 1
citrinegui/Qwen2.5-3B-Instruct_countdown2345_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1600
Text Generation
• 242k • Updated
• 1
citrinegui/Llama-3.2-3B-Instruct_countdown2345_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1600
Text Generation
• 175k • Updated
• 1
citrinegui/Qwen2.5-1.5B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC0.99DRO0.0G0.0_minp0.0_1200
Text Generation
• 2B • Updated
• 1
citrinegui/Llama-3.2-3B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1200
Text Generation
• 175k • Updated
• 2
citrinegui/Llama-3.2-3B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC0.3DRO0.0G0.0_minp0.0_1200
Updated
citrinegui/Qwen2.5-1.5B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1200
Text Generation
• 2B • Updated
• 2
citrinegui/Qwen2.5-1.5B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1600
Updated
citrinegui/Qwen2.5-1.5B-Instruct_countdown2345_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minpTrue_10000
Text Generation
• 2B • Updated
• 3
datasets 11
citrinegui/countdown_n2t1000_1-100
Viewer
• Updated
• 329k • 6
citrinegui/countdown_n2t100_1-100
Viewer
• Updated
• 329k • 28
citrinegui/countdown_n6t1000_1-100
Viewer
• Updated
• 329k • 5
citrinegui/countdown_n6t100_1-100
Viewer
• Updated
• 329k • 4
citrinegui/countdown_n5t1000_1-100
Viewer
• Updated
• 329k • 5
citrinegui/countdown_n5t100_1-100
Viewer
• Updated
• 329k • 27
citrinegui/countdown_n4t1000_1-100
Viewer
• Updated
• 329k • 4
citrinegui/countdown_n4t100_1-100
Viewer
• Updated
• 329k • 29
citrinegui/countdown_n3t1000_1-100
Viewer
• Updated
• 329k • 6
citrinegui/countdown_n3t100_1-100
Viewer
• Updated
• 329k • 30