-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 20
Rajat Ghosh PRO
rghosh8
AI & ML interests
None yet
Recent Activity
updated a collection about 14 hours ago
ARC-GRPO updated a collection about 14 hours ago
ARC-GRPO updated a model about 14 hours ago
rghosh8/arc-grpo-nemotron-mini-4b-instruct-seed-42-G-4-REDUCED-modules-layers-beta-0.01-mergedOrganizations
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 209 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 109 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 16
arc-grpo-baseline
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 20
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 209 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 109 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 16
models 125
rghosh8/arc-grpo-nemotron-mini-4b-instruct-seed-42-G-4-REDUCED-modules-layers-beta-0.01-merged
4B • Updated
rghosh8/arc-grpo-nemotron-mini-4b-instruct-seed-42-G-4-REDUCED-modules-layers-beta-0.01
Text Generation • Updated
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4-epsilon-high-0.3_merged
7B • Updated
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4-epsilon-high-0.3
Text Generation • Updated
rghosh8/arc-grpo-nemotron-mini-4b-instruct-beta-0.01
4B • Updated • 23
rghosh8/arc-grpo-nemotron-mini-4b-instruct-beta-0.01-adapter
Text Generation • Updated • 12
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-modules_merged
4B • Updated • 32
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-modules
Text Generation • Updated • 12
rghosh8/deepseek-llm-7b-chat-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params_merged
7B • Updated • 49
rghosh8/deepseek-llm-7b-chat-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params
Text Generation • Updated • 13
datasets 5
rghosh8/math-lighteval-processed
Viewer • Updated • 7.5k • 8
rghosh8/Codegen_Code-Search-CDP_Benchmarking
Viewer • Updated • 9 • 16
rghosh8/supportGPT-v8
Viewer • Updated • 7.92k • 12 • 1
rghosh8/supportGPT-v2
Viewer • Updated • 8.17k • 9
rghosh8/supportGPT_data
Viewer • Updated • 149 • 14