-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 179 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 91 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 16
Rajat Ghosh PRO
rghosh8
AI & ML interests
None yet
Recent Activity
updated a collection about 8 hours ago
GSM8k-GRPO updated a collection about 8 hours ago
GSM8k-GRPO updated a model about 9 hours ago
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-4Organizations
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 32 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 15 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 12 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 65
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 179 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 91 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 16
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 32 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 15 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 12 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 65
models 104
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-4
Text Generation • Updated • 11
rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4
Text Generation • Updated • 11
rghosh8/nemotron-mini-4b-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params_merged
4B • Updated • 18
rghosh8/nemotron-mini-4b-instruct-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params
Text Generation • Updated • 9
rghosh8/arc-grpo-Nemotron-Mini-4B-Instruct-rajat-seed-42-G-4-REDUCED-LAYERS_merged
4B • Updated • 11
rghosh8/arc-grpo-Nemotron-Mini-4B-Instruct-rajat-seed-42-G-4-REDUCED-LAYERS
Text Generation • Updated • 10
rghosh8/gsm8k-Nemotron-Mini-4B-Instruct-rajat-seed-42-G-4-REDUCED-LAYERS_merged
4B • Updated • 10
rghosh8/gsm8k-Nemotron-Mini-4B-Instruct-rajat-seed-42-G-4-REDUCED-LAYERS
Text Generation • Updated • 10
rghosh8/nemotron-mini-4b-instruct-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS_merged
4B • Updated • 11
rghosh8/nemotron-mini-4b-instruct-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS
Text Generation • Updated • 11
datasets 5
rghosh8/math-lighteval-processed
Viewer • Updated • 7.5k • 8
rghosh8/Codegen_Code-Search-CDP_Benchmarking
Viewer • Updated • 9 • 15
rghosh8/supportGPT-v8
Viewer • Updated • 7.92k • 13 • 1
rghosh8/supportGPT-v2
Viewer • Updated • 8.17k • 9
rghosh8/supportGPT_data
Viewer • Updated • 149 • 14