OpenLearnLM/special-r1-deepseek-qwen3-8b-merged-dare-v2 Text Generation • 8B • Updated about 23 hours ago • 14
OpenLearnLM/special-r1-deepseek-qwen3-8b-merged-dare-v2 Text Generation • 8B • Updated about 23 hours ago • 14
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward Text Generation • 8B • Updated 18 days ago • 103
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward Text Generation • 8B • Updated 18 days ago • 103
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward Text Generation • 8B • Updated 28 days ago • 230
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward Text Generation • 8B • Updated 28 days ago • 230