This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
LLM Post-Training
Recent Activity
published a model 2 days ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct_thinking-parquet_qwen3-1.7b_epoch_3_mask_k4096 updated a model 2 days ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct_thinking-parquet_qwen3-1.7b_epoch_3_mask_k4096 published a model 3 days ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct_thinking-parquet_qwen3-1.7b_epoch_3_mask