Robert Mueller
bordeauxred
ยท
AI & ML interests
RL, RLHF, RLAIF, meta learning
Recent Activity
updated a model about 20 hours ago
GoodStartLabs/qwen3-8b-openspiel-mix8-selfplay-randmix-100iter published a model about 20 hours ago
GoodStartLabs/qwen3-8b-openspiel-mix8-selfplay-randmix-100iter updated a model about 21 hours ago
GoodStartLabs/qwen3-8b-openspiel-mix8-selfplay-randmix-300iter