RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards Paper • 2605.10899 • Published 9 days ago • 74
SkillOS: Learning Skill Curation for Self-Evolving Agents Paper • 2605.06614 • Published 13 days ago • 45
MARS: Modular Agent with Reflective Search for Automated AI Research Paper • 2602.02660 • Published Feb 2 • 67