Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes Paper • 2605.05724 • Published 11 days ago • 15
Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models Paper • 2604.26951 • Published 19 days ago • 46