UniSD: Towards a Unified Self-Distillation Framework for Large Language Models Paper • 2605.06597 • Published 9 days ago • 15
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue Paper • 2605.05630 • Published 4 days ago • 10