ICML2026 paper: "Disentangling Intent from Role: Adversarial Self-Play for Persona-Invariant Safety Alignment"
Xiaoyu Wen
XiaoyuWen
AI & ML interests
None yet
Recent Activity
updated a dataset 4 days ago
XiaoyuWen/PIA-Persona-Dataset updated a collection 4 days ago
PIA updated a collection 4 days ago
PIA