Announcement_15
I’m excited to be a mentor for SPAR 2025 this Fall! I’ll be mentoring students on using SAEs for interpretable and tamper-resistant alignment. If you’re interested in advancing AI safety through hands-on interpretability research, please apply!