Announcement_6 | Akshit Achara

Our paper entitled “Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers” has been accepted at the NAACL 2025 Main Conference. I am grateful for the mentorship I received from Dr. Anshuman Chhabra.