In-House Evaluation is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI
Shayne Longpre, Kevin Klyman, Ruth E. Appel,Sayash Kapoor,Rishi Bommasani, Michelle Sahar,Sean McGregor,Avijit Ghosh, Borhane Blili-Hamelin, Nathan Butters,Alondra Nelson, Amit Elazari, Andrew Sellars, Casey John Ellis, Dane Sherrets,Dawn Song, Harley Geiger, Ilona Cohen, Lauren McIlvenny, Madhulika Srikumar, Mark M. Jaycox,Markus Anderljung, Nadine Farid Johnson,Nicholas Carlini, Nicolas Miailhe, Nik Marda,Peter Henderson, Rebecca S. Portnoff,Rebecca Weiss, Victoria Westerhoff,Yacine Jernite, Rumman Chowdhury,Percy Liang,Arvind Narayanan arxiv(2025)
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper