Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Swapnaja Achintalwar, Adriana Alvarado Garcia,Ateret Anaby-Tavor,Ioana Baldini,Sara E. Berger,Bishwaranjan Bhattacharjee,Djallel Bouneffouf,Subhajit Chaudhury,Pin-Yu Chen,Lamogha Chiazor,Elizabeth M. Daly,Kirushikesh DB,Rogério Abreu de Paula,Pierre Dognin,Eitan Farchi,Soumya Ghosh,Michael Hind,Raya Horesh,George Kour, Ja Young Lee,Nishtha Madaan,Sameep Mehta,Erik Miehling,Keerthiram Murugesan,Manish Nagireddy,Inkit Padhi,David Piorkowski,Ambrish Rawat,Orna Raz,Prasanna Sattigeri,Hendrik Strobelt,Sarathkrishna Swaminathan,Christoph Tillmann,Aashka Trivedi,Kush R. Varshney,Dennis Wei, Shalisha Witherspooon,Marcel Zalmanovici CoRR(2024)
Key words
Detector Performance,Silicon Detectors
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper