STAR: SocioTechnical Approach to Red Teaming Language ModelsLaura Weidinger,John F J Mellor,Bernat Guillén Pegueroles,Nahema Marchal,Ravin Kumar,Kristian Lum,Canfer Akbulut,Mark Diaz,A. Stevie Bergman,Mikel D. Rodriguez,Verena Rieser,William IsaacEMNLP 2024(2024)引用 12|浏览28AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要