Detecting Privacy-Sensitive Code Changes with Language Modeling

2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022)(2022)

引用 0|浏览21
暂无评分
摘要
At Meta, we work to incorporate privacy-by-design into all of our products and keep user information secure. We have created an ML model that detects code changes ("diffs") that have privacy-sensitive implications. At our scale of tens of thousands of engineers creating hundreds of thousands of diffs each month, we use automated tools for detecting such diffs. Inspired by recent studies on detecting defects [2, 3, 5] and security vulnerabilities [4, 6, 7], we use techniques from natural language processing to build a deep learning system for detecting privacy-sensitive code.
更多
查看译文
关键词
privacy,software,repository,change,detection,machine learning,privacy sensitive,neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要