Measuring interpersonal firearm violence: natural language processing methods to address limitations in criminal charge data

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION(2024)

引用 0|浏览0
暂无评分
摘要
Objective Firearm violence constitutes a public health crisis in the United States, but comprehensive data infrastructure is lacking to study this problem. To address this challenge, we used natural language processing (NLP) to classify court record documents from alleged violent crimes as firearm-related or non-firearm-related.Materials and Methods We accessed and digitized court records from the state of Washington (n = 1472). Human review established a gold standard label for firearm involvement (yes/no). We developed a key term search and trained supervised machine learning classifiers for this labeling task. Results were evaluated in a held-out test set.Results The decision tree performed best (F1 score: 0.82). The key term list had perfect recall (1.0) and a modest F1 score (0.65).Discussion and Conclusion This case report highlights the accuracy, feasibility, and potential time-saved by using NLP to identify firearm involvement in alleged violent crimes based on digitized narratives from court documents.
更多
查看译文
关键词
firearms,violence,natural language processing,crime
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要