ARTERIAL: A Natural Language Processing Model for Prevention of Information Leakage from Electronic Health Records

2023 XIII BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEMS ENGINEERING, SBESC(2023)

引用 0|浏览3
暂无评分
摘要
Over the past decade, there has been a steady increase in health security breaches. Therefore, healthcare organizations must protect their sensitive information such as test results, diagnoses, prescriptions, research, and customer personal information. A leak of sensitive data can result in significant economic loss and damage to the organization's image. In this sense, Data Leakage Prevention (DLP) systems can help to identify, monitor, protect, and reduce the risks of leaking sensitive data. However, state-of-the-art DLP solutions only use signature comparisons and static comparisons. Therefore, we propose to develop the ARTERIAL model based on Natural Language Processing (NLP), Entity Recognition (NER), and Artificial Neural Networks (ANN) to be more assertive in extracting information and recognizing entities from Electronic Health Records (EHR). Different from the current literature, ARTERIAL considers semantic features present in the EHR. Three approaches were implemented and tested, two based on ANN and the following based on machine learning algorithms. As a result, the approach taken in its implementation using a machine learning algorithm reached 98.0% of Precision, 86.0% of Recall, and 91.0% of F1-Score.
更多
查看译文
关键词
component,formatting,style,styling,insert
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要