Protected Health Information Recognition of Unstructured Code-Mixed Electronic Health Records in Taiwan.

World Congress on Medical and Health (Medical) Informatics (MedInfo)(2022)

引用 0|浏览4
暂无评分
摘要
Electronic health records (EHRs) at medical institutions provide valuable sources for research in both clinical and biomedical domains. However, before such records can be used for research purposes, protected health information (PHI) mentioned in the unstructured text must be removed. In Taiwan's EHR systems the unstructured EHR texts are usually represented in the mixing of English and Chinese languages, which brings challenges for de-identification. This paper presented the first study, to the best of our knowledge, of the construction of a code-mixed EHR de-identification corpus and the evaluation of different mature entity recognition methods applied for the code-mixed PHI recognition task.
更多
查看译文
关键词
Code-Mixing,Data Anonymization,Electronic Health Record
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要