Mining the Clinical Narrative: All Text are Not Equal

2016 IEEE International Conference on Healthcare Informatics (ICHI)(2016)

引用 27|浏览16
暂无评分
摘要
Over the past decade, the application of data science techniques to clinical data has allowed practitioners and researchers to develop a sundry of analytical models. These models have traditionally relied on structured data drawn from Electronic Medical Records (EMR). Yet, a large portion of EMR data remains unstructured, primarily held within clinical notes. While recent work has produced techniques for extracting structured features from unstructured text, this work generally operates under the untested assumption that all clinical text can be processed in a similar manner. This paper provides what we believe to be the first comprehensive evaluation of the differences between four major sources of clinical text, providing an evaluation of the structural, linguistic, and topical differences among notes of each category. Our conclusions support the premise that tools designed to extract structured data from clinical text must account for the categories of text they process.
更多
查看译文
关键词
structured data extraction,unstructured text,structured feature extraction,EMR data,electronic medical records,data science techniques,clinical data,clinical narrative mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要