Identifying and characterizing highly similar notes in big clinical note datasets.

Journal of Biomedical Informatics(2018)

引用 31|浏览367
暂无评分
摘要
•We aimed to use a scalable algorithm to de-duplicate notes in big datasets.•We use an algorithm to minimize pairwise comparisons consisting of three phases.•Duplicate notes were very prevalent in our institutional electronic medical record.
更多
查看译文
关键词
Electronic medical record,De-deduplication,Natural language processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要