Extracting Formulaic and Free Text Clinical Research Articles Metadata using Conditional Random Fields.

Louhi '10: Proceedings of the NAACL HLT 2010 Second Louhi Workshop on Text and Data Mining of Health Documents(2010)

引用 9|浏览14
暂无评分
摘要
We explore the use of conditional random fields (CRFs) to automatically extract important metadata from clinical research articles. These metadata fields include formulaic meta-data about the authors, extracted from the title page, as well as free text fields concerning the study's critical parameters, such as longitudinal variables and medical intervention methods, extracted from the body text of the article. Extracting such information can help both readers conduct deep semantic search of articles and policy makers and sociologists track macro level trends in research. Preliminary results show an acceptable level of performance for formulaic metadata and a high precision for those found in the free text.
更多
查看译文
关键词
body text,formulaic metadata,free text,free text field,important metadata,metadata field,acceptable level,clinical research article,formulaic meta-data,macro level trend,Extracting formulaic,conditional random field
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要