Parsing citations in biomedical articles using conditional random fields.

Computers in Biology and Medicine(2011)

引用 21|浏览0
暂无评分
摘要
Citations are used ubiquitously in biomedical full-text articles and play an important role for representing both the rhetorical structure and the semantic content of the articles. As a result, text mining systems will significantly benefit from a tool that automatically extracts the content of a citation. In this study, we applied the supervised machine-learning algorithms Conditional Random Fields (CRFs) to automatically parse a citation into its fields (e.g., Author, Title, Journal, and Year). With a subset of html format open-access PubMed Central articles, we report an overall 97.95% F1-score. The citation parser can be accessed at: http://www.cs.uwm.edu/∼qing/projects/cithit/index.html.
更多
查看译文
关键词
citation parsing,biomedical full-text article,rhetorical structure,biomedical text mining,important role,biomedical article,supervised machine-learning algorithms conditional,text mining system,citation parser,random fields,citation indexing,parsing citation,central article,conditional random field,semantic content,information extraction,natural language processing,conditional random fields,machine learning,html format open-access
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要