Event frame extraction based on a gene regulation corpus
COLING(2008)
摘要
This paper describes the supervised acquisition of semantic event frames based on a corpus of biomedical abstracts, in which the biological process of E. coli gene regulation has been linguistically annotated by a group of biologists in the EC research project "BOOTStrep". Gene regulation is one of the rapidly advancing areas for which information extraction could boost research. Event frames are an essential linguistic resource for extraction of information from biological literature. This paper presents a specification for linguistic-level annotation of gene regulation events, followed by novel methods of automatic event frame extraction from text. The event frame extraction performance has been evaluated with 10-fold cross validation. The experimental results show that a precision of nearly 50% and a recall of around 20% are achieved. Since the goal of this paper is event frame extraction, rather than event instance extraction, the issue of low recall could be solved by applying the methods to a larger-scale corpus.
更多查看译文
关键词
gene regulation event,event instance extraction,gene regulation,semantic event frame,e. coli gene regulation,event frame extraction,event frame,automatic event frame extraction,gene regulation corpus,event frame extraction performance,information extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络