What can linguistics contribute to event extraction?

msra(2006)

引用 25|浏览20
暂无评分
摘要
This paper examines the question of how a linguistic analysis of a written document can contribute to identifying, tracking and populating the "eventualities" that are presented in the document, either directly or indirectly, and representing de- grees of belief concerning them. It is our view that the role of lexical analysis (as exemplified in the research carried out in the FrameNet project) is greater than usually assumed, so this paper is partly an attempt to clarify the boundary between on the one hand the information that can be derived on the basis of linguistic knowledge alone (composed of lexical meanings and the meanings of grammatical constructions) and on the other hand, reasoning based on beliefs about the source of a document, world knowledge, and "common sense". Since the general linguistic processes described in this paper will apply to eventualities in general (by which we mean acts, happenings, states of affairs, and relations, whether real, pro- posed, imagined, or denied ), our presentation will emphasize the linguistic processes themselves. In particular, we show that the kind of information produced by the lexicon-building project FrameNet can have a special role in contributing to text understanding, starting from the basic facts of the combi- natorial properties of frame-bearing words (verbs, nouns, ad- jectives and prepositions) and arriving at the means of recog- nizing the anaphoric properties of specific unexpressed event participants, for all parts of speech, in defining a new layer of anaphora resolution and text cohesion. Using as a starting point the challenge text presented in the call for this work- shop (hereafter referred to as the Hijacking text), we show the points at which a thorough linguistic analysis can articu- late with the kind of simulation formalism demonstrated in X-schema diagram, Figure 2 , which itself incorporates a great deal of world knowledge connected with the events in- troduced in the Hijacking text.
更多
查看译文
关键词
noun,lexical analysis,part of speech
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要