Developing Probabilistic Models for Identifying Semantic Patterns in Texts

Minhua Huang,Haralick, R.M.

Semantic Computing(2011)

引用 1|浏览0
暂无评分
摘要
We present a probabilistic graphical model that finds a sequence of optimal categories for a sequence of input symbols. Based on this mode, three algorithms are developed for identifying semantic patterns in texts. They are the algorithm for extracting semantic arguments of a verb, the algorithm for classifying the sense of an ambiguous word, and the algorithm for identifying noun phrases from a sentence. Experiments conducted on standard data sets show good results. For example, our method achieves an average precision of 92:96% and an average recall of 94:94% for extracting semantic argument boundaries of verbs on WSJ data from Penn Tree bank and Prop Bank, an average accuracy of 81:12% for recognizing the six sense word 'line', and an average precision of 97:7% and an average recall of 98:8% for recognizing noun phrases on WSJ data from Penn Tree bank.
更多
查看译文
关键词
computational linguistics,probability,text analysis,Penn Tree bank,Prop Bank,ambiguous word,noun phrase,optimal categories,probabilistic graphical model,probabilistic models,semantic argument boundary,semantic pattern,text,verb,
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要