A Semantic Triplet Based Story Classifier

Advances in Social Networks Analysis and Mining(2012)

引用 33|浏览0
暂无评分
摘要
A story is defined as “an actor(s) taking action(s) that culminates in a resolution(s).” In this paper, we investigate the utility of standard keyword based features, statistical features based on shallow-parsing (such as density of POS tags and named entities), and a new set of semantic features to develop a story classifier. This classifier is trained to identify a paragraph as a “story,” if the paragraph contains mostly story(ies). Training data is a collection of expert-coded story and non-story paragraphs from RSS feeds from a list of extremist web sites. Our proposed semantic features are based on suitable aggregation and generalization of <;Subject, Verb, Object>; triplets that can be extracted using a parser. Experimental results show that a model of statistical features alongside memory-based semantic linguistic features achieves the best accuracy with a Support Vector Machine (SVM) classifier.
更多
查看译文
关键词
support vector machine,semantic triplet,pos tag,proposed semantic feature,best accuracy,expert-coded story,semantic feature,non-story paragraph,memory-based semantic linguistic feature,statistical feature,story classifier,feature extraction,accuracy,linguistics,organizations,support vector machines,semantics,statistical analysis,literature,artificial intelligence,grammars
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要