INFORMATION ORDERING WITH AN EVENT‐ENRICHED VECTOR SPACE MODEL FOR MULTI‐DOCUMENT NEWS SUMMARIZATION

COMPUTATIONAL INTELLIGENCE(2016)

引用 3|浏览31
暂无评分
摘要
Information ordering is a nontrivial task in multi-document summarization (MDS), which typically relies on the traditional vector space model (VSM) notorious for semantic deficiency. In this article, we propose a novel event-enriched VSM to alleviate the problem by building event semantics into sentence representations. The mediation of event information between sentence and term, especially in the news domain, has an intuitive appeal as well as technical advantage in common sentence-level operations such as sentence similarity computation. Inspired by the block-style writing by humans, we base the sentence ordering algorithm on sentence clustering. To accommodate the complexity introduced by event information, we adopt a soft-to-hard clustering strategy on the event and sentence levels, using expectation-maximization clustering and K-means, respectively. For the purpose of cluster-based sentence ordering, the event-enriched VSM enables us to design an ordering algorithm to enhance event coherence computed between sentence and sentence-context pairs. Drawing on the findings of earlier research, we also incorporate topic continuity measures and time information into the scheme. We evaluate the performance of the model and its variants automatically and manually, with experimental results showing clear advantage of the event-based model over baseline and non-event-based models in information ordering for multi-document news summarization. We are confident that the event-enriched VSM has even greater potential in summarization and beyond, which awaits further research.
更多
查看译文
关键词
event,vector space model,two-layered clustering,coherence,MDS ordering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要