What's Important in a Text? An Extensive Evaluation of Linguistic Annotations for Summarization

2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS)(2018)

引用 2|浏览97
暂无评分
摘要
Automatic text summarization aims at reducing the length of input documents while preserving the most important information. A key challenge in automatic summarization is therefore to estimate the importance of information. Most extractive summarization systems, however, usually only consider bigrams as the representation from which importance can be estimated. The potential of other text annotations such as frames or named-entities remains unexplored. In this paper, we evaluate the application potential of linguistic annotations for automatic text summarization. To this end, we extend a previously presented summarization system by replacing bigrams with a multitude of different linguistic annotation types, including n-grams, verb stems, frames, concepts, chunks, connotation frames, entity types, and discourse relation sense-types. We propose two novel evaluation methods to evaluate information importance detection capabilities. In our experiments, bigrams show the best overall performance when source document sentences have to be ranked. These results support the decision of summarization system developers to use bigrams in summarization systems. However, other annotation types perform better if the model has to distinguish between source and reference sentences.
更多
查看译文
关键词
extensive evaluation,linguistic annotations,automatic text summarization,automatic summarization,extractive summarization systems,text annotations,entity types,evaluation methods,information importance detection capabilities,summarization system developers,linguistic annotation types
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要