Manual Typification of Source Texts and Multi-document Summaries Alignments

Procedia - Social and Behavioral Sciences(2013)

引用 4|浏览9
暂无评分
摘要
The Multi-document Summarization (MDS) has been focused in Natural Language Processing (NLP) and its aim is to produce automatic summaries from a collection of texts that deal with the same subject (Mani, 2001). The alignment of human-written abstracts to their source documents makes explicit the correspondences that exist in such documents/abstract pairs and create a potentially rich data source to create of rules and models to support more linguistically motivated MDS methods. In this paper we describe the typification of such alignments in the CSTNews corpus. This work is part of two larger projects called Sucinto and Sustento, and it supports MSD researches of Brazilian Portuguese language. Specifically, the typification process consisted of assigning labels to the alignment between a summary sentence and its corresponding source sentence which codify formal and content aspects of the alignment. In order to present this work, we outline the alignment, and detail the typification process, the results of our work and some conclusions.
更多
查看译文
关键词
typification,alignment,summarization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要