Generating Extraction-Based Summaries from Hand-Written Summaries by Aligning Text Spans

msra(1999)

引用 37|浏览59
暂无评分
摘要
Human-quality text summarization systems based on sentence extraction are difficult to design because doc- uments can differ along several dimensions, such as length, writing style and lexical usage. The lack of suitable corpor a of extraction-based summaries makes it difficult to evaluate a nd improve existing algorithms. However, there are a large num- ber of hand-written (not extraction-based) summaries avai l- able for news-wire stories. This paper discusses our work on generating a corpus of approximately 25,000 extraction- based summaries from hand-written summaries. We discuss how text-span alignment can be applied to this problem and how this problem differs from previous work on aligning par- allel texts. In addition, we briefly analyze differences bet ween handwritten and extracted summaries.
更多
查看译文
关键词
text summarization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要