A Platform for Multilingual News Summarization

msra(2003)

引用 27|浏览6
暂无评分
摘要
We have developed a multilingual version of Columbia Newsblaster as a testbed for multilingual multi-document summarization. The system collects, clusters, and summarizes news documents from sources all over the world daily. It crawls news sites in many dieren t countries, written in dieren t languages, extracts the news text from the HTML pages, uses a variety of methods to translate the documents for clustering and summarization, and produces an English summary for each cluster. The system is robust, running daily over real-world data. The multilingual version of Columbia Newsblaster provides a platform for testing dieren t strategies for multilingual document clustering, and approaches for multilingual multi-document summarization.
更多
查看译文
关键词
summarization,multilingual,computer science
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要