Domain adaptation to summarize human conversations

DANLP 2010: Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing(2010)

引用 8|浏览17
暂无评分
摘要
We are interested in improving the summarization of conversations by using domain adaptation. Since very few email corpora have been annotated for summarization purposes, we attempt to leverage the labeled data available in the multiparty meetings domain for the summarization of email threads. In this paper, we compare several approaches to supervised domain adaptation using out-of-domain labeled data, and also try to use unlabeled data in the target domain through semi-supervised domain adaptation. From the results of our experiments, we conclude that with some in-domain labeled data, training in-domain with no adaptation is most effective, but that when there is no labeled in-domain data, domain adaptation algorithms such as structural correspondence learning can improve summarization.
更多
查看译文
关键词
domain adaptation,multiparty meetings domain,semi-supervised domain adaptation,supervised domain adaptation,target domain,in-domain data,unlabeled data,summarization purpose,training in-domain,email corpus,human conversation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要