Multi-Stage Pre-training for Low-Resource Domain Adaptation

Rong Zhang,Revanth Gangi Reddy,Md Arafat Sultan,Vittorio Castelli,Anthony Ferritto,Radu Florian,Efsun Sarioglu Kayi,Salim Roukos,Avi Sil,Todd Ward

Conference on Empirical Methods in Natural Language Processing（2020）

引用 33|浏览751

暂无评分

摘要

Transfer learning techniques are particularly useful for NLP tasks where a sizable amount of high-quality annotated data is difficult to obtain. Current approaches directly adapt a pretrained language model (LM) on in-domain text before fine-tuning to downstream tasks. We show that extending the vocabulary of the LM with domain-specific terms leads to further gains. To a bigger effect, we utilize structure in the unlabeled data to create auxiliary synthetic tasks, which helps the LM transfer to downstream tasks. We apply these approaches incrementally on a pretrained Roberta-large LM and show considerable performance gain on three tasks in the IT domain: Extractive Reading Comprehension, Document Ranking and Duplicate Question Detection.

查看译文

关键词

multi-stage,pre-training,low-resource

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要