Cross-Cultural Transfer Learning for Text Classification

international joint conference on natural language processing(2019)

引用 6|浏览12
暂无评分
摘要
Text classification is considered as one of the key tasks in natural language understanding. A computerized system capable of performing such classification allows to use computers to identify written texts and classify them, and in this sense present a capability that has so far been typical of humans only. In addition to the research significance of gaining such capability, the success of these systems has widespread social, economic, and business implications, as well as many applications in today's digital and global world. Examples of such computerized text classification systems include systems that can identify the key topics expressed in the text, recognize whether a text is written in a way that can be perceived as offensive, or classify a text as formal or informal.The most prominent results in natural language text classification in recent years has been achieved by employing supervised machine learning algorithms. These algorithms use large amounts of labeled training datasets to learn a model. Once training is completed, the model is expected to generalize beyond the inputs on which it was trained, and thus, to allow inference to be done on additional inputs that were not a part of the training dataset. The acquisition process for these labeled datasets is labor-intensive, expensive, and time-consuming. This process is also prone to human errors which impede the quality of both the dataset itself and of models that use it for training.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要