Multi-input CNN for Text Classification in Commercial Scenarios.

ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT I(2019)

引用 3|浏览25
暂无评分
摘要
In this work we describe a multi-input Convolutional Neural Network for text classification which allows for combining text preprocessed at word level, byte pair encoding level and character level. We conduct experiments on different datasets and we compare the results obtained with other classifiers. We apply the developed model to two different practical use cases: (1) classifying ingredients into their corresponding classes by means of a corpus provided by Northfork; and (2) classifying texts according to the English level of their corresponding writers by means of a corpus provided by ProvenWord. Additionally, we perform experiments on a standard classification task using Yahoo! Answers and GermEval2017 task A datasets. We show that the developed architecture obtains satisfactory results with these corpora, and we compare results obtained for each dataset with different state-of-the-art approaches, obtaining very promising results.
更多
查看译文
关键词
Text classification,Document classification,CNN,Multi-input network,Gastrofy,ProvenWord,Use case,Northfork,GermEval2017,Agglutinative language,Swedish,German
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要