POST-PROCESSING AND DIMENSIONALITY REDUCTION FOR EXTREME LEARNING MACHINE IN TEXT CLASSIFICATION

Maria Mihaela Trusca,Anamaria Aldea,Simona Elena Gradinaru,Crisan Albu

ECONOMIC COMPUTATION AND ECONOMIC CYBERNETICS STUDIES AND RESEARCH（2021）

引用 1|浏览1

暂无评分

摘要

Text classification is one of the core technologies of textual analysis, with interesting applications varying from sentiment classification, language identification to online abuse detection and many more. Many approaches have been taken to the machine learning classifiers employed in text categorization, with a focus on boosting model performance and efficiency. This paper proposes a new effective framework for the input processing of the Extreme Learning Machine (EML) algorithm, illustrated on the Reuters-21578 test collection of documents. We employ Glove word embeddings to provide a compact and semantic meaningful representation for each word of the input documents. Given the bi-dimensionality constraint of ELM inputs, we reduce the dimensionality of word embeddings using Latent Semantic Analysis and Principal Components Analysis (PCA). Our results reveal that PCA together with the post-processing operation led to more accurate results with lower computational costs.

查看译文

关键词

Extreme Learning Machine, Glove Word Embeddings, Post-processing Algorithm, Principal Component Analysis, Latent Semantic Analysis

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要