Empirical Study of Filtered-Based Feature Selection Methods for Arabic Text Classification

2023 6th World Symposium on Communication Engineering (WSCE)(2023)

引用 0|浏览1
暂无评分
摘要
Text classification has many applications in various fields; such as news categorization, sentiment analysis, E-mail spam filtering, and others. However, handling textual data is a challenging task owing to the potentially massive number of features (words). The presence of redundant irrelevant features deteriorates the performance of a learning algorithm and makes the process of text classification more complex. This research conducts a comparison study of several filtering-based feature se-lection methods in the context of Arabic text classification. Arabic is a highly complex language syntactically and morphologically which leads to more complicated learning tasks. Proposing a ro-bust classification model is demanding. Remarkably, integrating filtering approaches results in significant improvements in the performance of classification algorithms.
更多
查看译文
关键词
Feature Selection,Text classification,Chi Square,Mutual Information,Maximum Features per Document,Maximum Features per Document Reduced
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要