A Study on Different Text Representation Methods for the Negative Selection Algorithm.

Matheus A. Ferraria, Vinicius A. Ferraria,Leandro Nunes de Castro

DCAI(2022)

引用 0|浏览0
暂无评分
摘要
Unstructured data, such as text, usually have to be structured before standard machine learning classifiers are applied. In such cases, different representation schemes can be used, such as Bag of Words, the Linguistic Inquiry and Word Count (LIWC), Part-of-Speech Tagging (POS Tagging), and others. The Negative Selection Algorithm (NSA) was designed with inspiration in the immune system to solve binary classification problems, more specifically anomaly detection. This paper investigates the performance of various text representation schemes as input to the NSA. Three different datasets and text representation methods are used, and the results are presented in terms of Accuracy and False Positive Rate.
更多
查看译文
关键词
Negative selection algorithm,BOW,LIWC,POS Tagging,Text representation,Binary classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要