Temporal Spam Identification: A Multifaceted Approach To Identifying Review Spam

INTELLIGENT SYSTEMS AND APPLICATIONS, INTELLISYS, VOL 2(2019)

引用 0|浏览10
暂无评分
摘要
A variety of machine-learning techniques have been proposed, over the last decade, to build spam identification models. However, most of these models depend entirely on the extracted features and perform more efficiently when used by large datasets. This paper proposes a temporal spam identification algorithm, which makes use of time series, to filter suspicious reviews from a Yelp review dataset. Based on those labelled suspicious reviews, this algorithm employs feature-engineering techniques. We use a combination of behavioral, review-centric features and word and character n-grams. We classify spam and ham reviews, by using a support vector machine. The proposed method can be used in real-time spam detection systems. A comparison with two other approaches indicates that the algorithm proposed in this paper achieves a higher accuracy (94%). Our proposed algorithm reduces the scope of searching, and huge computations, required for spam detection in large datasets.
更多
查看译文
关键词
Spam, Time series, Review spam, SVM
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要