A Preliminary Performance Comparison of Machine Learning Algorithms for Web Author Identification of Vietnamese Online Messages

2020 26th Conference of Open Innovations Association (FRUCT)(2020)

引用 1|浏览1
暂无评分
摘要
With the rapid development of the Internet and accompanying technologies, communication between people has become easier than ever. Email, news sites, social networking applications become an indispensable connection tool. However, the Internet is also a favorable environment for cybercriminals with malicious activities. Therefore, it is necessary to develop a method to determine which user is the author of the online message. There has been a lot of researches with different corpora and various languages. In this article, we propose an approach to identify the authors of online messages in Vietnamese based on machine learning algorithms. Algorithms used include Naïve Bayes, SVM, Random Forest, and Logistic Regression. The algorithm that has yielded the best results in most cases is Random Forest.
更多
查看译文
关键词
machine learning algorithms,Vietnamese online messages,social networking applications,Web author identification,naïve Bayes,SVM,random forest,logistic regression
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要