A machine learning-based sentiment analysis of online product reviews with a novel term weighting and feature selection approach

Information Processing & Management(2021)

引用 77|浏览40
暂无评分
摘要
Recently, online shopping has turned into a mainstream means for users to purchase as well as consume with the upsurge development of Internet technology. User satisfaction can be improved effectively by doing Sentiment Analysis (SA) of a large quantity of user reviews on e-commerce platforms. It is still challenging to envisage the accurate sentiment polarities of the user reviews because of the changes in sequence length, textual order, along with complicated logic. This paper proposes a new optimized Machine Learning (ML) algorithm called the Local Search Improvised Bat Algorithm based Elman Neural Network (LSIBA-ENN) for the SA of online product reviews. The proposed work of SA encompasses ‘4’ major steps: i) Data Collection (DC), ii) preprocessing, iii) Features Extraction (FE) or Term Weighting (TW), Feature Selection (FS), and polarity or Sentiment Classifications (SC). Initially, the Web Scrapping Tool (WST) is utilized to extract the customer reviews of the products for which the data is gathered as of the E-commerce websites. Next, preprocessing is carried out on the web scrap extracted data. Those preprocessed data go through TW and FS for additional processing by means of Log Term Frequency-based Modified Inverse Class Frequency (LTF-MICF) and Hybrid Mutation based Earth Warm Algorithm (HMEWA). Lastly, the HM-EWA data is rendered to the LSIBA-ENN, which classifies the customer reviews’ sentiment as positive, negative, and neutral. For the performance analysis of the proposed and prevailing classifiers, ‘2’ yardstick datasets are taken. The outcomes exhibit that the LSIBA-ENN attains the best performance in SC when weighted against the existing top-notch algorithms. The observations of the reviewer are exact. The prevailing ENN proffers recall of 87.79 when utilizing the proposed LTF-MICF scheme, whereas ENN only achieve 83.55, 84.03, 85.48, and 86.04 of recall whilst utilizing W2V, TF, TF-IDF, and TF-DFS schemes respectively.
更多
查看译文
关键词
Sentiment analysis,Polarity classification,Machine learning,Sentiment analysis of online product reviews,Term weighting,Elman Neural Network (ENN),Bat algorithm (BA)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要