Aspect-based opinion mining in online reviews

mag(2013)

引用 27|浏览8
暂无评分
摘要
Other people’s opinions are important piece of information for making informed decisions. Today the Web has become an excellent source of consumer opinions. However, as the volume of opinionated text is growing rapidly, it is getting impossible for users to read all reviews to make a good decision. Reading different and possibly even contradictory opinions written by different reviewers even make them more confused. In the same way, monitoring consumer opinions is getting harder for the manufactures and providers. These needs have inspired a new line of research on mining consumer reviews, or opinion mining. Aspect-based opinion mining, is a relatively new sub-problem that attracted a great deal of attention in the last few years. Extracted aspects and estimated ratings clearly provides more detailed information for users to make decisions and for suppliers to monitor their consumers. In this thesis, we address the problem of aspect-based opinion mining and seek novel methods to improve limitations and weaknesses of current techniques. We first propose a method, called Opinion Digger, that takes advantages of syntactic patterns to improve the accuracy of frequencybased techniques. We then move on to model-based approaches and propose an LDA-based model, called ILDA, to jointly extract aspects and estimate their ratings. In our next work, we compare ILDA with a series of increasingly sophisticated LDA models representing the essence of the major published methods in the literature. A comprehensive evaluation of these models indicates that while ILDA works best for items with large number of reviews, it performs poorly when the size of the training dataset is small, i.e., for cold start items. The cold start problem is critical as in real-life data sets around 90% of items are cold start. We address this problem in our last work and propose an LDA-based model, called FLDA. It models items and reviewers by a set of latent factors and learns them using reviews of an item category. Experimental results on real life data sets show that FLDA achieve significant gain for cold start items compared to the state-of-the-art models.
更多
查看译文
关键词
Cold start,Sentiment analysis,Set (psychology),Data science,Reading (process),Computer science,Work (electrical),Real life data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要