Finding and Exploring Rating Distributions (Technical Report)

arXiv: Data Structures and Algorithms(2016)

引用 23|浏览28
暂无评分
摘要
Online rated datasets have become a source for large-scale population studies for analysts and a means for end-users to achieve routine tasks such as finding a book club. Existing systems however only provide limited insights into the opinions of different segments of the rater population. In this technical report, we assume that a segment, e.g., $langle${em 18-29 year old males in CA}$rangle$ has a rating distribution in the form of a histogram that aggregates its ratings for a set of items (e.g., {em movies starring Russel Crowe}) and we are interested in comparing its distribution with a given desired input distribution. We use the Earth Moveru0027s Distance ({tt EMD}) to comparing rating distributions and we prove that finding segments whose rating distribution is close to input ones is NP-complete.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要