ClustKNN: A Highly Scalable Hybrid Model- & Memory-Based CF Algorithm

Knowledge Discovery and Data Mining(2006)

引用 195|浏览63
暂无评分
摘要
Collaborative Filtering (CF)-based recommender systems are indispensable tools to find items of interest from the unmanageable number of available items. Moreover, com- panies who deploy a CF-based recommender system may be able to increase revenue by drawing customers' attention to items that they are likely to buy. However, the sheer num- ber of customers and items typical in e-commerce systems demand specially designed CF algorithms that can grace- fully cope with the vast size of the data. Many algorithms proposed thus far, where the principal concern is recom- mendation quality, may be too expensive to operate in a large-scale system. We propose ClustKnn, a simple and intuitive algorithm that is well suited for large data sets. The method first compresses data tremendously by build- ing a straightforward but ecient clustering model. Rec- ommendations are then generated quickly by using a simple Nearest Neighbor-based approach. We demonstrate the feasibility of ClustKnn both analytically and empirically. We also show, by comparing with a number of other pop- ular CF algorithms that, apart from being highly scalable and intuitive, ClustKnn provides very good recommenda- tion accuracy as well.
更多
查看译文
关键词
personalization,machine learning,recommender sys- tems,collaborative filtering,data mining.,clustering,recommender systems,nearest neighbor,recommender system,data mining,e commerce
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要