An Efficient Outlier Mining Algorithm for Large Dataset

Information Management, Innovation Management and Industrial Engineering, 2008. ICIII '08. International Conference(2008)

引用 5|浏览0
暂无评分
摘要
Since an outlier often contains useful information, outlier detection is becoming a hot issue in data mining. Thus, an efficient outlier mining algorithm based on KNN is proposed in this paper. It can find outlier more accurately through defining a correlation matrix considering the importance and correlation between attributes. In addition, a data structure R-tree is used in the algorithm and it utilizes pruning scheme to drastically reduce the time consuming of computing. Experimental results show that our algorithm is more efficient than the traditional KNN algorithm. It will provide an effective solution for outlier mining in large dataset.
更多
查看译文
关键词
data mining,outlier detection,efficient outlier mining algorithm,effective solution,data structure r-tree,outlier mining,hot issue,large dataset,correlation matrix,traditional knn algorithm,sorting,algorithm design and analysis,tree data structures,correlation,clustering algorithms,data structure
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要