Trend analysis of categorical data streams with a concept change method

Fuyuan Cao, Joshua Zhexue Huang,Jiye Liang

Information Sciences: an International Journal(2014)

引用 19|浏览23
暂无评分
摘要
This paper proposes a new method to trend analysis of categorical data streams. A data stream is partitioned into a sequence of time windows and the records in each window are assumed to carry a number of concepts represented as clusters. A data labeling algorithm is proposed to identify the concepts or clusters of a window from the concepts of the preceding window. The expression of a concept is presented and the distance between two concepts in two consecutive windows is defined to analyze the change of concepts in consecutive windows. Finally, a trend analysis algorithm is proposed to compute the trend of concept change in a data stream over the sequence of consecutive time windows. The methods for measuring the significance of an attribute that causes the concept change and the outlier degrees of objects are presented to reveal the causes of concept change. Experiments on real data sets are presented to demonstrate the benefits of the trend analysis method.
更多
查看译文
关键词
CLUSTERING DATA STREAMS,EVOLVING DATA,FRAMEWORK,ALGORITHM,SETS
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要