Building a Data Warehouse for Twitter Stream Exploration

Advances in Social Networks Analysis and Mining(2012)

引用 52|浏览2
暂无评分
摘要
In the recent year Twitter has evolved into an extremely popular social network and has revolutionized the ways of interacting and exchanging information on the Internet. By making its public stream available through a set of APIs Twitter has triggered a wave of research initiatives aimed at analysis and knowledge discovery from the data about its users and their messaging activities. While most of the projects and tools are tailored towards solving specific tasks, we pursue a goal of providing an application in dependent and universal analytical platform for supporting any kind of analysis and knowledge discovery. We employ the well established data warehousing technology with its underlying multidimensional data model, ETL routine for loading and consolidating data from different sources, OLAP functionality for exploring the data and data mining tools for more sophisticated analysis. In this work we describe the process of transforming the original stream into a set of related multidimensional cubes and demonstrate how the resulting data warehouse can be used for solving a variety of analytical tasks. We expect our proposed approach to be applicable for analyzing the data of other social networks as well.
更多
查看译文
关键词
Internet,application program interfaces,data mining,data models,data warehouses,social networking (online),API Twitter,ETL routine,Internet,OLAP functionality,Twitter stream exploration,data consolidation,data loading,data mining tools,data warehouse,knowledge discovery,messaging activities,multidimensional cubes,multidimensional data model,social network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要