Microblogs data management: a survey

The VLDB Journal(2019)

引用 22|浏览59
暂无评分
摘要
Microblogs data is the microlength user-generated data that is posted on the web, e.g., tweets, online reviews, comments on news and social media. It has gained considerable attention in recent years due to its widespread popularity, rich content, and value in several societal applications. Nowadays, microblogs applications span a wide spectrum of interests including targeted advertising, market reports, news delivery, political campaigns, rescue services, and public health. Consequently, major research efforts have been spent to manage, analyze, and visualize microblogs to support different applications. This paper gives a comprehensive review of major research and system work in microblogs data management. The paper reviews core components that enable large-scale querying and indexing for microblogs data. A dedicated part gives particular focus for discussing system-level issues and on-going effort on supporting microblogs through the rising wave of big data systems. In addition, we review the major research topics that exploit these core data management components to provide innovative and effective analysis and visualization for microblogs, such as event detection, recommendations, automatic geotagging, and user queries. Throughout the different parts, we highlight the challenges, innovations, and future opportunities in microblogs data research.
更多
查看译文
关键词
Microblogs,Social media,Twitter,Data management,Systems,Indexing,Query processing,Memory management,Main-memory,Flushing policy,Data analysis,Visual analysis,Event,Event detection,Event analysis,Recommendation,Geotagging,Geo,Spatial,Temporal,Top-k,Textual,Keyword,User,Aggregation,Sampling,Clustering,Classification,Probabilistic models,Statistical,Graph,Summarization,Ranking
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要