Source integration for data warehousing
Multidimensional Databases(2003)
摘要
While the main goal of a data warehouse is to provide support for data analysis and management's decisions, a fundamental aspect in design of a data warehouse system is the process of acquiring the raw data from a set of relevant information sources. We will call source integration system the component of a data warehouse system dealing with this process. The main goal of a source integration system is to deal with the transfer of data from the set of sources constituting the application-oriented operational environment, to the data warehouse. Since sources are typically autonomous, distributed, and heterogeneous, this task has to deal with the problem of cleaning, reconciling, and integrating data coming from the sources. The design of a source integration system is a very complex task, which comprises several different issues. The purpose of this chapter is to discuss the most important problems arising in the design of a source integration system, with special emphasis on schema integration, processing queries for data integration, and data cleaning and reconciliation.
更多查看译文
关键词
data warehouse system,relevant information source,data analysis,data warehouse,main goal,data warehousing,schema integration,data integration,complex task,raw data,source integration system
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络