Conceptual Formalization of Massive Storage for Advancing Decision-Making with Data Analytics.

CAiSE Forum(2023)

引用 0|浏览6
暂无评分
摘要
Data Lakes have been widely used to handle massive amounts of data arriving at high velocity and variety. However, if proper data management concerns are not addressed, this massive data storage can easily turn Data Lakes into Data Swamps. Furthermore, data must be associated with the data artefacts created to extract value from it, such as pipelines used to collect, treat, or process data and analytical artefacts such as analytical dashboards and machine learning models. This paper proposes a more comprehensive view of a Data Lake, in which all of these resources can be stored and managed. To that end, the conceptual meta-model incorporates a data catalog, data at various stages of maturity, pipelines, dashboards, and machine learning models. The proposed meta-model was instantiated in the ADM.IN (Advanced Decision Making in Productive Systems through Intelligent Networks) project, showing how vast amounts of data and their related artefacts can be managed to support decision-making processes with data analytics.
更多
查看译文
关键词
massive storage,conceptual formalization,decision-making decision-making,data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要