Automatic, Efficient and Scalable Provenance Registration for FAIR HPC Workflows

2022 IEEE/ACM Workshop on Workflows in Support of Large-Scale Science (WORKS)(2022)

引用 0|浏览13
暂无评分
摘要
Provenance registration is becoming more and more important, as we increase the size and number of experiments performed using computers. In particular, when provenance is recorded in HPC environments, it must be efficient and scalable. In this paper, we propose a provenance registration method for scientific workflows, efficient enough to run in supercomputers (thus, it could run in other environments with more relaxed restrictions, such as distributed ones). It also must be scalable in order to deal with large workflows, that are more typically used in HPC. We also target transparency for the user, shielding them from having to specify how provenance must be recorded. We implement our design using the COMPSs programming model as a Workflow Management System (WfMS) and use RO-Crate as a well-established specification to record and publish provenance. Experiments are provided, demonstrating the run time efficiency and scalability of our solution.
更多
查看译文
关键词
Provenance,Reproducibility,Replicability,Scientific Workflows,FAIR,High Performance Computing,COMPSs,RO-Crate
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要