On Metadata Quality in Sceiba, a Platform for Quality Control and Monitoring of Cuban Scientific Publications.

Eduardo Arencibia, Rafael Martinez,Yohannis Martí-Lahera,Marc Goovaerts

International Conference on Metadata and Semantics Research (MTSR)(2021)

引用 1|浏览4
暂无评分
摘要
It is introduced a platform for quality control and monitoring of Cuban scientific publications named Sceiba. To this end, it needs to collect scientific publications comprehensively at the national level. Metadata quality is crucial for Sceiba interoperability and development. This paper exposes how metadata quality is assured and enhanced in Sceiba. The metadata aggregation pipeline is worked out to collect, transform, store and exposemetadata on Persons, Organizations, Sources, and Scientific Publications. Raw data transformation into Sceiba's internal metadata models includes cleaning, disambiguation, deduplication, entity linking, validation, standardization, and enrichment using a semi-automated approach aligned with the findability, accessibility, interoperability, and reusability principles. To meet the requirements of metadata quality in Sceiba, a three-layer structure for metadata is used, including 1) discovery metadata, which allows the discovery of relevant scientific publications by browsing or query, 2) contextual metadata, which allows a) rich information on persons, organizations and other aspects associated with publications, b) interoperation among common metadata formats used in Current Research Information Systems, journals systems or Institutional Repositories; 3) detailed metadata, which is specific to the domain of scientific publication evaluation. The example provided shows how the metadata quality is improved in the Identification System for CubanResearch Organizations, one of Sceiba ' s component applications.
更多
查看译文
关键词
metadata quality,cuban scientific publications,quality control,sceiba
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要