Rare disease registries classification and characterization: a data mining approach.

Michele Santoro,Alessio Coi,Michele Lipucci Di Paola,Anna Maria Bianucci,Sabina Gainotti,Emanuela Mollo,Domenica Taruscio,Luciano Vittozzi,Fabrizio Bianchi

PUBLIC HEALTH GENOMICS（2015）

引用 20|浏览19

暂无评分

摘要

Background: The European Commission and Patients Organizations identify rare disease registries (RDRs) as strategic instruments to develop research and improve knowledge in the field of rare diseases. Interoperability between RDRs is needed for research activities, validation of therapeutic treatments, and public health actions. Sharing and comparing information requires a uniform and standardized way of data collection, so levels of interconnection between RDRs with similar aims and/or nature of data should be identified. The objective of this study is to define a classification and characterization of RDRs in order to identify different profiles and informative needs. Methods: Exploratory statistical analyses (cluster analysis and random forest) were applied to data derived from the EPIRARE project ('Building Consensus and Synergies for the EU Rare Disease Patient Registration') survey on the activities and needs of RDRs. Results: The cluster analysis identified 3 main typologies of RDRs: public health, clinical and genetic research, and treatment registries. The analysis of the most informative variables, identified by the random forest method, led to the characterization of 3 types of RDRs and the definition of different profiles and informative needs. Conclusions: These results represent a useful source of information to facilitate the harmonization and interconnection of RDRs in accordance with the different profiles identified. It could help sharing the information between RDRs with similar profiles and, whenever possible, interconnections between registries with different profiles. (C) 2015 S. Karger AG, Basel

查看译文

关键词

Cluster analysis,Data mining,Random forest,Rare disease registries

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要