A robust fusion-extraction procedure with summary statistics in the presence of biased sources

arxiv(2023)

引用 0|浏览15
暂无评分
摘要
Information from multiple data sources is increasingly available. However, some data sources may produce biased estimates due to biased sampling, data corruption or model misspecification. Thus there is a need for robust data combination methods that can be used with biased sources. In this paper, a robust data fusion-extraction method is proposed. Unlike existing methods, the proposed method can be applied in the important case where researchers have no knowledge of which data sources are unbiased. The proposed estimator is easy to compute and employs only summary statistics; hence it can be applied in many different fields, such as meta-analysis, Mendelian randomization and distributed systems. The proposed estimator is consistent, even if many data sources are biased, and is asymptotically equivalent to the oracle estimator that uses only unbiased data. Asymptotic normality of the proposed estimator is also established. In contrast to existing meta-analysis methods, the theoretical properties are guaranteed for our estimator, even if the number of data sources and the dimension of the parameter diverge as the sample size increases. Furthermore, the proposed method provides consistent selection for unbiased data sources with probability approaching 1. Simulation studies demonstrate the efficiency and robustness of the proposed method empirically. The method is applied to a meta-analysis dataset to evaluate surgical treatment for moderate periodontal disease and to a Mendelian randomization dataset to study the risk factors for head and neck cancer.
更多
查看译文
关键词
Data fusion,Inverse-variance weighting,Mendelian randomization,Meta-analysis,Robust statistics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要