Addressing Bias in Preterm Birth Research: The Role of Advanced Imputation Techniques for Missing Race and Ethnicity in Perinatal Health Data

Jihye Kim Scroggins, Ismael Ibrahim Hulchafo,Maxim Topaz,Kenrick Cato,Veronica Barcelona

Annals of Epidemiology(2024)

引用 0|浏览0
暂无评分
摘要
Objectives To evaluate the effectiveness of Bayesian Improved Surname Geocoding (BISG) and Bayesian Improved First Name Surname Geocoding (BIFSG) in estimating race and ethnicity, and how they influence odds ratios for preterm birth. Methods We analyzed hospital birth admission electronic health records (EHR) data (N=9,985). We created two simulation sets with 40% of race and ethnicity data missing randomly or more likely for non-Hispanic black birthing people who had preterm birth. We calculated C-statistics to evaluate how accurately BISG and BIFSG estimate race and ethnicity. We examined the association between race and ethnicity and preterm birth using logistic regression and reported odds ratios (OR). Results BISG and BIFSG showed high accuracy for most racial and ethnic categories (C-statistics=0.94-0.97, 95% confidence intervals [CI]=0.92-0.97). When race and ethnicity were not missing at random, BISG (OR=1.25, CI=0.97-1.62) and BIFSG (OR=1.38, CI=1.08-1.76) resulted in positive estimates mirroring the true association (OR=1.68, CI=1.34-2.09) for Non-Hispanic Black birthing people, while traditional methods showed contrasting estimates (Complete case OR=0.62, CI=0.41-0.94; multiple imputation OR=0.63, CI=0.40-0.98). Conclusions BISG and BIFSG accurately estimate missing race and ethnicity in perinatal EHR data, decreasing bias in preterm birth research, and are recommended over traditional methods to reduce potential bias.
更多
查看译文
关键词
Race and ethnicity,preterm birth,health disparities,missing data,electronic health records
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要