Investigation of Data Leakage in Deep-Learning-Based Blood Pressure Estimation Using Photoplethysmogram/Electrocardiogram

IEEE Sensors Journal(2023)

引用 0|浏览2
暂无评分
摘要
Cuff-less blood pressure (BP) estimation methods using a deep-learning model from photoplethysmogram (PPG) or electrocardiogram (ECG) have been actively studied in recent years. However, we found that most previous studies incur data leakage, where segments or records measured from the same subject appear in both the training and test datasets. Furthermore, many previous studies are suspected to have misinterpreted a record in the public dataset used for their evaluations as a subject. To investigate data leakage in BP estimation methods, this article first organizes previous studies in terms of data leakage. We then quantitatively evaluate the effect of a data leakage caused by the segment-level and the record-level train–test split using the public dataset, cuff-less BP estimation dataset. Our experimental results showed that the segment-level split and record-level split erroneously improved the estimation accuracy of mean BP from the (quasi-)subject-level split by the Pearson’s correlation coefficient of 0.56 and 0.40 when using PPG, and 0.82 and 0.69 when using ECG, respectively. These results confirmed that the train–test split used in many previous studies, including the one that described its evaluation as causing no data leakage, causes a high level of data leakage, and that a record in the cuff-less BP estimation dataset, often misinterpreted as a subject in previous studies, is not a subject and that a high level of data leakage occurs when the record is considered as a subject.
更多
查看译文
关键词
Blood pressure (BP),data leakage,deep learning,electrocardiogram (ECG),intersubject,intrasubject,photoplethysmogram (PPG)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要