MSDN: A Multistage Deep Network for Heart-Rate Estimation From Facial Videos

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT(2023)

引用 0|浏览5
暂无评分
摘要
Noncontact heart-rate (HR) measurement is a very important trend in clinical medicine. Recently, a variety of deep networks have been applied to estimate HRs from facial videos. However, due to limited data resources and poor parameter optimization, few existing models have achieved incredible performance in complicated scenarios, such as those with illumination changes, different skin tones, and facial motion. To address these challenges, this article proposes a novel multistage deep network (MSDN) that can decentralize the learnable parameters into different stages to reduce the difficulty of learning through multiple training steps. Specifically, the proposed network consists of three stages in an end-to-end way. In the first stage, an HR-aware feature extractor uses the next convolutional neural network (ConvNeXt) embedded with a newly designed bandpass filter as its backbone to extract spatial-temporal features for determining HR changes. Moreover, pseudolabels are generated to make the features compatible with illumination, motion, and color variance. In the second stage, various modules, including singular value decomposition (SVD) pooling and enhanced difference convolution (EDC) modules, are then designed and combined with a transformer encoder to construct a feature-compressed remote photoplethysmography (rPPG) generator. In the third stage, an HR estimator with an interbeat interval (IBI) analyzer and a 1-D filter is newly designed for HR estimation. Extensive experiments are performed on three publicly available databases (i.e., VIPL-HR, COHFACE, and PURE), and the results demonstrate the effectiveness of the proposed method through ablation studies and comparison experiments with state-of-the-art (SOTA) methods.
更多
查看译文
关键词
Feature extraction,Estimation,Heart rate,Videos,Training,Band-pass filters,Skin,Feature extractor,heart rate (HR) estimation,interbeat interval (IBI),multistage deep network (MSDN),remote photoplethysmography (rPPG) generator
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要