...
首页> 外文期刊>Biomedical signal processing and control >A novel one-stage framework for visual pulse rate estimation using deep neural networks
【24h】

A novel one-stage framework for visual pulse rate estimation using deep neural networks

机译:深度神经网络的视觉脉冲速率估计的一种新型单阶段框架

获取原文
获取原文并翻译 | 示例

摘要

Estimation of the visual pulse rate (also called heart rate) refers to extraction of the pulse rate from a facial video. With the studies on extracting photoplethysmography (PPG) signals from a facial video, the non-contacted measurement method has aroused great interest among researchers over the past few years. In this study, a novel one-stage spatio-temporal framework, namely PRnet, is proposed to estimate the pulse rate from a stationary facial video. First, visual pulse rate estimation is defined as a regression task based on deep neural networks, in which a video is mapped to a pulse rate value. Then, 3D convolutional neural networks (Conv3D) and Long short-term memory (LSTM) modules are used to extract spatial and latent temporal information that is hidden in a video. Subsequently, one fully connected layer is applied in the last layer of PRnet to estimate the pulse rate directly. Based on the exquisite framework design, our proposed method realizes competitive performance, especially in terms of processing latency, since it does not rely on power spectral density (PSD) and traditional Fast Fourier Transform (FFT) algorithms. Using our method, only 60 frames of video (2 s) are required for the robust prediction of the pulse rate, whereas 6-30 s of video are typically required for other methods. Finally, a novel visual pulse rate estimation database, which includes pulse rate range at various times of day, is collected to evaluate the proposed framework. The results of extensive experiments demonstrate that PRnet performs competitively while compared with state-of-the-art methods.
机译:视觉脉冲速率(也称为心率)的估计是指从面部视频提取脉冲速率。随着从面部视频中提取光电电敏感(PPG)信号的研究,未接触的测量方法在过去几年中对研究人员引起了极大的兴趣。在该研究中,提出了一种新颖的单级时空框架,即PRNET,以估计来自固定面部视频的脉搏率。首先,视觉脉冲速率估计被定义为基于深神经网络的回归任务,其中视频被映射到脉冲率值。然后,3D卷积神经网络(CONV3D)和长短短期存储器(LSTM)模块用于提取隐藏在视频中的空间和潜在的时间信息。随后,在PRNET的最后一层中应用一个完全连接的层以直接估计脉冲率。基于精湛的框架设计,我们提出的方法实现了竞争性能,尤其是在处理延迟方面,因为它不依赖于功率谱密度(PSD)和传统的快速傅里叶变换(FFT)算法。使用我们的方法,脉冲速率的稳健预测仅需要60帧视频(2秒),而其他方法通常需要6-30S视频。最后,收集了一种新的视觉脉冲速率估计数据库,其包括多次在一天中的脉冲率范围,以评估所提出的框架。广泛实验的结果表明,与最先进的方法相比,PRNET在竞争性上表现得很竞争。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号