...
首页> 外文期刊>Frontiers in Pediatrics >Development and Technical Validation of a Smartphone-Based Cry Detection Algorithm
【24h】

Development and Technical Validation of a Smartphone-Based Cry Detection Algorithm

机译:基于智能手机的呼声检测算法的开发和技术验证

获取原文
           

摘要

Introduction: The duration and frequency of crying of an infant can be indicative of its health. Manual tracking and labeling of crying is laborious, subjective, and sometimes inaccurate. The aim of this study was to develop and technically validate a smartphone-based algorithm able to automatically detect crying. Methods: For the development of the algorithm a training dataset containing 897 5-s clips of crying infants and 1,263 clips of non-crying infants and common domestic sounds was assembled from various online sources. OpenSMILE software was used to extract 1,591 audio features per audio clip. A random forest classifying algorithm was fitted to identify crying from non-crying in each audio clip. For the validation of the algorithm, an independent dataset consisting of real-life recordings of 15 infants was used. A 29-min audio clip was analyzed repeatedly and under differing circumstances to determine the intra- and inter- device repeatability and robustness of the algorithm. Results: The algorithm obtained an accuracy of 94% in the training dataset and 99% in the validation dataset. The sensitivity in the validation dataset was 83%, with a specificity of 99% and a positive- and negative predictive value of 75 and 100%, respectively. Reliability of the algorithm appeared to be robust within- and across devices, and the performance was robust to distance from the sound source and barriers between the sound source and the microphone. Conclusion: The algorithm was accurate in detecting cry duration and was robust to various changes in ambient settings.
机译:简介:婴儿哭泣的持续时间和频率可以表明其健康。手动跟踪和哭泣的标签是费力的,主观的,有时候是不准确的。本研究的目的是开发和技术上,验证能够自动检测哭泣的智能手机的算法。方法:为了开发算法,培训数据集包含897个哭婴儿的897个5-S剪辑,并从各种在线来源中组装了1,263个非哭婴儿剪辑和普通的国内声音。 OpenSmile软件用于提取每个音频剪辑的1,591个音频功能。装配随机森林分类算法,以识别在每个音频剪辑中从非哭泣的哭泣。对于算法的验证,使用了由15个婴儿的现实历史记录组成的独立数据集。重复分析29分钟的音频剪辑,并在不同的情况下进行分析,以确定算法的内部和内部可重复性和鲁棒性。结果:算法在训练数据集中获得了94%的精度,验证数据集中的99%。验证数据集中的灵敏度为83%,特异性为99%,积极和负预测值分别为75%和100%。算法的可靠性似乎是在设备内部和跨设备内的稳健性,并且性能对于从声源和麦克风之间的声源和屏障之间的距离是强大的。结论:算法在检测呼叫持续时间方面是准确的,并且对环境设置的各种变化具有稳健性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号