首页> 外文会议>IEEE International Conference on Multimedia and Expo Workshops >ROBUST detection of infant crying in adverse environments using weighted segmental two-dimensional linear frequency cepstral coefficients
【24h】

ROBUST detection of infant crying in adverse environments using weighted segmental two-dimensional linear frequency cepstral coefficients

机译:使用加权节段二维线性频率谱系十大稳健检测不利环境中的婴儿哭泣

获取原文

摘要

This paper addresses the problem of automatically detecting infant crying sounds. Infant crying sounds show the distinct and regular time-frequency patterns that include a clear harmonic structure and a unique melody. Therefore, extracting appropriate features to properly represent these characteristics is important in achieving a good performance. In this paper, we propose weighted segment-based two-dimensional linear-frequency cepstral coefficients to characterize the time-frequency patterns within a long-range segment of the target signal. A Gaussian mixture model is adopted to statistically represent the crying and non-crying sounds, and test sounds are classified by using a likelihood ratio test. Evaluation of the proposed feature extraction method on a database of several hundred crying and non-crying sound clips yields an average equal error rate of 4.42% in various noisy environments, showing over 20% relative improvements compared to conventional feature extraction methods.
机译:本文解决了自动检测婴幼儿哭泣的问题。 婴儿哭泣声称显示了包括明确谐波结构和独特旋律的不同和规则的时频图案。 因此,提取适当的特征以正确代表这些特征在实现良好的性能方面是重要的。 在本文中,我们提出了基于加权的基于分段的二维线性频率谱系数,以表征目标信号的远程段内的时频模式。 采用高斯混合模型在统计上代表哭泣和非哭声,通过使用似然比测试来分类测试声音。 评估拟议的特征提取方法对数百个哭泣和非哭声的声音剪辑的数据库中的平均相等误差率为4.42%,与传统特征提取方法相比,相对改善超过20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号