首页> 外文会议>IEEE International Conference on Multimedia and Expo Workshops >ROBUST detection of infant crying in adverse environments using weighted segmental two-dimensional linear frequency cepstral coefficients
【24h】

ROBUST detection of infant crying in adverse environments using weighted segmental two-dimensional linear frequency cepstral coefficients

机译:使用加权分段二维线性频率倒谱系数对不良环境中的婴儿哭声进行强健检测

获取原文

摘要

This paper addresses the problem of automatically detecting infant crying sounds. Infant crying sounds show the distinct and regular time-frequency patterns that include a clear harmonic structure and a unique melody. Therefore, extracting appropriate features to properly represent these characteristics is important in achieving a good performance. In this paper, we propose weighted segment-based two-dimensional linear-frequency cepstral coefficients to characterize the time-frequency patterns within a long-range segment of the target signal. A Gaussian mixture model is adopted to statistically represent the crying and non-crying sounds, and test sounds are classified by using a likelihood ratio test. Evaluation of the proposed feature extraction method on a database of several hundred crying and non-crying sound clips yields an average equal error rate of 4.42% in various noisy environments, showing over 20% relative improvements compared to conventional feature extraction methods.
机译:本文解决了自动检测婴儿哭声的问题。婴儿的哭声显示出独特而规则的时频模式,包括清晰的谐波结构和独特的旋律。因此,提取适当的特征以正确表示这些特征对于获得良好的性能很重要。在本文中,我们提出了基于加权段的二维线性频率倒谱系数,以表征目标信号远段内的时频模式。采用高斯混合模型统计表示哭声和非哭声,并通过似然比检验对测试声音进行分类。在数百个哭声和非哭声片段的数据库上对提出的特征提取方法进行评估,在各种嘈杂环境中产生的平均均等错误率为4.42%,与传统特征提取方法相比,显示出超过20%的相对改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号