首页> 外文会议>2014 International Conference on Embedded Systems >Unsupervised environmental sound recognition
【24h】

Unsupervised environmental sound recognition

机译:无监督的环境声音识别

获取原文
获取原文并翻译 | 示例

摘要

Environmental sound recognition is an audio scene identification process to locate a person by analyzing the background sound. This paper deals with the prototype modeling of environmental sound recognition that is based on unsupervised learning. The unsupervised learning finds a hidden structure in a group of data given as input. There is no need of a label to which the input data belongs. So this could be used for the practical cases. Sound recognition involves the collection of audio data, extraction of significant features and finding a common structure between them, thus leading to grouping of the data. The Mel frequency cepstrum coefficients are extracted. These features are used for clustering by a Gaussian mixture model which is a probabilistic model. The clustering leads to the identification of the correct audio scene. The implementation is done with the help of MATLAB and ModelSim. Five major environmental sounds which include the sound of car, office, restaurant, street, subway are considered. The parameters of the Gaussian mixture model are estimated in the training phase. The model is tested with the inputs considering the parameters. The MATLAB implementation shows an efficiency of 98%. The hardware implementation of the same shows an efficiency of 96.4%.
机译:环境声音识别是通过分析背景声音来定位人的音频场景识别过程。本文讨论了基于无监督学习的环境声音识别原型模型。无监督学习在作为输入给出的一组数据中找到隐藏的结构。不需要输入数据所属的标签。因此,可以将其用于实际情况。声音识别涉及音频数据的收集,重要特征的提取以及在它们之间找到通用结构,从而导致数据分组。提取梅尔频率倒谱系数。这些特征通过高斯混合模型(一种概率模型)用于聚类。聚类导致正确音频场景的识别。该实现是在MATLAB和ModelSim的帮助下完成的。考虑了五种主要的环境声音,包括汽车,办公室,餐厅,街道,地铁的声音。在训练阶段估算高斯混合模型的参数。使用考虑参数的输入对模型进行测试。 MATLAB实现显示效率为98%。相同的硬件实现显示效率为96.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号