首页> 外文会议>International Conference on Awareness Science and Technology >Environmental sound recognition using time-frequency intersection patterns
【24h】

Environmental sound recognition using time-frequency intersection patterns

机译:使用时频交叉模式的环境声音识别

获取原文

摘要

Environmental sound recognition is an important function of robots and intelligent computer systems. In this research, we tried to use a multi-stage perceptron type neural network system for environmental sound recognition. The input data is the one-dimensional combination of instantaneous spectrum at power peak and the power pattern in time domain. Since for almost environmental sounds, their spectrum changes are not remarkable compared with speech or voice, the combination of power and frequency pattern will preserve the major features of environmental sounds but with drastically reduced data. Two experiments were conducted using an original database and a database created by the RWCP. The recognition rate for about 45 data kinds of environmental sound was about 92%. The merit of this method is the use of a one-dimensional input which combines the power pattern and the instantaneous spectrum of sound data. Comparing with the method using only instantaneous spectrum, the new method are sufficient for larger sound database and the recognition rate was increased about 12%. The results are also comparable with the methods of HMM, while those methods require 2-dimensional spectrum time series data and more complicated computation.
机译:环境声音识别是机器人和智能计算机系统的重要功能。在这项研究中,我们尝试使用多级的Perceptron型神经网络系统进行环境声音识别。输入数据是功率峰值处的瞬时频谱和时域中功率图案的一维组合。由于对于几乎环境声音,与语音或语音相比,它们的频谱变化并不显着,电力和频率模式的组合将保持环境声音的主要特征,但数据急剧减少。使用原始数据库和由RWCP创建的数据库进行了两个实验。约45个数据的识别率约为92%。该方法的优点是使用一维输入,其结合了功率模式和声音数据的瞬时频谱。与使用仅瞬时频谱的方法相比,新方法足以进行更大的声音数据库,识别率提高约12%。结果也与HMM的方法相当,而这些方法需要二维频谱时间序列数据和更复杂的计算。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号