首页> 外文会议>Signal Processing and Communications Applications Conference >Recognition of non-speech sounds using Mel-frequency cepstrum coefficients and dynamic time warping method
【24h】

Recognition of non-speech sounds using Mel-frequency cepstrum coefficients and dynamic time warping method

机译:使用梅尔频率倒谱系数和动态时间扭曲方法识别非语音

获取原文

摘要

With the developing technology, speech recognition systems are getting more space in our daily lives. Sounds in our environment are not only pure speech. Because of this, it is important for cochlear implants, unmanned vehicles and security systems to be able to recognize other sounds. In this work, Mel-frequency cepstrum coefficients, one of the most widely used methods for feature extraction in speech recognition, applied to various nature and animal sounds. Because each sound does not have the same duration, dynamic time warping, one of the methods used in speech recognition, is preferred to classify the feature vectors. The difference in durations of sounds affects the lengths of the feature vectors. With dynamic time warping method, one can overcome these differences. One reference record and 10 test records obtained from 10 different sound sources. True classification rate is found as 88%.
机译:随着技术的发展,语音识别系统在我们的日常生活中获得了更多的空间。我们环境中的声音不仅是纯语音。因此,重要的是耳​​蜗植入物,无人驾驶车辆和安全系统能够识别其他声音。在这项工作中,梅尔频率倒谱系数是语音识别中特征提取最广泛使用的方法之一,它应用于各种自然和动物的声音。因为每种声音的持续时间都不相同,所以动态时间扭曲是语音识别中使用的一种方法,它是对特征向量进行分类的首选方法。声音持续时间的差异会影响特征向量的长度。使用动态时间规整方法,可以克服这些差异。从10种不同的声音源获得1条参考记录和10条测试记录。真实分类率为88%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号