首页> 外文会议>International Conference on e-Business and Telecommunications >SPEECH/MUSIC DISCRIMINATION BASED ON WAVELETS FOR BROADCAST PROGRAMS
【24h】

SPEECH/MUSIC DISCRIMINATION BASED ON WAVELETS FOR BROADCAST PROGRAMS

机译:基于广播节目​​的小波的语音/音乐辨别

获取原文

摘要

The problem of speech/music discrimination is a challenging research problem which significantly impacts Automatic Speech Recognition (ASR) performance. This paper proposes new features for the Speech/Music discrimination task. We propose to use a decomposition of the audio signal based on wavelets, which allows a good analysis of non stationary signal like speech or music. We compute different energy types in each frequency band obtained from wavelet decomposition. Two class/non-class classifiers are used; one for speech/non-speech, one for music/non-music. On the broadcast test corpus, the proposed wavelet approach gives better results than the MFCC one. For instance, we have a significant relative improvements of the error rate of 39% for the speech/music discrimination task.
机译:语音/音乐歧视的问题是一个具有挑战性的研究问题,这显着影响了自动语音识别(ASR)性能。本文提出了语音/音乐歧视任务的新功能。我们建议使用基于小波的音频信号的分解,这允许良好地分析语音或音乐等非固定信号。我们计算从小波分解获得的每个频带中的不同能量类型。使用两个类/非类分类器;一个用于言语/非演讲,一个用于音乐/非音乐。在广播测试语料库上,所提出的小波方法提供比MFCC ON的更好结果。例如,对于语音/音乐歧视任务,我们的错误率为39%的错误相对改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号