【24h】

Speech-music segmentation system for speech recognition

机译:用于语音识别的语音音乐分割系统

获取原文

摘要

Using posterior probability based features to segment an audio signal as speech and music has been commonly used method. In this study Hidden-Markov-Model (HMM) based acoustic models are used to calculate posterior probabilities. Acoustic Models includes states of context-independent phones as modeling unit. Entropy and dynamism are found using via the posterior probabilities and these values are used as feature for speech-music discrimination. An HMM based classifier that uses Viterbi decoding is implemented and using discriminative features, audio signals are segmented as speech and music. As a result of the tests, it was found that applied speech-music segmentation method decreases Word-Error-Rate and increases the speed of recognition.
机译:使用基于后验概率的特征来分割音频信号作为语音和音乐已经是常用的方法。在这项研究中,基于隐马尔可夫模型(HMM)的声学模型用于计算后验概率。声学模型包括上下文无关电话的状态作为建模单元。通过后验概率发现熵和动力,这些值被用作语音-音乐辨别的特征。实现了使用维特比解码的基于HMM的分类器,并使用区分功能将音频信号分割为语音和音乐。作为测试的结果,发现应用语音-音乐分割方法降低了字错误率并提高了识别速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号