Speech-music segmentation system for speech recognition

机译：用于语音识别的语音音乐分割系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Using posterior probability based features to segment an audio signal as speech and music has been commonly used method. In this study Hidden-Markov-Model (HMM) based acoustic models are used to calculate posterior probabilities. Acoustic Models includes states of context-independent phones as modeling unit. Entropy and dynamism are found using via the posterior probabilities and these values are used as feature for speech-music discrimination. An HMM based classifier that uses Viterbi decoding is implemented and using discriminative features, audio signals are segmented as speech and music. As a result of the tests, it was found that applied speech-music segmentation method decreases Word-Error-Rate and increases the speed of recognition.

机译：使用基于后验概率的特征来分割音频信号作为语音和音乐已经是常用的方法。在这项研究中，基于隐马尔可夫模型（HMM）的声学模型用于计算后验概率。声学模型包括上下文无关电话的状态作为建模单元。通过后验概率发现熵和动力，这些值被用作语音-音乐辨别的特征。实现了使用维特比解码的基于HMM的分类器，并使用区分功能将音频信号分割为语音和音乐。作为测试的结果，发现应用语音-音乐分割方法降低了字错误率并提高了识别速度。

著录项

来源
《Signal Processing and Communications Applications Conference, 2009. SIU 2009》|2009年|624-627|共4页
会议地点 Antalya(TR);Antalya(TR)
作者
Demir C.; Dogan M.U.;
展开▼
作者单位

TUBITAK-UEKAE (Turkiye Bilim ve Teknoloji Arastirma Kurumu-Ulusal Elektron. ve Kriptoloji Arastirma Enstitusu), Kocaeli, Turkey;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Viterbi decoding; audio signal processing; hidden Markov models; music; speech recognition; acoustic model; audio signal; context independent phone; entropy method; hidden Markov model; posterior probability; speech music segmentation system;

机译：维特比解码;音频信号处理;隐马尔可夫模型;音乐;语音识别;声学模型;音频信号;上下文无关电话;熵方法;隐马尔可夫模型;后验概率;语音音乐分割系统;

相似文献

外文文献
中文文献
专利

1. Automatic speech segmentation in syllable centric speech recognition system [J] . Soumya Priyadarsini Panda, Ajit Kumar Nayak International journal of speech technology . 2016,第1期

机译：音节中心语音识别系统中的自动语音分割
2. The effectiveness of Speech-Music Therapy for Aphasia (SMTA) in five speakers with Apraxia of Speech and aphasia [J] . Hurkmans Joost, Jonkers Roel, de Bruijn Madeleen, Aphasiology . 2015,第7a9期

机译：失语言语音乐疗法（SMTA）在五位言语失语症和失语症患者中的有效性
3. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [J] . Byeongwook Lee, Kwang-Hyun Cho Scientific reports. . 2016,第1期

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
4. Analysis of effect of single-channel speech-music separation using NMF to automatic speech recognition [C] . Demir Cemil, Cemgil A.Taylan, Saraclar Murat Signal Processing and Communications Applications Conference . 2014

机译：使用NMF的单通道语音音乐分离对自动语音识别的影响分析
5. An automatic speech recognition oriented study on segmentation, low dimensional feature extraction, and temporal trajectory information capture. [D] . Zhu, Yonggang. 2002

机译：面向语音识别的自动研究，涉及分割，低维特征提取和时间轨迹信息捕获。
6. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [O] . Byeongwook Lee, Kwang-Hyun Cho -1

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
7. Catalog-Based Single-Channel Speech-Music Separation for Automatic Speech Recognition [O] . Cemgil Ali Taylan, Demir Cemil, Saraclar Murat 2011

机译：基于目录的单通道语音-音乐分离，用于自动语音识别
8. Simulation and Evaluation of Phonetic Speech Recognition Techniques. Volume II. Segmentation of Continuous Speech into Phonemes [R] . Otten, K. W. 1964

机译：语音识别技术的仿真与评估。第二卷。将连续语音分割成音素

Speech-music segmentation system for speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅