Integrating Stress Information in Large Vocabulary Continuous Speech Recognition

机译：在大词汇量连续语音识别中整合压力信息

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose a novel method for integrating stress information in the decoding step of a speech recognizer. A multiscale rhythm model was used to determine the stress scores for each syllable, which are further used to reinforce paths during search. Two strategies for integrating the stress were employed: the first one reinforces paths through all the syllables with a value proportional to the their stress score, while the second one enhances paths passing only through stressed syllables, but with a constant value. The former strategy slightly outperforms the later, bringing a relative improvement of more than 2% over the baseline. Furthermore, the stress information proved to be a robust feature, by performing well even for foreign-accented speech.

机译：在本文中，我们提出了一种在语音识别器的解码步骤中整合压力信息的新颖方法。多尺度节奏模型用于确定每个音节的重音得分，并进一步用于增强搜索过程中的路径。采用了两种整合重音的策略：第一个以与音节分数成正比的值增强通过所有音节的路径，而第二个以恒定的值增强仅通过重读音节的路径。前一种策略略胜于后一种，相对于基准而言，相对提高了2％以上。此外，即使对于带有外国重音的语音也表现出色，压力信息被证明是强大的功能。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|2641-2644|共4页
会议地点
作者
Bogdan Ludusan; Stefan Ziegler; Guillaume Gravier;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
speech recognition; stress; rhythm;

机译：语音识别;强调;韵律;

相似文献

外文文献
中文文献
专利

1. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
2. Effect of Vocabulary Extension using Word Sequence Concatenation for Large Vocabulary Continuous Speech Recognition [J] . YOSUKE WADA, NORIHIKO KOBAYASHI, YUICHIRO NAKANO 情報処理学会論文誌 . 1999,第4期

机译：单词序列级联对词汇扩展对大词汇量连续语音识别的影响
3. Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech [J] . Krerksak Likitsupin, Proadpran Punyabukkana, Chai Wutiwiwatchai, Engineering journal . 2016,第2期

机译：改进大词汇量连续语音基于片段的语音识别的声学方法
4. Integrating Stress Information in Large Vocabulary Continuous Speech Recognition [C] . Bogdan Ludusan, Stefan Ziegler, Guillaume Gravier INTERSPEECH 2012 . 2012

机译：在大词汇连续语音识别中集成压力信息
5. An Error Detection and Correction Framework to Improve Large Vocabulary Continuous Speech Recognition [D] . Zhou, Zhengyu 2009

机译：一种提高大词汇量连续语音识别能力的错误检测与纠正框架
6. The integration of a continuous-speech-recognition system with the QMR diagnostic program. [O] . S. Shiffman, C. D. Lane, K. B. Johnson, 1992

机译：连续语音识别系统与QMR诊断程序的集成。
7. INTEGRATING A NON-PROBABILISTIC GRAMMAR INTO LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION [O] . 2008

机译：将非概率语法集成到大词汇量连续语音识别中

Integrating Stress Information in Large Vocabulary Continuous Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅