首页> 外文学位 >Run-time information fusion in large vocabulary continuous speech recognition.

【24h】

Run-time information fusion in large vocabulary continuous speech recognition.

机译：大词汇量连续语音识别中的运行时信息融合。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Continuous speech recognition systems are environmentally sensitive and suffer from the great variability of speech. In order to achieve recognition robustness, there's a strong interest among researchers on how to fuse different information sources for speech recognition. A common problem of those approaches is that complementary information is lost either before or after recognition.; To avoid this unrecoverable information loss, and to better utilize this complementary information, we proposed a run time information fusion scheme. The hypothesis of this thesis is that by performing fusion at different levels and stages of a Large Vocabulary Continuous Speech Recognition (LVCSR) system, especially inside the decoder, more reliable and efficient fusion is possible.; The hypothesis is first tested in a speech segmentation task, which is essential to the performance of an LVCSR system. Furthermore, three different approaches of run time fusion are proposed and implemented inside an LVCSR decoder. The experiments demonstrate the effectiveness and potential of these approaches.

机译：连续语音识别系统对环境敏感，并且遭受语音的巨大变化。为了实现识别的鲁棒性，研究人员对如何融合不同的信息源进行语音识别有着浓厚的兴趣。这些方法的一个普遍问题是互补信息在识别之前或之后都会丢失。为了避免这种不可恢复的信息丢失，并更好地利用此补充信息，我们提出了一种运行时信息融合方案。本文的假设是，通过在大型词汇连续语音识别（LVCSR）系统的不同级别和阶段执行融合，尤其是在解码器内部，可以实现更可靠，更有效的融合。该假设首先在语音分割任务中进行测试，这对于LVCSR系统的性能至关重要。此外，在LVCSR解码器内部提出并实现了三种不同的运行时融合方法。实验证明了这些方法的有效性和潜力。

著录项

作者
Zheng, Chengyi.;
展开▼
作者单位

OGI School of Science & Engineering.;

展开▼
授予单位 OGI School of Science & Engineering.;
学科 Computer Science.; Engineering Electronics and Electrical.; Information Science.; Artificial Intelligence.
学位 Ph.D.
年度 2004
页码 161 p.
总页数 161
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;无线电电子学、电信技术;信息与知识传播;人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
2. Effect of Vocabulary Extension using Word Sequence Concatenation for Large Vocabulary Continuous Speech Recognition [J] . YOSUKE WADA, NORIHIKO KOBAYASHI, YUICHIRO NAKANO 情報処理学会論文誌 . 1999,第4期

机译：单词序列级联对词汇扩展对大词汇量连续语音识别的影响
3. Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech [J] . Krerksak Likitsupin, Proadpran Punyabukkana, Chai Wutiwiwatchai, Engineering journal . 2016,第2期

机译：改进大词汇量连续语音基于片段的语音识别的声学方法
4. Model-based compensation of the additive noise for continuous speech recognition. Experiments using the AURORA II database and tasks [C] . J. C. Segura, A. de la Torre, M. C. Benitez, European conference on speech communication and technology . 2001

机译：基于模型的连续语音识别添加剂噪声补偿。使用Aurora II数据库和任务的实验
5. Modeling lexical tones for Mandarin large vocabulary continuous speech recognition. [D] . Lei, Xin. 2006

机译：为普通话大词汇量连续语音识别建模词汇声调。
6. State of the art in continuous speech recognition. [O] . J Makhoul, R Schwartz 1995

机译：连续语音识别的最新技术。
7. Linguistic constraints for large vocabulary speech recognition. [O] . 1999

机译：大词汇量语音识别的语言约束。
8. Vocabulary and Environment Adaptation in Vocabulary-Independent Speech Recognition. [R] . Hon, H., Lee, K. 1992

机译：词汇独立语音识别中的词汇与环境适应。

Run-time information fusion in large vocabulary continuous speech recognition.

摘要

著录项

相似文献

相关主题

期刊订阅