Dealing with Out-of vocabulary Words and Filled Pauses in Word N-gram Based Speech Recognition System

ATSUHIKO KAI; YOSHIFUMI HIROSE; SEIICHI NAKAGAWA

首页> 外文期刊>情報処理学会論文誌 >Dealing with Out-of vocabulary Words and Filled Pauses in Word N-gram Based Speech Recognition System

【24h】

Dealing with Out-of vocabulary Words and Filled Pauses in Word N-gram Based Speech Recognition System

机译：基于单词N-gram的语音识别系统处理词汇外单词和填充的暂停

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

For practical use of spoken dialog systems and dictation systems, it is important to cope with out-of vocabulary words and filled pauses including the phenomena such as interjection, restart and hesitation. To address these problems, this study tries to use an unknown-word processing (UWP) method for a word N-gram language model based continuous speech recognition system. We investigate an UWP method which employs a subword sequence decoder with subword acoustic models to produce unknown-word hypotheses. This method has been shown to be effective on a small vocabulary task tested with a context-free grammar-based recognition system. This paper proposes an efficient method for incorporating the UWP into a word N-gram language model--based recognition system. We performed a series of experi- ments to show the effectiveness of the method for spoken dialog tasks and a dictation task. The experimental results show that a semantic accuracy was improved by 48/100 using the UWP method. Also, in compared with the result of a system using context-free grammar, the word N-gram based system could further improve the semantic accuracy for spontaneous speech. Furthermore, we performed a recognition experiment for a large-vocabulary dictation task. As a result, although only a slight improvement was observed in terms of the word accuracy, the high performance for detecting the existence of unknown-word in an utterance could be achieved.

机译：为了实际使用口语对话系统和听写系统，应对词汇量过大和停顿的情况（包括诸如插入，重启和犹豫等现象）非常重要。为了解决这些问题，本研究尝试将未知词处理（UWP）方法用于基于词N-gram语言模型的连续语音识别系统。我们研究了一种UWP方法，该方法使用带有子词声学模型的子词序列解码器来产生未知词假设。该方法已证明对使用基于上下文的无语法识别系统测试的小词汇量任务有效。本文提出了一种将UWP纳入基于单词N-gram语言模型的识别系统的有效方法。我们进行了一系列实验，以证明该方法对口语对话任务和听写任务的有效性。实验结果表明，使用UWP方法可以使语义准确性提高48/100。而且，与使用无上下文语法的系统的结果相比，基于单词N-gram的系统可以进一步提高自发语音的语义准确性。此外，我们对大型词汇听写任务进行了识别实验。结果，尽管在单词准确度方面仅观察到了轻微的改善，但是可以实现用于检测发声中未知单词的存在的高性能。

著录项

来源
《情報処理学会論文誌》 |1999年第4期|p.1383-1394|共12页
作者
ATSUHIKO KAI; YOSHIFUMI HIROSE; SEIICHI NAKAGAWA;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 jpn
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition [J] . Sukkar R.A., Chin-Hui Lee IEEE Transactions on Speech and Audio Proceeding . 1996,第6期

机译：基于单词的语音识别中非关键词拒绝的词汇无关判别发声验证
2. Vocabulary independent discriminative utterance verification fornonkeyword rejection in subword based speech recognition [J] . Sukkar R.A., Chin-Hui Lee IEEE Transactions on Speech and Audio Proceessing . 1996,第6期

机译：基于子词的语音识别中非关键词拒绝的词汇无关判别发声验证
3. A Russian Keyword Spotting System Based on Large Vocabulary Continuous Speech Recognition and Linguistic Knowledge [J] . Valentin Smirnov, Dmitry Ignatov, Michael Gusev, Journal of electrical and computer engineering . 2016,第PTa2期

机译：基于大词汇量连续语音识别和语言知识的俄语关键词点播系统
4. Using sub-word n-gram models for dealing with OOV in large vocabulary speech recognition for Latvian [C] . Askars Salimbajevs, Jevgenijs Strigins 20th Nordic Conference of Computational Linguistics . 2015

机译：使用子词n元语法模型处理拉脱维亚语的大词汇量语音识别中的OOV
5. Learning Out-of-Vocabulary Words in Automatic Speech Recognition. [D] . Qin, Long. 2013

机译：在自动语音识别中学习词汇外单词。
6. Words prediction based on N-gram model for free-text entry in electronic health records [O] . Azita Yazdani, Reza Safdari, Ali Golkar, 2019

机译：基于N-GRAM模型的电子健康记录中自由文本输入的单词预测
7. Transcription Of Out-Of-Vocabulary Words In Large Vocabulary Speech Recognition Based On Phoneme-To-Grapheme Conversion [O] . Bart Decadt, Jacques Duchateau, Walter Daelemans, 2002

机译：基于音素到音素转换的大词汇量语音识别中词汇外单词的转录
8. A Real-time, Small-vocabulary, Connected-Word Speech Recognition System [R] . Yilmaz, H., Beninghof, W., Ferber, L., 1972

机译：一种实时，小词汇，连接词语音识别系统

Dealing with Out-of vocabulary Words and Filled Pauses in Word N-gram Based Speech Recognition System

摘要

著录项

相似文献

相关主题

期刊订阅