首页> 外文期刊>情報処理学会論文誌 >Dealing with Out-of vocabulary Words and Filled Pauses in Word N-gram Based Speech Recognition System
【24h】

Dealing with Out-of vocabulary Words and Filled Pauses in Word N-gram Based Speech Recognition System

机译:基于单词N-gram的语音识别系统处理词汇外单词和填充的暂停

获取原文
获取原文并翻译 | 示例
           

摘要

For practical use of spoken dialog systems and dictation systems, it is important to cope with out-of vocabulary words and filled pauses including the phenomena such as interjection, restart and hesitation. To address these problems, this study tries to use an unknown-word processing (UWP) method for a word N-gram language model based continuous speech recognition system. We investigate an UWP method which employs a subword sequence decoder with subword acoustic models to produce unknown-word hypotheses. This method has been shown to be effective on a small vocabulary task tested with a context-free grammar-based recognition system. This paper proposes an efficient method for incorporating the UWP into a word N-gram language model--based recognition system. We performed a series of experi- ments to show the effectiveness of the method for spoken dialog tasks and a dictation task. The experimental results show that a semantic accuracy was improved by 48/100 using the UWP method. Also, in compared with the result of a system using context-free grammar, the word N-gram based system could further improve the semantic accuracy for spontaneous speech. Furthermore, we performed a recognition experiment for a large-vocabulary dictation task. As a result, although only a slight improvement was observed in terms of the word accuracy, the high performance for detecting the existence of unknown-word in an utterance could be achieved.
机译:为了实际使用口语对话系统和听写系统,应对词汇量过大和停顿的情况(包括诸如插入,重启和犹豫等现象)非常重要。为了解决这些问题,本研究尝试将未知词处理(UWP)方法用于基于词N-gram语言模型的连续语音识别系统。我们研究了一种UWP方法,该方法使用带有子词声学模型的子词序列解码器来产生未知词假设。该方法已证明对使用基于上下文的无语法识别系统测试的小词汇量任务有效。本文提出了一种将UWP纳入基于单词N-gram语言模型的识别系统的有效方法。我们进行了一系列实验,以证明该方法对口语对话任务和听写任务的有效性。实验结果表明,使用UWP方法可以使语义准确性提高48/100。而且,与使用无上下文语法的系统的结果相比,基于单词N-gram的系统可以进一步提高自发语音的语义准确性。此外,我们对大型词汇听写任务进行了识别实验。结果,尽管在单词准确度方面仅观察到了轻微的改善,但是可以实现用于检测发声中未知单词的存在的高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号