首页> 外文会议>Mexican International Conference on Artificial Intelligence(MICAI 2007); 20071104-10; Aguascalientes(MX) >Spoken Commands in a Smart Home: An Iterative Approach to the Sphinx Algorithm
【24h】

Spoken Commands in a Smart Home: An Iterative Approach to the Sphinx Algorithm

机译:智能家居中的语音命令:Sphinx算法的迭代方法

获取原文
获取原文并翻译 | 示例

摘要

An algorithm for decoding commands spoken in an intelligent environment through iterative vocabulary reduction is presented. Current research in the field of speech recognition focuses primarily on the optimization of algorithms for single pass decoding using large vocabularies. While this is ideal for processing conversational speech, alternative methods should be explored for different domains of speech, specifically commands issued verbally in an intelligent environment. Such commands have both an explicitly defined structure and a vocabulary limited to valid task descriptions. We propose that a multiple pass context-driven decoding scheme utilizing dictionary pruning yields improved accuracy; this occurs when one deals with command structure and a reduced vocabulary. Each iteration incorporates the hypothesis of the previous into its decoding scheme by removing unlikely words from the current language model. We have applied this decoding method to a comprehensive set of spoken commands through the use of Sphinx-4, an Automatic Speech Recognition (ASR) engine using the Hidden Markov Model (HMM). When decoding via HMM, multiple previous states are used to determine the current state, thus utilizing context to aid in intelligent recognition. Our results show that within a fixed domain, multiple pass decoding yields recognition accuracy. Further research must be conducted to optimize practical context driven decoding and to apply the method to larger domains, primarily those of intelligent environments.
机译:提出了一种通过迭代词汇约简来解码智能环境中口头命令的算法。语音识别领域的当前研究主要集中在使用大词汇量的单遍解码算法的优化上。虽然这对于处理会话语音是理想的,但应针对语音的不同领域(尤其是在智能环境中以口头方式发出的命令)探索替代方法。这样的命令既有明确定义的结构,又有限制于有效任务描述的词汇。我们提出利用字典修剪的多遍上下文驱动解码方案可以提高准确性;当人们处理命令结构和减少词汇量时,就会发生这种情况。每次迭代通过从当前语言模型中删除不太可能的单词,将先前的假设纳入其解码方案。我们通过使用Sphinx-4(一种使用隐马尔可夫模型(HMM)的自动语音识别(ASR)引擎)将此解码方法应用于一组全面的语音命令。通过HMM解码时,使用多个先前状态来确定当前状态,从而利用上下文来帮助智能识别。我们的结果表明,在固定域内,多遍解码可产生识别精度。必须进行进一步的研究,以优化实际的上下文驱动的解码,并将该方法应用于更大的领域,主要是智能环境领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号