首页> 外文会议>Mexican International Conference on Artificial Intelligence >Spoken Commands in a Smart Home: An Iterative Approach to the Sphinx Algorithm
【24h】

Spoken Commands in a Smart Home: An Iterative Approach to the Sphinx Algorithm

机译:智能家居中的口头命令:窥探狮身态算法的迭代方法

获取原文

摘要

An algorithm for decoding commands spoken in an intelligent environment through iterative vocabulary reduction is presented. Current research in the field of speech recognition focuses primarily on the optimization of algorithms for single pass decoding using large vocabularies. While this is ideal for processing conversational speech, alternative methods should be explored for different domains of speech, specifically commands issued verbally in an intelligent environment. Such commands have both an explicitly defined structure and a vocabulary limited to valid task descriptions. We propose that a multiple pass context-driven decoding scheme utilizing dictionary pruning yields improved accuracy; this occurs when one deals with command structure and a reduced vocabulary. Each iteration incorporates the hypothesis of the previous into its decoding scheme by removing unlikely words from the current language model. We have applied this decoding method to a comprehensive set of spoken commands through the use of Sphinx-4, an Automatic Speech Recognition (ASR) engine using the Hidden Markov Model (HMM). When decoding via HMM, multiple previous states are used to determine the current state, thus utilizing context to aid in intelligent recognition. Our results show that within a fixed domain, multiple pass decoding yields recognition accuracy. Further research must be conducted to optimize practical context driven decoding and to apply the method to larger domains, primarily those of intelligent environments.
机译:介绍了通过迭代词汇减少在智能环境中解码命令的解码命令算法。语音识别领域的当前研究主要侧重于使用大词汇表进行单通解码的算法的优化。虽然这是处理会话语音的理想选择,但应该为不同的语音域探索替代方法,特别是在智能环境中口头发出的命令。此类命令具有明确定义的结构和词汇,限制为有效的任务描述。我们建议利用字典修剪的多遍的上下文驱动的解码方案产生提高精度;当有一个涉及命令结构和减少的词汇表时发生这种情况。每次迭代通过从当前语言模型中删除不太可能的单词来结合前一个进入其解码方案的假设。我们已通过使用斯芬克斯-4,使用隐马尔可夫模型(HMM)的自动语音识别(ASR)引擎的应用这个解码方法,以一组全面口头命令。当通过HMM解码时,使用多个先前状态来确定当前状态,从而利用上下文来帮助智能识别。我们的结果表明,在固定域中,多次通过解码产生识别准确性。必须进行进一步的研究以优化实际的上下文驱动的解码,并将该方法应用于更大的域,主要是智能环境。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号