首页> 外文OA文献 >Efficient error correction for speech systems using constrained re-recognition
【2h】

Efficient error correction for speech systems using constrained re-recognition

机译:使用约束重新识别的语音系统的有效纠错

摘要

Efficient error correction of recognition output is a major barrier in the adoption of speech interfaces. This thesis addresses this problem through a novel correction framework and user interface. The system uses constraints provided by the user to enhance re-recognition, correcting errors with minimal user effort and time. In our web interface, users listen to the recognized utterance, marking incorrect words as they hear them. After they have finished marking errors, they submit the edits back to the speech recognizer where it is merged with previous edits and then converted into a finite state transducer. This FST, modeling the regions of correct and incorrect words in the recognition output, is then composed with the recognizer's language model and the utterance is re-recognized. We explored the use of our error correction technique in both the lecture and restaurant domain, evaluating the types of errors and the correction performance in each domain. With our system, we have found significant improvements over other error correction techniques such as n-best lists, re-speaking or verbal corrections, and retyping in terms of actions per correction step, corrected output rate, and ease of use.
机译:识别输出的有效纠错是采用语音接口的主要障碍。本文通过一种新颖的校正框架和用户界面解决了这一问题。该系统使用用户提供的约束来增强重新识别,并以最少的用户工作量和时间纠正错误。在我们的Web界面中,用户聆听识别的语音,在听到不正确的单词时将其标记出来。完成标记错误后,他们将编辑提交回语音识别器,在此与先前的编辑合并,然后转换为有限状态转换器。然后,该FST对识别输出中正确和错误单词的区域进行建模,然后与识别器的语言模型一起构成,并重新识别话语。我们探索了我们的错误纠正技术在演讲和餐厅领域的使用,评估了错误的类型以及每个领域的纠正性能。通过我们的系统,我们发现与其他纠错技术相比,例如n-best列表,重新说出或口头的纠正,以及在每个纠正步骤的动作,纠正的输出率和易用性方面的重新键入,都有了显着的改进。

著录项

  • 作者

    Yu Gregory T;

  • 作者单位
  • 年度 2008
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号