首页> 外文会议> >Semantics synchronous understanding for robust spoken language applications
【24h】

Semantics synchronous understanding for robust spoken language applications

机译:健壮的口语应用程序的语义同步理解

获取原文

摘要

In this paper, we describe our recent effort in combining speech recognition and understanding into a single pass decoding process. The goal is to utilize the semantic structure not only to better handle disfluencies and improve the overall understanding accuracy, but also to shorten the response time and achieve higher interactivity. Three related techniques are instrumental in our approach. First, we employ the unified language model (ULM) to incorporate semantic schema into the recognition language model, and extend the search process from word synchronous to semantic object synchronous (SOS) decoding. Finally, we utilize sequential detection to defer, reject, or accept semantic hypotheses and execute consequent dialog actions while the user's utterance is ongoing. We incorporated these methods into SALT and HTML and conducted comparative user studies based on the MiPad scenarios. The experimental results show the system can gracefully cope with spontaneous speech and the users prefer the highly interactive nature of such systems even though there are no significant differences in the task completion rate and the understanding accuracy. However, the interactive interface does allow a more effective visual prompting strategy that contributes to the significantly lower out of grammar utterances.
机译:在本文中,我们描述了我们最近在将语音识别和理解结合到单遍解码过程中的工作。目标是利用语义结构不仅可以更好地处理歧义和提高整体理解的准确性,而且可以缩短响应时间并实现更高的交互性。三种相关技术对我们的方法至关重要。首先,我们采用统一语言模型(ULM)将语义模式合并到识别语言模型中,并将搜索过程从单词同步扩展到语义对象同步(SOS)解码。最后,我们利用顺序检测来推迟,拒绝或接受语义假设,并在用户发声进行时执行随后的对话动作。我们将这些方法合并到SALT和HTML中,并根据MiPad方案进行了比较用户研究。实验结果表明,该系统可以很好地应对自发语音,即使任务完成率和理解准确度没有显着差异,用户也喜欢这种系统的高度交互性。但是,交互式界面的确允许使用更有效的视觉提示策略,从而大大降低了语法语调。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号