首页> 外文会议>Dialogues with social robots: enablements, analyses, and evaluation >Recognising Conversational Speech: What an Incremental ASR Should Do for a Dialogue System and How to Get There
【24h】

Recognising Conversational Speech: What an Incremental ASR Should Do for a Dialogue System and How to Get There

机译:识别对话语音:增量ASR对对话系统应该做什么以及如何到达对话系统

获取原文
获取原文并翻译 | 示例

摘要

Automatic speech recognition (ASR) is not only becoming increasingly accurate, but also increasingly adapted for producing timely, incremental output. However, overall accuracy and timeliness alone are insufficient when it comes to interactive dialogue systems which require stability in the output and responsivity to the utterance as it is unfolding. Furthermore, for a dialogue system to deal with phenomena such as disfluencies, to achieve deep understanding of user utterances these should be preserved or marked up for use by downstream components, such as language understanding, rather than be filtered out. Similarly, word timing can be informative for analyzing deictic expressions in a situated environment and should be available for analysis. Here we investigate the overall accuracy and incremental performance of three widely used systems and discuss their suitability for the aforementioned perspectives. From the differing performance along these measures we provide a picture of the requirements for incremental ASR in dialogue systems and describe freely available tools for using and evaluating incremental ASR.
机译:自动语音识别(ASR)不仅变得越来越准确,而且越来越适合于产生及时的增量输出。但是,对于交互式对话系统而言,仅其整体准确性和及时性是不够的,交互式对话系统要求输出的稳定性和对发声的响应性,因为它正在发展。此外,对于处理诸如流离失所现象的对话系统,要深入理解用户的话语,应保留或标记这些内容以供下游组件使用,例如语言理解,而不是将其过滤掉。类似地,单词计时可以为分析环境中的动词表达提供参考,并应可用于分析。在这里,我们研究了三种广泛使用的系统的整体精度和增量性能,并讨论了它们对于上述观点的适用性。从这些措施的不同性能中,我们提供了对话系统中增量ASR要求的图片,并描述了使用和评估增量ASR的免费工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号