Recognising Conversational Speech: What an Incremental ASR Should Do for a Dialogue System and How to Get There

机译：识别对话语音：增量ASR对对话系统应该做什么以及如何到达对话系统

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic speech recognition (ASR) is not only becoming increasingly accurate, but also increasingly adapted for producing timely, incremental output. However, overall accuracy and timeliness alone are insufficient when it comes to interactive dialogue systems which require stability in the output and responsivity to the utterance as it is unfolding. Furthermore, for a dialogue system to deal with phenomena such as disfluencies, to achieve deep understanding of user utterances these should be preserved or marked up for use by downstream components, such as language understanding, rather than be filtered out. Similarly, word timing can be informative for analyzing deictic expressions in a situated environment and should be available for analysis. Here we investigate the overall accuracy and incremental performance of three widely used systems and discuss their suitability for the aforementioned perspectives. From the differing performance along these measures we provide a picture of the requirements for incremental ASR in dialogue systems and describe freely available tools for using and evaluating incremental ASR.

机译：自动语音识别（ASR）不仅变得越来越准确，而且越来越适合于产生及时的增量输出。但是，对于交互式对话系统而言，仅其整体准确性和及时性是不够的，交互式对话系统要求输出的稳定性和对发声的响应性，因为它正在发展。此外，对于处理诸如流离失所现象的对话系统，要深入理解用户的话语，应保留或标记这些内容以供下游组件使用，例如语言理解，而不是将其过滤掉。类似地，单词计时可以为分析环境中的动词表达提供参考，并应可用于分析。在这里，我们研究了三种广泛使用的系统的整体精度和增量性能，并讨论了它们对于上述观点的适用性。从这些措施的不同性能中，我们提供了对话系统中增量ASR要求的图片，并描述了使用和评估增量ASR的免费工具。

著录项

来源
《Dialogues with social robots: enablements, analyses, and evaluation》|2016年|421-432|共12页
会议地点 Saariselka(FI)
作者
Timo Baumann; Casey Kennington; Julian Hough; David Schlangen;
展开▼
作者单位

Natural Language Systems Group, Informatics Department, Universitaet Hamburg, Hamburg, Germany;

Dialogue Systems Group, Faculty of Linguistics and Literature and CITEC, Bielefeld University, Bielefeld, Germany;

Dialogue Systems Group, Faculty of Linguistics and Literature and CITEC, Bielefeld University, Bielefeld, Germany;

Dialogue Systems Group, Faculty of Linguistics and Literature and CITEC, Bielefeld University, Bielefeld, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Incremental ASR; Conversational speech; System requirements; Evaluation;

机译：增量ASR；对话演讲；系统要求;评价;

相似文献

外文文献
中文文献
专利

1. Some background on dialogue management and conversational speech for dialogue systems [J] . Yorick Wilks, Roberta Catizone, Simon Worgan, Computer speech and language . 2011,第2期

机译：对话系统对话管理和对话语音的一些背景
2. Towards incremental speech generation in conversational systems [J] . Gabriel Skantze, Anna Hjalmarsson Computer speech and language . 2013,第1期

机译：在对话系统中实现增量语音生成
3. Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech [J] . Andreas Stolcke, Klaus Ries, Noah Coccaro, Computational linguistics . 2000,第3期

机译：自动标记和识别对话语音的对话行为建模
4. Recognising Conversational Speech: What an Incremental ASR Should Do for a Dialogue System and How to Get There [C] . Timo Baumann, Casey Kennington, Julian Hough, International workshop on spoken dialogue systems technology . 2017

机译：认识到会话语音：渐进的ASR应该为对话系统做些什么以及如何到达那里
5. Dialogue systems as conversational partners: Applying conversation acts theory to natural language generation for task-oriented mixed-initiative spoken dialogue. [D] . Stent, Amanda Joy. 2001

机译：作为对话伙伴的对话系统：将对话行为理论应用于自然语言生成，以实现面向任务的混合式口语对话。
6. A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech [O] . Jodi Kodish-Wachs, Emin Agassi, Patrick Kenny III, 2018

机译：当代自动语音识别引擎用于对话式临床语音的系统比较
7. A RERANKING APPROACH FOR RECOGNITION AND CLASSIFICATION OF SPEECH INPUT IN CONVERSATIONAL DIALOGUE SYSTEMS [O] . Fabrizio Morbini, Kartik Audhkhasi, Ron Artstein, 2013

机译：对流对话系统中语音输入识别和分类的再生方法

Recognising Conversational Speech: What an Incremental ASR Should Do for a Dialogue System and How to Get There

摘要

著录项

相似文献

相关主题

期刊订阅