Posteriori Restoration of Turn-Taking and ASR Results for Incorrectly Segmented Utterances

Komatani Kazunori; Hotta Naoki; Sato SatoshiNakano Mikio

首页> 外文期刊>IEICE transactions on information and systems >Posteriori Restoration of Turn-Taking and ASR Results for Incorrectly Segmented Utterances

【24h】

Posteriori Restoration of Turn-Taking and ASR Results for Incorrectly Segmented Utterances

机译：Posteriori Restoration of Turn-Taking and ASR Results for Incorrectly Segmented Utterances

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

Appropriate turn-taking is important in spoken dialogue systems as well as generating correct responses. Especially if the dialogue features quick responses, a user utterance is often incorrectly segmented due to short pauses within it by voice activity detection (VAD). Incorrectly segmented utterances cause problems both in the automatic speech recognition (ASR) results and turn-taking: i.e., an incorrect VAD result leads to ASR errors and causes the system to start responding though the user is still speaking. We develop a method that performs a posteriori restoration for incorrectly segmented utterances and implement it as a plug-in for the MMDAgent open-source software. A crucial part of the method is to classify whether the restoration is required or not. We cast it as a binary classification problem of detecting originally single utterances from pairs of utterance fragments. Various features are used representing timing, prosody, and ASR result information. Experiments show that the proposed method outperformed a baseline with manually-selected features by 4.8 and 3.9 in cross-domain evaluations with two domains. More detailed analysis revealed that the dominant and domain-independent features were utterance intervals and results from the Gaussian mixture model (GMM).

著录项

来源
《IEICE transactions on information and systems》 |2015年第11期|1923-1931|共9页
作者
Komatani Kazunori; Hotta Naoki; Sato SatoshiNakano Mikio;
展开▼
作者单位

Nagoya Univ, Grad Sch Engn, Nagoya, Aichi 4648603, Japan;

Honda Res Inst Japan Co Ltd, Wako, Saitama 3510188, Japan;

Osaka Univ, Inst Sci & Ind Res, Ibaraki 5670047, Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类通信;
关键词
spoken dialogue system; VAD error; turn taking; a posteriori restoration;

Posteriori Restoration of Turn-Taking and ASR Results for Incorrectly Segmented Utterances

摘要

著录项

相关主题

期刊订阅