Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics

Ian Lane; Tatsuya Kawahara; Tomoko Matsui; Satoshi Nakamura

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics

【24h】

Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics

机译：使用多个主题的分类置信度进行域外话语检测

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

One significant problem for spoken language systems is how to cope with users' out-of-domain (OOD) utterances which cannot be handled by the back-end application system. In this paper, we propose a novel OOD detection framework, which makes use of the classification confidence scores of multiple topics and applies a linear discriminant model to perform in-domain verification. The verification model is trained using a combination of deleted interpolation of the in-domain data and minimum-classification-error training, and does not require actual OOD data during the training process, thus realizing high portability. When applied to the "phrasebook" system, a single utterance read-style speech task, the proposed approach achieves an absolute reduction in OOD detection errors of up to 8.1 points (40% relative) compared to a baseline method based on the maximum topic classification score. Furthermore, the proposed approach realizes comparable performance to an equivalent system trained on both in-domain and OOD data, while requiring no OOD data during training. We also apply this framework to the "machine-aided-dialogue" corpus, a spontaneous dialogue speech task, and extend the framework in two manners. First, we introduce topic clustering which enables reliable topic confidence scores to be generated even for indistinct utterances, and second, we implement methods to effectively incorporate dialogue context. Integration of these two methods into the proposed framework significantly improves OOD detection performance, achieving a further reduction in equal error rate (EER) of 7.9 points

机译：口语系统的一个重要问题是如何应对用户无法通过后端应用程序系统处理的域外（OOD）语音。在本文中，我们提出了一种新颖的OOD检测框架，该框架利用多个主题的分类置信度得分，并应用线性判别模型进行域内验证。使用删除的域内数据插值和最小分类错误训练的组合来训练验证模型，并且在训练过程中不需要实际的OOD数据，从而实现了高度的可移植性。与基于最大主题分类的基线方法相比，当将其应用于“短语手册”系统（一种单语音朗读式语音任务）时，与基于基线的方法相比，所提出的方法可将OOD检测错误绝对减少多达8.1点（相对于40％）得分了。此外，所提出的方法实现了与在域内和OOD数据上训练的等效系统相当的性能，同时在训练过程中不需要OOD数据。我们还将此框架应用于自发对话语音任务“机器辅助对话”语料库，并以两种方式扩展了该框架。首先，我们引入主题聚类，即使对于含糊不清的话语，也可以生成可靠的主题置信度分数；其次，我们实现了有效合并对话上下文的方法。将这两种方法集成到建议的框架中可显着提高OOD检测性能，从而使等错误率（EER）进一步降低7.9点

著录项

来源
《IEEE transactions on audio, speech and language processing 》 |2007年第1期| p.150-161| 共12页
作者
Ian Lane; Tatsuya Kawahara; Tomoko Matsui; Satoshi Nakamura;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术 ;
关键词
natural language processing; speech recognition; in-domain verification; linear discriminant model; machine-aided-dialogue corpus; minimum-classification-error training; out-of-domain utterance detection; phrasebook system; single utterance read-style speech tas;

机译：自然语言处理;语音识别;域内验证;线性判别模型;机器辅助对话语料库;最小分类错误训练;域外话语检测;短语书系统;单话语朗读式话语;

相似文献

外文文献
中文文献
专利

1. Out-of-Domain Utterance Detection based on Confidence Measures from Multiple Topic Classification [J] . Ian R. LANE, Tatsuya KAWAHARA, Tomoko MATSUI, 電子情報通信学会技術研究報告. 音声. Speech . 2003 ,第520期

机译：基于多主题分类置信度的域外话语检测
2. Out-of-Domain Utterance Detection based on Confidence Measures from Multiple Topic Classification [J] . Ian R. LANE, Tatsuya KAWAHARA, Tomoko MATSUI, 電子情報通信学会技術研究報告. 言語理解とコミュニケーション. Natural Language Understanding and Models of Communication . 2003 ,第518期

机译：基于多主题分类置信度的域外话语检测
3. Detection of Topic and Speech Act Type on Utterance-by-Utterance Basis for Conversational Interfaces [J] . Katsushi Asami, Toshiyuki Takezawa, Genichiro Kikui Systems and Computers in Japan . 2005 ,第12期

机译：对话界面逐词的基础上话题和言语行为类型的检测
4. OUT-OF-DOMAIN DETECTION BASED ON CONFIDENCE MEASURES FROM MULTIPLE TOPIC CLASSIFICATION [C] . Ian R. Lane, Tatsuya Kawahara, Tornoko Matsui, IEEE International Conference on Acoustics, Speech, and Signal Processing . 2004

机译：基于多主题分类的置信度量的域外探测
5. Hidden target detection and classification using multiple modalities. [D] . Saponaro, Philip. 2016

机译：使用多种方式进行隐藏目标检测和分类。
6. Microblog Topic-Words Detection Model for Earthquake Emergency Responses Based on Information Classification Hierarchy [O] . Xiaohui Su, Shurui Ma, Xiaokang Qiu, 2021

机译：基于信息分类层次结构的地震应急响应的微博主题词检测模型
7. Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics [O] . Lane Ian, Kawahara Tatsuya, Matsui Tomoko, 2007

机译：使用多个主题的分类置信度进行域外话语检测
8. Integrating Multiple Knowledge Sources for Utterance-Level Confidence Annotation in the CMU Communicator Spoken Dialog System [R] . Bohus, D. , Rudnicky, A. 2002

机译：在CmU Communicator口语对话系统中集成多个知识源用于话语级置信度注释

Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics

摘要

著录项

相似文献

相关主题

期刊订阅