Efficient language model development for spoken dialogue recognition and its evaluation on operator's speech at call centers

机译：语音对话识别的有效语言模型开发及其对呼叫中心话务员语音的评估

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

While a language model for recognition of spoken dialogue is ideally built from a very large, specific-task-oriented corpus, a great amount of time and effort is required to develop such a corpus, and this involves both the audio recording and written transcription of large amounts of speech data. Training data for a language model should match the target task in both topic and style. What is needed, then, is a method to utilize previously existing spoken dialogue corpora that are not necessarily related to the specific target-task. Such corpora would be combined with documents related to the topic of the target-task to develop a language model for the target spoken-dialogue. In this paper, we propose a method for combining previously existing corpora with key phrases (i.e. phrases that contain keywords) extracted from task related documents. Even though the added data is from documents related to the target dialogue, since it consists of key phrases, stylistic differences (between document data and the actual dialogue to which the model will be applied) are not a problem. We have produced a model using this method and have evaluated it in use on actual spoken dialogue collected at call centers. Experimental results show that a relative 13% reduction in word error rate could be achieved with the addition of key phrases. This performance is nearly as good as that which would be achieved on the basis of a large, expensive transcript-corpus, and the cost of producing the key phrase data is essentially negligible. Such cost reduction achieved by our method will enable speech recognition applications to be more widely used.

机译：理想情况下，识别语音对话的语言模型是由非常大型的，面向特定任务的语料库构建的，但开发这种语料库需要大量的时间和精力，并且涉及录音和书面抄录。大量语音数据。语言模型的训练数据应在主题和样式上与目标任务匹配。因此，需要一种方法，其利用不一定与特定目标任务相关的先前存在的口头对话语料库。这种语料库将与与目标任务的主题有关的文档结合在一起，以开发目标口语对话的语言模型。在本文中，我们提出了一种将先前存在的语料库与从任务相关文档中提取的关键短语（即包含关键字的短语）相结合的方法。即使添加的数据来自与目标对话相关的文档，但由于它是由关键短语组成的，因此样式差异（文档数据与将要应用该模型的实际对话之间）不会出现问题。我们使用此方法制作了一个模型，并在呼叫中心收集的实际语音对话中对其进行了评估。实验结果表明，添加关键短语可以使单词错误率降低13％。这种性能几乎与在大型，昂贵的笔录语料库基础上所获得的性能一样好，并且生成关键短语数据的成本基本上可以忽略不计。通过我们的方法实现的这种成本降低将使语音识别应用程序得到更广泛的应用。

著录项

来源
《Pacific Asia Conference on Language, Information and Computation; 20061101-03; Wuhan(CN)》|2006年|P.80-86|共7页
会议地点 Wuhan(CN)
作者
Kiyokazu MIKI; Kaichiro HATAZAKI; Hiroaki HATTORI;
展开▼
作者单位

Media and Information Research Laboratories, NEC Corporation 1753 Shimonumabe, Nakahara-Ku, Kawasaki, Kanagawa 211-8666, JAPAN;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;
关键词
spoken dialogue recognition; language model adaptation; call center;

机译：口语对话识别;语言模型适应;呼叫中心;
入库时间 2022-08-26 14:20:59

相似文献

外文文献
中文文献
专利

1. Lip Detection and Lip Geometric Feature Extraction using Constrained Local Model for Spoken Language Identification using Visual Speech Recognition [J] . Aparna Brahme, Umesh Bhadade Indian Journal of Science and Technology . 2016,第32期

机译：基于视觉语音识别的受限局部模型用于口语识别的嘴唇检测和嘴唇几何特征提取
2. Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system [J] . Chung-Hsien Wu, Gwo-Lang Yan IEEE Transactions on Speech and Audio Proceessing . 2005,第3期

机译：语音行为建模和口语对话系统中具有自发性的自发语音验证
3. On the Use of Speech Recognition Parameters for Speech Synthesis in Spoken Language Interfaces [J] . ALEXANDRU CARUNTU, ALINA NICA, GAVRIL TODEREAN WSEAS Transactions on Signal Processing . 2006,第10期

机译：口语界面中语音识别参数在语音合成中的应用
4. Efficient language model development for spoken dialogue recognition and its evaluation on operator's speech at call centers [C] . Kiyokazu MIKI, Kaichiro HATAZAKI, Hiroaki HATTORI, Pacific Asia Conference on Language, Information and Computation . 2006

机译：高效的语言模型开发，对话对话识别及其在呼叫中心讲师演讲评估
5. Automatic Speech Recognition for Low-Resource and Morphologically Complex Languages [D] . Morris, Ethan. 2021

机译：用于低资源和形态复杂语言的自动语音识别
6. Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions [O] . Feifan Liu, Gokhan Tur, Dilek Hakkani-Tür, 2011

机译：走向口语临床问题的答案：针对口语临床问题评估和改编自动语音识别系统
7. Efficient language model development for spoken dialogue recognition and its evaluation on operatoru27s speech at call centers [O] . 三木清一, 畑崎香一郎, 服部浩明 2006

机译：语音对话识别的有效语言模型开发及其对呼叫中心话务员语音的评估

Efficient language model development for spoken dialogue recognition and its evaluation on operator's speech at call centers

摘要

著录项

相似文献

相关主题

期刊订阅