首页> 外文会议>Pacific Asia Conference on Language, Information and Computation; 20061101-03; Wuhan(CN) >Efficient language model development for spoken dialogue recognition and its evaluation on operator's speech at call centers
【24h】

Efficient language model development for spoken dialogue recognition and its evaluation on operator's speech at call centers

机译:语音对话识别的有效语言模型开发及其对呼叫中心话务员语音的评估

获取原文
获取原文并翻译 | 示例

摘要

While a language model for recognition of spoken dialogue is ideally built from a very large, specific-task-oriented corpus, a great amount of time and effort is required to develop such a corpus, and this involves both the audio recording and written transcription of large amounts of speech data. Training data for a language model should match the target task in both topic and style. What is needed, then, is a method to utilize previously existing spoken dialogue corpora that are not necessarily related to the specific target-task. Such corpora would be combined with documents related to the topic of the target-task to develop a language model for the target spoken-dialogue. In this paper, we propose a method for combining previously existing corpora with key phrases (i.e. phrases that contain keywords) extracted from task related documents. Even though the added data is from documents related to the target dialogue, since it consists of key phrases, stylistic differences (between document data and the actual dialogue to which the model will be applied) are not a problem. We have produced a model using this method and have evaluated it in use on actual spoken dialogue collected at call centers. Experimental results show that a relative 13% reduction in word error rate could be achieved with the addition of key phrases. This performance is nearly as good as that which would be achieved on the basis of a large, expensive transcript-corpus, and the cost of producing the key phrase data is essentially negligible. Such cost reduction achieved by our method will enable speech recognition applications to be more widely used.
机译:理想情况下,识别语音对话的语言模型是由非常大型的,面向特定任务的语料库构建的,但开发这种语料库需要大量的时间和精力,并且涉及录音和书面抄录。大量语音数据。语言模型的训练数据应在主题和样式上与目标任务匹配。因此,需要一种方法,其利用不一定与特定目标任务相关的先前存在的口头对话语料库。这种语料库将与与目标任务的主题有关的文档结合在一起,以开发目标口语对话的语言模型。在本文中,我们提出了一种将先前存在的语料库与从任务相关文档中提取的关键短语(即包含关键字的短语)相结合的方法。即使添加的数据来自与目标对话相关的文档,但由于它是由关键短语组成的,因此样式差异(文档数据与将要应用该模型的实际对话之间)不会出现问题。我们使用此方法制作了一个模型,并在呼叫中心收集的实际语音对话中对其进行了评估。实验结果表明,添加关键短语可以使单词错误率降低13%。这种性能几乎与在大型,昂贵的笔录语料库基础上所获得的性能一样好,并且生成关键短语数据的成本基本上可以忽略不计。通过我们的方法实现的这种成本降低将使语音识别应用程序得到更广泛的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号