首页> 外文期刊>Knowledge-Based Systems >Two-level speech recognition to enhance the performance of spoken dialogue systems
【24h】

Two-level speech recognition to enhance the performance of spoken dialogue systems

机译:两级语音识别可增强口语对话系统的性能

获取原文
获取原文并翻译 | 示例

摘要

Spoken dialogue systems can be considered knowledge-based systems designed to interact with users using speech in order to provide information or carry out simple tasks. Current systems are restricted to well-known domains that provide knowledge about the words and sentences the users will likely utter. Basically, these systems rely on an input interface comprised of speech recogniser and semantic analyser, a dialogue manager, and an output interface comprised of response generator and speech synthesiser. As an attempt to enhance the performance of the input interface, this paper proposes a technique based on a new type of speech recogniser comprised of two modules. The first one is a standard speech recogniser that receives the sentence uttered by the user and generates a graph of words. The second module analyses the graph and produces the recognised sentence using the context knowledge provided by the current prompt of the system. We evaluated the performance of two input interfaces working in a previously developed dialogue system: the original interface of the system and a new one that features the proposed technique. The experimental results show that when the sentences uttered by the users are out-of-context analysed by the new interface, the word accuracy and sentence understanding rates increase by 93.71 and 77.42% absolute, respectively, regarding the original interface. The price to pay for this clear enhancement is a little reduction in the scores when the new interface analyses sentences in-context, as they decrease by 2.05 and 3.41% absolute, respectively, in comparison with the original interface. Given that in real dialogues sentences may be out-of-context analysed, specially when they are uttered by inexperienced users, the technique can be very useful to enhance the system performance.
机译:语音对话系统可以被认为是基于知识的系统,旨在通过语音与用户进行交互,以提供信息或执行简单的任务。当前系统仅限于提供有关用户可能会说出的单词和句子的知识的众所周知的域。基本上,这些系统依赖于由语音识别器和语义分析器组成的输入接口,对话管理器以及由响应生成器和语音合成器组成的输出接口。为了提高输入接口的性能,本文提出了一种基于新型语音识别器的技术,该语音识别器由两个模块组成。第一个是标准语音识别器,它接收用户说出的句子并生成单词图。第二个模块使用系统的当前提示提供的上下文知识分析图形并生成识别的句子。我们评估了在以前开发的对话系统中工作的两个输入界面的性能:系统的原始界面和具有建议技术的新界面。实验结果表明,在新界面下对用户说出的句子进行语境外分析时,相对于原始界面,单词准确率和句子理解率绝对值分别提高了93.71%和77.42%。当新界面分析上下文中的句子时,为此明显增强所付出的代价是分数的降低,因为与原始界面相比,它们分别减少了2.05%和3.41%的绝对值。鉴于在实际对话中可能会在上下文之外分析句子,尤其是当它们由经验不足的用户说出时,该技术对于增强系统性能可能非常有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号