【24h】

Empowering End Users to Personalize Dialogue Systems through Spoken Interaction

机译:通过语音交互使最终用户能够个性化对话系统

获取原文
获取原文并翻译 | 示例

摘要

This paper describes recent advances we have made towards the goal of empowering end users to automatically expand the knowledge base of a dialogue system through spoken interaction, in order to personalize it to their individual needs. We describe techniques used to incrementally reconfigure a preloaded trained natural language grammar, as well as the lexicon and language models for the speech recognition system. We also report on advances in the technology to integrate a spoken pronunciation with a spoken spelling, in order to improve spelling accuracy. While the original algorithm was designed for a "speak and spell" input mode, we have shown here that the same methods can be applied to separately uttered spoken and spelled forms of the word. By concatenating the two waveforms, we can take advantage of the mutual constraints realized in an integrated composite FST. Using an OGI corpus of separately spoken and spelled names, we have demonstrated letter error rates of under 6% for in-vocabulary words and under 11% for words not contained in the training lexicon, a 44% reduction in error rate over that achieved without use of the spoken form. We anticipate applying this technique to unknown words embedded in a larger context, followed by solicited spellings.
机译:本文介绍了我们在实现最终目标方面的最新进展,这些目标是使最终用户能够通过语音交互自动扩展对话系统的知识库,从而使其个性化以满足他们的个人需求。我们描述了用于逐步重新配置预加载的经过训练的自然语言语法的技术,以及用于语音识别系统的词典和语言模型。我们还报告了将口语发音与口语拼写整合在一起以提高拼写准确性的技术进步。虽然原始算法是为“说和拼写”输入模式设计的,但我们在此处显示,可以将相同的方法应用于单词的单独说出的口语和拼写形式。通过串联两个波形,我们可以利用集成复合FST中实现的相互约束。使用带有单独说出名字和拼写名字的OGI语料库,我们证明词汇中单词的字母错误率低于6%,而训练词典中未包含的单词的字母错误率低于11%,错误率比没有词汇时降低44%使用口头形式。我们期望将这种技术应用于更大上下文中嵌入的未知单词,然后进行拼写检查。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号