...
首页> 外文期刊>Computer speech and language >Beyond ASR 1-best: Using word confusion networks in spoken language understanding
【24h】

Beyond ASR 1-best: Using word confusion networks in spoken language understanding

机译:超越ASR 1-最佳:在单词理解中使用单词混淆网络

获取原文
获取原文并翻译 | 示例
           

摘要

We are interested in the problem of robust understanding from noisy spontaneous speech input. With the advances in automated speech recognition (ASR), there has been increasing interest in spoken language understanding (SLU). A challenge in large vocabulary spoken language understanding is robustness to ASR errors. State of the art spoken language understanding relies on the best ASR hypotheses (ASR 1-best). In this paper, we propose methods for a tighter integration of ASR and SLU using word confusion networks (WCNs). WCNs obtained from ASR word graphs (lattices) provide a compact representation of multiple aligned ASR hypotheses along with word confidence scores, without compromising recognition accuracy. We present our work on exploiting WCNs instead of simply using ASR one-best hypotheses. In this work, we focus on the tasks of named entity detection and extraction and call classification in a spoken dialog system, although the idea is more general and applicable to other spoken language processing tasks. For named entity detection, we have improved the F-measure by using both word lattices and WCNs, 6–10% absolute. The processing of WCNs was 25 times faster than lattices, which is very important for real-life applications. For call classification, we have shown between 5% and 10% relative reduction in error rate using WCNs compared to ASR 1-best output.
机译:我们对从嘈杂的自发语音输入中获得强大理解的问题感兴趣。随着自动语音识别(ASR)的进步,人们对口语理解(SLU)的兴趣日益增加。大词汇量口语理解中的挑战是对ASR错误的鲁棒性。最先进的口头语言理解依赖于最佳ASR假设(ASR 1最佳)。在本文中,我们提出了使用单词混淆网络(WCN)紧密集成ASR和SLU的方法。从ASR词图(晶格)获得的WCN提供了多个对齐的ASR假设以及词置信度得分的紧凑表示,而不会影响识别准确性。我们介绍了我们在开发WCN方面的工作,而不是简单地使用ASR最佳假设。在这项工作中,我们将重点放在口语对话系统中的命名实体检测,提取和呼叫分类等任务上,尽管该想法更为笼统并适用于其他口语处理任务。对于命名实体检测,我们通过同时使用单词格和WCN(绝对值6–10%)改进了F度量。 WCN的处理速度比晶格快25倍,这对于实际应用非常重要。对于呼叫分类,与ASR 1最佳输出相比,使用WCN的错误率相对降低了5%至10%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号