首页> 外文会议>International Conference on Spoken Language Processing; 20041004-08; Jeju(KR) >Keyword Recognition and Extraction by Multiple-LVCSRs with 60,000 Words in Speech-driven WEB Retrieval Task
【24h】

Keyword Recognition and Extraction by Multiple-LVCSRs with 60,000 Words in Speech-driven WEB Retrieval Task

机译:语音驱动的WEB检索任务中具有60,000个单词的多个LVCSR对关键字的识别和提取

获取原文
获取原文并翻译 | 示例

摘要

This paper presents speech-driven Web retrieval models which accepts spoken search topics (queries) in the NTCIR-3 Web retrieval task. We experimentally evaluate the techniques of combining outputs of multiple LVCSR models with a language model(LM) with a 60,000 vocabulary size in recognition of spoken queries. As model combination techniques, we use the SVM learning. We show that the techniques of multiple LVCSR model combination can achieve improvement both in speech recognition and retrieval accuracies in speech-driven text retrieval. Comparing with the retrieval accuracies when a LM with a 20,000/60,000 vocabulary size is used in LVCSRs, the LM that has larger size of the vocabulary improves also retrieval accuracies.
机译:本文介绍了语音驱动的Web检索模型,该模型在NTCIR-3 Web检索任务中接受语音搜索主题(查询)。我们通过实验评估了将多个LVCSR模型的输出与具有60,000词汇量的语言模型(LM)组合在一起以识别语音查询的技术。作为模型组合技术,我们使用SVM学习。我们表明,多种LVCSR模型组合技术可以在语音驱动的文本检索中实现语音识别和检索精度方面的改进。与在LVCSR中使用词汇量为20,000 / 60,000的LM时的检索准确度相比,具有较大词汇量的LM也会提高检索准确度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号