首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009 >Leveraging multiple query logs to improve language models for spoken query recognition
【24h】

Leveraging multiple query logs to improve language models for spoken query recognition

机译:利用多个查询日志来改善语言模型以进行口头查询识别

获取原文
获取外文期刊封面目录资料

摘要

A voice search system requires a speech interface that can correctly recognize spoken queries uttered by users. The recognition performance strongly relies on a robust language model. In this work, we present the use of multiple data sources, with the focus on query logs, in improving ASR language models for a voice search application. Our contributions are three folds: (1) the use of text queries from web search and mobile search in language modeling; (2) the use of web click data to predict query forms from business listing forms; and (3) the use of voice query logs in creating a positive feedback loop. Experiments show that by leveraging these resources, we can achieve recognition performance comparable to, or even better than, that of a previously deploy system where a large amount of spoken query transcripts are used in language modeling.
机译:语音搜索系统需要语音接口,该接口可以正确识别用户说出的口头查询。识别性能强烈依赖于强大的语言模型。在这项工作中,我们提出了使用多个数据源(重点是查询日志)来改善语音搜索应用程序的ASR语言模型。我们的贡献包括三个方面:(1)在语言建模中使用网络搜索和移动搜索中的文本查询; (2)使用网页点击数据从商户列表中预测查询形式; (3)使用语音查询日志来建立积极的反馈循环。实验表明,通过利用这些资源,我们可以实现与以前部署的系统相当甚至更好的识别性能,该系统以前在语言建模中使用了大量口头查询笔录。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号