Leveraging multiple query logs to improve language models for spoken query recognition

机译：利用多个查询日志来改善语言模型以进行口头查询识别

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A voice search system requires a speech interface that can correctly recognize spoken queries uttered by users. The recognition performance strongly relies on a robust language model. In this work, we present the use of multiple data sources, with the focus on query logs, in improving ASR language models for a voice search application. Our contributions are three folds: (1) the use of text queries from web search and mobile search in language modeling; (2) the use of web click data to predict query forms from business listing forms; and (3) the use of voice query logs in creating a positive feedback loop. Experiments show that by leveraging these resources, we can achieve recognition performance comparable to, or even better than, that of a previously deploy system where a large amount of spoken query transcripts are used in language modeling.

机译：语音搜索系统需要语音接口，该接口可以正确识别用户说出的口头查询。识别性能强烈依赖于强大的语言模型。在这项工作中，我们提出了使用多个数据源（重点是查询日志）来改善语音搜索应用程序的ASR语言模型。我们的贡献包括三个方面：（1）在语言建模中使用网络搜索和移动搜索中的文本查询；（2）使用网页点击数据从商户列表中预测查询形式；（3）使用语音查询日志来建立积极的反馈循环。实验表明，通过利用这些资源，我们可以实现与以前部署的系统相当甚至更好的识别性能，该系统以前在语言建模中使用了大量口头查询笔录。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009》|2009年|3713-3716|共4页
会议地点 Taipei(CT);Taipei(CT)
作者
Xiao Li; Nguyen, P.; Zweig, G.; Bohus, D.;
展开▼
作者单位

Microsoft Res. Redmond WA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
query processing; speech recognition; ASR language models; Web search; mobile search; multiple data sources; multiple query logs; positive feedback loop; speech interface; spoken query recognition; voice search system; click data; language modeling; query log; voice search;

机译：查询处理；语音识别; ASR语言模型；网络搜索;移动搜索；多个数据源；多个查询日志；正反馈回路；语音接口；语音查询识别；语音搜索系统；点击数据；语言建模；查询日志；声音搜索;

相似文献

外文文献
中文文献
专利

1. Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task [J] . Masahiko MATSUSHITA, Hiromitsu NISHIZAKI, Takehito UTSURO, IEICE Transactions on Information and Systems . 2005,第3期

机译：通过组合多个语音识别器的输出以执行语音驱动的WEB检索任务，提高口语查询的关键字识别
2. Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition [J] . Akinori Ito, Yasutomo Kajiura, Motoyuki Suzuki, EURASIP journal on audio, speech, and music processing . 2009,第009期

机译：语音识别的无监督语言模型自适应自动查询生成和查询相关性度量
3. Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition [J] . Akinori Ito, Yasutomo Kajiura, Motoyuki Suzuki, EURASIP journal on audio, speech, and music processing . 2009,第1期

机译：语音识别的无监督语言模型自适应自动查询生成和查询相关性度量
4. LEVERAGING MULTIPLE QUERY LOGS TO IMPROVE LANGUAGE MODELS FOR SPOKEN QUERY RECOGNITION [C] . Xiao Li, Patrick Nguyen, Geoffrey Zweig, IEEE International Conference on Acoustics, Speech, and Signal Processing . 2009

机译：利用多个查询日志来提高语言模型，用于语音查询识别
5. Database models and query languages for relational data and metadata query processing. [D] . Jain, Manoj Kumar. 1996

机译：用于关系数据和元数据查询处理的数据库模型和查询语言。
6. Leveraging blockchain for immutable logging and querying across multiple sites [O] . Mustafa Safa Ozdayi, Murat Kantarcioglu, Bradley Malin 2020

机译：利用区块链在多个站点之间进行不变的日志记录和查询
7. LEVERAGING MULTIPLE QUERY LOGS TO IMPROVE LANGUAGE MODELS FOR SPOKEN QUERY RECOGNITION [O] . Xiao Li, Patrick Nguyen, Geoffrey Zweig, 2009

机译：利用多个查询日志来改进语言模型以进行口语查询识别

Leveraging multiple query logs to improve language models for spoken query recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅