On-Demand Language Model Interpolation for Mobile Speech Input

机译：移动语音输入的按需语言模型插值

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Google offers several speech features on the Android mobile operating system: search by voice, voice input to any text field, and an API for application developers. As a result, our speech recognition service must support a wide range of usage scenarios and speaking styles: relatively short search queries, addresses, business names, dictated SMS and e-mail messages, and a long tail of spoken input to any of the applications users may install. We present a method of on-demand language model interpolation in which contextual information about each utterance determines interpolation weights among a number of n-gram language models. On-demand interpolation results in an 11.2% relative reduction in WER compared to using a single anguage model to handle all traffic.

机译：Google在Android移动操作系统上提供了多种语音功能：通过语音搜索，对任何文本字段的语音输入以及面向应用程序开发人员的API。因此，我们的语音识别服务必须支持广泛的使用场景和讲话方式：相对较短的搜索查询，地址，公司名称，指定的SMS和电子邮件，以及对任何应用程序的语音输入很长的尾巴用户可以安装。我们提出了一种按需语言模型插值的方法，其中有关每个语音的上下文信息确定了许多n-gram语言模型之间的插值权重。与使用单一语言模型处理所有流量相比，按需插值可将WER相对降低11.2％。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.1812-1815|共4页
会议地点
作者
Brandon Ballinger; Cyril Allauzen; Alexander Gruenstein; Johan Schalkwyk;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
language modeling; interpolation; mobile;

机译：语言建模;插值移动;

相似文献

外文文献
中文文献
专利

1. Improving command and control speech recognition on mobile devices: using predictive user models for language modeling [J] . Tim Paek, David Maxwell Chickering User modeling and user-adapted interaction . 2007,第1a2期

机译：改善移动设备上的命令和控制语音识别：使用预测性用户模型进行语言建模
2. Modeling input modality choice in mobile graphical and speech interfaces [J] . Schaffer Stefan, Schleicher Robert, Moeller Sebastian International journal of human-computer studies . 2015,第Null期

机译：在移动图形和语音界面中建模输入模式选择
3. Multi-model fusion framework based on multi-input cross-language emotional speech recognition [J] . Guohua Hu, Qingshan Zhao International journal of wireless and mobile computing . 2021,第1期

机译：基于多输入交叉语言情绪语音识别的多模型融合框架
4. On-Demand Language Model Interpolation for Mobile Speech Input [C] . Brandon Ballinger, Cyril Allauzen, Alexander Gruenstein, Annual conference of the International Speech Communication Association . 2010

机译：移动语音输入的按需语言模型插值
5. The impact of autism on language input: A comparison of the acoustic characteristics of mothers' speech to toddlers with autism and typically-developing controls. [D] . McKinnis, Kelsey M. 2012

机译：自闭症对语言输入的影响：比较自闭症儿童和典型发展中的控件的母亲的语音声学特征。
6. Look who’s talking: speech style and social context in language input to infants are linked to concurrent and future speech development [O] . Nairán Ramírez-Esparza, Adrián García-Sierra, Patricia K. Kuhl -1

机译：看谁在说话：婴儿输入语言时的言语风格和社交环境与并发和未来的言语发展息息相关
7. Binary Interpolation Search for Solution Mapping on Broadcast and On-demand Channels in a Mobile Computing Environment [O] . Jiun-long Huang, Wen-chih Peng, Ming-syan Chen 2001

机译：在移动计算环境中对广播和点播频道上的解决方案映射进行二进制插值搜索
8. Autoregressive Modelling for Speech Coding: Estimation, Interpolation andQuantisation (Autoregressieve Modellering voor Spraakcodering: Schatten, Interpoleren en Kwantiseren) [R] . Erkelens, J. S. 1996

机译：用于语音编码的自回归建模：估计，插值和量化（autoregressieve modellering voor spraakcodering：schatten，Interpoleren en Kwantiseren）

On-Demand Language Model Interpolation for Mobile Speech Input

摘要

著录项

相似文献

相关主题

期刊订阅