首页> 外国专利> Pronunciation prediction in speech recognition

Pronunciation prediction in speech recognition

机译:语音识别中的语音预测

摘要

An automatic speech recognition (ASR) device may be configured to predict pronunciations of textual identifiers (for example, song names, etc.) based on predicting one or more languages of origin of the textual identifier. The one or more languages of origin may be determined based on the textual identifier. The pronunciations may include a hybrid pronunciation including a pronunciation in one language, a pronunciation in a second language and a hybrid pronunciation that combines multiple languages. The pronunciations may be added to a lexicon and matched to the content item (e.g., song) and/or textual identifier. The ASR device may receive a spoken utterance from a user requesting the ASR device to access the content item. The ASR device determines whether the spoken utterance matches one of the pronunciations of the content item in the lexicon. The ASR device then accesses the content when the spoken utterance matches one of the potential textual identifier pronunciations.
机译:自动语音识别(ASR)设备可以被配置为基于预测文本标识符的起源的一种或多种语言来预测文本标识符(例如,歌曲名称等)的发音。可以基于文本标识符来确定一种或多种原语。语音可以包括混合语音,该混合语音包括一种语言的语音,第二语言的语音以及组合多种语言的混合语音。可以将发音添加到词典中并与内容项(例如歌曲)和/或文本标识符匹配。 ASR设备可以从请求ASR设备访问内容项的用户接收语音。 ASR设备确定语音是否与词典中内容项的发音之一匹配。然后,当语音与潜在的文本标识符发音之一匹配时,ASR设备就会访问内容。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号