首页> 外国专利> Indexing digitized speech with words represented in the digitized speech

Indexing digitized speech with words represented in the digitized speech

机译:用数字化语音中表示的单词索引数字化语音

摘要

Indexing digitized speech with words represented in the digitized speech, with a multimodal digital audio editor operating on a multimodal device supporting modes of user interaction, the modes of user interaction including a voice mode and one or more non-voice modes, the multimodal digital audio editor operatively coupled to an ASR engine, including providing by the multimodal digital audio editor to the ASR engine digitized speech for recognition; receiving in the multimodal digital audio editor from the ASR engine recognized user speech including a recognized word, also including information indicating where, in the digitized speech, representation of the recognized word begins; and inserting by the multimodal digital audio editor the recognized word, in association with the information indicating where, in the digitized speech, representation of the recognized word begins, into a speech recognition grammar, the speech recognition grammar voice enabling user interface commands of the multimodal digital audio editor.
机译:用在数字化语音中表示的单词对数字化语音进行索引,并在支持用户交互模式的多模式设备上运行的多模式数字音频编辑器进行操作,该用户交互模式包括语音模式和一个或多个非语音模式,多模式数字音频在操作上耦合到ASR引擎的编辑器,包括由多模式数字音频编辑器提供给ASR引擎数字化语音以进行识别;在多模态数字音频编辑器中从ASR引擎接收识别的用户语音,该用户语音包括识别的单词,还包括指示在数字化语音中识别的单词的表示从哪里开始的信息;然后,由多峰数字音频编辑器将识别出的单词与指示数字化语音中识别出的单词的表示从何处开始的信息相关联,插入语音识别语法中,该语音识别语法语音启用了多模式的用户界面命令数字音频编辑器。

著录项

  • 公开/公告号US9123337B2

    专利类型

  • 公开/公告日2015-09-01

    原文格式PDF

  • 申请/专利权人 NUANCE COMMUNICATIONS INC.;

    申请/专利号US201414204544

  • 发明设计人 CHARLES W. CROSS;FRANK L. JANIA;

    申请日2014-03-11

  • 分类号G10L15/22;G10L15/19;G10L21/06;G10L15/193;G10L15/183;G10L15/197;

  • 国家 US

  • 入库时间 2022-08-21 15:19:30

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号