首页> 外国专利> Audio tagging for the portable possible device which has the after-treatment the selective freedom, audio annotation, and speech recognition

Audio tagging for the portable possible device which has the after-treatment the selective freedom, audio annotation, and speech recognition

机译:用于便携式可能设备的音频标签,该设备具有选择性自由,音频注释和语音识别的后处理功能

摘要

A media capture device has an audio input receptive of user speech relating to a media capture activity in close temporal relation to the media capture activity. A plurality of focused speech recognition lexica respectively relating to media capture activities are stored on the device, and a speech recognizer recognizes the user speech based on a selected one of the focused speech recognition lexica. A media tagger tags captured media with generated speech recognition text, and a media annotator annotates the captured media with a sample of the user speech that is suitable for input to a speech recognizer. Tagging and annotating are based on close temporal relation between receipt of the user speech and capture of the captured media. Annotations may be converted to tags during post processing, employed to edit a lexicon using letter-to-sound rules and spelled word input, or matched directly to speech to retrieve captured media.
机译:媒体捕获设备具有音频输入,该音频输入接收与媒体捕获活动密切相关的与媒体捕获活动有关的用户语音。分别与媒体捕获活动有关的多个关注语音识别词典被存储在设备上,并且语音识别器基于所选择的关注语音识别词典中的一个来识别用户语音。媒体标记器用生成的语音识别文本标记捕获的媒体,并且媒体注释器用适合于输入到语音识别器的用户语音样本来注释捕获的媒体。标记和注释基于用户语音的接收与捕获媒体的捕获之间的紧密时间关系。批注可以在后期处理期间转换为标签,可以使用字母到声音的规则和拼写单词输入来编辑词典,或者可以直接与语音匹配以获取捕获的媒体。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号