首页> 外国专利> ADAPTIVE END-OF-UTTERANCE TIMEOUT FOR REAL-TIME SPEECH RECOGNITION

ADAPTIVE END-OF-UTTERANCE TIMEOUT FOR REAL-TIME SPEECH RECOGNITION

机译:自适应语音结束超时,用于实时语音识别

摘要

Real-time speech recognition systems extend an end-of-utterance timeout period in response to the presence of a disfluency at the end of speech, and by so doing avoid cutting off speakers mid-sentence. Approaches to detecting disfluencies include the application of disfluency n-gram language models, acoustic models, prosody models, and phrase spotting. Explicit pause phrases can also be detected to extend sentence parsing until relevant semantic information is gathered from the speaker or another voice. Disfluency models can be trained such as by searching by successive deletion of tokens, phonemes, or acoustic segments to convert sentences that cannot be parsed into ones that can. Disfluency-based timeout adaptation is applicable to safety-critical systems.
机译:实时语音识别系统响应于语音结束时有不满情绪而延长了语音结束超时时间,从而避免切断说话者的中间句子。检测差异性的方法包括应用差异性n-gram语言模型,声学模型,韵律模型和短语识别。还可以检测到显式的暂停短语以扩展句子的解析,直到从说话者或其他声音中收集了相关的语义信息。可以通过例如连续删除标记,音素或声学片段进行搜索,将无法解析的句子转换为可以解析的句子,从而训练流语模型。基于差异性的超时适应适用于安全性至关重要的系统。

著录项

  • 公开/公告号US2019325898A1

    专利类型

  • 公开/公告日2019-10-24

    原文格式PDF

  • 申请/专利权人 SOUNDHOUND INC.;

    申请/专利号US201815959590

  • 申请日2018-04-23

  • 分类号G10L25/78;G10L15/197;G10L15/06;G10L15/18;G10L15/02;

  • 国家 US

  • 入库时间 2022-08-21 12:10:59

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号