首页> 外国专利> ADAPTIVE END-OF-UTTERANCE TIMEOUT FOR REAL-TIME SPEECH RECOGNITION

ADAPTIVE END-OF-UTTERANCE TIMEOUT FOR REAL-TIME SPEECH RECOGNITION

机译：自适应语音结束超时，用于实时语音识别

页面导航

摘要
著录项
相似文献

摘要

Real-time speech recognition systems extend an end-of-utterance timeout period in response to the presence of a disfluency at the end of speech, and by so doing avoid cutting off speakers mid-sentence. Approaches to detecting disfluencies include the application of disfluency n-gram language models, acoustic models, prosody models, and phrase spotting. Explicit pause phrases can also be detected to extend sentence parsing until relevant semantic information is gathered from the speaker or another voice. Disfluency models can be trained such as by searching by successive deletion of tokens, phonemes, or acoustic segments to convert sentences that cannot be parsed into ones that can. Disfluency-based timeout adaptation is applicable to safety-critical systems.

机译：实时语音识别系统响应于语音结束时有不满情绪而延长了语音结束超时时间，从而避免切断说话者的中间句子。检测差异性的方法包括应用差异性n-gram语言模型，声学模型，韵律模型和短语识别。还可以检测到显式的暂停短语以扩展句子的解析，直到从说话者或其他声音中收集了相关的语义信息。可以通过例如连续删除标记，音素或声学片段进行搜索，将无法解析的句子转换为可以解析的句子，从而训练流语模型。基于差异性的超时适应适用于安全性至关重要的系统。

著录项

公开/公告号US2019325898A1

专利类型
公开/公告日2019-10-24

原文格式PDF
申请/专利权人 SOUNDHOUND INC.;
展开▼

申请/专利号US201815959590
发明设计人 LIAM OHART KINNEY;JOEL MCKENZIE;ANITHA KANDASAMY;
展开▼

申请日2018-04-23
分类号G10L25/78;G10L15/197;G10L15/06;G10L15/18;G10L15/02;
国家 US
入库时间 2022-08-21 12:10:59

相似文献

专利
外文文献
中文文献