首页>
外国专利>
ADAPTIVE END-OF-UTTERANCE TIMEOUT FOR REAL-TIME SPEECH RECOGNITION
ADAPTIVE END-OF-UTTERANCE TIMEOUT FOR REAL-TIME SPEECH RECOGNITION
展开▼
机译:自适应语音结束超时,用于实时语音识别
展开▼
页面导航
摘要
著录项
相似文献
摘要
Real-time speech recognition systems extend an end-of-utterance timeout period in response to the presence of a disfluency at the end of speech, and by so doing avoid cutting off speakers mid-sentence. Approaches to detecting disfluencies include the application of disfluency n-gram language models, acoustic models, prosody models, and phrase spotting. Explicit pause phrases can also be detected to extend sentence parsing until relevant semantic information is gathered from the speaker or another voice. Disfluency models can be trained such as by searching by successive deletion of tokens, phonemes, or acoustic segments to convert sentences that cannot be parsed into ones that can. Disfluency-based timeout adaptation is applicable to safety-critical systems.
展开▼