首页> 外国专利> Adaptation of a speech recognition system across multiple remote sessions with aspeaker

Adaptation of a speech recognition system across multiple remote sessions with aspeaker

机译：演讲者在多个远程会话中对语音识别系统的适应

页面导航

摘要
著录项
相似文献

摘要

A technique for adaptation of a speech recognizing system across multiple remote communication sessions with a speaker. The speaker can be a telephone caller. An acoustic model is utilized for recognizing the speaker's speech. Upon initiation of a first remote session with the speaker, the acoustic model is speaker-independent. During the first session, the speaker is uniquely identified and speech samples are obtained from the speaker. In the preferred embodiment, the samples are obtained without requiring the speaker to engage in a training session. The acoustic model is then modified based upon the samples thereby forming a modified model. The model can be modified during the session or after the session is terminated. Upon termination of the session, the modified model is then stored in association with an identification of the speaker. During a subsequent remote session, the speaker is identified and, then, the modified acoustic model is utilized to recognize the speaker's speech. Additional speech samples are obtained during the subsequent session and, then, utilized to further modify the acoustic model. In this manner, an acoustic model utilized for recognizing the speech of a particular speaker is cumulatively modified according to speech samples obtained during multiple sessions with the speaker. As a result, the accuracy of the speech recognizing system improves for the speaker even when the speaker only engages in relatively short remote sessions.

机译：一种用于跨多个与扬声器进行远程通信会话的语音识别系统的技术。扬声器可以是电话呼叫者。声学模型用于识别说话者的语音。在开始与说话者的第一次远程会话时，声学模型是与说话者无关的。在第一会话期间，唯一地识别说话者，并从说话者获得语音样本。在优选实施例中，不需要说话者参加训练课程就获得样本。然后基于样本修改声学模型，从而形成修改的模型。可以在会话期间或会话终止后修改模型。在会话终止时，然后将修改的模型与讲话者的标识相关联地存储。在随后的远程会话期间，识别说话者，然后，使用修改后的声学模型来识别说话者的语音。在后续会话期间获取其他语音样本，然后将其用于进一步修改声学模型。以这种方式，根据在与说话者的多个会话期间获得的语音样本来累积地修改用于识别特定说话者的语音的声学模型。结果，即使说话者仅参加相对较短的远程会话，说话者的语音识别系统的准确性也得以提高。

著录项

公开/公告号AU4840800A

专利类型
公开/公告日2000-11-21

原文格式PDF
申请/专利权人 NUANCE COMMUNICATIONS;
展开▼

申请/专利号AU20000048408
发明设计人 ASHVIN KANNAN;HY MURVEIT;
展开▼

申请日2000-05-10
分类号G10L15/06;
国家 AU
入库时间 2022-08-22 01:20:30

相似文献

专利
外文文献
中文文献