首页> 外国专利> Apparatus and method for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases

Apparatus and method for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases

机译:使用非声学和/或声学模型和数据库的说话者验证/识别/分类的装置和方法

摘要

A method of controlling access of a speaker to one of a service and a facility, the method comprising the steps of: (a) receiving first spoken utterances of the speaker, the first spoken utterances containing indicia of the speaker;(b) decoding the first spoken utterances; (c) accessing a database corresponding to the decoded first spoken utterances, the database containing information attributable to a speaker candidate having indicia substantially similar to the speaker; (d) querying the speaker with at least one question based on the information contained in the accessed database; (e) receiving second spoken utterances of the speaker, the second spoken utterances being representative of at least one answer to the at least one question; (f) decoding the second spoken utterances; (g) verifying the accuracy of the decoded answer against the information contained in the accessed database serving as the basis for the question; (h) taking a voice sample from the utterances of the speaker and processing the voice sample against an acoustic model attributable to the speaker candidate; (i) generating a score corresponding to the accuracy of the decoded answer and the closeness of the match between the voice sample and the model; and (j) comparing the score to a predetermined threshold value and if the score is one of substantially equivalent to and above the threshold value, then permitting speaker access to one of the service and the facility.
机译:一种控制说话者对服务和设施之一的访问的方法,该方法包括以下步骤:(a)接收说话者的第一语音,其中第一语音包含说话者的标记;(b)解码语音。第一口头言语; (c)访问与已解码的第一口述语音相对应的数据库,该数据库包含归因于具有与说话者基本相似的标记的说话者候选者的信息; (d)基于所访问的数据库中包含的信息,向讲话者查询至少一个问题; (e)接收说话者的第二语音,该第二语音代表对至少一个问题的至少一个答案; (f)解码第二语音; (g)根据所访问的数据库中包含的信息作为问题的基础,验证解码后答案的准确性; (h)从说话者的话语中提取语音样本,并根据可归因于说话者候选者的声学模型处理语音样本; (i)生成与解码后的答案的准确性以及语音样本与模型之间的匹配接近程度相对应的分数; (j)将分数与预定阈值进行比较,并且如果该分数基本上等于或高于阈值之一,则允许说话者访问服务和设施之一。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号