首页> 外文会议> >Cohort selection and word grammar effects for speaker recognition
【24h】

Cohort selection and word grammar effects for speaker recognition

机译:说话人识别的同类群组选择和单词语法效果

获取原文

摘要

Automatic speaker recognition systems are maturing and databases have been designed to specifically compare algorithms and results to target error rates. The LDC YOHO speaker verification database was designed to test error rates at the 1% false rejection and 0.1% false acceptance level. This work examines the use of speaker-dependent (SD) monophone models to meet these requirements. By representing each speaker with 22 monophones, both closed-set speaker identification and global-threshold verification was performed. Using four combination lock phrases, speaker identification error rates are obtained at 0.19% for males and 0.31% for females. By defining a test hypothesis, a critical error analysis for speaker verification is developed and new results reported for YOHO. A new Bhattacharyya distance is developed for cohort selection. This method, based on the second order statistics of the enrolment Viterbi log-likelihoods, determines the optimal cohorts and achieves an equal error rate of 0.282%.
机译:自动说话人识别系统已经日趋成熟,并且已经设计了数据库来专门比较算法和结果与目标错误率。 LDC YOHO扬声器验证数据库旨在以1%的错误拒绝和0.1%的错误接受水平测试错误率。这项工作研究了使用扬声器相关(SD)单声道电话型号来满足这些要求。通过用22个单声道电话代表每个扬声器,可以执行封闭设置的扬声器识别和全局阈值验证。使用四个密码锁定短语,男性说话者识别错误率分别为0.19%和女性0.31%。通过定义测试假设,开发了用于说话人验证的关键错误分析,并为YOHO报告了新结果。为队列选择开发了新的Bhattacharyya距离。该方法基于维特比对数入组可能性的二阶统计量,确定了最佳队列,并实现了0.282%的均等错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号