【24h】

Speaker age estimation using i-vectors

机译:使用i向量估算说话者年龄

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, a new approach for age estimation from speech signals based on i-vectors is proposed. In this method, each utterance is modeled by its corresponding i-vector. Then, a Within-Class Covariance Normalization technique is used for session variability compensation. Finally, a least squares support vector regression (LSSVR) is applied to estimate the age of speakers. The proposed method is trained and tested on telephone conversations of the National Institute for Standard and Technology (NIST) 2010 and 2008 speaker recognition evaluation databases. Evaluation results show that the proposed method yields significantly lower mean absolute error and higher Pearson correlation coefficient between chronological speaker age and estimated speaker age compared to different conventional schemes. The obtained relative improvements of mean absolute error and correlation coefficient compared to our best baseline system are around 5% and 2% respectively. Finally, the effect of some major factors influencing the proposed age estimation system, namely utterance length and spoken language are analyzed.
机译:本文提出了一种基于i向量的语音信号年龄估计新方法。在这种方法中,每个话语都通过其对应的i-vector进行建模。然后,将类内协方差归一化技术用于会话可变性补偿。最后,应用最小二乘支持向量回归(LSSVR)估计说话者的年龄。该方法在美国国家标准技术研究院(NIST)2010和2008演讲者识别评估数据库的电话交谈中经过培训和测试。评估结果表明,与不同的传统方案相比,该方法在按时间顺序说话者年龄与估计说话者年龄之间的平均绝对误差明显较低,而皮尔逊相关系数则较高。与我们的最佳基准系统相比,平均绝对误差和相关系数的相对改进分别约为5%和2%。最后,分析了影响拟议年龄估计系统的一些主要因素的影响,即话语长度和口语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号