首页> 外文会议>AES international conference >QUANTIFYING THE SPEAKING VOICE:FURTHER INVESTIGATION INTO SPEAKER IDENTIFICATION BY A SIMPLE CODE-MATCHING TECHNIQUE
【24h】

QUANTIFYING THE SPEAKING VOICE:FURTHER INVESTIGATION INTO SPEAKER IDENTIFICATION BY A SIMPLE CODE-MATCHING TECHNIQUE

机译:量化语音:通过简单的代码匹配技术进一步调查说话人的身份

获取原文

摘要

This paper reports on the techniques refined for a method of speaker identification through the automated comparison of spectral, timbral, and temporal features unique to an individual's speech production. This method was first described in Convention Paper 7274 presented by the co-author of this paper, Richard Sanders, at the 123~(rd) Convention of the Audio Engineering Society. Since its first publication, the system (now referred to as SIDNI or Speaker Identification by Numerical Imprint) has improved from 79% correct identifications in 78 comparisons from the speech of 26 males to 100% correct identifications in 150 comparisons from the speech of 50 males. This paper will provide more information on these results and the results of several other tests while also elaborating on the specific speech characteristics exploited by the system and their potential for identification. Some characteristics include: average fundamental speaking frequency, ratio of spectral densities below 1 kHz to those above 1 kHz (Alpha ratio), average rate of vowels, jitter, and shimmer.
机译:本文通过对个人语音产生所特有的频谱,音色和时间特征的自动比较,报告了针对说话人识别方法进行改进的技术。该方法的首次描述是由该论文的合著者Richard Sanders在音频工程学会的123〜(rd)Convention上提出的Convention Paper 7274中进行的。自首次发布以来,该系统(现称为SIDNI或通过数字印记进行的说话人识别)已从26位男性的语音中78个比较中的79%正确识别提高到50位男性的150个比较中100%的正确识别。 。本文将提供有关这些结果以及其他一些测试结果的更多信息,同时还将详细介绍系统利用的特定语音特征及其识别潜力。一些特征包括:平均基本说话频率,低于1 kHz的频谱密度与高于1 kHz的频谱密度之比(阿尔法比),元音,抖动和闪光的平均速率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号