首页> 外文会议>Croatian Society Electronics in Marine International Symposium >Acoustic and auxiliary speech features for speaker identification system
【24h】

Acoustic and auxiliary speech features for speaker identification system

机译:说话人识别系统的语音和辅助语音功能

获取原文

摘要

The focus of the article is on the selection, adjustment and overall performance of speech features at acoustical and prosodic level for speaker recognition task. Namely: perceptual linear prediction, Mel frequency cepstra, cepstral linear prediction, formant frequencies, and different auxiliary features. Both brief theoretical backgrounds and possible computational methods are outlined in regard to the speaker recognition task. In the series of experiments using 114 speakers database, it was observed that a model based method slightly outperformed the perceptual ones. Furthermore, it was found that auxiliary and prosodic features may not always improve scores when processed together with acoustic ones. On average the success rate was about 90% whereas the best recorded score was 99.1% for cepstral linear prediction coefficients in connection with k-nearest neighbor classifier.
机译:本文的重点是针对说话人识别任务,在声学和韵律级别上对语音特征的选择,调整和总体性能。即:感知线性预测,梅尔频率倒谱,倒谱线性预测,共振峰频率和不同的辅助特征。关于说话人识别任务概述了简要的理论背景和可能的计算方法。在使用114个发言人数据库的一系列实验中,观察到基于模型的方法略胜于感知方法。此外,还发现辅助和韵律特征在与声学特征一起处理时可能并不总能提高得分。平均而言,与k最近邻分类器有关的倒谱线性预测系数的最佳记录分数是99.1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号