Acoustic and auxiliary speech features for speaker identification system

机译：说话人识别系统的语音和辅助语音功能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The focus of the article is on the selection, adjustment and overall performance of speech features at acoustical and prosodic level for speaker recognition task. Namely: perceptual linear prediction, Mel frequency cepstra, cepstral linear prediction, formant frequencies, and different auxiliary features. Both brief theoretical backgrounds and possible computational methods are outlined in regard to the speaker recognition task. In the series of experiments using 114 speakers database, it was observed that a model based method slightly outperformed the perceptual ones. Furthermore, it was found that auxiliary and prosodic features may not always improve scores when processed together with acoustic ones. On average the success rate was about 90% whereas the best recorded score was 99.1% for cepstral linear prediction coefficients in connection with k-nearest neighbor classifier.

机译：本文的重点是针对说话人识别任务，在声学和韵律级别上对语音特征的选择，调整和总体性能。即：感知线性预测，梅尔频率倒谱，倒谱线性预测，共振峰频率和不同的辅助特征。关于说话人识别任务概述了简要的理论背景和可能的计算方法。在使用114个发言人数据库的一系列实验中，观察到基于模型的方法略胜于感知方法。此外，还发现辅助和韵律特征在与声学特征一起处理时可能并不总能提高得分。平均而言，与k最近邻分类器有关的倒谱线性预测系数的最佳记录分数是99.1％。

著录项

来源
《Croatian Society Electronics in Marine International Symposium》|2015年|109-112|共4页
会议地点
作者
Kacur Juraj; Truchly Peter;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
cepstral analysis; prediction theory; speaker recognition; acoustic feature; auxiliary speech feature; cepstral linear prediction; formant frequency; k-nearest neighbor classifier; mel frequency cepstra; perceptual linear prediction; speaker identification system; speaker recognition; Mel frequency cepstral coefficient; Speaker recognition; Speech; Speech recognition; Statistics; CLPC; LPC; MFCC; PLP; Speaker Recognition;

机译：倒频谱分析;预测理论;说话人识别;声学特征;辅助语音特征;倒频谱线性预测;共振峰频率; k近邻分类器;梅尔频率倒谱;感知线性预测;扬声器识别系统;扬声器识别;梅尔频率倒谱系数;扬声器识别;语音;语音识别;统计; CLPC; LPC; MFCC; PLP;说话人识别;

相似文献

外文文献
中文文献
专利

1. A Speech-and-Speaker Identification System: Feature Extraction, Description, and Classification of Speech-Signal Image [J] . Khalid Saeed, Mohammad Kheir Nammous IEEE Transactions on Industrial Electronics . 2007,第期

机译：语音识别系统：语音信号图像的特征提取，描述和分类
2. Performance enhancement of speaker identification systems using speech encryption and cancelable features [J] . Soliman Naglaa F.M., Mostfa Zhraa, El-Samie Fathi E.Abd, International journal of speech technology . 2017,第4期

机译：使用语音加密和可取消功能增强说话者识别系统的性能
3. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE transactions on information and systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
4. Acoustic and auxiliary speech features for speaker identification system [C] . Kacur Juraj, Truchly Peter Croatian Society Electronics in Marine International Symposium . 2015

机译：扬声器识别系统的声学和辅助语音功能
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. Acoustic and perceptual speech characteristics of native Mandarin speakers with Parkinsons disease [O] . Sih-Chiao Hsu, Yishan Jiao, Megan J. McAuliffe, -1

机译：患有帕金森氏病的普通话母语者的声学和感知语音特征
7. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [O] . Arata Itoh, Sunao Hara, Norihide Kitaoka, 2012

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别

Acoustic and auxiliary speech features for speaker identification system

摘要

著录项

相似文献

相关主题

期刊订阅