首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Physiologically-motivated feature extraction for speaker identification

【24h】

Physiologically-motivated feature extraction for speaker identification

机译：生理动机特征提取以识别说话人

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.

机译：本文介绍了使用三种生理动机特征进行说话人识别的方法：残余相位倒谱系数（RPCC），声门倒谱系数（GLFCC）和Teager倒谱系数（TPCC）。这些特征从声门源激励模式的不同方面捕获了说话人的辨别特征。拟议的生理学驱动的特征以较低的模型复杂度提供了更好的结果，并且还提供了即使对于大量数据也可以改善整体系统性能的补充信息。使用YOHO语料库对说话人进行识别的结果表明，这些生理驱动的特征比传统的mel-frequency倒谱系数（MFCC）更为精确和互补。尤其是，所提出的声门源特征的结合大大提高了说话人识别任务的鲁棒性和准确性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing 》|2014年|1690-1694|共5页
会议地点
作者
Wang Jianglin; Johnson Michael T.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Glottal source excitation and GMM-UBM; Speaker distinctive feature; Speaker identification;

机译：声门源激发和GMM-UBM;扬声器特色鲜明;说话人识别;

相似文献

外文文献
中文文献
专利

1. A network model of speaker identification with new feature extraction methods and asymmetric BLSTM [J] . Wang Xingmei, Xue Fuzhao, Wang Wei, Neurocomputing . 2020 ,第Auga25期

机译：具有新特征提取方法和非对称布斯特的扬声器识别网络模型
2. Speaker identification features extraction methods: A systematic review [J] . Tirumala Sreenivas Sremath, Shahamiri Seyed Reza, Garhwal Abhimanyu Singh, Expert Systems with Application . 2017 ,第deca30期

机译：说话人识别特征提取方法：系统综述
3. Acoustic feature extraction method for robust speaker identification [J] . Li Zuoqiang, Gao Yong Multimedia Tools and Applications . 2016 ,第12期

机译：鲁棒说话人识别的声学特征提取方法
4. Physiologically-motivated feature extraction for speaker identification [C] . Wang Jianglin, Johnson Michael T. IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：扬声器识别的生理动机特征提取
5. Physiologically-motivated feature extraction methods for speaker recognition. [D] . Wang, Jianglin. 2013

机译：用于说话人识别的生理动机特征提取方法。
6. A Markov chain-based feature extraction method for classification and identification of cancerous DNA sequences [O] . Amin Khodaei, Mohammad-Reza Feizi-Derakhshi, Behzad Mozaffari-Tazehkand 2021

机译：基于Markov链的分类和鉴定基于Markov链的特征提取方法
7. A Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification [O] . Turner Claude, Joseph Anthony 2015

机译：基于小波包和梅尔频率倒谱系数的说话人识别特征提取方法

Physiologically-motivated feature extraction for speaker identification

摘要

著录项

相似文献

相关主题

期刊订阅