首页> 美国卫生研究院文献>other >A Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery

【2h】

A Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery

机译：强大的说话人识别系统利用听觉外围模型的响应

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker identification under noisy conditions is one of the challenging topics in the field of speech processing applications. Motivated by the fact that the neural responses are robust against noise, this paper proposes a new speaker identification system using 2-D neurograms constructed from the responses of a physiologically-based computational model of the auditory periphery. The responses of auditory-nerve fibers for a wide range of characteristic frequency were simulated to speech signals to construct neurograms. The neurogram coefficients were trained using the well-known Gaussian mixture model-universal background model classification technique to generate an identity model for each speaker. In this study, three text-independent and one text-dependent speaker databases were employed to test the identification performance of the proposed method. Also, the robustness of the proposed method was investigated using speech signals distorted by three types of noise such as the white Gaussian, pink, and street noises with different signal-to-noise ratios. The identification results of the proposed neural-response-based method were compared to the performances of the traditional speaker identification methods using features such as the Mel-frequency cepstral coefficients, Gamma-tone frequency cepstral coefficients and frequency domain linear prediction. Although the classification accuracy achieved by the proposed method was comparable to the performance of those traditional techniques in quiet, the new feature was found to provide lower error rates of classification under noisy environments.

机译：嘈杂条件下的说话人识别是语音处理应用领域中具有挑战性的主题之一。由于神经反应对噪声具有鲁棒性，因此本文提出了一种新的说话人识别系统，该系统使用基于听觉外围的基于生理的计算模型的反应构建的二维神经图。听觉神经纤维对宽范围的特征频率的响应被模拟到语音信号以构造神经图。使用众所周知的高斯混合模型-通用背景模型分类技术训练神经图系数，以为每个说话者生成一个身份模型。在这项研究中，三个独立于文本和一个独立于文本的说话者数据库被用来测试该方法的识别性能。此外，使用由三种类型的噪声（例如具有不同信噪比的高斯白噪声，粉红色噪声和街道噪声）失真的语音信号研究了该方法的鲁棒性。将所提出的基于神经响应的方法的识别结果与传统说话人识别方法的性能进行了比较，这些方法具有梅尔频率倒谱系数，伽玛音频率倒谱系数和频域线性预测等功能。尽管通过所提出的方法实现的分类准确度可与那些传统技术在安静环境下的表现相媲美，但发现该新功能在嘈杂的环境下可提供较低的分类错误率。

著录项

期刊名称 other
作者
Md. Atiqul Islam; Wissam A. Jassim; Ng Siew Cheok; Muhammad Shamsul Arefeen Zilany;
展开▼
作者单位

展开▼
年(卷),期 -1(11),7
年度 -1
页码 e0158520
总页数 21
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-21 11:12:07

相似文献

外文文献
中文文献
专利

1. Computational modeling of the human auditory periphery: Auditory-nerve responses, evoked potentials and hearing loss [J] . Verhulst Sarah, Altoe Alessandro, Vasilkov Viacheslav Hearing Research: An International Journal . 2018,第期

机译：人体听觉周边的计算建模：听觉神经反应，唤起潜力和听力损失
2. Modeling auditory-nerve responses for high sound pressure levels in the normal and impaired auditory periphery [J] . Zilany MSA, Bruce IC The Journal of the Acoustical Society of America . 2006,第3期

机译：模拟正常和受损听觉外周中高声压水平的听觉-神经反应
3. Speaker Modeling Using Emotional Speech for More Robust Speaker Identification [J] . Journal of Communications Technology and Electronics . 2019,第11期

机译：使用情感语音进行说话人建模，以更可靠地识别说话人
4. A Robust Text Dependent Speaker Identification Using Neural Responses from the Model of the Auditory System [C] . Md. Ibrahim Khalil, Nursadul Mamun, Khadija Akter International Conference on Electrical, Computer and Communication Engineering . 2019

机译：基于听觉系统模型的神经响应的鲁棒文本相关说话人识别
5. Essays in Empirical Macroeconomics: Identification in Vector Autoregressive Models and Robust Inference in Early Warning Systems [D] . Bruns, Martin 2019

机译：经验宏观经济学中的论文：矢量自回归模型中的识别和预警系统中的稳健推断
6. Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling [O] . Sahar Akram, Alessandro Presacco, Jonathan Z. Simon, -1

机译：通过状态空间建模对来自演讲者环境中MEG的选择性听觉注意力进行可靠解码
7. A Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery. [O] . Md Atiqul Islam, Wissam A Jassim, Ng Siew Cheok, 2016

机译：一种基于听外围模型响应的鲁棒说话人识别系统。

A Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery

摘要

著录项

相似文献

相关主题

期刊订阅