Speaker Identification Using a Nonlinear Speech Model and ANN

Lajish V.L; Sunil Kumar R.K; Vivek P

首页> 外文期刊>International Journal of Advanced Information Technology >Speaker Identification Using a Nonlinear Speech Model and ANN

【24h】

Speaker Identification Using a Nonlinear Speech Model and ANN

机译：使用非线性语音模型和ANN的说话人识别

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a nonlinear speech model for improved speaker identification. We modelled the speaker identity using Reconstructed Phase Space (RPS) of the speech signal and the Phase Space Point Distribution (PSPD) parameters. The PSPD parameters are extracted from five vowels uttered by different speakers. The speaker identification experiments are conducted based on the PSPD parameters using Feed Forward Multi Layer Perceptron (FFMLP). Overall performance of the Mel-frequency cepstral baseline system is compared with the proposed composite classifier system using both cepstral and PSPD features across ten different speakers. The experimental results indicate that the accuracy of the phase space approach by itself is below that of MFCC features and further it shows that the proposed approach in which PSPD features when used with MFCC, pitch and first formant frequency offers reasonable improvement in speaker identification accuracy of the system

机译：本文介绍了一种非线性语音模型，用于改进说话人识别。我们使用语音信号的重构相空间（RPS）和相空间点分布（PSPD）参数对说话人身份进行建模。从不同说话者发出的五个元音中提取PSPD参数。使用前馈多层感知器（FFMLP）基于PSPD参数进行说话人识别实验。将梅尔频率倒谱基线系统的整体性能与在十个不同扬声器中同时使用倒谱和PSPD功能的拟议复合分类器系统进行了比较。实验结果表明，相空间方法本身的准确性低于MFCC特征的准确性，并且还表明，所提出的方法（其中PSPD特征与MFCC，基音和第一共振峰频率一起使用）可以合理地提高扬声器的说话人识别准确性。系统

著录项

来源
《International Journal of Advanced Information Technology》 |2012年第5期|共页
作者
Lajish V.L; Sunil Kumar R.K; Vivek P;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类信息处理技术;
关键词

相似文献

外文文献
中文文献
专利

1. Speaker Modeling Using Emotional Speech for More Robust Speaker Identification [J] . Journal of Communications Technology and Electronics . 2019,第11期

机译：使用情感语音进行说话人建模，以更可靠地识别说话人
2. Speech parameterization based on AM-FM model and its application in speaker identification using AANN [J] . D. Giften Francis Samuel, D. Synthiya Vinothini International Journal of Advanced Computer Research . 2013,第9期

机译：基于AM-FM模型的语音参数化及其在AANN说话人识别中的应用
3. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
4. Speaker identification based on nonlinear speech models [C] . Wenndt, S., Shamsander, . 1996

机译：基于非线性语音模型的说话人识别
5. Advancements in robust algorithm formulation for speaker identification of whispered speech. [D] . Fan, Xing. 2012

机译：说话人识别低声语音的鲁棒算法公式化的进展。
6. Identification of nonlinear-nonlinear neuron models and stimulus decoding [O] . Aurel A Lazar, Yevgeniy B Slutskiy 2013

机译：非线性非线性神经元模型的辨识与刺激解码
7. Speaker Identification Enhancement under Co-Channel Conditions using Sinusoidal Model based Usable Speech Detection [O] . Saurabh S. Khanwalkar, Brett Y. Smolenski, Robert E. Yantorno 2004

机译：基于正弦模型的可用语音检测在同信道条件下的说话人识别增强
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Speaker Identification Using a Nonlinear Speech Model and ANN

摘要

著录项

相似文献

相关主题

期刊订阅