New Issues on Nonlinear Discriminant Analysis for Speaker Recognition

机译：说话人识别的非线性判别分析新问题

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we study a nonlinear discriminant analysis (NLDA) technique that extracts a speaker-discriminant feature set. We present different system architectures to extract features that are more invariant to non-speakers-related conditions such as handset types and channel effects: (a) the first approach uses a Time Delay Neural Network (TDNN). (b) The second approach uses the Dynamic Decay Adjustment (DDA) algorithm for Radial basis Function (RBF) method. (c) We finally combined linearly the normalized scores TDNN/RBF architectures to maximize the separation between speakers by nonlinearly projecting a large set of acoustic features to a lower-dimensional feature set. The architecture proposed takes into account both the temporal changing of the speech signal and the powerful of the neural networks (NN). The extracted features are optimized to discriminate between speakers and to be robust to mismatched training and testing conditions. The transformed features are used to train a GMM-based speaker identification system. We have trained and tested the different proposed architectures on 45 speaker's SPIDRE corpus of telephone conversations. The results show an improvement of more than 12% compared to our standard system.

机译：在本文中，我们研究了一种非线性判别分析（NLDA）技术，该技术提取了说话人区分特征集。我们提出了不同的系统架构，以提取与非扬声器相关条件（如手机类型和信道效果）更加不变的特征：（a）第一种方法使用时延神经网络（TDNN）。（b）第二种方法将动态衰减调整（DDA）算法用于径向基函数（RBF）方法。（c）最后，我们通过非线性地将一大组声学特征非线性投影到低维特征集，从而将归一化得分TDNN / RBF体系结构进行线性组合，以最大程度地分散说话者之间的距离。所提出的架构同时考虑了语音信号的时间变化和神经网络（NN）的功能。提取的功能经过优化，可以区分说话者，并且对于不匹配的训练和测试条件具有鲁棒性。转换后的功能用于训练基于GMM的说话人识别系统。我们已经在45个说话者的SPIDRE电话交谈语料库中培训和测试了不同的建议体系结构。结果表明，与我们的标准系统相比，改进了12％以上。

著录项

来源
《World Multiconference on Systemics, Cybernetics and Informatics and 5th International Conference on Information Systems Analysis and Synthesis Vol.6: Image, Accoustic, Speech and Signal Processing, Jul 31-Aug 4, 1999, Orlando, Florida》|1999年|p.354-361|共8页
会议地点 Orlando FL(US);Orlando FL(US);Orlando FL(US);Orlando FL(US)
作者
Chakib Tadj;
展开▼
作者单位

Ecole de Technologie Superieure 1100 rue Notre-Dame Ouest. Montreal, Qc H3C 1K3 Canada;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
speaker-discriminant feature; feature extraction; nonlinear; discriminant analysis; neural networks;

机译：说话人区别特征；特征提取;非线性判别分析；神经网络;
入库时间 2022-08-26 13:56:11

相似文献

外文文献
中文文献
专利

1. Speaker Recognition Using Sparse Probabilistic Linear Discriminant Analysis [J] . Hai YANG, Yunfei XU, Qinwei ZHAO, IEICE Transactions on fundamentals of electronics, communications & computer sciences . 2013,第10期

机译：基于稀疏概率线性判别分析的说话人识别
2. Speaker adaptation using probabilistic linear discriminant analysis for continuous speech recognition [J] . Y.Jeong Electronics Letters . 2013,第25期

机译：使用概率线性判别分析的说话人自适应以实现连续语音识别
3. Fuzzy linear and nonlinear discriminant analysis algorithms for face recognition [J] . Khalid Chougdali, Mohamed Jedra, Noureddine Zahid Intelligent data analysis . 2009,第4期

机译：人脸识别的模糊线性和非线性判别分析算法
4. New Issues on Nonlinear Discriminant Analysis for Speaker Recognition [C] . Chakib Tadj World multiconference on systemics, cybernetics and informatics . 1999

机译：扬声器识别非线性判别分析的新问题
5. Nonlinear discriminant analysis based feature dimensionality reduction for automatic speech recognition. [D] . Hu, Hongbing. 2010

机译：基于非线性判别分析的特征维数缩减，可实现自动语音识别。
6. Neural Quadratic Discriminant Analysis: Nonlinear decoding with V1-like computation [O] . Marino Pagan, Eero P. Simoncelli, Nicole C. Rust -1

机译：神经二次判别分析：采用V1类计算的非线性解码
7. Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition [O] . Shuai Wang, Zili Huang, Yanmin Qian, 2018

机译：基于I载体的强大扬声器识别的深度判别分析
8. Sheep, Goats, Lambs and Wolves: A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation [R] . 1998

机译：绵羊，山羊，羔羊和狼：1998年NIsT演讲者认可评估中演讲者表现的统计分析

New Issues on Nonlinear Discriminant Analysis for Speaker Recognition

摘要

著录项

相似文献

相关主题

期刊订阅