首页> 外文会议>European Signal Processing Conference >Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech

【24h】

Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech

机译：通过从语音的正弦表示得出的压缩特征动力学来识别与文本相关的说话人

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Prevalent speaker recognition methods use only spectral-envelope based features such as MFCC, ignoring the rich speaker identity information contained in the temporal-spectral dynamics of the entire speech signal. We propose a new feature for speaker recognition based on sinusoidal representation of speech called compressed spectral dynamics (Sinogram-CSD), which effectively captures such spectral dynamics and the inherent speaker identity. The discriminative power of CSD allows classification to remain simple. The proposed CSD-MSRI method uses a simple nearest neighbor classifier to deliver performance competitive to conventional MFCC+DTW based text-dependent speaker recognition methods at significantly lower complexity.

机译：流行的说话人识别方法仅使用基于频谱包络的功能（例如MFCC），而忽略了整个语音信号的时域频谱动态中包含的丰富的说话人身份信息。我们提出了一种基于语音正弦表示的说话人识别新功能，称为压缩频谱动力学（Sinogram-CSD），可有效捕获此类频谱动力学和固有的说话人身份。 CSD的辨别力使分类保持简单。提出的CSD-MSRI方法使用一个简单的最近邻分类器，以较低的复杂度提供与传统的基于MFCC + DTW的基于文本的说话人识别方法相比具有竞争力的性能。

著录项

来源
《European Signal Processing Conference》|2008年|1-5|共5页
会议地点
作者
Das Amitava; Chittaranjan Gokul; Srinivasan V.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition [J] . Lucey S., Chen T., Sridharan S., IEEE transactions on multimedia . 2005,第3期

机译：视听语音处理的集成策略：应用于与文本相关的说话人识别
2. Restricted Boltzmann machines for vector representation of speech in speaker recognition [J] . Omid Ghahabi, Javier Hernando Computer speech and language . 2018,第JANa期

机译：说话人识别中用于语音矢量表示的受限玻尔兹曼机
3. Session compensation using binary speech representation for speaker recognition [J] . Gabriel Hernandez-Sierra, Jose R. Calvo, Jean-Francois Bonastre, Pattern recognition letters . 2014,第nova1期

机译：使用二进制语音表示进行会话补偿以进行说话人识别
4. Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech [C] . Das Amitava, Chittaranjan Gokul, Srinivasan V. European Signal Processing Conference . 2008

机译：通过从语音的正弦表示的压缩特征 - 动态进行文本依赖扬声器识别
5. Neural Network Based Representation Learning and Modeling for Speech and Speaker Recognition [D] . Guo, Jinxi. 2019

机译：基于神经网络的语言和扬声器识别的模拟
6. Recognition of time-compressed speech does not predict recognition of natural fast-rate speech by older listeners [O] . Sandra Gordon-Salant, Danielle J. Zion, Carol Espy-Wilson -1

机译：时间压缩语音的识别无法预测年长听众对自然快速语音的识别
7. Text-Dependent Speaker Recognition By compressed Feature-Dynamics Derived From Sinusoidal Representation of Speech [O] . Das Amitav 2008

机译：基于语音正弦表示的压缩特征动力学的文本相关说话人识别
8. Speaker Recognition on Lossy Compressed Speech Using the Speex Codec [R] . Stauffer, A. R., Lawson, A. D. 2009

机译：利用speex编解码器对有损压缩语音进行说话人识别

Text-dependent speaker recognition by compressed feature-dynamics derived from sinusoidal representation of speech

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅