Speaker recognition using hidden Markov models, dynamic time warping and vector quantisation

Yu K.; Mason J.

首页> 外文期刊>IEE Proceedings. Part K >Speaker recognition using hidden Markov models, dynamic time warping and vector quantisation

【24h】

Speaker recognition using hidden Markov models, dynamic time warping and vector quantisation

机译：使用隐马尔可夫模型，动态时间扭曲和矢量量化的说话人识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The authors evaluate continuous density hidden Markov models (CDHMM), dynamic time warping (DTW) and distortion-based vector quantisation (VQ) for speaker recognition, emphasising the performance of each model structure across incremental amounts of training data. Text-independent (TI) experiments are performed with VQ and CDHMMs, and text-dependent (TD) experiments are performed with DTW, VQ and CDHMMs. For TI speaker recognition, VQ performs better than an equivalent CDHMM with one training version, but is outperformed by CDHMM when trained with ten training versions. For TD experiments, DTW outperforms VQ and CDHMMs for sparse amounts of training data, but with more data the performance of each model is indistinguishable. The performance of the TD procedures is consistently superior to TI, which is attributed to subdividing the speaker recognition problem into smaller speaker-word problems. It is also shown that there is a large variation in performance across the different digits, and it is concluded that digit zero is the best digit for speaker discrimination.

机译：作者评估了连续密度隐藏马尔可夫模型（CDHMM），动态时间规整（DTW）和基于失真的矢量量化（VQ）来进行说话人识别，从而强调了每种模型结构在增量训练数据上的性能。使用VQ和CDHMM进行文本无关（TI）实验，使用DTW，VQ和CDHMM进行文本无关（TD）实验。对于TI说话人识别，VQ的性能优于具有一个培训版本的等效CDHMM，但是在经过十个培训版本的培训后，VQ的性能要优于CDHMM。对于TD实验，在稀疏的训练数据量方面，DTW优于VQ和CDHMM，但是随着数据的增加，每种模型的性能都难以区分。 TD程序的性能始终优于TI，这归因于将说话人识别问题细分为更小的说话人单词问题。还表明，不同数字在演奏上有很大的差异，并且得出结论，数字零是说话者辨别的最佳数字。

著录项

来源
《IEE Proceedings. Part K》 |1995年第5期|P.313-318|共6页
作者
Yu K.; Mason J.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;电工技术;
关键词

相似文献

外文文献
中文文献
专利

1. Isolated Malay Digit Recognition Using Pattern Recognition Fusion of Dynamic Time Warping and Hidden Markov Models | Science Publications [J] . A. Hussain, K. A. Ishak, S. A. Samad, American journal of applied sciences . 2008,第6期

机译：动态时间规整和隐马尔可夫模型的模式识别融合融合孤立马来数字科学出版物
2. Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: A comparative study [J] . Joseph A. Kogan, Daniel Margoliash The Journal of the Acoustical Society of America . 1998,第4期

机译：使用连续时间扭曲和隐马尔可夫模型从连续录音中自动识别鸟类歌曲元素的比较研究
3. Parameter re-estimation in semicontinuous hidden Markov modelling of speech with feedback to vector quantisation codebook [J] . Huang X.D., Jack M.A. Electronics Letters . 1988,第22期

机译：带有反馈到矢量量化码本的语音的半连续隐马尔可夫建模中的参数重新估计
4. Speaker identification using autoregressive hidden Markov models and adaptive vector quantisation [C] . Eugeny E. Bovbel, Igor E. Kheidorov, Michael E. Kotlyar International Workshop on Text, Speech and Dialogue . 2000

机译：扬声器识别使用自回归隐藏的马尔可夫模型和自适应矢量定量
5. Gesture Recognition using Hidden Markov Models, Dynamic Time Warping, and Geometric Template Matching. [D] . Hunter, Garett. 2013

机译：使用隐马尔可夫模型，动态时间扭曲和几何模板匹配进行手势识别。
6. One-against-All Weighted Dynamic Time Warping for Language-Independent and Speaker-Dependent Speech Recognition in Adverse Conditions [O] . Xianglilan Zhang, Jiping Sun, Zhigang Luo 2010

机译：不利条件下与语言无关和与说话者相关的语音识别的一对多加权动态时间规整
7. Isolated Malay Digit Recognition Using Pattern Recognition Fusion of Dynamic Time Warping and Hidden Markov Models [O] . S. A. R. Al-haddad, S. A. Samad, A. Hussain, 2010

机译：动态时间规整与隐马尔可夫模型的模式识别融合融合的孤立马来数字识别
8. Speaker Recognition by Hidden Markov Models and Neural Networks [R] . Zeek, E. J. 1996

机译：隐马尔可夫模型和神经网络的说话人识别

Speaker recognition using hidden Markov models, dynamic time warping and vector quantisation

摘要

著录项

相似文献

相关主题

期刊订阅