Speaker independent discriminant feature extraction for acoustic pattern-matching

机译：独立于说话人的判别特征提取，用于声学模式匹配

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Acoustic pattern-matching algorithms have recently become prominent again for automatically processing speech utterances where no prior knowledge of the spoken language is required. Applications of such technology include, but are not limited to, query-by-example search, spoken term detection and automatic word discovery. Obtaining content-aware acoustic features as independent as possible from speaker and acoustic environment variations is a key step in these algorithms. Currently, GMM posteriorgrams are found to outperform the standard MFCC features even though they were not designed to optimize the discrimination between acoustic classes. In this paper we combine the K-means clustering algorithm with the GMM posteriorgrams front-end to obtain more discriminant features. Results on a query-by-example task show that the proposed approaches outperform standard MFCC features by 7.8% absolute P@N and GMM-based posteriorgram features by 3.7% absolute P@N when using a 64-dimensional feature vector.

机译：声学模式匹配算法最近在自动处理语音发声方面再次变得很重要，而无需先验口语知识。这种技术的应用包括但不限于按示例查询，口语检测和自动单词发现。在这些算法中，获取与扬声器和声学环境变化尽可能独立的内容感知声学特征是关键。当前，即使GMM后验图的设计不是为了优化声学类别之间的区分，也发现它们优于标准MFCC功能。在本文中，我们将K-means聚类算法与GMM后验图前端结合使用，以获得更多的判别特征。以示例查询任务的结果表明，当使用64维特征向量时，所提出的方法优于标准MFCC特征7.8％的绝对P @ N，而基于GMM的后部特征优于3.7％的绝对P @ N。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.485- 488|共4页
会议地点 Kyoto(JP)
作者
Anguera, Xavier;
展开▼
作者单位

Telefonica Research Torre Telefonica-Diagonal 00 08019 Barcelona Spain;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE transactions on information and systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
2. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE Transactions on Information and Systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
3. Linear discriminant analysis, principal component analysis and sequential forward search for speaker feature extraction: a comparative study [J] . A. Harrag, D. Saigaa, N. Harrag International Journal of Engineering Intelligent Systems for Electrical Engineering and Co . 2011,第4期

机译：线性判别分析，主成分分析和顺序正向搜索以进行说话人特征提取：一项比较研究
4. Speaker independent discriminant feature extraction for acoustic pattern-matching [C] . Anguera Xavier IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：声音模式匹配的扬声器独立判别特征提取
5. Discriminant analysis based feature extraction for pattern recognition. [D] . Wu, Wei. 2009

机译：基于判别分析的特征提取用于模式识别。
6. A Feature Extraction Method Based on Differential Entropy and Linear Discriminant Analysis for Emotion Recognition [O] . Dong-Wei Chen, Rui Miao, Wei-Qi Yang, 2019

机译：基于微分熵和线性判别分析的情绪识别特征提取方法
7. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [O] . Arata Itoh, Sunao Hara, Norihide Kitaoka, 2012

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
8. Word Recognition and Speaker Authentication Using Amplitude Independent and Time Independent Word Features. [R] . preusse,john w. 1971

机译：使用幅度独立和时间无关词特征的词识别和说话者认证。

Speaker independent discriminant feature extraction for acoustic pattern-matching

摘要

著录项

相似文献

相关主题

期刊订阅