首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >Speaker independent discriminant feature extraction for acoustic pattern-matching
【24h】

Speaker independent discriminant feature extraction for acoustic pattern-matching

机译:独立于说话人的判别特征提取,用于声学模式匹配

获取原文

摘要

Acoustic pattern-matching algorithms have recently become prominent again for automatically processing speech utterances where no prior knowledge of the spoken language is required. Applications of such technology include, but are not limited to, query-by-example search, spoken term detection and automatic word discovery. Obtaining content-aware acoustic features as independent as possible from speaker and acoustic environment variations is a key step in these algorithms. Currently, GMM posteriorgrams are found to outperform the standard MFCC features even though they were not designed to optimize the discrimination between acoustic classes. In this paper we combine the K-means clustering algorithm with the GMM posteriorgrams front-end to obtain more discriminant features. Results on a query-by-example task show that the proposed approaches outperform standard MFCC features by 7.8% absolute P@N and GMM-based posteriorgram features by 3.7% absolute P@N when using a 64-dimensional feature vector.
机译:声学模式匹配算法最近在自动处理语音发声方面再次变得很重要,而无需先验口语知识。这种技术的应用包括但不限于按示例查询,口语检测和自动单词发现。在这些算法中,获取与扬声器和声学环境变化尽可能独立的内容感知声学特征是关键。当前,即使GMM后验图的设计不是为了优化声学类别之间的区分,也发现它们优于标准MFCC功能。在本文中,我们将K-means聚类算法与GMM后验图前端结合使用,以获得更多的判别特征。以示例查询任务的结果表明,当使用64维特征向量时,所提出的方法优于标准MFCC特征7.8%的绝对P @ N,而基于GMM的后部特征优于3.7%的绝对P @ N。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号