MAXIMUM-LIKELIHOOD AFFINE CEPSTRAL FILTERING (MLACF) TECHNIQUE FOR SPEAKER NORMALIZATION

机译：扬声器标准化的最大似然仿制药倒谱滤波（MLACF）技术

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a novel technique of minimizing the acoustic variability of speakers by transforming the features extracted from the speaker's data to better fit the recognition model. The concept of maximum-likelihood affine cepstral filtering (MLACF) will be introduced for feature transformation, along with solutions for the transformation parameters that maximize the likelihood of the test data with respect to a given recognition model. It is shown that for log-concave distributions, the solution of the MLACF problem can be obtained using convex programming. HMM-based digit recognition on the TIDIGITS database is presented to demonstrate the flexibility of the transformation in compensating for large acoustic mismatches between the speakers in the training and test database. In addition, it will be shown that the technique requires estimation of far fewer transformation parameters compared to existing techniques, thus allowing fast, real-time compensation.

机译：我们提出了一种新颖的技术，通过转换从说话者数据中提取的特征以更好地拟合识别模型，将说话者的声音变异性降至最低。将针对特征变换引入最大似然仿射倒谱滤波（MLACF）的概念，以及针对给定识别模型最大程度地提高测试数据可能性的变换参数解决方案。结果表明，对于对数凹面分布，可以使用凸规划获得MLACF问题的解。展示了TIDIGITS数据库上基于HMM的数字识别，以演示转换的灵活性，以补偿培训和测试数据库中扬声器之间的巨大声学失配。另外，将表明，与现有技术相比，该技术需要估计的变换参数要少得多，从而可以进行快速，实时的补偿。

著录项

来源
《European Conference on Speech Communication and Technology v.2; 20010903-20010907; Aalborg; DK》|2001年|P.1211-1214|共4页
会议地点 Aalborg(DK);Aalborg(DK)
作者
Yoon Kim;
展开▼
作者单位

Center for Computer Research in Music and Acoustics (CCRMA) Stanford University, Stanford, CA 94305 USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类传播理论;
关键词
入库时间 2022-08-26 13:48:55

相似文献

外文文献
中文文献
专利

1. Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients [J] . Marwa A. Nasr, Mohammed Abd-Elnaby, Adel S. El-Fishawy, International journal of speech technology . 2018,第4期

机译：基于归一化基音频率和梅尔频率倒谱系数的说话人识别
2. Arabic Audio News Retrieval System Using Dependent Speaker Mode, Mel Frequency Cepstral Coefficient and Dynamic Time Warping Techniques [J] . Hasan Muaidi, Ayat Al-Ahmad, Thaer Khdoor, Research journal of applied science, engineering and technology . 2014,第24期

机译：阿拉伯音频新闻检索系统，使用相关的扬声器模式，梅尔频率倒谱系数和动态时间扭曲技术
3. Arabic Audio News Retrieval System Using Dependent Speaker Mode, Mel Frequency Cepstral Coefficient and Dynamic Time Warping Techniques [J] . Hasan Muaidi, Ayat Al-Ahmad, Thaer Khdoor, Research journal of applied science, engineering and technology . 2014,第24期

机译：阿拉伯音频新闻检索系统，使用相关的扬声器模式，梅尔频率倒谱系数和动态时间扭曲技术
4. MAXIMUM-LIKELIHOOD AFFINE CEPSTRAL FILTERING (MLACF) TECHNIQUE FOR SPEAKER NORMALIZATION [C] . Yoon Kim European conference on speech communication and technology . 2001

机译：扬声器归一化的最大似然染粒度滤波（MLACF）技术
5. The modified-mean cepstral mean normalization (MMCMN) method for channel-robust automatic speaker recognition. [D] . Garcia, Alvin A. 2002

机译：改进的均值倒谱均值归一化（MMCMN）方法用于声道鲁棒性自动说话人识别。
6. Joint pre-processing framework for two-dimensional gel electrophoresis images based on nonlinear filtering background correction and normalization techniques [O] . Manuel Mauricio Goez, Maria C. Torres-Madronero, Sarah Rothlisberger, 2020

机译：基于非线性滤波背景校正和归一化技术的二维凝胶电泳图像的联合预处理框架
7. Cepstral Feature Normalization Methods Using Pole Filtering and Scale Normalization for Robust Speech Recognition [O] . Bo Kyeong Choi, Sung Min Ban, Hyung Soon Kim 2015

机译：抗骨刺特征使用杆滤波和尺度标准化进行规范化方法，用于强大的语音识别
8. Maximum-Likelihood Spectral Estimation and Adaptive Filtering Techniques withApplication to Airborne Doppler Weather Radar [R] . Lai, J. Y. 1994

机译：机载多普勒天气雷达的最大似然谱估计和自适应滤波技术

MAXIMUM-LIKELIHOOD AFFINE CEPSTRAL FILTERING (MLACF) TECHNIQUE FOR SPEAKER NORMALIZATION

摘要

著录项

相似文献

相关主题

期刊订阅