Dialect and Accent Recognition using Phonetic-Segmentation Supervectors

机译：使用语音分段超向量的方言和口音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe a new approach to automatic dialect and accent recognition which exceeds state-of-the-art performance in three recognition tasks. This approach improves the accuracy and substantially lower the time complexity of our earlier phonetic-based kernel approach for dialect recognition. In contrast to state-of-the-art acoustic-based systems, our approach employs phone labels and segmentation to constrain the acoustic models. Given a speaker's utterance, we first obtain phone hypotheses using a phone recognizer and then extract GMM-supervectors for each phone type, effectively summarizing the speaker's phonetic characteristics in a single vector of phone-type supervectors. Using these vectors, we design a kernel function that computes the phonetic similarities between pairs of utterances to train SVM classifiers to identify dialects. Comparing this approach to the state-of-the-art, we obtain a 12.9% relative improvement in EER on Arabic dialects, and a 17.9% relative improvement for American vs. Indian English dialects. We also see a 53.5% relative improvement over a GMM-UBM on American Southern vs. Non-Southern English.

机译：我们介绍了一种新的自动方言和重音识别方法，该方法在三个识别任务中都超过了最新的性能。这种方法提高了准确性，并大大降低了我们较早的基于语音的基于核的方言识别方法的时间复杂度。与最新的基于声学的系统相比，我们的方法采用电话标签和分段来约束声学模型。给定讲话者的话语，我们首先使用电话识别器获得电话假设，然后为每种电话类型提取GMM超向量，从而在单个电话类型超向量中有效地总结了讲话者的语音特征。使用这些向量，我们设计了一个内核函数，该函数计算发声对之间的语音相似度，以训练SVM分类器识别方言。将此方法与最新技术进行比较，我们发现阿拉伯方言的EER相对提高了12.9％，美国和印度英语方言的EER相对提高了17.9％。我们还发现，相对于美国南方英语和非南方英语，GMM-UBM相对提高了53.5％。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.752-755|共4页
会议地点
作者
Fadi Biadsy; Julia Hirschberg; Daniel P. W. Ellis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词

相似文献

外文文献
中文文献
专利

1. Tibetan Multi-Dialect Speech and Dialect Identity Recognition [J] . Yue Zhao, Jianjian Yue, Wei Song, Computers, Materials & Continua . 2019,第3期

机译：西藏多方面言语和方言识别识别
2. A Pashtu speakers database using accent and dialect approach [J] . Shahid Munir Shah, Shahzad Ahmed Memon, Khalil-ur-Rehman Khoumbati, International Journal of Applied Pattern Recognition . 2017,第4期

机译：使用口音和方言方法的Pashtu演讲者数据库
3. Accent Sandhi Estimation of Tokyo Dialect of Japanese Using Conditional Random Fields [J] . Masayuki SUZUKI, Ryo KUROIWA, Keisuke INNAMI, IEICE transactions on information and systems . 2017,第4期

机译：使用条件随机场的日语东京方言口音Sandhi估计
4. Accent recognition using i-vector, Gaussian Mean Supervector and Gaussian posterior probability supervector for spontaneous telephone speech [C] . Bahari Mohamad Hasan, Saeidi Rahim, Van hamme Hugo, IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：使用i-vector，高斯平均超向量和高斯后验概率超向量的自发电话语音口音识别
5. Automatic Dialect and Accent Recognition and its Application to Speech Recognition [D] . Biadsy, Fadi 2011

机译：方言和重音自动识别及其在语音识别中的应用
6. Audiovisual cues benefit recognition of accented speech in noise but not perceptual adaptation [O] . Briony Banks, Emma Gowen, Kevin J. Munro, 2015

机译：视听提示有助于识别噪声中的重音但不能感知适应
7. Dialect and Accent Recognition using Phonetic-Segmentation Supervectors [O] . Biadsy Fadi, Hirschberg Julia Bell, Ellis Daniel P. W. 2011

机译：使用语音分段超向量的方言和口音识别
8. Eigen-Channel Compensation and Discriminatively Trained Gaussian Mixture Models for Dialect and Accent Recognition. [R] . Torres-Carrasquillo, P. A., Sturim, D., Reynolds, D. A., 2016

机译：用于方言和口音识别的特征信道补偿和判别训练的高斯混合模型。

Dialect and Accent Recognition using Phonetic-Segmentation Supervectors

摘要

著录项

相似文献

相关主题

期刊订阅