首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Dialect and Accent Recognition using Phonetic-Segmentation Supervectors
【24h】

Dialect and Accent Recognition using Phonetic-Segmentation Supervectors

机译:使用语音分段超向量的方言和口音识别

获取原文

摘要

We describe a new approach to automatic dialect and accent recognition which exceeds state-of-the-art performance in three recognition tasks. This approach improves the accuracy and substantially lower the time complexity of our earlier phonetic-based kernel approach for dialect recognition. In contrast to state-of-the-art acoustic-based systems, our approach employs phone labels and segmentation to constrain the acoustic models. Given a speaker's utterance, we first obtain phone hypotheses using a phone recognizer and then extract GMM-supervectors for each phone type, effectively summarizing the speaker's phonetic characteristics in a single vector of phone-type supervectors. Using these vectors, we design a kernel function that computes the phonetic similarities between pairs of utterances to train SVM classifiers to identify dialects. Comparing this approach to the state-of-the-art, we obtain a 12.9% relative improvement in EER on Arabic dialects, and a 17.9% relative improvement for American vs. Indian English dialects. We also see a 53.5% relative improvement over a GMM-UBM on American Southern vs. Non-Southern English.
机译:我们介绍了一种新的自动方言和重音识别方法,该方法在三个识别任务中都超过了最新的性能。这种方法提高了准确性,并大大降低了我们较早的基于语音的基于核的方言识别方法的时间复杂度。与最新的基于声学的系统相比,我们的方法采用电话标签和分段来约束声学模型。给定讲话者的话语,我们首先使用电话识别器获得电话假设,然后为每种电话类型提取GMM超向量,从而在单个电话类型超向量中有效地总结了讲话者的语音特征。使用这些向量,我们设计了一个内核函数,该函数计算发声对之间的语音相似度,以训练SVM分类器识别方言。将此方法与最新技术进行比较,我们发现阿拉伯方言的EER相对提高了12.9%,美国和印度英语方言的EER相对提高了17.9%。我们还发现,相对于美国南方英语和非南方英语,GMM-UBM相对提高了53.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号