首页> 外文期刊>IEEE transactions on audio, speech and language processing >Dialect/Accent Classification Using Unrestricted Audio
【24h】

Dialect/Accent Classification Using Unrestricted Audio

机译:使用无限制音频的方言/重音分类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This study addresses novel advances in English dialect/accent classification. A word-based modeling technique is proposed that is shown to outperform a large vocabulary continuous speech recognition (LVCSR)-based system with significantly less computational costs. The new algorithm, which is named Word-based Dialect Classification (WDC), converts the text-independent decision problem into a text-dependent decision problem and produces multiple combination decisions at the word level rather than making a single decision at the utterance level. The basic WDC algorithm also provides options for further modeling and decision strategy improvement. Two sets of classifiers are employed for WDC: a word classifier DW(k) and an utterance classifier D u. DW(k) is boosted via the AdaBoost algorithm directly in the probability space instead of the traditional feature space. Du is boosted via the dialect dependency information of the words. For a small training corpus, it is difficult to obtain a robust statistical model for each word and each dialect. Therefore, a context adapted training (CAT) algorithm is formulated, which adapts the universal phoneme Gaussian mixture models (GMMs) to dialect-dependent word hidden Markov models (HMMs) via linear regression. Three separate dialect corpora are used in the evaluations that include the Wall Street Journal (American and British English), NATO N4 (British, Canadian, Dutch, and German accent English), and IViE (eight British dialects). Significant improvement in dialect classification is achieved for all corpora tested
机译:这项研究探讨了英语方言/重音分类的新进展。提出了一种基于单词的建模技术,该技术表现出比基于大词汇量的连续语音识别(LVCSR)的系统优越的性能,而计算成本却大大降低。新算法称为基于单词的方言分类(WDC),它将基于文本的决策问题转换为基于文本的决策问题,并在单词级别产生多个组合决策,而不是在语音级别上做出单个决策。基本的WDC算法还提供了用于进一步建模和决策策略改进的选项。 WDC使用两组分类器:单词分类器DW(k)和发声分类器D u。 DW(k)通过AdaBoost算法直接在概率空间而不是传统特征空间中提升。通过单词的方言相关性信息来增强Du。对于小的训练语料库,很难为每个单词和每个方言获得可靠的统计模型。因此,制定了一种上下文适应训练(CAT)算法,该算法通过线性回归将通用音素高斯混合模型(GMM)改编为方言相关的词隐马尔可夫模型(HMM)。评估中使用了三个独立的方言语料库,其中包括《华尔街日报》(美国和英国英语),北约N4(英国,加拿大,荷兰和德国口音英语)和IViE(八个英国方言)。所有经过测试的语料库在方言分类方面均取得了显着改善

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号