Dialect/Accent Classification Using Unrestricted Audio

Huang R.; Hansen J. H. L.; Angkititrakul P.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Dialect/Accent Classification Using Unrestricted Audio

【24h】

Dialect/Accent Classification Using Unrestricted Audio

机译：使用无限制音频的方言/重音分类

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This study addresses novel advances in English dialect/accent classification. A word-based modeling technique is proposed that is shown to outperform a large vocabulary continuous speech recognition (LVCSR)-based system with significantly less computational costs. The new algorithm, which is named Word-based Dialect Classification (WDC), converts the text-independent decision problem into a text-dependent decision problem and produces multiple combination decisions at the word level rather than making a single decision at the utterance level. The basic WDC algorithm also provides options for further modeling and decision strategy improvement. Two sets of classifiers are employed for WDC: a word classifier DW(k) and an utterance classifier D u. DW(k) is boosted via the AdaBoost algorithm directly in the probability space instead of the traditional feature space. Du is boosted via the dialect dependency information of the words. For a small training corpus, it is difficult to obtain a robust statistical model for each word and each dialect. Therefore, a context adapted training (CAT) algorithm is formulated, which adapts the universal phoneme Gaussian mixture models (GMMs) to dialect-dependent word hidden Markov models (HMMs) via linear regression. Three separate dialect corpora are used in the evaluations that include the Wall Street Journal (American and British English), NATO N4 (British, Canadian, Dutch, and German accent English), and IViE (eight British dialects). Significant improvement in dialect classification is achieved for all corpora tested

机译：这项研究探讨了英语方言/重音分类的新进展。提出了一种基于单词的建模技术，该技术表现出比基于大词汇量的连续语音识别（LVCSR）的系统优越的性能，而计算成本却大大降低。新算法称为基于单词的方言分类（WDC），它将基于文本的决策问题转换为基于文本的决策问题，并在单词级别产生多个组合决策，而不是在语音级别上做出单个决策。基本的WDC算法还提供了用于进一步建模和决策策略改进的选项。 WDC使用两组分类器：单词分类器DW（k）和发声分类器D u。 DW（k）通过AdaBoost算法直接在概率空间而不是传统特征空间中提升。通过单词的方言相关性信息来增强Du。对于小的训练语料库，很难为每个单词和每个方言获得可靠的统计模型。因此，制定了一种上下文适应训练（CAT）算法，该算法通过线性回归将通用音素高斯混合模型（GMM）改编为方言相关的词隐马尔可夫模型（HMM）。评估中使用了三个独立的方言语料库，其中包括《华尔街日报》（美国和英国英语），北约N4（英国，加拿大，荷兰和德国口音英语）和IViE（八个英国方言）。所有经过测试的语料库在方言分类方面均取得了显着改善

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2007年第2期|p.453-464|共12页
作者
Huang R.; Hansen J. H. L.; Angkititrakul P.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Gaussian processes; hidden Markov models; natural language processing; regression analysis; speech recognition; AdaBoost algorithm; English; Gaussian mixture models; accent classification; context adapted training algorithm; decision strategy; dialect-dependent wor;

机译：高斯过程;隐马尔可夫模型;自然语言处理;回归分析;语音识别;AdaBoost算法;英语;高斯混合模型;口音分类;上下文自适应训练算法;决策策略;方言依赖;

相似文献

外文文献
中文文献
专利

1. Dynamical Network Analysis of the South Korean Dialects Compared to Traditional Dialect Classification [J] . Min Seungsik Journal of the Korean Physical Society . 2020,第4期

机译：与传统方言分类相比，韩国方言动态网络分析
2. A Pashtu speakers database using accent and dialect approach [J] . Shahid Munir Shah, Shahzad Ahmed Memon, Khalil-ur-Rehman Khoumbati, International Journal of Applied Pattern Recognition . 2017,第4期

机译：使用口音和方言方法的Pashtu演讲者数据库
3. Accent Sandhi Estimation of Tokyo Dialect of Japanese Using Conditional Random Fields [J] . Masayuki SUZUKI, Ryo KUROIWA, Keisuke INNAMI, IEICE transactions on information and systems . 2017,第4期

机译：使用条件随机场的日语东京方言口音Sandhi估计
4. An Integrated Approach to the Detection and Classification of Accents/Dialects for a Spoken Document Retrieval System [C] . Gray, S., Hansen, . 2005

机译：语音文档检索系统中口音/方言的检测和分类的综合方法
5. Speech science modeling for automatic accent and dialect classification. [D] . Gray, Sharmistha Sarkar. 2007

机译：用于自动重音和方言分类的语音科学建模。
6. Audiovisual cues benefit recognition of accented speech in noise but not perceptual adaptation [O] . Briony Banks, Emma Gowen, Kevin J. Munro, 2015

机译：视听提示有助于识别噪声中的重音但不能感知适应
7. AUTOMATIC ACCENT CLASSIFICATION OF FOREIGN ACCENTED AUSTRALIAN ENGLISH SPEECH [O] . 2008

机译：澳大利亚外语英语演讲的自动分类
8. Eigen-Channel Compensation and Discriminatively Trained Gaussian Mixture Models for Dialect and Accent Recognition. [R] . Torres-Carrasquillo, P. A., Sturim, D., Reynolds, D. A., 2016

机译：用于方言和口音识别的特征信道补偿和判别训练的高斯混合模型。

Dialect/Accent Classification Using Unrestricted Audio

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅