Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese

Lei Y.Hansen J. H. L.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese

【24h】

Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese

机译：通过独立于文本的培训和测试对阿拉伯语，西班牙语和汉语进行方言分类

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Automatic dialect classification has emerged as an important area in the speech research field. Effective dialect classification is useful in developing robust speech systems, such as speech recognition and speaker identification. In this paper, two novel algorithms are proposed to improve dialect classification for text-independent spontaneous speech in Arabic and Spanish languages, along with probe results for Chinese. The problem considers the case where no transcripts but dialect labels are available for training and test data, and speakers are speaking spontaneously, which is defined as text-independent dialect classification. The Gaussian mixture model (GMM) is used as the baseline system for text-independent dialect classification. The major motivation is to suppress confused/distractive regions from the dialect language space and emphasize discriminative/sensitive information of the available dialects. In the training phase, a symmetric version of the Kullback–Leibler divergence is used to find the most discriminative GMM mixtures (KLD-GMM), where the confused acoustic GMM region is suppressed. For testing, the more discriminative frames are detected and used via the location of where the frames are in the GMM mixture feature space, which is termed frame selection decoding (FSD-GMM). The first KLD-GMM and second FSD-GMM techniques, are shown to improve dialect classification performance for three-way dialect tasks. The two algorithms and their combination are evaluated on dialects of Arabic and Spanish corpora. Measurable improvement is achieved in both two cases, over a generalized maximum-likelihood estimation GMM baseline (MLE-GMM).

机译：自动方言分类已成为语音研究领域的重要领域。有效的方言分类对于开发健壮的语音系统（例如语音识别和说话者识别）很有用。本文提出了两种新颖的算法来改善阿拉伯文和西班牙文与文本无关的自发语音的方言分类以及中文的探测结果。该问题考虑了以下情况：没有笔录而是方言标签可用于训练和测试数据，并且说话者自发讲话，这被定义为与文本无关的方言分类。高斯混合模型（GMM）用作独立于文本的方言分类的基准系统。主要动机是抑制方言语言空间中的混淆/分散区域，并强调可用方言的区分/敏感信息。在训练阶段，使用Kullback-Leibler散度的对称形式来查找最有区别的GMM混合（KLD-GMM），在该混合中，混淆的声学GMM区域被抑制。为了进行测试，通过帧在GMM混合特征空间中的位置来检测和使用更具区分性的帧，这被称为帧选择解码（FSD-GMM）。显示了第一种KLD-GMM和第二种FSD-GMM技术可提高三向方言任务的方言分类性能。对阿拉伯语和西班牙语语料库的方言评估了这两种算法及其组合。在这两种情况下，都可以通过广义最大似然估计GMM基线（MLE-GMM）实现可衡量的改善。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2011年第1期|p.85-96|共12页
作者
Lei Y.Hansen J. H. L.;
展开▼
作者单位

CenterforRobustSpeechSystems(CRSS),TheUniversityofTexasatDallas,Richardson,TX,USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Arabic dialects; Gaussian mixture; Kullback–Leibler divergence; Spanish dialects; dialect classification; frame selection;

机译：阿拉伯方言;高斯混合;Kullback-Leibler散度;西班牙方言;方言分类;框架选择;

相似文献

外文文献
中文文献
专利

1. Dictionary of Arabic and Allied Loanwords: Spanish, Portuguese, Catalan, Galician and Kindred Dialects [J] . Stuart James Reference reviews . 2009,第6期

机译：阿拉伯和盟国外来语字典：西班牙语，葡萄牙语，加泰罗尼亚语，加利西亚语和同类
2. Automatic Arabic Dialect Classification Using Deep Learning Models [J] . Leena Lulu, Ashraf Elnagar Procedia Computer Science . 2018,第1期

机译：使用深度学习模型自动进行阿拉伯语方言分类
3. Investigating the effects of gender, dialect, and training size on the performance of Arabic speech recognition [J] . Alsharhan Eiman, Ramsay Allan Language Resources and Evaluation . 2020,第4期

机译：调查性别，方言和培训规模对阿拉伯语演讲表现的影响
4. Self-Training Pre-Trained Language Models for Zero- and Few-Shot Multi-Dialectal Arabic Sequence Labeling [C] . Muhammad Khalifa, Muhammad Abdul-Mageed, Khaled Shaalan Conference of the European Chapter of the Association for Computational Linguistics . 2021

机译：自我训练预先训练的语言模型，用于零射门和少量射击多方面老化阿拉伯语序列标签
5. Memory and comprehension of inferences in complex sentences: A comparison of English, Spanish, Chinese and Arabic. [D] . Bechtold, John Ivan. 1990

机译：复杂句子中的推论的记忆和理解：英语，西班牙语，中文和阿拉伯语的比较。
6. Morphological structure in the Arabic mental lexicon: Parallels between standard and dialectal Arabic [O] . Sami Boudelaa, William D. Marslen-Wilson -1

机译：阿拉伯语心理词典中的形态结构：标准阿拉伯语与方言阿拉伯语之间的平行
7. The Spanish 18th Century and the Study of Arabic: Dialectal Arabic in Father Cañes’s Grammar [O] . Moscoso García, Francisco 2017

机译：18世纪的西班牙语和阿拉伯语的研究：父亲Cañes的语法中的方言阿拉伯语
8. Accurate Arabic Script Language/Dialect Classification. [R] . S. C. Tratz 2014

机译：准确的阿拉伯语脚本语言/方言分类。

Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅