首页> 外文会议>International Conference on Pattern Recognition Association of South Africa and Robotics and Mechatronics >Using generalized maxout networks and phoneme mapping for low resource ASR- a case study on Flemish-Afrikaans
【24h】

Using generalized maxout networks and phoneme mapping for low resource ASR- a case study on Flemish-Afrikaans

机译:使用广义maxout网络和音素映射处理低资源ASR-以佛兰德语-南非语为例

获取原文

摘要

Recently, multilingual deep neural networks (DNNs) have been successfully used to improve under-resourced speech recognizers. Common approaches use either a merged universal phoneme set based on the International Phonetic Alphabet (IPA) or a language specific phoneme set to train a multilingual DNN. In this paper, we investigate the effect of both knowledge-based and data-driven phoneme mapping on the multilingual DNN and its application to an under-resourced language. For the data-driven phoneme mapping we propose to use an approximation of Kullback Leibler Divergence (KLD) to generate a confusion matrix and find the best matching phonemes of the target language for each individual phoneme in the donor language. Moreover, we explore the use of recently proposed generalized maxout network in both multilingual and low resource monolingual scenarios. We evaluate the proposed phoneme mappings on a phoneme recognition task with both HMM/GMM and DNN systems with generalized maxout architecture where Flemish and Afrikaans are used as donor and under-resourced target languages respectively.
机译:最近,多语言深度神经网络(DNN)已成功用于改善资源不足的语音识别器。常用方法是使用基于国际语音字母(IPA)的合并通用音素集或特定于语言的音素集来训练多语言DNN。在本文中,我们研究了基于知识的和基于数据的音素映射对多语言DNN的影响及其在资源匮乏的语言中的应用。对于数据驱动的音素映射,我们建议使用Kullback Leibler散度(KLD)的近似值来生成混淆矩阵,并为施主语言中的每个单个音素找到目标语言的最佳匹配音素。此外,我们探索了在多语言和低资源单语言场景中最近提出的广义maxout网络的使用。我们在HMM / GMM和具有通用maxout架构的DNN系统的音素识别任务上评估建议的音素映射,其中Flemish和Afrikaans分别用作供体和资源不足的目标语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号