首页> 外文会议>International Workshop on Soft Computing Models in Industrial Applications >Language Identification for Under-ResourcedLanguages in the Basque Context
【24h】

Language Identification for Under-ResourcedLanguages in the Basque Context

机译:巴斯克语境中的资源下的语言识别

获取原文

摘要

Automatic Speech Recognition (ASR) is a broad research area that ab-sorbs many efforts from the research community. The interest on MultilingualSystems arouses in the Basque Country because there are three official languages(Basque, Spanish, and French), and there is much linguistic interaction amongthem, even if Basque has very different roots than the other two languages. Thedevelopment of Multilingual Large Vocabulary Continuous Speech Recognitionsystems involves issues as: Language Identification, Acoustic Phonetic Decoding,Language Modeling or the development of appropriate Language Resources. Thispaper describes the development of a Language Identification (LID) system ori-ented to robust Multilingual Speech Recognition in the Basque context. The workpresents hybrid strategies for LID, based on the selection of system elements byseveral classifiers and Discriminant Analysis improved with robust regularizedcovariance matrix estimation methods oriented to under-resourced languages andstochastic methods for speech recognition tasks (Hidden Markov Modelsand n-grams).
机译:自动语音识别(ASR)是一项广泛的研究区,即Ab-Sorb来自研究界的努力。对波斯克国家的兴趣引起了巴斯克国家,因为有三种官方语言(巴斯克,西班牙语和法语),并且即使巴斯克的根源与另外两种语言具有非常不同的根,也存在许多语言互动。多语言大词汇表连续语音识别系统涉及问题:语言识别,声学语音解码,语言建模或适当语言资源的开发。此纸纸描述了在巴斯克语境中的语言识别(LID)系统的开发,以强大的多语言语音识别。基于系统元素的选择对称分类器和判别分析的工作人员进行混合策略,并通过稳健的规则化转换矩阵估计方法,以资源为导向,并对语音识别任务进行了交换的语言(隐藏的Markov Models和N-Grams)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号