首页> 外文会议> >Language-identification using language-dependent phonemes and language-independent speech units
【24h】

Language-identification using language-dependent phonemes and language-independent speech units

机译:使用与语言相关的音素和与语言无关的语音单元进行语言识别

获取原文

摘要

The paper reports on results from ongoing research on language identification (LID) performed on the three languages: American-English, German and Spanish. The speech material used is from the Oregon Graduate Institute Spontaneous Telephone Speech Corpus, OGI-TS. The baseline LID system consists of three parallel phoneme recognisers, each of which are followed by three language modelling modules each characterising the bigram probabilities. The phoneme models used are derived on the basis of the combined speech corpus comprising the three languages. The phonemes are handled differently in analysis performed in two experiments. In the first experiment they are trained and tested language specifically. In the second, they are separated into a number of groups, one of which contains those language independent speech units which are similar enough to be equated across the training languages, the remaining containing the non combinable language dependent phonemes for each of the languages. A data driven technique has been devised to separate the speech sounds contained within the training corpus into these groups. In order to prepare for an optimal separation between the input classes, a linear discriminant analysis is performed on the training speech material. Results from a number of experiments show that average language identification scores of close to 90% can be retained by the LID system presented here, even for a high number of language independent speech units.
机译:本文报告了对三种语言进行了语言识别(盖子)的持续研究的结果:美国英语,德语和西班牙语。使用的语音材料来自俄勒冈州毕业研究所自发电话语音语料库,ogi-ts。基线盖系统由三个并行音素识别器组成,每个识别器之后是三种语言建模模块,每个都表征Bigram概率。使用的音素模型是基于包括三种语言的组合语音语料库。在两个实验中进行的分析中不同地处理音素。在第一个实验中,他们专门培训和测试语言。在第二中,它们被分成了多个组,其中一个组包含这些语言独立的语音单元,这些语言与培训语言相似,其余包含每个语言的非组合语言依赖性音素。已经设计了一种数据驱动技术,以将培训语料库中包含的语音分开到这些组中。为了准备输入类别之间的最佳分离,对训练言语材料进行线性判别分析。来自许多实验的结果表明,即使对于大量语言独立语音单元,盖子系统也可以保留接近90%的平均语言识别分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号