首页> 外文会议>2015 International Conference on Control, Communication amp; Computing India >Pair-wise language discrimination using phonotactic information
【24h】

Pair-wise language discrimination using phonotactic information

机译:使用语音信息的成对语言歧视

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a novel method for automatic language identification using phonotactics. Conventional phonotactic approach using N-gram language modeling requires several hours of speech data along with the corresponding orthographic transcriptions, which is not available for many of the Indian languages. This paper proposes a method which captures the language discriminating cue in co-occurance of phones using limited data. Here speech utterance is decoded into a sequence of chosen phones using an automatic phone recognizer. A unique code is assigned for each phone to obtain feature vectors corresponding to five consecutive phones. These feature vectors are then used to train a neural network / SVM based classifier at the back-end. A pair-wise language discrimination system for Hindi and Malayalam is developed using manual and automatic transcriptions.
机译:本文介绍了一种新的使用光变法自动识别语言的方法。使用N-gram语言建模的传统音韵方法需要几个小时的语音数据以及相应的正字法转录,这对于许多印度语言而言是不可用的。本文提出了一种使用有限数据捕获电话共现中的语言区分提示的方法。在这里,语音发声使用自动电话识别器解码为一系列选定的电话。为每个电话分配一个唯一的代码以获得对应于五个连续电话的特征向量。这些特征向量然后用于在后端训练基于神经网络/ SVM的分类器。使用手动和自动抄写开发了印地语和马拉雅拉姆语的成对语言歧视系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号