首页> 外文会议>International Conference on Control, Communication Computing India >Pair-wise language discrimination using phonotactic information
【24h】

Pair-wise language discrimination using phonotactic information

机译:配对语言歧视使用致素发音信息

获取原文

摘要

This paper describes a novel method for automatic language identification using phonotactics. Conventional phonotactic approach using N-gram language modeling requires several hours of speech data along with the corresponding orthographic transcriptions, which is not available for many of the Indian languages. This paper proposes a method which captures the language discriminating cue in co-occurance of phones using limited data. Here speech utterance is decoded into a sequence of chosen phones using an automatic phone recognizer. A unique code is assigned for each phone to obtain feature vectors corresponding to five consecutive phones. These feature vectors are then used to train a neural network / SVM based classifier at the back-end. A pair-wise language discrimination system for Hindi and Malayalam is developed using manual and automatic transcriptions.
机译:本文介绍了使用致素发音的自动语言识别的新方法。使用N-GRAM语言建模的传统音牙方法需要几个小时的语音数据以及相应的正交转录,这对于许多印度语言不可用。本文提出了一种方法,该方法使用有限数据捕获手机共同发生中的语言辨别提示。这里使用自动电话识别器解码语音话语被解码为一系列所选电话。为每辆手机分配唯一代码以获取对应于连续五个电话的特征向量。然后使用这些特征向量在后端训练基于神经网络/ SVM的分类器。使用手动和自动转录开发了一种用于印地语和Malayalam的一对语言辨别系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号