首页> 外文期刊>Neural Networks: The Official Journal of the International Neural Network Society >Frame-by-frame language identification in short utterances using deep neural networks
【24h】

Frame-by-frame language identification in short utterances using deep neural networks

机译:使用深度神经网络在短话语中逐帧识别语言

获取原文
获取原文并翻译 | 示例
           

摘要

This work addresses the use of deep neural networks (DNNs) in automatic language identification (LID) focused on short test utterances. Motivated by their recent success in acoustic modelling for speech recognition, we adapt DNNs to the problem of identifying the language in a given utterance from the short-term acoustic features. We show how DNNs are particularly suitable to perform LID in real-time applications, due to their capacity to emit a language identification posterior at each new frame of the test utterance. We then analyse different aspects of the system, such as the amount of required training data, the number of hidden layers, the relevance of contextual information and the effect of the test utterance duration. Finally, we propose several methods to combine frame-by-frame posteriors. Experiments are conducted on two different datasets: the public NIST Language Recognition Evaluation 2009 (3 s task) and a much larger corpus (of 5 million utterances) known as oogle 5M LID, obtained from different Google Services. Reported results show relative improvements of DNNs versus the i-vector system of 40% in LRE09 3 second task and 76% in Google 5M LID. (C) 2014 Elsevier Ltd. All rights reserved.
机译:这项工作解决了深度神经网络(DNN)在自动语言识别(LID)中的使用,重点是简短的测试话语。由于他们最近在语音识别的声学建模方面取得的成功,我们使DNN适应了从短期声学特征以给定发音识别语言的问题。我们将展示DNN如何特别适合在实时应用中执行LID,这是因为DNN能够在测试发声的每个新帧后方发出语言标识。然后,我们分析系统的不同方面,例如所需训练数据的数量,隐藏层的数量,上下文信息的相关性以及测试话语持续时间的影响。最后,我们提出了几种组合逐帧后验的方法。实验是在两个不同的数据集上进行的:公开的NIST语言识别评估2009(3秒钟的任务)和从不同的Google服务获得的称为oogle 5M LID的更大的语料库(500万语音)。报告的结果显示,与i-vector系统相比,DNN的相对改进在LRE09 3秒任务中为40%,在Google 5M LID中为76%。 (C)2014 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号