...
首页> 外文期刊>Computer speech and language >Residual convolutional neural network with attentive feature pooling for end-to-end language identification from short-duration speech
【24h】

Residual convolutional neural network with attentive feature pooling for end-to-end language identification from short-duration speech

机译:带有注意力特征池的残差卷积神经网络用于从短时语音识别端到端语言

获取原文
获取原文并翻译 | 示例
           

摘要

The problem of language identification from speech is tackled in this work. Residual convolutional neural networks are employed to this end, aiming at exploiting the ability of such architectures to take into account large contextual segments of input data. Moreover, learnable attention mechanisms are introduced on top of the convolutional stack for data-driven feature pooling across time, enabling the computation of fixed-dimension representations given varying-length speech segments as input. Training is performed using a combination of language identification and metric learning via triplet loss minimization, aimed at enforcing class separability within the embeddings space. Evaluation is performed across different conditions, such as multi-class classification, short-duration test utterances, and confusing languages, for the closed-set case, while open-set performance is evaluated with the introduction of unseen languages. At test time, end-to-end scoring along with cosine similarity and PLDA are employed, outperforming state-of-the-art benchmark methods, such as i-vectors by improving the average cost by 30% to 40% depending on the evaluation condition. (C) 2019 Elsevier Ltd. All rights reserved.
机译:这项工作解决了从语音中识别语言的问题。为此,采用了残差卷积神经网络,其目的是利用这种体系结构考虑输入数据的较大上下文段的能力。此外,在卷积堆栈的顶部引入了可学习的注意力机制,用于跨时间的数据驱动的特征池,可在给定长度可变的语音段作为输入的情况下,计算固定尺寸的表示形式。培训是通过将语言识别和度量学习结合在一起,并通过三元组损失最小化来进行的,目的是在嵌入空间内增强类的可分离性。对于封闭式案例,评估是在不同条件下进行的,例如多类分类,短期测试话语和令人困惑的语言,而开放式绩效则通过引入看不见的语言来评估。在测试时,采用了端到端评分以及余弦相似度和PLDA,优于最新的基准测试方法,例如i-vector,通过根据评估将平均成本提高了30%至40%健康)状况。 (C)2019 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号