...
首页> 外文期刊>Digital Signal Processing >An adapted data selection for deep learning-based audio segmentation in multi-genre broadcast channel
【24h】

An adapted data selection for deep learning-based audio segmentation in multi-genre broadcast channel

机译:多类型广播频道中基于深度学习的音频分割的适应性数据选择

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Broadcast audio transcription is still a challenging problem because of the complexity of diverse speech and audio signals. Audio segmentation, which is an essential module in a broadcast audio transcription system, has benefited greatly from the development of deep learning theory. However, the need of large amounts of labeled training data becomes a bottleneck of deep learning-based audio segmentation methods. To tackle this problem, an adapted segmentation method is proposed to select speech/non-speech segments with high confidence from unlabeled training data as complements to the labeled training data. The new method relies on GMM-based speech/non-speech models trained on an utterance-by-utterance basis. The long-term information is used to choose reliable training data for speech/non-speech models from the utterances at hand. Experimental results show that this data selection method is a powerful audio segmentation algorithm of its own. We also observed that the deep neural networks trained using data selected by this method are superior to those trained with data chosen by two comparing methods. Moreover, better performance could be obtained by combining the deep learning-based audio segmentation method with the adapted data selection method.
机译:由于不同语音和音频信号的复杂性,广播音频转录仍然是一个具有挑战性的问题。作为广播音频转录系统中的重要模块,音频分割,从深度学习理论的发展中受益匪浅。然而,大量标记训练数据的需要成为基于深度学习的音频分段方法的瓶颈。为了解决这个问题,提出了一种适应的分割方法,以从未标记的训练数据从未与标记的训练数据的补充选择具有高信心的语音/非语音段。新方法依赖于基于GMM的语音/非语音模型,逐字培训。长期信息用于从手中的话语中选择用于语音/非语音模型的可靠培训数据。实验结果表明,该数据选择方法是其自身的强大音频分段算法。我们还观察到使用该方法选择的数据训练的深神经网络优于由两种比较方法选择的数据训练的数据。此外,通过将基于深度学习的音频分割方法与适应的数据选择方法组合来获得更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号