首页> 外文会议>International Conference on Neural Information Processing >Language Identification Using Deep Convolutional Recurrent Neural Networks
【24h】

Language Identification Using Deep Convolutional Recurrent Neural Networks

机译:使用深卷积经常性神经网络的语言识别

获取原文

摘要

Language Identification (LID) systems are used to classify the spoken language from a given audio sample and are typically the first step for many spoken language processing tasks, such as Automatic Speech Recognition (ASR) systems. Without automatic language detection, speech utterances cannot be parsed correctly and grammar rules cannot be applied, causing subsequent speech recognition steps to fail. We propose a LID system that solves the problem in the image domain, rather than the audio domain. We use a hybrid Convolutional Recurrent Neural Network (CRNN) that operates on spectrogram images of the provided audio snippets. In extensive experiments we show, that our model is applicable to a range of noisy scenarios and can easily be extended to previously unknown languages, while maintaining its classification accuracy. We release our code and a large scale training set for LID systems to the community.
机译:语言识别(LID)系统用于将口语从给定的音频样本分类,通常是许多口语处理任务的第一步,例如自动语音识别(ASR)系统。没有自动语言检测,无法正确解析语音话语,无法应用语法规则,导致后续的语音识别步骤失败。我们提出了一种盖子系统,可以解决图像域中的问题,而不是音频域。我们使用的混合卷积经常性神经网络(CRNN),其在提供的音频片段的谱图图像上运行。在我们展示的广泛实验中,我们的模型适用于一系列嘈杂的场景,并且可以轻松扩展到以前未知的语言,同时保持其分类准确性。我们释放了我们的代码和大规模培训,用于社区的盖系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号