Language Identification Using Deep Convolutional Recurrent Neural Networks

机译：使用深卷积经常性神经网络的语言识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Language Identification (LID) systems are used to classify the spoken language from a given audio sample and are typically the first step for many spoken language processing tasks, such as Automatic Speech Recognition (ASR) systems. Without automatic language detection, speech utterances cannot be parsed correctly and grammar rules cannot be applied, causing subsequent speech recognition steps to fail. We propose a LID system that solves the problem in the image domain, rather than the audio domain. We use a hybrid Convolutional Recurrent Neural Network (CRNN) that operates on spectrogram images of the provided audio snippets. In extensive experiments we show, that our model is applicable to a range of noisy scenarios and can easily be extended to previously unknown languages, while maintaining its classification accuracy. We release our code and a large scale training set for LID systems to the community.

机译：语言识别（LID）系统用于将口语从给定的音频样本分类，通常是许多口语处理任务的第一步，例如自动语音识别（ASR）系统。没有自动语言检测，无法正确解析语音话语，无法应用语法规则，导致后续的语音识别步骤失败。我们提出了一种盖子系统，可以解决图像域中的问题，而不是音频域。我们使用的混合卷积经常性神经网络（CRNN），其在提供的音频片段的谱图图像上运行。在我们展示的广泛实验中，我们的模型适用于一系列嘈杂的场景，并且可以轻松扩展到以前未知的语言，同时保持其分类准确性。我们释放了我们的代码和大规模培训，用于社区的盖系统。

著录项

来源
《International Conference on Neural Information Processing》|2017年|912p|共10页
会议地点
作者
Christian Bartz; Tom Herold; Haojin Yang; Christoph Meinel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP183-53;
关键词

相似文献

外文文献
中文文献
专利

1. Cascade convolutional neural network-long short-term memory recurrent neural networks for automatic tonal and nontonal preclassification-based Indian language identification [J] . China Bhanja Chuya, Laskar Mohammad A., Laskar Rabul H. Expert Systems . 2020,第5期

机译：级联卷积神经网络长短期内存经常性神经网络，用于自动色调和非统计学预分配的印度语言识别
2. Detection of Bleeding Events in Electronic Health Record Notes Using Convolutional Neural Network Models Enhanced With Recurrent Neural Network Autoencoders: Deep Learning Approach [J] . Rumeng Li, Baotian Hu, Feifan Liu, JMIR Medical Informatics . 2019,第1期

机译：使用循环神经网络自动编码器增强的卷积神经网络模型检测电子病历中的出血事件：深度学习方法
3. Network traffic classification using deep convolutional recurrent autoencoder neural networks for spatial-temporal features extraction [J] . DAngelo Gianni, Palmieri Francesco Journal of network and computer applications . 2021,第Jana期

机译：网络流量分类使用深卷积复制自动化器神经网络进行空间时间特征提取
4. Language Identification Using Deep Convolutional Recurrent Neural Networks [C] . Christian Bartz, Tom Herold, Haojin Yang, International conference on neural information processing . 2017

机译：使用深度卷积递归神经网络进行语言识别
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. CORENup: a combination of convolutional and recurrent deep neural networks for nucleosome positioning identification [O] . Domenico Amato, Giosue’ Lo Bosco, Riccardo Rizzo 2020

机译：Corenup：卷积和反复性深神经网络的组合核心定位鉴定
7. Identification of Spoken Language from Webcast Using Deep Convolutional Recurrent Neural Networks [O] . Dong ZHU, Ming HUANG, Jing-jing YANG, 2019

机译：使用深度卷积经常性神经网络识别来自网络广播的口语语言

Language Identification Using Deep Convolutional Recurrent Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅