SpeeD's DNN approach to Romanian speech recognition

机译：速度的DNN探讨罗马尼亚语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the main improvements brought recently to the large-vocabulary, continuous speech recognition (LVCSR) system for Romanian language developed by the Speech and Dialogue (SpeeD) research laboratory. While the most important improvement consists in the use of DNN-based acoustic models, instead of the classic HMM-GMM approach, several other aspects are discussed in the paper: a significant increase of the speech training corpus, the use of additional algorithms for feature processing, speaker adaptive training, and discriminative training and, finally, the use of lattice rescoring with significantly expanded language models (n-gram models up to order 5, based on vocabularies of up to 200k words). The ASR experiments were performed with several types of acoustic and language models in different configurations on the standard read and conversational speech corpora created by SpeeD in 2014. The results show that the extension of the training speech corpus leads to a relative word error rate (WER) improvement between 15% and 17%, while the use of DNN-based acoustic models instead of HMM-GMM-based acoustic models leads to a relative WER improvement between 18% and 23%, depending on the nature of the evaluation speech corpus (read or conversational, clean or noisy). The best configuration of the LVCSR system was integrated as a live transcription web application available online on SpeeD laboratory's website at https://speed.pub.ro/live-transcriber-2017.

机译：本文礼物最近带到大词汇的主要改进，罗马尼亚语言连续语音识别（LVCSR）系统由语音和对话（速度）的研究实验室开发的。而最重要的改进之处在于代替经典HMM-GMM方法在使用基于DNN声学模型的，其他几个方面将在本文讨论：语音训练语料的显著增加，使用的功能的其他算法处理，扬声器适应性训练和判别训练，最后，使用晶格再评分与显著扩展的语言模型（n-gram中的模型到顺序5，根据最多的词汇至200K个字）。该ASR实验是用几种类型由速度在2014年创建的标准读取和对话语音语料库不同配置的声学和语言模型的结果显示执行的训练语料库导致相对字错误率的延伸（WER 15 ％和17 ％之间）的改善，而使用基于DNN声学模型代替的HMM-GMM基于声学模型导致之间18 ％和23 ％的相对WER改进，视的性质评价语料库（读或对话，清洁或嘈杂）。该LVCSR系统的最佳配置是在线集成为一个提供实时转录Web应用程序的速度实验室在https://speed.pub.ro/live-transcriber-2017网站。

著录项

来源
《International Conference on Speech Technology and Human-Computer Dialogue》|2017年|161p|共8页
会议地点
作者
Alexandru-Lucian Georgescu; Horia Cucu; Corneliu Burileanu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Hidden Markov models; Acoustics; Training; Speech; Adaptation models; Speech recognition; Biological neural networks;

机译：隐马尔可夫模型;声学;培训;语音;适应模型;语音识别;生物神经网络;
入库时间 2022-08-20 23:19:03

相似文献

外文文献
中文文献
专利

1. DNN based continuous speech recognition system of Punjabi language on Kaldi toolkit [J] . Jyoti Guglani, A. N. Mishra International journal of speech technology . 2021,第1期

机译：基于DNN基于Kaldi Toolkit的Punjabi语言的连续语音识别系统
2. Advances in subword-based HMM-DNN speech recognition across languages [J] . Peter Smit, Sami Virpioja, Mikko Kurimo Computer speech and language . 2021,第Mara期

机译：跨语言的基于次字的HMM-DNN语音识别的进步
3. Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition [J] . Novotny Ondrej, Plchot Oldrich, Glembek Ondrej, Computer speech and language . 2019,第NOVa期

机译：DNN语音信号增强以增强说话人识别能力的分析
4. SpeeD's DNN approach to Romanian speech recognition [C] . Alexandru-Lucian Georgescu, Horia Cucu, Corneliu Burileanu International Conference on Speech Technology and Human-Computer Dialogue . 2017

机译：SpeeD的DNN方法用于罗马尼亚语音识别
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Effect of (Mis)Matched Compression Speed on Speech Recognition in Bimodal Listeners [O] . Dimitar Spirrov, Eugen Kludt, Eline Verschueren, 2020

机译：（MIS）匹配压缩速度对双峰听众语音识别的影响
7. Advances in subword-based HMM-DNN speech recognition across languages [O] . Peter Smit, Sami Virpioja, Mikko Kurimo 2021

机译：跨语言的基于次字的HMM-DNN语音识别的进步
8. An articulatorily constrained, maximum entropy approach to speech recognition and speech coding [R] . 1996

机译：语音识别和语音编码的咬合约束，最大熵方法

SpeeD's DNN approach to Romanian speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅