Supervised and unsupervised active learning for automatic speech recognition of low-resource languages

机译：有监督和无监督的主动学习，可自动识别低资源语言的语音

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Automatic speech recognition (ASR) systems rely on large quantities of transcribed acoustic data. The collection of audio data is relatively cheap, whereas the transcription of that data is relatively expensive. Thus there is an interest in the ASR community in active learning, in which only a small subset of highly representative data chosen from a large pool of untranscribed audio need be transcribed in order to approach the performance of the system trained with much larger amounts of transcribed audio. In this paper, we compare two basic approaches to active learning: a supervised approach in which we build a speech recognition system from a small amount of seed data in order to make the selection of a limited amount of additional audio for transcription, and an unsupervised approach in which no intermediate system recognition system built from seed data is necessary. Our best unsupervised approach performs quite close to our supervised approach, with both outperforming a random selection scheme.

机译：自动语音识别（ASR）系统依赖于大量转录的声学数据。音频数据的收集相对便宜，而该数据的转录则相对昂贵。因此，ASR社区对主动学习产生了兴趣，在这种学习中，仅转录从大量未转录音频中选择的极具代表性的数据的一小部分，即可达到通过大量转录而训练的系统的性能声音的。在本文中，我们比较了主动学习的两种基本方法：一种有监督的方法，其中我们从少量的种子数据中构建了一个语音识别系统，以便选择数量有限的其他音频进行转录;以及一种无监督的方法。这种方法不需要由种子数据构建的中间系统识别系统。我们最好的无监督方法在性能上与我们的监督方法非常接近，两者均优于随机选择方案。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2016年|5320-5324|共5页
会议地点
作者
Ali Raza Syed; Andrew Rosenberg; Ellen Kislal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
active learning; limited-resource automatic speech recognition; supervised active learning; unsupervised active learning;

机译：主动学习;有限资源自动语音识别;监督主动学习;无监督主动学习;

相似文献

外文文献
中文文献
专利

1. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu Journal of Signal and Information Processing . 2020,第1期

机译：Chaha非常低于资源语言的多语言深神经网络建模方法对自动语音识别系统的研究
2. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu 信号与信息处理（英文） . 2020,第001期

机译：资源非常少的语言Chaha通过多语言深层神经网络建模方法研究自动语音识别系统
3. Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition [J] . Akinori Ito, Yasutomo Kajiura, Motoyuki Suzuki, EURASIP journal on audio, speech, and music processing . 2009,第009期

机译：语音识别的无监督语言模型自适应自动查询生成和查询相关性度量
4. Supervised and unsupervised active learning for automatic speech recognition of low-resource languages [C] . Ali Raza Syed, Andrew Rosenberg, Ellen Kislal IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：用于低资源语言的自动语音识别的监督和无人监督的主动学习
5. Automatic Speech Recognition for Low-Resource and Morphologically Complex Languages [D] . Morris, Ethan. 2021

机译：用于低资源和形态复杂语言的自动语音识别
6. Task‐induced brain functional connectivity as a representation of schema for mediating unsupervised and supervised learning dynamics in language acquisition [O] . Hiroyuki Akama, Yixin Yuan, Shunji Awazu 2021

机译：任务诱导的脑功能连通性作为调解语言习得中无监督和监督学习动态的模式的代表
7. Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition [O] . 2009

机译：语音识别的无监督语言模型自适应自动查询生成和查询相关性度量

Supervised and unsupervised active learning for automatic speech recognition of low-resource languages

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅