首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages

【24h】

Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages

机译：探索共同标签集的使用，提高低资源印度语言的语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many Indian languages, written characters are organized on sound phonetic principles, and the ordering of characters is the same across many of them. However, while training conventional end-to-end (E2E) Multilingual speech recognition systems, we treat characters or target subword units from different languages as separate entities. Since the visual rendering of these characters is different, in this paper, we explore the benefits of representing such similar target subword units (e.g., Byte Pair Encoded(BPE) units) through a Common Label Set (CLS). The CLS can be very easily created using automatic methods since the ordering of characters is the same in many Indian Languages. E2E models are trained using a transformer-based encoder-decoder architecture. During testing, given the Mel-filterbank features as input, the system outputs a sequence of BPE units in CLS representation. Depending on the language, we then map the recognized CLS units back to the language-specific grapheme representation. Results show that models trained using CLS improve over monolingual baseline and a multilingual framework with separate symbols for each language. Similar experiments on a subset of the Voxforge dataset also confirm the benefits of CLS. An extension of this idea is to decode an unseen language (Zero-resource) using CLS trained model.

机译：在许多印度语言中，书面字符是在声音语音原理上组织的，并且在其中许多人中的字符排序是相同的。然而，在培训常规端到端（E2E）的多语言语音识别系统的同时，我们将不同语言的字符或目标子字单元视为单独的实体。由于这些字符的视觉渲染是不同的，因此在本文中，我们探讨了代表这种类似目标子字单元（例如，字节对编码（BPE）单元）的好处通过公共标签集（CLS）。可以使用自动方法非常容易地创建CLS，因为字符的排序是许多印度语言的顺序。 E2E模型使用基于变压器的编码器解码器架构进行培训。在测试期间，给出MEL-FilterBank特征作为输入，系统在CLS表示中输出一系列BPE单元。根据语言，我们将识别的CLS单元映射到特定于语言的图形表示。结果表明，使用CLS培训的型号通过针对单晶体基线和多语言框架进行培训，以及每个语言的单独符号。在Voxforge数据集的子集上的类似实验也证实了CLS的好处。此想法的扩展是使用CLS培训的模型解码未经语言（零资源）。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing 》|2021年|7228-7232|共5页
会议地点
作者
Vishwas M. Shetty; S. Umesh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Handwriting recognition; Visualization; Error analysis; Speech recognition; Phonetics; Signal processing;

机译：培训;手写识别;可视化;错误分析;语音识别;语音;信号处理;

相似文献

外文文献
中文文献
专利

1. Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification [J] . Basu Joyanta, Khan Soma, Roy Rajib, Circuits, systems and signal processing . 2021 ,第10期

机译：用于扬声器和语言识别的低资源东部和东北印度语言语言的多语种演讲语料库
2. Developing children's speech recognition system for low resource Punjabi language [J] . Kadyan Virender, Shanawazuddin Syed, Singh Amitoj Applied Acoustics . 2021 ,第Jula期

机译：为低资源旁遮普语言发展儿童语音识别系统
3. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu Journal of Signal and Information Processing . 2020 ,第1期

机译：Chaha非常低于资源语言的多语言深神经网络建模方法对自动语音识别系统的研究
4. Improving the Performance of Transformer Based Low Resource Speech Recognition for Indian Languages [C] . Vishwas M. Shetty, Metilda Sagaya Mary N J, S. Umesh IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：提高基于变压器的印度语言低资源语音识别性能
5. Automatic Speech Recognition for Low-Resource and Morphologically Complex Languages [D] . Morris, Ethan. 2021

机译：用于低资源和形态复杂语言的自动语音识别
6. The use of non-speech oral-motor exercises among Indian speech-language pathologists to treat speech disorders: An online survey [O] . Roha M. Thomas, Ramesh Kaipa 2015

机译：在线语音调查中印度言语病理学家使用非言语口腔运动练习来治疗言语障碍：在线调查
7. Pre-training on high-resource speech recognition improves low-resource speech-to-text translation [O] . Sameer Bansal, Herman Kamper, Karen Livescu, 2019

机译：高资源语音识别的预培训改善了低资源语音到文本翻译

Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages

摘要

著录项

相似文献

相关主题

期刊订阅