首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Investigation of Methods to Improve the Recognition Performance of Tamil-English Code-Switched Data in Transformer Framework

【24h】

Investigation of Methods to Improve the Recognition Performance of Tamil-English Code-Switched Data in Transformer Framework

机译：在变压器框架中提高泰米尔语-英语代码交换数据的识别性能的方法的研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Code-switching (CS) refers to (inter/intra-word) switching between multiple languages in a single conversation. In multilingual countries like India, CS occurs very often in everyday speech, resulting in a new breed of languages in urban regions like Hinglish (Hindi-English), Tanglish (Tamil-English), etc. Research in Indic CS speech recognition is primarily affected by insufficient data. In this paper, we investigate methods to deal with such very low resource scenarios. Recently, Transformers have shown promising results on automatic speech recognition (ASR) tasks. In a Transformer based framework, we investigate two methods for Tamil-English CS speech recognition, namely, (i) well-trained encoders of Monolingual Transformers as feature extractors to provide language discrimination, (ii) language information as tokens at the targets. Our results show that CS is efficiently handled by the second method, while the first method was efficient in discriminating languages.

机译：代码切换（CS）是指单个对话中多种语言之间的（/内/内）切换。在像印度这样的多语种国家，CS经常发生在日常演讲中，导致城市地区的新品种，如HINGISH（Hindi-English），Tanglish（泰米尔英语）等。在CS语音识别中的研究主要受到影响数据不足。在本文中，我们调查了处理如此低的资源方案的方法。最近，变形金刚在自动语音识别（ASR）任务上显示了有希望的结果。在基于变压器的框架中，我们调查了泰米尔英语CS语音识别的两种方法，即我们的结果表明，CS通过第二种方法有效处理，而第一种方法以辨别语言有效。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing 》|2020年|7889-7893|共5页
会议地点
作者
Metilda Sagaya Mary N J; Vishwas M. Shetty; S. Umesh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Transformers; code-switched speech; low resource speech recognition;

机译：变压器;代码转换语音;低资源语音识别;

相似文献

外文文献
中文文献
专利

1. Improving diagnostic performance of a power transformer using an adaptive over-sampling method for imbalanced data [J] . Viet Tra, Bach-Phi Duong, Jong-Myon Kim IEEE Transactions on Dielectrics and Electrical Insulation . 2019 ,第4期

机译：使用自适应过采样方法针对不平衡数据提高电力变压器的诊断性能
2. An Improved Framework for Recognizing Highly Imbalanced Bilingual Code-Switched Lectures with Cross-Language Acoustic Modeling and Frame-Level Language Identification [J] . Yeh Ching-Feng, Lee Lin-Shan Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2015 ,第7期

机译：跨语言声学建模和框架级语言识别的高度识别双语代码转换演讲的改进框架
3. Investigation of ANFIS and FFBNN Recognition Methods Performance in Tamil Speech Word Recognition [J] . S. Rojathai, M. Venkatesulu International journal of software innovation . 2014 ,第2期

机译：ANFIS和FFBNN识别方法在泰米尔语语音单词识别中的性能研究
4. Investigation of Methods to Improve the Recognition Performance of Tamil-English Code-Switched Data in Transformer Framework [C] . Metilda Sagaya Mary N J, Vishwas M. Shetty, S. Umesh IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：改善变压器框架中泰米尔英语代码交换数据识别性能的方法研究
5. A Database Tuning Framework for Improving Stored Procedure Performance [D] . ?Suver, Nathan 2020

机译：用于提高存储过程性能的数据库调整框架
6. Comparative Investigation on the Performance of Modified System Poles and Traditional System Poles Obtained from PDC Data for Diagnosing the Ageing Condition of Transformer Polymer Insulation Materials [O] . Jiefeng Liu, Hanbo Zheng, Yiyi Zhang, 2018

机译：从PDC数据获取的用于诊断变压器聚合物绝缘材料老化状况的修正系统极与传统系统极性能的比较研究。
7. Transformer-Transducers for Code-Switched Speech Recognition [O] . Siddharth Dalmia, Yuzong Liu, Srikanth Ronanki, 2021

机译：用于代码切换语音识别的变压器传感器

Investigation of Methods to Improve the Recognition Performance of Tamil-English Code-Switched Data in Transformer Framework

摘要

著录项

相似文献

相关主题

期刊订阅