Development of speech corpora in Gujarati and Marathi for phonetic transcription

机译：在古吉拉特语和马拉地语中发展语音语料库以进行语音转录

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

There have been growing interest to use speech technology for rural areas. In this context, this paper describes the development of speech corpora in Indian languages (viz., Gujarati and Marathi from remote villages) for the task of phonetic transcription. This paper also presents related analysis of phonetic transcription. The manual phonetic transcription was done for two Indian languages, viz., Gujarati and Marathi for 8 hours of field recorded speech data in real-life settings. Dialectal variations are also analyzed using spectrograms and phonetic transcription. In addition, it was found that for consonant sounds, plosive sounds are having large coverage in broad phonetic category. The collected speech corpora can be very useful for speech and speaker recognition tasks.

机译：在农村地区使用语音技术的兴趣日益浓厚。在这种情况下，本文描述了印度语语音语料库（来自偏远村庄的古吉拉特语和马拉地语）的发展，用于语音转录的任务。本文还介绍了语音转录的相关分析。手动语音转录是针对两种印度语言（即古吉拉特语和马拉地语）进行的，在真实环境中进行了8小时的现场记录语音数据。方言变化也可以通过频谱图和语音转录进行分析。另外，发现对于辅音，爆破音在广泛的语音类别中具有较大的覆盖范围。收集的语音语料库对于语音和说话者识别任务非常有用。

著录项

来源
《2013 International Conference on Oriental COCOSDA》|2013年|1-6|共6页
会议地点 Gurgaon(IN)
作者
Malde Kewal D.; Vachhani Bhavik B.; Madhavi Maulik C.; Chhayani Nirav H.;
展开▼
作者单位

Dhirubhai Ambani Institute of Information and Communication Technology Gandhinagar, Gujarat, Indiac;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Database collection; Indian languages; dialectal variation; phonetic transcription;

机译：数据库集合;印度语言;方言变化;语音转录;;
入库时间 2022-08-26 14:02:11

相似文献

外文文献
中文文献
专利

1. Automatic phonetic transcription of large speech corpora [J] . Christophe Van Bael, Lou Boves, Henk van den Heuvel, Computer speech and language . 2007,第4期

机译：大型语音语料库的自动语音转录
2. Phonetically rich and balanced text and speech corpora for Arabic language [J] . Mohammad A. M. Abushariah, Raja N. Ainon, Roziati Zainuddin, Language Resources and Evaluation . 2012,第4期

机译：语音丰富且平衡的阿拉伯语文字和语音语料库
3. A Review on Marathi Language Speech Database Development for Automatic Speech Recognition (ASR) System [J] . Mrs. Chhaya S. Patil, Prof.Dr.Vaishali B.Patil International Journal of Engineering Research and Applications . 2017,第3期

机译：用于自动语音识别（ASR）系统的Marathi语言语音数据库开发的回顾
4. Development of speech corpora in Gujarati and Marathi for phonetic transcription [C] . Malde Kewal D., Vachhani Bhavik B., Madhavi Maulik C., International Conference on Oriental COCOSDA . 2013

机译：古吉拉蒂和马拉萨语音转录的发展
5. Experiments on automatic phonetic segmentation and transcription of speech. [D] . Lennig, Matthew. 1984

机译：自动语音分割和语音转录的实验。
6. Conventions for sign and speech transcription of child bimodal bilingual corpora in ELAN [O] . Deborah Chen Pichler, Julie A. Hochgesang, Diane Lillo-Martin, -1

机译：伊朗儿童双峰双语语料库的签署和语音转录的公约
7. A Phonetic Study for Constructing a Database of Gujarati Characters for Speech Synthesis of Gujarati Text [O] . Prof Jj Kothari 2015

机译：古吉拉特语文本合成古吉拉特语数据库构建的语音研究
8. Military Typesetting Equipment and Systems for Indo-Aryan and Dravidian Languages (Hindi, Marathi, Bengali, Punjabi, Gujarati, Malayalam, Tamil, and Telugu) (1961-1963) [R] . Nitenson, E. 1964

机译：印度 - 雅利安语和德拉威语的军事排版设备和系统（印地语，马拉地语，孟加拉语，旁遮普语，古吉拉特语，马拉雅拉姆语，泰米尔语和泰卢固语）（1961-1963）

Development of speech corpora in Gujarati and Marathi for phonetic transcription

摘要

著录项

相似文献

相关主题

期刊订阅