首页> 外文会议>2013 International Conference on Oriental COCOSDA >Development of speech corpora in Gujarati and Marathi for phonetic transcription
【24h】

Development of speech corpora in Gujarati and Marathi for phonetic transcription

机译:在古吉拉特语和马拉地语中发展语音语料库以进行语音转录

获取原文
获取原文并翻译 | 示例

摘要

There have been growing interest to use speech technology for rural areas. In this context, this paper describes the development of speech corpora in Indian languages (viz., Gujarati and Marathi from remote villages) for the task of phonetic transcription. This paper also presents related analysis of phonetic transcription. The manual phonetic transcription was done for two Indian languages, viz., Gujarati and Marathi for 8 hours of field recorded speech data in real-life settings. Dialectal variations are also analyzed using spectrograms and phonetic transcription. In addition, it was found that for consonant sounds, plosive sounds are having large coverage in broad phonetic category. The collected speech corpora can be very useful for speech and speaker recognition tasks.
机译:在农村地区使用语音技术的兴趣日益浓厚。在这种情况下,本文描述了印度语语音语料库(来自偏远村庄的古吉拉特语和马拉地语)的发展,用于语音转录的任务。本文还介绍了语音转录的相关分析。手动语音转录是针对两种印度语言(即古吉拉特语和马拉地语)进行的,在真实环境中进行了8小时的现场记录语音数据。方言变化也可以通过频谱图和语音转录进行分析。另外,发现对于辅音,爆破音在广泛的语音类别中具有较大的覆盖范围。收集的语音语料库对于语音和说话者识别任务非常有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号