首页> 外文会议>International Conference on Speech Database and Assessments >Bengali speech corpus for continuous auutomatic speech recognition system
【24h】

Bengali speech corpus for continuous auutomatic speech recognition system

机译:孟加拉语音语料库用于连续自动语音识别系统

获取原文

摘要

This paper presents Bengali speech corpus development for speaker independent continuous speech recognition. speech corpora is the backbone of automatic speech recognition (ASR) system. Speech corpus can be classified into several class. It may be language dependent or age dependent. We have developed speech corpus for two age groups. Younger group belongs to 20 to 40 years of age whereas older group is distributed into 60 to 80 years. We have created phone and triphone labeled speech corpora. Initially, speech samples are aligned with statistical modeling technique. Statistically labeled files are then pruned by manual correction. Hidden Markov Model Toolkit (HTK) has been used for aligning the speech data. We have observed phoneme recognition and continuous word recognition performance to check speech corpus quality.
机译:本文展示了孟加拉语音语料库开发,为扬声器独立的连续演讲识别。语音Corpora是自动语音识别(ASR)系统的骨干。语音语料库可以分为几个类。它可能是依赖或年龄依赖的语言。我们为两个年龄组开发了言语语料库。年龄较小的群体属于20至40岁,而较大的小组分发给60至80年。我们创建了电话和Triphone标记的演讲语料。最初,语音样本与统计建模技术对齐。然后通过手动校正来修剪统计标记的文件。隐藏的Markov模型工具包(HTK)已用于对齐语音数据。我们观察了音素识别和连续字识别性能来检查语音语料库质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号