Bengali speech corpus for continuous auutomatic speech recognition system

机译：孟加拉语音语料库用于连续自动语音识别系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents Bengali speech corpus development for speaker independent continuous speech recognition. speech corpora is the backbone of automatic speech recognition (ASR) system. Speech corpus can be classified into several class. It may be language dependent or age dependent. We have developed speech corpus for two age groups. Younger group belongs to 20 to 40 years of age whereas older group is distributed into 60 to 80 years. We have created phone and triphone labeled speech corpora. Initially, speech samples are aligned with statistical modeling technique. Statistically labeled files are then pruned by manual correction. Hidden Markov Model Toolkit (HTK) has been used for aligning the speech data. We have observed phoneme recognition and continuous word recognition performance to check speech corpus quality.

机译：本文展示了孟加拉语音语料库开发，为扬声器独立的连续演讲识别。语音Corpora是自动语音识别（ASR）系统的骨干。语音语料库可以分为几个类。它可能是依赖或年龄依赖的语言。我们为两个年龄组开发了言语语料库。年龄较小的群体属于20至40岁，而较大的小组分发给60至80年。我们创建了电话和Triphone标记的演讲语料。最初，语音样本与统计建模技术对齐。然后通过手动校正来修剪统计标记的文件。隐藏的Markov模型工具包（HTK）已用于对齐语音数据。我们观察了音素识别和连续字识别性能来检查语音语料库质量。

著录项

来源
《International Conference on Speech Database and Assessments》|2011年||共5页
会议地点
作者
Das Biswajit; Mandal Sandipan; Mitra Pabitra;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-53;
关键词
Bengali speech corpus; HTK; SPHINX; Speech labeling; Speech recognition;

机译：孟加拉语音语料库;htk;sphinx;言语标签;语音识别;

相似文献

外文文献
中文文献
专利

1. Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems [J] . Mohammad Abd-Alrahman Mahmoud Abushariah, Raja Noor Ainon, Roziati Zainuddin, Journal of the Franklin Institute . 2012,第7期

机译：用于实现和评估自动连续语音识别系统的现代标准阿拉伯语语音语料库
2. Aging speech recognition with speaker adaptation techniques: Study on medium vocabulary continuous Bengali speech [J] . Biswajit Das, Sandipan Mandal, Pabitra Mitra, Pattern recognition letters . 2013,第3期

机译：说话人适应技术对语音的老化识别：中词汇连续孟加拉语语音研究
3. Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus [J] . Mohammad Abushariah, Raja Ainon, Roziati Zainuddin, The international arab journal of information technology . 2012,第1期

机译：基于语音丰富均衡的语料库的阿拉伯语独立于说话人的连续自动语音识别
4. Bengali speech corpus for continuous auutomatic speech recognition system [C] . Das Biswajit, Mandal Sandipan, Mitra Pabitra 2011 International Conference on Speech Database and Assessments . 2011

机译：孟加拉语语料库用于连续自动语音识别系统
5. Large-vocabulary speaker-independent continuous speech recognition: The SPHINX system. [D] . Lee, Kai-Fu. 1988

机译：独立于大词汇的说话者的连续语音识别：SPHINX系统。
6. Evaluation of the accuracy of a continuous speech recognition software system in radiology [O] . Kalpana M. Kanal, Nicholas J. Hangiandreou, Anne-Marie G. Sykes, 2000

机译：放射学中连续语音识别软件系统准确性的评估
7. JNAS: Japanese speech corpus for large vocabulary continuous speech recognition research. [O] . Katunobu Itou, Mikio Yamamoto, Kazuya Takeda, 1999

机译：JNAS：日语语音语料库，用于大词汇连续语音识别研究。
8. Use of Computer Speech Understanding in Training: A Preliminary Investigation of a Limited Continuous Speech Recognition Capability. [R] . Porter, J. E., Grady, M. W., Hicklin, M. B., 1977

机译：计算机语音理解在训练中的运用：有限连续语音识别能力的初步研究。

Bengali speech corpus for continuous auutomatic speech recognition system

摘要

著录项

相似文献

相关主题

期刊订阅