Development of vocal tract length normalized phonetic engine for Gujarati and Marathi languages

机译：古吉拉特语和马拉地语的声道长度归一化语音引擎的开发

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Phonetic engine (PE) is a system that converts speech sound units into symbols without any higher-level information (such as semantic or linguistic details). This paper presents the development of PE in two Indian languages, viz., Gujarati and Marathi. To investigate the performance of PE, speech recorded in three different modes, viz., read, spontaneous and lecture is considered. Database consists of a large number of speakers in each mode for these languages. In order to reduce the effects of speaker differences in the databases, Vocal Tract Length Normalization (VTLN) using Lee-Rose method is incorporated. Here, performances of PEs are tested using state-of-the-art Mel frequency cepstral coefficients (MFCC) and vocal tract length normalized features. Hidden Markov model (HMM)-based approach is used for modeling the phonetic units. On an average, improvement of 3.12 % and 1.32 % is achieved using vocal tract length normalized PE over MFCCs for Gujarati and Marathi, respectively.

机译：语音引擎（PE）是一种将语音声音单元转换为没有任何高级信息（例如语义或语言细节）的符号的系统。本文以古吉拉特语和马拉地语两种印度语言介绍了体育的发展。为了研究体育课的表现，考虑了以三种不同模式记录的语音，即阅读，自发和演讲。在每种模式下，数据库都包含大量使用这些语言的发言人。为了减少说话人差异在数据库中的影响，合并了使用Lee-Rose方法的声道长度标准化（VTLN）。在这里，使用最新的梅尔频率倒谱系数（MFCC）和声道长度归一化功能来测试PE的性能。基于隐马尔可夫模型（HMM）的方法用于对语音单位进行建模。平均而言，与古吉拉特语和马拉地语的MFCC相比，使用声道长度归一化PE分别可提高3.12％和1.32％。

著录项

来源
《2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques 》|2014年|1-6|共6页
会议地点 Phuket(TH)
作者
Sharma Shubham; Madhavi Maulik C.; Patil Hemant /A/.;
展开▼
作者单位

Dhirubhai Ambani Inst. of Inf. Commun. Technol., Gandhinagar, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Lee-Rose method; MFCC; Phonetic engine; VTLN; hidden Markov model;

机译：Lee-Rose方法; MFCC;语音引擎; VTLN;隐马尔可夫模型;

相似文献

外文文献
中文文献
专利

1. Time Scale Modification And Vocal Tract Length Normalization For Improving The Performance Of Tamil Speech Recognition System Implemented Using Language Independent Segmentation Algorithm [J] . S. Saraswathi, T.V. Geetha International journal of speech technology . 2006 ,第3a4期

机译：时标修改和声道长度归一化以提高使用语言独立分割算法实现的泰米尔语语音识别系统的性能
2. Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection [J] . Madhavi Maulik C., Patil Hemant A. Computer speech and language . 2019 ,第NOVa期

机译：使用高斯混合模型框架进行语音片段长度归一化，以示例查询口语术语
3. Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection [J] . Madhavi Maulik C., Patil Hemant A. Computer speech and language . 2019 ,第Nova期

机译：使用高斯混合模型框架进行查询逐期检测的高斯混合模型框架的声带长度标准化
4. Development of vocal tract length normalized phonetic engine for Gujarati and Marathi languages [C] . Sharma Shubham, Madhavi Maulik C., Patil Hemant /A/. Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques . 2014

机译：古吉拉特和马拉地语语言的声音道长度标准化语音发动机的发展
5. Frequency warping by linear transformation, and vocal tract inversion for speaker normalization in automatic speech recognition. [D] . Panchapagesan, Sankaran. 2008

机译：通过线性变换实现的频率扭曲和声道反转，可在自动语音识别中实现说话人归一化。
6. A statistical formant-pattern model for segregating vowel type and vocal-tract length in developmental formant data [O] . Richard E. Turner, Thomas C. Walters, Jessica J. M. Monaghan, -1

机译：统计共振峰模式模型用于分离发育共振峰数据中的元音类型和声道长度
7. The ΔF method of vocal tract length normalization for vowels [O] . Keith Johnson 2020

机译：元音声道长度归一化的ΔF方法
8. Military Typesetting Equipment and Systems for Indo-Aryan and Dravidian Languages (Hindi, Marathi, Bengali, Punjabi, Gujarati, Malayalam, Tamil, and Telugu) (1961-1963) [R] . Nitenson, E. 1964

机译：印度 - 雅利安语和德拉威语的军事排版设备和系统（印地语，马拉地语，孟加拉语，旁遮普语，古吉拉特语，马拉雅拉姆语，泰米尔语和泰卢固语）（1961-1963）

Development of vocal tract length normalized phonetic engine for Gujarati and Marathi languages

摘要

著录项

相似文献

相关主题

期刊订阅