Automatic phonetic segmentation of Hindi speech using hidden Markov model

Archana Balyan; S. S. Agrawal; Amita Dev

首页> 外文期刊>AI & society >Automatic phonetic segmentation of Hindi speech using hidden Markov model

【24h】

Automatic phonetic segmentation of Hindi speech using hidden Markov model

机译：使用隐马尔可夫模型对印地语语音进行自动语音分割

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we study the performance of baseline hidden Markov model (HMM) for segmentation of speech signals. It is applied on single-speaker segmentation task, using Hindi speech database. The automatic phoneme segmentation framework evolved imitates the human phoneme segmentation process. A set of 44 Hindi phonemes were chosen for the segmentation experiment, wherein we used continuous density hidden Markov model (CDHMM) with a mixture of Gaussian distribution. The left-to-right topology with no skip states has been selected as it is effective in speech recognition due to its consistency with the natural way of articulating the spoken words. This system accepts speech utterances along with their orthographic "transcriptions" and generates segmentation information of the speech. This corpus was used to develop context-independent hidden Markov models (HMMs) for each of the Hindi phonemes. The system was trained using numerous sentences that are relevant to provide information to the passengers of the Metro Rail. The system was validated against a few manually segmented speech utterances. The evaluation of the experiments shows that the best performance is obtained by using a combination of two Gaussians mixtures and five HMM states. A category-wise phoneme error analysis has been performed, and the performance of the phonetic segmentation has been reported. The modeling of HMMs has been implemented using Microsoft Visual Studio 2005 (C++), and the system is designed to work on Windows operating system. The goal of this study is automatic segmentation of speech at phonetic level.

机译：在本文中，我们研究了用于语音信号分割的基线隐马尔可夫模型（HMM）的性能。使用印地语语音数据库，它可用于单扬声器分割任务。不断发展的自动音素分割框架模仿了人类的音素分割过程。选择了一组44种印地语音素进行分割实验，其中我们使用了具有高斯分布混合的连续密度隐藏马尔可夫模型（CDHMM）。选择了无跳过状态的从左到右拓扑，因为它与语音发音的自然方式保持一致，因此在语音识别中很有效。该系统接受语音发声及其正交的“转录”，并生成语音的分段信息。该语料库用于为每个北印度语音素开发与上下文无关的隐藏马尔可夫模型（HMM）。该系统使用大量与向地铁乘客提供信息有关的句子进行了培训。该系统已针对一些手动分段的语音进行了验证。实验评估表明，结合使用两种高斯混合物和五个HMM状态可获得最佳性能。已经执行了类别明智的音素错误分析，并且已经报告了语音分割的性能。 HMM的建模已使用Microsoft Visual Studio 2005（C ++）实现，并且该系统旨在在Windows操作系统上工作。这项研究的目标是在语音级别自动分割语音。

著录项

来源
《AI & society》 |2012年第4期|p.543-549|共7页
作者
Archana Balyan; S. S. Agrawal; Amita Dev;
展开▼
作者单位

Maharaja Surajmal Institute of Technology, Guru Gobind Singh Indraprastha University, C-4, Janakpuri, New Delhi 110058, India;

KIIT College of Engineering, KIIT Campus, Sohna Road, Gurgaon, Haryana, India;

Bhai Parmanand Institute of Business Studies, Shakurpur, Delhi, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
automatic phonetic segmentation; hidden markov models; text to speech; corpus-based speech synthesis gaussian mixture models; unit selection;

机译：自动语音分割;隐藏的马尔可夫模型;文字转语音;基于语料库的语音合成高斯混合模型;单位选择;

相似文献

外文文献
中文文献
专利

1. Incorporating phonetic properties in hidden Markov models for speech recognition [J] . Ramachandrula N. V. Sitaram, Thippur Sreenivas The Journal of the Acoustical Society of America . 1997,第2期

机译：在隐马尔可夫模型中整合语音属性以进行语音识别
2. Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition [J] . Lee K.-F. IEEE Transactions on Acoustics, Speech, and Signal Processing . 1990,第4期

机译：上下文无关的语音隐藏马尔可夫模型，用于与说话者无关的连续语音识别
3. Development of an acoustic-phonetic hidden Markov model for continuous speech recognition [J] . Ljolje A., Levinson S.E. IEEE Transactions on Signal Processing . 1991,第1期

机译：用于连续语音识别的语音隐藏马尔可夫模型的开发
4. Automatic speaker independent alignment of continuous speech with its phonetic transcription using a hidden Markov model [C] . Brummer, J.N.L., Boetzer, . 1988

机译：使用隐藏的马尔可夫模型，连续语音及其语音转录的自动独立于说话者的对齐
5. Online Learning of Large Margin Hidden Markov Models for Automatic Speech Recognition. [D] . Cheng, Chih-Chieh. 2011

机译：在线学习大余量隐马尔可夫模型以进行自动语音识别。
6. Image segmentation for automatic particle identification in electron micrographs based on hidden Markov random field models and expectation maximization [O] . Vivek Singh, Dan C. Marinescu, Timothy S. Baker -1

机译：基于隐马尔可夫随机场模型和期望最大化的电子显微图像中颗粒自动识别的图像分割
7. Continuous Density Hidden Markov Model for Hindi Speech Recognition [O] . Shweta Sinha, S S Agrawal, Aruna Jain 2013

机译：印度语识别的连续密度隐马尔可夫模型

Automatic phonetic segmentation of Hindi speech using hidden Markov model

摘要

著录项

相似文献

相关主题

期刊订阅