...
首页> 外文期刊>AI & society >Automatic phonetic segmentation of Hindi speech using hidden Markov model
【24h】

Automatic phonetic segmentation of Hindi speech using hidden Markov model

机译:使用隐马尔可夫模型对印地语语音进行自动语音分割

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we study the performance of baseline hidden Markov model (HMM) for segmentation of speech signals. It is applied on single-speaker segmentation task, using Hindi speech database. The automatic phoneme segmentation framework evolved imitates the human phoneme segmentation process. A set of 44 Hindi phonemes were chosen for the segmentation experiment, wherein we used continuous density hidden Markov model (CDHMM) with a mixture of Gaussian distribution. The left-to-right topology with no skip states has been selected as it is effective in speech recognition due to its consistency with the natural way of articulating the spoken words. This system accepts speech utterances along with their orthographic "transcriptions" and generates segmentation information of the speech. This corpus was used to develop context-independent hidden Markov models (HMMs) for each of the Hindi phonemes. The system was trained using numerous sentences that are relevant to provide information to the passengers of the Metro Rail. The system was validated against a few manually segmented speech utterances. The evaluation of the experiments shows that the best performance is obtained by using a combination of two Gaussians mixtures and five HMM states. A category-wise phoneme error analysis has been performed, and the performance of the phonetic segmentation has been reported. The modeling of HMMs has been implemented using Microsoft Visual Studio 2005 (C++), and the system is designed to work on Windows operating system. The goal of this study is automatic segmentation of speech at phonetic level.
机译:在本文中,我们研究了用于语音信号分割的基线隐马尔可夫模型(HMM)的性能。使用印地语语音数据库,它可用于单扬声器分割任务。不断发展的自动音素分割框架模仿了人类的音素分割过程。选择了一组44种印地语音素进行分割实验,其中我们使用了具有高斯分布混合的连续密度隐藏马尔可夫模型(CDHMM)。选择了无跳过状态的从左到右拓扑,因为它与语音发音的自然方式保持一致,因此在语音识别中很有效。该系统接受语音发声及其正交的“转录”,并生成语音的分段信息。该语料库用于为每个北印度语音素开发与上下文无关的隐藏马尔可夫模型(HMM)。该系统使用大量与向地铁乘客提供信息有关的句子进行了培训。该系统已针对一些手动分段的语音进行了验证。实验评估表明,结合使用两种高斯混合物和五个HMM状态可获得最佳性能。已经执行了类别明智的音素错误分析,并且已经报告了语音分割的性能。 HMM的建模已使用Microsoft Visual Studio 2005(C ++)实现,并且该系统旨在在Windows操作系统上工作。这项研究的目标是在语音级别自动分割语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号