首页> 外文会议>International Conference on Speech and Computer >A Phonetic Segmentation Procedure Based on Hidden Markov Models
【24h】

A Phonetic Segmentation Procedure Based on Hidden Markov Models

机译:基于隐马尔可夫模型的语音分割过程

获取原文

摘要

In this paper, a novel variant of an automatic phonetic segmentation procedure is presented, especially useful if data is scarce. The procedure uses the Kaldi speech recognition toolkit as its basis, and combines and modifies several existing methods and Kaldi recipes. Both the specifics of model training and test data alignment are explained in detail. Effectiveness of artificial extension of the starting amount of manually labeled material during training is examined as well. Experimental results show the admirable overall correctness of the proposed procedure in the given test environment. Several variants of the procedure are compared, and the usage of speaker-adapted context-dependent triphone models trained without the expanded manually checked data is proven to produce the best results. A few ways to improve the procedure even more, as well as future work, are also discussed.
机译:在本文中,提出了一种新的自动语音分割过程的变体,如果数据稀缺,特别有用。该过程使用Kaldi语音识别工具包作为其基础,并组合和修改几种现有方法和KALDI配方。详细解释了模型训练和测试数据对齐的细节。检查了在训练期间手动标记材料起始量的人工延伸的有效性。实验结果表明,在给定的测试环境中提出的程序的令人钦佩的整体正确性。比较了几个程序的变体,并经过证明没有扩展的手动检查数据培训的扬声器适应的上下文相关的三磡模型的使用是为了产生最佳结果。还讨论了几种方法来提高程序,以及未来的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号