【24h】

Automatic phonetic segmentation

机译:自动语音分割

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents the results and conclusions of a thorough study on automatic phonetic segmentation. It starts with a review of the state of the art in this field. Then, it analyzes the most frequently used approach-based on a modified Hidden Markov Model (HMM) phonetic recognizer. For this approach, a statistical correction procedure is proposed to compensate for the systematic errors produced by context-dependent HMMs, and the use of speaker adaptation techniques is considered to increase segmentation precision. Finally, this paper explores the possibility of locally refining the boundaries obtained with the former techniques. A general framework is proposed for the local refinement of boundaries, and the performance of several pattern classification approaches (fuzzy logic, neural networks and Gaussian mixture models) is compared within this framework. The resulting phonetic segmentation scheme was able to increase the performance of a baseline HMM segmentation tool from 27.12%, 79.27%, and 97.75% of automatic boundary marks with errors smaller than 5, 20, and 50 ms, respectively, to 65.86%, 96.01%, and 99.31% in speaker-dependent mode, which is a reasonably good approximation to manual segmentation.
机译:本文介绍了对自动语音分割进行深入研究的结果和结论。首先回顾该领域的最新技术。然后,它基于修改后的隐马尔可夫模型(HMM)语音识别器分析最常用的方法。对于这种方法,提出了一种统计校正程序来补偿由上下文相关的HMM产生的系统误差,并且考虑使用说话人自适应技术来提高分割精度。最后,本文探讨了局部细化使用前一种技术获得的边界的可能性。提出了用于边界的局部细化的通用框架,并且在该框架内比较了几种模式分类方法(模糊逻辑,神经网络和高斯混合模型)的性能。所得的语音分割方案能够将基线HMM分割工具的性能从自动边界标记的27.12%,79.27%和97.75%提高到错误分别小于5、20和50 ms的65.86%,96.01 %,以及99.31%(与说话者相关的模式),这对于手动细分来说是一个很好的近似值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号