首页> 外文会议>Chinese Spoken Language Processing; Lecture Notes in Artificial Intelligence; 4274 >A Minimum Boundary Error Framework for Automatic Phonetic Segmentation
【24h】

A Minimum Boundary Error Framework for Automatic Phonetic Segmentation

机译:自动语音分割的最小边界误差框架

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper presents a novel framework for HMM-based automatic phonetic segmentation that improves the accuracy of placing phone boundaries. In the framework, both training and segmentation approaches are proposed according to the minimum boundary error (MBE) criterion, which tries to minimize the expected boundary errors over a set of possible phonetic alignments. This framework is inspired by the recently proposed minimum phone error (MPE) training approach and the minimum Bayes risk decoding algorithm for automatic speech recognition. To evaluate the proposed MBE framework, we conduct automatic phonetic segmentation experiments on the TIMIT acoustic-phonetic continuous speech corpus. MBE segmentation with MBE-trained models can identify 80.53% of human-labeled phone boundaries within a tolerance of 10 ms, compared to 71.10% identified by conventional ML segmentation with ML-trained models. Moreover, by using the MBE framework, only 7.15% of automatically labeled phone boundaries have errors larger than 20 ms.
机译:本文提出了一种基于HMM的自动语音分割新框架,该框架提高了放置电话边界的准确性。在该框架中,根据最小边界误差(MBE)准则提出了训练和分割方法,该准则试图在一组可能的语音对齐方式上将预期的边界误差最小化。该框架的灵感来自于最近提出的最小电话错误(MPE)训练方法和用于自动语音识别的最小贝叶斯风险解码算法。为了评估提出的MBE框架,我们在TIMIT语音连续语音语料库上进行了自动语音分割实验。用MBE训练的模型进行MBE细分可以在10毫秒的容差范围内识别80.53%的人类标记电话边界,而使用ML训练的模型通过传统的ML细分可以识别出71.10%的电话边界。此外,通过使用MBE框架,只有7.15%的自动标记电话边界的错误大于20 ms。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号