首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >MINIMUM SEGMENTATION ERROR BASED DISCRIMINATIVE TRAINING OF HMM FOR AUTOMATIC PHONETIC SEGMENTATION
【24h】

MINIMUM SEGMENTATION ERROR BASED DISCRIMINATIVE TRAINING OF HMM FOR AUTOMATIC PHONETIC SEGMENTATION

机译:基于最低分割误差的自动语音分割的HMM辨别训练

获取原文
获取原文并翻译 | 示例
           

摘要

In the conventional HMM-based segmentation method, the HMM training is based on MLE criteria, which links the segmentation task to the problem of distribution estimation. The HMMs are built to identify the phonetic segments, not to detect the boundary. This kind of inconsistency between training and application limited the performance of segmentation. In this paper, we adopt the discriminative training method and introduce a new criterion, named Minimum Segmentation Error (MSE), for HMM training. In this method, a loss function directly related to the segmentation error is defined. By minimizing the overall empirical loss with the Generalized Probabilistic Descent (GPD) algorithm, the segmentation error is also minimized. From the results on both Chinese and Japanese data, the accuracy of segmentation is improved, where the error average is reduced 0.7 ms on Chinese and 1.6 ms on Japanese. Moreover, this method is robust even when we do not have enough knowledge on HMM modeling, e.g. the number of states is not optimized.
机译:在基于迁移的基于HMM的分割方法中,HMM培训基于MLE标准,这将分割任务链接到分发估计问题。构建HMMS以识别语音段,而不是检测边界。培训和应用之间这种不一致限制了分割的性能。在本文中,我们采用了鉴别的培训方法,并引入了一个名为最小分割错误(MSE)的新标准,用于HMM培训。在此方法中,定义了与分段错误直接相关的丢失函数。通过使广义概率下降(GPD)算法的总体经验损失最小化,还原误差也最小化。从汉语和日本数据的结果,分割的准确性得到改善,其中错误平均值减少了汉语和1.6毫秒的日语。此外,即使我们没有足够的HMM建模,这种方法也是坚固的稳健性,例如,即使我们没有足够的知识,例如,也是如此。状态的数量未得到优化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号