...
首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Accurate phoneme segmentation method using combination of HMM and Fuzzy Inference system
【24h】

Accurate phoneme segmentation method using combination of HMM and Fuzzy Inference system

机译:HMM与模糊推理系统相结合的精确音素分割方法

获取原文
获取原文并翻译 | 示例
           

摘要

The aim of this study, is to improve the accuracy of automatic segmentation. In the last twenty years, manual speech segmentation is always considered as the most accurate method for speech segmentation. However, this is a handwork by experts and a time-consuming work. Compared with this, automatic segmentation methods are much stable and faster. Unfortunately, achieving accurate segmentation is still a challenging task. Recently, some researches attempt to improve the accuracy of automatic segmentation by using some statistical correction procedures or learning methods on HMM-based forced-alignment. The refinement for HMM-based forced alignment in automatic speech segmentation is still not accurate enough. This paper presents an effective approach based on an adaptive neuro fuzzy inference system (ANFIS) for refining the output of the traditional HMM. Compared with other competitive methods, such as SVM, handmade fuzzy-logic and linear method, ANFIS has advantages in dealing with the problem of non-linear, fuzzy and can be trained in a completely automatic way. This study combined ANFIS instead of linear system used in the state-of-the-art research with HMM-based forced alignment to improve the automatic phoneme segmentation. The results show that ANFIS successful learn the rules of manual speech segmentation strategies and can significantly improve forced alignment accuracy. The proposed system already achieved 91.6% agreement within 20 msec for manual segmentation on the TIMIT corpus, comparing the 89.98% for the linear system used in the outstanding work.
机译:这项研究的目的是提高自动分割的准确性。在过去的二十年中,手动语音分割始终被认为是语音分割的最准确方法。但是,这是专家的手工工作,是一项耗时的工作。与此相比,自动分割方法更加稳定和快捷。不幸的是,实现准确的分割仍然是一项艰巨的任务。最近,一些研究试图通过使用一些基于HMM的强制对齐的统计校正程序或学习方法来提高自动分割的准确性。自动语音分割中基于HMM的强制对齐的改进仍然不够准确。本文提出了一种基于自适应神经模糊推理系统(ANFIS)的有效方法,用于细化传统HMM的输出。与SVM,手工模糊逻辑和线性方法等其他竞争方法相比,ANFIS在处理非线性,模糊问题方面具有优势,并且可以完全自动化地进行训练。这项研究将最新研究中使用的ANFIS而非线性系统与基于HMM的强制对齐相结合,以改善自动音素分割。结果表明,ANFIS成功地学习了手动语音分割策略的规则,并可以显着提高强制对齐的准确性。拟议的系统已经在20毫秒内在TIMIT语料库上实现了手动分割的91.6%协议,相比之下,出色工作中使用的线性系统为89.98%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号