首页> 外文期刊>International journal of speech technology >An Acoustic-Phonetic and a Model-Theoretic Analysis of Subspace Distribution Clustering Hidden Markov Models
【24h】

An Acoustic-Phonetic and a Model-Theoretic Analysis of Subspace Distribution Clustering Hidden Markov Models

机译:子空间分布聚类隐马尔可夫模型的声学和模型理论分析

获取原文
获取原文并翻译 | 示例
       

摘要

Recently, we proposed a new derivative to conventional continuous density hidden Markov modeling (CDHMM) that we call "subspace distribution clustering hidden Markov modeling" (SDCHMM). SDCHMMs can be created by tying low-dimensional subspace Gaussians in CDHMMs. In tasks we tried, usually only 32-256 subspace Gaussian prototypes were needed in SDCHMM-based system to maintain recognition performance of its original CDHMM-based system―a reduction of Gaussian parameters by one to three orders of magnitude. Consequently, both recognition time and memory were greatly reduced. We also have showed that if the underlying subspace distribution tying structure is known, it may be used to train an SDCHMM-based system with as little as eight minutes of speech from scratch. All the results suggest that there is substantial redundancy in conventional CDHMM and that SDCHMM is a more compact model. In this paper, we analyze the tying structure from two perspectives: from the acoustic-phonetic perspective showing that the tying structure seems to capture prominent relationship among phones; and, from the model-theoretic perspective showing that SDCHMMs, if properly created from CDHMMs, may be preferred over the latter as they are less complex and have the potential of greater generalization power.
机译:最近,我们提出了一种常规连续密度隐马尔可夫建模(CDHMM)的新衍生物,我们将其称为“子空间分布聚类隐马尔可夫建模”(SDCHMM)。 SDCHMM可以通过将CDHMM中的低维子空间高斯联系起来来创建。在我们尝试的任务中,在基于SDCHMM的系统中通常仅需要32-256个子空间高斯原型即可维持其原始基于CDHMM的系统的识别性能-将高斯参数减少1-3个数量级。因此,识别时间和记忆都大大减少了。我们还表明,如果底层的子空间分布绑定结构是已知的,则可以将其用于训练基于SDCHMM的系统,该系统从头开始仅需八分钟的语音。所有结果都表明,传统的CDHMM具有很大的冗余性,而SDCHMM是更紧凑的模型。在本文中,我们从两个角度分析了打结结构:从声学的角度来看,打结结构似乎抓住了电话之间的重要关系;从模型理论的角度来看,如果从CDHMM中正确创建SDCHMM,则它们可能比CDHMM更可取,因为它们不那么复杂,并且具有更大的泛化能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号