首页> 外文会议>Annual meeting of the Association for Computational Linguistics >A Nonparametric Bayesian Approach to Acoustic Model Discovery
【24h】

A Nonparametric Bayesian Approach to Acoustic Model Discovery

机译:声学模型发现的非参数贝叶斯方法

获取原文

摘要

We investigate the problem of acoustic modeling in which prior language-specific knowledge and transcribed data are unavailable. We present an unsupervised model that simultaneously segments the speech, discovers a proper set of sub-word units (e.g., phones) and learns a Hidden Markov Model (HMM) for each induced acoustic unit. Our approach is formulated as a Dirichlet process mixture model in which each mixture is an HMM that represents a sub-word unit. We apply our model to the TIMIT corpus, and the results demonstrate that our model discovers sub-word units that are highly correlated with English phones and also produces better segmentation than the state-of-the-art unsupervised baseline. We test the quality of the learned acoustic models on a spoken term detection task. Compared to the baselines, our model improves the relative precision of top hits by at least 22.1 % and outperforms a language-mismatched acoustic model.
机译:我们调查了声学建模的问题,其中先前的语言特定知识和转录数据不可用。我们介绍了一个同时段演讲的无监督模型,发现了一组适当的子字单元(例如,电话),并为每个感应的声学单元学习隐藏的马尔可夫模型(HMM)。我们的方法被配制成Dirichlet过程混合物模型,其中每个混合物是表示子字单元的HMM。我们将模型应用于Timit语料库,结果表明,我们的模型发现与英语电话高度相关的子字单元,也会产生比最先进的无监督的基线产生更好的细分。我们在口语术语检测任务上测试学习声学模型的质量。与基线相比,我们的模型提高了顶部命中的相对精度至少22.1%,优于语言 - 不匹配的声学模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号