首页> 外文会议>Annual meeting of the Association for Computational Linguistics;ACL 2012 >A Nonparametric Bayesian Approach to Acoustic Model Discovery
【24h】

A Nonparametric Bayesian Approach to Acoustic Model Discovery

机译:声学模型发现的非参数贝叶斯方法

获取原文

摘要

We investigate the problem of acoustic modeling in which prior language-specific knowledge and transcribed data are unavailable. We present an unsupervised model that simultaneously segments the speech, discovers a proper set of sub-word units (e.g., phones) and learns a Hidden Markov Model (HMM) for each induced acoustic unit. Our approach is formulated as a Dirichlet process mixture model in which each mixture is an HMM that represents a sub-word unit. We apply our model to the TIMIT corpus, and the results demonstrate that our model discovers sub-word units that are highly correlated with English phones and also produces better segmentation than the state-of-the-art unsupervised baseline. We test the quality of the learned acoustic models on a spoken term detection task. Compared to the baselines, our model improves the relative precision of top hits by at least 22.1 % and outperforms a language-mismatched acoustic model.
机译:我们调查了声学建模问题,在该模型中先前的语言特定知识和转录数据不可用。我们提供了一种无监督模型,该模型可同时对语音进行细分,发现适当的子词单元集(例如电话),并为每个感应声单元学习隐马尔可夫模型(HMM)。我们的方法被公式化为Dirichlet过程混合模型,其中每种混合都是代表子词单元的HMM。我们将模型应用于TIMIT语料库,结果表明我们的模型发现了与英语电话高度相关的子词单元,并且比最新的无监督基线产生了更好的细分。我们在语音术语检测任务上测试学习的声学模型的质量。与基准相比,我们的模型将热门歌曲的相对精度提高了至少22.1%,并且优于语言不匹配的声学模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号