首页> 外文会议>Annual conference of the International Speech Communication Association >Are Sparse Representations Rich Enough for Acoustic Modeling?
【24h】

Are Sparse Representations Rich Enough for Acoustic Modeling?

机译:稀疏表示是否足以进行声学建模?

获取原文

摘要

We propose a novel approach to acoustic modeling based on recent advances in sparse representations. The key idea in sparse coding is to compute a compressed local representation of a signal via an over-complete basis or dictionary that is learned in an unsupervised way. In this study, we compute the local representation on speech spectrogram as the raw "signal" and use it as the local sparse code to perform a standard phone classification task. A linear classifier is used that directly receives the coding space for making the classification decision. The simplicity of the linear classifier allows us to assess whether the sparse representations are sufficiently rich to serve as effective acoustic features for discriminating speech classes. Our experiments demonstrate competitive error rates when compared to other shallow approaches. An examination of the dictionary learned in sparse feature extraction demonstrates meaningful acoustic-phonetic properties that are captured by a collection of the dictionary entries.
机译:我们基于稀疏表示的最新进展提出了一种新颖的声学建模方法。稀疏编码的关键思想是通过过完整的基础或以无监督方式学习的字典来计算信号的压缩本地表示。在这项研究中,我们将语音频谱图上的本地表示计算为原始“信号”,并将其用作本地稀疏代码以执行标准的电话分类任务。使用线性分类器,其直接接收编码空间以做出分类决策。线性分类器的简单性使我们能够评估稀疏表示是否足够丰富,可以用作区分语音类别的有效声学特征。与其他浅层方法相比,我们的实验证明了竞争性错误率。对稀疏特征提取中学习的词典的检查表明,有意义的声学特性被词典条目的集合所捕获。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号