首页> 外文期刊>Neurocomputing >Discovering Speech Phones Using Convolutive Non-negative Matrix Factorisation With A Sparseness Constraint
【24h】

Discovering Speech Phones Using Convolutive Non-negative Matrix Factorisation With A Sparseness Constraint

机译:使用具有稀疏约束的卷积非负矩阵分解发现语音电话

获取原文
获取原文并翻译 | 示例
           

摘要

Discovering a representation that allows auditory data to be parsimoniously represented is useful for many machine learning and signal processing tasks. Such a representation can be constructed by non-negative matrix factorisation (NMF), a method for finding parts-based representations of non-negative data. Here, we present an extension to convolutive NMF that includes a sparseness constraint, where the resultant algorithm has multiplicative updates and utilises the beta divergence as its reconstruction objective. In combination with a spectral magnitude transform of speech, this method discovers auditory objects that resemble speech phones along with their associated sparse activation patterns. We use these in a supervised separation scheme for monophonic mixtures, finding improved separation performance in comparison to standard convolutive NMF.
机译:发现可以简化听觉数据表示的表示形式对于许多机器学习和信号处理任务很有用。可以通过非负矩阵分解(NMF)(一种用于查找基于零件的非负数据的表示方法)来构造这样的表示。在这里,我们提出了对卷积NMF的扩展,其中包括稀疏约束,其中所得算法具有乘法更新,并利用beta散度作为其重建目标。结合语音的频谱幅度变换,此方法可发现类似于语音电话的听觉对象及其关联的稀疏激活模式。我们将它们用于单音混合物的监督分离方案中,发现与标准卷积NMF相比,分离性能得到了改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号