...
首页> 外文期刊>Information Sciences: An International Journal >Unsupervised learning of phonemes of whispered speech in a noisy environment based on convolutive non-negative matrix factorization
【24h】

Unsupervised learning of phonemes of whispered speech in a noisy environment based on convolutive non-negative matrix factorization

机译:基于卷积非负矩阵分解的无声环境下低语语音音素的无监督学习

获取原文
获取原文并翻译 | 示例
           

摘要

This paper focuses on the development of an algorithm that can be optimized for a specific acoustic environment to improve the intelligibility of whispered speech. A new convolutive non-negative matrix factorization (NMF) algorithm is proposed to extract phoneme bases from noisy whispered speech with the noise bases from prior learning; these noise bases are obtained from training using the conventional non-negative matrix factorization. The divergence function with a sparseness constraint term is selected as the objective function in the developed algorithm to obtain multiplicative update rules of the phoneme base matrix and the corresponding weight matrix. The weights of the noise bases from prior learning are also updated in the phoneme learning stage. Listening experiments were conducted to assess the intelligibility performance of speech synthesized using the proposed algorithm. The experimental results indicate that the proposed algorithm is very effective for improving the intelligibility of whispers in various noise contexts, and it outperforms conventional algorithms.
机译:本文重点研究可以针对特定的声学环境进行优化以提高耳语语音清晰度的算法。提出了一种新的卷积非负矩阵分解算法,该算法可以从嘈杂的耳语中提取音素基,而在先的学习中则采用噪声基。这些噪声基是通过使用常规非负矩阵分解进行训练而获得的。在所开发的算法中,选择具有稀疏约束项的散度函数作为目标函数,以获得音素基矩阵和相应权重矩阵的乘法更新规则。在音素学习阶段,也会更新来自先前学习的噪声基准的权重。进行了听力实验,以评估使用该算法合成的语音的清晰度。实验结果表明,该算法在各种噪声环境下对窃窃私语的清晰度都非常有效,优于传统算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号