首页> 外文期刊>電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics >Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using asymmetric bilinear model with non-negative matrix factorization
【24h】

Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using asymmetric bilinear model with non-negative matrix factorization

机译:使用非负矩阵分解的不对称双线性模型基于自然和清晰度分解的基于HMM的合成语音质量改进

获取原文
获取原文并翻译 | 示例
           

摘要

HMM-based synthesized voices are intelligible but not natural especially in limited data condition due to over-smoothing of speech spectra. One solution for this problem is to use voice conversion techniques to convert over-smoothed spectra into natural spectra. Although conventional conversion techniques can transform speech spectra to natural ones in order to improve naturalness, they cause unexpected distortions on acceptable intelligibility of synthesized speech. The aim of this paper is to improve naturalness without violating acceptable intelligibility employing a novel asymmetric bilinear model using non-negative matrix factorization (NMF) to separate the naturalness and intelligibility of the synthesized speech. Subjective evaluations carried out on English dataset confirm that the achieved synthesis quality is higher than other methods in limited data condition and competitive in large data condition. Moreover, non-negativity constrain in NMF helps to explain the physical meanings of factored matrices as follows: the first matrix describes naturalness and the second describes intelligibility of the speech.
机译:基于HMM的合成语音是可理解的,但不是自然的,特别是在有限的数据条件下,由于语音频谱的过度平滑。解决此问题的一种方法是使用语音转换技术,将过度平滑的频谱转换为自然频谱。尽管常规转换技术可以将语音频谱转换为自然频谱以提高自然度,但它们会在合成语音的可接受清晰度上引起意外的失真。本文的目的是使用非负矩阵分解(NMF)来分离合成语音的自然性和清晰度,采用新颖的不对称双线性模型,在不破坏可接受的清晰度的情况下提高自然度。在英语数据集上进行的主观评估证实,在有限的数据条件下,所获得的合成质量要高于其他方法,在大数据条件下具有竞争力。此外,NMF中的非负约束有助于如下解释因式矩阵的物理含义:第一个矩阵描述自然性,第二个矩阵描述语音的清晰度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号