首页> 外国专利> Use of an auditory model to improve quality or lower the bit rate of speech synthesis systems

Use of an auditory model to improve quality or lower the bit rate of speech synthesis systems

机译:使用听觉模型来提高语音合成系统的质量或降低其比特率

摘要

Low bit rate speech coding algorithms are mostly based on the use of voice production models in which vocal tract filters are excited by vectors chosen from fixed and adaptive codebooks. It has been recognized that to improve the perceptual quality of such coders it is necessary to also allow for the pyschoacoustic properties of the human ear. The weighting filter (5, of Fig. 1B) traditionally used for this purpose is sub-optimal as it doesnot explicitly evaluate auditory characteristics. Disclosed in the preferred embodiment of the present invention, the weighting filter is replaced with an auditory model which enables the search for the optimum stochastic code vector in the psychoacoustic domain. An algorithm, which has been termed PERCELP (for Perceptually Enhanced Random Codebook Excited Linear Prediction), is disclosed which produces speech that is of considerably better quality than obtained with a weighting filter. The computational overhead is low enough to warrant the use of this approach in new speech coders.
机译:低比特率语音编码算法主要基于语音产生模型的使用,在该模型中,通过从固定和自适应码本中选择的矢量来激励声道滤波器。已经认识到,为了提高这种编码器的感知质量,还必须考虑到人耳的心理声学特性。传统上用于此目的的加权滤波器(图1B中的5)是次优的,因为它没有明确评估听觉特征。在本发明的优选实施例中公开了,加权滤波器被替换为听觉模型,该听觉模型使得能够在心理声学域中搜索最优随机码向量。公开了一种算法,该算法被称为PERCELP(用于感知增强型随机码本激励线性预测),其产生的语音质量比用加权滤波器获得的语音质量好得多。计算开销很低,足以保证在新的语音编码器中使用此方法。

著录项

  • 公开/公告号AU6672094A

    专利类型

  • 公开/公告日1994-11-21

    原文格式PDF

  • 申请/专利权人 UNISEARCH LIMITED;

    申请/专利号AU19940066720

  • 发明设计人 DIPANJAN SEN;WARWICK HARVEY HOLMES;

    申请日1994-04-29

  • 分类号G10L3/02;G10L9/10;

  • 国家 AU

  • 入库时间 2022-08-22 04:15:24

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号