首页> 外文学位 >A 9600 BIT PER SECOND SPEECH COMPRESSION ALGORITHM BASED ON LINEAR PREDICTION.
【24h】

A 9600 BIT PER SECOND SPEECH COMPRESSION ALGORITHM BASED ON LINEAR PREDICTION.

机译:基于线性预测的每秒9600位压缩算法。

获取原文
获取原文并翻译 | 示例

摘要

The use of digital technologies in voice communication systems has created the need for efficient methods of converting analog speech signals into digital data formats. Often this conversion must be accomplished subject to a fixed bit rate constraint. For rates below 16,000 bits per second (bps), the speech data is usually processed by a compression algorithm to preserve the quality and intelligibility of the speech.;A simulation of the new algorithm was developed to evaluate the synthesized speech. Quality and intelligibility tests were performed. They demonstrated that the use of the residual-derived excitation increases the naturalness and intelligibility of the synthesized speech relative to the baseline. The performance of the new algorithm was also compared to that of a 9600 bps adaptive predictive coder (APC). For male speakers, the speech synthesized by the new algorithm was generally preferred, while for female speakers, APC was judged superior.;The new algorithm contains features that makes it particularly suited for applications where both 2400 bps LPC and a 9600 bps compression algorithm was required. For example, the bit stream of the new algorithm is a super set of the LPC bit stream. This greatly facilitates conferencing between users who have only the low rate capability and users whose compression systems operate at the higher rate. Also, much of the software required to implement the new algorithm is identical to that of LPC. A 9600 bps capability can be added to a 2400 bps LPC compression system with only a modest increase in program memory.;This dissertation presents a new 9600 bps speech compression algorithm. The new algorithm is based on a state-of-the-art 2400 bps linear predictive coder (LPC). A synthesizer excitation is derived from the prediction residual and is used to replace the fixed impulse excitation used in LPC. A method for selecting, evaluating, and coding the excitation is presented.
机译:在语音通信系统中数字技术的使用引起了对将模拟语音信号转换成数字数据格式的有效方法的需求。通常,必须在固定的比特率约束下完成此转换。对于低于16,000比特/秒(bps)的速率,通常使用压缩算法处理语音数据,以保留语音的质量和清晰度。开发了一种新算法的仿真,以评估合成语音。进行了质量和清晰度测试。他们证明,使用残留的激励可以提高合成语音相对于基线的自然度和清晰度。还将新算法的性能与9600 bps自适应预测编码器(APC)的性能进行了比较。对于男性发言者,通常首选采用新算法合成的语音,而对于女性发言者,则认为APC更具优势。该新算法的功能使其特别适用于同时使用2400 bps LPC和9600 bps压缩算法的应用。需要。例如,新算法的比特流是LPC比特流的超集。这极大地促进了仅具有低速率能力的用户和其压缩系统以较高速率运行的用户之间的会议。同样,实现新算法所需的许多软件与LPC相同。可以将9600 bps的功能添加到2400 bps的LPC压缩系统中,而程序存储器只需要适度增加即可。本论文提出了一种新的9600 bps语音压缩算法。新算法基于最新的2400 bps线性预测编码器(LPC)。合成器激励是从预测残差中导出的,用于代替LPC中使用的固定脉冲激励。提出了一种选择,评估和编码激励的方法。

著录项

  • 作者

    ABZUG, BARRY MARTIN.;

  • 作者单位

    University of Maryland, College Park.;

  • 授予单位 University of Maryland, College Park.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 1981
  • 页码 177 p.
  • 总页数 177
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号