首页> 外国专利> Method and system for low bit rate speech coding with speech recognition features and pitch providing reconstruction of the spectral envelope

Method and system for low bit rate speech coding with speech recognition features and pitch providing reconstruction of the spectral envelope

机译:具有语音识别特征和音调的低比特率语音编码的方法和系统,可重构频谱包络

摘要

A method for encoding a digitized speech signal so as to generate data capable of being decoded as speech. A digitized speech signal is first converted to a series of feature vectors using for example known Mel-frequency Cepstral coefficients (MFCC) techniques. At successive instances instance of time a respective pitch value of the digitized speech signal is computed, and successive acoustic vectors each containing the respective pitch value and feature vector are compressed so as to derive therefrom a bit stream. A suitable decoder reverses the operation so as to extract the features vectors and pitch values, thus allowing speech reproduction and playback. In addition, speech recognition is possible using the decompressed feature vectors, with no impairment of the recognition accuracy and no computational overhead.
机译:一种用于对数字化语音信号进行编码以便生成能够被解码为语音的数据的方法。首先使用例如已知的梅尔频率倒谱系数(MFCC)技术将数字化语音信号转换为一系列特征向量。在时间的连续实例处,计算数字化语音信号的相应音调值,并且压缩每个均包含相应音调值和特征矢量的连续声学矢量,以便从中导出比特流。合适的解码器将操作逆转以提取特征向量和音调值,从而允许语音再现和回放。另外,使用解压缩的特征向量可以进行语音识别,而不会降低识别精度,也不会增加计算量。

著录项

  • 公开/公告号US2003088402A1

    专利类型

  • 公开/公告日2003-05-08

    原文格式PDF

  • 申请/专利权人 IBM CORP.;

    申请/专利号US20020291590

  • 申请日2002-11-12

  • 分类号G10L11/04;

  • 国家 US

  • 入库时间 2022-08-22 00:07:39

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号