首页> 外文会议>2011 Data Compression Conference >Hybrid Scalar/Vector Quantization of Mel-Frequency Cepstral Coefficients for Low Bit-Rate Coding of Speech

【24h】

Hybrid Scalar/Vector Quantization of Mel-Frequency Cepstral Coefficients for Low Bit-Rate Coding of Speech

机译：语音的低比特率编码的Mel频率倒谱系数的混合标量/矢量量化

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we propose a low bit-rate speech codec based on a hybrid scalar/vector quantization of the mel-frequency cepstral coefficients (MFCCs). We begin by showing that if a high-resolution mel-frequency cepstrum (MFC) is computed, good-quality speech reconstruction is possible from the MFCCs despite the lack of explicit phase information. By evaluating the contribution toward speech quality that individual MFCCs make and applying appropriate quantization, our results show perceptual evaluation of speech quality (PESQ) of the MFCC-based codec matches the state-of-the-art MELPe codec at 600 bps and exceeds the CELP codec at 2000 -- 4000 bps coding rates. The main advantage of the proposed codec is in distributed speech recognition (DSR) since speech features based on MFCCs can be directly obtained from code words thus eliminating additional decode and feature extract stages.

机译：在本文中，我们提出了一种基于梅尔频率倒谱系数（MFCC）的混合标量/矢量量化的低比特率语音编解码器。我们首先显示，如果计算出高分辨率的mel-cepstrum（MFC），尽管缺少明确的相位信息，但仍可以从MFCC进行高质量的语音重建。通过评估各个MFCC对语音质量的贡献并应用适当的量化，我们的结果表明，基于MFCC的编解码器对语音质量（PESQ）的感知评估与600 bps的最新MELPe编解码器相匹配，并且超出了CELP编解码器的编码速率为2000-4000 bps。所提出的编解码器的主要优点在于分布式语音识别（DSR），因为可以直接从代码字获得基于MFCC的语音特征，从而消除了额外的解码和特征提取阶段。

著录项

来源
《2011 Data Compression Conference》|2011年|p.103-112|共10页
会议地点
作者
Boucheron Laura E.; Leon Phillip L. De; Sandoval Steven;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.56;
关键词

相似文献

外文文献
中文文献
专利

1. Low Bit-Rate Speech Coding Through Quantization of Mel-Frequency Cepstral Coefficients [J] . Boucheron L. E., De Leon P. L., Sandoval S. Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第2期

机译：通过梅尔频率倒谱系数的量化实现低比特率语音编码
2. Speech Recognition for Isolated Words using Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) [J] . Yogesh S. Angal, R. H. Chile, R. S. Holambe Journal of the Instrument Society of India: Proceedings of the national symposium on instrumentation . 2011,第3期

机译：使用Mel频率倒谱系数（MFCC）和矢量量化（VQ）对孤立单词进行语音识别
3. Predictive Trellis-Coded Quantization of the Cepstral Coefficients for the Distributed Speech Recognition [J] . Sangwon KANG, Joonseok LEE IEICE Transactions on Communications . 2007,第6期

机译：分布语音识别的倒谱系数的预测网格编码量化
4. Hybrid Scalar/Vector Quantization of Mel-Frequency Cepstral Coefficients for Low Bit-Rate Coding of Speech [C] . Boucheron Laura E., Leon Phillip L. De, Sandoval Steven Data Compression Conference . 2011

机译：用于低比特率编码的熔融谱系/矢量量化的语音低比特率编码
5. A new technique for low bit rate scalar quantization of image sub-band coefficients. [D] . Kurdziel, Michael Thomas. 2001

机译：用于图像子带系数的低比特率标量量化的新技术。
6. A scalable speech coding scheme using compressive sensing and orthogonal mapping based vector quantization [O] . M.S. Arun Sankar, P.S. Sathidevi 2019

机译：使用压缩感知和基于正交映射的矢量量化的可伸缩语音编码方案
7. APLIKASI SPEECH RECOGNITION BAHASA INDONESIA DENGAN METODE MEL-FREQUENCY CEPSTRAL COEFFICIENT DAN LINEAR VECTOR QUANTIZATION UNTUK PENGENDALIAN GERAK ROBOT [O] . Wicaksono Anggoro, Endah Sukmawati Nur, Adhy Satriyo, 100

机译：利用mEL频率系数法和线性矢量量化技术在机器人运动控制中应用印度尼西亚语音识别
8. Efficient Derivation and Approximation of Cepstral Coefficients for Speech Coding [R] . Limcangco, K. A. 1992

机译：语音编码倒谱系数的有效推导与逼近

Hybrid Scalar/Vector Quantization of Mel-Frequency Cepstral Coefficients for Low Bit-Rate Coding of Speech

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅