AN SVD-BASED SCHEME FOR MFCC COMPRESSION IN DISTRIBUTED SPEECH RECOGNITION SYSTEM

机译：分布式语音识别系统中的MFCC压缩基于SVD的方案

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a new scheme for low bit-rate source coding of Mel Frequency Cepstral Coefficients (MFCCs) in Distributed Speech Recognition (DSR) system. The method uses the compressed ETSI Advanced Front-End (ETSI-AFE) features factorized into SVD components. By investigating the correlation property between successive MFCC frames, the odd ones are encoded using ETSI-AFE, while only the singular values and the nearest left singular vectors index are encoded and transmitted for the even frames. At the server side, the non-transmitted MFCCs are evaluated through their quantized singular values and the nearest left singular vectors. The system provides a compression bit-rate of 2.7 kbps. The recognition experiments were carried out on the Aurora-2 database for clean and multi-condition training modes. The simulation results show good recognition performance without significant degradation, with respect to the ETSI-AFE encoder.

机译：本文提出了分布式语音识别（DSR）系统中MEL频率谱系数（MFCC）的低比特率源编码的新方案。该方法使用压缩的ETSI高级前端（ETSI-AFE）分为SVD组件。通过研究连续MFCC帧之间的相关性，使用ETSI-AFE对奇数进行编码，而仅对偶数帧进行编码和传输奇异值和最近的左字奇异矢量索引。在服务器端，通过它们量化的奇异值和最近的左奇异向量来评估未发送的MFCC。该系统提供了2.7 kbps的压缩比特率。识别实验是在Aurora-2数据库上进行的，以进行清洁和多条件培训模式。仿真结果表明，对于ETSI-AFE编码器，良好的识别性能而无需显着降级。

著录项

来源
《Workshop on Automatic Speech Recognition and Understanding》|2013年||共6页
会议地点
作者
Azzedine Touazi; Mohamed Debyeche;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
Distributed speech recognition; MFCC coefficients; ETSI-AFE standard; SVD decomposition;

机译：分布式语音识别;MFCC系数;ETSI-AFE标准;SVD分解;

相似文献

外文文献
中文文献
专利

1. An efficient low bit-rate compression scheme of acoustic features for distributed speech recognition [J] . Touazi Azzedine, Debyeche Mohamed Computers and Electrical Engineering . 2016,第Null期

机译：用于分布式语音识别的声学特征的高效低比特率压缩方案
2. Energy-Efficient Floating-Point MFCC Extraction Architecture for Speech Recognition Systems [J] . Jo Jihyuck, Yoo Hoyoung, Park In-Cheol Very Large Scale Integration (VLSI) Systems, IEEE Transactions on . 2016,第2期

机译：语音识别系统的节能浮点MFCC提取架构
3. Arabic Speech Recognition System Based on MFCC and HMMs [J] . Hussien A. Elharati, Mohamed Alshaari, Veton Z. K?puska Journal of Computer and Communications . 2020,第3期

机译：基于MFCC和HMMS的阿拉伯语语音识别系统
4. An SVD-based scheme for MFCC compression in distributed speech recognition system [C] . Touazi Azzedine, Debyeche Mohamed IEEE Workshop on Automatic Speech Recognition and Understanding . 2013

机译：分布式语音识别系统中基于SVD的MFCC压缩方案
5. Compression algorithms for distributed classification with applications to distributed speech recognition. [D] . Srinivasamurthy, Naveen. 2007

机译：用于分布式分类的压缩算法及其在分布式语音识别中的应用。
6. Comparison of two SVD-based color image compression schemes [O] . Ying Li, Musheng Wei, Fengxia Zhang, 2012

机译：两种基于SVD的彩色图像压缩方案的比较
7. Speaker Independent Speech Recognition using MFCC with Cubic-Log Compression and VQ Analysis [O] . Neeraj Kaberpanthi, Ashutosh Datar 2015

机译：使用mFCC与立方对数压缩和VQ分析的扬声器独立语音识别

AN SVD-BASED SCHEME FOR MFCC COMPRESSION IN DISTRIBUTED SPEECH RECOGNITION SYSTEM

摘要

著录项

相似文献

相关主题

期刊订阅