Recognizing GSM digital speech

Gallardo-Antolin A.; Pelaez-Moreno C.; Diaz-de-Maria F.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Recognizing GSM digital speech

【24h】

Recognizing GSM digital speech

机译：识别GSM数字语音

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Global System for Mobile (GSM) environment encompasses three main problems for automatic speech recognition (ASR) systems: noisy scenarios, source coding distortion, and transmission errors. The first one has already received much attention; however, source coding distortion and transmission errors must be explicitly addressed. In this paper, we propose an alternative front-end for speech recognition over GSM networks. This front-end is specially conceived to be effective against source coding distortion and transmission errors. Specifically, we suggest extracting the recognition feature vectors directly from the encoded speech (i.e., the bitstream) instead of decoding it and subsequently extracting the feature vectors. This approach offers two significant advantages. First, the recognition system is only affected by the quantization distortion of the spectral envelope. Thus, we are avoiding the influence of other sources of distortion as a result of the encoding-decoding process. Second, when transmission errors occur, our front-end becomes more effective since it is not affected by errors in bits allocated to the excitation signal. We have considered the half and the full-rate standard codecs and compared the proposed front-end with the conventional approach in two ASR tasks, namely, speaker-independent isolated digit recognition and speaker-independent continuous speech recognition. In general, our approach outperforms the conventional procedure, for a variety of simulated channel conditions. Furthermore, the disparity increases as the network conditions worsen.

机译：全球移动系统（GSM）环境包含自动语音识别（ASR）系统的三个主要问题：嘈杂的情况，源代码失真和传输错误。第一个已经引起了很多关注。但是，必须明确解决源代码编码失真和传输错误。在本文中，我们提出了另一种用于GSM网络上语音识别的前端。特别设计了此前端，可有效防止源代码失真和传输错误。具体而言，我们建议直接从编码语音（即比特流）中提取识别特征向量，而不是对其进行解码，然后再提取特征向量。这种方法具有两个明显的优点。首先，识别系统仅受频谱包络的量化失真的影响。因此，我们避免了由于编码解码过程而导致的其他失真源的影响。其次，当发生传输错误时，由于不受分配给激励信号的位错误的影响，我们的前端变得更加有效。我们已经考虑了半速率和全速率标准编解码器，并将拟议的前端与常规方法在两个ASR任务中进行了比较，即独立于说话者的隔离数字识别和独立于说话者的连续语音识别。通常，对于各种模拟信道条件，我们的方法要优于常规方法。此外，随着网络状况的恶化，视差增加。

著录项

来源
《IEEE Transactions on Speech and Audio Proceessing》 |2005年第6期|p.1186-1205|共20页
作者
Gallardo-Antolin A.; Pelaez-Moreno C.; Diaz-de-Maria F.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
cellular radio; code standards; combined source-channel coding; decoding; distortion; feature extraction; quantisation (signal); radio networks; speech codecs; speech coding; speech recognition; GSM network; bit allocation; decoding process; digital speech recognition;

机译：蜂窝无线电;代码标准;组合信源信道编码;解码;失真;特征提取;量化（信号）;无线电网络;语音编解码器;语音编码;语音识别;GSM网络;比特分配;解码过程;数字语音识别;
入库时间 2022-08-18 00:12:54

相似文献

外文文献
中文文献
专利

1. "A speech recognizer" a tool to recognize the high clarity speech signal based on existing speech using ISCA [J] . Velammal M. Navaneetha, Kumar P. Nirmal Analog Integrated Circuits and Signal Processing . 2019,第1期

机译：“语音识别器”一种基于使用ISCA的现有语音识别高清晰度语音信号的工具
2. Data transmission over GSM voice channel using digital modulation technique based on autoregressive modeling of speech production [J] . Kotnik B., Mezgec Z., Sve?ko J., Digital Signal Processing . 2009,第4期

机译：基于语音产生自回归建模的数字调制技术通过GSM语音通道进行数据传输
3. Multi-level error detection and concealment algorithm to improve speech quality in GSM full rate speech codecs [J] . Wang Linfang, Liu Jia, Liu Xiaoqing, Tsinghua Science and Technology . 2011,第3期

机译：用于提高GSM全速率语音编解码器语音质量的多级错误检测和隐藏算法
4. Testing the ability of speech recognizers to measure the effectiveness of encoding algorithms for digital speech transmission [C] . Chernick, C.M., Leigh, . 1999

机译：测试语音识别器测量数字语音传输编码算法有效性的能力
5. A 1.5V multirate multibit sigma delta modulator for GSM/WCDMA in a 90nm digital CMOS process. [D] . Altun, Oguz. 2005

机译：适用于GSM / WCDMA的1.5V多速率多比特sigma delta调制器，采用90nm数字CMOS工艺。
6. Recognizing visual speech: Reduced responses in visual-movement regions but not other speech regions in autism [O] . Kamila Borowiak, Stefanie Schelinski, Katharina von Kriegstein 2018

机译：识别视觉语音：视觉运动区域的反应减少但自闭症的其他语音区域却没有
7. Recognizing GSM Digital Speech [O] . Ascensión Gallardo-antolín, Carmen Peláez-moreno, O Díaz-de-maría 2013

机译：认识到Gsm数字语音
8. Can Speech Recognizers Measure the Effectiveness of Encoding Algorithms for Digital Speech Transmission [R] . Chernick, C. M., Leigh, S., Mills, K. L., 1999

机译：语音识别器能否衡量数字语音传输编码算法的有效性

Recognizing GSM digital speech

摘要

著录项

相似文献

相关主题

期刊订阅