Speaker and Language Recognition Using Speech Codec Parameters

机译：使用语音编解码器参数进行说话和语言识别

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we investigate the effect of speech coding on speaker and language recognition tasks. Three coders were selected to cover a wide range of quality and bit rates: GSM at 12.2 kb/s, G.729 at 8 kb/s, and G.723.1 at 5.3 kb/s. Our objective is to measure recognition performance from either the synthesized speech or directly from the coder parameters themselves. We show that using speech synthesized from the three codecs, GMM-based speaker verification and phone-based language recognition performance generally degrades with coder bit rate, i.e., from GSM to G.729 to G.723.1, relative to an uncoded baseline. In addition, speaker verification for all codecs shows a performance decrease as the degree of mismatch between training and testing conditions increases, while language recognition exhibited no decrease in performance. We also present initial results in determining the relative importance of codec system components in their direct use for recognition tasks. For the G.729 codec, it is shown that removal of the post-filter in the decoder helps speaker verification performance under the mismatched condition. On the other hand, with use of G.729 LSF-based mel-cepstra, performance decreases under all conditions, indicating the need for a residual contribution to the feature representation.

著录项

作者
Quatieri, T. F., Singer, E., Dunn, R. B., Reynolds, D. A., Campbell, J. P.;
展开▼
作者单位

展开▼
年度 1999
页码 1-5
总页数 5
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Speech recognition ; Training ; Performance(Engineering) ; Coders ; Words(Language) ; Voice communications ; Verification ; Coding;

机译：语音识别;训练;表演（工程）;编码员;单词（语言）;语音通信;验证;编码;

相似文献

外文文献
中文文献
专利

1. Novel Detection Algorithm of Speech Activity and the impact of Speech Codecs on Remote Speaker Recognition System [J] . RIADH AJGOU, SALIM SBAA, SAID GHENDIR, WSEAS Transactions on Signal Processing . 2014,第Pta1期

机译：语音活动的新型检测算法及语音编解码器对远程讲话者识别系统的影响
2. A method to compensate the influence of speech codec in speaker recognition [J] . José R. Calvo de Lara, Flavio J. Reyes Diaz, Gabriel Hernández Sierra, International journal of speech technology . 2018,第4期

机译：一种补偿语音编解码器对说话人识别影响的方法
3. Development of speech corpora for speaker recognition research and evaluation in Indian languages [J] . Hemant A. Patil, T.K. Basu International journal of speech technology . 2008,第1期

机译：语音语料库的开发，用于印度语中的说话人识别研究和评估
4. Speaker recognition using G.729 speech codec parameters [C] . Quatieri, T.F., Dunn, . 2000

机译：使用G.729语音编解码器参数的说话人识别
5. Objective speech intelligibility assessment using speech recognition and bigram statistics with application to low bit-rate codec evaluation [D] . Teng, Yan 2006

机译：使用语音识别和双字母组统计的客观语音清晰度评估及其在低比特率编解码器评估中的应用
6. One-against-All Weighted Dynamic Time Warping for Language-Independent and Speaker-Dependent Speech Recognition in Adverse Conditions [O] . Xianglilan Zhang, Jiping Sun, Zhigang Luo 2010

机译：不利条件下与语言无关和与说话者相关的语音识别的一对多加权动态时间规整
7. Speaker And Language Recognition Using Speech Codec Parameters [O] . T. F. Quatieri et al. 2007

机译：使用语音编解码器参数的说话人和语言识别

Speaker and Language Recognition Using Speech Codec Parameters

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅