Estimation of Speech Intelligibility Using Speech Recognition Systems

Yusuke TAKANO; Kazuhiro KONDO

首页> 外文期刊>IEICE transactions on information and systems >Estimation of Speech Intelligibility Using Speech Recognition Systems

【24h】

Estimation of Speech Intelligibility Using Speech Recognition Systems

机译：Estimation of Speech Intelligibility Using Speech Recognition Systems

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

We attempted to estimate subjective scores of the Japanese Diagnostic Rhyme Test (DRT), a two-to-one forced selection speech intelligibility test. We used automatic speech recognizers with language models that force one of the words in the word-pair, mimicking the human recognition process of the DRT. Initial testing was done using speaker-independent models, and they showed significantly lower scores than subjective scores. The acoustic models were then adapted to each of the speakers in the corpus, and then adapted to noise at a specified SNR. Three different types of noise were tested: white noise, multi-talker (babble) noise, and pseudo-speech noise. The match between subjective and estimated scores improved significantly with noise-adapted models compared to speaker-independent models and the speaker-adapted models, when the adapted noise level and the tested level match. However, when SNR conditions do not match, the recognition scores degraded especially when tested SNR conditions were higher than the adapted noise level. Accordingly, we adapted the models to mixed levels of noise, i.e., multi-condition training. The adapted models now showed relatively high intelligibility matching subjective intelligibility performance over all levels of noise. The correlation between subjective and estimated intelligibility scores increased to 0.94 with multi-talker noise, 0.93 with white noise, and 0.89 with pseudo-speech noise, while the root mean square error (RMSE) reduced from more than 40 to 13.10,13.05 and 16.06, respectively.

著录项

来源
《IEICE transactions on information and systems》 |2010年第12期|3368-3376|共9页
作者
Yusuke TAKANO; Kazuhiro KONDO;
展开▼
作者单位

Graduate School of Science and Engineering, Yamagata University, Yonezawa-shi, 992-8510 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类无线电电子学、电信技术;
关键词
objective estimation; speech intelligibility; speech recognition; japanese diagnostic rhyme test; noise adaptation;
入库时间 2024-01-25 20:33:59

Estimation of Speech Intelligibility Using Speech Recognition Systems

摘要

著录项

相关主题

期刊订阅