首页> 外文期刊>Trends in Hearing >Measuring Speech Recognition With a Matrix Test Using Synthetic Speech
【24h】

Measuring Speech Recognition With a Matrix Test Using Synthetic Speech

机译:使用合成语音的矩阵测试测量语音识别

获取原文
       

摘要

Speech audiometry is an essential part of audiological diagnostics and clinical measurements. Development times of speech recognition tests are rather long, depending on the size of speech corpus and optimization necessity. The aim of this study was to examine whether this development effort could be reduced by using synthetic speech in speech audiometry, especially in a matrix test for speech recognition. For this purpose, the speech material of the German matrix test was replicated using a preselected commercial system to generate the synthetic speech files. In contrast to the conventional matrix test, no level adjustments or optimization tests were performed while producing the synthetic speech material. Evaluation measurements were conducted by presenting both versions of the German matrix test (with natural or synthetic speech), alternately and at three different signal-to-noise ratios, to 48 young, normal-hearing participants. Psychometric functions were fitted to the empirical data. Speech recognition thresholds were 0.5?dB signal-to-noise ratio higher (worse) for the synthetic speech, while slopes were equal for both speech types. Nevertheless, speech recognition scores were comparable with the literature and the threshold difference lay within the same range as recordings of two different natural speakers. Although no optimization was applied, the synthetic-speech signals led to equivalent recognition of the different test lists and word categories. The outcomes of this study indicate that the application of synthetic speech in speech recognition tests could considerably reduce the development costs and evaluation time. This offers the opportunity to increase the speech corpus for speech recognition tests with acceptable effort.
机译:语音测听是听力学诊断和临床测量的重要组成部分。语音识别测试的开发时间相当长,具体取决于语音语料库的大小和优化的必要性。这项研究的目的是研究通过在语音测听中使用合成语音,特别是在语音识别矩阵测试中,是否可以减少这种开发工作。为此,使用预选的商业系统复制了德语矩阵测试的语音材料,以生成合成语音文件。与常规矩阵测试相反,在生产合成语音材料时不执行任何级别调整或优化测试。通过对48名年轻的正常听觉参与者交替和以三种不同的信噪比呈现德国矩阵测试的两种版本(使用自然或合成语音)进行评估测量。心理测验函数适合于经验数据。合成语音的语音识别阈值比信噪比高0.5?dB(更差),而两种语音类型的斜率均相等。然而,语音识别分数与文献相当,并且阈值差异与两个自然说话者的录音在同一范围内。尽管未应用优化,但合成语音信号导致对不同测试列表和单词类别的等同识别。这项研究的结果表明,合成语音在语音识别测试中的应用可以大大降低开发成本和评估时间。这提供了以可接受的努力增加用于语音识别测试的语音语料库的机会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号