首页> 美国卫生研究院文献>Trends in Hearing >Measuring Speech Recognition With a Matrix Test Using SyntheticSpeech
【2h】

Measuring Speech Recognition With a Matrix Test Using SyntheticSpeech

机译:使用合成的矩阵测试测量语音识别言语

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Speech audiometry is an essential part of audiological diagnostics and clinical measurements. Development times of speech recognition tests are rather long, depending on the size of speech corpus and optimization necessity. The aim of this study was to examine whether this development effort could be reduced by using synthetic speech in speech audiometry, especially in a matrix test for speech recognition. For this purpose, the speech material of the German matrix test was replicated using a preselected commercial system to generate the synthetic speech files. In contrast to the conventional matrix test, no level adjustments or optimization tests were performed while producing the synthetic speech material. Evaluation measurements were conducted by presenting both versions of the German matrix test (with natural or synthetic speech), alternately and at three different signal-to-noise ratios, to 48 young, normal-hearing participants. Psychometric functions were fitted to the empirical data. Speech recognition thresholds were 0.5 dB signal-to-noise ratio higher (worse) for the synthetic speech, while slopes were equal for both speech types. Nevertheless, speech recognition scores were comparable with the literature and the threshold difference lay within the same range as recordings of twodifferent natural speakers. Although no optimization was applied, thesynthetic-speech signals led to equivalent recognition of the different testlists and word categories. The outcomes of this study indicate that theapplication of synthetic speech in speech recognition tests could considerablyreduce the development costs and evaluation time. This offers the opportunity toincrease the speech corpus for speech recognition tests with acceptableeffort.
机译:语音测听是听力学诊断和临床测量的重要组成部分。语音识别测试的开发时间相当长,具体取决于语音语料库的大小和优化的必要性。这项研究的目的是研究通过在语音测听中使用合成语音,特别是在语音识别的矩阵测试中,是否可以减少这种开发工作。为此,使用预选的商业系统复制了德语矩阵测试的语音材料,以生成合成语音文件。与常规矩阵测试相反,在生产合成语音材料时不执行任何级别调整或优化测试。通过对48名年轻的正常听觉参与者交替和以三种不同的信噪比呈现德国矩阵测试的两种版本(使用自然或合成语音)进行评估测量。心理测量函数适合于经验数据。合成语音的语音识别阈值比信噪比高0.5?dB(更差),而两种语音类型的斜率均相等。然而,语音识别分数与文献相当,并且阈值差异与两个录音的阈值在同一范围内不同的自然说话者。尽管未应用优化,但合成语音信号导致对不同测试的等同识别列表和单词类别。这项研究的结果表明合成语音在语音识别测试中的应用可能会很大减少开发成本和评估时间。这提供了机会增加可接受的语音识别测试的语料库努力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号