首页> 外文期刊>International journal of speech technology >Automatic speech recognition in neurodegenerative disease
【24h】

Automatic speech recognition in neurodegenerative disease

机译:神经变性疾病中的自动语音识别

获取原文
获取原文并翻译 | 示例
       

摘要

Automatic speech recognition (ASR) could potentially improve communication by providing transcriptions of speech in real time. ASR is particularly useful for people with progressive disorders that lead to reduced speech intelligibility or difficulties performing motor tasks. ASR services are usually trained on healthy speech and may not be optimized for impaired speech, creating a barrier for accessing augmented assistance devices. We tested the performance of three state-of-the-art ASR platforms on two groups of people with neurodegenerative disease and healthy controls. We further examined individual differences that may explain errors in ASR services within groups, such as age and sex. Speakers were recorded while reading a standard text. Speech was elicited from individuals with multiple sclerosis, Friedreich's ataxia, and healthy controls. Recordings were manually transcribed and compared to ASR transcriptions using Amazon Web Services, Google Cloud, and IBM Watson. Accuracy was measured as the proportion of words that were correctly classified. ASR accuracy was higher for controls than clinical groups, and higher for multiple sclerosis compared to Friedreich's ataxia for all ASR services. Amazon Web Services and Google Cloud yielded higher accuracy than IBM Watson. ASR accuracy decreased with increased disease duration. Age and sex did not significantly affect ASR accuracy. ASR faces challenges for people with neuromuscular disorders. Until improvements are made in recognizing less intelligible speech, the true value of ASR for people requiring augmented assistance devices and alternative communication remains unrealized. We suggest potential methods to improve ASR for those with impaired speech.
机译:自动语音识别(ASR)可能通过实时提供语音的转录来改善通信。 ASR对渐进式疾病的人群特别有用,导致演讲可懂度或难以执行运动任务的困难。 ASR服务通常受到健康演讲的培训,可能不会针对损害的语音进行优化,从而为访问增强辅助设备的障碍。我们在两组患有神经变性疾病和健康对照的两组人群中测试了三个最先进的ASR平台的表现。我们进一步审查了可能在年龄和性别等团体中解释ASR服务错误的个人差异。读写标准文本时录制了扬声器。讲话是从具有多发性硬化症,弗里德莱希的共济失调和健康控制的个体的讲话。手动转录并将录制转录并与使用亚马逊Web服务,Google云和IBM Watson的ASR转录进行比较。测量准确性作为正确分类的词语比例。对于对照组的准确性比临床组更高,并且与Friedreich的所有ASR服务的共济失调相比,多发性硬化的较高。亚马逊网络服务和谷歌云的准确性高于IBM Watson。 ASR精度随着疾病持续时间的增加而降低。年龄和性别没有显着影响ASR的准确性。 ASR面临着神经肌肉疾病的人的挑战。直到在认识到较少可理解的演讲中,为需要增强辅助设备和替代通信的人的ASR的真实价值仍然是未实现的。我们建议改善言论减弱的人的潜在方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号