Automatic speech recognition in neurodegenerative disease

Schultz Benjamin G.; Tarigoppula Venkata S. Aditya; Noffs Gustavo; Rojas Sandra; van der Walt Anneke; Grayden David B.; Vogel Adam P.

首页> 外文期刊>International journal of speech technology >Automatic speech recognition in neurodegenerative disease

【24h】

Automatic speech recognition in neurodegenerative disease

机译：神经变性疾病中的自动语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic speech recognition (ASR) could potentially improve communication by providing transcriptions of speech in real time. ASR is particularly useful for people with progressive disorders that lead to reduced speech intelligibility or difficulties performing motor tasks. ASR services are usually trained on healthy speech and may not be optimized for impaired speech, creating a barrier for accessing augmented assistance devices. We tested the performance of three state-of-the-art ASR platforms on two groups of people with neurodegenerative disease and healthy controls. We further examined individual differences that may explain errors in ASR services within groups, such as age and sex. Speakers were recorded while reading a standard text. Speech was elicited from individuals with multiple sclerosis, Friedreich's ataxia, and healthy controls. Recordings were manually transcribed and compared to ASR transcriptions using Amazon Web Services, Google Cloud, and IBM Watson. Accuracy was measured as the proportion of words that were correctly classified. ASR accuracy was higher for controls than clinical groups, and higher for multiple sclerosis compared to Friedreich's ataxia for all ASR services. Amazon Web Services and Google Cloud yielded higher accuracy than IBM Watson. ASR accuracy decreased with increased disease duration. Age and sex did not significantly affect ASR accuracy. ASR faces challenges for people with neuromuscular disorders. Until improvements are made in recognizing less intelligible speech, the true value of ASR for people requiring augmented assistance devices and alternative communication remains unrealized. We suggest potential methods to improve ASR for those with impaired speech.

机译：自动语音识别（ASR）可能通过实时提供语音的转录来改善通信。 ASR对渐进式疾病的人群特别有用，导致演讲可懂度或难以执行运动任务的困难。 ASR服务通常受到健康演讲的培训，可能不会针对损害的语音进行优化，从而为访问增强辅助设备的障碍。我们在两组患有神经变性疾病和健康对照的两组人群中测试了三个最先进的ASR平台的表现。我们进一步审查了可能在年龄和性别等团体中解释ASR服务错误的个人差异。读写标准文本时录制了扬声器。讲话是从具有多发性硬化症，弗里德莱希的共济失调和健康控制的个体的讲话。手动转录并将录制转录并与使用亚马逊Web服务，Google云和IBM Watson的ASR转录进行比较。测量准确性作为正确分类的词语比例。对于对照组的准确性比临床组更高，并且与Friedreich的所有ASR服务的共济失调相比，多发性硬化的较高。亚马逊网络服务和谷歌云的准确性高于IBM Watson。 ASR精度随着疾病持续时间的增加而降低。年龄和性别没有显着影响ASR的准确性。 ASR面临着神经肌肉疾病的人的挑战。直到在认识到较少可理解的演讲中，为需要增强辅助设备和替代通信的人的ASR的真实价值仍然是未实现的。我们建议改善言论减弱的人的潜在方法。

著录项

来源
《International journal of speech technology》 |2021年第3期|771-779|共9页
作者
Schultz Benjamin G.; Tarigoppula Venkata S. Aditya; Noffs Gustavo; Rojas Sandra; van der Walt Anneke; Grayden David B.; Vogel Adam P.;
展开▼
作者单位

Univ Melbourne Dept Audiol & Speech Pathol Ctr Neurosci Speech 550 Swanston St Melbourne Vic 3053 Australia|Maastricht Univ Fac Psychol & Neurosci Dept Neuropsychol & Psychopharmacol Maastricht Netherlands;

Univ Melbourne Dept Biomed Engn Melbourne Vic Australia|Univ Melbourne ARC Training Ctr Cognit Comp Med Technol Melbourne Vic Australia;

Univ Melbourne Dept Audiol & Speech Pathol Ctr Neurosci Speech 550 Swanston St Melbourne Vic 3053 Australia;

Univ Melbourne Dept Audiol & Speech Pathol Ctr Neurosci Speech 550 Swanston St Melbourne Vic 3053 Australia;

Monash Univ Dept Neurosci Cent Clin Sch Melbourne Vic Australia;

Univ Melbourne Dept Biomed Engn Melbourne Vic Australia|Univ Melbourne ARC Training Ctr Cognit Comp Med Technol Melbourne Vic Australia;

Univ Melbourne Dept Audiol & Speech Pathol Ctr Neurosci Speech 550 Swanston St Melbourne Vic 3053 Australia|Redenlab Melbourne Vic Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Automatic Speech Recognition; Dysarthria; Neurodegenerative disease; Augmented assistive communication technology; Communication;

机译：自动语音识别;讨厌;神经退行性疾病;增强辅助通信技术;沟通;
入库时间 2022-08-19 02:31:22

相似文献

外文文献
中文文献
专利

1. Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition (L) [J] . Odette Scharenborg, Louis ten Bosch, Lou Boves, The Journal of the Acoustical Society of America . 2003,第6期

机译：桥接自动语音识别和心理语言学：将候选清单扩展到人类语音识别的端到端模型（L）
2. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [J] . Byeongwook Lee, Kwang-Hyun Cho Scientific reports. . 2016,第1期

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
3. Auditory-Inspired Morphological Processing of Speech Spectrograms: Applications in Automatic Speech Recognition and Speech Enhancement [J] . Joyner Cadore, Francisco J. Valverde-Albacete, Ascensión Gallardo-Antolín, Cognitive Computation . 2013,第4期

机译：语音频谱图的听觉启发式形态处理：在自动语音识别和语音增强中的应用
4. Effectiveness of Speech Analysis in Classification of Neurodegenerative Diseases: A Study on Parkinson's Disease [C] . Sai Bharadwaj Appakaya, Ravi Sankar SoutheastCon . 2018

机译：语音分析在神经退行性疾病分类中的有效性：帕金森氏病研究
5. A multimodal fusion approach for automatic postal address recognition system using Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) techniques. [D] . Singh, Amriteshwar. 2011

机译：一种使用光学字符识别（OCR）和自动语音识别（ASR）技术的自动邮政地址识别系统的多模式融合方法。
6. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [O] . Byeongwook Lee, Kwang-Hyun Cho -1

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
7. Automatic speech recognition in neurodegenerative disease [O] . Benjamin G. Schultz, Venkata S. Aditya Tarigoppula, Gustavo Noffs, 2021

机译：神经变性疾病中的自动语音识别

Automatic speech recognition in neurodegenerative disease

摘要

著录项

相似文献

相关主题

期刊订阅