【24h】

Towards Enhancing the Acoustic Models for Dysarthric Speech

机译:致力于增强发音异常的声学模型

获取原文

摘要

Dysarthria is a set of congenital and traumatic neuromotor disorders that impair the physical production of speech. These impairments reduce or remove the normal control of the vocal articu-lators. The acoustic characteristics of dysarthric speech is very different from the speech signal collected from a normative population, with relatively larger intra-speaker inconsistencies in the temporal dynamics of the dysarthric speech . These inconsistencies result in poor audible quality for the dysarthric speech, and in low phone/speech recognition accuracy. Further, collecting and labeling the dysarthric speech is extremely difficult considering the small number of people with these disorders, and the difficulty in labeling the database due to the poor quality of the speech. Hence, it would be of great interest to explore on how to improve the efficiency of the acoustic models built on small dysarthric speech databases such as Nemours [3], or use speech databases collected from a normative population to build acoustic models for dysarthric speakers. In this work, we explore the latter approach.
机译:构音障碍是一组先天性和外伤性神经运动障碍,会损害言语的物理产生。这些障碍减少或消除了声带发音者的正常控制。构音障碍语音的声学特征与从规范人群中收集的语音信号有很大不同,在构音障碍语音的时间动态中,扬声器内部的不一致性相对较大。这些不一致导致音调异常的语音质量差,并且电话/语音识别精度低。此外,考虑到患有这些疾病的人数很少,并且很难收集和标记发音异常的语音,并且由于语音质量差而难以标记数据库。因此,探索如何提高建立在小型反律语音数据库(如Nemours [3])上的声学模型的效率,或使用从规范人群中收集的语音数据库来建立反律扬声器的声学模型,将引起极大的兴趣。在这项工作中,我们探索了后一种方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号