首页> 外文期刊>Circuits, systems and signal processing >Improving Speech to Text Alignment Based on Repetition Detection for Dysarthric Speech
【24h】

Improving Speech to Text Alignment Based on Repetition Detection for Dysarthric Speech

机译:基于重复检测对发育arthric语音的重复检测改进语音

获取原文
获取原文并翻译 | 示例
           

摘要

Alignment of transcription to the speech finds applications in video subtitling, human-computer interaction by means of natural language communication, etc. In spite of many advancements, alignment of transcription to speech remains a challenging task and may become even more challenging for dysarthric speech. Dysarthria is a motor speech disorder resulting from damaged peripheral or central nervous system and causes slow speaking rate, pronunciation deviations, and prolonged pause interval between words and syllables. One of the problems in aligning dysarthric speech to text is the presence of repetition. Repetition can be at syllable/word/phrase level. In this work, we proposed an algorithm for syllable boundary detection followed by syllable repetition detection in dysarthric speech. When a syllable is found to be repeated, that syllable is repeated automatically in the transcription also. Modified transcription is given to the aligner along with the dysarthric speech. The proposed system when tested for word alignment with 15 utterances containing 146 words resulted in root mean square error (RMSE) of 0.138 when compared with the existing work in the literature, which gives an RMSE of 0.276.
机译:转录对语音的对准发现视频字幕的应用,通过自然语言通信等的人机交互等。尽管有许多进步,转录对语音的对准仍然是一个具有挑战性的任务,并且对于发育不良言论可能会变得更具挑战性。扰动性是由损坏的外围或中枢神经系统产生的电机语音障碍,并导致口语速度慢,发音偏差和延长单词和音节之间的延长暂停间隔。将发育不良语音与文本对齐的问题之一是存在重复。重复可以是音节/单词/短语级别。在这项工作中,我们提出了一种用于音节边界检测的算法,其次是在发育arthric语音中的音节重复检测。当发现一个音节重复时,还会在转录中自动重复该音节。改性转录与对准器一起以及缺陷言论。当与文献中的现有工作相比,当与包含146个单词的15个话有关的146个单词进行146个单词的话语,提出的系统,其具有0.276的RMSE,导致0.138的均方根误差(RMSE)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号