Improving Speech to Text Alignment Based on Repetition Detection for Dysarthric Speech

Diwakar G.; Karjigi Veena

首页> 外文期刊>Circuits, systems and signal processing >Improving Speech to Text Alignment Based on Repetition Detection for Dysarthric Speech

【24h】

Improving Speech to Text Alignment Based on Repetition Detection for Dysarthric Speech

机译：基于重复检测对发育arthric语音的重复检测改进语音

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Alignment of transcription to the speech finds applications in video subtitling, human-computer interaction by means of natural language communication, etc. In spite of many advancements, alignment of transcription to speech remains a challenging task and may become even more challenging for dysarthric speech. Dysarthria is a motor speech disorder resulting from damaged peripheral or central nervous system and causes slow speaking rate, pronunciation deviations, and prolonged pause interval between words and syllables. One of the problems in aligning dysarthric speech to text is the presence of repetition. Repetition can be at syllable/word/phrase level. In this work, we proposed an algorithm for syllable boundary detection followed by syllable repetition detection in dysarthric speech. When a syllable is found to be repeated, that syllable is repeated automatically in the transcription also. Modified transcription is given to the aligner along with the dysarthric speech. The proposed system when tested for word alignment with 15 utterances containing 146 words resulted in root mean square error (RMSE) of 0.138 when compared with the existing work in the literature, which gives an RMSE of 0.276.

机译：转录对语音的对准发现视频字幕的应用，通过自然语言通信等的人机交互等。尽管有许多进步，转录对语音的对准仍然是一个具有挑战性的任务，并且对于发育不良言论可能会变得更具挑战性。扰动性是由损坏的外围或中枢神经系统产生的电机语音障碍，并导致口语速度慢，发音偏差和延长单词和音节之间的延长暂停间隔。将发育不良语音与文本对齐的问题之一是存在重复。重复可以是音节/单词/短语级别。在这项工作中，我们提出了一种用于音节边界检测的算法，其次是在发育arthric语音中的音节重复检测。当发现一个音节重复时，还会在转录中自动重复该音节。改性转录与对准器一起以及缺陷言论。当与文献中的现有工作相比，当与包含146个单词的15个话有关的146个单词进行146个单词的话语，提出的系统，其具有0.276的RMSE，导致0.138的均方根误差（RMSE）。

著录项

来源
《Circuits, systems and signal processing》 |2020年第11期|5543-5567|共25页
作者
Diwakar G.; Karjigi Veena;
展开▼
作者单位

Siddaganga Inst Technol Tumakuru Dept Elect & Commun Tumakuru Karnataka India;

Siddaganga Inst Technol Tumakuru Dept Elect & Commun Tumakuru Karnataka India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Alignment; Dysarthria; Repetition; Transcription;

机译：对齐;讨厌;重复;转录;

相似文献

外文文献
中文文献
专利

1. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
2. Nomadic Speech-Based Text Entry: A Decision Model Strategy for improved Speech to Text Processing [J] . Kathleen J. Price, Min Lin, Jinjuan Feng, International journal of human-computer interaction . 2009,第7期

机译：游牧基于语音的文本输入：一种改进的语音到文本处理的决策模型策略
3. Using speech rhythm knowledge to improve dysarthric speech recognition [J] . S.-A. Selouani, H. Dahmani, R. Amami, International journal of speech technology . 2012,第1期

机译：利用语音节奏知识来改善构音障碍性语音识别
4. Repetition detection in dysarthric speech [C] . G. Diwakar, Veena Karjigi 2017 International Conference on Wireless Communications, Signal Processing and Networking . 2017

机译：构音障碍语音中的重复检测
5. HMM-based non-intrusive speech quality and implementation of Viterbi score distribution and hiddenness based measures to improve the performance of speech recognition [D] . Talwar, Gaurav 2006

机译：基于HMM的非侵入式语音质量以及基于Viterbi分数分布和隐蔽性的措施的实施，以提高语音识别的性能
6. Familiarisation conditions and the mechanisms that underlie improved recognition of dysarthric speech [O] . Stephanie A. Borrie, Megan J. McAuliffe, Julie M. Liss, -1

机译：熟悉的条件和提高扰动言论识别的机制
7. Comparison of Two Different Text-to-speech Alignment systems: Speech Synthesis based VS. Hybrid HMM/ANN [O] . Deroo O., Malfrere F., Dutoit T. 1998

机译：两种不同的文本到语音对齐系统的比较：基于语音合成的VS。混合HMM / ANN

Improving Speech to Text Alignment Based on Repetition Detection for Dysarthric Speech

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅