Speech Clarity Index (Ψ): A Distance-based Speech Quality Indicator And Recognition Rate Prediction For Dysarthric Speakers With Cerebral Palsy

Prakasith KAYASITH; Thanaruk THEERAMUNKONG

首页> 外文期刊>IEICE Transactions on Information and Systems >Speech Clarity Index (Ψ): A Distance-based Speech Quality Indicator And Recognition Rate Prediction For Dysarthric Speakers With Cerebral Palsy

【24h】

Speech Clarity Index (Ψ): A Distance-based Speech Quality Indicator And Recognition Rate Prediction For Dysarthric Speakers With Cerebral Palsy

机译：语音清晰度指数（Ψ）：基于距离的语音麻痹性说话者说话者的语音质量指标和识别率预测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is a tedious and subjective task to measure severity of a dysarthria by manually evaluating his/her speech using available standard assessment methods based on human perception. This paper presents an automated approach to assess speech quality of a dysarthric speaker with cerebral palsy. With the consideration of two complementary factors, speech consistency and speech distinction, a speech quality indicator called speech clarity index (Ψ) is proposed as a measure of the speaker's ability to produce consistent speech signal for a certain word and distinguished speech signal for different words. As an application, it can be used to assess speech quality and forecast speech recognition rate of speech made by an individual dysarthric speaker before actual exhaustive implementation of an automatic speech recognition system for the speaker. The effectiveness of Ψ as a speech recognition rate predictor is evaluated by rank-order inconsistency, correlation coefficient, and root-mean-square of difference. The evaluations had been done by comparing its predicted recognition rates with ones predicted by the standard methods called the articulatory and intelligibility tests based on the two recognition systems (HMM and ANN). The results show that Ψ is a promising indicator for predicting recognition rate of dysarthric speech. All experiments had been done on speech corpus composed of speech data from eight normal speakers and eight dysarthric speakers.

机译：通过使用基于人类感知的可用标准评估方法手动评估语音异常来测量构音障碍的严重程度是一项繁琐而主观的任务。本文提出了一种自动方法来评估患有脑瘫的发音异常的说话者的语音质量。考虑到语音互补性和语音区分性这两个互补因素，提出了一种语音质量指标，称为语音清晰度指数（a），用于衡量说话者针对某个单词产生一致的语音信号以及针对不同单词产生区分的语音信号的能力。。作为一种应用程序，它可以用于评估语音质量，并预测各个发音异常的说话者的语音识别率，然后再实际穷举实施自动语音识别系统。 rank作为语音识别率预测指标的有效性通过等级顺序不一致，相关系数和差异的均方根来评估。评估是通过将其预测的识别率与基于两种识别系统（HMM和ANN）的称为清晰度和清晰度测试的标准方法所预测的识别率进行比较而完成的。结果表明，Ψ是预测构音障碍语音识别率的有前途的指标。所有实验都是在语音语料库上进行的，该语料库由来自八名正常说话者和八位发音异常者的语音数据组成。

著录项

来源
《IEICE Transactions on Information and Systems》 |2009年第3期|p.460-468|共9页
作者
Prakasith KAYASITH; Thanaruk THEERAMUNKONG;
展开▼
作者单位

Information and Computer Technology School, Sirindhorn International Institute of Technology, Thammasat University, Thailand;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
speech disorder; dysarthric speech recognition; speech assessment; speech quality index; recognition rate prediction;

机译：语言障碍;构音障碍语音识别;语音评估;语音质量指标;识别率预测;
入库时间 2022-08-18 00:27:24

相似文献

外文文献
中文文献
专利

1. Pronouncibility index (II): a distance-based and confusion-based speech quality measure for dysarthric speakers [J] . Prakasith Kayasith, Thanaruk Theeramunkong Knowledge and information systems . 2011,第3期

机译：发音指数（II）：反距离说话者的基于距离和基于混淆的语音质量度量
2. Pronouncibility index (Π): a distance-based and confusion-based speech quality measure for dysarthric speakers [J] . Prakasith Kayasith, Thanaruk Theeramunkong Knowledge and Information Systems . 2011,第3期

机译：发音指数（Π）：构音扬声器的基于距离和基于混淆的语音质量度量
3. Speech confusion index (Φ): A confusion-based speech quality indicator and recognition rate prediction for dysarthria [J] . Prakasith Kayasith, Thanaruk Theeramunkong Computers & mathematics with applications . 2009,第8期

机译：语音混淆指数（Φ）：基于混淆的语音质量指标和构音障碍的识别率预测
4. Speech Confusion Index (Φ): A Recognition Rate Indicator for Dysarthric Speakers [C] . Prakasith Kayasith, Thanaruk Theeramunkong, Nuttakorn Thubthong International Conference on Advances in Natural Language Processing(NLP, FinTAL2006); 20060823-25; Turku(FI) . 2006

机译：语音混淆指数（Φ）：说话异常的说话者识别率指标
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Speech Clarity Index (.PSI.): A Distance-Based Speech Quality Indicator and Recognition Rate Prediction for Dysarthric Speakers with Cerebral Palsy [O] . Prakasith KAYASITH, Thanaruk THEERAMUNKONG 2009

机译：言语清晰度指数（.psi。）：一种基于距离的语音质量指标和患有脑瘫扰动扬声器的识别率预测
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Speech Clarity Index (Ψ): A Distance-based Speech Quality Indicator And Recognition Rate Prediction For Dysarthric Speakers With Cerebral Palsy

摘要

著录项

相似文献

相关主题

期刊订阅