首页> 外文期刊>Logopedics, phoniatrics, vocology. >Automatic intelligibility assessment of pathologic speech over the telephone
【24h】

Automatic intelligibility assessment of pathologic speech over the telephone

机译:通过电话对病理性语音进行自动清晰度评估

获取原文
获取原文并翻译 | 示例
       

摘要

Objective assessment of intelligibility on the telephone is desirable for voice and speech assessment and rehabilitation. A total of 82 patients after partial laryngectomy read a standardized text which was synchronously recorded by a headset and via telephone. Five experienced raters assessed intelligibility perceptually on a five-point scale. Objective evaluation was performed by support vector regression on the word accuracy (WA) and word correctness (WR) of a speech recognition system, and a set of prosodic features. WA and WR alone exhibited correlations to human evaluation between |r| = 0.57 and |r| = 0.75. The correlation was r = 0.79 for headset and r = 0.86 for telephone recordings when prosodic features and WR were combined. The best feature subset was optimal for both signal qualities. It consists of WR, the average duration of the silent pauses before a word, the standard deviation of the fundamental frequency on the entire sample, the standard deviation of jitter, and the ratio of the durations of the voiced sections and the entire recording.
机译:对于语音和语音评估以及康复,需要客观地评估电话的清晰度。共有82例患者在部分喉切除术后阅读了标准化的文本,该文本通过头戴式耳机和电话同步记录。五位经验丰富的评分者以五分制对感知能力进行感知评估。通过支持向量回归对语音识别系统的单词准确度(WA)和单词正确性(WR)以及一组韵律特征进行客观评估。 | r |之间,仅WA和WR表现出与人类评估的相关性。 = 0.57和| r | = 0.75。当韵律特征和WR结合时,耳机的相关性为r = 0.79,电话录音的r = 0.86。最佳特征子集对于两种信号质量都是最佳的。它由WR,一个词之前的静默暂停的平均持续时间,整个样本的基本频率的标准偏差,抖动的标准偏差以及浊音部分与整个录音的持续时间之比组成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号