首页> 外文会议>ITG Symposium on Speech Communication >Voice and Speech Assessment From Telephone Recordings Using Prosodic Analysis Based on u-Law-Companded Features
【24h】

Voice and Speech Assessment From Telephone Recordings Using Prosodic Analysis Based on u-Law-Companded Features

机译:基于u-Law扩展特征的韵律分析从电话录音中评估语音和语音

获取原文

摘要

Objective assessment of voice and speech properties via telephone is desirable for rehabilitation purposes. 82 patients after partial laryngectomy read a standardized text on the phone. Five experienced raters assessed speech effort, match of breath and sense units, vocal tone, intelligibility, and overall voice quality perceptually based on these recordings. Objective evaluation was performed by the word accuracy and word correctness of a speech recognition system, and a set of prosodic features. The speech recognition system used mu-law features, i. e. modifiedMel- Frequency Cepstrum Coefficients (MFCCs). The prosodic features were computed based on word hypotheses graphs produced by the speech recognizer. The human-machine correlation between these features and the perceptual evaluation show slightly better results for the system based on mu-law features than for the baseline MFCC system.
机译:为了康复的目的,希望通过电话客观评估语音和语音特性。喉部分切除术后的82例患者在电话上阅读了标准文本。五位经验丰富的评分员根据这些录音在听觉上评估了言语努力,呼吸和感觉单位的匹配,语气,清晰度和整体语音质量。客观评估是通过语音识别系统的单词准确性和单词正确性以及一组韵律特征来进行的。语音识别系统使用了mu-law功能,即e。修正的倒谱倒谱系数(MFCC)。韵律特征是基于语音识别器生成的单词假设图计算的。这些特征与感知评估之间的人机关联性显示,基于mu-law特征的系统的结果比基线MFCC系统略好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号