...
首页> 外文期刊>International journal of speech technology >TEO-based speaker stress assessment using hybrid classification and tracking schemes
【24h】

TEO-based speaker stress assessment using hybrid classification and tracking schemes

机译:使用混合分类和跟踪方案的基于TEO的说话人压力评估

获取原文
获取原文并翻译 | 示例
           

摘要

Speaker variability is known to have an adverse impact on speech systems that process linguistic content, such as speech and language recognition. However, speech production changes in individuals due to stress and emotions have similarly detrimental effect also on the task of speaker recognition as they introduce mismatch with the speaker models typically trained on modal speech. The focus of this study is on the analysis of stress-induced variations in speech and design of an automatic stress level assessment scheme that could be used in directing stress-dependent acoustic models or normalization strategies. Current stress detection methods typically employ a binary decision based on whether the speaker is or not under stress. In reality, the amount of stress in individuals varies and can change gradually. Using speech and biometric data collected in a real-world, variable-stress level law enforcement training scenario, this study considers two methods for stress level assessment. The first approach uses a nearest neighbor clustering scheme at the vowel token and sentence levels to classify speech data into three levels of stress. The second approach employs Euclidean distance metrics within the multi-dimensional feature space to provide real-time stress level tracking capability. Evaluations on audio data confirmed by biometric readings show both methods to be effective in assessment of stress level within a speaker (average accuracy of 55.6 % in a 3-way classification task). In addition, an impact of high-level stress on in-set speaker recognition is evaluated and shown to reduce the accuracy from 91.7 % (low/mid stress) to 21.4 % (high level stress).
机译:众所周知,说话人的可变性会对处理语言内容(例如语音和语言识别)的语音系统产生不利影响。然而,由于压力和情绪引起的个人语音产生变化也对说话者识别的任务具有类似的有害影响,因为它们引入了与通常在模态语音上训练的说话者模型的不匹配。这项研究的重点是分析语音中由语音引起的变化,以及设计一种自动应力水平评估方案,该方案可用于指导依赖于应力的声学模型或归一化策略。当前的压力检测方法通常基于说话者是否处于压力下而采用二进制判定。实际上,个人的压力大小是变化的,并且可以逐渐变化。使用在现实世界中可变压力水平的执法培训场景中收集的语音和生物统计数据,本研究考虑了两种压力水平评估方法。第一种方法在元音标记和句子级别使用最近邻居聚类方案,以将语音数据分为三个压力级别。第二种方法在多维特征空间内采用欧几里得距离度量,以提供实时应力水平跟踪功能。通过生物识别读数确认的音频数据评估显示,这两种方法均能有效评估说话者内的压力水平(三向分类任务中的平均准确度为55.6%)。此外,评估了高强度压力对内置说话人识别的影响,并显示将其准确性从91.7%(低/中压力)降低到21.4%(高压力)。

著录项

  • 来源
    《International journal of speech technology》 |2012年第3期|p.295-311|共17页
  • 作者单位

    Center for Robust Speech Systems (CRSS), University of Texas at Dallas, 800 West Campbell Rd, EC33, Richardson,TX 75080-3021, USA;

    Center for Robust Speech Systems (CRSS), University of Texas at Dallas, 800 West Campbell Rd, EC33, Richardson,TX 75080-3021, USA;

    Center for Robust Speech Systems (CRSS), University of Texas at Dallas, 800 West Campbell Rd, EC33, Richardson,TX 75080-3021, USA;

    Center for Robust Speech Systems (CRSS), University of Texas at Dallas, 800 West Campbell Rd, EC33, Richardson,TX 75080-3021, USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    stress assessment from speech; FLETC corpus; TEO operator;

    机译:言语压力评估;FLETC语料库;TEO操作员;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号