TEO-based speaker stress assessment using hybrid classification and tracking schemes

John H.L. Hansen; Evan Ruzanski; Hynek Bofil; James Meyerhoff

首页> 外文期刊>International journal of speech technology >TEO-based speaker stress assessment using hybrid classification and tracking schemes

【24h】

TEO-based speaker stress assessment using hybrid classification and tracking schemes

机译：使用混合分类和跟踪方案的基于TEO的说话人压力评估

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker variability is known to have an adverse impact on speech systems that process linguistic content, such as speech and language recognition. However, speech production changes in individuals due to stress and emotions have similarly detrimental effect also on the task of speaker recognition as they introduce mismatch with the speaker models typically trained on modal speech. The focus of this study is on the analysis of stress-induced variations in speech and design of an automatic stress level assessment scheme that could be used in directing stress-dependent acoustic models or normalization strategies. Current stress detection methods typically employ a binary decision based on whether the speaker is or not under stress. In reality, the amount of stress in individuals varies and can change gradually. Using speech and biometric data collected in a real-world, variable-stress level law enforcement training scenario, this study considers two methods for stress level assessment. The first approach uses a nearest neighbor clustering scheme at the vowel token and sentence levels to classify speech data into three levels of stress. The second approach employs Euclidean distance metrics within the multi-dimensional feature space to provide real-time stress level tracking capability. Evaluations on audio data confirmed by biometric readings show both methods to be effective in assessment of stress level within a speaker (average accuracy of 55.6 % in a 3-way classification task). In addition, an impact of high-level stress on in-set speaker recognition is evaluated and shown to reduce the accuracy from 91.7 % (low/mid stress) to 21.4 % (high level stress).

机译：众所周知，说话人的可变性会对处理语言内容（例如语音和语言识别）的语音系统产生不利影响。然而，由于压力和情绪引起的个人语音产生变化也对说话者识别的任务具有类似的有害影响，因为它们引入了与通常在模态语音上训练的说话者模型的不匹配。这项研究的重点是分析语音中由语音引起的变化，以及设计一种自动应力水平评估方案，该方案可用于指导依赖于应力的声学模型或归一化策略。当前的压力检测方法通常基于说话者是否处于压力下而采用二进制判定。实际上，个人的压力大小是变化的，并且可以逐渐变化。使用在现实世界中可变压力水平的执法培训场景中收集的语音和生物统计数据，本研究考虑了两种压力水平评估方法。第一种方法在元音标记和句子级别使用最近邻居聚类方案，以将语音数据分为三个压力级别。第二种方法在多维特征空间内采用欧几里得距离度量，以提供实时应力水平跟踪功能。通过生物识别读数确认的音频数据评估显示，这两种方法均能有效评估说话者内的压力水平（三向分类任务中的平均准确度为55.6％）。此外，评估了高强度压力对内置说话人识别的影响，并显示将其准确性从91.7％（低/中压力）降低到21.4％（高压力）。

著录项

来源
《International journal of speech technology》 |2012年第3期|p.295-311|共17页
作者
John H.L. Hansen; Evan Ruzanski; Hynek Bofil; James Meyerhoff;
展开▼
作者单位

Center for Robust Speech Systems (CRSS), University of Texas at Dallas, 800 West Campbell Rd, EC33, Richardson,TX 75080-3021, USA;

Center for Robust Speech Systems (CRSS), University of Texas at Dallas, 800 West Campbell Rd, EC33, Richardson,TX 75080-3021, USA;

Center for Robust Speech Systems (CRSS), University of Texas at Dallas, 800 West Campbell Rd, EC33, Richardson,TX 75080-3021, USA;

Center for Robust Speech Systems (CRSS), University of Texas at Dallas, 800 West Campbell Rd, EC33, Richardson,TX 75080-3021, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
stress assessment from speech; FLETC corpus; TEO operator;

机译：言语压力评估;FLETC语料库;TEO操作员;

相似文献

外文文献
中文文献
专利

1. Cortical encoding and neurophysiological tracking of intensity and pitch cues signaling English stress patterns in native and nonnative speakers [J] . Chung Wei-Lun, Bidelman Gavin M. Brain and language . 2016,第期

机译：皮质编码和神经生理追踪强度和音高提示，发信号通知本地和非扬声器中英语应力模式
2. Thermal infrared and visible sequences fusion tracking based on a hybrid tracking framework with adaptive weighting scheme [J] . Luo Chengwei, Sun Bin, Yang Ke, Infrared physics and technology . 2019,第期

机译：具有自适应加权方案的混合跟踪框架的热红外和可见序列融合跟踪
3. Tracking and Classification of Action Units in Facial Expressions using Hybrid Tracking System and Probabilistic Neural Networks [J] . HADI SEYEDARABI, WON-SOOK LEE, ALI AGHAGOLZADEH, WSEAS Transactions on Signal Processing . 2006,第5期

机译：使用混合跟踪系统和概率神经网络跟踪面部表情中动作单元的分类
4. Vehicles tracking and classification using Traffic zones in a hybrid scheme for intersection traffic management by smart cameras [C] . Babaei Peyman Proceedings of the 2010 International Conference on Signal and Image Processing . 2010

机译：在混合方案中使用交通区域对车辆进行跟踪和分类，以通过智能相机进行路口交通管理
5. Assessment and optimization of schemes for tracking and routing to mobile users in packet-based networks. [D] . Hristea, Cristina Ana-Maria. 2002

机译：评估和优化用于在基于分组的网络中跟踪和路由到移动用户的方案。
6. A Novel Loss Recovery and Tracking Scheme for Maneuvering Target in Hybrid WSNs [O] . Hanwang Qian, Pengcheng Fu, Baoqing Li, 2018

机译：混合无线传感器网络中机动目标的新型损耗恢复与跟踪方案
7. TEO-based speaker stress assessment using hybrid classification and tracking schemes [O] . John H. L. Hansen, Evan Ruzanski, Hynek Bořil, 2012

机译：使用混合分类和跟踪方案的基于TEO的说话人压力评估

TEO-based speaker stress assessment using hybrid classification and tracking schemes

摘要

著录项

相似文献

相关主题

期刊订阅