首页> 外文会议>International Conference on Advanced Technologies for Signal and Image Processing >Combining Speech Features for Aggression Detection Using Deep Neural Networks
【24h】

Combining Speech Features for Aggression Detection Using Deep Neural Networks

机译:结合语音特征使用深度神经网络进行攻击检测

获取原文

摘要

Predicting the intensity level of aggression is a challenging problem in surveillance applications. Since there are no trivial fusion rules or classifiers, we developed a fusion framework to accomplish this complex task using Deep Neural Networks. This framework used a low level that contains the audio-visual features, an intermediate level composed of a set of concepts (meta-features) and a high level which is a final evaluation of the multimodal aggression detection. In this paper, we study the prediction of multimodal level for aggression detection and both Context and Semantics meta-features. This prediction is based on the audio modality using sensor and semantic information. Using meta-features for the semantic part of speech, we show the added value of such extra-information on the fusion process when the situations are more complicated. We also propose to use different text-based features such as linguistic and word affect features that will provide significant results for predicting the two meta-features and the multimodal aggression level using Deep Neural Networks when they are fused with the acoustic features although the nature of spontaneous speech.
机译:在监视应用中,预测攻击的强度水平是一个具有挑战性的问题。由于没有琐碎的融合规则或分类器,我们开发了融合框架以使用深度神经网络来完成此复杂任务。该框架使用了一个包含视听功能的低级别,一个由一组概念(元特征)组成的中间级别以及一个对多模式攻击检测的最终评估的高级别。在本文中,我们研究了用于攻击检测以及上下文和语义元特征的多模式水平的预测。该预测基于使用传感器和语义信息的音频模态。通过将元功能用于语音的语义部分,我们展示了当情况更加复杂时,此类额外信息在融合过程中的附加值。我们还建议使用不同的基于文本的功能,例如语言和单词情感功能,这些功能将在将深层神经网络与声学功能融合时,通过深度神经网络预测两个元功能和多模态的攻击水平提供重要的结果,尽管自发的讲话。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号