Combining Speech Features for Aggression Detection Using Deep Neural Networks

机译：结合语音特征使用深度神经网络进行攻击检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Predicting the intensity level of aggression is a challenging problem in surveillance applications. Since there are no trivial fusion rules or classifiers, we developed a fusion framework to accomplish this complex task using Deep Neural Networks. This framework used a low level that contains the audio-visual features, an intermediate level composed of a set of concepts (meta-features) and a high level which is a final evaluation of the multimodal aggression detection. In this paper, we study the prediction of multimodal level for aggression detection and both Context and Semantics meta-features. This prediction is based on the audio modality using sensor and semantic information. Using meta-features for the semantic part of speech, we show the added value of such extra-information on the fusion process when the situations are more complicated. We also propose to use different text-based features such as linguistic and word affect features that will provide significant results for predicting the two meta-features and the multimodal aggression level using Deep Neural Networks when they are fused with the acoustic features although the nature of spontaneous speech.

机译：在监视应用中，预测攻击的强度水平是一个具有挑战性的问题。由于没有琐碎的融合规则或分类器，我们开发了融合框架以使用深度神经网络来完成此复杂任务。该框架使用了一个包含视听功能的低级别，一个由一组概念（元特征）组成的中间级别以及一个对多模式攻击检测的最终评估的高级别。在本文中，我们研究了用于攻击检测以及上下文和语义元特征的多模式水平的预测。该预测基于使用传感器和语义信息的音频模态。通过将元功能用于语音的语义部分，我们展示了当情况更加复杂时，此类额外信息在融合过程中的附加值。我们还建议使用不同的基于文本的功能，例如语言和单词情感功能，这些功能将在将深层神经网络与声学功能融合时，通过深度神经网络预测两个元功能和多模态的攻击水平提供重要的结果，尽管自发的讲话。

著录项

来源
《International Conference on Advanced Technologies for Signal and Image Processing》|2020年|1-6|共6页
会议地点
作者
Noussaiba Jaafar; Zied Lachiri;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
aggression detection; multimodal fusion; acoustic features, text-based features; deep neural networks;

机译：攻击检测;多模式融合;声学特征，基于文本的特征;深度神经网络;

相似文献

外文文献
中文文献
专利

1. Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature [J] . Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara EURASIP journal on advances in signal processing . 2015,第1期

机译：结合了深度神经网络和深度自动编码器的混响语音识别，并增强了电话类功能
2. An artificial target detection method combining a polarimetric feature extractor with deep convolutional neural networks [J] . Sun Rui, Sun Xiaobing, Chen Feinan, International journal of remote sensing . 2020,第13a14期

机译：具有深度卷积神经网络的偏振特征提取器的人工目标检测方法
3. Combining deep neural networks and engineered features for cardiac arrhythmia detection from ECG recordings [J] . Shenda Hong, Yuxi Zhou, Meng Wu, Physiological measurement . 2019,第5期

机译：从心电图录音中组合深神经网络和设计心律失常检测的工程特征
4. Aggression Detection on Social Media Text Using Deep Neural Networks [C] . Vinay Singh, Aman Varshney, Syed S. Akhtar, Second workshop on abusive language online 2018 . 2018

机译：深度神经网络对社交媒体文本的攻击检测
5. DeepFakes Detection in Videos Using Feature Engineering Techniques in Deep Learning Convolution Neural Network Frameworks [D] . Burroughs, Sonya. 2021

机译：使用深度学习卷积神经网络框架的特征工程技术在视频中检测视频
6. The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility [O] . Thomas Bentsen, Tobias May, Abigail A. Kressner, 2012

机译：在计算语音隔离中将深度神经网络架构与理想比率掩码估计相结合的好处，可以提高语音清晰度
7. Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature [O] . Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara 2015

机译：结合了深度神经网络和深度自动编码器的混响语音识别，并增强了电话类功能

Combining Speech Features for Aggression Detection Using Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅