Integrating Deep and Shallow Models for Multi-Modal Depression Analysis—Hybrid Architectures

Yang Le; Jiang Dongmei; Sahli Hichem

首页> 外文期刊>Affective Computing, IEEE Transactions on >Integrating Deep and Shallow Models for Multi-Modal Depression Analysis—Hybrid Architectures

【24h】

Integrating Deep and Shallow Models for Multi-Modal Depression Analysis—Hybrid Architectures

机译：为多模态凹陷分析 - 混合架构集成深层浅模型

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

At present, although great progress has been made in automatic depression assessment, most of the recent works only concern the audio and video paralinguistic information, rather than the linguistic information from the spoken content. In this work, we argue that beside developing good audio and video features, to build reliable depression detection systems, text-based content features are also of importance to analyse depression-related textual indicators. Furthermore, to improve the performance of automatic depression assessment systems, powerful models, capable of modelling the characteristics of depression embedded in the audio, visual and text descriptors, are also required. This paper proposes new text and video features and hybridizes deep and shallow models for depression estimation and classification from audio, video and text descriptors. The proposed hybrid framework consists of three main parts: 1) A Deep Convolutional Neural Network (DCNN) and Deep Neural Network (DNN) based audio-visual multi-modal depression recognition model for estimating the Patient Health Questionnaire depression scale (PHQ-8); 2) A Paragraph Vector (PV) and Support Vector Machine (SVM) based model for inferring the physical and mental conditions of the individual from the transcripts of the interview; 3) A Random Forest (RF) model for depression classification from the estimated PHQ-8 score and the inferred conditions of the individual. In the PV-SVM model, PV embedding is used to obtain fixed-length feature vectors from transcripts of the answers to the questions associated with psychoanalytic aspects of depression, which are subsequently fed into the SVM classifiers for detecting the presence/absence of the considered psychoanalytic symptoms. To our best knowledge, this approach is the first attempt to apply PV for depression analysis. Besides, we propose a new visual descriptor - Histogram of Displacement Range (HDR) to characterize the displacement and velocity of the facial landmarks in the video segment. Experiments have been carried out on the Audio Visual Emotion Challenge (AVEC2016) depression dataset, they demonstrate that: 1) The proposed hybrid framework effectively improves the accuracies of both depression estimation and depression classification, with an average F1 measure up to 0.746, which is higher than the best result (0.724) of the depression sub-challenge of AVEC2016. 2) HDR obtains better depression recognition performance than Bag-of-Words (BoW) and Motion History Histogram (MHH) features.

机译：目前，虽然在自动抑制评估中取得了巨大进展，但最近的大多数作品只关注了音频和视频预测信息，而不是来自口头内容的语言信息。在这项工作中，我们认为，除了开发良好的音频和视频特征外，建立可靠的抑郁检测系统，基于文本的内容特征也重要性来分析抑郁相关的文本指标。此外，还需要改善自动抑郁评估系统的性能，强大的模型，能够建模嵌入音频，视觉和文本描述符中的抑郁特性。本文提出了新的文本和视频特征，并与音频，视频和文本描述符进行了抑郁估计和分类的深层和浅模范。所提出的混合框架由三个主要部分组成：1）深度卷积神经网络（DCNN）和基于深神经网络（DNN）的视听多模态抑郁识别模型，用于估算患者健康问卷抑郁尺度（PHQ-8） ; 2）基于段落向量（PV）和支持向量机（SVM）的模型，用于推断出访谈的成绩单中个体的身体和精神状况; 3）从估计的PHQ-8分数和个人推断的抑郁分类的随机森林（RF）模型。在PV-SVM模型中，PV嵌入用于从与抑郁症的精神分析方面相关的问题的答案的转录物获取固定长度特征向量，随后将被送入SVM分类器以检测所考虑的存在/不存在精神分析症状。为了我们的最佳知识，这种方法是第一次应用PV进行抑郁分析。此外，我们提出了一种新的视觉描述符 - 位移范围（HDR）的直方图，以表征视频段中面部地标的位移和速度。实验已经在视听情感挑战（AVEC2016）抑郁数据集上进行了实验，他们证明：1）所提出的混合框架有效提高了抑郁估计和抑郁分类的准确性，平均F1测量可达0.746，即高于AVEC2016的抑郁症次挑战的最佳结果（0.724）。 2）HDR获得比单词袋（弓）和运动历史直方图（MHH）功能的更好的抑郁识别性能。

著录项

来源
《Affective Computing, IEEE Transactions on》 |2021年第1期|239-253|共15页
作者
Yang Le; Jiang Dongmei; Sahli Hichem;
展开▼
作者单位

Northwestern Polytech Univ Sch Comp Sci NPU VUB Joint AVSP Res Lab Shaanxi Key Lab Speech & Image Informat Proc Youyi Xilu 127 Xian 710072 Peoples R China;

Northwestern Polytech Univ Sch Comp Sci NPU VUB Joint AVSP Res Lab Shaanxi Key Lab Speech & Image Informat Proc Youyi Xilu 127 Xian 710072 Peoples R China;

Vrije Univ Brussel VUB Dept Elect & Informat ETRO VUB NPU Joint AVSP Res Lab Pl Laan 2 B-1050 Brussels Belgium|Interuniv Microelect Ctr IMEC Kapeldreef 75 B-3001 Heverlee Belgium;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Depression estimation; depression classification; deep convolutional neural network-deep neural network (DCNN-DNN); paragraph vector-support vector machine (PV-SVM); random forest; histogram of displacement range (HDR);

机译：抑郁估计;抑郁症分类;深卷积神经网络深神经网络（DCNN-DNN）;段矢量支持向量机（PV-SVM）;随机森林;位移范围（HDR）的直方图;

相似文献

外文文献
中文文献
专利

1. A hybrid model integrating deep learning with investor sentiment analysis for stock price prediction [J] . Jing Nan, Wu Zhao, Wang Hefei Expert systems with applications . 2021,第Sepa期

机译：集成深层学习与投资者情绪分析的混合模型
2. Architecture and facies model in a non-marine to shallow-marine setting with continuous base-level rise: An example from the Cretaceous Denglouku Formation in the Chang ling Depression, Songliao Basin, China [J] . Li Jingzhe, Liu Shasha, Zhang Jinliang, Marine and Petroleum Geology . 2015,第Pta1期

机译：非海洋-浅海环境中基础连续上升的建筑和相模型：以松辽盆地长岭De陷白垩纪Denglouku组为例
3. A Hybrid Method for Open Information Extraction Based on Shallow and Deep Linguistic Analysis [J] . Vahideh RESHADAT, Maryam HOORALI, Heshaam FAILI Interdisciplinary Information Sciences . 2016,第1期

机译：基于浅层和深层语言分析的开放信息混合提取方法
4. Hybrid Deep Shallow Network for Assessment of Depression Using Electroencephalogram Signals [C] . Abdul Qayyum, Imran Razzak, Wajid Mumtaz International Conference on Neural Information Processing . 2020

机译：混合深浅网络，用于使用脑电图信号评估抑郁症
5. A radical embodied model of language and mind in a swarm-based system: Coaxing deep structure out of shallow architecture. [D] . Wilkerson, Lonnie Otto, IV. 2010

机译：在基于群体的系统中，语言和思维的一个根本体现模型：从浅层结构中哄骗深层结构。
6. Multi-modal magnetic resonance imaging-based grading analysis for gliomas by integrating radiomics and deep features [O] . Zhenyuan Ning, Jiaxiu Luo, Qing Xiao, 2021

机译：基于多模态磁共振成像的基于胶质瘤的聚类分析通过整合射线学和深度特征
7. Integrating deep and shallow natural language processing components : representations and hybrid architectures [O] . Schäfer Ulrich 2006

机译：集成浅层和浅层自然语言处理组件：表示形式和混合体系结构

Integrating Deep and Shallow Models for Multi-Modal Depression Analysis—Hybrid Architectures

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅