Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders

Zhao Ziping; Bao Zhongtian; Zhang Zixing; Deng Jun; Cummins Nicholas; Wang Haishuai; Tao Jianhua; Schuller Bjoern

首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders

【24h】

Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders

机译：通过分层关注传输网络和注意力自动评估抑郁症的抑郁症和注意力自动化

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Early interventions in mental health conditions such as Major Depressive Disorder (MDD) are critical to improved health outcomes, as they can help reduce the burden of the disease. As the efficient diagnosis of depression severity is therefore highly desirable, the use of behavioural cues such as speech characteristics in diagnosis is attracting increasing interest in the field of quantitative mental health research. However, despite the widespread use of machine learning methods in the depression analysis community, the lack of adequate labelled data has become a bottleneck preventing the broader application of techniques such as deep learning. Accordingly, we herein describe a deep learning approach that combines unsupervised learning, knowledge transfer and hierarchical attention for the task of speech-based depression severity measurement. Our novel approach, a Hierarchical Attention Transfer Network (HATN), uses hierarchical attention autoencoders to learn attention from a source task, followed by speech recognition, and then transfers this knowledge into a depression analysis system. Experiments based on the depression sub-challenge dataset of the Audio/Visual Emotion Challenge (AVEC) 2017 demonstrate the effectiveness of our proposed model. On the test set, our technique outperformed other speech-based systems presented in the literature, achieving a Root Mean Square Error (RMSE) of 5.51 and a Mean Absolute Error (MAE) of 4.20 on a Patient Health Questionnaire (PHQ)-8 scale [0, 24]. To the best of our knowledge, these scores represent the best-known speech results on the AVEC 2017 depression corpus to date.

机译：诸如重大抑郁症（MDD）等心理健康状况的早期干预对于改善健康结果至关重要，因为它们可以帮助减少疾病的负担。因此，随着抑郁严重程度的有效诊断，非常需要，诊断中的行为提示诸如语音特征的使用是在定量心理健康研究领域吸引越来越兴趣。然而，尽管在抑郁症分析社区中广泛使用了机器学习方法，但缺乏足够的标记数据已成为阻止更广泛应用的瓶颈，如深度学习等技术。因此，我们在本文中描述了一种深入的学习方法，将无监督的学习，知识转移和分层关注结合了基于语音的抑郁严重性测量的任务。我们的新方法是一种分层关注传输网络（HATN），使用分层关注自动泊车从源任务中学习注意力，然后学习语音识别，然后将这些知识传送到凹陷分析系统中。基于抑郁症的子挑战数据集（AVEC）2017的抑郁症次挑战数据集证明了我们所提出的模型的有效性。在测试集上，我们的技术表现出文献中的其他基于语音的系统，实现了5.51的根均线误差（RMSE）和4.20的平均绝对误差（MAE）在患者健康问卷（PHQ）-8规模上[0,24]。据我们所知，这些分数代表Avec 2017抑郁症迄今为止的最佳语音结果。

著录项

来源
《Selected Topics in Signal Processing, IEEE Journal of》 |2020年第2期|423-434|共12页
作者
Zhao Ziping; Bao Zhongtian; Zhang Zixing; Deng Jun; Cummins Nicholas; Wang Haishuai; Tao Jianhua; Schuller Bjoern;
展开▼
作者单位

Tianjin Normal Univ Coll Comp & Informat Engn Tianjin 300387 Peoples R China;

Tianjin Normal Univ Coll Comp & Informat Engn Tianjin 300387 Peoples R China;

Imperial Coll London Dept Comp London SW7 2AZ England;

Agile Robots AG D-82205 Gilching Germany;

Univ Augsburg Embedded Intelligence Hlth Care & Wellbeing D-86159 Augsburg Germany;

Fairfield Univ Dept Comp Sci Fairfield CT 06824 USA;

Chinese Acad Sci Univ Chinese Acad Sci Natl Lab Pattern Recognit Inst Automat CEBSIT Sch Artificial Intelligence Beijing 100190 Peoples R China;

Univ Augsburg Embedded Intelligence Hlth Care & Wellbeing D-86159 Augsburg Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Task analysis; Deep learning; Speech recognition; Training; Feature extraction; Depression; attention transfer; hierarchical attention; monotonic attention;

机译：任务分析;深入学习;语音识别;培训;特征提取;抑郁;注意转移;分层关注;单调注意;
入库时间 2022-08-18 22:04:16

相似文献

外文文献
中文文献
专利

1. A time-frequency channel attention and vectorization network for automatic depression level prediction [J] . Niu Mingyue, Liu Bin, Tao Jianhua, Neurocomputing . 2021,第Auga25期

机译：用于自动抑郁级预测的时频通道关注和矢量化网络
2. Tamper-Proofing Video With Hierarchical Attention Autoencoder Hashing on Blockchain [J] . Tu Bui, Daniel Cooper, John Collomosse, Multimedia, IEEE Transactions on . 2020,第11期

机译：篡改篡改视频在区块链上具有分层关注AutoEncoder散列
3. Spatial Attention Gated Variational Autoencoder Enhanced Cycle-Consistent Generative Adversarial Networks for MRI to CT Translation [J] . Kearney V., Zeimer B. P., Perry A., International Journal of Radiation Oncology, Biology, Physics . 2019,第1Suppla期

机译：空间关注改变变分性AutoEncoder增强的循环一致的生成对冲网络，用于MRI到CT翻译
4. Hierarchical Attention Transfer Networks for Depression Assessment from Speech [C] . Ziping Zhao, Zhongtian Bao, Zixing Zhang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：用于语音抑郁评估的分层注意力转移网络
5. Hierarchical Attention Networks for Fake News Detection [D] . Jeong, Haeseung. 2021

机译：用于假新闻检测的分层关注网络
6. Patient Representation Transfer Learning from Clinical Notes based on Hierarchical Attention Network [O] . Yuqi Si, Kirk Roberts 2020

机译：基于分层注意网络的临床笔记中的患者表征转移学习
7. Identify Vulnerability Fix Commits Automatically Using Hierarchical Attention Network [O] . Mingxin Sun, Wenjie Wang, Hantao Feng, 2020

机译：使用分层注意网络自动识别漏洞修复
8. Hierarchical Neural Network (HNN) for Closed Loop Decision Making: Designing the Architecture of a Hierarchical Neural Network to Model Attention, Learning and Goal Oriented Behavior. [R] . Guez, A. 1990

机译：用于闭环决策的分层神经网络（HNN）：设计层次神经网络的体系结构以模拟注意，学习和目标导向行为。

Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders

摘要

著录项

相似文献

相关主题

期刊订阅