首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders
【24h】

Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders

机译:通过分层关注传输网络和注意力自动评估抑郁症的抑郁症和注意力自动化

获取原文
获取原文并翻译 | 示例
       

摘要

Early interventions in mental health conditions such as Major Depressive Disorder (MDD) are critical to improved health outcomes, as they can help reduce the burden of the disease. As the efficient diagnosis of depression severity is therefore highly desirable, the use of behavioural cues such as speech characteristics in diagnosis is attracting increasing interest in the field of quantitative mental health research. However, despite the widespread use of machine learning methods in the depression analysis community, the lack of adequate labelled data has become a bottleneck preventing the broader application of techniques such as deep learning. Accordingly, we herein describe a deep learning approach that combines unsupervised learning, knowledge transfer and hierarchical attention for the task of speech-based depression severity measurement. Our novel approach, a Hierarchical Attention Transfer Network (HATN), uses hierarchical attention autoencoders to learn attention from a source task, followed by speech recognition, and then transfers this knowledge into a depression analysis system. Experiments based on the depression sub-challenge dataset of the Audio/Visual Emotion Challenge (AVEC) 2017 demonstrate the effectiveness of our proposed model. On the test set, our technique outperformed other speech-based systems presented in the literature, achieving a Root Mean Square Error (RMSE) of 5.51 and a Mean Absolute Error (MAE) of 4.20 on a Patient Health Questionnaire (PHQ)-8 scale [0, 24]. To the best of our knowledge, these scores represent the best-known speech results on the AVEC 2017 depression corpus to date.
机译:诸如重大抑郁症(MDD)等心理健康状况的早期干预对于改善健康结果至关重要,因为它们可以帮助减少疾病的负担。因此,随着抑郁严重程度的有效诊断,非常需要,诊断中的行为提示诸如语音特征的使用是在定量心理健康研究领域吸引越来越兴趣。然而,尽管在抑郁症分析社区中广泛使用了机器学习方法,但缺乏足够的标记数据已成为阻止更广泛应用的瓶颈,如深度学习等技术。因此,我们在本文中描述了一种深入的学习方法,将无监督的学习,知识转移和分层关注结合了基于语音的抑郁严重性测量的任务。我们的新方法是一种分层关注传输网络(HATN),使用分层关注自动泊车从源任务中学习注意力,然后学习语音识别,然后将这些知识传送到凹陷分析系统中。基于抑郁症的子挑战数据集(AVEC)2017的抑郁症次挑战数据集证明了我们所提出的模型的有效性。在测试集上,我们的技术表现出文献中的其他基于语音的系统,实现了5.51的根均线误差(RMSE)和4.20的平均绝对误差(MAE)在患者健康问卷(PHQ)-8规模上[0,24]。据我们所知,这些分数代表Avec 2017抑郁症迄今为止的最佳语音结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号