首页> 外文会议>IEEE International Conference on Semantic Computing >Classifying Depression in Imbalanced Datasets Using an Autoencoder- Based Anomaly Detection Approach
【24h】

Classifying Depression in Imbalanced Datasets Using an Autoencoder- Based Anomaly Detection Approach

机译:使用基于自动编码器的异常检测方法对不平衡数据集中的压抑进行分类

获取原文

摘要

Depression is the most prevalent mental health ailment in the United States, affecting 15% of the population. Untreated depression can significantly decrease quality of life, physical health, and has significant economic and societal costs. The traditional method of diagnosing depression requires the patient to respond to medical questionnaires and is subjective. Passive methods to autonomously detect depression are desirable. Prior work on smartphone sensing of depression has utilized machine learning classification of smartphone sensor data. However, as with many ailments, the percentage of afflicted users in most populations is small compared with those unaffected, leading to severe class imbalance. In this work, we explore anomaly detection methods as a method for mitigating class imbalance for depression detection. Our approach adopts a multi-stage machine learning pipeline. First, using autoen-coders, we project the mobility features of the majority class (undepressed users). Thereafter, the trained autoencoder then classifies a test set of users as either depressed (anomalous) or not depressed (inliers) using a One Class SVM algorithm. Our method, when applied to the real-world StudentLife data set shows that even with an extremely imbalanced dataset, our method is able to detect individuals with depression symptoms with an AUC-ROC of 0.92, significantly outperforming traditional machine learning classification approaches.
机译:抑郁症是美国最普遍的心理健康疾病,影响了15%的人口。未经治疗的抑郁症会显着降低生活质量,身体健康,并产生重大的经济和社会成本。诊断抑郁症的传统方法要求患者对医疗问卷做出回应,并且是主观的。主动地检测抑郁的被动方法是合乎需要的。关于智能手机感测抑郁的先前工作已经利用智能手机传感器数据的机器学习分类。但是,与许多疾病一样,与未受影响的人相比,大多数人口中受苦用户的比例很小,从而导致严重的班级失衡。在这项工作中,我们探索异常检测方法作为缓解抑郁症检测中的类不平衡的一种方法。我们的方法采用了多阶段的机器学习管道。首先,我们使用自动编码器来投影大多数类别(未压迫的用户)的移动性功能。此后,训练有素的自动编码器随后使用One Class SVM算法将用户的测试集分类为沮丧(异常)或未沮丧(内部)。我们的方法在应用于现实世界的StudentLife数据集时显示,即使数据集非常不平衡,我们的方法也能够检测AUC-ROC为0.92的抑郁症患者,大大优于传统的机器学习分类方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号