首页> 外文会议>IEEE International Conference on Smart Instrumentation, Measurement and Application >On the Optimum Speech Segment Length for Depression Detection
【24h】

On the Optimum Speech Segment Length for Depression Detection

机译:关于抑郁检测的最佳语音段长度

获取原文

摘要

Depression is a worldwide problem, which according to the World Health Organization, is the largest contributor to global disability. According to a study, around 18336 Malaysians are suffering from depression. Therefore, an automated system that can detect depression from human speech is needed. The main objective of this paper is to investigate the optimum speech segment length that provide fast and accurate depression detection. An artificial neural network was used as classifier to detect depression using a speech feature, i.e. the averaged Mel-frequency cepstral coefficients (MFCC). The Distress Analysis Interview Corpus Wizard of Oz (DAIC-WOZ) was used to train and test the system, measured in terms of accuracy and processing time, while varying the number of neurons used. The obtained results are further optimized by investigating the ideal segment length for depression detection. Results showed that our proposed system can recognize voiced depression in 3 levels of depression with an accuracy rate up to 98.3% when given previous samples of the same speaker for training. Furthermore, the optimum speech segment length was found to be 7 seconds, when it is tested for the length between 1 to 20 seconds.
机译:抑郁症是一个全球问题,根据世界卫生组织的统计,是全球残疾的最大贡献者。根据一项研究,大约18336年马来西亚人患有抑郁症。因此,需要一种可以检测人类语音抑郁的自动化系统。本文的主要目的是研究提供快速准确的抑郁检测的最佳语音段长度。使用人工神经网络用作分类器以使用语音特征检测抑郁症,即平均熔体频率谱系数(MFCC)。遇险分析采访oz(daic-woz)的语料库向导用于培训和测试系统,以准确性和处理时间测量,同时改变所用神经元的数量。通过研究抑郁检测的理想区段长度进一步优化所得结果。结果表明,我们的建议系统可以在3级抑郁症中识别浊音,精度率高达98.3%,在同一扬声器的培训中的先前样本时。此外,发现最佳语音段长度为7秒,当测试到1至20秒之间的长度时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号