首页> 外文会议>Chinese Automation Congress >Improvement on Speech Depression Recognition Based on Deep Networks

【24h】

Improvement on Speech Depression Recognition Based on Deep Networks

机译：基于深度网络的语音抑制识别的改进

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

To reduce the burden of clinicians diagnosing a large number of depressive symptoms, the field of artificial intelligence researchers are increasingly interested in designing automatic recognition systems for depression. Depressed patient have different speech signal from normal people. Here, we present a deep model, Depression AudioNet, which encodes depression-related features in the vocal tract and provides a more comprehensive audio representation. Firstly, the Mel-frequency cepstral coefficients (MFCCs) were extracted from raw audio data. Secondly, the robust emotions features were acquired by Multiscale Audio Delta Normalization (MADN), which is a data processing algorithm we proposed. Finally, the MFCCs and the emotions features of two adjacent segments of local audio were fed into the Depression AudioNet in turn to train the network. This method solves the problem of less training data and low precision by increasing the length information of the sample without reducing the number of samples. Experiments are conducted on AVEC2014 dataset, and the results shows that the proposed method is more effective and accurate than the existing speech depression recognition algorithms.

机译：为了减轻诊断大量抑郁症状的临床医生的负担，人工智能研究人员对设计用于抑郁的自动识别系统越来越感兴趣。抑郁症患者的语音信号与正常人不同。在这里，我们介绍了一个深层模型Depression AudioNet，该模型对声道中与抑郁相关的特征进行编码，并提供更全面的音频表示。首先，从原始音频数据中提取梅尔频率倒谱系数（MFCC）。其次，通过多尺度音频三角洲归一化（MADN）获得了鲁棒的情绪特征，这是我们提出的一种数据处理算法。最后，MFCC和本地音频的两个相邻段的情感特征又被馈送到Depression AudioNet中以训练网络。该方法通过增加样本的长度信息而不减少样本的数量，解决了训练数据少，精度低的问题。在AVEC2014数据集上进行了实验，结果表明该方法比现有的语音抑郁识别算法更有效，更准确。

著录项

来源
《Chinese Automation Congress》|2018年|2705-2709|共5页
会议地点
作者
Jinming Li; Xiaoyan Fu; Zhuhong Shao; Yuanyuan Shang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Speech recognition; Deep learning; Convolution; Training data; Data models;

机译：特征提取;语音识别;深度学习;卷积;训练数据;数据模型;

相似文献

外文文献
中文文献
专利

1. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
2. Deep and shallow features fusion based on deep convolutional neural network for speech emotion recognition [J] . Linhui Sun, Jia Chen, Keli Xie, International journal of speech technology . 2018,第4期

机译：基于深度卷积神经网络的深浅特征融合在语音情感识别中的应用
3. Automated speech-based screening of depression using deep convolutional neural networks [J] . Karol Chlasta, Krzysztof Wo?k, Izabela Krejtz Procedia Computer Science . 2019,第41期

机译：使用深度卷积神经网络自动进行基于语音的抑郁症筛查
4. Improvement on Speech Depression Recognition Based on Deep Networks [C] . Jinming Li, Xiaoyan Fu, Zhuhong Shao, Chinese Automation Congress . 2018

机译：基于深网络的语音抑郁识别改进
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. Towards Robust Deep Neural Networks for Affect and Depression Recognition from Speech [O] . Alice Othmani, Daoud Kadoch, Kamil Bentounes, 2021

机译：朝着强大的深度神经网络免受言语影响和抑郁症

Improvement on Speech Depression Recognition Based on Deep Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅