An Investigation of Depressed Speech Detection: Features and Normalization

机译：沮丧的语音检测研究：特征和归一化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, the problem of automatic detection of mental illness from the speech signal has gained some initial interest, however questions remaining include how speech segments should be selected, what features provide good discrimination, and what benefits feature normalization might bring given the speaker-specific nature of mental disorders. In this paper, these questions are addressed empirically using classifier configurations employed in emotion recognition from speech, evaluated on a 47-speaker depressedeutral read sentence speech database. Results demonstrate that (1) detailed spectral features are well suited to the task, (2) speaker normalization provides benefits mainly for less detailed features, and (3) dynamic information appears to provide little benefit. Classification accuracy using a combination of MFCC and formant based features approached 80% for this database.

机译：近年来，从语音信号中自动检测精神疾病的问题已引起人们的最初兴趣，但是仍然存在的问题包括如何选择语音片段，哪些功能可提供良好的辨别力，以及功能正常化给说话者带来的好处？精神障碍的特殊性质。在本文中，这些问题是通过使用分类器配置从经验上解决的，该分类器配置用于从语音中进行情感识别，并在47个扬声器的中性/中性阅读句子语音数据库中进行了评估。结果表明，（1）详细的频谱特征非常适合该任务;（2）说话人归一化主要是为细节较少的特征提供了好处;（3）动态信息似乎几乎没有提供任何好处。对于该数据库，结合使用MFCC和基于共振峰的特征进行分类的准确性接近80％。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.3008-3011|共4页
会议地点
作者
Nicholas Cummins; Julien Epps; Michael Breakspear; Roland Goecke;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
mental state recognition; depressed speech; feature comparison; MFCC; Gaussian mixture models;

机译：精神状态识别;沮丧的讲话功能比较; MFCC;高斯混合模型;

相似文献

外文文献
中文文献
专利

1. Iterative Feature Normalization Scheme for Automatic Emotion Detection from Speech [J] . Affective Computing, IEEE Transactions on . 2013,第4期

机译：语音自动情感检测的迭代特征归一化方案
2. Temporal Structure Normalization of Speech Feature for Robust Speech Recognition [J] . Xiao X., Chng E. S., Li H. IEEE signal processing letters . 2007,第7期

机译：语音特征的时态结构归一化，用于鲁棒语音识别
3. Emotional speech feature normalization and recognition based on speaker-sensitive feature clustering [J] . Chengwei Huang, Baolin Song, Li Zhao International journal of speech technology . 2016,第4期

机译：基于说话人敏感特征聚类的情感语音特征归一化与识别
4. Depression Speaks: Automatic Discrimination between Depressed and Non-Depressed Speakers Based on Nonverbal Speech Features [C] . F. Scibelli, G. Roffo, M. Tayarani, IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：抑郁症讲话：基于非语言语音特征的抑郁和非抑郁扬声器之间的自动歧视
5. Duration normalization for robust recognition of spontaneous speech via missing feature methods. [D] . Nedel, Jon P. 2004

机译：持续时间归一化，可通过缺失特征方法对自发语音进行可靠识别。
6. Using speech recognition technology to investigate the association between timing-related speech features and depression severity [O] . Mao Yamamoto, Akihiro Takamiya, Kyosuke Sawada, 2020

机译：使用语音识别技术来调查与时序相关的语音特征与抑郁症的关联
7. Iterative Feature Normalization for Emotional Speech Detection [O] . Carlos Busso, Angeliki Metallinou, Shrikanth S. Narayanan 2011

机译：情绪语音检测的迭代特征归一化
8. Normalized Amplitude Modulation Features for Large Vocabulary Noise- Robust Speech Recognition. [R] . Mitra, V., Franco, H., Graciarena, M., 2012

机译：用于大词汇量噪声 - 鲁棒语音识别的归一化幅度调制特征。

An Investigation of Depressed Speech Detection: Features and Normalization

摘要

著录项

相似文献

相关主题

期刊订阅