Choice of a classifier, based on properties of a dataset: case study-speech emotion recognition

Shashidhar G. Koolagudi; Y. V. Srinivasa Murthy; Siva P. Bhaskar

首页> 外文期刊>International journal of speech technology >Choice of a classifier, based on properties of a dataset: case study-speech emotion recognition

【24h】

Choice of a classifier, based on properties of a dataset: case study-speech emotion recognition

机译：根据数据集的属性选择分类器：案例研究-语音情感识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, the process of selecting a classifier based on the properties of dataset is designed since it is very difficult to experiment the data on n-number of classifiers. As a case study speech emotion recognition is considered. Different combinations of spectral and prosodic features relevant to emotions are explored. The best subset of the chosen set of features is recommended for each of the classifiers based on the properties of chosen dataset. Various statistical tests have been used to estimate the properties of dataset. The nature of dataset gives an idea to select the relevant classifier. To make it more precise, three other clustering and classification techniques such as K-means clustering, vector quantization and artificial neural networks are used for experimentation and results are compared with the selected classifier. Prosodic features like pitch, intensity, jitter, shimmer, spectral features such as mel frequency cepstral coefficients (MFCCs) and formants are considered in this work. Statistical parameters of prosody such as minimum, maximum, mean (μ) and standard deviation (σ) are extracted from speech and combined with basic spectral (MFCCs) features to get better performance. Five basic emotions namely anger, fear, happiness, neutral and sadness are considered. For analysing the performance of different datasets on different classifiers, content and speaker independent emotional data is used, collected from Telugu movies. Mean opinion score of fifty users is collected to label the emotional data. To make it more accurate, one of the benchmark IIT-Kharagpur emotional database is used to generalize the conclusions.

机译：由于很难对n个分类器上的数据进行实验，因此设计了一种基于数据集属性选择分类器的过程。作为案例研究，考虑了语音情感识别。探索与情绪有关的频谱特征和韵律特征的不同组合。根据所选数据集的属性，为每个分类器推荐所选功能集的最佳子集。各种统计检验已用于估计数据集的属性。数据集的性质给出了选择相关分类器的想法。为了更精确，将其他三种聚类和分类技术（例如K-means聚类，矢量量化和人工神经网络）用于实验，并将结果与所选分类器进行比较。在这项工作中考虑了韵律特征，例如音调，强度，抖动，闪光，频谱特征（例如梅尔频率倒谱系数（MFCC）和共振峰）。从语音中提取韵律的统计参数，例如最小，最大，均值（μ）和标准偏差（σ），并与基本频谱（MFCC）功能结合使用以获得更好的性能。考虑了五个基本情绪，即愤怒，恐惧，幸福，中立和悲伤。为了分析不同分类器上不同数据集的表现，使用了从泰卢固语电影中收集的内容和与说话者无关的情感数据。收集五十个用户的平均意见分数来标记情感数据。为了使其更准确，使用了基准的IIT-Kharagpur情感数据库之一来概括结论。

著录项

来源
《International journal of speech technology》 |2018年第1期|167-183|共17页
作者
Shashidhar G. Koolagudi; Y. V. Srinivasa Murthy; Siva P. Bhaskar;
展开▼
作者单位

Department of CSE, National Institute of Technology Karnataka, Mangalore 575 025, India;

Department of CSE, National Institute of Technology Karnataka, Mangalore 575 025, India;

Department of CSE, National Institute of Technology Karnataka, Mangalore 575 025, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Properties of dataset; Normality tests; Selection of classifier; Spectral and prosodic features; Jitter; Shimmer;

机译：数据集的属性;正态性测试;选择分类器;频谱和韵律特征;抖动微光;

相似文献

外文文献
中文文献
专利

1. Domain Adaptation Techniques for EEG-Based Emotion Recognition: A Comparative Study on Two Public Datasets [J] . Lan Zirui, Sourina Olga, Wang Lipo, IEEE Transactions on Cognitive and Developmental Systems . 2019,第1期

机译：基于EEG的情绪识别的领域适应技术：两种公共数据集的比较研究
2. Domain Adaptation Techniques for EEG-Based Emotion Recognition: A Comparative Study on Two Public Datasets [J] . Lan Zirui, Sourina Olga, Wang Lipo, IEEE Transactions on Cognitive and Developmental Systems . 2019,第1期

机译：基于EEG的情感识别的域适应技术：两个公共数据集的比较研究
3. Quaternion Based Fuzzy Neural Network Classifier for MPIK Dataset's View-invariant Color Face Image Recognition? [J] . Wai Kit Wong, Gin Chong Lee, Chu Kiong Loo, Informatica: An International Journal of Computing and Informatics . 2013,第2期

机译：基于四元数的模糊神经网络分类器，用于MPIK数据集的视图不变彩色人脸图像识别？
4. The Facial Emotion Recognition (FER-2013) Dataset for Prediction System of Micro-Expressions Face Using the Convolutional Neural Network (CNN) Algorithm based Raspberry Pi [C] . Lutfiah Zahara, Purnawarman Musa, Eri Prasetyo Wibowo, International Conference on Informatics and Computing . 2020

机译：基于卷积神经网络（CNN）算法的覆盆子PI，面部情感识别（FER-2013）用于预测系统的微型表达式面部预测系统的数据集
5. Face Recognition: Algorithmic Approach for Large Datasets and 3D Based Point Clouds [D] . ElSayed, Ahmed A. 2016

机译：人脸识别：大数据集和基于3D的点云的算法方法
6. Multi-Path and Group-Loss-Based Network for Speech Emotion Recognition in Multi-Domain Datasets [O] . Kyoung Ju Noh, Chi Yoon Jeong, Jiyoun Lim, 2021

机译：基于多路径和组丢失的语音情感识别网络中的多域数据集
7. Application of J48 Decision Tree Classifier in Emotion Recognition Based on Chaos Characteristics [O] . Chun yan Nie, Ju Wang, Fang He, 2015

机译：J48决策树分类器在基于混沌特征的情感识别中的应用

Choice of a classifier, based on properties of a dataset: case study-speech emotion recognition

摘要

著录项

相似文献

相关主题

期刊订阅