首页> 外文会议>National Conference on Communications >Emotion Recognition from Varying Length Patterns of Speech using CNN-based Segment-Level Pyramid Match Kernel based SVMs

【24h】

Emotion Recognition from Varying Length Patterns of Speech using CNN-based Segment-Level Pyramid Match Kernel based SVMs

机译：使用基于CNN的段级金字塔匹配内核的SVM从不同长度的语音模式进行情绪识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolutional Neural Networks (CNNs) and its variants have achieved impressive performance when used for different speech processing tasks like spoken language identification, speaker verification, speech emotion recognition, etc. Conventionally, CNNs for speech applications consider input features from fixed duration speech segments as input. In this work, we attempt to consider features from complete speech signal as input to CNN. We propose to use spatial pyramid pooling (SPP) layer in CNN architecture to remove the fixed length constraint and to consider features from varying length speech signals as input to CNN for an end to end training. Proposed architecture also results in varying size set of feature maps from convolution layer. Further, we propose novel CNN-based segment-level pyramid match kernel (CNN-SLPMK) as dynamic kernel between a pair of varying size set of feature maps for the classification framework using support vector machines (SVMs) based classifier. We demonstrate that our proposed approach achieves comparable results with state-of-the-art techniques for speech emotion recognition task.

机译：当卷积神经网络（CNN）及其变体用于不同的语音处理任务（例如口语识别，说话者验证，语音情感识别等）时，已经取得了令人印象深刻的性能。常规上，用于语音应用的CNN将固定持续时间语音段的输入特征视为输入。在这项工作中，我们尝试将来自完整语音信号的特征考虑为CNN的输入。我们建议在CNN架构中使用空间金字塔池（SPP）层来消除固定长度约束，并考虑将可变长度语音信号中的特征作为输入到CNN进行端到端训练。提议的体系结构还会导致卷积层的特征图大小不同。此外，我们提出了新颖的基于CNN的段级金字塔匹配内核（CNN-SLPMK），作为使用基于支持向量机（SVM）的分类器在分类框架的一对大小不一的特征图对之间的动态内核。我们证明了我们提出的方法可以通过语音情感识别任务的最新技术获得可比的结果。

著录项

来源
《National Conference on Communications 》|2019年|1-6|共6页
会议地点
作者
Shikha Gupta; Kishalaya De; Dileep Aroor Dinesh; Veena Thenkanidiyoor;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Convolution; Computer architecture; Speech recognition; Kernel; Training; Task analysis; Speech processing;

机译：卷积;计算机体系结构;语音识别;内核;训练;任务分析;语音处理;

相似文献

外文文献
中文文献
专利

1. Segment-level probabilistic sequence kernel and segment-level pyramid match kernel based extreme learning machine for classification of varying length patterns of speech [J] . Shikha Gupta, Ahmed Karanath, Kansul Mahrifa, International journal of speech technology . 2019 ,第1期

机译：基于段级概率序列核和段级金字塔匹配核的极限学习机，用于语音不同长度模式的分类
2. Feature Selection for GUMI Kernel-Based SVM in Speech Emotion Recognition [J] . Imen Trabelsi, Med Salim Bouhlel International journal of synthetic emotions . 2015 ,第2期

机译：语音情感识别中基于GUMI内核的SVM的特征选择
3. GMM-Based Intermediate Matching Kernel for Classification of Varying Length Patterns of Long Duration Speech Using Support Vector Machines [J] . Dileep A.D., Sekhar C.C. Neural Networks and Learning Systems, IEEE Transactions on . 2014 ,第8期

机译：基于GMM的中间匹配核，使用支持向量机对长时语音变长模式进行分类
4. Emotion Recognition from Varying Length Patterns of Speech using CNN-based Segment-Level Pyramid Match Kernel based SVMs [C] . Shikha Gupta, Kishalaya De, Dileep Aroor Dinesh, National Conference on Communications . 2019

机译：使用基于CNN的段级金字塔匹配基于内核的SVM的情感识别来自不同长度的语音模式
5. Domain Adaptation for Speech Based Emotion Recognition [D] . Abdelwahab, Mohammed. 2019

机译：基于语音情感识别的域适应
6. Emotion Recognition from Single-Trial EEG Based on Kernel Fishers Emotion Pattern and Imbalanced Quasiconformal Kernel Support Vector Machine [O] . Yi-Hung Liu, Chien-Te Wu, Wei-Teng Cheng, 2014

机译：基于核Fisher情绪模式和不对称拟共形核支持向量机的单次EEG情绪识别
7. Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features [O] . Tursunov Anvarjon, Soonil Kwon 2020

机译：深网络：使用深频特征的基于轻量级CNN的语音情感识别系统
8. New Kernel for SVM MLLR Based Speaker Recognition. [R] . Karam, Z. N., Campbell, W. M. 2016

机译：基于sVm mLLR的说话人识别新核。

Emotion Recognition from Varying Length Patterns of Speech using CNN-based Segment-Level Pyramid Match Kernel based SVMs

摘要

著录项

相似文献

相关主题

期刊订阅