AUDIOVISUAL CLASSIFICATION OF VOCAL OUTBURSTS IN HUMAN CONVERSATION USING LONG-SHORT-TERM MEMORY NETWORKS

机译：使用长短短期内存网络在人类谈话中的声音爆发的视听分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate classification of non-linguistic vocalisations with a novel audiovisual approach and Long Short-Term Memory (LSTM) Recurrent Neural Networks as highly successful dynamic sequence classifiers. As database of evaluation serves this year's Paralinguistic Challenge's Audiovisual Interest Corpus of human-to-human natural conversation. For video-based analysis we compare shape and appearance based features. These are fused in an early manner with typical audio descriptors. The results show significant improvements of LSTM networks over a static approach based on Support Vector Machines. More important, we can show a significant gain in performance when fusing audio and visual shape features.

机译：我们调查与新型视听方法和长短期记忆（LSTM）经常性神经网络的非语言声音的分类，作为高度成功的动态序列分类器。作为评估数据库，今年的Paralingument挑战是人类自然谈话的视听兴趣罪。用于基于视频的分析，我们比较基于形状和外观的功能。这些以早期的方式与典型的音频描述符融合。结果表明，基于支持向量机的静态方法，LSTM网络的显着改进。更重要的是，在融合音频和可视形状功能时，我们可以显示出显着的性能。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2011年||共4页
会议地点
作者
Florian Eyben; Stavros Petridis; Bjorn Schuller; George Tzimiropoulos; Stefanos Zafeiriou; Maja Pantic;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Non-linguistic Vocalisations; Laughter; Audiovisual Processing; Long Short-Term Memory;

机译：非语言声学;笑声;视听处理;长期内记忆;

相似文献

外文文献
中文文献
专利

1. Dual-stage attention-based long-short-term memory neural networks for energy demand prediction [J] . Peng Jieyang, Kimmig Andreas, Wang Jiahai, Energy and Buildings . 2021,第Octa期

机译：基于双阶段的关注的长短期记忆神经网络，用于能量需求预测
2. A visual long-short-term memory based integrated CNN model for fabric defect image classification [J] . Neurocomputing . 2020,第Mara7期

机译：基于视觉长期记忆的集成CNN模型用于织物疵点图像分类
3. Network Intrusion Detection Based on an Improved Long-Short-Term Memory Model in Combination with Multiple Spatiotemporal Structures [J] . Xiaolong Huang Wireless communications & mobile computing . 2021,第a期

机译：基于改进的长短期记忆模型与多种时空结构的改进的长短期存储器模型的网络入侵检测
4. Audiovisual classification of vocal outbursts in human conversation using Long-Short-Term Memory networks [C] . Eyben Florian, Petridis Stavros, Schuller Bjorn, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：使用长期记忆网络对人类对话中的声音爆发进行视听分类
5. Wireless Body Sensor Network for Tracking Human Mobility Using Long Short-Term Memory Neural Network for Classification [D] . Gupta, Saumya . 2019

机译：用于跟踪人类流动性的无线体传感器网络，使用长短短期内存神经网络进行分类
6. Exploiting Graphoelements and Convolutional Neural Networks with Long Short Term Memory for Classification of the Human Electroencephalogram [O] . P. Nejedly, V. Kremen, V. Sladky, -1

机译：开发具有长短期记忆的石墨元素和卷积神经网络对人类脑电图进行分类
7. AUDIOVISUAL CLASSIFICATION OF VOCAL OUTBURSTS IN HUMAN CONVERSATION USING LONG-SHORT-TERM MEMORY NETWORKS [O] . Florian Eyben, Stavros Petridis, Björn Schuller, 2013

机译：长期记忆网络在人际交流中对人声爆发的视听分类

AUDIOVISUAL CLASSIFICATION OF VOCAL OUTBURSTS IN HUMAN CONVERSATION USING LONG-SHORT-TERM MEMORY NETWORKS

摘要

著录项

相似文献

相关主题

期刊订阅