Combining Video, Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering

机译：通过粒子滤波结合在自发对话中的视频，音频和词汇指示器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present experiments on fusing facial video, audio and lexical indicators for affect estimation during dyadic conversations. We use temporal statistics of texture descriptors extracted from facial video, a combination of various acoustic features, and lexical features to create regression based affect estimators for each modality. The single modality regressors are then combined using particle filtering, by treating these independent regression outputs as measurements of the affect states in a Bayesian filtering framework, where previous observations provide prediction about the current state by means of learned affect dynamics. Tested on the Audio-visual Emotion Recognition Challenge dataset, our single modality estimators achieve substantially higher scores than the official baseline method for every dimension of affect. Our filtering-based multi-modality fusion achieves correlation performance of 0.344 (baseline: 0.136) and 0.280 (baseline: 0.096) for the fully continuous and word level sub challenges, respectively.

机译：我们在多达对话期间对融合面部视频，音频和词汇指标进行融合面部视频，音频和词汇指标进行实验。我们使用从面部视频中提取的纹理描述符的时间统计，各种声学特征的组合和词汇特征，以创建基于回归的影响每个模态的影响。然后，通过将这些独立回归输出处理在贝叶斯滤波框架中的影响状态的测量中，使用粒子滤波来组合单个模态回归器，其中先前观察通过学习的影响动态提供关于当前状态的预测。在视听情感识别挑战数据集上测试，我们的单个模态估计器比每一系列影响的官方基线方法达到更高的分数。我们的滤波基多种式融合可以分别实现0.344（基线：0.136）和0.280（基线：0.096）的相关性能，分别为完全连续和字级群挑战。

著录项

来源
《ACM international conference on multimodal interaction》|2012年||共8页
会议地点
作者
Arman Savran; Houwei Cao; Miraj Shah; Ani Nenkova; Ragini Verma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
emotion recognition; affective computing; particle filtering; emotion dynamics; multi-modality fusion; local binary patterns; class-spectral features; lexical analysis; adaboost; svm;

机译：情绪识别;情感计算;粒子过滤;情绪动态;多种方式融合;局部二进制模式;类光谱特征;词汇分析;adaboost;svm;

相似文献

外文文献
中文文献
专利

1. Temporal Bayesian Fusion for Affect Sensing: Combining Video, Audio, and Lexical Modalities [J] . Savran Arman, Cao Houwei, Nenkova Ani, Cybernetics, IEEE Transactions on . 2015,第9期

机译：用于情感感知的时间贝叶斯融合：视频，音频和词汇形式的组合
2. Acoustic and lexical representations for affect prediction in spontaneous conversations [J] . Houwei Cao, Arman Savran, Ragini Verma, Computer speech and language . 2015,第1期

机译：自发会话中的情感预测的声学和词汇表示
3. A comparative study of lexical word search in an audioconferencing and a videoconferencing condition [J] . Cohen Cathy, Wigham Ciara R. Computer assisted language learning . 2019,第1a4期

机译：音频会议和视频会议条件下词汇搜索的比较研究
4. Combining Video, Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering [C] . Arman Savran, Houwei Cao, Miraj Shah, ACM international conference on multimodal interaction . 2012

机译：通过粒子过滤将视频，音频和词汇指示符组合在自发会话中的效果
5. Use of referring expressions by autistic children in spontaneous conversations: Does impaired metarepresentational ability affect reference production? [D] . Wicklund, Mark Donald. 2012

机译：自闭症儿童在自发交谈中使用参照表达：元表征能力受损会影响参照产生吗？
6. Combining Video Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering [O] . Arman Savran, Houwei Cao, Miraj Shah, -1

机译：通过粒子滤波结合在自发对话中的视频音频和词汇指示器
7. Combining video, audio and lexical indicators of affect in spontaneous conversation via particle filtering [O] . Arman Savran, Houwei Cao, Miraj Shah, 2012

机译：通过粒子滤波结合在自发对话中的视频，音频和词汇指示器

Combining Video, Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering

摘要

著录项

相似文献

相关主题

期刊订阅