Combining Video, Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering

机译：通过粒子过滤将视频，音频和词汇指示符组合在自发会话中的效果

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present experiments on fusing facial video, audio and lexical indicators for affect estimation during dyadic conversations. We use temporal statistics of texture descriptors extracted from facial video, a combination of various acoustic features, and lexical features to create regression based affect estimators for each modality. The single modality regressors are then combined using particle filtering, by treating these independent regression outputs as measurements of the affect states in a Bayesian filtering framework, where previous observations provide prediction about the current state by means of learned affect dynamics. Tested on the Audio-visual Emotion Recognition Challenge dataset, our single modality estimators achieve substantially higher scores than the official baseline method for every dimension of affect. Our filtering-based multi-modality fusion achieves correlation performance of 0.344 (baseline: 0.136) and 0.280 (baseline: 0.096) for the fully continuous and word level sub challenges, respectively.

机译：我们提出了融合面部视频，音频和词汇指标以进行二元对话期间的影响估计的实验。我们使用从面部视频中提取的纹理描述符的时间统计数据，各种声学特征和词汇特征的组合来为每个模态创建基于回归的影响估计量。然后，通过将这些独立的回归输出作为贝叶斯过滤框架中对情感状态的度量来处理，然后使用粒子滤波将单个模态回归变量进行组合，其中先前的观察通过学习到的情感动态来提供有关当前状态的预测。在视听情感识别挑战数据集上进行测试，我们的单模态估计器在情感的各个维度上的得分均比官方基准线方法高出许多。我们基于过滤的多模态融合分别针对完全连续和字级子挑战分别实现了0.344（基准：0.136）和0.280（基准：0.096）的相关性能。

著录项

来源
《ACM international conference on multimodal interaction》|2012年|485-492|共8页
会议地点
作者
Arman Savran; Houwei Cao; Miraj Shah; Ani Nenkova; Ragini Verma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
emotion recognition; affective computing; particle filtering; emotion dynamics; multi-modality fusion; local binary patterns; class-spectral features; lexical analysis; adaboost; svm;

机译：情绪识别;情感计算;粒子过滤情绪动力学;多模态融合本地二进制模式;类谱特征;词法分析; adaboost虚拟机;

相似文献

外文文献
中文文献
专利

1. Temporal Bayesian Fusion for Affect Sensing: Combining Video, Audio, and Lexical Modalities [J] . Savran Arman, Cao Houwei, Nenkova Ani, Cybernetics, IEEE Transactions on . 2015,第9期

机译：用于情感感知的时间贝叶斯融合：视频，音频和词汇形式的组合
2. Acoustic and lexical representations for affect prediction in spontaneous conversations [J] . Houwei Cao, Arman Savran, Ragini Verma, Computer speech and language . 2015,第1期

机译：自发会话中的情感预测的声学和词汇表示
3. A comparative study of lexical word search in an audioconferencing and a videoconferencing condition [J] . Cohen Cathy, Wigham Ciara R. Computer assisted language learning . 2019,第1a4期

机译：音频会议和视频会议条件下词汇搜索的比较研究
4. Combining Video, Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering [C] . Arman Savran, Houwei Cao, Miraj Shah, ACM international conference on multimodal interaction . 2012

机译：通过粒子滤波结合在自发对话中的视频，音频和词汇指示器
5. Use of referring expressions by autistic children in spontaneous conversations: Does impaired metarepresentational ability affect reference production? [D] . Wicklund, Mark Donald. 2012

机译：自闭症儿童在自发交谈中使用参照表达：元表征能力受损会影响参照产生吗？
6. Combining Video Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering [O] . Arman Savran, Houwei Cao, Miraj Shah, -1

机译：通过粒子滤波结合在自发对话中的视频音频和词汇指示器
7. Combining video, audio and lexical indicators of affect in spontaneous conversation via particle filtering [O] . Arman Savran, Houwei Cao, Miraj Shah, 2012

机译：通过粒子滤波结合在自发对话中的视频，音频和词汇指示器

Combining Video, Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering

摘要

著录项

相似文献

相关主题

期刊订阅