Lightly-supervised utterance-level emotion identification using latent topic modeling of multimodal words

机译：基于多模态词的潜在主题建模的轻度话语级情绪识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Research on multimodal emotion recognition has drawn much attention recently in diverse disciplines. With the increasing amount of multimodal data, unsupervised or semi-supervised learning has become highly desirable to automatically discover expression of emotion patterns in behavioral data. We present a novel approach for multimodal emotion learning using only a small amount of labels. Our approach is hinging on probabilistic latent semantic analysis (pLSA) that defines the latent variable as the emotion class, motivated by the conceptualization that human emotion acts as a latent control variable that regulates the external behavior manifestations, such as through speech and body gesture. In our approach, we represent the audio-visual information in an utterance as a bag of multimodal words. To exploit the interrelation between speech and gesture modalities, we propose a canonical correlation analysis (CCA) based vocabulary of multimodal words. Our approach has achieved promising experimental results. We have also demonstrated the superiority of the CCA-based multimodal words over those derived directly from the original cues.

机译：最近，多模式情感识别的研究已引起了各学科的广泛关注。随着多模态数据量的增加，非常需要无监督或半监督学习来自动发现行为数据中的情感模式表达。我们提出了一种仅使用少量标签进行多模式情感学习的新颖方法。我们的方法依赖于概率潜伏语义分析（pLSA），该潜伏语义分析将潜伏变量定义为情感类别，这是由于人类情感充当了调节外部行为表现（例如通过语音和身体手势）的潜伏控制变量的概念化所激发的。在我们的方法中，我们将视听信息表达为一袋多峰词。为了利用语音和手势模态之间的相互关系，我们提出了一种基于规范相关分析（CCA）的多模态词词汇。我们的方法取得了令人鼓舞的实验结果。我们还证明了基于CCA的多模式单词比直接源自原始提示的单词具有优越性。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2016年|2767-2771|共5页
会议地点
作者
Zhaojun Yang; Shrikanth Narayanan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Multimodal emotion recognition; latent topic modeling; semi-supervised learning; unsupervised learning;

机译：多模式情感识别;潜在主题建模;半监督学习;无监督学习;

相似文献

外文文献
中文文献
专利

1. A HYBRID WORD EMBEDDING MODEL BASED ON ADMIXTURE OF POISSON-GAMMA LATENT DIRICHLET ALLOCATION MODEL AND DISTRIBUTED WORD-DOCUMENT-TOPIC REPRESENTATION [J] . IBRAHIM BAKARI BALA, MOHD ZAINURI SARINGAT, AIDA MUSTAPHA Journal of Theoretical and Applied Information Technology . 2020,第9期

机译：一种基于泊松 - 伽马潜在Dirichlet分配模型和分布式字文档主题表示的混合词嵌入模型
2. Affective Audio-Visual Words and Latent Topic Driving Model for Realizing Movie Affective Scene Classification [J] . Irie G., Satou T., Kojima A., Multimedia, IEEE Transactions on . 2010,第6期

机译：实现电影情感场景分类的情感视听词与潜在主题驱动模型
3. Word2vec-based latent semantic analysis (W2V-LSA) for topic modeling: A study on blockchain technology trend analysis [J] . Kim Suhyeon, Park Haecheong, Lee Junghye Expert systems with applications . 2020,第Auga期

机译：基于Word2VEC的潜在语义分析（W2V-LSA）主题建模：区块链技术趋势分析研究
4. Lightly-supervised utterance-level emotion identification using latent topic modeling of multimodal words [C] . Zhaojun Yang, Shrikanth Narayanan IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：轻度监督的话语级情绪识别，使用多式联词的潜在模型
5. Structured topic models: Jointly modeling words and their accompanying modalities. [D] . Wang, Xuerui. 2009

机译：结构化主题模型：联合建模单词及其伴随的方式。
6. A Comparison of Dimensional Models of Emotion: Evidence from Emotions Prototypical Events Autobiographical Memories and Words [O] . David C. Rubin, Jennifer M. Talarico -1

机译：情感的三维模型的比较：从情感原型活动自传体记忆和言词证据
7. A Latent Concept Topic Model for Robust Topic Inference Using Word Embeddings [O] . Weihua Hu, Junichi Tsujii 2016

机译：使用Word Embeddings的强大主题推断的潜在概念主题模型

Lightly-supervised utterance-level emotion identification using latent topic modeling of multimodal words

摘要

著录项

相似文献

相关主题

期刊订阅