首页> 外文会议>6th workshop on ontologies and lexical resources. >Latent Mixture of Discriminative Experts for Multimodal Prediction Modeling
【24h】

Latent Mixture of Discriminative Experts for Multimodal Prediction Modeling

机译:多模式预测建模的判别专家的潜在混合

获取原文
获取原文并翻译 | 示例

摘要

During face-to-face conversation, people naturally integrate speech, gestures and higher level language interpretations to predict the right time to start talking or to give backchannel feedback. In this paper we introduce a new model called Latent Mixture of Discriminative Experts which addresses some of the key issues with multimodal language processing: (1) temporal synchrony/asynchrony between modalities, (2) micro dynamics and (3) integration of different levels of interpretation. We present an empirical evaluation on listener nonverbal feedback prediction (e.g., head nod), based on observable behaviors of the speaker. We confirm the importance of combining four types of multimodal features: lexical, syntactic structure, eye gaze, and prosody. We show that our Latent Mixture of Discriminative Experts model outperforms previous approaches based on Conditional Random Fields (CRFs) and Latent-Dynamic CRFs.
机译:在面对面的交谈中,人们自然会整合语音,手势和高级语言解释,以预测何时开始交谈或提供反向渠道反馈的正确时间。在本文中,我们介绍了一个称为“判别专家的潜在混合”的新模型,该模型解决了多模态语言处理中的一些关键问题:(1)模态之间的时间同步/异步,(2)微观动力学和(3)不同层次的语言集成解释。我们根据说话者的可观察到的行为对听者的非语言反馈预测(例如,头点头)进行实证评估。我们确认结合四种类型的多峰特征的重要性:词汇,句法结构,视线和韵律。我们表明,基于判别专家的潜在混合模型优于基于条件随机场(CRF)和潜在动态CRF的先前方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号