【24h】

A Simple Analysis of Multimodal Data Fusion

机译:多模式数据融合的简单分析

获取原文

摘要

Multimodal data fusion has a long research history since audio-visual speech recognition, which is inspired by the McGurk effect. Because of the limited model capacity of the traditional methods, multimodal data fusion researches are not so popular for a period. Recently, the advances of deep learning techniques open up new opportunities for the multimodal data fusion field. However, there is still a great gap in the multimodal data processing ability between artificial intelligence and human beings. Many problems in multimodal data processing are still necessary to be researched. In this work, we propose to gain an insight into the information fusion level and apply different information fusion strategy to different situations. We analyze the different situations of the multimodal data fusion process and divide them into two categories, including consistent information fusion and contradictory information fusion. We demonstrate some toy examples of the different cases of the multimodal data fusion process.
机译:多模式数据融合具有很长的研究历史,因为视听语音识别,由McGurk效果启发。由于传统方法的有限型号能力,多式联数据融合研究在一段时期并不受欢迎。最近,深度学习技术的进步开辟了多模式数据融合场的新机遇。然而,人工智能与人类之间的多模式数据处理能力仍然存在巨大缺口。仍然需要研究多模式数据处理中的许多问题。在这项工作中,我们建议深入了解信息融合级别,并将不同的信息融合策略应用于不同情况。我们分析了多模式数据融合过程的不同情况,并将它们分为两类,包括一致的信息融合和矛盾信息融合。我们展示了一些多式联数据融合过程的不同情况的一些玩具例子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号