Multimodal Classification Fusion in Real-World Scenarios

机译：现实世界情景中的多模式分类融合

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we propose a multimodal setting in real-world scenarios based on weighting and meta-learning combination methods that integrate the output probabilities obtained from text and visual classifiers. While the classifier built on the concatenation of text and visual features may worsen the results, the model described in this paper can increase classification accuracy to over 6%. Typically, text or images are used in classification; however, ambiguity in either text or image may reduce the performance. This leads to combine text and image of an object or a concept in a multimodal approach to enhance the performance. In our approach, a text classifier is trained on Bag of Words and a visual classifier is trained on features extracted through a Deep Convolutional Neural Network. We created a new dataset of real-world texts and images called Ferramenta. Some of the images and related texts in this dataset contain ambiguities, which is an ideal situation to test a multimodal approach. Experimental results reported on Ferramenta and PASCAL VOC2007 datasets indicate that the combination methods described performs better in a multimodal setting.

机译：在本文中，我们基于加权和元学习组合方法提出了现实世界场景中的多模态设置，该方法集成了从文本和视觉分类器获得的输出概率。虽然基于文本和可视化功能的串联构建的分类器可能会使结果恶化，但本文中描述的模型可以将分类精度提高到超过6 ％。通常，文本或图像用于分类;但是，文本或图像中的模糊可能会降低性能。这导致将对象的文本和图像组合或以多模式方法的概念组合以增强性能。在我们的方法中，文本分类器在单词袋上培训，并在通过深度卷积神经网络提取的功能上培训视觉分类器。我们创建了一个名为ferramenta的现实文本和图像的新数据集。此数据集中的一些图像和相关文本包含含糊不清，这是测试多式联路方法的理想情况。在Ferramera和Pascal VOC2007数据集上报告的实验结果表明描述的组合方法在多模式设置中更好地执行。

著录项

来源
《IAPR International Conference on Document Analysis and Recognition》|2017年|83p|共6页
会议地点
作者
Ignazio Gallo; Alessandro Calefati; Shah Nawaz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词
Feature extraction; Visualization; Stacking; Noise measurement; Prediction algorithms; Computational modeling; Support vector machines;

机译：特征提取;可视化;堆叠;噪声测量;预测算法;计算建模;支持矢量机器;

相似文献

外文文献
中文文献
专利

1. Graph-based multimodal fusion with metric learning for multimodal classification [J] . Angelou Michalis, Solachidis Vassilis, Vretos Nicholas, Pattern Recognition: The Journal of the Pattern Recognition Society . 2019,第期

机译：基于图的多模式学习的基于图的多模式融合
2. Road surface type classification based on inertial sensors and machine learning: A comparison between classical and deep machine learning approaches for multi-contextual real-world scenarios [J] . Menegazzo Jeferson, von Wangenheim Aldo Computing . 2021,第10期

机译：基于惯性传感器和机器学习的道路表面类型分类：多语境现实世界情景经典和深层机器学习方法的比较
3. Degradation Adaptive Texture Classification for Real-World Application Scenarios [J] . M. Gadermayr, D. Merhof, A. Vécsei, Pattern recognition and image analysis: advances in mathematical theory and applications in the USSR . 2017,第1期

机译：真实应用方案的降级自适应纹理分类
4. Multimodal Classification Fusion in Real-World Scenarios [C] . Ignazio Gallo, Alessandro Calefati, Shah Nawaz IAPR International Conference on Document Analysis and Recognition . 2017

机译：现实场景中的多模式分类融合
5. Discovering real-world usage scenarios for a multimodal math search interface. [D] . Wangari, Keita Del Valle. 2014

机译：发现多模式数学搜索界面的实际使用场景。
6. Multimodal fusion of EEG-fNIRS: a mutual information-basedhybrid classification framework [O] . Roohollah Jafari Deligani, Seyyed Bahram Borgheai, John McLinden, 2021

机译：EEG-FNIR的多模式融合：基于相同的信息混合分类框架
7. Histology Image Classification using Supervised Classification and Multimodal Fusion [O] . Tao Meng, Lin Lin, Mei-ling Shyu, 2013

机译：使用监督分类和多模态融合的组织学图像分类

Multimodal Classification Fusion in Real-World Scenarios

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅