Towards Mathematical Reasoning: A Multimodal Deep Learning Approach

机译：迈向数学推理：一种多模式深度学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a new direction for the visual question answering task. Given an image with a simple linear algebraic equation system and a question in natural language based on the variables in the equations, we propose an end-to-end deep learning model that produces accurate answers to questions pertaining to the value of the variables and other related questions. Modeling the problem of solving simple linear equations as a VQA task makes it interesting as the system now requires three kinds of understanding a) visual understanding to recognize digits, variables, operators and equal sign b) conceptual understanding of the symbolic meanings of coefficients' constants, variables, operators and equality and c) high level understanding of the interaction between the image and the questions in order to accurately answer them. We also create an open-source dataset for the same and compare the performance of our model with different baselines.

机译：本文提出了视觉问题解答任务的新方向。给定具有简单线性代数方程组的图像以及基于方程中变量的自然语言问题，我们提出了一种端到端深度学习模型，该模型可为与变量和其他值有关的问题提供准确的答案相关问题。将简单线性方程式的问题建模为VQA任务使其变得很有趣，因为系统现在需要三种理解方式：a）视觉理解，以识别数字，变量，运算符和等号b）对系数常数的符号含义的概念性理解，变量，运算符和相等性;以及c）对图像和问题之间的相互作用有较高的了解，以便准确地回答它们。我们还为此创建了一个开源数据集，并将我们的模型的性能与不同的基准进行比较。

著录项

来源
《IEEE International Conference on Image Processing》|2018年|4028-4032|共5页
会议地点
作者
Abhishek Sinha; Kumar Ayush;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Mathematical model; Task analysis; Visualization; Decoding; Computational modeling; Machine learning; Numerical models;

机译：数学模型;任务分析;可视化;解码;计算模型;机器学习;数值模型;

相似文献

外文文献
中文文献
专利

1. 一种基于深度稀疏自编码的语音情感迁移学习方法 [J] . 梁镇麟, 梁瑞宇, 唐曼婷, 东南大学学报（英文版） . 2019,第002期
2. Multimodal deep learning for multimedia understanding and reasoning [J] . Han Yahong, Chen Jingjing Multimedia Tools and Applications . 2021,第11期

机译：多模式深度学习多媒体理解与推理
3. Development of Learning Materials with Realistic Mathematics Education Approach to Improve Students’ Mathematic Reasoning Ability and Self-Concept [J] . Nenta Dumalia Siregar, Elmanani Simamora, Izwita Dewi Journal of Education and Practice . 2019,第23期

机译：运用逼真的数学教育方法开发学习材料，以提高学生的数学推理能力和自我概念
4. Mathematics Learning Device Development Based On Constructivism Approach To Improve Mathematical Reasoning Skill Of Class X Students In Vocational High School (SMK) [J] . Yunita Safitri, I Made Arnawa International Journal of Scientific & Technology Research . 2019,第5期

机译：基于建构主义方法的数学学习设备开发，以提高职业高中X级学生的数学推理能力
5. TOWARDS MATHEMATICAL REASONING: A MULTIMODAL DEEP LEARNING APPROACH [C] . Abhishek Sinha, Kumar Ayush IEEE International Conference on Image Processing . 2018

机译：迈向数学推理：多峰深度学习方法
6. Deep Reinforcement Learning Approach to Multimodal Adaptive Traffic Signal Control [D] . Mohamad Alizadeh Shabestary, Soheil. 2019

机译：多模式自适应交通信号控制的深增强学习方法
7. Deep Learning Approach for Multimodal Biometric Recognition System Based on Fusion of Iris Face and Finger Vein Traits [O] . Nada Alay, Heyam H. Al-Baity 2020

机译：基于虹膜面部和手指静脉特征的多模式生物识别系统的深度学习方法
8. COMPARISON OF THE ABILITY OF MATHEMATICAL REASONING OF SMK STUDENTS USING A METACOGNITIVE SKILL APPROACH THROUGH ADVANCE ORGANIZER LEARNING MODELS WITH A SAINTIFIC APPROACH THROUGH DISCOVERY LEARNING [O] . Rosalina Rolina, Sri Aryani, Erik Taufiqurrohman 2018

机译：利用先进组织机学习模型，通过发现学习比较MSMK学生使用元认知技能方法的数学推理能力

Towards Mathematical Reasoning: A Multimodal Deep Learning Approach

摘要

著录项

相似文献

相关主题

期刊订阅