The Search for Equations — Learning to Identify Similarities Between Mathematical Expressions

机译：寻找方程式—学习识别数学表达式之间的相似性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

On your search for scientific articles relevant to your research question, you judge the relevance of a mathematical expression that you stumble upon using extensive background knowledge about the domain, its problems and its notations. We wonder if machine learning can support this process and work toward implementing a search engine for mathematical expressions in scientific publications. Thousands of scientific publication with millions of mathematical expressions or equations are accessible at arXiv.org. We want to use this data to learn about equations, their distribution and their relations in order to find similar equations. To this end we propose an embedding model based on convolutional neural networks that maps bitmap images of equations into a low-dimensional vector-space where similarity is evaluated via dot-product. However, no annotated similarity data is available to train this mapping. We mitigate this by proposing a number of different unsupervised proxy tasks that use available features as weak labels. We evaluate our system using a number of metrics, including results on a small hand-labeled subset of equations. In addition, we show and discuss a number of result-sets for some sample queries. The results show that we are able to automatically identify related mathematical expressions. Our dataset is published at https://whadup.github.io/EquationLearning/ and we invite the community to use it.

机译：在搜索与您的研究问题相关的科学文章时，您会使用关于领域，其问题及其表示法的广泛背景知识来判断偶然发现的数学表达式的相关性。我们想知道机器学习是否可以支持此过程，并努力实现在科学出版物中实现数学表达式的搜索引擎。可以在arXiv.org上访问数以千计的科学出版物，其中包含数百万个数学表达式或方程式。我们想使用这些数据来了解方程，方程的分布及其关系，以便找到相似的方程。为此，我们提出了一种基于卷积神经网络的嵌入模型，该模型将方程的位图图像映射到低维向量空间，其中通过点积评估相似性。但是，没有带注释的相似性数据可用于训练此映射。我们通过建议使用可用功能作为弱标签的许多不同的无监督代理任务来减轻这种情况。我们使用许多指标来评估我们的系统，其中包括手工标记的一小部分方程组的结果。此外，我们显示并讨论了一些示例查询的许多结果集。结果表明，我们能够自动识别相关的数学表达式。我们的数据集发布在https://whadup.github.io/EquationLearning/上，我们邀请社区使用它。

著录项

来源
《European conference on machine learning and principles and practice of knowledge discovery in databases》|2019年|704-718|共15页
会议地点
作者
Lukas Pfahler; Jonathan Schill; Katharina Morik;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Applied data science track; Data science; Preference learning and ranking; Deep learning;

机译：应用数据科学专业;数据科学;偏好学习和排名;深度学习;

相似文献

外文文献
中文文献
专利

1. Local similarity preserved hashing learning via Markov graph for efficient similarity search [J] . Liu Hong, Jiang Aiwen, Wang Mingwen, Neurocomputing . 2015,第jula2期

机译：通过马尔可夫图进行局部相似度保留的哈希学习，实现有效的相似度搜索
2. Comparison of Mathematical Equations Applicable to Tolerance of Total Body Irradiation in Humans and Decay of Isotopes, Uranium and Thorium: Differences and Similarity [J] . Sung Jang Chung Journal of Biomedical Science and Engineering . 2017,第5期

机译：适用于同位素，铀和钍腐蚀总体辐射耐受性的数学方程的比较：差异与相似性
3. Mathematical differences and physical similarities between Eliezer-Ford-O’connell equation and Landau-Lifshitz equation [J] . J.F. García-Camacho, E. Salinas, A. Avalos-Vargas, Revista mexicana de fisica . 2015,第5期

机译：Eliezer-Ford-O’connell方程和Landau-Lifshitz方程之间的数学差异和物理相似性
4. Graphical User Interface for Search of Mathematical Expressions with Regular Expressions [C] . Takayuki Watabe, Yoshinori Miyazaki International conference on human-computer interaction . 2015

机译：图形用户界面，用于使用正则表达式搜索数学表达式
5. Learning Effective Binary Representation with Deep Hashing Technique for Large-Scale Multimedia Similarity Search [D] . Wu, Gengshen. 2020

机译：学习具有深度散列技术的有效二进制表示，用于大规模多媒体相似性搜索
6. Similarities and Differences in the Learning Profiles of Adolescents with SLD and SLI in Mathematics—A Preliminary Analysis [O] . Eleni Bonti, Afroditi Kamari, Maria Sofologi, 2021

机译：数学中SLD和SLI青少年学习概况的相似性和差异 - 初步分析
7. A geometrical similarity between diffusion of biological particles in mathematical biology and migration of human population in mathematical sociology(Dynamics of functional equations and numerical simulation) [O] . Tabata Minoru 2006

机译：数学生物学中生物颗粒的扩散与数学社会学中人口迁移之间的几何相似性（功能方程和数值模拟的动力学）

The Search for Equations — Learning to Identify Similarities Between Mathematical Expressions

摘要

著录项

相似文献

相关主题

期刊订阅