【24h】

Deep Learning for Math Knowledge Processing

机译:用于数学知识处理的深度学习

获取原文

摘要

The vast and fast-growing STEM literature makes it imperative to develop systems for automated math-semantics extraction from technical content, and for semantically-enabled processing of such content. Grammar-based techniques alone are inadequate for the task. We present a new project for using deep learning (DL) for that purpose. It will explore a number of DL and representation-learning models, which have shown superior performance in applications that involve sequences of data. As math and science involve sequences of text, symbols and equations, such as deep learning models are expected to deliver good performance in math-semantics extraction and processing. The project has several goals: (1) to apply different DL models to math-semantics extraction and processing, designing more suitable models as needed, for such foundational tasks as accurate tagging and automated translation from LATEX to semantically-resolved machine understandable forms such as cMathML; (2) to create and make available to the public labeled math-content datasets for model training and testing, and Word2Vec/Math2Vec representations derived from large math datasets; and (3) to conduct extensive comparative performance evaluations gaining insights into which DL models, data representations, and traditional machine learning models, are best for the above-stated goals.
机译:大量且快速增长的STEM文献迫切需要开发用于从技术内容中自动提取数学语义以及对此类内容进行语义支持的处理的系统。仅基于语法的技术不足以完成这项任务。我们提出了一个用于为此目的使用深度学习(DL)的新项目。它将探索许多DL和表示学习模型,这些模型在涉及数据序列的应用程序中显示出卓越的性能。由于数学和科学涉及文本,符号和方程的序列,因此诸如深度学习模型之类的模型有望在数学语义提取和处理中提供良好的性能。该项目有以下目标:(1)将不同的DL模型应用于数学语义提取和处理,根据需要设计更合适的模型,以完成诸如LATEX到LATEX到语义解析的机器可理解形式的准确标记和自动翻译之类的基础任务。 cMathML; (2)创建并向公众提供带有标签的数学内容数据集以进行模型训练和测试,以及从大型数学数据集派生的Word2Vec / Math2Vec表示形式; (3)进行广泛的比较性能评估,以获取最适合上述目标的DL模型,数据表示形式和传统机器学习模型的见解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号