首页> 外文会议>The 2nd International Conference on Software Engineering and Data Mining >Feature extraction and clustering-based retrieval for mathematical formulas
【24h】

Feature extraction and clustering-based retrieval for mathematical formulas

机译:基于特征提取和基于聚类的数学公式检索

获取原文

摘要

Mathematical formulas or expressions are essential for presenting scientific knowledge in many research documents in academic areas such as physics and mathematics. Searching for related mathematical formulas is an important but challenging problem as formulas contain both structural and semantic information. Such information is hidden inside the mathematical expressions of the formulas. To support effective formula search, it is necessary to extract the structural and semantic features from the mathematical presentation of the formulas faithfully. In this paper, we propose an effective approach for formula feature extraction. To evaluate the proposed approach, the extracted features are tested with three popular clustering algorithms, namely K-means, Self Organizing Map (SOM), and Agglomerative Hierarchical Clustering (AHC), for formula retrieval. The performance of the clustering-based retrieval is measured based on a dataset of 881 formulas and promising results have been achieved.
机译:数学公式或表达式对于在学术领域(例如物理和数学)中的许多研究文档中呈现科学知识至关重要。搜索相关的数学公式是一个重要但具有挑战性的问题,因为公式包含结构和语义信息。这些信息隐藏在公式的数学表达式内。为了支持有效的公式搜索,有必要从公式的数学表示中忠实地提取结构和语义特征。在本文中,我们提出了一种有效的公式特征提取方法。为了评估所提出的方法,使用三种流行的聚类算法(即K-means,自组织图(SOM)和聚集层次聚类(AHC))对提取的特征进行了测试,以进行公式检索。基于881公式的数据集对基于聚类的检索的性能进行了测量,并取得了可喜的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号