Retrieval of Mathematical Information with Syntactic and Semantic Structure over Web

Hussain Sharaf; Khoja Shakeel

首页> 外文期刊>Journal of Information Recording >Retrieval of Mathematical Information with Syntactic and Semantic Structure over Web

【24h】

Retrieval of Mathematical Information with Syntactic and Semantic Structure over Web

机译：Web上的句法和语义结构检索数学信息

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Efficient retrieval of mathematical expressions over web is a complex process as compared to simple text search. This is only possible when the syntactic (e.g. Textual) and semantic (e.g. Structural) information of a mathematical expression is retrieved properly and analyzed methodically. In this paper, we are proposing a technique that indexes expressions along with their syntactic and semantic information. These expressions are represented in ContentMathML(CMML). To improve the memory efficiency in index, an encoding technique is introduced which encode CMML mathematical expressions in Braille Unicode characters. In order to improve ranking of retrieved documents, a weighting function is introduced which assign a weight to each indexing term. The weighting score of each term contributes in ranking function that improves the rank of a document which contains query terms. The proposed technique is evaluated on NTCIR-12 Wikipedia and Arxiv corpora. Performance is also measured using NTCIR-MathIR evaluation criteria. The precision for Wikipedia-formula-queries is achieved 47% and for Arxiv is achieved 44% at top 5 documents.

机译：与简单文本搜索相比，在Web上的数学表达式的有效检索是一个复杂的过程。只有当正确并有条理地分析数学表达式的语法（例如文本）和语义（例如结构）信息时才可以获得。在本文中，我们提出了一种索引表达式以及其句法和语义信息的技术。这些表达式在ContentMathml（CMML）中表示。为了提高索引中的记忆效率，介绍了编码技术，其在盲文Unicode字符中编码CMML数学表达式。为了改善检索的文档的排名，引入了对每个索引项的权重的加权函数。每个术语的加权分数有助于排名函数，从而提高包含查询术语的文档的等级。所提出的技术在NTCIR-12维基百科和Arxiv Corpora上进行了评估。使用NTCIR-Mathir评估标准也测量性能。维基百科配方查询的精度实现了47％，对于Arxiv在前5份文件中实现了44％。

著录项

来源
《Journal of Information Recording》 |2020年第1期|75-89|共15页
作者
Hussain Sharaf; Khoja Shakeel;
展开▼
作者单位

Inst Business Adm Dept Comp Sci Karachi 74400 Pakistan;

Inst Business Adm Dept Comp Sci Karachi 74400 Pakistan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
information retrieval; formula retrieval; term ranking; structure matching; term encoding; formula indexing;

机译：信息检索;公式检索;术语排名;结构匹配;术语编码;公式索引;

相似文献

外文文献
中文文献
专利

1. Mining Semantics Structures from Syntactic Structures in Web Document Corpora [J] . Hamid Mousavi, Shi Gao, Deirdre Kerr, International journal of semantic computing . 2014,第4期

机译：从Web文档语料库的句法结构挖掘语义结构。
2. A Semantic Web data retrieval implementation with an adaptive model for supporting agent decision structures [J] . James V. Hansen, James B. McDonald, Conan C. Albrecht, Electronic Commerce Research . 2007,第1期

机译：带有支持代理决策结构的自适应模型的语义Web数据检索实现
3. Leveraging the structure of the semantic web to enhance information retrieval for proteomics [J] . Smith A, Cheung K, Krauthammer M, Bioinformatics . 2007,第22期

机译：利用语义网的结构来增强蛋白质组学的信息检索
4. A Web-based Tool for the Integrated Annotation of Semantic and Syntactic Structures [C] . Richard Eckart de Castilho, Eva Mujdricza-Maydt, Seid Muhie Yimam, Language technology resources and tools for digital humanities . 2016

机译：用于语义和句法结构集成注释的基于Web的工具
5. Information retrieval and semantic structure matching for assessing Web-service similarity. [D] . Wang, Yiqiao. 2003

机译：信息检索和语义结构匹配，用于评估Web服务的相似性。
6. The Influence of Task-Irrelevant Music on Language Processing: Syntactic and Semantic Structures [O] . Lisianne Hoch, Benedicte Poulin-Charronnat, Barbara Tillmann 2011

机译：任务无关的音乐对语言处理的影响：句法和语义结构
7. The Syntactic Web - Syntax and Semantics on the Web [O] . Jonathan Robie 2001

机译：语法Web-Web上的语法和语义

Retrieval of Mathematical Information with Syntactic and Semantic Structure over Web

摘要

著录项

相似文献

相关主题

期刊订阅