Towards a semantic book search engine

机译：迈向语义书搜索引擎

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional Information Retrieval (IR) methods were initially used for searching and ranking web pages on the Web. These methods were progressively modified to exploit the peculiarities of the Web including the use of the hyperlinked structure of the Web for relevance ranking. These Web IR techniques, however, are also being applied for searching and ranking to other forms of text collections which are not inherently web documents. Books (especially in PDF form) are by nature different from web pages because they lack an explicit hypertextual structure and therefore cannot be accurately and precisely searched and ranked using traditional approaches. Books contain a highly structured content with implicit logical connections among different parts of the same book as well as to related content in other books. These book structural semantics and logical connections could be discovered and used to establish a web of books where the logical concepts, images, figures, tables, and other parts are linked with each other thus resulting in a semantic graph, which could then be exploited by a semantic book search engine for more precise and accurate indexing, searching, ranking and recommendations. Based on this hypothesis, the paper outlines a high-level architecture for one of the possible implementations of a semantic book search engine, identifies all the potential areas of research for future researchers, and reports on our work in progress in the form of the proposed model for the purpose. The proposed architecture, if implemented in its true sense, has the potential to better serve the needs of all the stakeholders including authors, publishers, readers, and librarians.

机译：最初，传统的信息检索（IR）方法用于在Web上对网页进行搜索和排名。对这些方法进行了逐步修改，以利用Web的特殊性，包括使用Web的超链接结构进行相关性排名。但是，这些Web IR技术也被用于搜索和排序其他形式的文本集合，而这些文本集合本身并不是Web文档。书籍（尤其是PDF格式的书籍）与网页本质上是不同的，因为它们缺乏明确的超文本结构，因此无法使用传统方法进行准确，精确的搜索和排名。书籍包含高度结构化的内容，在同一本书的不同部分之间以及与其他书籍中的相关内容之间具有隐式逻辑联系。这些书本的结构语义和逻辑联系可以被发现并用于建立一个书本网络，其中逻辑概念，图像，图形，表格和其他部分相互链接，从而产生一个语义图，然后可以被该图利用语义书搜索引擎，可进行更精确，更准确的索引，搜索，排名和推荐。基于此假设，本文概述了语义书搜索引擎的一种可能实现方式的高级体系结构，为未来的研究人员确定了所有潜在的研究领域，并以提议的形式报告了我们正在进行的工作目的模型。所提议的体系结构，如果以其真正的意义实施，则有可能更好地满足所有利益相关者的需求，包括作者，出版者，读者和图书馆员。

著录项

来源
《International Conference on Open Source Systems and Technologies》|2016年|106-113|共8页
会议地点
作者
Shah Khusro; Irfan Ullah;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Search engines; Web pages; Data mining; Semantic Web; Indexing;

机译：语义;搜索引擎;网页;数据挖掘;语义网;索引;

相似文献

外文文献
中文文献
专利

1. A Study on Semantic Searching, Semantic Search Engines and Technologies Used for Semantic Search Engines [J] . Junaid Rashid, Muhammad Wasif Nisar International Journal of Information Technology and Computer Science . 2016,第10期

机译：用于语义搜索引擎的语义搜索，语义搜索引擎和技术的研究
2. ExNa: an efficient search pattern for semantic search engines‡ [J] . Wei Xiao, Zeng Daniel Dajun Concurrency and computation: practice and experience . 2016,第15期

机译：ExNa：语义搜索引擎的有效搜索模式‡
3. Evaluation of a Traditional Search Engine against a Semantic Search Engine Using User Effort Measurements [J] . Ambarnath Banerji, Bhim Singh, Sujit K. Biswas The Online Journal on Computer Science and Information Technology . 2014,第1期

机译：使用用户努力量评估针对语义搜索引擎的传统搜索引擎
4. In Search of a Semantic Book Search Engine on the Web: Are We There Yet? [C] . Irfan Ullah, Shah Khusro Computer Science On-line Conference . 2016

机译：在网上搜索语义书搜索引擎：我们是否有？
5. Semantic routed network for distributed search engines [D] . Biswas, Amitava 2010

机译：分布式搜索引擎的语义路由网络
6. Sweet google O’ mine—The importance of online search engines for MS-facilitated database-independent identification of peptide-encoded book prefaces [O] . Alexander Hogrebe, Rosa R. Jersie-Christensen 2019

机译：谷歌我的天哪！在线搜索引擎对于MS辅助的与数据库无关的肽编码书序识别的重要性
7. Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines [O] . Rygl, Jan, Pomikálek, Jan, Řehůřek, Radim, 2017

机译：基于全文搜索的语义向量编码和相似性搜索引擎

Towards a semantic book search engine

摘要

著录项

相似文献

相关主题

期刊订阅