...
首页> 外文期刊>International journal of intelligent information and database systems >Search engine indexing storage optimisation using Hamming distance
【24h】

Search engine indexing storage optimisation using Hamming distance

机译:使用汉明距离的搜索引擎索引存储优化

获取原文
获取原文并翻译 | 示例

摘要

We are going to propose indexing algorithm of search engine aiming to decrease time and space complexity. Kxisting indexing algorithms have greater space requirements due to the fact that all the words of the web pages are being stored except the stop words. In this paper, we present a theory on indexing mechanism of a search engine. Time complexity is the lime taken by the search engine to retrieve information and space complexity is the space required to store the indices in the hard disk. Decreasing the time complexity will lead to faster retrieval of information and decreasing the space complexity leads to efficient utilisation of space. We have only dealt with textual part of the web pages. Hamming distance concept frames approach to achieve better result in space complexity.
机译:我们将提出搜索引擎的索引算法,以减少时间和空间的复杂性。由于存储了网页中除停用词以外的所有单词,因此现有的索引算法具有更大的空间要求。在本文中,我们提出了一种关于搜索引擎索引机制的理论。时间复杂度是搜索引擎检索信息所需的时间,而空间复杂度是将索引存储在硬盘中所需的空间。减少时间复杂度将导致更快的信息检索,而减少空间复杂度将导致空间的有效利用。我们只处理了网页的文本部分。汉明距离概念框架方法可以更好地实现空间复杂性。

著录项

  • 来源
  • 作者单位

    Netaji Subhash Engineering College,West Bengal University of Technology,Calcutta 700152, India Innovation Research Lab (IRL),Capex Technologies,West Bengal 711103, India;

    Netaji Subhash Engineering College,West Bengal University of Technology,Calcutta 700152, India Innovation Research Lab (IRL),Capex Technologies,West Bengal 711103, India;

    Netaji Subhash Engineering College,West Bengal University of Technology,Calcutta 700152, India Innovation Research Lab (IRL),Capex Technologies,West Bengal 711103, India;

    Netaji Subhash Engineering College,West Bengal University of Technology,Calcutta 700152, India Innovation Research Lab (IRL),Capex Technologies,West Bengal 711103, India;

    Netaji Subhash Engineering College,West Bengal University of Technology,Calcutta 700152, India Innovation Research Lab (IRL),Capex Technologies,West Bengal 711103, India;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    search engine; forward indexing; inverted indexing; hamming distance; indexing storage minimisation;

    机译:搜索引擎;前向索引倒排索引海明距离索引存储最小化;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号