首页> 外国专利> SEARCH ENGINE SYSTEM BASED ON DISTRIBUTED DATA STORAGE DEVICE AND SEARCH METHOD THEREOF

SEARCH ENGINE SYSTEM BASED ON DISTRIBUTED DATA STORAGE DEVICE AND SEARCH METHOD THEREOF

机译:基于分布式数据存储设备的搜索引擎系统及其搜索方法

摘要

The present invention relates to a search engine system based on a distributed data storage device and a search method thereof. According to an embodiment of the present invention, a search engine system based on a distributed data storage device comprises: a user terminal which can be connected to a network and inputs a search request; a search engine server which includes an index module for generating index data including an inverted index file and an original document file with respect to a target document when the target document for index generation is collected; and a distributed storage device for receiving and storing the original document file generated by the index module, wherein when the search request is input from the user terminal, the search engine server parses and analyzes a search query corresponding to the search request, calculates a search result for the search request by using an inverted index file including words included in the search query, and returns the calculated search result. According to an embodiment of the present invention, all the search engine nodes share only the inverted index file among index result files including the inverted index file and the original document file, and the original document file is stored in the distributed data storage device, thereby reducing costs for transmitting unnecessary data, and improving the search speed by allowing search to be performed through only the inverted index file when a search query request is performed.;COPYRIGHT KIPO 2020
机译:基于分布式数据存储设备的搜索引擎系统及其搜索方法技术领域本发明涉及基于分布式数据存储设备的搜索引擎系统及其搜索方法。根据本发明的一个实施例,基于分布式数据存储设备的搜索引擎系统包括:可以连接到网络并输入搜索请求的用户终端;搜索引擎服务器,其包括索引模块,当收集用于索引生成的目标文档时,该索引模块用于针对目标文档生成包括反向索引文件和原始文档文件的索引数据;分布式存储设备,用于接收和存储由索引模块生成的原始文档文件,其中,当从用户终端输入搜索请求时,搜索引擎服务器解析并分析与该搜索请求对应的搜索查询,计算搜索量通过使用包含在搜索查询中的单词的倒排索引文件获得搜索请求的结果,并返回计算出的搜索结果。根据本发明的实施例,所有搜索引擎节点在包括反向索引文件和原始文档文件的索引结果文件中仅共享反向索引文件,并且原始文档文件被存储在分布式数据存储设备中,从而通过执行搜索查询请求时仅允许通过倒排索引文件执行搜索来减少传输不必要数据的成本并提高搜索速度。; COPYRIGHT KIPO 2020

著录项

  • 公开/公告号KR102089348B1

    专利类型

  • 公开/公告日2020-03-16

    原文格式PDF

  • 申请/专利权人 WISENUT INC.;

    申请/专利号KR20190010514

  • 发明设计人 YANG JAE SEOK;JANG JUNG HOON;

    申请日2019-01-28

  • 分类号G06F16/2458;G06F16/22;G06F16/2455;H04L29/08;

  • 国家 KR

  • 入库时间 2022-08-21 11:05:08

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号