首页> 外国专利> Homepage searching method using similarity recalculation based on URL substring relationship

Homepage searching method using similarity recalculation based on URL substring relationship

机译:基于URL子串关系的相似度重新计算的首页搜索方法

摘要

A homepage searching method uses a similarity recalculation based on a URL substring relationship. An entry point of a homepage is searched among a plurality of web documents belonging to the homepage by using their substring relationships. The technical essence lies in that the present invention uses a principle that if a URL of a certain web document is a substring of a URL of another web document, the former is more likely to be an entry point of a homepage than the latter. Thus, the present invention improves a conventional information searching method and allows a page serving as an entry point of a homepage to be searched prior to other documents. Accordingly, a user can determine whether a searched web document is a homepage or not without visiting all the URLs of the searched web documents.
机译:主页搜索方法使用基于URL子串关系的相似度重新计算。通过使用它们的子串关系在属于该主页的多个Web文档中搜索主页的入口点。技术本质在于,本发明使用这样的原理:如果某个Web文档的URL是另一个Web文档的URL的子字符串,则前者比后者更可能是主页的入口点。因此,本发明改进了传统的信息搜索方法,并且允许在其他文档之前搜索用作主页的入口点的页面。因此,用户可以在不访问搜索到的网络文档的所有URL的情况下确定搜索到的网络文档是否是主页。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号