【24h】

A Framework for Web Host Quality Detection

机译:Web主机质量检测的框架

获取原文

摘要

With the rapid growth of World Wide Web, finding useful and desired information in a short amount of time becomes an important issue for Web users. Search engines and focused crawlers help people to navigate the internet. A user expresses her information need in the form of a query and there is huge number of Web pages returning to this query. However, the majority of users view only a single page (the top 10 Web pages as ranked by the search engine) returned by a search engine. Even if the returned Web pages do not provide the exact information they need, the users also do not refine their query based on the returning results of their initial query. Thus, not only finding relevant Web pages but also ranking them plays an important role for the search engines. For this reason, determining the quality of Web pages is one of the main priorities of search engines, since low quality Web pages cause search engines results to be extremely vague and flooded with irrelevant Web pages. In this paper, we propose a novel method for determining the quality of Web pages. The proposed method first identifies the genre of Web pages and then it determines the quality of Web pages based on their genre. Our experimental results show that our proposed method is very effective and efficient.
机译:随着万维网的快速增长,在短时间内找到有用和所需的信息成为网络用户的重要问题。搜索引擎和集中的爬虫帮助人们浏览互联网。用户以查询的形式表达她的信息,并且有大量的网页返回此查询。但是,大多数用户只通过搜索引擎返回的单个页面(由搜索引擎排名的前10个网页)。即使返回的网页没有提供所需的确切信息,用户也不会根据其初始查询的返回结果来完善其查询。因此,不仅找到相关的网页,还为搜索引擎扮演了重要作用。因此,确定网页的质量是搜索引擎的主要优先级之一,因为低质量的网页导致搜索引擎导致非常模糊并与无关的网页淹没。在本文中,我们提出了一种确定网页质量的新方法。所提出的方法首先识别网页的类型,然后基于其类型确定网页的质量。我们的实验结果表明,我们的提出方法非常有效和有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号