Quality Information Retrieval for the World Wide Web

机译：万维网的质量信息检索

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The World Wide Web is an unregulated communication medium which exhibits very limited means of quality control. Quality assurance has become a key issue for many information retrieval services on the Internet, e.g. web search engines. This paper introduces some quality evaluation and assessment methods to assess the quality of web pages. The proposed quality evaluation mechanisms are based on a set of quality criteria which were extracted from a targeted user survey. A weighted algorithmic interpretation of the most significant user quoted quality criteria is proposed. In addition, the paper utilizes machine learning methods to produce a prediction of quality for web pages before they are downloaded. The set of quality criteria allows us to implement a web search engine with quality ranking schemes, leading to web crawlers which can crawl directly quality web pages. The proposed approaches produce some very promising results on a sizeable web repository.

机译：万维网是一个不受管制的通信介质，其呈现非常有限的质量控制手段。质量保证已成为互联网上许多信息检索服务的关键问题，例如，网络搜索引擎。本文介绍了一些质量评估和评估方法，以评估网页的质量。所提出的质量评估机制基于一系列质量标准，该质量标准从目标用户调查中提取。提出了对最重要的用户引用质量标准的加权算法解释。此外，本文利用机器学习方法在下载之前对网页的质量预测。该集合标准允许我们使用质量排名方案实现Web搜索引擎，导致Web爬虫，可以抓取直接质量的网页。拟议的方法在相当大的Web存储库上产生一些非常有前途的结果。

著录项

来源
《IEEE/WIC/ACM Joint International Conference on Web Intelligence and Intelligent Agent Technology》|2008年||共7页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Crawling; Quality assessment; Quality information; Web page retrieval;

机译：爬行;质量评估;质量信息;网页检索;

相似文献

外文文献
中文文献
专利

1. A conceptual model for user-centered quality information retrieval on the World Wide Web [J] . Surya B. Yadav Journal of Intelligent Information Systems . 2010,第1期

机译：在万维网上以用户为中心的质量信息检索的概念模型
2. The Continuous Media Web: a distributed multimedia information retrieval architecture extending the World Wide Web [J] . Silvia Pfeiffer, Conrad Parker, Andre Pang Multimedia Systems . 2005,第6期

机译：连续媒体网络：扩展了万维网的分布式多媒体信息检索体系结构
3. An Information Retrieval Model from World Wide Web based on Formal Concept Analysis [J] . Minyar Sassi Hidri, Amel Grissa Touzi International Arab Journal of e-Technology . 2012,第4期

机译：基于形式概念分析的万维网信息检索模型
4. An Improved Method for Automatically Determining Webpage Cohesiveness for Quality Information Retrieval from World Wide Web [C] . Surya Yadav, Jeremy Bellah International Conference on Information Quality(ICIQ-2006); 20061110-12; Cambridge,MA(US) . 2006

机译：一种自动确定从万维网检索质量信息的网页内聚力的改进方法
5. Incorporating quality metrics in agent-based centralized/distributed information retrieval on the World Wide Web. [D] . Zhu, Xiaolan. 1999

机译：在万维网上将质量指标纳入基于代理的集中式/分布式信息检索中。
6. Filtering Web pages for quality indicators: an empirical approach to finding high quality consumer health information on the World Wide Web. [O] . S. L. Price, W. R. Hersh 1999

机译：筛选网页以获取质量指标：这是一种在万维网上查找高质量的消费者健康信息的经验方法。
7. Quality information retrieval for the World Wide Web [O] . Kc, Milly W, Hagenbuchner, Markus, Tsoi, Ah Chung 2008

机译：万维网的质量信息检索
8. Environmental Quality: The World Wide Web. The 1997 Report of the Council on Environmental Quality [R] . 1997

机译：环境质量：万维网。1997年环境质量委员会报告

Quality Information Retrieval for the World Wide Web

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅