Topic-specific crawling on the Web with concept context graph based on FCA

机译：具有基于FCA的概念上下文图的Web上的特定主题爬行

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Topic-specific crawling is a method which can not crawl all the webpage, but only crawls the web pages which are related to users' interests. The web pages which have high relevancy of the users' interests should be crawled first. The major problem in focused crawling is how to assign proper credits to the unvisited pages the crawling will visit. In this paper, we propose an effective approach using concept context graph based on Formal Concept Analysis to solve this problem. We build a concept lattice with the visited pages, and then use a method of combination of the term to construct our concept context graph based on the upper concept lattice. Our crawler can measure a page's expected relevancy to a given topic and determine the order in which pages should be visited first. An experiment illustrates that the new method is an effective mechanism which have a considerable result.

机译：特定于主题的爬网是一种无法抓取所有网页的方法，但只爬网页面与用户兴趣有关的网页。具有高相关性的用户兴趣的网页应该首先爬网。重点爬行中的主要问题是如何为爬行将访问的不受检测的页面分配适当的信用。在本文中，我们提出了一种基于正式概念分析的概念上下文图的有效方法来解决这个问题。我们用访问的页面构建一个概念格，然后使用术语的组合方法来构建基于上概念格的概念上下文图。我们的履带程序可以对给定主题衡量页面的预期相关性，并确定首先应该访问页面的顺序。实验说明新方法是具有相当长的有效机制。

著录项

来源
《International Conference on Management and Service Science》|2009年||共4页
会议地点
作者
Qiangqiang PENG; Yajun DU; Yufeng HAI; Shaoming CHEN; Zhaoqiong GAO;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 C93-53;
关键词
Search Engine; Topic-specific spider; Attribute; Formal Concept Analysis;

机译：搜索引擎;题目特定的蜘蛛;属性;正式的概念分析;

相似文献

外文文献
中文文献
专利

1. Topic-specific crawling on the Web with the measurements of the relevancy context graph [J] . Ching-Chi Hsu, Fan Wu Information Systems . 2006,第4a5期

机译：使用相关上下文图的度量值在Web上进行特定主题的爬网
2. Clustering-Based Topical Web Crawling for Topic-Specific Information Retrieval Guided by Incremental Classifier [J] . Tao Peng, Lu Liu International journal of software engineering and knowledge engineering . 2015,第1期

机译：增量分类器指导的基于聚类的主题Web爬行，用于主题特定的信息检索
3. Clustering-based topical Web crawling using CFu-tree guided by link-context [J] . Lu LIU, Tao PENG Frontiers of computer science in China . 2014,第4期

机译：在链接上下文的指导下使用CFu树进行基于集群的主题Web爬网
4. Topic-Specific Crawling on the Web with Concept Context Graph Based on FCA [C] . Peng Qiangqiang, Du Yajun, Hai Yufeng, International Conference on Management and Service Science;MASS 2009 . 2009

机译：基于FCA的概念上下文图在Web上进行主题特定的爬网
5. Health websites in Aboriginal context: Principles of conception based on a user-centered approach. The case of the Sioux Lookout District [D] . Gratton, Marie-France. 2009

机译：土著环境中的卫生网站：基于用户为中心的概念原则。苏Look望台案
6. Web-based public health geographic information systems for resources-constrained environment using scalable vector graphics technology: a proof of concept applied to the expanded program on immunization data [O] . Raoul Kamadjeu, Herman Tolentino 2006

机译：使用可缩放矢量图形技术的资源受限环境的基于Web的公共卫生地理信息系统：应用于免疫数据扩展程序的概念验证
7. Heterogeneous Graph-Based Intent Learning with Queries, Web Pages and Wikipedia Concepts [O] . Xiang Ren, Yujing Wang, Xiao Yu, 2014

机译：基于图形的异构意图学习与查询，网页和维基百科概念

Topic-specific crawling on the Web with concept context graph based on FCA

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅