A New Algorithm of Topical Crawler

机译：一种新的局部履带算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The generic crawler provides more help to people for finding information in WWW. However, it has some drawback in terms of precision and efficiency because of its generality and no specialty. In this paper, we address two issues of the topical web crawler. One is how to make the definition of the topic; the other is how to sort of links to be downloaded in the queue efficiently. It aims to visit only relevant pages, and get a great scale of hyperlinks which link to the relevant pages. The crawl method in this paper is a novel one, which is based on the semi-structured features of the website and content information. The results of experiment show that it is a very effective method for focused crawler.

机译：通用履带为人们提供更多帮助，以查找WWW中的信息。然而，由于其一般性，并且没有专业，它在精度和效率方面存在一些缺点。在本文中，我们解决了局部Web履带的两个问题。一个是如何制定这个主题的定义;另一种是如何有效地在队列中下载的链接。它旨在只访问相关页面，并获得大规模的超链接，链接到相关页面。本文的爬网方法是一种小说，基于网站和内容信息的半结构化特征。实验结果表明它是一种非常有效的聚焦履带方法。

著录项

来源
《International Workshop on Computer Science and Engineering》|2009年||共4页
会议地点
作者
Li Wei-jiang; Ru Hua-suo; Zhao Tie-jun; Zang Wen-mao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Topical Crawler; Generic Crawler; Algorithm;

机译：局部履带;通用履带;算法;

相似文献

外文文献
中文文献
专利

1. Topical Web Crawlers: Evaluating Adaptive Algorithms [J] . FILIPPO MENCZER, GAUTAM PANT, PADMINI SRINIVASAN ACM Transactions on Internet Technology . 2004,第4期

机译：主题Web爬虫：评估自适应算法
2. Machine Learning-Based Topical Web Crawler: An Ensemble Approach Incorporating Meta-Features [J] . Tae Jun Kim, Han- Joon Kim Journal of Engineering & Applied Sciences . 2017,第18期

机译：基于机器学习的主题Web履带：一个包含元特征的合并方法
3. LSI Based Relevance Computation for Topical Web Crawler [J] . Gurmeen Minhas, Mukesh Kumar Journal of Emerging Technologies in Web Intelligence . 2013,第4期

机译：基于LSI的主题网页爬虫的相关性计算
4. A New Algorithm of Topical Crawler [C] . Li Wei-jiang, Ru Hua-suo, Zhao Tie-jun, International Workshop on Computer Science and Engineering . 2009

机译：一种新的局部履带算法
5. Learning to crawl: Classifier-guided topical crawlers. [D] . Pant, Gautam. 2004

机译：学习爬网：分类器指导的主题爬网程序。
6. Management of Tissue Ischemia in Mastectomy Skin Flaps: Algorithm Integrating SPY Angiography and Topical Nitroglycerin [O] . Kyle Sanniec, Sumeet Teotia, Bardia Amirlak 2016

机译：乳房切除皮瓣组织缺血的处理：结合SPY血管造影和局部硝酸甘油的算法
7. A Frame Work for Topical Collections Make with Focused and Accelerated Focused Crawlers [O] . Saturi Rajesh, D.Raju D.Raju, P.Ajay Kumar, 2015

机译：主题集合的框架工作，使得专注和加速的聚焦爬行器
8. Signal Validation Algorithms for Consistency Checking and Sequential Probability Ratio Testing of Redundant Measurements: Topical Report [R] . Gloeckler, O. , Upadhyaya, B. R. , Kerlin, T. W. 1987

机译：用于一致性检查的信号验证算法和冗余测量的顺序概率比测试：主题报告

A New Algorithm of Topical Crawler

摘要

著录项

相似文献

相关主题

期刊订阅