Parallel Crawling for Detection and Removal of DUST using DUSTER

机译：用除尘器检测和去除灰尘的平行爬行

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Web is commonly used medium to search informa-tion using Web Crawler. Web crawler fetches different pages related to given keyword but some of them contains duplicate content. Different URLs with similar text are DUST. To im-prove performance of search engine, DUSTER method is used. DUSTER detects and removes duplicate URLs without fetching their contents. Single crawler crawls single URL at a time. Multiple URLs are crawled parallally by Parallel crawlers and the results of parallel cralwlers are combined and given as a input to the DUSTER. Multiple sequence alignment is used to generate candidate rules and rules of validation. Then the candidate rules filtered out according to their performance in a validation set and finally removes the duplicate URLs. Using this method reduction of large number of duplicate URLs is achieved.

机译：Web通常使用媒体使用Web爬网程序搜索信息。 Web爬网程序获取与给定关键字相关的不同页面，但其中一些包含重复内容。具有类似文本的不同URL是灰尘。为了IM-证明搜索引擎的性能，使用Duster方法。除尘器检测并删除重复的URL而不获取其内容。单个爬网手一次爬网。通过并行爬行器逐渐爬行多个URL，并将并行CRALWLERS的结果组合并作为Duster的输入给出。多个序列对齐用于生成候选规则和验证规则。然后候选规则根据其在验证集中的性能进行过滤，最后删除重复的URL。使用此方法实现了大量重复URL的减少。

著录项

来源
《International Conference on Computing Communication Control and Automation》|2018年|603-1205p|共5页
会议地点
作者
Jyoti G. Langhi; Shailaja Jadhav;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN91-53;
关键词
Crawler; Parallel crawling; DUSTER;

机译：履带;平行爬行;除尘器;

相似文献

外文文献
中文文献
专利

1. Remove Dust and Extend the Life of Your Duster [J] . Cleaning & maintenance management . 2020,第6期

机译：去除灰尘并延长粉碎机的寿命
2. Factors Affecting the performance of trickle dusters for preventing explosive dust accumulations in return airways [J] . Sapko Michael J., Harris Marcia L., Perera Inoka E., Journal of loss prevention in the process industries . 2019,第期

机译：影响涓流粉尘垃圾粉末防止爆炸性尘埃累积的因素
3. Applying Pesticide Dusts Choosing and Using a Hand-held Duster [J] . Techletter . 2012,第1期

机译：应用农药粉尘选择和使用手持式除尘器
4. Parallel Crawling for Detection and Removal of DUST using DUSTER [C] . Jyoti G. Langhi, Shailaja Jadhav International Conference on Computing Communication Control and Automation . 2018

机译：用除尘器检测和去除灰尘的平行爬行
5. Framework for Crawling and Local Event Detection Using Twitter Data. [D] . Bakshi, Hrishikesh. 2011

机译：使用Twitter数据进行爬网和本地事件检测的框架。
6. Crawling-induced floor dust resuspension affects the microbiota of the infant breathing zone [O] . Heidi K. Hyytiäinen, Balamuralikrishna Jayaprakash, Pirkka V. Kirjavainen, 2018

机译：爬行引起的地板灰尘重悬浮会影响婴儿呼吸区的微生物群
7. Dust particle artifact detection and removal in retinal images [O] . Sierra, E., Marrugo Hernandez, Andrés Guillermo, Millán Garcia-Varela, M. Sagrario 10000

机译：视网膜图像中的尘埃粒子伪影检测和去除
8. Electrostatic Dust Detection and Removal for ITER [R] . Skinner, C. H., Campos, A., Kugel, H., 2008

机译：ITER的静电粉尘检测和清除

Parallel Crawling for Detection and Removal of DUST using DUSTER

摘要

著录项

相似文献

相关主题

期刊订阅