Ontology-Based Focused Crawler

Gechao Lu; rnWanli Zuo; rnAiqi Zhang; rnYing Wang; rnWenyan Ji

首页> 外文期刊>Journal of information and computational science >Ontology-Based Focused Crawler

【24h】

Ontology-Based Focused Crawler

机译：基于本体的爬虫

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper studies how to make the Focused Crawler collect the topical pages effectively and accurately. We analyze the inadequacy of the traditional methods and present a model used for the extraction of the feature vector. We present another model base on ontology to calculate the similarity between pages in semantic. Then we build a Focused Crawler that synthesizes the two models mentioned above and the Best-First in [1] strategy. We use the URL distribution strategy in [2] in the Focused Crawler. From the experiment's results we found that the two models are effective and accurate in feature vector extraction and similarity calculation. Therefore, the ontology-based focused crawler we present here is feasible in Focused Crawling.

机译：本文研究了如何使聚焦爬行器有效，准确地收集主题页面。我们分析了传统方法的不足，并提出了用于特征向量提取的模型。我们提出了另一个基于本体的模型，以计算语义上页面之间的相似度。然后，我们建立了一个聚焦爬虫，它综合了上述两个模型和[1]策略中的最佳优先。我们在[Focused Crawler]的[2]中使用URL分发策略。从实验结果中，我们发现这两个模型在特征向量提取和相似度计算中是有效且准确的。因此，我们在此介绍的基于本体的聚焦爬虫在聚焦爬虫中是可行的。

著录项

来源
《Journal of information and computational science》 |2010年第2期|P.577-584|共8页
作者
Gechao Lu; rnWanli Zuo; rnAiqi Zhang; rnYing Wang; rnWenyan Ji;
展开▼
作者单位

College of Computer Science and Technology, Jilin University Changchun 130012, China;

rnCollege of Computer Science and Technology, Jilin University Changchun 130012, China;

rnCollege of Computer Science and Technology, Jilin University Changchun 130012, China;

rnCollege of Computer Science and Technology, Jilin University Changchun 130012, China;

rnCollege of Computer Science and Technology, Jilin University Changchun 130012, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
focused crawling; ontology; feature vector; topical pages;

机译：集中爬行;本体特征向量专题页面;

相似文献

外文文献
中文文献
专利

1. PDD Crawler : A Focused Web Crawler Using Link and Content Analysis for Relevence Prediction [J] . Prashant Dahiwale, M M Raghuwanshi, Latesh Malik Computer Science & Information Technology . 2014,第11期

机译：PDD爬网程序：使用链接和内容分析进行相关性预测的集中式Web爬网程序
2. ANTON Framework Based on Semantic Focused Crawler to Support Web Crime Mining Using SVM [J] . Javad Hosseinkhani, Hamed Taherdoost, Solmaz Keikhaee Annals of data science . 2021,第2期

机译：基于语义聚焦履带的Anton框架支持使用SVM的Web犯罪挖掘
3. An effective approach to enhancing a focused crawler using Google [J] . Lee Jae-Gil, Bae Donghwan, Kim Sansung, Journal of supercomputing . 2020,第10期

机译：使用谷歌加强聚焦履带的有效方法
4. The research of ontology-based focused crawler [C] . Wu Cong-Cong, Zhao Jian-li, Ma Hui-lin 2012 7th International Conference on System of Systems Engineering. . 2012

机译：基于本体的集中爬虫的研究
5. The outward-focused church: Leadership training for established northwest ministry network churches to transition church culture from inward-focused to outward-focused [D] . Cole, Dave E. 2015

机译：向外侧重的教会：为建立西北部网络教会的领导培训，从内向专注于向外关注的教会文化
6. FOCUS on clinical research informatics: Using ontology-based annotation to profile disease research [O] . Yi Liu, Adrien Coulet, Paea LePendu, 2012

机译：关于临床研究信息学的FOCUS：使用基于本体的注释来描述疾病研究
7. An Ontology-Based Focused Crawler [O] . Lefteris Kozanidis 2015

机译：基于Ontology的聚焦爬虫

Ontology-Based Focused Crawler

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅