首页> 外文会议>International Conference on Management Science and Intelligent Control >A Focused Crawler Framework for E-commerce Deep Web Based on Concept Analysis
【24h】

A Focused Crawler Framework for E-commerce Deep Web Based on Concept Analysis

机译:基于概念分析的电子商务深媒体重点履历框架

获取原文

摘要

With the development of electronic commerce, product information on the Internet is growing rapidly. This paper presents a framework for the e-commerce Deep Web focused crawler. In order to overcome the deficiency of topic filtering strategy based on keywords, an algorithm for computing the degree of correiativity based on concept analysis is proposed. A noise filtering method based on statistics is proposed to overcome the noises in result set pages and greatly reduces the topic drift. Experimental results show that the framework are effective, and the accuracy and efficiency of crawling are both improved.
机译:随着电子商务的发展,互联网的产品信息正在迅速增长。本文介绍了电子商务深网络聚焦履带的框架。为了克服基于关键字的主题过滤策略的缺陷,提出了一种计算基于概念分析的校正度的算法。提出了一种基于统计信息的噪声过滤方法来克服结果集页面中的噪声,大大减少了主题漂移。实验结果表明,该框架是有效的,爬行的准确性和效率均得到改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号