首页> 外文会议>International Conference on Service Systems and Service Management >A Framework of Petroleum Information Retrieval System Based on Web Scraping with Python
【24h】

A Framework of Petroleum Information Retrieval System Based on Web Scraping with Python

机译:基于Web Scraping的石油信息检索系统框架

获取原文

摘要

It is very necessary to build a customized retrieval system in the era of the big information explosion. This paper gives a framework of petroleum information retrieval system which will be used by petroleum exploration and development researchers. First, we use the open source framework SCRAPY to build a crawler system to crawl the information that business people pay attention to. Then k-means algorithm is used to cluster the crawled documents, therefore the key information is extracted and presented in the system. The actual effect in production and operation shows that this customized retrieval system is efficient and agile, it improves the efficiency, accuracy and automation level of the work.
机译:在大信息爆炸时代,建立定制的检索系统是非常必要的。本文给出了石油信息检索系统的框架,供石油勘探与开发研究人员使用。首先,我们使用开源框架SCRAPY来构建搜寻器系统,以搜寻商人关注的信息。然后使用k-means算法对爬虫文档进行聚类,从而提取关键信息并将其呈现在系统中。在生产和运营中的实际效果表明,这种定制的检索系统高效灵活,提高了工作效率,准确性和自动化程度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号