A Framework of Petroleum Information Retrieval System Based on Web Scraping with Python

机译：基于Web Scraping的石油信息检索系统框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is very necessary to build a customized retrieval system in the era of the big information explosion. This paper gives a framework of petroleum information retrieval system which will be used by petroleum exploration and development researchers. First, we use the open source framework SCRAPY to build a crawler system to crawl the information that business people pay attention to. Then k-means algorithm is used to cluster the crawled documents, therefore the key information is extracted and presented in the system. The actual effect in production and operation shows that this customized retrieval system is efficient and agile, it improves the efficiency, accuracy and automation level of the work.

机译：在大信息爆炸时代，建立定制的检索系统是非常必要的。本文给出了石油信息检索系统的框架，供石油勘探与开发研究人员使用。首先，我们使用开源框架SCRAPY来构建搜寻器系统，以搜寻商人关注的信息。然后使用k-means算法对爬虫文档进行聚类，从而提取关键信息并将其呈现在系统中。在生产和运营中的实际效果表明，这种定制的检索系统高效灵活，提高了工作效率，准确性和自动化程度。

著录项

来源
《International Conference on Service Systems and Service Management》|2018年|1-6|共6页
会议地点
作者
Yili Ren; Yiting Ren;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Crawlers; Clustering algorithms; Petroleum; Search engines; Partitioning algorithms; Business;

机译：爬网程序;聚类算法;石油;搜索引擎;分区算法;业务;

相似文献

外文文献
中文文献
专利

1. A generic framework for ontology-based information retrieval and image retrieval in web data [J] . V. Vijayarajan, M. Dinakaran, Priyam Tejaswin, Human-centric Computing and Information Sciences . 2016,第1期

机译：Web数据中基于本体的信息检索和图像检索的通用框架
2. SEQing: web-based visualization of iCLIP and RNA-seq data in an interactive python framework [J] . Martin Lewinski, Yannik Bramkamp, Tino K?ster, BMC Bioinformatics . 2020,第1期

机译：SEQING：在交互式Python框架中基于Web的ICLIP和RNA-SEQ数据的可视化
3. pycity_scheduling—A Python framework for the development and assessment of optimisation-based power scheduling algorithms for multi-energy systems in city districts [J] . Sebastian Schwarz, Sebastian Alexander Uerlich, Antonello Monti SoftwareX . 2021,第a期

机译：pycity_scheduling-a python框架，用于开发和评估城市区多能量系统的优化功率调度算法
4. A Framework of Petroleum Information Retrieval System Based on Web Scraping with Python [C] . Yili Ren, Yiting Ren International Conference on Service Systems and Service Management . 2018

机译：基于Web刮擦Python的石油信息检索系统框架
5. The Web interfacing repository manager: A framework for developing Web-based experiment management systems. [D] . Jakobovits, Rex Matthew. 1999

机译：Web接口存储库管理器：用于开发基于Web的实验管理系统的框架。
6. SEQing: web-based visualization of iCLIP and RNA-seq data in an interactive python framework [O] . Martin Lewinski, Yannik Bramkamp, Tino Köster, 2020

机译：SEQing：在交互式python框架中基于Web的iCLIP和RNA-seq数据可视化
7. A generic framework for ontology-based information retrieval and image retrieval in web data [O] . V. Vijayarajan, M. Dinakaran, Priyam Tejaswin, 2016

机译：Web数据中基于本体的信息检索和图像检索的通用框架

A Framework of Petroleum Information Retrieval System Based on Web Scraping with Python

摘要

著录项

相似文献

相关主题

期刊订阅