首页> 外文会议>Offshore Mediterranean Conference and Exhibition >NOVEL MACHINE LEARNING-BASED SUPPORT TOOL FOR SOURCE ROCK IDENTIFICATION FROM BIOMARKERS DATA: THEORY AND CASE STUDIES
【24h】

NOVEL MACHINE LEARNING-BASED SUPPORT TOOL FOR SOURCE ROCK IDENTIFICATION FROM BIOMARKERS DATA: THEORY AND CASE STUDIES

机译:基于新型机器学习的支持工具,用于生物标志物数据的源岩识别:理论和案例研究

获取原文
获取外文期刊封面目录资料

摘要

Petroleum biomarkers are complex carbon-based molecules derived from formerly living organisms and found in crude oils. These molecules are used by geochemists to get information on the source rocks responsible for the oils generation, such as lithology, depositional environment, organic matter, maturity and age. So they are of paramount importance for Petroleum System Modelling and more generally for exploration de-risking and sedimentary basin characterization purposes. Very often, biomarkers datasets are very large and interpretation process by geochemists can take several months to complete. For this reason, we developed an innovative Machine Learning-based support tool to facilitate and speed-up the whole process of biomarkers examination and interpretation. The core of tool is an advanced clustering method that allows expressing biomarkers data as a combination (mixing) of underlying components, directly ascribable to different source rocks. Non-negative constraint is a key aspect: the objective is to express each data sample, i.e. a vector with mainly non-negative values such as biomarkers concentrations and/or concentration ratios, as an additive combination of some of the underlying components, whereas subtracting components would not have any physical interpretation. A sparsity constraint is added to find solutions that allow to represent data as an additive combination of few source rock components. Both constraints greatly reduce non-uniqueness of the solution, greatly enhancing interpretability of the results. The tool then groups data in clusters, each one having a specific geochemical signature given by a set of scores for each of the different biomarkers’ parameters. Each sample is assigned to a specific cluster with a 'purity' percentage indicator. Geochemists can then easily use the high-purity samples to label the relevant samples as belonging to different source rocks. Moreover, the tool is able to distinguish the amount of mixing between different source rocks, through accurate deconvolution algorithms. Two applications of the tool are here presented, borrowed by real exploration case studies. In both cases the tool was able to separate samples into clusters that geochemists successfully recognized as lacustrine, marine and in some cases, transitional, with less than 10% of misclassifications, isolating also strongly biodegraded samples. This tool opens the doors also to the insertion and integration of other types of data (light hydrocarbons, diamondoids, etc.) for the whole ‘Big Data’ geochemical characterization of a sedimentary basin.
机译:石油生物标志物是衍生自以前生物的基于碳的分子,并在原油中发现。这些分子由地球化商使用,以获取有关负责油生成的源岩的信息,例如岩性,沉积环境,有机物,成熟度和年龄。因此,它们对石油系统建模至关重要,更普遍地探索失败风险和沉积盆地表征目的。通常,生物标志物数据集非常大,地球化学家的解释过程可能需要几个月的时间才能完成。因此,我们开发了一种创新的基于机器学习的支持工具,可以促进和加速生物标志物检查和解释的整个过程。工具的核心是一种高级聚类方法,允许将生物标记数据作为底层组件的组合(混合)表示,直接归属于不同的源岩石。非负约束是一个关键方面:目的是表达每个数据样本,即具有主要非负值的载体,例如生物标志物浓度和/或浓度比,作为一些下面部件的添加剂组合,而减去组件不会有任何物理解释。添加稀疏性约束以找到允许将数据表示为几个源岩联组件的添加剂组合的解决方案。两个约束都大大减少了解决方案的非唯一性,大大提高了结果的可解释性。然后,该工具将数据中的数据组分组,每个数据具有由每个不同的生物标记参数的一组分数给出的特定地球化学签名。每个样本都分配给具有“纯度”百分比指示符的特定群集。然后可以轻松地使用高纯度样本来将相关样本标记为属于不同的源岩石。此外,该工具能够通过精确的解卷积算法区分不同源岩之间的混合量。这里展示了该工具的两种应用,由实际勘探案例研究借用。在这两种情况下,该工具能够将样品分开到集群中,地球化学家成功被认为是湖泊,海洋和在某些情况下,过渡性,低于10%的错误分类,分离也是强烈的生物降解的样品。该工具还打开了用于插入和整合其他类型的数据(轻质碳氢化合物,菱形等),以实现沉积盆地的整体“大数据”地球化学特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号