首页> 外文期刊>International Journal of Innovative Computing Information and Control >EVALUATION OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS TO ESTIMATE TOTAL ORGANIC CARBON FROM WIRELINE LOGS
【24h】

EVALUATION OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS TO ESTIMATE TOTAL ORGANIC CARBON FROM WIRELINE LOGS

机译:基于树的集合学习算法评估从有线原木估算总有机碳的总有机碳

获取原文
获取原文并翻译 | 示例
       

摘要

To evaluate the hydrocarbon generation potential, Total Organic Carbon (TOC) of source/reservoir rocks is of vital importance. TOC estimation from well logs is challenging and in laboratory from rock specimens is costly as well as time-consuming. TOC prediction from Passey method is low whereas AI techniques such as Artificial Neural Network (ANN), Support Vector Machine (SVM) get trapped in local optima, resulting in overfitting and are also considered ambiguous if the technique is not reasonable. In this paper, we proposed four efficient tree-based ensemble techniques that include Random Forest (RF), Extra Trees (ET), Gradient Boosting (GB), and eXtremely Gradient Boosting (XGB), capable of fitting highly non-linear data with minimum data pre-processing for TOC prediction. To evaluate the efficiency of these models, 205 data points and seven well logs from the Goldwyer Formation of the Canning Basin, Australia, were used for the training and testing purpose. Results validated that the accuracy of these tree-based ensemble techniques is at exemplary level for the TOC estimation, where the XGB model (for testing R~2 94.39%, MAE 0.0447, MSE 0.0039) outperformed the other techniques, i.e., RF (for testing R~2 90.59%, MAE 0.0549, MSE 0.0055), ET (for testing R~2 90.63%, MAE 0.0583, MSE 0.0058) and GB (for testing R~2 91.23%, MAE 0.0569, MSE 0.0053). These robust tree-based ensemble techniques have not only protected overfitting but also achieved better prediction results in dealing with the multidimensional data.
机译:为了评价烃潜力,总有机碳源/储集岩的(TOC)是至关重要的。从测井TOC估计是富有挑战性和从岩石样本的实验室是昂贵和费时。从Passey方法TOC预测为低,而AI技术,如人工神经网络(ANN),支持向量机(SVM)获得陷于局部极值,导致过度拟合也被视为不明确的,如果该技术是不合理的。在本文中,我们提出了四种高效的基于树的集合技术能够嵌合用高度非线性的数据的,其包括随机森林(RF),附加树(ET),梯度增压(GB),和极梯度增压(XGB),最小数据前处理TOC预测。为了评估这些模型,205个数据点和七个测井曲线坎宁盆地,澳大利亚Goldwyer形成的效率,被用于训练和测试的目的。结果验证了这些基于树的集合技术的准确度是在对TOC估计,其中XGB模型的示例性电平(用于测试R〜2 94.39%,MAE 0.0447,MSE 0.0039)优于其他技术,即,RF(用于测试R〜2 90.59%,MAE 0.0549,0.0055 MSE),ET(用于测试R〜2 90.63%,MAE 0.0583,0.0058 MSE)和GB(用于测试R〜2 91.23%,MAE 0.0569,MSE 0.0053)。这些功能强大的基于树的集合技术不仅保护过度拟合,但在处理多维数据也取得了较好的预测结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号