首页> 外文会议>SPE Hydraulic Fracturing Technology Conference >Evaluating the Relationship Between Well Parameters and Production Using Multivariate Statistical Models: A Middle Bakken and Three Forks Case History
【24h】

Evaluating the Relationship Between Well Parameters and Production Using Multivariate Statistical Models: A Middle Bakken and Three Forks Case History

机译:使用多元统计模型评估井参数与生产之间的关系:中间Bakken和三个叉案历史

获取原文
获取外文期刊封面目录资料

摘要

Building a predictive statistical model for evaluating the impact of various fracture treatment and well completion designs on production has been of great interest in the oil and gas industry. The objectives of this study were to evaluate the benefits of advanced statistical and machine-learning techniques for predicting production from oil wells, highlight the strengths and weaknesses of these techniques, and gain insight into the relationship between well parameters and production. The predictive models are described through mathematical functions or algorithms that rely on well data (training set). The ongoing dilemma is that these models often result in poor predictions, even if they result in a high R-squared (0.7 or higher). The new perspective that this study brings is the importance of cross-validation with "hold-out" datasets in the workflow to develop reliable statistical models. A database of available completion and production data has been assembled from the North Dakota Industrial Commission (NDIC) and Frac Focus websites and from internal completion documentation. To date, there are at least 6,800 horizontal wells completed in the Middle Bakken formation and 3,600 completed in the Three Forks formation on the North Dakota side of the Williston Basin. Various models such as multiple regression, random forests, and gradient boosting machine were built to predict the cumulative oil production of the Middle Bakken and Three Forks horizontal wells. Model predictive abilities were assessed by cross-validating the root mean squared errors (in cross-validation, a hold-out set was used to assess the modelis predictive ability). The results showed the following conclusions about statistical evaluation techniques: 1) regression models that account for overfitting provided the best predictive ability, 2) gradient boosting model with the highest R-squared value had the worst predictive ability for the specific datasets in this paper— which shows why it is critical to not rely solely on R-squared value to assess a modelis predictive ability, but to also perform cross-validation, and 3) random forests and gradient boosting machine can be used for determining variable importance. Moreover, we observed that there is statistical evidence to support the presence of important interactions among variables in predicting cumulative oil production. For the Middle Bakken and Three Forks wells included in this study, the results showed that water cut, which can be used as a proxy for reservoir quality, is the most important predictor for cumulative oil production. However, the most important completion-related variables for predicting oil production were total frac fluid and proppant pumped. The analysis and results presented in this paper will enable companies to apply the approach to their own data when building production prediction models and analyzing the complex relationships of variables that control well performance.
机译:建立一种预测统计模型,用于评估各种骨折处理的影响,井完成的生产设计对石油和天然气工业有益。本研究的目标是评估先进的统计和机器学习技术,用于预测油井生产,突出这些技术的优势和弱点,并深入了解井参数和生产之间的关系。通过依赖于井数据(训练集)的数学函数或算法描述了预测模型。持续的困境是这些模型通常导致预测差,即使它们导致高r型(0.7或更高)。本研究提出的新视角是交叉验证与工作​​流程中的“扑出”数据集的重要性,以开发可靠的统计模型。已从北达科他州工业委员会(NDIC)和FRAC焦点网站和内部完成文件组装的可用完成和生产数据数据库。迄今为止,至少有6,800个水平井在中间Bakken地层完成,3,600件在威利斯顿盆地的北达科他州侧面形成的三叉形成。建立了多种回归,随机森林和梯度升压机等各种型号,以预测中间Bakken和三叉水平井的累积油生产。通过交叉验证根均方误差(在交叉验证中,使用扑出装置来评估模型预测能力,用于评估模型预测能力)。结果表明,关于统计评估技术的结论:1)占过度装备的回归模型提供了最佳的预测能力,2)梯度升压模型具有最高的R线值,具有本文的特定数据集的最糟糕的预测能力 - 这表明为什么它不仅仅是仅仅依赖于R线值来评估模型预测能力,而且还可以执行交叉验证,以及3)随机林和梯度升压机可用于确定可变重要性。此外,我们观察到存在统计证据来支持在预测累积油生产中的变量之间存在重要的相互作用。对于本研究中的中间Bakken和三个叉子井来说,结果表明,水切割,可作为储层质量的代理,是累积石油生产最重要的预测因素。然而,用于预测石油产量的最重要的完整性变量是全部FRAC液体和支撑剂泵送。本文提出的分析和结果将使公司能够在建设生产预测模型时将方法应用于自己的数据,并分析控制良好性能的变量的复杂关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号