数据分析优化复杂井况井位设计-石油圈

通过运用数据分析法，可以帮助工程师在不确定性条件下优化井位设计。

来自丨JPT
编译丨TOM

关于井位优化设计已经有大量研究，也存在着不同的方法。然而，现场经验表明这些技术很少得到应用，在实践中，专家通常首先会确定影响油未来生产的主要因素。在设计最佳的井建参数时会考虑这些因素，然后用全井场油藏模拟模型评估。通常在第二阶段处理不确定性因素，通过运行几个替代模型来测试所提出的开发方案的稳定性，少数情况下，会在完全基于经验生成的蒙特卡罗模型上测试。

理论中提到的不同优化方法并未在得到应用的原因很多，例如在优化过程中难以处理所有约束条件，模拟运行次数过多，在考虑复杂油水井设计时的效果较差以及方法及其构型的复杂性等。这里提到的方法不仅仅是一种合理的井位优化方法；它是一整套数据分析工具，旨在帮助油藏工程师探索众多可能的解决方案，并希望找到最佳的解决方案。本文的目的是提供一种适合每个特定油田的方法指南。我们的理念首先遵循油藏工程师使用的典型工作流程，同时添加数据分析技术以改进决策过程。

这项工作主要针对未开发区域，提出了一种解决油田开发规划问题的方法，尽管类似的工作流程也可以应用于老油田区块。油田开发规划的第一步将地质、地震和流体数据纳入油藏模拟模型进行研究。

假定可以预测油田开发的简要方案，更确切地说，油藏工程师将决定是使用水平井还是垂直井以及井的数量，并确定每口井的勘察区域及其运行约束条件。勘察区域最终可能是调查整个油田，但是特别是在海上区域，可以根据平台位置确定限制区域。一旦确定了给定井的勘察区域，第二步就是生成可能的井眼轨迹。

生成井眼轨迹

搜索算法是基于系统的暴力算法，所有要测试的位置都需要预先定义。实际上，这是为搜索模式提供规则，并且算法将基于这些规则将完井数据置于储层模型内。搜索模式本身基于圆形概念，井口位于中心。设定规则时要权衡各个因素，包括计算性能、调查分辨率、期望井设计和钻井约束。必须定义井口位置、完井前后的内部和外部勘察半径、勘测方向的方位角、倾角范围、最大完井长度和目标靶点区域。靶点区域被定义为一个个单元格并赋予储层属性，并显示出在哪些单元格可以完井。

井眼特征筛选和计算

下一步涉及用于表征每个井眼轨迹的特征的定义。需要区分两组特征：沿井眼轨迹交叉的单元计算的特征和在与井眼轨迹交叉的连通体上计算的特征。油藏中的连通体被定义为一组彼此相连的单元，其中至少一个单元具有高于限定的导水系数。

连通体的概念是这种方法的核心，因为它提供了每口井的流域信息，而无需通过模拟进行评估；但是校准导水系数临界值应根据使用全场模拟观测到的流域进行分析。

沿井特征通过对井穿过的单元格属性进行求和或平均来计算，而对于连通体特征，通过对井眼交叉的所有连通体的属性求和或平均值。然后针对每个井眼轨迹不同的特征通过不同的模型进行计算。需要注意的是，对于具有数百万个单元的超大型模型，这个过程可能非常耗时。因此在这个阶段可能需要选择简化的地质软件实现。对于每种井眼轨迹组合，实现的所有特征都需要计算平均值。在此阶段，可以决定是否停止考虑某些特征平均值。例如，可以决定不使井彼此太近或不使井太接近含水层。

井身结构参数的聚类算法

一旦确定了临界值，下一步就是由油藏数值模拟选择井身结构参数。这些参数应尽可能不同，为实现此目的，将聚类算法应用于由每口井及井身结构参数的特征值组成的特征矩阵，K-medoid算法用于选择在特征矩阵空间中距离最远的N个结构参数，并且允许机器学习算法学习差异较大的井身结构参数，该算法是被认为是预测性最好的。然后根据选择的所有油藏认识对所选参数进行模拟。

机器学习模型的构建，现在都可以通过培训机器学习模型以评估新井的井身结构参数。模型的输入参数是针对每口井计算的不同特征参数，输出参数是油藏数值模拟的输出参数，一般是累积石油产量、采收率或某个未来时间点的净现值。需要注意的是，如果目标是预测油田的累计产量，则必须使用针对每口井评估的特征参数进行预测。通常每口井将考虑5~10个特征参数；因此如果油田井数超过5口，那么特征参数的数量就会变得非常多。而且，累计计算后将会非常复杂且难以学习。为了简化学习任务，每口井的累积产量将单独学习。通过这种方式，问题得到简化并且向系统提供了更多信息，因为了解了每口井的产量而不仅仅是油田的整体产量。

通过测试不同的机器学习算法以找到最佳模型。为了确定每种算法的准确性，数据分为训练集和分别由总数据集的80％和20％组成的测试集。模型验证后，可用于评估数百万个新井的参数。根据模拟井参数的数量，可以选择评估所有参数，也可以使用优化方法来找到最佳参数。机器学习算法只是模拟结果的近似值；因此它将用于识别出一定数量的最佳解决方案（通常是十几个），然后用油藏数值模拟软件进行评估。

通过提出一种新颖的数据分析工作流程，以帮助油藏工程师找到最佳井位。该方法基于地质特征的启发式定义（基于专家），可用于训练机器学习回归算法，通过使用合理数量的油藏数值模拟训练（几千）来预测新井眼轨迹的最终累积产量。同时考虑了不同可能的地质认识产生的地质不确定性。这种方法的可行性已在具有三口水平井的人造模拟油田中得到证实并进行了优化，正在申请在实际油田中应用。

For English, Please click here (展开/收缩)

Extensive literature has been published about well-placement optimization; authors have proposed many different approaches. Real field experience, however, has shown that these techniques are rarely used. In practice, expert engineers usually identify the main factors affecting a future well’s production. They consider these factors in obtaining the best well configurations. These are then evaluated with a full-field reservoir-simulation model. Uncertainty is handled in the process typically in a second phase to test the robustness of the proposed development plan by running a few alternative models corresponding to some low and high cases and, more rarely, on a full Monte Carlo set of models generated by the full prior-model uncertainty.

The reasons the different optimization methods proposed in the literature are not used in operations can vary: difficulty in handling all the operational constraints during optimization, a prohibitive number of simulation runs, relatively poor performance of the optimization methods when considering many complex production and injection wells, and the complexity of the method and its configuration. The approach proposed here is not a proper well-placement optimization method; it is more a set of data-analytics tools aimed at helping reservoir engineers to explore the extremely large domain of possible solutions and, it is hoped, to find better solutions. This paper’s objective is to provide a methodology and some guidelines that can be adapted to each specific field case. The idea is to follow the typical work flow used by reservoir engineers while adding data-analytics techniques to improve the decision-making process.

This work presents a methodology to address the problem of field development planning mainly for greenfields, although a similar work flow can be applied to mature fields. The first step in field development planning is to perform a study to incorporate geological, seismic, and fluid data into a reservoir-simulation model.

A schematic idea of the ideal field development also is presumed to be available. More precisely, the engineer will decide whether to use horizontal or vertical wells and the total number of wells and define an area of investigation for each well and its operational constraints. The area of investigation could be the entire field eventually, but, particularly in offshore fields, a restricted zone can be determined on the basis of platform locations. Once the investigation area is determined for a given well, the second step is to generate possible trajectories.

Well-Trajectories Generation. The search algorithm is based on a systematic brute-force approach. All locations to be tested need to be predefined. In practical terms, this is accomplished by providing rules for a search pattern, and the algorithm will place completions inside the reservoir model on the basis of these rules. The search pattern itself is based on a circular concept with the wellhead at the center. The rules are set while keeping in mind a tradeoff between computational performance, resolution of investigation, desired well design, and drilling constraints. One must define wellhead location, inner and outer investigation radii for start and end of the completion, azimuths of investigation directions, range of dips, maximum completion length, and a target-zone definition. The target zone is defined as a reservoir property cell by cell and indicates in which cells a completion can and cannot be placed. Some examples of possible well schemes for horizontal wells are shown in Fig. 1.

Well-Features Extraction and Computation. The next step concerns the definition of the features used to characterize each well trajectory. Two set of features are distinguished: features computed along the cells crossed by a well trajectory and features computed on connected bodies crossed by trajectories. A connected body in a reservoir is defined as a set of cells connected to each other by at least one cell with a transmissibility higher than a given threshold.

The connected-body notion is central to this approach because it provides information about drainage areas for each well without having to evaluate it with a simulation; however, the calibration of the transmissibility threshold used to compute connected bodies should be conducted according to the drained areas observed by use of a full-field simulation.

The along-well features are computed by summing or averaging the cell properties crossed by the well, whereas, for connected-bodies features, the properties of all connected bodies crossed by the well are summed or averaged. The corresponding features for each well trajectory are then computed for each different model realization. Note that, for very large models with millions of cells, this could be time-consuming; therefore, a selection of a reduced number of geological realizations may be necessary at this stage. For each combination of trajectories, the average of each feature for all the realizations is then computed. At this stage, the decision can be made whether to apply a cutoff on the values of some feature average. For example, the decision can be made not to have wells too close to each other or not to have wells too close to the aquifer.

Clustering of Well Configurations. Once the threshold has been applied, the next step is to select configurations to be evaluated by the reservoir simulator. These configurations should be as different as possible. To accomplish this, a clustering algorithm is applied on the features matrix composed of the features values for each well and configuration. The K-medoid algorithm (a variation of the well-known K-means algorithm) is used for selecting N configurations that are the most distant in the features space and that should allow the machine-learning algorithm to learn the most different well configurations and, it is hoped, be more predictive. The selected configurations then are simulated for all the chosen realizations.

Machine-Learning-Model Construction. Now, all of the ingredients are available to start training a machine-learning model to evaluate new well configurations. The inputs for the model are the different features computed for each well, and the output is the reservoir-simulation output that typically will be cumulative oil production, recovery factor, or the net present value at a certain future time. Note, however, that, if a goal is to predict cumulative production for the field, the features evaluated for each well will have to be used. Usually, between five and 10 features will be considered for each well; therefore, if the field has more than five wells, the number of features can become quite high. Moreover, the cumulative function will be complex and difficult to learn. In order to simplify the learning task, each cumulative production for each well is going to be learned independently. In this way, the problem is simplified and more information is provided to the system because it is informed about the production of each separate well and not only the total field production.

Different machine-learning algorithms can then be tested to find the best model (this study tested neural networks, random forests, and gradient boosting). To qualify the accuracy of each method, the data is divided into a training set and a test set consisting of 80 and 20% of the total dataset, respectively. After the model has been validated, it can be used to evaluate millions of new well configurations. Depending on the number of well configurations, either they all can be evaluated or an optimization method can be used to find the best possible configurations. The machine-learning algorithm is only an approximation of the simulator result; therefore, it will be used to identify a certain number of best solutions (typically a dozen of them) that will then be evaluated with the reservoir simulator.

A novel data-analytics work flow is presented to assist reservoir engineers in finding optimal well locations for greenfields. The method is based on heuristic (expert-based) definitions of geological features that can be used to train machine-learning regression algorithms to predict the final cumulative production of new trajectories by training them with a reasonable number of reservoir simulations (a few thousands). Geological uncertainty provided by different possible geological realizations also is taken into account. The feasibility of this approach has been proved on a synthetic oil field with three horizontal wells to optimize. Application on a real field case is ongoing.

未经允许，不得转载本站任何文章：

相关推荐