Artificial neural network models for reservoir-aquifer dimensionless variables: influx and pressure prediction for water influx calculation

Okon, Anietie Ndarake; Ansa, Idongesit Bassey

doi:10.1007/s13202-021-01148-8

Artificial neural network models for reservoir-aquifer dimensionless variables: influx and pressure prediction for water influx calculation

Original Paper-Production Engineering
Open access
Published: 25 March 2021

Volume 11, pages 1885–1904, (2021)
Cite this article

Download PDF

You have full access to this open access article

Journal of Petroleum Exploration and Production Aims and scope Submit manuscript

Artificial neural network models for reservoir-aquifer dimensionless variables: influx and pressure prediction for water influx calculation

Download PDF

1705 Accesses
4 Citations
Explore all metrics

Abstract

Calculation of water influx into petroleum reservoir is a tedious evaluation with significant reservoir engineering applications. The classical approach developed by van Everdingen–Hurst (vEH) based on diffusivity equation solution had been the fulcrum for water influx calculation in both finite and infinite-acting aquifers. The vEH model for edge-water drive reservoirs was modified by Allard and Chen for bottom-water drive reservoirs. Regrettably, these models solution variables: dimensionless influx ($W_{{{\text{eD}}}}$) and dimensionless pressure ($P_{D}$) were presented in tabular form. In most cases, table look-up and interpolation between time entries are necessary to determine these variables, which makes the vEH approach tedious for water influx estimation. In this study, artificial neural network (ANN) models to predict the reservoir-aquifer variables $W_{{{\text{eD}}}}$ and $P_{D}$ was developed based on the vEH datasets for the edge- and bottom-water finite and infinite-acting aquifers. The overall performance of the developed ANN models correlation coefficients (R) was 0.99983 and 0.99978 for the edge- and bottom-water finite aquifer, while edge- and bottom-water infinite-acting aquifer was 0.99992 and 0.99997, respectively. With new datasets, the generalization capacities of the developed models were evaluated using statistical tools: coefficient of determination (R²), R, mean square error (MSE), root-mean-square error (RMSE) and absolute average relative error (AARE). Comparing the developed finite aquifer models predicted $W_{{{\text{eD}}}}$ with Lagrangian interpolation approach resulted in R², R, MSE, RMSE and AARE of 0.9984, 0.9992, 0.3496, 0.5913 and 0.2414 for edge-water drive and 0.9993, 0.9996, 0.1863, 0.4316 and 0.2215 for bottom-water drive. Also, infinite-acting aquifer models (Model-1) resulted in R², R, MSE, RMSE and AARE of 0.9999, 0.9999, 0.5447, 0.7380 and 0.2329 for edge-water drive, while bottom-water drive had 0.9999, 0.9999, 0.2299, 0.4795 and 0.1282. Again, the edge-water infinite-acting model predicted $W_{{{\text{eD}}}}$ and Edwardson et al. polynomial estimated $W_{eD}$ resulted in the R² value of 0.9996, R of 0.9998, MSE of 4.740 × 10^–4, RMSE of 0.0218 and AARE of 0.0147. Furthermore, the developed ANN models generalization performance was compared with some models for estimating $P_{D}$. The results obtained for finite aquifer model showed the statistical measures: R², R, MSE, RMSE and AARE of 0.9985, 0.9993, 0.0125, 0.1117 and 0.0678 with Chatas model and 0.9863, 0.9931, 0.1411, 0.3756 and 0.2310 with Fanchi equation. The infinite-acting aquifer model had 0.9999, 0.9999, 0.1750, 0.0133 and 7.333 × 10^–3 with Edwardson et al. polynomial, then 0.9865, 09,933, 0.0143, 0.1194 and 0.0831 with Lee model and 0.9991, 0.9996, 1.079 × 10^–3, 0.0328 and 0.0282 with Fanchi model. Therefore, the developed ANN models can predict $W_{{{\text{eD}}}}$ and $P_{D}$ for the various aquifer sizes provided by vEH datasets for water influx calculation.

Data-Driven Modeling Approach to Predict the Recovery Performance of Low-Salinity Waterfloods

Article 18 January 2021

Shams Kalam, Rizwan Ahmed Khan, … Sidqi A. Abu-Khamsin

Reservoir Inflow Prediction: A Comparison between Semi Distributed Numerical and Artificial Neural Network Modelling

Article 14 November 2023

Mahesh Shelke, S. N. Londhe, … Pravin Kolhe

Artificial Neural Network Modeling and Forecasting of Oil Reservoir Performance

Introduction

Most petroleum reservoirs are underlain by water-bearing formation aquifer, which in most cases provides the reservoir natural energy (drive) source (Okon and Appah 2018). During oil and gas production from the reservoir, the pressure drop depletion, in the reservoir, enables the encroachment of water (i.e., water influx) from the aquifer into the petroleum reservoir (Nashawi and Elkamel 1999). In other words, water influx contributes to the driving force (energy) used for the production of hydrocarbon from the reservoir to the surface (Mustafa et al. 2018). Al-Ghanim et al. (2012) reported that calculation of water influx volumes into the hydrocarbon reservoir is imperative in numerous applications, such as material balance for estimation of reserves, reservoir simulation studies for model calibration, production scheduling and setting up development strategies to optimize hydrocarbon recovery. Because of the mentioned importance of water influx calculation, a reliable model that captures the dynamics of petroleum subsurface system is of essence (Mustafa et al. 2018). In the literature, there are several models for water influx calculation which apply to different flow regimes, namely steady-state (Schilthius 1963), modified steady-state (Hurst 1943; Leung 1986a), unsteady-state (van Everdingen-Hurst 1949; Carter-Tracy 1960) and pseudo-steady-state (Fetkovitch 1971; Leung 1986b). In all the available water influx models, van Everdingen and Hurst (vEH) in 1949 presented the most reliable model for water influx calculation. The reason is that their developed model was the exact solution to the radial flow diffusivity equation, that is, the partial differential equation that describes the fluid flow in porous media. According to Allard and Chen (1988) and Al-Ghanim et al. (2012), the vEH model applies to all flow regimes provided the flow geometry is radial. Also, the model solutions are for both the constant-terminal-pressure and constant-terminal-rate cases of finite and infinite-acting edge-water aquifers. Meaning, the vEH aquifer model was one dimensional and does not cover bottom-water aquifer.

Looking at the gap in vEH solution, Coats (1962) presented a model (diffusivity equation) that considers the upward movement of water from the aquifer into the reservoir, thus, bottom-water drive model. His developed model was two dimensional that provides the solution for only the constant-terminal-rate case of an infinite-acting aquifer. Thus, Coats (1962) model was not applicable to the finite aquifer. Again, using Coats (1962) model, Allard and Chen (1988) modified the model to provide solutions for constant-terminal-pressure case in both finite and infinite-acting aquifers. These classical models for calculating water influx in edge- and bottom-water drive reservoirs are quite useful in reservoir engineering applications (Etim 2019). Regrettably, their established solutions for reservoir-aquifer variables: dimensionless influx ($W_{{{\text{eD}}}}$) and dimensionless pressure ($P_{D}$) for the constant-terminal-rate case and constant-terminal-pressure case, respectively, are in table form. Hence, the use of these models for water influx calculation requires table look-up and interpolation between time entries (Nashawi and Elkamel 1999; Al-Ghanim et al. 2012) to determine the $W_{{{\text{eD}}}}$ and $P_{D}$ for the appropriate aquifer size. In applying these vEH-based models in petroleum reservoir software, the task of executing table look-up and interpolation between time entries requires huge computer efforts (Ansa 2019). Okotie and Ikporo (2019) added that vEH-based approach of calculating water influx requires the principle of superposition which is not straightforward procedure. In this direction, Carter and Tracy (1960) and Fetkovitch (1971) developed approximation models for calculating water influx. Carter-Tracy model handled the tedious process of superposition involved in vEH-based approach. However, the challenge of table look-up and interpolation between time entries remains part of the approach, as they introduced dimensionless pressure ($P_{D}$) and its derivative ($P^{\prime}_{D}$) with respect to dimensionless time ($t_{D}$) (Nashawi and Elkamel 1999). Allard and Chen (1988) maintained that these vEH-based approximate models are applicable to only a limited range of flow condition or reservoir-aquifer geometries. Considering the wide acceptability of the vEH-based approach, therefore, it is important to have models that are not limited to flow condition rather applicable to all vEH datasets ranges. Hence, the focus of this study is to develop ANN models that cover the vEH datasets for predicting $W_{{{\text{eD}}}}$ and $P_{D}$ variables for calculation of water influx volume into the reservoirs.

Overview of some existing models for estimating the reservoir-aquifer dimensionless variables: $W_{{{\text{eD}}}}$ and $P_{D}$

Among the available models for calculating water influx into the reservoir, there is no doubt that the vEH-based models, which are the exact solutions of the diffusivity equations, are the most applicable. As reported by Al-Ghanim et al. (2012), these models suffer limitation because their results (i.e., $W_{{{\text{eD}}}}$ and $P_{D}$ variables) are in tabular forms, which significantly limits their application in computer analysis and simulation studies. For the constant-terminal-rate in finite-radial and infinite-acting edge-water drive reservoirs, the proposed Chatas (1953), Lee (1982) and Fanchi (1985) equations for estimating $P_{D}$ from $t_{D}$ and $r_{{{\text{eD}}}}$ determine this dimensionless variable. In another development, the polynomials developed by Edwardson et al. (1962) for the approximation of $q_{D}$, $P_{D}$ and $P^{\prime}_{D}$ as a function of $t_{D}$ for an infinite-acting aquifer are extended to water influx dimensionless variables determination, as $W_{{{\text{eD}}}}$ replaces $q_{D}$ in the equations. Also, Klins et al. (1999) developed some complex polynomials for estimating water influx variables: $W_{{{\text{eD}}}}$, $P_{D}$ and $P^{\prime}_{D}$ for finite and infinite-acting aquifers that apply to edge-water drive reservoirs. They maintained that these equations represent a traceable replacement to tubular listings of the vEH dimensionless functions. Again, Al-Ghanim et al. (2012) developed nonparametric optimal transformation models for $W_{{{\text{eD}}}}$ and $P_{D}$ for edge-water drive reservoirs that are data-driven and do not assume a priori function form as other models do. Regrettably, as reported by Nashawi and Elkamel (1999) and Al-Ghanim et al. (2012), the available models for estimating these water influx dimensionless variables are characterized by drawbacks. For instance, Chatas (1953), Edwardson et al. (1962), Lee (1982), Fanchi (1985) and Klins et al. (1999) models do not apply to the various aquifer sizes (i.e., finite and infinite-acting aquifers) and reservoir drives (edge- and bottom-water drive). Also, these models do not provide values for all the aquifer sizes presented by vEH as their estimation is correlation range limited. Again, Klins et al. (1999) and Al-Ghanim et al. (2012) equations are not easy to implement as they involve some complex transformations and computation effort. In this direction, it is necessary to have a model that will handle the mentioned drawbacks of these existing models. According to Nashawi and Elkamel (1999), intelligent models-neural network models would provide the values of the dimensionless variables for the various reservoir drive and aquifer sizes as presented by vEH. They developed ANN models for predicting $W_{{{\text{eD}}}}$ and $P_{D}$ in edge- and bottom-water drive finite and infinite-acting aquifers. Their developed ANN models were multiple-inputs single-output (MISO) except for edge-water drive infinite-acting aquifer model that was single-input single-output (SISO). The performance of these models was evaluated based on their training and testing data points errors (i.e., minimum, maximum and average errors) and compared to the results obtained from Fanchi (1985) and Klins et al. (1999) equations. Unfortunately, these ANN models generalization capacities were not tested with new datasets to establish their application potentials. Also, except for the finite edge-water ANN model, other models’ scaled variables were further normalized by taking their natural logarithm. Meaning that the de-normalization of these models predicted outputs would not be straight forward, as they will involve 2-stage de-normalization. Besides, any ANN model predicts values in the range of 0.00001–1.0 effectively, which is not the case in edge- and bottom-water drive infinite-acting aquifers. Therefore, the potential of the Nashawi and Elkamel (1999) ANN models to predict new sets of data is in doubt. Hence, it is imperative to develop ANN models that can handle new datasets to predict $W_{{{\text{eD}}}}$ and $P_{D}$ that are comparable with the existing models in this study.

Overview of artificial neural network (ANN)

According to Zou et al. (2008), the artificial neural network (ANN), often just called a neural network, is a machine learning method evolved from the idea of simulating the human brain. Therefore, ANN is modeled on the concept of biological neural network with ANN as the interconnection nodes or neurons. An ANN consists of several artificial neurons (i.e., nonlinear processing unit) connected through weights (Krenker et al. 2011). Zou et al. (2008) reported that ANN has three major components, namely, node character, network topology and learning rules. The node or neuron character signals processes by the neuron, such as the number of inputs and output, and the activation (transfer) function. The network topology controls the manners neurons are array and linked to the network. Again, the learning rules determine how the weights and biases (threshold) are initialized and adjusted in the network. There are several types of neural networks, namely, feed-forward neural network (FFNN), multilayer perceptron (MLP), generalized regression neural network (GRNN), convolution neural network (CNN), radial basis function neural network (RBFNN), recurrent neural network (RNN), etc. A typical ANN topology or architecture has three layers: input layer, hidden layer and output layer (Jiang et al. 2018; Han et al. 2018). Figure 1 depicts a simplified topology of an ANN, which can be represented mathematically as in Eq. 1 (Anifowose et al. 2012).

$$ y = f\left[ {\sum\limits_{i = 1}^{n} {\left( {x_{i} W_{i} + b_{i} } \right)} } \right] $$

(1)

where $x_{i}$ are the inputs to the neuron, $W_{i}$ are the weight attached to the inputs to the neuron, $b_{i}$ is the bias (or threshold), $f$ is the network transfer function, and $y$ is the output of the neuron. Krenker et al. (2011) mentioned that the major unknown variable in Eq. 1 is the transfer function, which is chosen based on the nature of the problem to be solved by the artificial neuron. The various transfer or activation functions available in the literature are linear, nonlinear, piece-wise linear, sigmoidal, tangent, hyperbolic and polynomial functions (Anifowose et al. 2012). In any case, the most used transfer functions in a neural network are linear function “purelin” and nonlinear (sigmoid) function “tansig.” The sigmoid function is as shown in Eq. 2;

$$ \sigma \left( z \right) = \frac{1}{{1 + \exp \left( { - z} \right)}} $$

(2)

where $z$ represents the node summed variables and $\sigma \left( z \right)$ denotes the transformed node output. The value processed by the sigmoid function is the network node output value. An artificial neural network learns a task by adjusting its weights (Musa and Hamisu 2019). The higher the weight of an artificial neuron is, the stronger the input which is multiplied by it will be. The types of network learning or training are supervised and unsupervised learning approaches. Supervised training requires the output data to learn the target data, while unsupervised learning does not need the output data to predict the target outcome (Krenker et al. 2011). There are several ANN learning algorithms available in the literature. The importance of any training algorithm is to minimize the mean square error (MSE) between the predicted output (target) datasets of the model and the observation outputs datasets used in the network training (Okon et al. 2020). Examples of the available training algorithms include Levenberg–Marquardt, Bayesian regularization, scaled conjugate gradient, Quasi-Newton, etc. Among these ANN learning algorithms, Levenberg–Marquardt algorithm is the most efficient (Konate et al. 2015), as it is faster and has more convergence stability than other learning algorithms (Hagan and Menhaj 1994). So far, ANNs have been applied in numerous fields, like medical, environmental, software engineering, control engineering, etc. In petroleum engineering, the most common type of ANN is the MLP which is train with a feed-forward back-propagation (FFBP) approach (Wood 2019). Some applications of ANN in petroleum industry include prediction of hydrocarbon reserve (Ma and Gomez 2015); reservoir characterization (Long et al. 2016); mud loss treatment (Cristofaro et al. 2017); relative permeability interpolation (Dang et al. 2018); water saturation prediction of sandstone reservoirs (Khan et al. 2018); development of screening tool for CO₂ injection in naturally fractured reservoirs (Hammam and Ertekim 2018), among others.

Materials and methods

Data acquisition and preparation

The edge-water van Everdingen-Hurst (1949) and bottom-water Allard and Chen (1988) dimensionless datasets of time ($t_{D}$), radius ($r_{{{\text{eD}}}}$), vertical distance ($z_{D}$) and water influx ($W_{{{\text{eD}}}}$) required for the finite (bounded) and infinite-acting aquifers were extracted from Ahmed and McKinney (2005). These vEH datasets were based on analytical solution (using Laplace transformation) to the radial diffusivity equation, which assumed there was step change between the reservoir and the aquifer pressure. The constant reservoir-aquifer boundary pressure solution was presented in the form of dimensionless water influx ($W_{{{\text{eD}}}}$) as a function of $t_{D}$ and $r_{{{\text{eD}}}}$. Also, the corresponding edge-water dimensionless pressure ($P_{D}$) for the two reservoir-aquifer configurations: finite (bounded) and infinite-acting, was evaluated using Chatas (1953) and Edwardson et al. (1962) models (Eqs. 3 and 4). In the bounded aquifer, the edge-water dimensionless influx ($W_{{{\text{eD}}}}$) and dimensionless pressure ($P_{D}$) are functions of dimensionless time ($t_{D}$) and dimensionless radius ($r_{{{\text{eD}}}}$) while in the bottom-water, they are functions of $t_{D}$, $r_{{{\text{eD}}}}$ and dimensionless vertical distance ($z_{D}$). Also, in the infinite-acting aquifer, $W_{{{\text{eD}}}}$ and $P_{D}$ are a function of $t_{D}$ in edge-water drive and the functions of $t_{D}$ and $z_{D}$ in the bottom-water drive reservoir. Regrettably, there is no available empirical or analytical model(s) in the literature for the estimation of $P_{D}$ in the bottom-water drive reservoir type for bounded and infinite-acting aquifers.

For the finite (bounded) aquifer, the Chatas (1953) model for predicting $P_{D}$ in edge-water drive reservoir-aquifer configuration is given as:

$$ P_{D} = \frac{{0.5 + 2t_{D} }}{{r_{{{\text{eD}}}}^{2} - 1}} - \frac{{r_{{{\text{eD}}}}^{4} \left[ {3 - 4\ln \left( {r_{{{\text{eD}}}} } \right)} \right] - 2r_{{{\text{eD}}}}^{2} - 1}}{{4\left( {r_{{{\text{eD}}}}^{2} - 1} \right)^{2} }} $$

(3)

For the infinite-acting aquifer, the Edwardson et al. (1962) model for predicting $P_{D}$ in bottom-water drive reservoir-aquifer configuration is expanded as:

$$ P_{D} = \frac{{370.529\sqrt {t_{D} } + 137.582t_{D} + 5.69549\left( {t_{D} } \right)^{1.5} }}{{328.834 + 265.488\sqrt {t_{D} } + 45.2157t_{D} + \left( {t_{D} } \right)^{1.5} }} $$

(4)

when the $t_{D} > 100$; thus, $P_{D} = 0.5\left[ {\ln \left( {t_{D} } \right) + 0.80907} \right]$

Therefore, for the bounded aquifer, the edge-water drive consists of 516 datasets with $t_{D}$ and $r_{{{\text{eD}}}}$ as input data and $W_{{{\text{eD}}}}$ and $P_{D}$ as output variables. The bottom-water drive reservoir type has 1218 data points with $t_{D}$, $r_{{{\text{eD}}}}$ and $z_{D}$ as input data and $W_{{{\text{eD}}}}$ as output data. Again, for the infinite-acting aquifer, the datasets were 549 for an edge-water drive reservoir with $t_{D}$ as an input variable and $W_{{{\text{eD}}}}$ and $P_{D}$ as output data. The bottom-water drive consists of 4098 datasets with $t_{D}$ and $z_{D}$ as input data and $W_{{{\text{eD}}}}$ as output data. Tables 1 and 2 present the input and output variables’ minimum and maximum values and the statistical description of these variables’ values for the various reservoir-aquifer configurations. As observed in Table 2, especially for the infinite-acting aquifer, the differences between the variables maximum and minimum values (i.e., range) are large, and these values will affect the network training process if not scaled down. Hence, the input and output variables were normalized to 0–1 using the maximum–minimum normalization equation (Eq. 5). This approach ensures that the neural network training algorithm will adjust the network weights and biases adequately. Again, scaling the input and output data to 0–1 reduces the sensitivity of the neural networks’ sigmoidal (i.e., activation) function to large data values (Okon et al. 2020).

$$ y_{{{\text{normalized}}}} = \frac{{y - y_{\min } }}{{y_{\max } - y_{\min } }} $$

(5)

where $y_{{{\text{normalized}}}}$ is the normalized input or output variable,$y$ is the actual variable value, and $y_{\min }$ and $y_{\max }$ are the minima and maxima values of the variables, respectively. It is worth mentioning that after normalizing the infinite-acting aquifer edge-water and bottom-water datasets, the scaled $t_{D}$ and $W_{{{\text{eD}}}}$ datasets were in the range of 1.0 × 10^–12–1.0. These extreme values would affect the generalization capacity of the ANN models predictions. In this case, the edge-water and bottom-water infinite-acting aquifer datasets were grouped into five sets, and then scaled based on the minimum and maximum values in Table 3 to reduce these datasets range.

Table 1 Minimum and maximum values of the input and output variables for ANN models development

Artificial neural network models for reservoir-aquifer dimensionless variables: influx and pressure prediction for water influx calculation

Abstract

Similar content being viewed by others

Data-Driven Modeling Approach to Predict the Recovery Performance of Low-Salinity Waterfloods

Reservoir Inflow Prediction: A Comparison between Semi Distributed Numerical and Artificial Neural Network Modelling

Artificial Neural Network Modeling and Forecasting of Oil Reservoir Performance

Introduction

Overview of some existing models for estimating the reservoir-aquifer dimensionless variables: \(W_{{{\text{eD}}}}\) and \(P_{D}\)

Overview of artificial neural network (ANN)

Materials and methods

Data acquisition and preparation

Artificial neural network development

Results and discussion

Performance of the ANN models

Finite (Bounded) aquifer

Infinite-acting aquifer

Generalization of the developed ANN models predictions and comparison with other existing models

Finite aquifer models

Infinite-acting aquifer

The application of the developed ANN models in water influx calculations in reservoir engineering software

Conclusion

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation