Introduction

Most petroleum reservoirs are underlain by water-bearing formation aquifer, which in most cases provides the reservoir natural energy (drive) source (Okon and Appah 2018). During oil and gas production from the reservoir, the pressure drop depletion, in the reservoir, enables the encroachment of water (i.e., water influx) from the aquifer into the petroleum reservoir (Nashawi and Elkamel 1999). In other words, water influx contributes to the driving force (energy) used for the production of hydrocarbon from the reservoir to the surface (Mustafa et al. 2018). Al-Ghanim et al. (2012) reported that calculation of water influx volumes into the hydrocarbon reservoir is imperative in numerous applications, such as material balance for estimation of reserves, reservoir simulation studies for model calibration, production scheduling and setting up development strategies to optimize hydrocarbon recovery. Because of the mentioned importance of water influx calculation, a reliable model that captures the dynamics of petroleum subsurface system is of essence (Mustafa et al. 2018). In the literature, there are several models for water influx calculation which apply to different flow regimes, namely steady-state (Schilthius 1963), modified steady-state (Hurst 1943; Leung 1986a), unsteady-state (van Everdingen-Hurst 1949; Carter-Tracy 1960) and pseudo-steady-state (Fetkovitch 1971; Leung 1986b). In all the available water influx models, van Everdingen and Hurst (vEH) in 1949 presented the most reliable model for water influx calculation. The reason is that their developed model was the exact solution to the radial flow diffusivity equation, that is, the partial differential equation that describes the fluid flow in porous media. According to Allard and Chen (1988) and Al-Ghanim et al. (2012), the vEH model applies to all flow regimes provided the flow geometry is radial. Also, the model solutions are for both the constant-terminal-pressure and constant-terminal-rate cases of finite and infinite-acting edge-water aquifers. Meaning, the vEH aquifer model was one dimensional and does not cover bottom-water aquifer.

Looking at the gap in vEH solution, Coats (1962) presented a model (diffusivity equation) that considers the upward movement of water from the aquifer into the reservoir, thus, bottom-water drive model. His developed model was two dimensional that provides the solution for only the constant-terminal-rate case of an infinite-acting aquifer. Thus, Coats (1962) model was not applicable to the finite aquifer. Again, using Coats (1962) model, Allard and Chen (1988) modified the model to provide solutions for constant-terminal-pressure case in both finite and infinite-acting aquifers. These classical models for calculating water influx in edge- and bottom-water drive reservoirs are quite useful in reservoir engineering applications (Etim 2019). Regrettably, their established solutions for reservoir-aquifer variables: dimensionless influx (\(W_{{{\text{eD}}}}\)) and dimensionless pressure (\(P_{D}\)) for the constant-terminal-rate case and constant-terminal-pressure case, respectively, are in table form. Hence, the use of these models for water influx calculation requires table look-up and interpolation between time entries (Nashawi and Elkamel 1999; Al-Ghanim et al. 2012) to determine the \(W_{{{\text{eD}}}}\) and \(P_{D}\) for the appropriate aquifer size. In applying these vEH-based models in petroleum reservoir software, the task of executing table look-up and interpolation between time entries requires huge computer efforts (Ansa 2019). Okotie and Ikporo (2019) added that vEH-based approach of calculating water influx requires the principle of superposition which is not straightforward procedure. In this direction, Carter and Tracy (1960) and Fetkovitch (1971) developed approximation models for calculating water influx. Carter-Tracy model handled the tedious process of superposition involved in vEH-based approach. However, the challenge of table look-up and interpolation between time entries remains part of the approach, as they introduced dimensionless pressure (\(P_{D}\)) and its derivative (\(P^{\prime}_{D}\)) with respect to dimensionless time (\(t_{D}\)) (Nashawi and Elkamel 1999). Allard and Chen (1988) maintained that these vEH-based approximate models are applicable to only a limited range of flow condition or reservoir-aquifer geometries. Considering the wide acceptability of the vEH-based approach, therefore, it is important to have models that are not limited to flow condition rather applicable to all vEH datasets ranges. Hence, the focus of this study is to develop ANN models that cover the vEH datasets for predicting \(W_{{{\text{eD}}}}\) and \(P_{D}\) variables for calculation of water influx volume into the reservoirs.

Overview of some existing models for estimating the reservoir-aquifer dimensionless variables: \(W_{{{\text{eD}}}}\) and \(P_{D}\)

Among the available models for calculating water influx into the reservoir, there is no doubt that the vEH-based models, which are the exact solutions of the diffusivity equations, are the most applicable. As reported by Al-Ghanim et al. (2012), these models suffer limitation because their results (i.e., \(W_{{{\text{eD}}}}\) and \(P_{D}\) variables) are in tabular forms, which significantly limits their application in computer analysis and simulation studies. For the constant-terminal-rate in finite-radial and infinite-acting edge-water drive reservoirs, the proposed Chatas (1953), Lee (1982) and Fanchi (1985) equations for estimating \(P_{D}\) from \(t_{D}\) and \(r_{{{\text{eD}}}}\) determine this dimensionless variable. In another development, the polynomials developed by Edwardson et al. (1962) for the approximation of \(q_{D}\), \(P_{D}\) and \(P^{\prime}_{D}\) as a function of \(t_{D}\) for an infinite-acting aquifer are extended to water influx dimensionless variables determination, as \(W_{{{\text{eD}}}}\) replaces \(q_{D}\) in the equations. Also, Klins et al. (1999) developed some complex polynomials for estimating water influx variables: \(W_{{{\text{eD}}}}\), \(P_{D}\) and \(P^{\prime}_{D}\) for finite and infinite-acting aquifers that apply to edge-water drive reservoirs. They maintained that these equations represent a traceable replacement to tubular listings of the vEH dimensionless functions. Again, Al-Ghanim et al. (2012) developed nonparametric optimal transformation models for \(W_{{{\text{eD}}}}\) and \(P_{D}\) for edge-water drive reservoirs that are data-driven and do not assume a priori function form as other models do. Regrettably, as reported by Nashawi and Elkamel (1999) and Al-Ghanim et al. (2012), the available models for estimating these water influx dimensionless variables are characterized by drawbacks. For instance, Chatas (1953), Edwardson et al. (1962), Lee (1982), Fanchi (1985) and Klins et al. (1999) models do not apply to the various aquifer sizes (i.e., finite and infinite-acting aquifers) and reservoir drives (edge- and bottom-water drive). Also, these models do not provide values for all the aquifer sizes presented by vEH as their estimation is correlation range limited. Again, Klins et al. (1999) and Al-Ghanim et al. (2012) equations are not easy to implement as they involve some complex transformations and computation effort. In this direction, it is necessary to have a model that will handle the mentioned drawbacks of these existing models. According to Nashawi and Elkamel (1999), intelligent models-neural network models would provide the values of the dimensionless variables for the various reservoir drive and aquifer sizes as presented by vEH. They developed ANN models for predicting \(W_{{{\text{eD}}}}\) and \(P_{D}\) in edge- and bottom-water drive finite and infinite-acting aquifers. Their developed ANN models were multiple-inputs single-output (MISO) except for edge-water drive infinite-acting aquifer model that was single-input single-output (SISO). The performance of these models was evaluated based on their training and testing data points errors (i.e., minimum, maximum and average errors) and compared to the results obtained from Fanchi (1985) and Klins et al. (1999) equations. Unfortunately, these ANN models generalization capacities were not tested with new datasets to establish their application potentials. Also, except for the finite edge-water ANN model, other models’ scaled variables were further normalized by taking their natural logarithm. Meaning that the de-normalization of these models predicted outputs would not be straight forward, as they will involve 2-stage de-normalization. Besides, any ANN model predicts values in the range of 0.00001–1.0 effectively, which is not the case in edge- and bottom-water drive infinite-acting aquifers. Therefore, the potential of the Nashawi and Elkamel (1999) ANN models to predict new sets of data is in doubt. Hence, it is imperative to develop ANN models that can handle new datasets to predict \(W_{{{\text{eD}}}}\) and \(P_{D}\) that are comparable with the existing models in this study.

Overview of artificial neural network (ANN)

According to Zou et al. (2008), the artificial neural network (ANN), often just called a neural network, is a machine learning method evolved from the idea of simulating the human brain. Therefore, ANN is modeled on the concept of biological neural network with ANN as the interconnection nodes or neurons. An ANN consists of several artificial neurons (i.e., nonlinear processing unit) connected through weights (Krenker et al. 2011). Zou et al. (2008) reported that ANN has three major components, namely, node character, network topology and learning rules. The node or neuron character signals processes by the neuron, such as the number of inputs and output, and the activation (transfer) function. The network topology controls the manners neurons are array and linked to the network. Again, the learning rules determine how the weights and biases (threshold) are initialized and adjusted in the network. There are several types of neural networks, namely, feed-forward neural network (FFNN), multilayer perceptron (MLP), generalized regression neural network (GRNN), convolution neural network (CNN), radial basis function neural network (RBFNN), recurrent neural network (RNN), etc. A typical ANN topology or architecture has three layers: input layer, hidden layer and output layer (Jiang et al. 2018; Han et al. 2018). Figure 1 depicts a simplified topology of an ANN, which can be represented mathematically as in Eq. 1 (Anifowose et al. 2012).

$$ y = f\left[ {\sum\limits_{i = 1}^{n} {\left( {x_{i} W_{i} + b_{i} } \right)} } \right] $$
(1)

where \(x_{i}\) are the inputs to the neuron, \(W_{i}\) are the weight attached to the inputs to the neuron, \(b_{i}\) is the bias (or threshold), \(f\) is the network transfer function, and \(y\) is the output of the neuron. Krenker et al. (2011) mentioned that the major unknown variable in Eq. 1 is the transfer function, which is chosen based on the nature of the problem to be solved by the artificial neuron. The various transfer or activation functions available in the literature are linear, nonlinear, piece-wise linear, sigmoidal, tangent, hyperbolic and polynomial functions (Anifowose et al. 2012). In any case, the most used transfer functions in a neural network are linear function “purelin” and nonlinear (sigmoid) function “tansig.” The sigmoid function is as shown in Eq. 2;

$$ \sigma \left( z \right) = \frac{1}{{1 + \exp \left( { - z} \right)}} $$
(2)

where \(z\) represents the node summed variables and \(\sigma \left( z \right)\) denotes the transformed node output. The value processed by the sigmoid function is the network node output value. An artificial neural network learns a task by adjusting its weights (Musa and Hamisu 2019). The higher the weight of an artificial neuron is, the stronger the input which is multiplied by it will be. The types of network learning or training are supervised and unsupervised learning approaches. Supervised training requires the output data to learn the target data, while unsupervised learning does not need the output data to predict the target outcome (Krenker et al. 2011). There are several ANN learning algorithms available in the literature. The importance of any training algorithm is to minimize the mean square error (MSE) between the predicted output (target) datasets of the model and the observation outputs datasets used in the network training (Okon et al. 2020). Examples of the available training algorithms include Levenberg–Marquardt, Bayesian regularization, scaled conjugate gradient, Quasi-Newton, etc. Among these ANN learning algorithms, Levenberg–Marquardt algorithm is the most efficient (Konate et al. 2015), as it is faster and has more convergence stability than other learning algorithms (Hagan and Menhaj 1994). So far, ANNs have been applied in numerous fields, like medical, environmental, software engineering, control engineering, etc. In petroleum engineering, the most common type of ANN is the MLP which is train with a feed-forward back-propagation (FFBP) approach (Wood 2019). Some applications of ANN in petroleum industry include prediction of hydrocarbon reserve (Ma and Gomez 2015); reservoir characterization (Long et al. 2016); mud loss treatment (Cristofaro et al. 2017); relative permeability interpolation (Dang et al. 2018); water saturation prediction of sandstone reservoirs (Khan et al. 2018); development of screening tool for CO2 injection in naturally fractured reservoirs (Hammam and Ertekim 2018), among others.

Fig. 1
figure 1

A simplified ANN topology (architecture) (Anifowose et al. 2012)

Materials and methods

Data acquisition and preparation

The edge-water van Everdingen-Hurst (1949) and bottom-water Allard and Chen (1988) dimensionless datasets of time (\(t_{D}\)), radius (\(r_{{{\text{eD}}}}\)), vertical distance (\(z_{D}\)) and water influx (\(W_{{{\text{eD}}}}\)) required for the finite (bounded) and infinite-acting aquifers were extracted from Ahmed and McKinney (2005). These vEH datasets were based on analytical solution (using Laplace transformation) to the radial diffusivity equation, which assumed there was step change between the reservoir and the aquifer pressure. The constant reservoir-aquifer boundary pressure solution was presented in the form of dimensionless water influx (\(W_{{{\text{eD}}}}\)) as a function of \(t_{D}\) and \(r_{{{\text{eD}}}}\). Also, the corresponding edge-water dimensionless pressure (\(P_{D}\)) for the two reservoir-aquifer configurations: finite (bounded) and infinite-acting, was evaluated using Chatas (1953) and Edwardson et al. (1962) models (Eqs. 3 and 4). In the bounded aquifer, the edge-water dimensionless influx (\(W_{{{\text{eD}}}}\)) and dimensionless pressure (\(P_{D}\)) are functions of dimensionless time (\(t_{D}\)) and dimensionless radius (\(r_{{{\text{eD}}}}\)) while in the bottom-water, they are functions of \(t_{D}\), \(r_{{{\text{eD}}}}\) and dimensionless vertical distance (\(z_{D}\)). Also, in the infinite-acting aquifer, \(W_{{{\text{eD}}}}\) and \(P_{D}\) are a function of \(t_{D}\) in edge-water drive and the functions of \(t_{D}\) and \(z_{D}\) in the bottom-water drive reservoir. Regrettably, there is no available empirical or analytical model(s) in the literature for the estimation of \(P_{D}\) in the bottom-water drive reservoir type for bounded and infinite-acting aquifers.

For the finite (bounded) aquifer, the Chatas (1953) model for predicting \(P_{D}\) in edge-water drive reservoir-aquifer configuration is given as:

$$ P_{D} = \frac{{0.5 + 2t_{D} }}{{r_{{{\text{eD}}}}^{2} - 1}} - \frac{{r_{{{\text{eD}}}}^{4} \left[ {3 - 4\ln \left( {r_{{{\text{eD}}}} } \right)} \right] - 2r_{{{\text{eD}}}}^{2} - 1}}{{4\left( {r_{{{\text{eD}}}}^{2} - 1} \right)^{2} }} $$
(3)

For the infinite-acting aquifer, the Edwardson et al. (1962) model for predicting \(P_{D}\) in bottom-water drive reservoir-aquifer configuration is expanded as:

$$ P_{D} = \frac{{370.529\sqrt {t_{D} } + 137.582t_{D} + 5.69549\left( {t_{D} } \right)^{1.5} }}{{328.834 + 265.488\sqrt {t_{D} } + 45.2157t_{D} + \left( {t_{D} } \right)^{1.5} }} $$
(4)

when the \(t_{D} > 100\); thus, \(P_{D} = 0.5\left[ {\ln \left( {t_{D} } \right) + 0.80907} \right]\)

Therefore, for the bounded aquifer, the edge-water drive consists of 516 datasets with \(t_{D}\) and \(r_{{{\text{eD}}}}\) as input data and \(W_{{{\text{eD}}}}\) and \(P_{D}\) as output variables. The bottom-water drive reservoir type has 1218 data points with \(t_{D}\), \(r_{{{\text{eD}}}}\) and \(z_{D}\) as input data and \(W_{{{\text{eD}}}}\) as output data. Again, for the infinite-acting aquifer, the datasets were 549 for an edge-water drive reservoir with \(t_{D}\) as an input variable and \(W_{{{\text{eD}}}}\) and \(P_{D}\) as output data. The bottom-water drive consists of 4098 datasets with \(t_{D}\) and \(z_{D}\) as input data and \(W_{{{\text{eD}}}}\) as output data. Tables 1 and 2 present the input and output variables’ minimum and maximum values and the statistical description of these variables’ values for the various reservoir-aquifer configurations. As observed in Table 2, especially for the infinite-acting aquifer, the differences between the variables maximum and minimum values (i.e., range) are large, and these values will affect the network training process if not scaled down. Hence, the input and output variables were normalized to 0–1 using the maximum–minimum normalization equation (Eq. 5). This approach ensures that the neural network training algorithm will adjust the network weights and biases adequately. Again, scaling the input and output data to 0–1 reduces the sensitivity of the neural networks’ sigmoidal (i.e., activation) function to large data values (Okon et al. 2020).

$$ y_{{{\text{normalized}}}} = \frac{{y - y_{\min } }}{{y_{\max } - y_{\min } }} $$
(5)

where \(y_{{{\text{normalized}}}}\) is the normalized input or output variable,\(y\) is the actual variable value, and \(y_{\min }\) and \(y_{\max }\) are the minima and maxima values of the variables, respectively. It is worth mentioning that after normalizing the infinite-acting aquifer edge-water and bottom-water datasets, the scaled \(t_{D}\) and \(W_{{{\text{eD}}}}\) datasets were in the range of 1.0 × 10–12–1.0. These extreme values would affect the generalization capacity of the ANN models predictions. In this case, the edge-water and bottom-water infinite-acting aquifer datasets were grouped into five sets, and then scaled based on the minimum and maximum values in Table 3 to reduce these datasets range.

Table 1 Minimum and maximum values of the input and output variables for ANN models development
Table 2 Statistical description of the input and output variables used for the ANN models development
Table 3 Ranges of the grouped edge- and bottom-water drive infinite-acting aquifer variables

Artificial neural network development

The artificial neural network (ANN) models for predicting the dimensionless influx (\(W_{{{\text{eD}}}}\)) and the dimensionless pressure (\(P_{D}\)) of finite (bounded) and infinite-acting aquifers were developed using the neural fitting tool (nftool) in MATLAB (Matrix Laboratory) 2015a mathematical software. The normalized input (i.e., \(t_{D}\), \(r_{{{\text{eD}}}}\) and \(z_{D}\)) and output (i.e.,\(W_{{{\text{eD}}}}\) and \(P_{D}\)) datasets for the various reservoir-aquifer configurations were exported to the MATLAB nftool environment and randomly divided into three parts, namely, training set (70%), validation set (15%) and testing set (15%), as shown in Table 4. The basic settings of the MATLAB software used to train and developed the ANN models are visible in Table 5. The neural network training was supervised learning as the target data (\(W_{{{\text{eD}}}}\) and \(P_{D}\)) were provided to the network as outputs. In the training phase, the network learning algorithm: Levenberg–Marquardt (trainlm in MATLAB which takes less time) adjusts the network neurons’ weights and biases until the neural network learns the appropriate input–output pattern of the datasets (Okon et al. 2020). Again, during the training stage, the predictions performance of the neural network was assessed using the validation and test datasets. Then, the network training truncated based on the stopping criteria, that is, mean square error (MSE) and the number of the epoch (iteration). The MATLAB software default values for these stopping criteria are given in Table 5. In this software, the training automatically stopped when the validation dataset generalization (i.e., MSE value) stops improving. According to Okon et al. (2020), the weights and biases that yield the lowest error from the supervised datasets are the network best generalization. Also, this least MSE and correlation coefficient (R) values were the basis for selecting the best ANN architecture. In other words, a trial-and-error approach was used to achieve the number of neurons in the network hidden layer. Hence, the ANN architecture training was several times repeated to ensure that its random weights initialization did not result in the network’s inconsistent performance correlation.

Table 4 Training, validation and testing datasets of the finite and infinite-acting aquifers
Table 5 The basic ANN models training parameters

Results and discussion

Performance of the ANN models

Finite (Bounded) aquifer

The developed ANN models for predicting the dimensionless influx (\(W_{{{\text{eD}}}}\)) and dimensionless pressure (\(P_{D}\)) of edge-water and bottom-water drive finite (bounded) aquifer are three-layer feed-forward back-propagation networks. These developed ANN models architectures are established based on their best generalization performance with the training, validation and testing datasets. Table 6 shows the models architecture as well as their performance indexes, namely, mean square error (MSE) and correlation coefficient (R) values, for this reservoir-aquifer configuration. From this table, the ANN model for the edge-water drive has two neurons at the input layer, ten neurons at the hidden layer and two neurons at the output layer, thus, represented as 2–10–2. Also, the bottom-water drive ANN model has three neurons at the input layer, six neurons at the hidden layer and one neuron at the output layer (i.e., 3–6–1). Again, the developed ANN models performance plots that showed the training, validation, testing and overall predictions of these models to the edge-water and bottom-water drives datasets are visible in Figs. 2 and 3, respectively. For the edge-water drive ANN model, its predictions were very close to the actual \(W_{{{\text{eD}}}}\) and \(P_{D}\) datasets. This assertion is because of the MSE and R values obtained (Table 6) and the diagonal alignment of the output and target from the training, validation and testing datasets (Fig. 2). Besides, the overall performance of the edge-water ANN model prediction of \(W_{{{\text{eD}}}}\) and \(P_{D}\) data resulted in an R-value of 0.99983. This R-value further supported that the model predictions fit very close to the actual \(W_{{{\text{eD}}}}\) and \(P_{D}\) data for the edge-water drive finite aquifer. Just as in the developed edge-water model, the performance plots of the bottom-water drive ANN model as depicted in Fig. 3 implied that the neural network predictions were near-exact values of the \(W_{{{\text{eD}}}}\) datasets. The efficient predictions of this ANN model are visible in the MSE and R values obtained (Table 6), as well as the diagonal trend of the output-target datasets from the training, validation and testing in Fig. 3. Also, the robust predictions of this ANN model are visible in its overall performance R-value of 0.99978. Therefore, these empirical ANN models for the edge-water and bottom-water \(W_{{{\text{eD}}}}\) and \(P_{D}\) predictions based on the Levenberg–Marquardt learning algorithm are expanded in Eqs. 6 and 7. These equations predict the \(W_{{{\text{eD}}}}\) and \(P_{D}\) in the normalized forms which would be de-normalized using Eq. 8;

Table 6 Training, validation and testing performance of the finite aquifer ANN models
Fig. 2
figure 2

Edge-water drive finite aquifer ANN model performance plot

Fig. 3
figure 3

Bottom-water drive finite aquifer ANN model performance plot

Edge-water drive ANN model for finite aquifer:

$$ (W_{{{\text{eD}}}} ,P_{D} ) = \sum\limits_{j = 1}^{2} {\left[ {purelin\left\{ {\sum\limits_{i = 1}^{10} {\sum\limits_{j = 1}^{2} {tansig} \left[ {\left( {t_{D} j_{1} + r_{{{\text{eD}}}} j_{2} } \right)_{i} + b_{i} } \right]} } \right\} \times Lw_{i,j} + b_{{k_{i} }} } \right]} $$
(6)

Bottom-water drive ANN model for finite aquifer:

$$ (W_{{{\text{eD}}}} ) = \sum\limits_{j = 1}^{1} {\left[ {purelin\left\{ {\sum\limits_{i = 1}^{6} {\sum\limits_{j = 1}^{3} {tansig} \left[ {\left( {t_{D} j_{1} + r_{{{\text{eD}}}} j_{2} + z_{D} j_{3} } \right)_{i} + b_{i} } \right]} } \right\} \times Lw_{i,j} + b_{{k_{i} }} } \right]} $$
(7)
$$ \left( {W_{eD} ,P_{D} } \right)_{{{\text{denorm}}{.}}} = \left( {W_{eD} ,P_{D} } \right)_{{{\text{Min}}{.}}} + \left( {W_{eD} ,P_{D} } \right)\left[ {\left( {W_{eD} ,P_{D} } \right)_{{{\text{Max}}{.}}} - \left( {W_{eD} ,P_{D} } \right)_{{{\text{Min}}{.}}} } \right] $$
(8)

where \(\left( {W_{eD} ,P_{D} } \right)_{{{\text{denorm}}{.}}}\) would be the de-normalized \(W_{{{\text{eD}}}}\) or \(P_{D}\) value(s) predicted by the developed ANN models, \(\left( {W_{eD} ,P_{D} } \right)_{{{\text{Min}}{.}}}\) and \(\left( {W_{eD} ,P_{D} } \right)_{{{\text{Max}}{.}}}\) are the actual minimum and maximum values of \(W_{{{\text{eD}}}}\) or \(P_{D}\), and \(\left( {W_{{{\text{eD}}}} ,P_{D} } \right)\) is the normalized value(s) of \(W_{{{\text{eD}}}}\) or \(P_{D}\) predicted by the ANN models.

Equations 6 and 7 are the trained artificial neural network (ANN) models for predicting \(W_{eD}\) and \(P_{D}\) of edge-water and bottom-water finite aquifers. In these equations, the variables \(j_{1}\), \(j_{2}\) and \(j_{3}\) are input weights connected to the inputs data \(t_{D}\), \(r_{{{\text{eD}}}}\) and \(z_{D}\), respectively, from the input layer to the hidden layer neurons. The variable \(Lw_{i,j}\) is hidden layer weights attached to the output layer neuron(s), as the variables \(b_{i}\) and \(b_{k}\) are biases connected to the hidden layer neurons and output layer neuron(s), respectively. These variables values for the developed edge-water and bottom-water finite aquifer ANN models are as shown in Tables 7 and 8, respectively. Also, in these equations, the activation function ‘tansig’ transformed the combined inputs (\(t_{D}\), \(r_{{{\text{eD}}}}\) and \(z_{D}\)), weights (\(j_{1}\), \(j_{2}\) and \(j_{3}\)) and bias (\(b_{i}\)), that is, output(s) of the hidden layer neurons to the output layer neuron(s) and the ‘purelin’ function converts the estimated outcome of the output layer neuron(s) to established the linear relationship between the input and output datasets.

Table 7 Weights and biases of the developed edge-water ANN model
Table 8 Weights and biases of the developed bottom-water ANN model

Infinite-acting aquifer

The developed ANN models for edge-water and bottom-water infinite-acting aquifers are also a three-layer feed-forward back-propagation (FFBP) neural network. As earlier mentioned that because of the large values of \(t_{D}\) and \(W_{{{\text{eD}}}}\) datasets of the edge-water and bottom-water infinite-acting aquifer, the data were grouped into five sets. Hence, five ANN models, as indicated in Table 9, were trained to handle the predictions of \(W_{{{\text{eD}}}}\) and \(P_{D}\) for this reservoir-aquifer geometry. The developed edge-water drive ANN models have one neuron at the input layer, five neurons at the hidden layer and two neurons at the output layer (i.e., 1–5–2), while the bottom-water drive models have two neurons at the input layer, 8 neurons at the hidden layer and one neuron at the output layer (i.e., 2–8–1). Table 9 depicts the training, validation and testing performance, that is, MSE and R values of the developed models. From these MSE and R values and the performance plots obtained for the various ANN models, they showed that the models’ predictions were close to the actual \(W_{{{\text{eD}}}}\) and \(P_{D}\) datasets. Figures 4 and 5 are the plots showing the training, validation, testing and overall performance of the edge-water ANN Model-1 and bottom-water ANN Model-1, respectively. Interestingly, the similar trends were obtained for other models (i.e., model-2–5). Therefore, the overall performance of the five ANN models for the edge-water infinite-acting aquifer resulted in R values of 0.99994, 1.0, 0.99995, 0.99999 and 0.99997, while that of bottom-water resulted in R values of 1.0, 1.0, 1.0, 1.0 and 0.99959. These MSE and R values obtained, and the output-target diagonal trends of the ANN models overall performance implied that the developed models would handle the predictions of \(W_{{{\text{eD}}}}\) and \(P_{D}\) for edge-water and bottom-water infinite-acting aquifers. The mathematical equations of the developed ANN models for edge-water and bottom-water infinite-acting aquifers are as shown in Eqs. 9 and 10. The variables \(j_{1}\), \(j_{2}\), \(j_{3}\), \(Lw_{i,j}\), \(b_{i}\), \(b_{k}\), and activation functions: tansig and purelin, in these equations are the same as described for Eqs. 6 and 7. Tables 10, 11, 12, 13 and 14 depict the values of these variables in the developed edge-water infinite-acting aquifer models, while Tables 15, 16, 17, 18 and 19 present the values for the bottom-water infinite-acting aquifer models.

Table 9 Training, validation and testing performance of the infinite-acting aquifer ANN models
Fig. 4
figure 4

Edge-water drive infinite-acting aquifer ANN model (Model-1) performance plot

Fig. 5
figure 5

Bottom-water drive infinite-acting aquifer ANN model (Model-1) performance plot

Table 10 Weights and biases of the developed edge-water ANN Model-1
Table 11 Weights and biases of the developed edge-water ANN Model-2
Table 12 Weights and biases of the developed edge-water ANN Model-3
Table 13 Weights and biases of the developed edge-water ANN Model-4
Table 14 Weights and biases of the developed edge-water ANN Model-5
Table 15 Weights and biases of the developed bottom-water ANN Model-1
Table 16 Weights and biases of the developed bottom-water ANN Model-2
Table 17 Weights and biases of the developed bottom-water ANN Model-3
Table 18 Weights and biases of the developed bottom-water ANN Model-4
Table 19 Weights and biases of the developed bottom-water ANN Model-5

Edge-water drive ANN model for infinite-acting aquifer:

$$ (W_{{{\text{eD}}}} ,P_{D} ) = \sum\limits_{j = 1}^{2} {\left[ {purelin\left\{ {\sum\limits_{i = 1}^{5} {\sum\limits_{j = 1}^{1} {tansig} \left[ {\left( {t_{D} j_{1} } \right)_{i} + b_{i} } \right]} } \right\} \times Lw_{i,j} + b_{{k_{i} }} } \right]} $$
(9)

Bottom-water drive ANN model for infinite-acting aquifer:

$$ \left( {W_{{{\text{eD}}}} } \right) = \sum\limits_{j = 1} {\left[ {purelin\left\{ {\sum\limits_{i = 1}^{8} {\sum\limits_{j = 1}^{2} {tansig\left[ {\left( {t_{D} j_{1} + z_{D} j_{2} } \right) + b_{i} } \right]} } } \right\} \times Lw_{i,j} + b_{k} } \right]} $$
(10)

In all the developed empirical ANN models, the general steps of the models to predict \(W_{{{\text{eD}}}}\) and \(P_{D}\) for the finite (bounded) and infinite-acting aquifers require providing the model(s) input parameters: \(t_{D}\), \(r_{{{\text{eD}}}}\) and \(z_{D}\), values. For example, in the developed edge-water finite aquifer ANN model to predict \(W_{{{\text{eD}}}}\) and \(P_{D}\), the input parameters (i.e., \(t_{D}\) and \(r_{{{\text{eD}}}}\)) values provided to the input layer neurons will multiply with their input layer weights: \(j_{1}\) and \(j_{2}\), respectively. In the first neuron of the hidden layer, the sum of these inputs to the neuron and its bias (\(b_{i}\)), that is, \(\sum\nolimits_{i = 1} {\sum\nolimits_{j = 1}^{2} {\left( {t_{D} j_{1} + r_{eD} j_{2} } \right)_{1} + b_{1} } }\) is estimated using the values for \(i\) = 1 in Table 7, where \(j_{1}\) = −0.93605, \(j_{2}\) = −1.0019441 and \(b_{1}\) = 1.893821, and their appropriate \(t_{D}\) and \(r_{{{\text{eD}}}}\) values. This summation is transformed by the transfer function ‘tansig’ and connected to the output layer neurons. Again, the outcome from the first neuron in the hidden layer, \(\sum\nolimits_{i = 1} {\sum\nolimits_{j = 1}^{2} {\tan sig\left( {t_{D} j_{1} + r_{eD} j_{2} } \right)_{1} + b_{1} } }\) is multiplied with the hidden layer weights (\(Lw_{i,j}\)) and connected to the output neurons. Thus, this input \(\sum\nolimits_{i = 1} {\sum\nolimits_{j = 1}^{2} {\left[ {\tan sig\left( {t_{D} j_{1} + r_{eD} j_{2} } \right)_{1} + b_{1} } \right]} \times Lw_{i,j} }\) combined with the output biases (\(b_{{k_{i} }}\)) in the output neurons. The outlined computations procedures are followed by other neurons (i.e., \(i = 2,3 \ldots 10\)) in the hidden layer using their corresponding input weights (\(j_{1}\), \(j_{2}\)) and biases (\(b_{i}\)) from the input layer neurons, and weights (\(Lw_{i,j}\)) to link with the output layer neurons. The total summations in the output layer neurons, that is, \( \sum\nolimits_{{j = 1}}^{2} {\left[ {purelin\left\{ {\sum\nolimits_{{i = 1}}^{{10}} {\sum\nolimits_{{j = 1}}^{2} {tansig\left[ {\left( {t_{D} j_{1} + r_{{eD}} j_{2} } \right)_{i} + b_{i} } \right]} } } \right\} \times Lw_{{i,j}} + b_{{k_{i} }} } \right]} \) are transformed by the output layer transfer function ‘purelin’ and presented as the ANN model prediction(s) in the normalized form. Then, the ANN model predicted outputs (i.e., \(W_{{{\text{eD}}}}\) and \(P_{D}\)) would be de-normalized using Eq. 8.

Generalization of the developed ANN models predictions and comparison with other existing models

The robustness of any developed model is its ability to predict with new datasets to justify its generalization as well as application. As earlier pointed out, the used of vEH approach for estimation of water influx into the reservoir(s) is achieved by table look-up. Most times, this required interpolation between two data points to obtain the corresponding \(W_{{{\text{eD}}}}\), as the calculated \(t_{D}\) (even \(r_{{{\text{eD}}}}\) and \(z_{D}\)) may not be the exact value as provided in the vEH table. Therefore, the developed ANN models generalization potential was determined using randomly generated \(t_{D}\) datasets based on their corresponding \(r_{{{\text{eD}}}}\) and \(z_{D}\). The ANN models predicted \(W_{{{\text{eD}}}}\) were compared with the Lagrangian interpolation approach (Eq. 11) estimated \(W_{{{\text{eD}}}}\). Again, the edge-water infinite-acting models predicted \(W_{{{\text{eD}}}}\) were compared with the Edwardson et al. (1962) polynomial (Eqs. 12 and 13) estimated \(W_{{{\text{eD}}}}\). Also, the developed models (for finite and infinite-acting edge-water aquifers) predicted \(P_{D}\) were compared with some existing models, namely, Chatas (1953), Edwardson et al. (1962), Lee (1982) and Fanchi (1985) estimations. The generalization performance of these developed models was determined using some statistical measures, like, coefficient of determination (R2), R, MSE, root-mean-square error (RMSE) and average absolute relative error (AARE) and cross-plots.

$$ W_{{{\text{eD}}}} (t_{D} ) = W_{{{\text{eD}}}} (t_{{D_{1} }} )\left[ {\frac{{t_{{D_{2} }} - t_{D} }}{{t_{{D_{2} }} - t_{{D_{1} }} }}} \right] + W_{{{\text{eD}}}} (t_{{D_{2} }} )\left[ {\frac{{t_{D} - t_{{D_{1} }} }}{{t_{{D_{2} }} - t_{{D_{1} }} }}} \right] $$
(11)

where \(W_{{{\text{eD}}}} (t_{D} )\) is the required dimensionless influx at estimated \(t_{D}\), \(W_{{{\text{eD}}}} (t_{{D_{1} }} )\) and \(W_{{{\text{eD}}}} (t_{{D_{2} }} )\) are the look-up dimensionless influx at \(t_{{D_{1} }}\) and \(t_{{D_{2} }}\), respectively, from vEH table.

Edwardson et al. (1962) polynomial:

  1. (i).

    For \(0.01 < t_{D} < 200\)

    $$ W_{{{\text{eD}}}} = \frac{{\left( {1.2838\sqrt {t_{D} } + 1.19328t_{D} + 0.269872\left( {t_{D} } \right)^{1.5} + 8.55294 \times 10^{ - 3} \left( {t_{D} } \right)^{2} } \right)}}{{\left( {1 + 0.616599\sqrt {t_{D} } + 0.0413008t_{D} } \right)}} $$
    (12)
  2. (ii).

    For \(t_{D} > 200\)

    $$ W_{{{\text{eD}}}} = \frac{{ - 4.29881 + 2.02566t_{D} }}{{\ln \left( {t_{D} } \right)}} $$
    (13)

Lee (1982) equation:

$$ P_{D} = a_{1} + a_{2} \ln (t_{D} ) + a_{3} [\ln (t_{D} )]^{2} + a_{4} [\ln (t_{D} )]^{3} + a_{5} t_{D} + a_{6} (t_{D} )^{2} + a_{7} (t_{D} )^{3} + \frac{{a_{8} }}{{t_{D} }} $$
(14)

where the values for \(a_{1}\) through \(a_{8}\) are: \(a_{1}\) = 0.8085064, \(a_{2}\) = 0.29302022, \(a_{3}\) = 0.035264177, \(a_{4}\) = −1.4036304 × 10–3, \(a_{5}\) = −4.7722225 × 10–4, \(a_{6}\) = 5.1240532 × 10–7, \(a_{7}\) = 2.3033017 × 10–10 and \(a_{8}\) = −2.6723117 × 10–3.

Fanchi (1985) equation:

$$ P_{D} = a_{0} + a_{1} t_{D} + a_{2} \ln (t_{D} ) + a_{3} [\ln (t_{D} )]^{2} $$
(15)

where the values for \(a_{0}\) through \(a_{3}\) in the Fanchi (1985) equation varied depending on the \(r_{{{\text{eD}}}}\) value (i.e., 1.5, 2.0, 3.0, 4.0, 5.0, 6.0, 8.0, 10.0 and ∞) considered for the equation. The values for the constants \(a_{0}\) through \(a_{3}\) are available in Ahmed and McKinney (2005).

Finite aquifer models

The generalization performance of the developed edge-water and bottom-water finite aquifer models was determined using 105 and 213, respectively, randomly generated \(t_{D}\) datasets. Table 20 and Figs. 6, 7 and 8 present the generalization and comparison performance of these developed ANN models for the finite aquifer. From the table, the statistical measure results and the cross-plots in Figs. 6 and 8, the developed ANN models predicted \(W_{{{\text{eD}}}}\) were very close to the Lagrangian interpolated \(W_{{{\text{eD}}}}\). These models close predictions with the Lagrangian interpolation are visible in the diagonal trends of the data points along a unit slope (i.e., 45° from the origin). According to Al-Bulushi et al. (2009), this observation means a good agreement between the ANN models predicted \(W_{eD}\) and Lagrangian interpolated \(W_{{{\text{eD}}}}\). Hence, the statistical measures of these developed models performance resulted in R2, R, MSE, RMSE and AARE of 0.9984, 0.9992, 0.3496, 0.5913 and 0.2414 for edge-water ANN model and 0.9993, 0.9996, 0.1863, 0.4316 and 0.2215 for bottom-water model. This implies that the developed ANN models would conveniently predict \(W_{{{\text{eD}}}}\) for the edge-water and bottom-water finite aquifer with about 99% accuracy without any vEH table look-up. Also, comparing the developed edge-water ANN model predicted \(P_{D}\) with some existing models: Chatas (1953) and Fanchi (1985) equations for evaluating \(P_{D}\) showed that the developed ANN model predictions fit close to the Chatas (1953) and Fanchi (1985) models estimated \(P_{D}\). A comparison of the ANN model predicted \(P_{D}\) with the existing models estimated \(P_{D}\) resulted in the statistical performance of R2 of 0.9985, R of 0.9993, MSE of 0.0125, RMSE of 0.1117 and AARE of 0.0678 with Chatas (1953) model and R2, R, MSE, RMSE and AARE of 0.9863, 0.9931, 0.1411, 0.3756 and 0.2310, respectively, with Fanchi (1985) equation. Therefore, these developed ANN models can predict \(W_{{{\text{eD}}}}\) and \(P_{D}\) for edge-water and the bottom-water finite aquifer that could be used to estimate water influx into the reservoir using van Everdingen–Hurst approach or Carter–Tracy method.

Table 20 Statistical performance of the developed finite aquifer ANN models with other models
Fig. 6
figure 6

Comparison of edge-water finite aquifer ANN model predicted \(W_{eD}\) with Lagrangian interpolation

Fig. 7
figure 7

Comparison of edge-water finite aquifer ANN model predicted \(P_{D}\) with other models

Fig. 8
figure 8

Comparison of bottom-water finite aquifer ANN model predicted \(W_{eD}\) with Lagrangian interpolation

Infinite-acting aquifer

The developed edge-water and bottom-water infinite-acting aquifer ANN models generalization capacity was determined using 106 and 735 randomly generated \(t_{D}\) datasets. These generated \(t_{D}\) datasets were divided to fall within the five models (Model-1–5) data ranges in Table 3. Table 21 presents the statistical performance of the developed edge-water and bottom-water infinite-acting aquifer models predictions (i.e., \(W_{{{\text{eD}}}}\) and \(P_{D}\)) with the new \(t_{D}\) datasets when compared with Lagrangian interpolation for \(W_{{{\text{eD}}}}\) and Chatas (1953) estimation for \(P_{D}\). The R2 and R values obtained for these ANN models indicate close predictions between the ANN models and these estimation methods. Furthermore, the developed infinite-acting aquifer ANN models overall generalization potentials were compared with other existing models. The results of this evaluation are visible in Table 22 as well as cross-plots in Figs. 9, 10 and 11. For the edge-water, the developed ANN model predictions were very close to Lagrangian interpolation and Edwardson et al. (1962) polynomial estimation of \(W_{{{\text{eD}}}}\); this closeness is noted in the unit slope in Fig. 9. Again, the efficient performance of this ANN model is seen in its predicted \(P_{D}\) values, as they are comparable to Edwardson et al. (1962), Fanchi (1985) and Lee (1982) models estimated \(P_{D}\). Also, in Table 22, the overall performance of the bottom-water infinite-acting model is close to the Lagrangian interpolation for \(W_{{{\text{eD}}}}\). In other words, these developed ANN models will predict \(W_{{{\text{eD}}}}\) and \(P_{D}\) that could be functional for the calculation of water influx in edge-water and bottom-water infinite-acting reservoir.

Table 21 Statistical performance of the developed infinite-acting aquifer ANN models
Table 22 Statistical performance of the overall edge- and bottom-water infinite-acting aquifer ANN models with other models
Fig. 9
figure 9

Comparison of edge-water infinite-acting aquifer ANN model predicted \(W_{eD}\) with other models

Fig. 10
figure 10

Comparison of edge-water infinite-acting aquifer ANN model predicted \(P_{D}\) with other models

Fig. 11
figure 11

Comparison of bottom-water infinite-acting aquifer ANN model predicted \(W_{eD}\) with Lagrangian interpolation

In all, the advantages and novelty of the developed ANN models over some existing models are as follows:

  1. 1.

    Compared to Nashawi and Elkamel (1999) ANN models that are multiple-inputs single-output (MISO), the developed models in this study for \(W_{{{\text{eD}}}}\) and \(P_{D}\) prediction for edge-water drive finite and infinite-acting aquifers are multiple-inputs multiple-outputs (MIMO) ANN models.

  2. 2.

    Unlike existing models, namely Lee (1982), Fanchi (1985), etc., that are correlation range limited and do not provide values for all the aquifer sizes presented by vEH, the developed ANN models cover the entire vEH datasets. Again, the ANN models can predict the \(W_{{{\text{eD}}}}\) and \(P_{D}\) values for aquifer sizes that are not provided by vEH but are within the \(r_{eD}\) and \(z_{D}\) data ranges.

  3. 3.

    The challenges of table look-up and interpolation between time entries and preloading of the vEH data table into reservoir engineering software are taken care of with the developed ANN models for water influx calculation.

The application of the developed ANN models in water influx calculations in reservoir engineering software

Any model be it empirical, analytical or numerical is developed to drive home some or specific problems. According to Okon et al. (2020), any developed model is useless if it lacks an area of application. Therefore, these developed ANN models would be useful for the prediction of dimensionless influx (\(W_{{{\text{eD}}}}\)) and dimensionless pressure (\(P_{D}\)) for the calculation of water influx (\(W_{e}\)) into the reservoir when incorporated into the software. Worthy to note that the available reservoir engineering software, like, ECLIPSE-100, PETREL, RESOLVE, etc., calculate water influx into the reservoir using an approximate solution to diffusivity equation—Carter–Tracy model or pseudo-state aquifer productivity index—Fetkovich model. The limitations of these approaches are that Fetkovich model applies to finite aquifer edge-water and bottom-water drive reservoirs, as Carter-Tracy aquifer model applies to edge-water drive reservoir finite-acting and infinite-acting aquifers (Okotie and Ikporo 2019). Again, Fetkovich approach underestimates the water influx into the reservoir, while Carter–Tracy method overestimates the water influx. Regrettably, the exact solution to diffusivity equation—van Everdingen–Hurst model that applies for both finite and infinite-acting aquifers edge-water and bottom-water drive reservoirs is limited in the reservoir engineering software. The reason is because of the challenge of table look-up in most \(t_{D}\) entries to interpolate for \(W_{{{\text{eD}}}}\). In this direction, intelligent models that predict \(W_{{{\text{eD}}}}\) for the calculation of \(W_{e}\) using vEH approach were necessary. The conceptual schematic of the developed ANN models for reservoir engineering software for the prediction of \(W_{{{\text{eD}}}}\) and \(P_{D}\), and then the calculation of \(W_{e}\) using vEH or Carter-Tracy method is as shown in Fig. 12. The software takes basic reservoir-aquifer properties and computes reservoir-aquifer dimensionless variables, namely, \(t_{D}\), \(r_{{{\text{eD}}}}\), \(z_{D}\) and \(P^{\prime}_{D}\). It then checks the computed \(r_{{{\text{eD}}}}\) to determine the aquifer outer boundary to establish the aquifer type, that is, finite aquifer or infinite-acting aquifer. The computer program further assesses the computed \(z_{D}\) to determine the reservoir drive type: edge-water or bottom-water, and the ANN model to be used for the prediction. For instance, if the mentioned conditions established finite aquifer edge-water drive, then the developed ANN model takes input variables (i.e., computed \(t_{D}\) and \(r_{{{\text{eD}}}}\)) to predict \(W_{{{\text{eD}}}}\) and \(P_{D}\). Then, vEH or Carter–Tracy model could be used to estimate the \(W_{e}\) into the reservoir. Again, if the conditions established infinite-acting aquifer bottom-water drive, the developed ANN model takes the computed input variables (i.e., \(t_{D}\) and \(z_{D}\)) and used the \(t_{D}\) value to check for the appropriate ANN model’s weights and biases to predict the \(W_{{{\text{eD}}}}\). Hence, the \(W_{e}\) into the reservoir can be determined using vEH method.

Fig. 12
figure 12

Software conceptual schematic of the developed ANN models

Conclusion

Water influx calculation is an indispensable evaluation in reservoir engineering as an alternative method to assess reservoir underground withdrawal performance. The most reliable approach by vEH-based models is hindered by table look-up and interpolation between time entries to determine the reservoir-aquifer dimensionless variables: influx and pressure. In this study, the feed-forward back-propagation (FFBP) artificial neural network (ANN) models were developed for the edge- and bottom-water drive finite and infinite-acting aquifers. The developed models are reproducible, as their weights and biases required to replicate the models are visible in this work. From the performance of these developed ANN models, the following conclusions can be drawn:

  1. 1.

    the developed ANN models predicted reservoir-aquifer dimensionless variables: \(W_{{{\text{eD}}}}\) and \(P_{D}\) were fitted to the original vEH-based model datasets, as the overall performance R values obtained for the edge- and bottom-water finite aquifer models are 0.99983 and 0.99978, while edge- and bottom-water infinite-acting aquifer models are 0.99992 and 0.99997, respectively;

  2. 2.

    the developed finite aquifer ANN models predicted \(W_{{{\text{eD}}}}\) values are comparable with the Lagrangian interpolation approach, as the obtained statistical measures between them resulted in the R2 of 0.9984, R of 0.9992, MSE of 0.3496, RMSE of 0.5913 and AARE of 0.2414 for edge-water drive model, while bottom-water drive model resulted in the R2, R, MSE, RMSE and AARE of 0.9993, 0.9996, 0.1863, 0.4316 and 0.2215, respectively;

  3. 3.

    for the infinite-acting aquifer models, their predicted \(W_{{{\text{eD}}}}\) and Lagrangian interpolated \(W_{{{\text{eD}}}}\) resulted in the R2, R, MSE, RMSE and AARE of 0.9999, 0.9999, 0.5447, 0.7380 and 0.2329 for edge-water drive model (Model-1); with an overall performance R2 value of 0.9999, R of 0.9999, MSE of 6.329 × 1016, RMSE of 2.516 × 108 and AARE of 3.601 × 107. The bottom-water model (Model-1) had R2 of 0.9999, R of 0.9999, MSE of 0.2299, RMSE of 0.4795 and AARE of 0.1282; with an overall performance R2 value of 0.9999, R of 0.9999, MSE of 5.718 × 1016, RMSE of 2.390 × 108 and AARE of 7.0 × 107;

  4. 4.

    again, the developed edge-water infinite-acting aquifer ANN model predicted \(W_{{{\text{eD}}}}\) were comparable with Edwardson et al. (1962) polynomial estimated \(W_{{{\text{eD}}}}\), as the obtained R2 value of 0.9996, R of 0.9998, MSE of 4.740 × 10–4, RMSE of 2.178 × 10–2 and AARE of 1.470 × 10–2 for Model-1; and overall performance of R2, R, MSE, RMSE and AARE of 0.9999, 0.9999, 8.367 × 1015, 9.147 × 107 and 2.702 × 107, respectively;

  5. 5.

    the developed edge-water ANN models predicted \(P_{D}\) were closed to some existing models, namely Chatas (1953), Edwardson et al. (1962), Lee (1982) and Fanchi (1985) estimations, as the overall statistical performance: R2, R, MSE, RMSE and AARE obtained for finite aquifer were 0.9985, 0.9993, 1.250 × 10–2, 0.1117 and 6.780 × 10–2 with Chatas (1953) model and 0.9863, 0.9931, 0.1411, 0.3756 and 0.2310 with Fanchi (1985) equation, while the infinite-acting aquifer was 0.9999, 0.9999, 0.1750, 1.325 × 10–2 and 7.333 × 10–3 with Edwardson et al. (1962) polynomial, then 0.9865, 09,933, 1.425 × 10–2, 0.1194 and 8.310 × 10–2 with Lee (1982) model, and 0.9991, 0.9996, 1.079 × 10–3, 3.284 × 10–2 and 2.823 × 10–2 with Fanchi (1985) model;

  6. 6.

    the developed ANN models would perform better than the existing models since they are not correlation range limited and can predict the dimensionless variables: \(W_{{{\text{eD}}}}\) and \(P_{D}\), for aquifer sizes not provided in the vEH datasets.

For future work, as observed in the infinite-acting aquifer edge- and bottom-water models prediction, developing hybridize ANN models that minimize this aquifer-type dimensionless variables prediction error is important for water influx calculation in infinite-acting aquifers.