Forecasting of Landslide Displacement Using a Probability-Scheme Combination Ensemble Prediction Technique

Ma, Junwei; Liu, Xiao; Niu, Xiaoxu; Wang, Yankun; Wen, Tao; Zhang, Junrong; Zou, Zongxing

doi:10.3390/ijerph17134788

Open AccessArticle

Forecasting of Landslide Displacement Using a Probability-Scheme Combination Ensemble Prediction Technique

¹

Three Gorges Research Center for Geo-Hazards of the Ministry of Education, China University of Geosciences, Wuhan 430074, China

²

Faculty of Engineering, China University of Geosciences, Wuhan 430074, China

³

School of Geosciences, Yangtze University, Wuhan 430100, China

^*

Author to whom correspondence should be addressed.

Int. J. Environ. Res. Public Health 2020, 17(13), 4788; https://doi.org/10.3390/ijerph17134788

Submission received: 9 June 2020 / Revised: 29 June 2020 / Accepted: 30 June 2020 / Published: 3 July 2020

(This article belongs to the Special Issue 2nd Edition of Health Emergency and Disaster Risk Management (Health-EDRM))

Download

Browse Figures

Versions Notes

Abstract

:

Data-driven models have been extensively employed in landslide displacement prediction. However, predictive uncertainty, which consists of input uncertainty, parameter uncertainty, and model uncertainty, is usually disregarded in deterministic data-driven modeling, and point estimates are separately presented. In this study, a probability-scheme combination ensemble prediction that employs quantile regression neural networks and kernel density estimation (QRNNs-KDE) is proposed for robust and accurate prediction and uncertainty quantification of landslide displacement. In the ensemble model, QRNNs serve as base learning algorithms to generate multiple base learners. Final ensemble prediction is obtained by integration of all base learners through a probability combination scheme based on KDE. The Fanjiaping landslide in the Three Gorges Reservoir area (TGRA) was selected as a case study to explore the performance of the ensemble prediction. Based on long-term (2006–2018) and near real-time monitoring data, a comprehensive analysis of the deformation characteristics was conducted for fully understanding the triggering factors. The experimental results indicate that the QRNNs-KDE approach can perform predictions with perfect performance and outperform the traditional backpropagation (BP), radial basis function (RBF), extreme learning machine (ELM), support vector machine (SVM) methods, bootstrap-extreme learning machine-artificial neural network (bootstrap-ELM-ANN), and Copula-kernel-based support vector machine quantile regression (Copula-KSVMQR). The proposed QRNNs-KDE approach has significant potential in medium-term to long-term horizon forecasting and quantification of uncertainty.

Keywords:

landslide displacement; predictive uncertainty; ensemble prediction; probability combination scheme; quantile regression neural networks (QRNNs); kernel density estimation (KDE)

1. Introduction

As one of the most common natural hazards in the world, landslides pose a significant threat to public health and safety. According to statistics, landslides have affected 4.8 million people and caused 18,275 deaths during the period of 2009–2019 [1]. Landslide displacement prediction, which provides the necessary information to determine the extent of ongoing hazard, has proven to be the most cost-saving risk reduction measure [2,3,4]. However, landslide displacement prediction is complex and remains a key challenge in natural hazard research. This challenge arises because landslides are nonlinear, dynamic systems, and the associated movements can be induced by different causes, such as geological factors [5], hydrological factors [6,7], morphological factors, and human activities [4,8].

A large number of efforts in the literature have focused on the precise prediction of landslide displacement [9]. Currently, approaches used for landslide displacement prediction are categorized as physical modelling approaches and data-driven approaches [10]. Physical models (also known as white-box models), which rely on detailed descriptions of landslide mechanism processes, can provide clear physical explanations of landslides. The commonly used physical models include the tertiary creep model [11], the Hayashi model [12], and the general creep model [13]. Those physical models require numerous expensive geotechnical characterizations of the materials involved in landslides and therefore may be applicable only in limited cases [14].

Data-driven models differ from physical models because a characterization of the actual landslide mechanism processes is not fully required. Thus, the data-driven models are also known as black-box models. The main advantage of data-driven models is that the trained models can be easily updated on the basis of new and more recent data.

Data-driven models include but are not limited to statistical methods, artificial neural networks (ANNs), support vector machines (SVMs) [15], and extreme learning machines (ELMs) [16]. Owing to their capacity to approximate arbitrary, nonlinear, and dynamic systems with high precision, data-driven models achieve good model performance in the prediction of landslide displacement.

Despite their widespread application, the output of most existing data-driven models is a single estimate for each prediction horizon. These single estimates, which provide deterministic values, are referred to as point predictions [3]. The defining characteristic of a point prediction is its accessibility with regard to understanding and operation. The main drawback of point prediction is that it only provides the prediction error, with no information regarding the associated predictive uncertainties, which limits the use of point prediction in decision-making applications.

The predictive uncertainties consisting primarily of input uncertainty, parameter uncertainty, and model uncertainty could be substantial. It is highly desirable to know the degree of uncertainty that is associated with a particular point prediction and convert the point prediction into informative resources for emergency landslide risk management [3,17]. Only limited studies have examined the quantification of uncertainty associated with landslide displacement prediction by constructing prediction intervals (PIs). The output of a PI is an interval composed of upper and lower bounds, where we expect the predictive value of the series to fall within some (prespecified) probability, which is deemed the PI nominal confidence (PINC). A hybrid approach based on an echo state network and mean-variance estimation was proposed by Yao et al. [18] to measure the uncertainty in landslide deformation prediction and perform interval prediction. A bootstrap-based approach was proposed by Ma et al. [4] to perform interval prediction of landslide displacement. Wang et al. [2] proposed a direct interval prediction using least squares support vector machines or the construction of PIs of landslide displacement. Kernel-based support vector machine quantile regression (KSVMQR) was utilized in [3] for quantification of the predictive uncertainty of landslide displacement.

However, the traditional methods have certain disadvantages in displacement prediction and quantification of predictive uncertainty. For example, the bootstrap-based approach requires significantly high computational costs, especially for large datasets [2]. Additionally, the performances of SVM-based approaches are sensitive to the choice of kernel type and parameter values [19]. Therefore, more efforts still need to be made for the improvement of prediction performance and quantification of the predictive uncertainty.

Ensemble prediction, a state-of-the-art artificial intelligence technique, aims to improve prediction robustness and accuracy and uncertainty quantification [20,21]. Ensemble prediction has been successfully applied in a variety of fields, including prediction performance improvement and uncertainty quantification of remaining useful life [22], bankruptcy [23], shear capacity of reinforced-concrete deep beams [24], residential electricity consumption [25], wind power [26], flood susceptibility [27,28], and landslide susceptibility [29].

In this study, a probability-scheme combination ensemble prediction that employs quantile regression neural networks and kernel density estimation (QRNNs-KDE) was proposed for robust and accurate prediction and uncertainty quantification of landslide displacement. The Fanjiaping landslide with long-term and near real-time monitoring data was selected as a case study to explore the performance of the QRNNs-KDE approach. The deformation characteristics were clarified for fully understanding the triggering factors.

2. Methodology

2.1. Description of Uncertainty Sources

Predictive uncertainty in data-driven models consists primarily of input uncertainty, parameter uncertainty, and model uncertainty [30,31,32].

The input uncertainty is related to the input data uncertainty and the input variable section uncertainty. The input data uncertainty is primarily due to measurement and sampling error and environmental noise. The input variable section uncertainty accounts for uncertainty inherent in the selection of input variables from the candidate data set. For physical models, the required inputs are pre-determined, being consistent with considered rheological models. However, for data-driven models, the selection of input variables is problem-dependent and cannot be determined in advance. Only major and relevant variables are selected as final inputs to train the data-driven model. The selection of the variables to include in a data-driven model from the original data set is inherently uncertain, especially when the input candidate pool is very large. For example, in data-driven models that utilize decomposition algorithms, only a portion of the decomposed sub-components are selected as input variables. The candidate input pool, which consists of sub-components, increases very quickly with the decomposition level and potentially increases the input variable selection uncertainty.

The parameter uncertainty refers to the uncertainty in the model parameter vector and mainly arises from the inability to identify a unique set of best parameters for the model [33].

Model uncertainty arises primarily from the model structure uncertainty and model error. Model structure uncertainty is associated with the specific model setting of learning algorithms, such as the polynomial order in polynomial regression models, the number of hidden nodes in an ANN or ELM, and the type of kernel function in an SVM. The input uncertainty may also account for model structure uncertainty, because different input variables “automatically" produce different model structures. Model error refers to the difference between two model estimates with respect to the corresponding target and is caused by the inability to reproduce the real processes.

2.2. Ensemble Prediction

Ensemble prediction is not a specific learning algorithm but a strategic combination of multiple predictions into a single output with a model combination process [21]. Based on the selection of the learning algorithm, ensemble prediction models can be further classified into homogeneous and heterogeneous ensemble models (Figure 1). A homogeneous ensemble model generates multiple learners with the same learning algorithm on different training datasets, which are produced by manipulating the original training data (schematic illustrated in Figure 1a). Bootstrap aggregation, also known as bagging for short, is the most straightforward and widely used method of manipulating the training dataset. By contrast, a heterogeneous ensemble model generates multiple learners with different learning algorithms on the same training data set (schematic illustrated in Figure 1b).

The base learner combination is the main step in the ensemble prediction model. Summation and averaging are simple combination schemes. A more general approach involves assigning a weight to each base learner. In the present study, a heterogeneous ensemble model was built based on QRNNs and KDE. QRNNs serve as base learning algorithms to produce multiple base learners, and the probability combination scheme based on KDE is used to combine the base learners into the final ensemble prediction.

2.3. Quantile Regression Neural Network

2.3.1. Quantile Regression

Quantile regression is a common statistical technique for conducting inferences concerning conditional quantile functions [34,35]. More formally, any real-valued random variable Y may be characterized by its distribution function as follows:

F (y) = Prob (Y \leq y)

(1)

whereas for any

0 < τ < 1

,

Q (τ) = \inf {y : F (y) \geq τ}

(2)

is called the

τ th

quantile of Y.

Given a data set

(x_{i} (t), Y (t))

for

i = 1, 2, \dots, I

and

t = 1, 2, \dots, N

, the linear quantile regression can be expressed as follows:

{\hat{Y}}_{τ} (t) = \sum_{i = 1}^{I} θ_{i} x_{i} (t) + b

(3)

where

0 < τ < 1

is the quantile, and b is an error with zero expectation.

The estimated parameters

θ_{i}

can be approximated by minimizing a sum of the asymmetrically weighted absolute residual cost functions, which are expressed as follows:

E_{τ} = \frac{1}{N} \sum_{t = 1}^{N} ρ_{τ} (Y (t) - {\hat{Y}}_{τ} (t))

(4)

where

Y (t)

is the observation at time t and

ρ_{τ}

is the check function, which is also known as the pinball loss function and is defined as follows:

ρ_{τ} (x) = {\begin{array}{l} τ x & if x \geq 0 \\ (τ - 1) x & if x < 0 \end{array}

(5)

2.3.2. Quantile Regression Neural Network

Given inputs

x_{i} (t)

and an output

Y (t)

, the output from a QRNN is calculated as follows:

Consider a hidden-layer transfer function

h (\cdot)

; the output from the j-th hidden-layer node

g_{j} (t)

is given by applying the hidden-layer transfer function to the inner product between

x_{i} (t)

and hidden-layer weights

w_{i j}^{(h)}

plus the hidden-layer bias

b_{j}^{(h)}

, which can be calculated as follows:

g_{j} (t) = h (\sum_{i = 1}^{I} x_{i} (t) w_{i j}^{(h)} + b_{j}^{(h)})

(6)

An estimate of the conditional

τ

-quantile

{\hat{y}}_{τ} (t)

is

{\hat{Y}}_{τ} (t) = f (\sum_{j = 1}^{J} g_{j} (t) w_{j}^{(o)} + b^{(o)})

(7)

where

w_{j}^{(o)}

are the output-layer weights,

b^{(o)}

is the output-layer bias, and

f (\cdot)

is the output-layer transfer function. The transfer function

h (\cdot)

and

f (\cdot)

are usually set as the hyperbolic tangent sigmoidal and linear function, respectively [36].

As an alternative method to prevent overfitting, weight delay regularization for the magnitude of the input-hidden layer weight can be applied by setting a penalty with a nonzero value.

2.4. Kernel Density Estimation (KDE)

Nonparametric density estimation is the process of fitting a parametric density model of a random variable without making the assumption that the density belongs to a particular parametric family [37,38]. Various methods have been proposed for nonparametric density estimation, e.g., k-nearest neighbors method, Parzen windows, histogram, and KDE [38]. In the domain of nonparametric density estimation, the K-nearest neighbors method has a very limited scope of practical applications due to its very poor performance. The Parzen windows method presents slightly better performance but also produces discontinuities (stair-like curves) that are quite annoying in practice [38]. A histogram is a simple form of the nonparametric density estimation. However, it suffers serious and noticeable drawbacks. First, the resulting visualization strongly depends on the choice of binning. Second, the natural feature of the histogram is discontinuity, which causes extreme difficulty if derivatives of the estimates are required.

Fortunately, those abovementioned drawbacks can be easily eliminated by using KDE [38,39]. In fact, KDE has been extensively studied and has become the most popular method in nonparametric density estimation. Given a random sample

Y_{1}, Y_{2}, \dots, Y_{m}

, the value of the density at the point

y

estimated by the KDE method is given by the following:

\hat{f} (y, h) = \frac{1}{m h} \sum_{i = 1}^{m} K (\frac{y - Y_{i}}{h})

(8)

where

h

is the bandwidth with positive real value and

K (\cdot)

is the kernel function. In this study, the most effective Epanechnikov kernel [38] was adopted and expressed as

K (y) = \frac{3}{4} (1 - y^{2}) ℝ (| y | \leq 1)

(9)

where

ℝ (\cdot)

is the indicator function, that is,

ℝ (y \in A) = 1

for

y \in A

and

ℝ (y \in A) = 0

for

y \notin A

.

The selection of bandwidth parameter is a crucial issue in KDE. The bandwidth parameter influences the smoothness of the KDE curve and also determines the tradeoff between the bias and variance. In general, the smaller the bandwidth, the smaller the bias, and the larger the variance. A number of methods have been proposed to find the optimal bandwidth, such as Silverman’s rule of thumb and the Sheather-Jones method. Silverman’s rule of thumb bandwidth with a Gaussian kernel and Epanechnikov kernel can be computed as follows:

h^{o p t i m a l} \approx 1.06 \hat{σ} n^{- \frac{1}{5}}

(10)

h^{o p t i m a l} \approx 2.34 \hat{σ} n^{- \frac{1}{5}}

(11)

where

\hat{σ}

is the estimation of

σ

(standard deviation of the input data) [38].

2.5. Ensemble Prediction Employing QRNNs and KDE

The proposed ensemble prediction employing QRNNs and KDE is shown in Figure 2. The QRNNs-KDE approach consists of four stages: (1) data splitting and normalization, (2) QRNN modelling, (3) probability density function (PDF) estimation by KDE, and (4) final ensemble prediction.

Data splitting and normalization: The original landslide monitoring dataset is divided into training data and testing data. The training data are used for model construction, and the testing data are used to evaluate the performance of the constructed model. To eliminate the influence of dimensional data, the training data and testing data are first normalized in the range of 0 to 1.

QRNNs modelling: QRNNs serve as base learning algorithms to generate multiple base learners

Y_{1} (t), Y_{2} (t), \dots, Y_{m} (t)

by applying a finite number of conditional quantities

τ_{1} \leq τ_{2} \leq \dots \leq τ_{m}

within the domain

0 < τ < 1

, e.g.,

τ

= 0.01, 0.02, …, 0.98, 0.99. The base learners of landslide displacement are obtained after renormalizing the outputs from the QRNNs approach. To avoiding overfitting in QRNNs modelling, a penalty parameter with nonzero value is applied.

PDF estimation by KDE: Multiple base learners from the QRNNs base model are treated as the input for KDE to estimate the probability density function (PDF) of the base learners. The kernel function and bandwidth influence the shape of the KDE curve. An appropriate kernel function and an optimal bandwidth should be chosen to best match the features of the original dataset.

Final ensemble prediction: In the present study, the final ensemble prediction was obtained through a probability combination scheme as follows:

u_{t} = \sum_{i = 1}^{m} p_{i} (t) Y_{i} (t)

(12)

where

p_{i} (t)

is the probability value of the i-th base learner and

Y_{i} (t)

is obtained from the KDE for monitoring period

t

.

2.6. Evaluation Metrics and Uncertainty Quantification

In this study, five indices—coefficient of determination (R²) MSE, RMSE, NRMSE, and MAPE—were applied to assess the performance of point prediction. R², MSE, RMSE, NRMSE, and MAPE are defined as

R^{2} = {[\frac{\sum_{t = 1}^{N} (u_{t} - \bar{u}) ({\hat{u}}_{t} - \bar{\hat{u}})}{\sqrt{\sum_{t = 1}^{N} {(u_{t} - \bar{u})}^{2} {({\hat{u}}_{t} - \bar{\hat{u}})}^{2}}}]}^{2}

(13)

M S E = \frac{\sum_{t = 1}^{N} {({\hat{u}}_{t} - u_{t})}^{2}}{N}

(14)

R M S E = \sqrt{\frac{\sum_{t = 1}^{N} {({\hat{u}}_{t} - u_{t})}^{2}}{N}}

(15)

N R M S E = \sqrt{\frac{\sum_{t = 1}^{N} {({\hat{u}}_{t} - u_{t})}^{2}}{\sum_{t = 1}^{N} u_{t}^{2}}}

(16)

M A P E = \frac{1}{N} (\sum_{t = 1}^{N} | \frac{{\hat{u}}_{t} - u_{t}}{u_{t}} |) \times 100 %

(17)

where

{\hat{u}}_{t}

and

u_{t}

denote the t-th predictive value and observation, respectively, and

\bar{u}

and

\bar{\hat{u}}

denote the mean of the observation and the mean of the predictive value, respectively.

In the present study, the associated predictive uncertainties were quantified with PIs. After the above procedures, full PDFs of the future landslide displacement were achieved. An interval prediction with a

(1 - α) \times 100 %

confidence interval can be obtained from the

α / 2

and

1 - α / 2

quantiles of the obtained PDF. The

α

level, also called the significance level, ranges from 0 to 1 and is the probability of not capturing the value of the parameter. The predictive values of the

α / 2

quantity and

1 - α / 2

quantity are set as the upper bound (

U_{t}^{1 - α}

) and lower bound (

L_{t}^{1 - α}

), respectively. For example, a 90% central PI can be obtained from the 0.05 and 0.95 quantiles of the PDF. The upper bound and lower bound of the 90% confidence level correspond to the predictive values of the 0.95 and 0.05 quantiles of the obtained PDF.

The prediction interval coverage probability (PICP), normalized mean PI width (NMPIW), and coverage width-based criterion (CWC) are three indices for evaluating the correctness of the approximated PIs. The PICP reflects the degree of reliability of PIs and is defined as

PICP = \frac{1}{N} \sum_{t = 1}^{N} I_{t}^{1 - α}

(18)

where

I_{t}^{1 - α}

is defined as follows:

I_{t}^{1 - α} = {\begin{matrix} 1 u_{t} \in [L_{t}^{1 - α}, U_{t}^{_{1 - α}}] \\ 0 u_{t} \notin [L_{t}^{1 - α}, U_{t}^{_{1 - α}}] \end{matrix}

(19)

NMPIW measures the width of the PI; it is defined as

N M P I W = \frac{1}{N ς} \sum_{t = 1}^{N} (U_{t}^{1 - α} - L_{t}^{1 - α})

(20)

where

ς

is the range of the underlying targets.

For high-quality PIs, narrow PIs (smaller NMPIW) with a high coverage probability (large PICP close to 100%) have great value [40,41]. Theoretically, NMPIW and PICP are conflicting. Therefore, CWC, which is a new balance criterion between PICP and NMPIW [42], is proposed to give a comprehensive assessment of PIs. CWC is defined as

C W C = (N M P I W + ψ) e^{\frac{γ (P I C P - μ)}{2 δ^{2}}}

(21)

where

ψ

is a small positive value within the range of (0.1%, 0.5%),

μ

corresponds to the nominal confidence level associated with PIs that is usually set to

1 - α

, and

δ

is a small positive value less than 1.

γ

is set to 1 during the training process; for testing, it is defined by the following step function:

γ = {\begin{matrix} 1, P I C P \geq μ \\ 0, P I C P < μ \end{matrix}

(22)

3. Case Study: Fanjiaping Landslide

3.1. Features of the Fanjiaping Landslide

The Fanjiaping landslide is located on the southern bank of the Yangtze River and upstream of the Baishuihe landslide and downstream of the well-known Huangtupo landslide, which is approximately 56 km northwest of the Three Gorges Reservoir Dam (see Figure 3 for location). The Fanjiaping landslide is an ancient landslide [43,44] composed of two blocks: the Muyubao landslide and Fanjiaping landslide. The entire planar area of the landslide is approximately 1.96 million square meters, and the landslide volume is approximately 106 million cubic meters. The thickness of the Fanjiaping landslide ranges from 40 to 139.16 m. The Muyubao landslide is approximately 1500 m long and 1200 m wide. The average thickness of the Muyubao landslide body is approximately 50 m, and its estimated volume is 90 million m³.

The Muyubao landslide extends from an elevation of 100 m at the toe to 520 m at the crown (Figure 4a,b). The slope surface consists of alternating gentle and comparatively steep landforms. The sliding direction of the landslide is 20°. The Tanjiahe landslide, located on the downstream of the Muyubao landslide, is approximately 1000 m long and 400 m wide. The average thickness of the Tanjiahe landslide body is approximately 40 m, and its estimated volume is 16 million m³. The Tanjiahe landslide extends from an elevation of 135 m at the toe to 420 m at the crown (Figure 4c,d). The slope surface consists of alternating gentle and comparatively steep landforms. The sliding direction of the landslide is 345°.

The site-specific investigation shows that the landslide materials are arranged in two different layers: a colluvial deposit at the upper surface and highly disturbed sandstone at the lower surface. The cataclastic sandstone is underlaid by sandstone and mudstone of the Jurassic Xiangxi formation (J1x) with an average dip direction of 10–25° and a dip angle of 27–36° (Figure 4b,d). Soft coal layers are prevalent in the J1x formation, and many landslides have developed along the soft coal layers. The borehole data indicates that the landslide mass of the Muyubao and Tanjiahe landslide slide along a soft coal layer with a thickness ranging from 0.1 to 0.3 m. According to laboratory testing of sliding zone soil obtained from the borehole, the natural moisture content of the soil is 12.6%, and the natural density is 1.9 g/cm³.

3.2. Input Data

A total of sixteen GPS beacons were installed on the landslide mass to monitor the landslide movements in September 2006 (see Figure 4 for the GPS locations): four on the Tanjiahe landslide and twelve on the Muyubao landslide. The GPS monuments were manually surveyed once a month. In April 2016, four GPS monitoring points, ZG295, ZG296, ZG297, and ZG298, were updated to near real-time monitoring. At most, thirteen years’ worth of monitoring data were obtained. Figure 5 shows the monthly rainfall intensity obtained from the Shazhenxi Meteorological Station near the Fanjiaping landslide, the reservoir water level, and the displacement from GPS survey monuments over the thirteen-year period from October 2006 to March 2018. The available data indicate that the landslide was unstable and continuously deforming during the entire monitoring period. The landslide exhibits a step-like deformation behavior because of the periodic fluctuations in the reservoir water level and heavy precipitation. The monitoring data from both Muyubao and Tanjiahe show that larger displacements occurred in the upper middle part of the landslide mass. From the sequence of the surface cracks and displacement magnitude, we speculate that the movement occurred first at the rear part and progressed downslope. Based on a previous study on the relations between slip-surface geometry, material structures, and deformational structures [45,46], the observed kinematic behaviors are expected independent of the characteristics of the landslide material. However, more work is needed to confirm these findings.

3.3. Triggering Factors of the Landslide Movements

Although the Fanjiaping landslide is one of the largest landslides in the TGRA, very few publications have reported detailed information on the triggering factors of the landslide movements. Fully understanding the triggering factors is critical for landslide mitigation and early warning. In this study, long-term and near real-time monitoring data were used to comprehensively analyze the landslide movements. The cumulative displacement at monitoring point ZG295, monthly rainfall intensity, and reservoir water levels in 2009, 2011, 2012, 2015 are shown in Figure 6a–d. The available data shows the following trends:

(1) When the reservoir water level first rose from 135 to 156 m at the end of 2006, a significant annual displacement of 330 mm occurred at monitoring point ZG291 in 2007. Similarly, an annual displacement of 260 mm occurred at monitoring point ZG291 in 2009 when the reservoir water level rose from 156 to 172 m at the end of 2008. After 2009, the annual displacements shows a decreasing trend (Figure 6f). The results of the above analysis suggest that landslide deformation occurred at the preliminary operation phase, and more significant movement is likely to occur when the reservoir water level reaches a new higher level.

(2) A large deformation occurred when the reservoir water level slightly dropped from 175 m to 170 m in November to February (I in Figure 6). During 2009 to 2015, the monthly deformation rate during this drawdown period was greater than 20 mm per month (Figure 6a–d). For example, when the reservoir water level dropped from 174 m to 172.21 m in January 2012 to February 2012, the displacements measured at monitoring points ZG295, ZG296, ZG297, and ZG298 were 38.98, 26.39, 35.12, and 41.78 mm, respectively (Figure 6c). However, when the reservoir water level significantly dropped from 170 m to 145 m in February to June (II in Figure 6), the monthly deformation rate decreased to less than 20 mm per month.

(3) When the reservoir level remained at 145 m in July to September (III in Figure 6) and the landslide area suffered a heavy rainfall event, the landslide deformation was likely to be suspended except for 2012. The maximum monthly rainfall intensity during those suspended activities was 158 mm. In July 2012, the landslide area suffered from a heavy rainfall event with a monthly rainfall intensity of 208 mm, and the monitoring point deformed at a high rate. From those comparative analyses, we can speculate than the minimum triggering threshold consists of episodes lasting one month with cumulative rainfall exceeding 158 mm.

(4) When the reservoir rose from 145 m to 175 m in September to November (IV in Figure 6), monthly deformation rate decreased to a small positive (less than 10 mm per month) or even negative value.

(5) The near real-time monitoring data also showed the abovementioned trends: when the reservoir water level rose from 147.25 to 174.42 m on August 31 2016 to October 27 2016, the monthly deformation rates for ZG296 and ZG297 were 2.6 and 3.3 mm/month, respectively. When the reservoir dropped from 174.45 to 171.18 m on November 9 2016 to January 16 2017, deformations of 24.07 and 26.46 mm occurred at ZG296 and ZG297, respectively. The corresponding monthly deformation rates were 12.8 and 13.2 mm/month, respectively.

From the above analysis we can conclude that landslide movement was especially pronounced under prolonged periods of dropping reservoir levels, especially during periods of slight dropdown at the highest reservoir level, and the minimum triggering threshold consisted of episodes lasting one month, with cumulative rainfall exceeding 158 mm.

3.4. QRNNs-KDE-Based Method for Ensemble Prediction

3.4.1. Data Splitting and Normalization

The available data (Figure 5) indicate that for the two active blocks, the largest displacements were observed at monitoring points ZG289 and ZG291, respectively. Therefore, monitoring points ZG289 and ZG291 were selected to establish a prediction model for the Fanjiaping landslide.

Previous correlation analysis in [3] revealed that weak to very strong correlations exist between landslide displacement and triggering and state variables. Therefore, based on triggering factor analysis and previous work on correlation analysis in [3], seven variables including four trigger variables and three state variables were selected as the inputs: rainfall intensity over the past month (

x_{1} (t)

), rainfall intensity over the past two months (

x_{2} (t)

), average reservoir water level in the current month (

x_{3} (t)

), variation in the reservoir water level in the current month (

x_{4} (t)

), displacement over the past one month (

x_{5} (t)

), displacement over the past two months (

x_{6} (t)

), and displacement over the past three months (

x_{7} (t)

). In addition, the displacement in the current month (

Y (t)

) was selected as the output. A data set

(x_{i} (t), Y (t)), i = 1, 2, \dots, 7

was generated based on the inputs and corresponding outputs. For the Tanjiahe landslide, the data from October 2006 to January 2015 with a size of 100 were treated as the training set, and the data from February 2015 to June 2015 with a size of 5 were used as the testing set. For the Muyubao landslide, the data from October 2006 to January 2015 with a size of 133 were treated as the training set, and the data from November 2017 to October 2018 with a size of 12 were used as the testing set.

3.4.2. QRNN Modelling

Two nonlinear models with a sigmoidal transfer function and linear transfer function for

τ

= 0.01, 0.02, …, 0.98, 0.99 with an interval of 0.01 were trained for monitoring points ZG289 and ZG291 to generate multiple base learners. The number of hidden nodes in the QRNNs model was set to 5. The penalty for weight delay regularization was set to 1 to prevent overfitting in QRNNs model construction. For each monitoring period, a total of 99 base learners were obtained at conditional quantities ranging from 0.01 to 0.99 based on the QRNNs. The main parameters applied in the modelling of QRNNs are shown in Table 1.

3.4.3. PDF Estimation by KDE

The multiple base learners from QRNNs were employed as inputs of Epanechnikov KDE to estimate the PDF. The optimal bandwidths for PDF estimation were calculated based on Silverman’s rule of thumb. The optimal bandwidths for PDF estimation of testing data at ZG289 were set to 7.98, 8.05, 5.66, 7.05, and 9.26. The optimal bandwidths for PDF estimation of testing data at ZG291 were set to 5.28, 8.61, 7.77, 5.62, and 5.30.

3.4.4. Final Ensemble Prediction

Final ensemble predictions for the Fanjiaping landslide were generated through a probability combination scheme. PIs were constructed from the obtained PDF to estimate the predictive uncertainty. For the purpose of aiding decision-making, it is preferable to have prediction information with high confidence levels to reduce risks. Therefore, PIs at a high PINC value of 90% were obtained and analyzed in the study.

4. Results

PDFs: The PDFs of predictive displacement at ZG289 and ZG291 constructed by the proposed QRNNs-KDE approach are shown in Figure 7 and Figure 8. The fast movement is the main concern in landslide displacement prediction. Here, only a portion of the prediction describing the fast landslide is selected and shown. Figure 7 and Figure 8 show that rather than a single estimate, the range and complete PDF of the predictive displacement are provided by the proposed approach. All landslide displacement observations are distributed in the middle of the PDFs with high probability in addition to the observations of May and June at ZG289, which appear at the tail of the probability density curve. The small fraction falling into the right tail follows the increase in the prediction period; here, there are more uncertainties associated with longer-term landslide predictions.

Final ensemble prediction: Figure 9 shows the final ensemble predictions. As shown in Figure 9, the ensemble predictions obtained via the probability combination scheme showed a high degree of consistency in the landslide displacement observations, with coefficient of determination values of 0.999932 and 0.999944. To further evaluate the prediction performances of ensemble prediction based on the QRNNs-KDE, the evaluation metrics of the BP, RBF, ELM, and SVM approaches are shown in Table 2. As shown in Table 2, the final ensemble predictions using the QRNNs-KDE approach outperformed the persistence methods with the smallest MSE, RMSE, NRMSE, and MAPE and the largest R². Moreover, compared with predictions at monitoring point ZG289 using the Copula-KSVMQR approach in [3], the QRNNs-KDE approach provided more accurate prediction with smaller MAPE and RMSE.

Uncertainty quantification: Based on the PDFs shown in Figure 7 and Figure 8, PIs at a high confidence level (90%) were constructed for ZG289 and ZG291 (Figure 10a,c, respectively). To evaluate the prediction performances based on the QRNNs-KDE approach, 90% PIs were constructed based on the bootstrap-ELM-ANN approach (Figure 10b,d). The corresponding evaluation metrics are shown in Table 3. As shown in Figure 10 and Table 3, the constructed PIs based on the QRNNs-KDE approach perfectly covered the observations with a high percentage, and the QRNNs-KDE approach outperformed the bootstrap-ELM-ANN approach with smaller NMPIW and CWC. For example, the performance indices NPIW and CWC of 90% PIs at ZG289 were 0.0215 and 0.1661, respectively, which were lower than those obtained using the bootstrap-ELM-ANN approach. The normalized mean PI width using the QRNNs-KDE approach was approximately 90% narrower than that for the bootstrap-ELM-ANN approach.

The experimental results show that the final ensemble predictions based on the QRNNs-KDE approach outperformed the traditional BP, RBF, ELM, SVM, and Copula-KSVMQR algorithms with regard to deterministic point prediction. The QRNNs-KDE approach was more informative than traditional algorithms because it provided the likely range of landslide displacement. The landslide observations were distributed in the middle of the prediction range with high probability. Moreover, regarding the aspect of uncertainty quantification, the QRNNs-KDE provided more satisfactory PIs than the bootstrap-ELM-ANN approach. Therefore, we believe that the final ensemble predictions based on the QRNNs-KDE approach have the advantages of accurate prediction and uncertainty quantification of landslide displacement.

5. Discussion

In this study, with regard to point prediction, the probability-scheme combination ensemble prediction, which employs QRNNs-KDE, provided the best prediction. The fundamental reasons behind this can be explained from statistical, computational, and representational perspectives [47]. From a statistical perspective, the available training data set may not be able to provide sufficient information for training the true model (h^* in Figure 11). Constructing an ensemble model (h^’ in Figure 11) might not be better than the single best prediction model h^*, but it does reduce the risk of choosing a bad learner with poor generalizability (schematic in Figure 11a). From a computational perspective, in a single model the training algorithms might get stuck in lock optima by only performing a local search. Constructing an ensemble model by searching from different starting positions might be a better alternative (schematic in Figure 11b). From a representational perspective, it is possible that the searched hypothesis space might not contain the true model h^*. Constructing an ensemble model might expand the representable space (schematic in Figure 11c).

In the proposed QRNNs-KDE approach, the probability combination scheme is employed to combine 99 base learners into one final ensemble to improve the model performance. However, a concern about computational time may be associated with this ensemble strategy. The required computational time is highly related to the number of base learners. For the case of ZG 291, the required computation time is 191.85 s to train 99 base learners in RStudio Version 1.2.5042 on an Intel(R) Xeon(R) E-2176M @ 2.70 GHz CPU with 64 GB RAM. Thus, we believe that the proposed approach is computationally efficient.

Nevertheless, the probability-scheme combination ensemble prediction, which employ QRNNs and KDE, also holds inherent limitations associated with data-driven models, such as the lack of an explicit input-output relationship, and the requirement of large training data to maintain the model performance.

In practical applications, the main motivation for the construction of predictive range and complete PDF is to quantify the likely predictive uncertainty in the deterministic point predictions. Availability of range and complete PDF of the predictive displacement allows the researchers and practitioners to efficiently quantify the level of predictive uncertainty with the deterministic point predictions and to consider a multiple of solutions/scenarios for the best and worst conditions. Wide ranges are an indication of presence of a high level of uncertainty in the operation. This information can guide the researchers and practitioners to avoid the selection of risky actions under uncertain conditions. In contrast, narrow range means that decisions can be made more confidently with less chance of confronting an unexpected condition in the future, for example, if a sharp displacement increment with a wider range was predicted for the further. An alert should be carefully determined whether reaching tertiary creep stage by researchers and practitioners through comprehensive analysis. Under this circumstance, time-of-failure forecasting should be run in parallel, and a multiple of solutions/scenarios should be considered until either failure precursors are identified or the movements suspended.

The proposed QRNNs-KDE approach is suitable for medium-term to long-term horizon forecasting. Results from previous studies [2,48] have shown that the performance of data-driven models varies for landslides with different deformation behaviors. Usually, for landslides with drastic step-like deformation, the prediction accuracy is lower, and the corresponding prediction error is larger. Therefore, in practical applications of medium-term to long-term horizon forecasting, when predicting landslides with drastic deformation, the proposed QRNNs-KDE approach should be applied with caution. To achieve excellent performance, sufficient data are recommended and needed for model training.

6. Conclusions

In this study, a QRNNs-KDE approach was proposed to improve the prediction accuracy and uncertainty quantification of landslide displacement. The Fanjiaping landslide in the TGRA was selected as a case study to explore the performance of the QRNNs-KDE approach. The following conclusions from the study were obtained:

The movements of the Fanjiaping landslide was especially pronounced under prolonged periods of dropping reservoir levels, especially during periods of slight dropdown at the highest reservoir level, and the minimum triggering threshold consists of episodes lasting one month, with cumulative rainfall exceeding 158 mm.

The QRNNs-KDE approach achieves perfect performance and outperforms the traditional BP, RBF, ELM, SVM, bootstrap-ELM-ANN, and Copula-KSVMQR methods. Additionally, the proposed approach is more informative by providing the likely range and complete PDFs of landslide displacement. The landslide displacement observations are distributed in the middle of the prediction range with high probability.

In practical application, the proposed QRNNs-KDE approach is suitable for medium-term to long-term horizon forecasting. The range and complete PDF of the predictive displacement can supplement final point predictions for decision making.

Author Contributions

The work was carried out in collaboration between all the authors. J.M. and X.L. guided and supervised this research; X.N., Y.W., T.W., J.Z., and Z.Z. performed the field investigation; J.M. wrote the original draft; and J.M. and X.L. reviewed and edited the draft. All authors have contributed to, seen, and approved the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program of China (Grant No. 2017YFC1501305), the National Natural Science Foundation of China (Grant Nos. 41702328 and 41572279), the Hubei Provincial Natural Science Foundation of China (Grant No. 2019CFB585), and the Huaneng Lancang River Hydropower Co., Ltd. (HNHJ18-H24).

Conflicts of Interest

The authors declare no conflict of interest.

References

CRED. EM-DAT: International Disaster Database. Available online: https://public.emdat.be/data (accessed on 9 June 2020).
Wang, Y.; Tang, H.; Wen, T.; Ma, J. Direct Interval Prediction of Landslide Displacements Using Least Squares Support Vector Machines. Complexity 2020, 2020, 7082594. [Google Scholar] [CrossRef]
Ma, J.W.; Niu, X.X.; Tang, H.M.; Wang, Y.K.; Wen, T.; Zhang, J.R. Displacement Prediction of a Complex Landslide in the Three Gorges Reservoir Area (China) Using a Hybrid Computational Intelligence Approach. Complexity 2020, 2020, 2624547. [Google Scholar] [CrossRef]
Ma, J.W.; Tang, H.M.; Liu, X.; Wen, T.; Zhang, J.R.; Tan, Q.W.; Fan, Z.Q. Probabilistic forecasting of landslide displacement accounting for epistemic uncertainty: A case study in the Three Gorges Reservoir area, China. Landslides 2018, 15, 1145–1153. [Google Scholar] [CrossRef]
Pinto, F.; Guerriero, L.; Revellino, P.; Grelle, G.; Senatore, M.R.; Guadagno, F.M. Structural and lithostratigraphic controls of earth-flow evolution, Montaguto earth flow, Southern Italy. J. Geol. Soc. Lond. 2016, 173, 649. [Google Scholar] [CrossRef]
Guerriero, L.; Diodato, N.; Fiorillo, F.; Revellino, P.; Grelle, G.; Guadagno, F.M. Reconstruction of long-term earth-flow activity using a hydroclimatological model. Nat. Hazards 2015, 77, 1–15. [Google Scholar] [CrossRef]
Hu, X.; Bürgmann, R.; Schulz, W.H.; Fielding, E.J. Four-dimensional surface motions of the Slumgullion landslide and quantification of hydrometeorological forcing. Nat. Commun. 2020, 11, 2792. [Google Scholar] [CrossRef]
Ma, J.W.; Tang, H.M.; Hu, X.L.; Bobet, A.; Zhang, M.; Zhu, T.W.; Song, Y.J.; Ez Eldin, M.A.M. Identification of causal factors for the Majiagou landslide using modern data mining methods. Landslides 2017, 14, 311–322. [Google Scholar] [CrossRef]
Corominas, J.; Moya, J.; Ledesma, A.; Lloret, A.; Gili, J.A. Prediction of ground displacements and velocities from groundwater level changes at the Vallcebre landslide (Eastern Pyrenees, Spain). Landslides 2005, 2, 83–96. [Google Scholar] [CrossRef]
Li, S.H.; Wu, L.Z.; Chen, J.J.; Huang, R.Q. Multiple data-driven approach for predicting landslide deformation. Landslides 2020, 17, 709–718. [Google Scholar] [CrossRef]
Saito, M. Forecasting the time of occurrence of a slope failure. In Proceedings of the 6th International Congress of Soil Mechanics and Foundation Engineering, Montreal, QC, Canada, 8–15 September 1965; pp. 537–541. [Google Scholar]
Hayashi, S.; Komamura, F.; Park, B.W.; Yamamori, T. On the forecast of time to failure of slope-Approximate forecast in the early period of the tertiary creep. J. Jpn. Landslide Soc. 1988, 25, 11–16. [Google Scholar] [CrossRef]
Federico, A.; Popescu, M.; Fidelibus, C.; Interno, G. On the prediction of the time of occurrence of a slope failure: A review. In Proceedings of the 9th International Symposium on Landslides, Rio de Janeiro, Brazil, 28 June–2 July 2004; pp. 979–983. [Google Scholar]
Ma, J.W.; Tang, H.M.; Liu, X.; Hu, X.L.; Sun, M.J.; Song, Y.J. Establishment of a deformation forecasting model for a step-like landslide based on decision tree C5.0 and two-step cluster algorithms: A case study in the Three Gorges Reservoir area, China. Landslides 2017, 14, 1275–1281. [Google Scholar] [CrossRef]
Wen, T.; Tang, H.M.; Wang, Y.K.; Lin, C.Y.; Xiong, C.R. Landslide displacement prediction using the GA-LSSVM model and time series analysis: A case study of Three Gorges Reservoir, China. Nat. Hazards Earth Syst. Sci. 2017, 17, 2181–2198. [Google Scholar] [CrossRef] [Green Version]
Huang, F.M.; Yin, K.L.; Zhang, G.R.; Gui, L.; Yang, B.B.; Liu, L. Landslide displacement prediction using discrete wavelet transform and extreme learning machine based on chaos theory. Environ. Earth Sci. 2016, 75, 1376. [Google Scholar] [CrossRef]
Zhang, J.; Wang, Z.P.; Zhang, G.D.; Xue, Y.D. Probabilistic prediction of slope failure time. Eng. Geol. 2020, 271, 105586. [Google Scholar] [CrossRef]
Yao, W.; Zeng, Z.G.; Lian, C. Generating probabilistic predictions using mean-variance estimation and echo state network. Neurocomputing 2017, 219, 536–547. [Google Scholar] [CrossRef]
Tehrany, M.S.; Pradhan, B.; Jebur, M.N. Flood susceptibility mapping using a novel ensemble weights-of-evidence and support vector machine models in GIS. J. Hydrol. 2014, 512, 332–343. [Google Scholar] [CrossRef]
Zhu, Y.J. Ensemble forecast: A new approach to uncertainty and predictability. Adv. Atmos. Sci. 2005, 22, 781–788. [Google Scholar] [CrossRef]
Wang, Z.Y.; Srinivasan, R.S. A review of artificial intelligence based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models. Renew. Sust. Energ. Rev. 2017, 75, 796–808. [Google Scholar] [CrossRef]
Rigamonti, M.; Baraldi, P.; Zio, E.; Roychoudhury, I.; Goebel, K.; Poll, S. Ensemble of optimized echo state networks for remaining useful life prediction. Neurocomputing 2018, 281, 121–138. [Google Scholar] [CrossRef] [Green Version]
Kim, M.-J.; Kang, D.-K. Ensemble with neural networks for bankruptcy prediction. Expert Syst. Appl. 2010, 37, 3373–3379. [Google Scholar] [CrossRef]
Prayogo, D.; Cheng, M.-Y.; Wu, Y.-W.; Tran, D.-H. Combining machine learning models via adaptive ensemble weighting for prediction of shear capacity of reinforced-concrete deep beams. Eng. Comput. Ger. 2019, 36, 1135–1153. [Google Scholar] [CrossRef]
Chen, K.; Jiang, J.; Zheng, F.; Chen, K. A novel data-driven approach for residential electricity consumption prediction based on ensemble learning. Energy 2018, 150, 49–60. [Google Scholar] [CrossRef]
Lee, D.; Baldick, R. Short-Term Wind Power Ensemble Prediction Based on Gaussian Processes and Neural Networks. IEEE Trans. Smart Grid 2014, 5, 501–510. [Google Scholar] [CrossRef]
Choubin, B.; Moradi, E.; Golshan, M.; Adamowski, J.; Sajedi-Hosseini, F.; Mosavi, A. An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. Sci. Total Environ. 2019, 651, 2087–2096. [Google Scholar] [CrossRef] [PubMed]
Chen, J.; Li, Q.; Wang, H.; Deng, M. A Machine Learning Ensemble Approach Based on Random Forest and Radial Basis Function Neural Network for Risk Evaluation of Regional Flood Disaster: A Case Study of the Yangtze River Delta, China. Int. J. Environ. Res. Public Health 2020, 17, 49. [Google Scholar] [CrossRef] [Green Version]
Di Napoli, M.; Carotenuto, F.; Cevasco, A.; Confuorto, P.; Di Martire, D.; Firpo, M.; Pepe, G.; Raso, E.; Calcaterra, D. Machine learning ensemble modelling as a tool to improve landslide susceptibility mapping reliability. Landslides 2020. [Google Scholar] [CrossRef]
Srivastav, R.K.; Sudheer, K.P.; Chaubey, I. A simplified approach to quantifying predictive and parametric uncertainty in artificial neural network hydrologic models. Water Resour. Res. 2007, 43, W10407. [Google Scholar] [CrossRef] [Green Version]
Tiwari, M.K.; Chatterjee, C. Development of an accurate and reliable hourly flood forecasting model using wavelet-bootstrap-ANN (WBANN) hybrid approach. J. Hydrol. 2010, 394, 458–470. [Google Scholar] [CrossRef]
Kasiviswanathan, K.S.; He, J.; Sudheer, K.P.; Tay, J.-H. Potential application of wavelet neural network ensemble to forecast streamflow for flood management. J. Hydrol. 2016, 536, 161–173. [Google Scholar] [CrossRef]
Zhang, J.; Tang, W.H.; Zhang, L.M.; Huang, H.W. Characterising geotechnical model uncertainty by hybrid Markov Chain Monte Carlo simulation. Comput. Geotech. 2012, 43, 26–36. [Google Scholar] [CrossRef]
Cannon, A.J. Quantile regression neural networks: Implementation in R and application to precipitation downscaling. Comput. Geosci. UK 2011, 37, 1277–1284. [Google Scholar] [CrossRef]
Xu, Q.F.; Liu, X.; Jiang, C.X.; Yu, K.M. Quantile autoregression neural network model with applications to evaluating value at risk. Appl. Soft Comput. 2016, 49, 1–12. [Google Scholar] [CrossRef] [Green Version]
Donaldson, R.G.; Kamstra, M. Forecast combining with neural networks. J. Forecast. 1996, 15, 49–61. [Google Scholar] [CrossRef]
Charlton, T.S.; Rouainia, M. Probabilistic capacity analysis of suction caissons in spatially variable clay. Comput. Geotech. 2016, 80, 226–236. [Google Scholar] [CrossRef] [Green Version]
Gramacki, A. Nonparametric Kernel Density Estimation and Its Computational Aspects; Springer: New York, NY, USA, 2017. [Google Scholar]
Zhang, S.; Ma, J.W.; Tang, H.M. Estimation of Risk Thresholds for a Landslide in the Three Gorges Reservoir Based on a KDE-Copula-VaR Approach. Geofluids 2020, 2020, 8030264. [Google Scholar] [CrossRef]
Wan, C.; Xu, Z.; Pinson, P.; Dong, Z.Y.; Wong, K.P. Probabilistic forecasting of wind power generation using extreme learning machine. IEEE Trans. Power Syst. 2014, 29, 1033–1044. [Google Scholar] [CrossRef] [Green Version]
Wan, C.; Xu, Z.; Wang, Y.L.; Dong, Z.Y.; Wong, K.P. A hybrid approach for probabilistic forecasting of electricity price. IEEE Trans. Smart Grid 2014, 5, 463–470. [Google Scholar] [CrossRef]
Wang, Y.K.; Tang, H.M.; Wen, T.; Ma, J.W. A hybrid intelligent approach for constructing landslide displacement prediction intervals. Appl. Soft Comput. 2019, 81, 105506. [Google Scholar] [CrossRef]
Zhang, L.; Liao, M.S.; Balz, T.; Shi, X.G.; Jiang, Y.N. Monitoring landslide activities in the Three Gorges area with multi-frequency satellite SAR data sets. In Modern Technologies for Landslide Monitoring and Prediction; Scaioni, M., Ed.; Springer Berlin Heidelberg: Berlin/Heidelberg, Germany, 2015; pp. 181–208. [Google Scholar] [CrossRef]
Fan, J.H.; Qiu, K.T.; Xia, Y.; Li, M.; Lin, H.; Zhang, H.T.; Tu, P.F.; Liu, G.; Shu, S.Q. InSAR monitoring and synthetic analysis of the surface deformation of Fanjiaping landslide in the Three Gorges Reservoir area. Geol. Bull. China 2017, 36, 1665–1673. [Google Scholar]
Guerriero, L.; Coe, J.A.; Revellino, P.; Grelle, G.; Pinto, F.; Guadagno, F.M. Influence of slip-surface geometry on earth-flow deformation, Montaguto earth flow, southern Italy. Geomorphology 2014, 219, 285–305. [Google Scholar] [CrossRef]
Guerriero, L.; Bertello, L.; Cardozo, N.; Berti, M.; Grelle, G.; Revellino, P. Unsteady sediment discharge in earth flows: A case study from the Mount Pizzuto earth flow, southern Italy. Geomorphology 2017, 295, 260–284. [Google Scholar] [CrossRef]
Dietterich, T.G. Ensemble Methods in Machine Learning. In Proceedings of the Multiple Classifier Systems, Cagliari, Italy, 21–23 June 2000; pp. 1–15. [Google Scholar]
Du, J.; Yin, K.L.; Lacasse, S. Displacement prediction in colluvial landslides, Three Gorges Reservoir, China. Landslides 2013, 10, 203–218. [Google Scholar] [CrossRef]

Figure 1. General framework for ensemble prediction models. (a) Homogeneous ensemble model and (b) heterogeneous ensemble model.

Figure 2. The overall flowchart of ensemble prediction based on the quantile regression neural networks and kernel density estimation (QRNNs-KDE) approach.

Figure 3. Location of the landslide site.

Figure 4. Topographic map and geological profile of the Fanjiaping landslide. (a) Topographic map of the Muyubao landslide. (b) Geological profile of the Muyubao landslide along sections A-A′, as recorded with monitoring instruments. (c) Topographic map of the Tanjiahe landslide. (d) Geological profile of the Tanjiahe landslide along sections B-B′, as recorded with monitoring instruments.

Figure 5. Reservoir water level, monthly rainfall intensity, and cumulative displacement from the Fanjiaping landslide area.

Figure 6. (a–d) Cumulative displacement at monitoring point ZG295, monthly rainfall intensity, and reservoir water level spanning the period of 2009, 2011, 2012, and 2015. (e) Cumulative displacement at monitoring points ZG296 and ZG297, daily rainfall intensity, and reservoir water level spanning the period of June 2016 to October 2017. (f) Annual displacement at monitoring point ZG291, ZG294, ZG288, and ZG289, and reservoir water level spanning the period of 2007 to 2017.

Figure 7. Probability density functions (PDFs) for the Fanjiaping landslide at ZG289 from February 2015 to June 2015.

Figure 8. PDFs for the Fanjiaping landslide at ZG291 from December 2017 to May 2018.

Figure 9. Comparisons of the final ensemble predictions and observations for the Fanjiaping landslide at ZG289 and ZG291.

Figure 10. Comparisons of the observations and the constructed PIs at a 90% confidence level for the Fanjiaping landslide at ZG289 and ZG291 using QRNNs-KDE and bootstrap-ELM-ANN. (a) 90% PIs at ZG291 using QRNNs-KDE; (b) 90% PIs at ZG291 using bootstrap-ELM-ANN; (c) 90% PIs at ZG289 using QRNNs-KDE, (d) 90% PIs at ZG289 using bootstrap-ELM-ANN.

Figure 11. Schematic that shows the fundamental benefits of the ensemble prediction model from statistical (a), computational (b), and representational (c) perspectives. h^* is the true prediction model; h₁, h₂, and h₃ are single prediction models; and h^’ is the ensemble prediction model obtained by combining the single prediction models h₁, h₂, and h₃. The outer black curve is the hypothesis space of all possible models. The inner blue curve denotes the subset of hypotheses that give reasonable accuracy with the available training data (modified from [47]).

Table 1. The parameters utilized in the QRNNs modeling for the Fanjiaping landslide.

Parameter	Value	Parameter	Value
Maximum number of iterations	5000	Penalty for weight decay regularization	1
Number of quantiles	99	Number of input nodes	7
Number of repeated trials	5	Number of hidden nodes	5

Table 2. Comparisons of predictions obtained from QRNNs-KDE, BP, RBF, ELM, and SVM for the Fanjiaping landslide.

Monitoring Point		BP	RBF	ELM	SVM	QRNNs-KDE
Monitoring Point	Index	BP	RBF	ELM	SVM	QRNNs-KDE
ZG289	R²	0.99730	0.99992	0.99785	0.99993	0.99997
	MSE	3192.07	99.54	2538.74	78.12	30.69
	RMSE	56.50	9.98	50.39	8.84	5.54
	NRMSE	0.032263	0.005697	0.028772	0.005047	0.003163
	MAPE	2.74	2.00	1.57	1.27	1.17
ZG291	R²	0.99991	0.99759	0.99991	0.99995	0.99997
	MSE	206.32	5684.98	215.41	119.75	70.15
	RMSE	14.36	75.40	14.68	10.94	8.38
	NRMSE	0.005953	0.031251	0.006083	0.004536	0.003471
	MAPE	3.97	1.96	2.59	2.33	0.41

Note: The most accurate prediction results are shown in bold italics.

Table 3. Comparisons of 90% PIs obtained from Bootstrap-ELM-ANN and QRNNs-KDE for the Fanjiaping landslide.

Monitoring Point		PICP	NPIW	CWC
Monitoring Point	Model	PICP	NPIW	CWC
ZG289	Bootstrap-ELM-ANN	100%	0.27	0.2071
ZG289	QRNNs-KDE	100%	0.0215	0.1661
ZG291	Bootstrap-ELM-ANN	99%	0.024	0.143
ZG291	QRNNs-KDE	99%	0.018	0.085

Note: The prediction results with a narrower PI range are shown in bold italics.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ma, J.; Liu, X.; Niu, X.; Wang, Y.; Wen, T.; Zhang, J.; Zou, Z. Forecasting of Landslide Displacement Using a Probability-Scheme Combination Ensemble Prediction Technique. Int. J. Environ. Res. Public Health 2020, 17, 4788. https://doi.org/10.3390/ijerph17134788

AMA Style

Ma J, Liu X, Niu X, Wang Y, Wen T, Zhang J, Zou Z. Forecasting of Landslide Displacement Using a Probability-Scheme Combination Ensemble Prediction Technique. International Journal of Environmental Research and Public Health. 2020; 17(13):4788. https://doi.org/10.3390/ijerph17134788

Chicago/Turabian Style

Ma, Junwei, Xiao Liu, Xiaoxu Niu, Yankun Wang, Tao Wen, Junrong Zhang, and Zongxing Zou. 2020. "Forecasting of Landslide Displacement Using a Probability-Scheme Combination Ensemble Prediction Technique" International Journal of Environmental Research and Public Health 17, no. 13: 4788. https://doi.org/10.3390/ijerph17134788

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Forecasting of Landslide Displacement Using a Probability-Scheme Combination Ensemble Prediction Technique

Abstract

1. Introduction

2. Methodology

2.1. Description of Uncertainty Sources

2.2. Ensemble Prediction

2.3. Quantile Regression Neural Network

2.3.1. Quantile Regression

2.3.2. Quantile Regression Neural Network

2.4. Kernel Density Estimation (KDE)

2.5. Ensemble Prediction Employing QRNNs and KDE

2.6. Evaluation Metrics and Uncertainty Quantification

3. Case Study: Fanjiaping Landslide

3.1. Features of the Fanjiaping Landslide

3.2. Input Data

3.3. Triggering Factors of the Landslide Movements

3.4. QRNNs-KDE-Based Method for Ensemble Prediction

3.4.1. Data Splitting and Normalization

3.4.2. QRNN Modelling

3.4.3. PDF Estimation by KDE

3.4.4. Final Ensemble Prediction

4. Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI