Prediction of intensity and location of seismic events using deep learning

doi:10.1016/j.spasta.2020.100442

Spatial Statistics

Volume 42, April 2021, 100442

https://doi.org/10.1016/j.spasta.2020.100442 Get rights and content

Abstract

The object of this work is to predict the seismic rate in Chile by using two Deep Neural Network (DNN) architectures, Long Short Term Memory (LSTM) and Convolutional Neural Networks (CNN). For this, we propose a methodology based on a three-module approach: a pre-processing module, a spatial and temporal estimation module, and a prediction module. The first module considers the Epidemic-Type Aftershock Sequences (ETAS) model for estimating the intensity function, which will be used for estimating the seismic rate on a 1 × 1 degree grid providing a sequence of daily images covering all the seismic area of Chile. The spatial and temporal estimation module uses the LSTM and CNN for predicting the intensity and the location of earthquakes. The last module integrates the information provided by the DNNs for predicting future values of the maximum seismic rate and their location. In particular, the LSTM will be trained using the maximum intensity of the last 30 days as input for predicting the maximum intensity of the next day, and the CNN will be trained on the last 30 images provided by the application of the ETAS model for predicting the probability that the next day the maximum event will be in certain area of Chile. Some performance indexes (such as $R^{2}$ and accuracy) will be used for validating the proposed models.

Introduction

Earthquakes represent one of the most destructive yet unpredictable natural disasters along the world, with a massive physical, psychological and economical impact in the population worldwide. Chile is considered one of most seismic-active country in the world, having the world’s largest instrumentally documented earthquake occurred in Valdivia (on May 22, 1960), as well as recently been affected by three major earthquakes with magnitudes $>$ 8.0 (on Richter scale). Thus, to have a better approximation or additional information on where and when an event of that magnitude could occur, it would represent an invaluable tool for managing and designing public policies regarding natural disasters. However, earthquake prediction is a very challenging task due to the high complexity associated to the process itself, and also due to the fact that their occurrences depend on a multitude of variables, that in most cases could be unidentified (Sobolev, 2015, Cimellaro and Marasco, 2018, Joffe et al., 2018). Most of the proposed prediction models are focused on some form of seismic hazard estimation (Budnitz et al., 1997, Woessner et al., 2015, Petersen et al., 2018). This term is defined as the probability that an earthquake will occur in a given geographic area, within a given window of time, and with a magnitude exceeding a given threshold. In fact, according to Allen (1976) and Joffe et al. (2018), a valid approach for earthquake prediction should consider a spatio-temporal window, a magnitude estimation, a scientific sound validation process, and an appropriate visualization procedure.

Self-exciting point process models have become essential components in the assessment of seismic hazard. A particular class is given by the Epidemic Time Aftershock Sequence (ETAS) models which have proven to be extremely useful in the description and modeling of earthquake occurrence times and locations. In that sense, Ogata, 1988, Ogata, 1998, introduced ETAS models for temporal and spatio-temporal seismic hazard estimation, respectively. Those models use a given parametrization of the conditional intensity function associated with the occurrence rate of an earthquake and its triggering function at time $t$ and within an $(x, y)$ location. Aftershocks are then estimated following the seismic aftershock propagation law, or Omori’s law (Utsu, 1961). Recently, many improvements and extensions have been proposed for incorporating local features of the seismic events (Lombardi et al., 2010, Ogata, 2011, Bansal and Ogata, 2013, Kumazawa and Ogata, 2014, Guo et al., 2015, Nicolis et al., 2015).

Although ETAS models have shown to be very useful for estimating the triggering earthquakes, they often fail when predicting future events. Nicolis et al. (2017) show that the ETAS normally underestimate the real number of seismic events, depending on the precise time that the main shock happened. For solving this problem, they introduced a correction factor that takes into account when the main earthquake happens just before the forecasting day. They also show the superiority of the temporal ETAS for predicting future values of the intensity respect the spatio-temporal ETAS model.

Joffe et al. (2018) stated that contemporary techniques are insufficiently sensitive to allow for precise modeling of future earthquake events. This raises the importance for new approaches that consider broader and bigger sources of information. In that sense, Deep Neural Networks (DNN) have state-of-art accuracy for most of the problems where statistical learning models are applied and where a precise mathematical formulation is hard to obtain. Moreover, DNN architectures, like Long Short Term Memory (LSTM) networks and Convolutional Neural Networks (CNN) have appeared in the last few years, with positive results in a variety of problems such as speech recognition, language modeling, translation, image classification and captioning, time series anomaly detection, stock market prediction, to name a few (Liu et al., 2017, Kumar et al., 2018).

Some of the first machine learning applications on earthquake analysis appeared in the 1990s, and used multilayer perceptrons or artificial neural networks for event detection and phase picking (Wang and Teng, 1995, Tiira, 1999, Zhao and Takano, 1999). In the next decades, along with the further development of new techniques, several new machine learning methods have been successfully applied, for instance, Asim et al. (2018a) used Support Vector Machine Regression and Hybrid Neural Networks to predict earthquake occurrences based on a combination of relevant seismic features such as Gutenberg–Richter law, seismic rate changes, foreshock frequency, seismic energy release and total recurrence time, and Reyes et al. (2013) showed a Neural Network approach for earthquake prediction in Chile from 1960 to 2011, taking into account seismic areas of Talca, Pichilemu, Santiago and Valparaíso.

Furthermore, the application of DNN to seismological problems is at its dawn (Kriegerowski et al., 2019). In that sense, recent works proposed the application of DNNs for seismic analysis, most of these works used CNN or LSTM networks. Kislov and Gravirov (2018) analyzed the potential of using Deep Neural Networks for the analysis of seismic data, where DNN has a higher level of abstraction which, consequently, improves the generalization ability of the model. Zhou et al. (2019) introduced a hybrid CNN-RNN to detect earthquake events from seismograms signals. Linville et al. (2019) applied CNN and RNN to discriminate between quarry blasts and tectonic sources from event catalogs and the spectrograms of the sensors, with a performance higher than 99%. Kriegerowski et al. (2019) have applied CNN to accurately predict de hypocenter locations from full-waveform records of multiple stations. Perol et al. (2018) proposed the ConvNetQuake model, a CNN for earthquake detection and location from waveforms. Vijayasankari and Indhuja (2018) used LSTM networks for spatio-temporal earthquake forecast. Geng et al. (2019) proposed a dilated causal temporal convolution network and a CNN-LSTM network to forecast seismic events. Huang et al. (2018) proposed to project the seismic events into a topographic map and generated a dataset of images where the earthquake with magnitude $>$ 6 is marked with a label “1”. The authors used a CNN to detect and predict if these large earthquake events will occur in the next 30 days. Li et al. (2019) proposed a method for seismic fault detection using a CNN, moreover, to augment the dataset they developed an encoder–decoder CNN to enrich very small training set. Wang et al. (2018) trained the ResNets for seismic data antialiasing interpolation, where the model is used to reconstruct dense data with halved trace intervals. The generated data can be used to improve the accuracy of subsequent algorithms. On the other hand, Oliveira et al. (2018) assessed the performance of a conditional generative adversarial network to interpolate and generate seismic data. Recently, Plaza et al. (2019) used a LSTM for predicting the intensity function of a temporal ETAS in Chile. LSTM were also applied by Reyes et al., 2013, Wang et al., 2017, Asim et al., 2018b and Vardaan et al. (2019) for the temporal and spatial prediction of earthquakes.

In this work we explore a new approach based on a Long Short Term Memory (LSTM) network (Hochreiter and Schmidhuber, 1997) and a Convolutional Neural Network (CNN) (LeCun et al., 1989)) for predicting the intensity function and the probability that a seismic event with a given magnitude occurs in a certain area. In particular, first we use the seismic catalogue of Chile for estimating the ETAS model on a grid of 1 × 1 degree for the period 2000–2017 using three different threshold magnitudes ( $\geq 4.0$ , $\geq 5.0$ and $\geq 6.0$ ). Then, we take the spatial maximum value of the intensity and we use a LSTM for predicting the maximum rate of seismic event occurrence on the next day. A CNN is also applied to the same data for predicting the probability that the maximum value of intensity is in a certain area. For this goal we divide the data in 6 areas and a classification of the areas are provided. By crossing the outputs of the two neural networks we can predict the maximum intensity and the location of the next seismic event.

This work is structured as follows. In Section 2 we briefly show the methodological modular approach with a brief description of the ETAS model, and the LSTM and CNN neural networks. In Section 3 we apply the latter models for data preprocessing and intensity and location prediction of future seismic events. Some results are shown in Section 4. Conclusions and further developments are provided in Section 5.

Section snippets

Methodology

The general purpose for this work is to use a DNN approach with a Long Short Term Memory (LSTM) network and a Convolutional Neural Network (CNN) for the conditional intensity prediction and classification of seismic events. To achieve that goal, three modules are developed, the data preprocessing module, the spatio-temporal estimation module, and the output module. The data preprocessing module processes the original data and prepare the inputs for the DNN models (LSTM and CNN). The

Application to the Chilean catalogue

As stated in Section 2.2, the preprocessing module based on the temporal ETAS model has been used for estimating the conditional intensity function in Chile in the period 2000–2017 on a $42 °$ by $15 °$ grid including the area between latitudes $(- 57, - 15)$ and longitudes $(- 80, - 65)$ . Each pixel of the grid had a size of $1 ° \times 1 °$ which represents an area of approximately $111 km m \times 111 km$ for each pixel. Firstly, the parameters of the ETAS model had been estimated on all area by considering seismic magnitudes

Results

Fig. 5 represents the maximum intensity LSTM prediction results for events with magnitude greater than or equal to 6.0 (on Richter scale). The figure shows that the LSTM is able to both identify patterns that characterize high-magnitude earthquakes, and to predict the maximum intensity with a certain approximation. The goodness of fit of the model was confirmed by the R $^{2}$ determinant coefficient which resulted 0.70 for the training set, and 0.66 for the testing set.

In order to compare the

Discussion

In this work we propose a deep learning approach based on a Long Short Term Memory (LSTM) and a Convolutional Neural Network (CNN) for predicting the intensity and location of future seismic events in the Chilean catalogue with magnitudes greater than or equal to 4.0, 5.0, and 6.0. The results showed that LSTM can predict the maximum intensity of the seismic events with an $R^{2}$ of 0.66 on the testing set. This performance index considerably decreases when considering the LSTM prediction on events

Concluding remarks and further developments

As the prediction of seismic events still constitutes an area that needs further development, this work constitutes a preliminary analysis on the joint use of Deep Learning methods, such as LSTM and CNN. The results can be easily extended for predicting the number of events or the maximum magnitude in a certain area. Also, this work establishes a baseline from which the proposed DL model could be further improved by incorporating some exogenous variables such as the seismic depth, crustal

Acknowledgments

Orietta Nicolis was partially supported by the National Research Center for Integrated National Disaster Management (CIGIDEN)ANID/FONDAP/15110017, by the Andres Bello University grant DI-03-19/R, and by the national grant FONDECYT Regular ID1201478. Francisco Plaza was partially supported by the CONICYT-PFCHA/DOCTORADO-BECAS-CHILE/2018 –21182037.

References (61)

AsimK.M. et al.
Seismic indicators based earthquake predictor system using genetic programming and AdaBoost classification
Soil Dyn. Earthq. Eng.
(2018)
KumarJ. et al.
Long short term memory recurrent neural network (LSTM-RNN) based workload forecasting model for cloud datacenters
Procedia Comput. Sci.
(2018)
LiuW. et al.
A survey of deep neural network architectures and their applications
Neurocomputing
(2017)
NicolisO. et al.
Windowed ETAS models with application to the Chilean seismic catalogs
Spatial Stat.
(2015)
ReyesJ. et al.
Neural networks to predict earthquakes in Chile
Appl. Soft Comput.
(2013)
TiiraT.
Detecting teleseismic events using artificial neural networks
Comput. Geosci.
(1999)
Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur,...
Abdel-HamidO. et al.
Exploring convolutional neural network structures and optimization techniques for speech recognition
AllenC.R.
Responsibilities in earthquake prediction: to the seismological society of America, delivered in Edmonton, Alberta, May 12, 1976
Bull. Seismol. Soc. Am.
(1976)
AsimK.M. et al.
Earthquake prediction model using support vector regressor and hybrid neural networks
PLoS One
(2018)

BansalA. et al.

A non-stationary epidemic type aftershock sequence model for seismicity prior to the December 26, 2004 M 9.1 Sumatra-Andaman Islands mega-earthquake

J. Geophys. Res. Solid Earth

(2013)

BudnitzR. et al.

Recommendations for probabilistic seismic hazard analysis: Guidance on uncertainty and use of expertsTechnical Report

(1997)

CadyF.

The Data Science Handbook

(2017)

ChiodiM. et al.

Mixed non-parametric and parametric estimation techniques in R package etasFLP for earthquakes’ description

J. Stat. Softw.

(2017)

CholletF. et al.

R interface to Keras

(2017)

CholletF.

Keras

(2015)

CimellaroG.P. et al.

Earthquake prediction

GengY. et al.

Seismic events prediction using deep temporal convolution networks

J. Electr. Comput. Eng.

(2019)

GoodfellowI. et al.

Deep Learning

(2016)

GoodfellowI. et al.

Generative adversarial nets

GuoY. et al.

An improved space-time ETAS model for inverting the rupture geometry from seismicity triggering

J. Geophys. Res. Solid Earth

(2015)

HarteD.

PtProcess: An R package for modelling marked point processes indexed by time

J. Stat. Softw.

(2010)

HochreiterS. et al.

Long short-term memory

Neural Comput.

(1997)

HuangJ. et al.

Large earthquake magnitude prediction in Taiwan based on deep learning neural network

Neural Netw. World

(2018)

JiaoP. et al.

Artificial intelligence in seismology: Advent, performance and future trends

Geosci. Front.

(2019)

JoffeH. et al.

Stigma in science: the case of earthquake prediction

Disasters

(2018)

KislovK.V. et al.

Deep artificial neural networks as a tool for the analysis of seismic data

Seism. Instrum.

(2018)

KriegerowskiM. et al.

A deep convolutional neural network for localization of clustered earthquakes based on multistation full waveforms

Seismol. Res. Lett.

(2019)

KumazawaT. et al.

Nonstationary ETAS models for nonstandard earthquakes

Ann. Appl. Stat.

(2014)

LeCunY. et al.

Backpropagation applied to handwritten zip code recognition

Neural Comput.

(1989)

Cited by (28)

Unveiling out-of-distribution data for reliable structural damage assessment in earthquake emergency situations
2023, Automation in Construction
To ensure accurate and effective emergency responses following an earthquake, one must promptly and accurately assess damage to structures with minimal manual effort. An advanced approach for evaluating structural damage is to use machine learning techniques for automated tagging or damage classification. However, most previous studies evaluated models using simple and unrealistic datasets limited to in-distribution (ID) datasets for training and testing. Although this approach may yield satisfactory results within the confines of the training dataset, it is not justifiable for real-time scenarios, in which the testing dataset may differ. Hence, a novel methodology is proposed herein that focuses on the detection of out-of-distribution (OOD) data. By subjecting a network to outliers, the model can effectively identify data outside the domain of the training dataset. A custom loss function is adopted, where both the cross-entropy loss from the ID training dataset and the log loss from the outlier dataset are incorporated. The effectiveness of this approach is demonstrated through its application in the post-event rapid damage assessment of bridges and shear walls subjected to seismic loading. A single network is employed to classify ID data into their respective classes, whereas the OOD data are shown to belong to the OOD class. The results highlight the significant accuracy achieved in the simultaneous prediction of ID and OOD data. By incorporating the detection of OOD data, this study enriches the methods for improving the reliability and accuracy of structural damage assessments in earthquake emergency responses, thus enabling more informed decision-making.
An attempt to use machine learning algorithms to predict strong tremors during longwall mining of a coal seam
2023, Journal of Applied Geophysics
Strong tremors during longwall mining are usually associated with the fracturing of thick sandstone layers. The prediction of such tremors is important and can be treated as a form of rockburst prevention. The article presents the possibility of predicting the occurrence of strong tremors based on the random forest algorithm in the 48-h forecast horizon. The study concerns a longwall panel in coal seam No. 506 in one of the hard coal mines in the Upper Silesian Coal Basin, Poland. The extraction of this longwall panel was carried out at a great depth, surrounded by thick layers of sandstone and with the occurrence of edges in the previously mined seams. Therefore, seismic activity was high and strong tremors occurred. Based on the seismic catalogue and data related to the longwall face advance, 12 parameters were selected that were most significantly positively or negatively correlated with the occurrence of strong tremors. Based on 10 randomly selected datasets, models of the random forest algorithm were trained and then tested. Due to the much smaller number of cases of strong tremors compared to days when there were no strong tremors, the set of training data was balanced using the ADASYN algorithm. The obtained models had an average recall value of 0.79, and average accuracy equalled 0.912. The trained models showed high efficiency in predicting minority cases of days with strong tremors for up to 48 h and a relatively low percentage of misclassifications.
End-to-end LSTM based estimation of volcano event epicenter localization
2022, Journal of Volcanology and Geothermal Research
Citation Excerpt :
Nonetheless, to overcome this problem, a recurring alternative known as LSTM (long short-term memory) was proposed in (Hochreiter and Schmidhuber, 1997). LSTM has been used for the recognition system of volcanic-seismic events (Titos et al., 2018; Canario et al., 2020), identification of P waves in tectonic seismic data (Zhu et al., 2019), prediction of seismic events (Nicolis et al., 2021), earthquake signal detection (Mousavi et al., 2019), and magnitude estimation (Mousavi and Beroza, 2020a). In this paper, LSTM is employed to estimate the epicenter of volcano events by exploring the LSTM inner structure to capture the relevant characteristics of the wave signal to achieve this goal, i.e. the occurrence of P and S waves, on an end-to-end basis.
Locating sources of volcano-seismic event is very relevant to monitor and comprehend volcanic processes. Ordinary estimation of source seismic events is based on phase picking. The most accurate procedure of phase selection is the visual inspection of the records by experts, who employ local characteristics for phase detection and comparison with observed signals from other stations. This activity is highly time demanding, which in turn is a strong motivation to automatize the epicenter estimation process. However, automatic phase picking in volcano signals is highly inaccurate because of the short distances between the event epicenters and the seismograph stations. In this paper, an end-to-end based LSTM (Long-Short Term Memory) scheme is proposed to address the problem of volcano event localization without any a priori model relating phase picking with localization estimation. LSTM was chosen due to its capability to capture the dynamics of time varying signals, and to remove or add information within the memory cell state and model long-term dependencies. A brief insight into LSTM is also discussed here to justify the use of this neural network. The results presented in this paper show that the LSTM based architecture provided a success rate, i.e., an error smaller than 1.0 km, equal to 48.5%, which in turn is dramatically superior to the one delivered by automatic phase picking. Moreover, the proposed end-to-end LSTM based method gave a success rate (18%) higher than CNN (Convolutional Neural Network). The results presented suggest that the approach proposed here for automatic volcano event epicenter localization can be applied to other geophysics problems.
Dynamic seismic damage assessment of distributed infrastructure systems using graph neural networks and semi-supervised machine learning
2022, Advances in Engineering Software
Citation Excerpt :
The American Lifelines Alliance model [5] uses PGV as the intensity measure, and the effects of pipe material, joint type, and soil corrosiveness are considered when computing the fragility for the pipelines. In the civil engineering community, researchers have been exploring the application of machine learning methods for estimating the seismic response, detecting/classifying damage and predicting earthquake events [6–9]. Sun et al. [10] summarizes the application of machine learning in structural design and performance assessment into four categories: (1) predicting structural response and performance, (2) models developed using data from physical experiments, (3) information retrieval using images and written text and (4) models developed using field reconnaissance and structural health monitoring data.
A methodology is presented for performing dynamic seismic damage assessment of distributed infrastructure systems using graph neural networks and semi-supervised machine learning. To achieve this goal, a pre-event damage assessment is performed using either traditional fragility-based models or a machine learning classification algorithm trained on historical damage data. Then, a graph-neural network is implemented to perform semi-supervised learning and update the pre-event predictions as observations of actual damage become available during the post-earthquake inspection process. The methodology is demonstrated on the pipe network for the City of Napa, California water distribution system. A dataset of pipes damaged during the 2014 M 6.0 earthquake is used for validation purposes. A conventional neural network classification model is first trained on a portion of the observed pipe damage and used to perform the pre-event damage assessment i.e., supervised learning. Following the earthquake, a graph neural network model is employed to update the damage estimates given the information incrementally collected during the inspection process. The evaluation results show that the neural network supervised learning model provides better pre-event damage estimates than the existing repair rate-based model. Also, the graph neural network models can provide improved damage quantification given partial information collected during the inspection process.
A long short-term memory based deep learning algorithm for seismic response uncertainty quantification
2022, Probabilistic Engineering Mechanics
Citation Excerpt :
Over the last few years, CNN has been implemented in nonlinear dynamic response quantification [25–27]. Various CNN based approaches for detection, visualization and declaration of earthquake events have also been proposed [28–30]. However, these studies used CNN as a feature extractor mostly to conduct classification and recognition tasks.
The application of metamodeling technique to overcome computational challenge of Monte Carlo simulation (MCS) technique for response uncertainty quantification under stochastic earthquake load is a difficult task due to the high-dimensional nature of stochastic load. Recent developments in the sequential models for forecasting and prediction have opened a new avenue in this regard. Various deep learning algorithms, particularly the convolutional neural network and recurrent neural network are quite suitable for response uncertainty quantification of nonlinear stochastic dynamic system. However, most of the existing studies consider stochastic load as the only source of uncertainty assuming the parameters characterizing a structure as deterministic. The present study proposes a long short-term memory (LSTM) based deep learning algorithm for seismic response uncertainty quantification by duly addressing both the stochastic nature of dynamic load and structural system parameter uncertainty. The functional application program interface feature of Keras that allows layers sharing to form more complex model is explored to form a response approximation model. It incorporates more than one input source i.e., stochastic dynamic excitation sequence as well as structural system parameter uncertainty. The proposed algorithm is elucidated through two numerical examples i.e., a proof-of-concept example and one realistic structural engineering problem by considering the direct MCS based results as the benchmark. The results of accuracy matrices, regression analysis results, comparison of seismic response statistics and reliability results with the results of direct MCS technique clearly revealed the enhanced prediction capability of the proposed LSTM model.
Classification of Seismic Events Using Density Based Clustering and Transformer Neural Networks
2024, SSRN

View all citing articles on Scopus

View full text

Prediction of intensity and location of seismic events using deep learning

Abstract

Introduction

Section snippets

Methodology

Application to the Chilean catalogue

Results

Discussion

Concluding remarks and further developments

Acknowledgments

Soil Dyn. Earthq. Eng.

Procedia Comput. Sci.

Neurocomputing

Spatial Stat.

Appl. Soft Comput.

Comput. Geosci.

Exploring convolutional neural network structures and optimization techniques for speech recognition

Responsibilities in earthquake prediction: to the seismological society of America, delivered in Edmonton, Alberta, May 12, 1976

Bull. Seismol. Soc. Am.

Earthquake prediction model using support vector regressor and hybrid neural networks

PLoS One

A non-stationary epidemic type aftershock sequence model for seismicity prior to the December 26, 2004 M 9.1 Sumatra-Andaman Islands mega-earthquake

J. Geophys. Res. Solid Earth

Recommendations for probabilistic seismic hazard analysis: Guidance on uncertainty and use of expertsTechnical Report

The Data Science Handbook

Mixed non-parametric and parametric estimation techniques in R package etasFLP for earthquakes’ description

J. Stat. Softw.

R interface to Keras

Keras

Earthquake prediction

Seismic events prediction using deep temporal convolution networks

J. Electr. Comput. Eng.

Deep Learning

Generative adversarial nets

An improved space-time ETAS model for inverting the rupture geometry from seismicity triggering

J. Geophys. Res. Solid Earth

PtProcess: An R package for modelling marked point processes indexed by time

J. Stat. Softw.

Long short-term memory

Neural Comput.

Large earthquake magnitude prediction in Taiwan based on deep learning neural network

Neural Netw. World

Artificial intelligence in seismology: Advent, performance and future trends

Geosci. Front.

Stigma in science: the case of earthquake prediction

Disasters

Deep artificial neural networks as a tool for the analysis of seismic data

Seism. Instrum.

A deep convolutional neural network for localization of clustered earthquakes based on multistation full waveforms

Seismol. Res. Lett.

Nonstationary ETAS models for nonstandard earthquakes

Ann. Appl. Stat.

Backpropagation applied to handwritten zip code recognition

Neural Comput.