Elsevier

Spatial Statistics

Volume 42, April 2021, 100442
Spatial Statistics

Prediction of intensity and location of seismic events using deep learning

https://doi.org/10.1016/j.spasta.2020.100442Get rights and content

Abstract

The object of this work is to predict the seismic rate in Chile by using two Deep Neural Network (DNN) architectures, Long Short Term Memory (LSTM) and Convolutional Neural Networks (CNN). For this, we propose a methodology based on a three-module approach: a pre-processing module, a spatial and temporal estimation module, and a prediction module. The first module considers the Epidemic-Type Aftershock Sequences (ETAS) model for estimating the intensity function, which will be used for estimating the seismic rate on a 1 × 1 degree grid providing a sequence of daily images covering all the seismic area of Chile. The spatial and temporal estimation module uses the LSTM and CNN for predicting the intensity and the location of earthquakes. The last module integrates the information provided by the DNNs for predicting future values of the maximum seismic rate and their location. In particular, the LSTM will be trained using the maximum intensity of the last 30 days as input for predicting the maximum intensity of the next day, and the CNN will be trained on the last 30 images provided by the application of the ETAS model for predicting the probability that the next day the maximum event will be in certain area of Chile. Some performance indexes (such as R2 and accuracy) will be used for validating the proposed models.

Introduction

Earthquakes represent one of the most destructive yet unpredictable natural disasters along the world, with a massive physical, psychological and economical impact in the population worldwide. Chile is considered one of most seismic-active country in the world, having the world’s largest instrumentally documented earthquake occurred in Valdivia (on May 22, 1960), as well as recently been affected by three major earthquakes with magnitudes > 8.0 (on Richter scale). Thus, to have a better approximation or additional information on where and when an event of that magnitude could occur, it would represent an invaluable tool for managing and designing public policies regarding natural disasters. However, earthquake prediction is a very challenging task due to the high complexity associated to the process itself, and also due to the fact that their occurrences depend on a multitude of variables, that in most cases could be unidentified (Sobolev, 2015, Cimellaro and Marasco, 2018, Joffe et al., 2018). Most of the proposed prediction models are focused on some form of seismic hazard estimation (Budnitz et al., 1997, Woessner et al., 2015, Petersen et al., 2018). This term is defined as the probability that an earthquake will occur in a given geographic area, within a given window of time, and with a magnitude exceeding a given threshold. In fact, according to Allen (1976) and Joffe et al. (2018), a valid approach for earthquake prediction should consider a spatio-temporal window, a magnitude estimation, a scientific sound validation process, and an appropriate visualization procedure.

Self-exciting point process models have become essential components in the assessment of seismic hazard. A particular class is given by the Epidemic Time Aftershock Sequence (ETAS) models which have proven to be extremely useful in the description and modeling of earthquake occurrence times and locations. In that sense, Ogata, 1988, Ogata, 1998, introduced ETAS models for temporal and spatio-temporal seismic hazard estimation, respectively. Those models use a given parametrization of the conditional intensity function associated with the occurrence rate of an earthquake and its triggering function at time t and within an (x,y) location. Aftershocks are then estimated following the seismic aftershock propagation law, or Omori’s law (Utsu, 1961). Recently, many improvements and extensions have been proposed for incorporating local features of the seismic events (Lombardi et al., 2010, Ogata, 2011, Bansal and Ogata, 2013, Kumazawa and Ogata, 2014, Guo et al., 2015, Nicolis et al., 2015).

Although ETAS models have shown to be very useful for estimating the triggering earthquakes, they often fail when predicting future events. Nicolis et al. (2017) show that the ETAS normally underestimate the real number of seismic events, depending on the precise time that the main shock happened. For solving this problem, they introduced a correction factor that takes into account when the main earthquake happens just before the forecasting day. They also show the superiority of the temporal ETAS for predicting future values of the intensity respect the spatio-temporal ETAS model.

Joffe et al. (2018) stated that contemporary techniques are insufficiently sensitive to allow for precise modeling of future earthquake events. This raises the importance for new approaches that consider broader and bigger sources of information. In that sense, Deep Neural Networks (DNN) have state-of-art accuracy for most of the problems where statistical learning models are applied and where a precise mathematical formulation is hard to obtain. Moreover, DNN architectures, like Long Short Term Memory (LSTM) networks and Convolutional Neural Networks (CNN) have appeared in the last few years, with positive results in a variety of problems such as speech recognition, language modeling, translation, image classification and captioning, time series anomaly detection, stock market prediction, to name a few (Liu et al., 2017, Kumar et al., 2018).

Some of the first machine learning applications on earthquake analysis appeared in the 1990s, and used multilayer perceptrons or artificial neural networks for event detection and phase picking (Wang and Teng, 1995, Tiira, 1999, Zhao and Takano, 1999). In the next decades, along with the further development of new techniques, several new machine learning methods have been successfully applied, for instance, Asim et al. (2018a) used Support Vector Machine Regression and Hybrid Neural Networks to predict earthquake occurrences based on a combination of relevant seismic features such as Gutenberg–Richter law, seismic rate changes, foreshock frequency, seismic energy release and total recurrence time, and Reyes et al. (2013) showed a Neural Network approach for earthquake prediction in Chile from 1960 to 2011, taking into account seismic areas of Talca, Pichilemu, Santiago and Valparaíso.

Furthermore, the application of DNN to seismological problems is at its dawn (Kriegerowski et al., 2019). In that sense, recent works proposed the application of DNNs for seismic analysis, most of these works used CNN or LSTM networks. Kislov and Gravirov (2018) analyzed the potential of using Deep Neural Networks for the analysis of seismic data, where DNN has a higher level of abstraction which, consequently, improves the generalization ability of the model. Zhou et al. (2019) introduced a hybrid CNN-RNN to detect earthquake events from seismograms signals. Linville et al. (2019) applied CNN and RNN to discriminate between quarry blasts and tectonic sources from event catalogs and the spectrograms of the sensors, with a performance higher than 99%. Kriegerowski et al. (2019) have applied CNN to accurately predict de hypocenter locations from full-waveform records of multiple stations. Perol et al. (2018) proposed the ConvNetQuake model, a CNN for earthquake detection and location from waveforms. Vijayasankari and Indhuja (2018) used LSTM networks for spatio-temporal earthquake forecast. Geng et al. (2019) proposed a dilated causal temporal convolution network and a CNN-LSTM network to forecast seismic events. Huang et al. (2018) proposed to project the seismic events into a topographic map and generated a dataset of images where the earthquake with magnitude > 6 is marked with a label “1”. The authors used a CNN to detect and predict if these large earthquake events will occur in the next 30 days. Li et al. (2019) proposed a method for seismic fault detection using a CNN, moreover, to augment the dataset they developed an encoder–decoder CNN to enrich very small training set. Wang et al. (2018) trained the ResNets for seismic data antialiasing interpolation, where the model is used to reconstruct dense data with halved trace intervals. The generated data can be used to improve the accuracy of subsequent algorithms. On the other hand, Oliveira et al. (2018) assessed the performance of a conditional generative adversarial network to interpolate and generate seismic data. Recently, Plaza et al. (2019) used a LSTM for predicting the intensity function of a temporal ETAS in Chile. LSTM were also applied by Reyes et al., 2013, Wang et al., 2017, Asim et al., 2018b and Vardaan et al. (2019) for the temporal and spatial prediction of earthquakes.

In this work we explore a new approach based on a Long Short Term Memory (LSTM) network (Hochreiter and Schmidhuber, 1997) and a Convolutional Neural Network (CNN) (LeCun et al., 1989)) for predicting the intensity function and the probability that a seismic event with a given magnitude occurs in a certain area. In particular, first we use the seismic catalogue of Chile for estimating the ETAS model on a grid of 1 × 1 degree for the period 2000–2017 using three different threshold magnitudes (4.0, 5.0 and 6.0). Then, we take the spatial maximum value of the intensity and we use a LSTM for predicting the maximum rate of seismic event occurrence on the next day. A CNN is also applied to the same data for predicting the probability that the maximum value of intensity is in a certain area. For this goal we divide the data in 6 areas and a classification of the areas are provided. By crossing the outputs of the two neural networks we can predict the maximum intensity and the location of the next seismic event.

This work is structured as follows. In Section 2 we briefly show the methodological modular approach with a brief description of the ETAS model, and the LSTM and CNN neural networks. In Section 3 we apply the latter models for data preprocessing and intensity and location prediction of future seismic events. Some results are shown in Section 4. Conclusions and further developments are provided in Section 5.

Section snippets

Methodology

The general purpose for this work is to use a DNN approach with a Long Short Term Memory (LSTM) network and a Convolutional Neural Network (CNN) for the conditional intensity prediction and classification of seismic events. To achieve that goal, three modules are developed, the data preprocessing module, the spatio-temporal estimation module, and the output module. The data preprocessing module processes the original data and prepare the inputs for the DNN models (LSTM and CNN). The

Application to the Chilean catalogue

As stated in Section 2.2, the preprocessing module based on the temporal ETAS model has been used for estimating the conditional intensity function in Chile in the period 2000–2017 on a 42° by 15° grid including the area between latitudes (57,15) and longitudes (80,65). Each pixel of the grid had a size of 1°×1° which represents an area of approximately 111kmm×111km for each pixel. Firstly, the parameters of the ETAS model had been estimated on all area by considering seismic magnitudes

Results

Fig. 5 represents the maximum intensity LSTM prediction results for events with magnitude greater than or equal to 6.0 (on Richter scale). The figure shows that the LSTM is able to both identify patterns that characterize high-magnitude earthquakes, and to predict the maximum intensity with a certain approximation. The goodness of fit of the model was confirmed by the R2 determinant coefficient which resulted 0.70 for the training set, and 0.66 for the testing set.

In order to compare the

Discussion

In this work we propose a deep learning approach based on a Long Short Term Memory (LSTM) and a Convolutional Neural Network (CNN) for predicting the intensity and location of future seismic events in the Chilean catalogue with magnitudes greater than or equal to 4.0, 5.0, and 6.0. The results showed that LSTM can predict the maximum intensity of the seismic events with an R2 of 0.66 on the testing set. This performance index considerably decreases when considering the LSTM prediction on events

Concluding remarks and further developments

As the prediction of seismic events still constitutes an area that needs further development, this work constitutes a preliminary analysis on the joint use of Deep Learning methods, such as LSTM and CNN. The results can be easily extended for predicting the number of events or the maximum magnitude in a certain area. Also, this work establishes a baseline from which the proposed DL model could be further improved by incorporating some exogenous variables such as the seismic depth, crustal

Acknowledgments

Orietta Nicolis was partially supported by the National Research Center for Integrated National Disaster Management (CIGIDEN)ANID/FONDAP/15110017, by the Andres Bello University grant DI-03-19/R, and by the national grant FONDECYT Regular ID1201478. Francisco Plaza was partially supported by the CONICYT-PFCHA/DOCTORADO-BECAS-CHILE/201821182037.

References (61)

  • BansalA. et al.

    A non-stationary epidemic type aftershock sequence model for seismicity prior to the December 26, 2004 M 9.1 Sumatra-Andaman Islands mega-earthquake

    J. Geophys. Res. Solid Earth

    (2013)
  • BudnitzR. et al.

    Recommendations for probabilistic seismic hazard analysis: Guidance on uncertainty and use of expertsTechnical Report

    (1997)
  • CadyF.

    The Data Science Handbook

    (2017)
  • ChiodiM. et al.

    Mixed non-parametric and parametric estimation techniques in R package etasFLP for earthquakes’ description

    J. Stat. Softw.

    (2017)
  • CholletF. et al.

    R interface to Keras

    (2017)
  • CholletF.

    Keras

    (2015)
  • CimellaroG.P. et al.

    Earthquake prediction

  • GengY. et al.

    Seismic events prediction using deep temporal convolution networks

    J. Electr. Comput. Eng.

    (2019)
  • GoodfellowI. et al.

    Deep Learning

    (2016)
  • GoodfellowI. et al.

    Generative adversarial nets

  • GuoY. et al.

    An improved space-time ETAS model for inverting the rupture geometry from seismicity triggering

    J. Geophys. Res. Solid Earth

    (2015)
  • HarteD.

    PtProcess: An R package for modelling marked point processes indexed by time

    J. Stat. Softw.

    (2010)
  • HochreiterS. et al.

    Long short-term memory

    Neural Comput.

    (1997)
  • HuangJ. et al.

    Large earthquake magnitude prediction in Taiwan based on deep learning neural network

    Neural Netw. World

    (2018)
  • JiaoP. et al.

    Artificial intelligence in seismology: Advent, performance and future trends

    Geosci. Front.

    (2019)
  • JoffeH. et al.

    Stigma in science: the case of earthquake prediction

    Disasters

    (2018)
  • KislovK.V. et al.

    Deep artificial neural networks as a tool for the analysis of seismic data

    Seism. Instrum.

    (2018)
  • KriegerowskiM. et al.

    A deep convolutional neural network for localization of clustered earthquakes based on multistation full waveforms

    Seismol. Res. Lett.

    (2019)
  • KumazawaT. et al.

    Nonstationary ETAS models for nonstandard earthquakes

    Ann. Appl. Stat.

    (2014)
  • LeCunY. et al.

    Backpropagation applied to handwritten zip code recognition

    Neural Comput.

    (1989)
  • Cited by (28)

    • End-to-end LSTM based estimation of volcano event epicenter localization

      2022, Journal of Volcanology and Geothermal Research
      Citation Excerpt :

      Nonetheless, to overcome this problem, a recurring alternative known as LSTM (long short-term memory) was proposed in (Hochreiter and Schmidhuber, 1997). LSTM has been used for the recognition system of volcanic-seismic events (Titos et al., 2018; Canario et al., 2020), identification of P waves in tectonic seismic data (Zhu et al., 2019), prediction of seismic events (Nicolis et al., 2021), earthquake signal detection (Mousavi et al., 2019), and magnitude estimation (Mousavi and Beroza, 2020a). In this paper, LSTM is employed to estimate the epicenter of volcano events by exploring the LSTM inner structure to capture the relevant characteristics of the wave signal to achieve this goal, i.e. the occurrence of P and S waves, on an end-to-end basis.

    • Dynamic seismic damage assessment of distributed infrastructure systems using graph neural networks and semi-supervised machine learning

      2022, Advances in Engineering Software
      Citation Excerpt :

      The American Lifelines Alliance model [5] uses PGV as the intensity measure, and the effects of pipe material, joint type, and soil corrosiveness are considered when computing the fragility for the pipelines. In the civil engineering community, researchers have been exploring the application of machine learning methods for estimating the seismic response, detecting/classifying damage and predicting earthquake events [6–9]. Sun et al. [10] summarizes the application of machine learning in structural design and performance assessment into four categories: (1) predicting structural response and performance, (2) models developed using data from physical experiments, (3) information retrieval using images and written text and (4) models developed using field reconnaissance and structural health monitoring data.

    • A long short-term memory based deep learning algorithm for seismic response uncertainty quantification

      2022, Probabilistic Engineering Mechanics
      Citation Excerpt :

      Over the last few years, CNN has been implemented in nonlinear dynamic response quantification [25–27]. Various CNN based approaches for detection, visualization and declaration of earthquake events have also been proposed [28–30]. However, these studies used CNN as a feature extractor mostly to conduct classification and recognition tasks.

    View all citing articles on Scopus
    View full text