L-moments for automatic threshold selection in extreme value analysis

Silva Lomba, Jessica; Fraga Alves, Maria Isabel

doi:10.1007/s00477-020-01789-x

L-moments for automatic threshold selection in extreme value analysis

Original Paper
Published: 23 March 2020

Volume 34, pages 465–491, (2020)
Cite this article

Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

735 Accesses
12 Citations
1 Altmetric
Explore all metrics

Abstract

In extreme value analysis, sensitivity of inference to the definition of extreme event is a paramount issue. Under the peaks-over-threshold approach, this translates directly into the need of fitting a Generalized Pareto distribution to observations above a suitable level that balances bias versus variance of estimates. Selection methodologies established in the literature face recurrent challenges such as an inherent subjectivity or high computational intensity. We suggest a truly automated method for threshold detection, aiming at time efficiency and elimination of subjective judgment. Based on the well-established theory of L-moments, this versatile data-driven technique can handle batch processing of large collections of extremes data, while also presenting good performance on small samples. The technique’s performance is evaluated in a large simulation study and illustrated with significant wave height data sets from the literature. We find that it compares favorably to other state-of-the-art methods regarding the choice of threshold, associated parameter estimation and the ultimate goal of computationally efficient return level estimation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Computational Study of the Adaptive Estimation of the Extreme Value Index with Probability Weighted Moments

Adaptive Choice of Thresholds and the Bootstrap Methodology: An Empirical Study

Multiple thresholds in extremal parameter estimation

Article 03 October 2018

References

Alonso MA, de Zea Bermudez P, Scotto MG (2014) Comparing generalized Pareto models fitted to extreme observations: an application to the largest temperatures in Spain. Stoch Env Res Risk Assess 28:1221–1233
Article Google Scholar
Bader B, Yan J, Zhang X (2018) Automated threshold selection for extreme value analysis via ordered goodness-of-fit tests with adjustment for false discovery rate. Ann Appl Stat 12(1):310–329
Article Google Scholar
Bader B, Yan J (2016) eva: Extreme value analysis with goodness-of-fit testing. R package version 0.2.4. Retrieved from https://CRAN.R-project.org/package=eva
Beirlant J, Dierckx G, Guillou A, Stǎricǎ C (2002) On exponential representations of log-spacings of extreme order statistics. Extremes 5:157–180
Article Google Scholar
Bernardara P, Schertzer D, Sauquet E, Tchiguirinskaia I, Lang M (2008) The flood probability distribution tail: how heavy is it? Stoch Env Res Risk Assess 22:107–122
Article Google Scholar
Choulakian V, Stephens MA (2001) Goodness-of-fit tests for the generalized Pareto distribution. Technometrics 43(4):478–484
Article Google Scholar
Clauset A, Shalizi CR, Newman MEJ (2009) Power-law distributions in empirical data. SIAM Rev 51:661–703
Article Google Scholar
Coles S (2001) An introduction to statistical modeling of extreme values. Springer, London
Book Google Scholar
Danielsson J, de Haan L, Peng L, de Vries CG (2001) Using a bootstrap method to choose the sample fraction in tail index estimation. J Multivar Anal 76:226–248
Article Google Scholar
Danielsson J, Ergun LM, de Haan L, de Vries C (2016) Tail index estimation: quantile driven threshold selection. Systemic Risk Centre’s Discussion Paper Series, SRC Discussion Paper 58, pp 1–74. Retrieved from SRC: http://eprints.lse.ac.uk/id/eprint/66193
Deidda R (2010) A multiple threshold method for fitting the generalized Pareto distribution to rainfall time series. Hydrol Earth Syst Sci 14:2559–2575
Article Google Scholar
Deidda R, Puliga M (2009) Performances of some parameter estimators of the generalized Pareto distribution over rounded-off samples. Phys Chem Earth 34:626–634
Article Google Scholar
Ferreira A, de Haan L, Peng L (2003) On optimising the estimation of high quantiles of a probability distribution. Statistics 37(5):401–434
Article Google Scholar
Greenwood JA, Landwehr JM, Matalas NC, Wallis JR (1979) Probability weighted moments: definition and relation to parameters of several distributions expressable in inverse form. Water Resour Res 15(5):1049–1054
Article Google Scholar
G’Sell MG, Wager S, Chouldechova A, Tibshirani R (2016) Sequential selection procedures and false discovery rate control. J R Stat Soc B 78(2):423–444
Article Google Scholar
Hosking JRM (1986) The theory of probability weighted moments. Research Report RC12210-IBM Research
Hosking JRM (2006) On the characterization of distributions by their L-moments. J Stati Plan Inference 136:193–198
Article Google Scholar
Hosking JRM, Balakrishnan N (2015) A uniqueness result for L-estimators, with applications to L-moments. Stat Methodol 24:69–80
Article Google Scholar
Hosking JRM, Wallis JR (1987) Parameter and quantile estimation estimation for the generalized Pareto distribution. Technometrics 29(3):339–349
Article Google Scholar
Hosking JRM, Wallis JR (1997) Regional frequency analysis: an approach based on L-moments. Cambridge University Press, Cambridge
Book Google Scholar
Langousis A, Mamalakis A, Puliga M, Deidda R (2016) Threshold detection for the generalized Pareto distribution: review of representative methods and application to the NOAA NCDC daily rainfall database. Water Resour Res 52:2659–2681
Article Google Scholar
Lee J, Fan Y, Sisson SA (2015) Bayesian threshold selection for extremal models using measures of surprise. Comput Stat Data Anal 85:84–99
Article Google Scholar
Mamalakis A, Langousis A, Deidda R, Marrocu M (2017) A parametric approach for simultaneous bias correction and high-resolution downscaling of climate model rainfall. Water Resour Res 53:2149–2170
Article Google Scholar
Manurung A, Wigena AH, Djuraidah A (2018) GPD threshold estimation using measure of surprise. Int J Sci Basic Appl Res 44(3):16–25
Google Scholar
Northrop PJ, Attalides N (2017) threshr: threshold selection and uncertainty for extreme value analysis. R package version 1.0.0. Retrieved from https://CRAN.R-project.org/package=threshr
Northrop PJ, Coleman CL (2014) Improved threshold diagnostic plots for extreme value analyses. Extremes 17:289–303
Article Google Scholar
Northrop PJ, Attalides N, Jonathan P (2017) Cross-validatory extreme value threshold selection and uncertainty with application to ocean storm severity. J R Stat Soc Ser C Appl Stat 66(1):93–120
Article Google Scholar
Pickands J (1975) Statistical inference using extreme order statistics. Ann Stat 3:119–131
Article Google Scholar
Ribatet MA (2011) A user’s guide to the POT package (version 1.4). Retrieved from http://cran.r-project.org/package=POT
Scarrott C, MacDonald A (2012) A review of extreme value threshold estimation and uncertainty quantification. REVSTAT Stat J 10(1):33–60
Google Scholar
Schneider LF, Krajina A, Krivobokova T (2019) Threshold selection in univariate extreme value analysis. arXiv:1903.02517
Silva Lomba J (2016) Extreme value analysis of competitive freediving records. Master’s thesis, Faculty of Sciences, University of Lisbon. http://hdl.handle.net/10451/24672
Thompson P, Cai Y, Reeve D, Stander J (2009) Automated threshold selection methods for extreme wave analysis. Coast Eng 56:1013–1021
Article Google Scholar
Um M, Cho W, Heo J (2010) A comparative study of the adaptive choice of thresholds in extreme hydrologic events. Stoch Env Res Risk Assess 24(5):611–623
Article Google Scholar
Wadsworth J (2016) Exploiting structure of maximum likelihood estimators for extreme value threshold selection. Technometrics 58(1):116–126
Article Google Scholar
Wadsworth J, Tawn J (2012) Likelihood-based procedures for threshold diagnostics and uncertainty in extreme value modelling. J R Stat Soc Ser B (Stat Methodol) 74:543–567
Article Google Scholar
Withers CS, Nadarajah S (2011) Bias-reduced estimates for skewness, kurtosis, L-skewness and L-kurtosis. J Stat Plan Inference 141:3839–3861
Article Google Scholar

Download references

Acknowledgements

Partial support of the Fundação para a Ciência e a Tecnologia, I.P. under doctoral grant SFRH/BD/130764/2017 (J.S.L.) and through Project UIDB/00006/2020 (J.S.L. and M.I.F.A.) is gratefully acknowledged. We would also like to thank Dr. Kate J. E. Lee and Dr. Paul J. Northrop for kindly providing us with the computer code for the referred alternative methodologies, as well as the editor and anonymous reviewers for their insightful suggestions and remarks that helped improve this work.

Author information

Authors and Affiliations

CEAUL and DEIO, Faculty of Sciences, University of Lisbon, Lisbon, Portugal
Jessica Silva Lomba & Maria Isabel Fraga Alves

Authors

Jessica Silva Lomba
View author publications
You can also search for this author in PubMed Google Scholar
Maria Isabel Fraga Alves
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Maria Isabel Fraga Alves.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendixes

1.1 Appendix 1: ALRSM distances for the hybrid illustrative sample

See Table 6.

Table 6 Automatic selection procedure for a simulated sample of size $n=500$ from a Hybrid($0.75,\,0.2$) distribution, with sample quantile threshold candidates $\left\{ u_i\right\} _{i=1}^{20}$, as in (6) : minimum Euclidean distance $d_{u_i}$ between each point $( t _{3,u_i}, t _{4,u_i})$ for the excesses over each candidate threshold $u_i$ and the GPd theoretical curve (4). Selected threshold, $i=14$, in bold

Full size table

1.2 Appendix 2: Complementary simulation study: GEVd data

In the interest of briefly analysing the performance of the proposed methodology against the presented alternatives in a broader non-GPd scenario, we performed a complementary simulation study to that presented in Sect. 3, using samples drawn from distributions without an exact GPd tail. For this succinct presentation, we considered two GEVd with tail weights given by two of the previously used EVIs—again, both heavy and light tail scenarios were treated. The GEV cumulative distribution function is generally given in terms of its location $\mu \in \mathbb {R}$, scale $\sigma >0$ and shape $\xi \ne 0$ parameters as ${\text {GEV}}(x|\mu ,\,\sigma ,\,\xi ):= \exp \left\{ -\left( 1+\xi \,\frac{x-\mu }{\sigma }\right) ^{-\nicefrac {1}{\xi }}\right\}$, while for the case $\xi =0$ it is taken as the limit of this expression. The remaining parameters were procured as to have equivalence, after some point, of the new distributions with the tail of the Hybrid($0.75,\,\xi$) distributions previously used—this serves the purpose of comparability of results under both studies. Thus, location and scale of the GEVd were defined as functions of $\xi$, respectively as

$$\begin{aligned} \mu =0.75+\frac{1-0.75}{\xi }\left[ \left( \frac{1}{1-0.75}\right) ^{-\xi }-1\right] \;, \qquad \sigma =\left( 1-0.75\right) ^{\xi +1}\;. \end{aligned}$$

(11)

Note, however, that the GEVd does not carry a true threshold, as the Hybrid did, and as such assessment of selection results is more problematic: although we show the mean selected thresholds for each case in Fig. 15, given the lack of interpretability for this case, we choose to focus mainly on the selections’ performance regarding estimation of the EVI and $p=0.001$ exceedance probability quantile.

Analogously to the scheme presented before, estimated entities were computed from a set of 1500 samples generated according to each combination of the proposed scenarios:

GEVd tail shape parameter: $\xi =-0.2$; $\xi =0.2$;
Sample size: $n=1000$; $n=500$; $n=200$;
Sets of candidate thresholds: $I=10$ as in (5); $I=20$ as in (6).

Analysis of the results comprised in Figs. 15 and 16 yields very similar conclusions to the study presented in the main text (colour scheme remains as green for the ALRSM, red for the aSTSM and yellow for the SGFSM). We firstly observe, from Fig. 15, that the mean selected threshold by the ALRSM is consistently higher than that of the competing SGFSM and aSTSM for every scenario posed. In terms of proximity of estimated EVI and 99.9% quantile to their true values, the proposed methodology clearly shows superior performance, in what regards the smaller bias values depicted in Fig. 16a and closer to 1 values for the ratios shown in Fig. 16c. Furthermore, the RMSE associated with estimation of these parameters also appears to be competitive against the alternative selection methodologies, as can be assessed in Fig. 16b, d, resp., corresponding to estimation of the EVI and 99.9% quantile.

In Tables 7 and 8 we can see that the conclusions in Sect. 3 concerning the computational efficiency of the methodologies remain true when dealing with these samples: the ALRSM is, among the three considered, by far the fastest methodology to perform the complete 1500 simulations, and although it shows some failures (2.6% in the worst case), as previously mentioned, these are due to the fitting methodology used, contrary to the selection failures of the SGFSM—which is, once again, the most direct competitor to the ALRSM in terms of bias and RMSE of estimates.

Thus, general conclusions regarding validity of our automated method remain valid for these additional scenarios, even without the possibility of evaluating the threshold selection in itself.

Table 7 Total run time, in seconds, of threshold selection and parameter estimation for 1500 simulated samples with size $n=1000,\,500,\,200$ from the GEV($\mu ,\,\sigma ,\,\xi$) with $\xi =-0.2,\,0.2$, $\mu$ and $\sigma$ as in Eq. (11) and sample quantile threshold candidates $\left\{ u_i\right\} _{i=1}^{I}$, $I=10,\,20$; best run times for each scenario in bold

Full size table

Table 8 Total failure count for 1500 simulated samples with size $n=1000,\,500,\,200$ from the GEV($\mu ,\,\sigma ,\,\xi$) with $\xi =-0.2,\,0.2$, $\mu$ and $\sigma$ as in Eq. (11) and sample quantile threshold candidates $\left\{ u_i\right\} _{i=1}^{I}$, $I=10,\,20$

Full size table

1.3 Appendix 3: Complementary simulation study: quantized data

A known issue in estimation under the GPd is, as mentioned before, the effect of data quantization. Several authors have addressed this problem more or less extensively, from different perspectives which we can find amongst the already referred works. For example, Bader et al. (2018) attempt to “fix” the quantized data by applying a jittering procedure previous to the application of their threshold selection procedure—the objective is to approximate the data to that obtained from a continuous process, in order to use the known results for continuous distributions without further issues. On the other hand, other works such as that of Deidda (2010) choose to develop methods that can deal with quantized samples in themselves, that is, that perform their task well whether the data is discretized or not. Although it is our view that the first approach would be the most indicated to deal with the problem, in this brief simulation study we will assess the robustness of the proposed ALRSM in the face of considerably discretized data. However, we limit the investigation to rounded samples from the Hybrid distribution, for the reasons presented in Sect. 3.

For illustration purposes, consider again the Hybrid($0.75,\,0.2$) sample of dimension $n=500$ used in the examples of Sects. 2 and 3, Figs. 3 and 4 and Table 1. The data was used with the machine precision of the R software. At this point, we round the data points to 2 decimal places, and consider this to be our quantized sample, with a quantization level $\Delta =0.01$, which we feel is an appropriate representation for a variety of real world discretized data sets (note that the real data sets used in Sect. 4 were recorded to 3 decimal places). Figure 17 is parallel to Fig. 3 of the main text: it shows the LMRD and the selection process for the same quantized sample, and as we can see the results are analogous, hinting at robustness of the ALRSM against quantization. As before, the heuristic’s distances are comprised in Table 9.

Table 9 Automatic selection procedure for a simulated sample of size $n=500$ from a Hybrid($0.75,\,0.2$) distribution, quantized with $\Delta =0.01$, with sample quantile threshold candidates $\left\{ u_i\right\} _{i=1}^{20}$, as in (6) : minimum Euclidean distance $d_{u_i}$ between each point $( t _{3,u_i}, t _{4,u_i})$ for the excesses over each candidate threshold $u_i$ and the GPd theoretical curve (4). Selected threshold, $i=14$, in bold

Full size table

Analogously to the study in Sect. 3, we will more thoroughly evaluate performance of the methodologies in selecting the threshold and estimating the EVI and 99.9% quantile, regarding bias and RMSE of estimation, computed from the same set of 1500 Hybrid($0.75,\,\xi$) samples, now subjected to rounding (to the second decimal place, as mentioned—$\Delta =0.01$), generated according to each combination of the proposed scenarios:

GP tail shape parameter: $\xi =-0.2$; $\xi =0.2$;
Sample size: $n=1000$; $n=500$; $n=200$;
Sets of candidate thresholds: $I=10$ as in (5); $I=20$ as in (6).

Looking at Fig. 18 we can see that the ALRSM again outperforms the competitors in terms of threshold selection bias and RMSE across the considered sample sizes and number of candidates, as well as for both heavy and bounded tails. From comparison with Fig. 5 of the main text, the suggested methodology appears to be robust to quantization of data in this point, while the SGFSM seems to show the most impacted selection performance, being outdone by even the aSTSM in several cases.

Concerning EVI estimation, Fig. 19a, b shows once more close similarity to the bottom and middle panels of Fig. 6, with common underestimation of the shape parameter to the three selection methodologies. Again, both in terms of bias and RMSE, the ALRSM performs either better or comparably well to the alternatives for every considered scenario—EVI estimation through these selections does not appear here to be very sensitive to the quantization level applied.

For the heavy tail samples ($\xi =0.2$), estimation of the 99.9% quantile seems to have suffered some impact from the discretization, when we compare Fig. 19c with Fig. 7b. Here, the ALRSM seems to be slightly more sensitive to the rounding of data than the SGFSM. However, it still performs satisfactorily well, specially for the bounded tail case and regarding robustness against sample size and number of candidate thresholds. Estimation RMSE for this return level, seen in Fig. 19d, is also comparable between the 3 processes, across the proposed scenarios and similar to the depicted in Fig. 8b for the continuous case.

Conclusions drawn in Sect. 3 (and also Appendix 2) regarding superiority of the ALRSM, in what concerns computational efficiency, over the competing selection methodologies, remains true when considering this quantized data scenario, both in terms of total run time and number of failed simulations. Corresponding results are summarized in Tables 10 and 11.

Table 10 Total run time, in seconds, of threshold selection and parameter estimation for 1500 simulated samples with size $n=1000,\,500,\,200$ from a Hybrid($0.75,\,\xi$), with $\xi =-0.2,\,0.2$, quantized with $\Delta =0.01$, and sample quantile threshold candidates $\left\{ u_i\right\} _{i=1}^{I}$, $I=10,\,20$; best run times for each scenario in bold

Full size table

Table 11 Total failure count for 1500 simulated samples with size $n=1000,\,500,\,200$ from a Hybrid($0.75,\,\xi$), with $\xi =-0.2,\,0.2$, quantized with $\Delta =0.01$, and sample quantile threshold candidates $\left\{ u_i\right\} _{i=1}^{I}$, $I=10,\,20$

Full size table

As we can infer from this small study, our selection methodology reveals itself as a good tool for choosing a single threshold even when in the presence of rounded data. However, it is our view, as mentioned in the Concluding Remarks (Sect. 5), that it can be preferable as a general principle to firstly submit the raw, discretized data to an appropriate jittering process, before further extreme value analysis—this was amply studied, for example, in the Supplementary Material of Bader et al. (2018).

1.4 Appendix 4: Mean residual life plots: significant wave height data sets

See Fig. 20.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Silva Lomba, J., Fraga Alves, M.I. L-moments for automatic threshold selection in extreme value analysis. Stoch Environ Res Risk Assess 34, 465–491 (2020). https://doi.org/10.1007/s00477-020-01789-x

Download citation

Published: 23 March 2020
Issue Date: April 2020
DOI: https://doi.org/10.1007/s00477-020-01789-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

L-moments for automatic threshold selection in extreme value analysis

Abstract

Access this article

Similar content being viewed by others

Computational Study of the Adaptive Estimation of the Extreme Value Index with Probability Weighted Moments

Adaptive Choice of Thresholds and the Bootstrap Methodology: An Empirical Study

Multiple thresholds in extremal parameter estimation

References

Acknowledgements