Abstract
In extreme value analysis, sensitivity of inference to the definition of extreme event is a paramount issue. Under the peaks-over-threshold approach, this translates directly into the need of fitting a Generalized Pareto distribution to observations above a suitable level that balances bias versus variance of estimates. Selection methodologies established in the literature face recurrent challenges such as an inherent subjectivity or high computational intensity. We suggest a truly automated method for threshold detection, aiming at time efficiency and elimination of subjective judgment. Based on the well-established theory of L-moments, this versatile data-driven technique can handle batch processing of large collections of extremes data, while also presenting good performance on small samples. The technique’s performance is evaluated in a large simulation study and illustrated with significant wave height data sets from the literature. We find that it compares favorably to other state-of-the-art methods regarding the choice of threshold, associated parameter estimation and the ultimate goal of computationally efficient return level estimation.
Similar content being viewed by others
References
Alonso MA, de Zea Bermudez P, Scotto MG (2014) Comparing generalized Pareto models fitted to extreme observations: an application to the largest temperatures in Spain. Stoch Env Res Risk Assess 28:1221–1233
Bader B, Yan J, Zhang X (2018) Automated threshold selection for extreme value analysis via ordered goodness-of-fit tests with adjustment for false discovery rate. Ann Appl Stat 12(1):310–329
Bader B, Yan J (2016) eva: Extreme value analysis with goodness-of-fit testing. R package version 0.2.4. Retrieved from https://CRAN.R-project.org/package=eva
Beirlant J, Dierckx G, Guillou A, Stǎricǎ C (2002) On exponential representations of log-spacings of extreme order statistics. Extremes 5:157–180
Bernardara P, Schertzer D, Sauquet E, Tchiguirinskaia I, Lang M (2008) The flood probability distribution tail: how heavy is it? Stoch Env Res Risk Assess 22:107–122
Choulakian V, Stephens MA (2001) Goodness-of-fit tests for the generalized Pareto distribution. Technometrics 43(4):478–484
Clauset A, Shalizi CR, Newman MEJ (2009) Power-law distributions in empirical data. SIAM Rev 51:661–703
Coles S (2001) An introduction to statistical modeling of extreme values. Springer, London
Danielsson J, de Haan L, Peng L, de Vries CG (2001) Using a bootstrap method to choose the sample fraction in tail index estimation. J Multivar Anal 76:226–248
Danielsson J, Ergun LM, de Haan L, de Vries C (2016) Tail index estimation: quantile driven threshold selection. Systemic Risk Centre’s Discussion Paper Series, SRC Discussion Paper 58, pp 1–74. Retrieved from SRC: http://eprints.lse.ac.uk/id/eprint/66193
Deidda R (2010) A multiple threshold method for fitting the generalized Pareto distribution to rainfall time series. Hydrol Earth Syst Sci 14:2559–2575
Deidda R, Puliga M (2009) Performances of some parameter estimators of the generalized Pareto distribution over rounded-off samples. Phys Chem Earth 34:626–634
Ferreira A, de Haan L, Peng L (2003) On optimising the estimation of high quantiles of a probability distribution. Statistics 37(5):401–434
Greenwood JA, Landwehr JM, Matalas NC, Wallis JR (1979) Probability weighted moments: definition and relation to parameters of several distributions expressable in inverse form. Water Resour Res 15(5):1049–1054
G’Sell MG, Wager S, Chouldechova A, Tibshirani R (2016) Sequential selection procedures and false discovery rate control. J R Stat Soc B 78(2):423–444
Hosking JRM (1986) The theory of probability weighted moments. Research Report RC12210-IBM Research
Hosking JRM (2006) On the characterization of distributions by their L-moments. J Stati Plan Inference 136:193–198
Hosking JRM, Balakrishnan N (2015) A uniqueness result for L-estimators, with applications to L-moments. Stat Methodol 24:69–80
Hosking JRM, Wallis JR (1987) Parameter and quantile estimation estimation for the generalized Pareto distribution. Technometrics 29(3):339–349
Hosking JRM, Wallis JR (1997) Regional frequency analysis: an approach based on L-moments. Cambridge University Press, Cambridge
Langousis A, Mamalakis A, Puliga M, Deidda R (2016) Threshold detection for the generalized Pareto distribution: review of representative methods and application to the NOAA NCDC daily rainfall database. Water Resour Res 52:2659–2681
Lee J, Fan Y, Sisson SA (2015) Bayesian threshold selection for extremal models using measures of surprise. Comput Stat Data Anal 85:84–99
Mamalakis A, Langousis A, Deidda R, Marrocu M (2017) A parametric approach for simultaneous bias correction and high-resolution downscaling of climate model rainfall. Water Resour Res 53:2149–2170
Manurung A, Wigena AH, Djuraidah A (2018) GPD threshold estimation using measure of surprise. Int J Sci Basic Appl Res 44(3):16–25
Northrop PJ, Attalides N (2017) threshr: threshold selection and uncertainty for extreme value analysis. R package version 1.0.0. Retrieved from https://CRAN.R-project.org/package=threshr
Northrop PJ, Coleman CL (2014) Improved threshold diagnostic plots for extreme value analyses. Extremes 17:289–303
Northrop PJ, Attalides N, Jonathan P (2017) Cross-validatory extreme value threshold selection and uncertainty with application to ocean storm severity. J R Stat Soc Ser C Appl Stat 66(1):93–120
Pickands J (1975) Statistical inference using extreme order statistics. Ann Stat 3:119–131
Ribatet MA (2011) A user’s guide to the POT package (version 1.4). Retrieved from http://cran.r-project.org/package=POT
Scarrott C, MacDonald A (2012) A review of extreme value threshold estimation and uncertainty quantification. REVSTAT Stat J 10(1):33–60
Schneider LF, Krajina A, Krivobokova T (2019) Threshold selection in univariate extreme value analysis. arXiv:1903.02517
Silva Lomba J (2016) Extreme value analysis of competitive freediving records. Master’s thesis, Faculty of Sciences, University of Lisbon. http://hdl.handle.net/10451/24672
Thompson P, Cai Y, Reeve D, Stander J (2009) Automated threshold selection methods for extreme wave analysis. Coast Eng 56:1013–1021
Um M, Cho W, Heo J (2010) A comparative study of the adaptive choice of thresholds in extreme hydrologic events. Stoch Env Res Risk Assess 24(5):611–623
Wadsworth J (2016) Exploiting structure of maximum likelihood estimators for extreme value threshold selection. Technometrics 58(1):116–126
Wadsworth J, Tawn J (2012) Likelihood-based procedures for threshold diagnostics and uncertainty in extreme value modelling. J R Stat Soc Ser B (Stat Methodol) 74:543–567
Withers CS, Nadarajah S (2011) Bias-reduced estimates for skewness, kurtosis, L-skewness and L-kurtosis. J Stat Plan Inference 141:3839–3861
Acknowledgements
Partial support of the Fundação para a Ciência e a Tecnologia, I.P. under doctoral grant SFRH/BD/130764/2017 (J.S.L.) and through Project UIDB/00006/2020 (J.S.L. and M.I.F.A.) is gratefully acknowledged. We would also like to thank Dr. Kate J. E. Lee and Dr. Paul J. Northrop for kindly providing us with the computer code for the referred alternative methodologies, as well as the editor and anonymous reviewers for their insightful suggestions and remarks that helped improve this work.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendixes
Appendixes
1.1 Appendix 1: ALRSM distances for the hybrid illustrative sample
See Table 6.
1.2 Appendix 2: Complementary simulation study: GEVd data
In the interest of briefly analysing the performance of the proposed methodology against the presented alternatives in a broader non-GPd scenario, we performed a complementary simulation study to that presented in Sect. 3, using samples drawn from distributions without an exact GPd tail. For this succinct presentation, we considered two GEVd with tail weights given by two of the previously used EVIs—again, both heavy and light tail scenarios were treated. The GEV cumulative distribution function is generally given in terms of its location \(\mu \in \mathbb {R}\), scale \(\sigma >0\) and shape \(\xi \ne 0\) parameters as \({\text {GEV}}(x|\mu ,\,\sigma ,\,\xi ):= \exp \left\{ -\left( 1+\xi \,\frac{x-\mu }{\sigma }\right) ^{-\nicefrac {1}{\xi }}\right\}\), while for the case \(\xi =0\) it is taken as the limit of this expression. The remaining parameters were procured as to have equivalence, after some point, of the new distributions with the tail of the Hybrid(\(0.75,\,\xi\)) distributions previously used—this serves the purpose of comparability of results under both studies. Thus, location and scale of the GEVd were defined as functions of \(\xi\), respectively as
Note, however, that the GEVd does not carry a true threshold, as the Hybrid did, and as such assessment of selection results is more problematic: although we show the mean selected thresholds for each case in Fig. 15, given the lack of interpretability for this case, we choose to focus mainly on the selections’ performance regarding estimation of the EVI and \(p=0.001\) exceedance probability quantile.
Analogously to the scheme presented before, estimated entities were computed from a set of 1500 samples generated according to each combination of the proposed scenarios:
GEVd tail shape parameter: \(\xi =-0.2\); \(\xi =0.2\);
Sample size: \(n=1000\); \(n=500\); \(n=200\);
Sets of candidate thresholds: \(I=10\) as in (5); \(I=20\) as in (6).
Analysis of the results comprised in Figs. 15 and 16 yields very similar conclusions to the study presented in the main text (colour scheme remains as green for the ALRSM, red for the aSTSM and yellow for the SGFSM). We firstly observe, from Fig. 15, that the mean selected threshold by the ALRSM is consistently higher than that of the competing SGFSM and aSTSM for every scenario posed. In terms of proximity of estimated EVI and 99.9% quantile to their true values, the proposed methodology clearly shows superior performance, in what regards the smaller bias values depicted in Fig. 16a and closer to 1 values for the ratios shown in Fig. 16c. Furthermore, the RMSE associated with estimation of these parameters also appears to be competitive against the alternative selection methodologies, as can be assessed in Fig. 16b, d, resp., corresponding to estimation of the EVI and 99.9% quantile.
In Tables 7 and 8 we can see that the conclusions in Sect. 3 concerning the computational efficiency of the methodologies remain true when dealing with these samples: the ALRSM is, among the three considered, by far the fastest methodology to perform the complete 1500 simulations, and although it shows some failures (2.6% in the worst case), as previously mentioned, these are due to the fitting methodology used, contrary to the selection failures of the SGFSM—which is, once again, the most direct competitor to the ALRSM in terms of bias and RMSE of estimates.
Thus, general conclusions regarding validity of our automated method remain valid for these additional scenarios, even without the possibility of evaluating the threshold selection in itself.
1.3 Appendix 3: Complementary simulation study: quantized data
A known issue in estimation under the GPd is, as mentioned before, the effect of data quantization. Several authors have addressed this problem more or less extensively, from different perspectives which we can find amongst the already referred works. For example, Bader et al. (2018) attempt to “fix” the quantized data by applying a jittering procedure previous to the application of their threshold selection procedure—the objective is to approximate the data to that obtained from a continuous process, in order to use the known results for continuous distributions without further issues. On the other hand, other works such as that of Deidda (2010) choose to develop methods that can deal with quantized samples in themselves, that is, that perform their task well whether the data is discretized or not. Although it is our view that the first approach would be the most indicated to deal with the problem, in this brief simulation study we will assess the robustness of the proposed ALRSM in the face of considerably discretized data. However, we limit the investigation to rounded samples from the Hybrid distribution, for the reasons presented in Sect. 3.
For illustration purposes, consider again the Hybrid(\(0.75,\,0.2\)) sample of dimension \(n=500\) used in the examples of Sects. 2 and 3, Figs. 3 and 4 and Table 1. The data was used with the machine precision of the R software. At this point, we round the data points to 2 decimal places, and consider this to be our quantized sample, with a quantization level \(\Delta =0.01\), which we feel is an appropriate representation for a variety of real world discretized data sets (note that the real data sets used in Sect. 4 were recorded to 3 decimal places). Figure 17 is parallel to Fig. 3 of the main text: it shows the LMRD and the selection process for the same quantized sample, and as we can see the results are analogous, hinting at robustness of the ALRSM against quantization. As before, the heuristic’s distances are comprised in Table 9.
Analogously to the study in Sect. 3, we will more thoroughly evaluate performance of the methodologies in selecting the threshold and estimating the EVI and 99.9% quantile, regarding bias and RMSE of estimation, computed from the same set of 1500 Hybrid(\(0.75,\,\xi\)) samples, now subjected to rounding (to the second decimal place, as mentioned—\(\Delta =0.01\)), generated according to each combination of the proposed scenarios:
GP tail shape parameter: \(\xi =-0.2\); \(\xi =0.2\);
Sample size: \(n=1000\); \(n=500\); \(n=200\);
Sets of candidate thresholds: \(I=10\) as in (5); \(I=20\) as in (6).
Looking at Fig. 18 we can see that the ALRSM again outperforms the competitors in terms of threshold selection bias and RMSE across the considered sample sizes and number of candidates, as well as for both heavy and bounded tails. From comparison with Fig. 5 of the main text, the suggested methodology appears to be robust to quantization of data in this point, while the SGFSM seems to show the most impacted selection performance, being outdone by even the aSTSM in several cases.
Concerning EVI estimation, Fig. 19a, b shows once more close similarity to the bottom and middle panels of Fig. 6, with common underestimation of the shape parameter to the three selection methodologies. Again, both in terms of bias and RMSE, the ALRSM performs either better or comparably well to the alternatives for every considered scenario—EVI estimation through these selections does not appear here to be very sensitive to the quantization level applied.
For the heavy tail samples (\(\xi =0.2\)), estimation of the 99.9% quantile seems to have suffered some impact from the discretization, when we compare Fig. 19c with Fig. 7b. Here, the ALRSM seems to be slightly more sensitive to the rounding of data than the SGFSM. However, it still performs satisfactorily well, specially for the bounded tail case and regarding robustness against sample size and number of candidate thresholds. Estimation RMSE for this return level, seen in Fig. 19d, is also comparable between the 3 processes, across the proposed scenarios and similar to the depicted in Fig. 8b for the continuous case.
Conclusions drawn in Sect. 3 (and also Appendix 2) regarding superiority of the ALRSM, in what concerns computational efficiency, over the competing selection methodologies, remains true when considering this quantized data scenario, both in terms of total run time and number of failed simulations. Corresponding results are summarized in Tables 10 and 11.
As we can infer from this small study, our selection methodology reveals itself as a good tool for choosing a single threshold even when in the presence of rounded data. However, it is our view, as mentioned in the Concluding Remarks (Sect. 5), that it can be preferable as a general principle to firstly submit the raw, discretized data to an appropriate jittering process, before further extreme value analysis—this was amply studied, for example, in the Supplementary Material of Bader et al. (2018).
1.4 Appendix 4: Mean residual life plots: significant wave height data sets
See Fig. 20.
Rights and permissions
About this article
Cite this article
Silva Lomba, J., Fraga Alves, M.I. L-moments for automatic threshold selection in extreme value analysis. Stoch Environ Res Risk Assess 34, 465–491 (2020). https://doi.org/10.1007/s00477-020-01789-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-020-01789-x