Distribution-free changepoint detection tests based on the breaking of records

Castillo-Mateo, Jorge

doi:10.1007/s10651-022-00539-2

Distribution-free changepoint detection tests based on the breaking of records

Original Paper
Open access
Published: 06 July 2022

Volume 29, pages 655–676, (2022)
Cite this article

Download PDF

You have full access to this open access article

Environmental and Ecological Statistics Aims and scope Submit manuscript

Distribution-free changepoint detection tests based on the breaking of records

Download PDF

Jorge Castillo-Mateo ORCID: orcid.org/0000-0003-3859-0248¹

1904 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

The analysis of record-breaking events is of interest in fields such as climatology, hydrology or anthropology. In connection with the record occurrence, we propose three distribution-free statistics for the changepoint detection problem. They are CUSUM-type statistics based on the upper and/or lower record indicators observed in a series. Using a version of the functional central limit theorem, we show that the CUSUM-type statistics are asymptotically Kolmogorov distributed. The main results under the null hypothesis are based on series of independent and identically distributed random variables, but a statistic to deal with series with seasonal component and serial correlation is also proposed. A Monte Carlo study of size, power and changepoint estimate has been performed. Finally, the methods are illustrated by analyzing the time series of temperatures at Madrid, Spain. The R package RecordTest publicly available on CRAN implements the proposed methods.

Robust Change Detection in the Dependence Structure of Multivariate Time Series

Nuisance-parameter-free changepoint detection in non-stationary series

Article 03 May 2019

The multiple filter test for change point detection in time series

Article 18 July 2018

1 Introduction

An observation in a time series is called an upper (lower) record if it is greater (smaller) than all previous observations in the series. Therefore a new record is a remarkable event that attracts great attention in numerous applications, whether in environmental fields, economy, sports, physics or biology (see, e.g., Wergen 2013, and references therein). Particularly interesting is the study of record events in environmental sciences and their connection with climate change. For example, Benestad (2004) compared the observed and expected number of records under stationarity by means of a $\chi ^2$-test and graphical tools. Respectively, Coumou et al. (2013) and Lehmann et al. (2015) found an increase in temperature and precipitation record-breaking events with respect to a stationary climate on a global scale. In addition to its many applications, the main foundations in the framework of theory of records can be found in the monographs Arnold et al. (1998) and Nevzorov (2001).

An aspect of interest is the study of the evolution of the number of records over time, in particular the identification of changes in their behavior. To analyze this type of change, changepoint detection methods that make use of the record occurrence should be considered.

The changepoint problem tries to identify times when the probability distribution function of a time series changes. In general the problem concerns both detecting whether or not a change has occurred and identifying its time of occurrence. Although several changes might be considered, our work resides in the at most one changepoint (AMOC) domain. The first results on changepoint detection start with Page (1954, 1955) who introduced a cumulative sum (CUSUM) statistic to locate a shift in the mean of independent and identically distributed (IID) normal random variables (RVs). Since then, several methods have been proposed, many of which can be found in the monographs Brodsky and Darkhovsky (1993) and Csörgő and Horváth (1997). Noteworthy is the importance of changepoint detection techniques in climatology (Reeves et al. 2007), but also in very different fields such as economy, speech processing, etc.

Traditional changepoint detection methods attempt to find changes in location or scale, more recently, changepoint detection in the extreme values has also been an active area of research. For example, Dierckx and Teugels (2010) introduced tests to detect changes in the parameters of the generalized Pareto distribution based on its likelihood for models of excesses over threshold, Kojadinovic and Naveau (2017) studied several tests for independent samples of block maxima, and e Silva et al. (2020) proposed a changepoint model for the r-largest order statistics. Ratnasingam and Ning (2021) proposed procedures based on the modified information criterion and the confidence distribution for detecting changepoints in the three-parameter Weibull distribution. Non-homogeneous Poisson processes have also been considered to study changepoints in the occurrence of peaks over threshold (Achcar et al. 2010, 2016; Rodrigues et al. 2019). To the best of our knowledge, there is no changepoint detection method based on the breaking of records. There are, however, tests for trend detection based on the breaking of records. Foster and Stuart (1954) proposed two simple statistics based on the number of records to test the hypothesis that T observations have been independently drawn from the same continuous distribution. These tests were later improved by Diersen and Trenkler (1996) and more recently new tests and graphical tools were introduced by Cebrián et al. (2022).

The aim of this paper is to develop changepoint detection tests based on the record occurrence to detect changes in the tails of the distribution. The first use of the tests introduced in this paper is to detect changes in the record occurrence and therefore in the extreme values, however, they are also useful against other types of change such as a change in location or scale. When there is a gradual change in location or scale, it will generally take time to be significantly reflected in a change in the behavior of the number of records, so the second use of the proposed methodology lies in analyzing how long it takes a series from when a changepoint is detected using another method (see, e.g., Pettitt 1979, for a change in location), until that change is reflected in the observed records. Beyond its theoretical and descriptive interest, the third use of these changepoint detection tests based on records is that they would be uniquely appropriate whenever the original data are not available while records are.

The proposed tests make use of CUSUM-type statistics based on the record indicator RVs. The functional central limit theorem for independent but nonidentically distributed RVs is used to show that the functional evolution of the number of records adequately standardized behaves asymptotically as a Wiener process and, as a consequence, the CUSUM-type statistics follow the Kolmogorov distribution. This characterization allows to obtain exact p-values for the tests. The use of weights in the statistics can improve the power of the tests under certain scenarios. However, we prove that the weighted statistics do not have the same asymptotic properties as the previous ones and the p-value must be calculated using Monte Carlo techniques. An approach to analyze series with seasonal component or serial correlation is also proposed. The statistics based on the record indicators will allow studying the extreme values of the distribution with the advantage of not needing the specification of an underlying distribution for the data, i.e., they are distribution-free. Also, the requirement on the variance of data as in other CUSUM-type statistics is avoided here.

The rest of the paper is organized as follows. Section 2 introduces our records statistics, establishes their asymptotic distribution under the null hypothesis and proposes some generalizations. Section 3 compares these tests under various scenarios by means of Monte Carlo simulations. An application to temperature data is presented in Sect. 4, and Sect. 5 concludes the paper with final comments, conclusions and future work.

Finally, note that the proposed tests for changepoint detection are available from the R (R Core Team 2021) package RecordTest (Castillo-Mateo 2021).

2 Tests based on theory of records

Let $X_1, \ldots , X_T$ be a sequence of IID continuous RVs. The sequences of upper and lower record indicators, $(I_t)$ and $(I_t^L)$, are defined by $I_1 = I_1^L = 1$ and for $t=2,\ldots ,T$, by

$$\begin{aligned} I_t = {\left\{ \begin{array}{ll} 1 &{} \text {if } X_t > \max \{X_1, \ldots , X_{t-1}\}, \\ 0 &{} \text {otherwise}, \end{array}\right. } \quad I_t^L = {\left\{ \begin{array}{ll} 1 &{} \text {if } X_t < \min \{X_1, \ldots , X_{t-1}\}, \\ 0 &{} \text {otherwise}. \end{array}\right. } \end{aligned}$$

The sequence of differences in the upper and lower record indicators, $(d_t)$, is given by $d_t = I_t - I_t^L$, while the sequence of sums, $(s_t)$, is given by $s_t = I_t + I_t^L$.

The following lemma is a well known distribution-free result within the theory of records that characterizes the distribution of the record indicators, equally valid for upper and lower records (Arnold et al. 1998; Nevzorov 2001).

Lemma 2.1

Let $X_1, \ldots , X_T$ be a sequence of IID continuous RVs. Then, the record indicators $I_1, \ldots , I_T$ are independent and

$$\begin{aligned} p_t = P(I_t = 1) = \frac{1}{t}, \quad t = 1,\ldots ,T. \end{aligned}$$

It is easily checked that the expectations and variances for $t = 2,\ldots ,T$, are

$$\begin{aligned}&E(I_t) = \frac{1}{t}, \quad Var(I_t) = \frac{1}{t} \left( 1-\frac{1}{t} \right) , \\&E(d_t) = 0, \quad Var(d_t) = \frac{2}{t}, \\&E(s_t) = \frac{2}{t}, \quad Var(s_t) = \frac{2}{t} \left( 1-\frac{2}{t} \right) . \end{aligned}$$

Given $p_t$ the probability of upper or lower record at time t, our aim is to construct asymptotic tests with null hypothesis

$$\begin{aligned} {\mathcal {H}}_0: p_t = 1 / t, \quad 1 \le t \le T, \end{aligned}$$

against the two-sided alternative hypothesis given by

$$\begin{aligned} {\mathcal {H}}_1: p_t = 1 / t, \quad 1 \le t \le t_0 \quad \text {and} \quad p_t \ne 1 / t, \quad t_0 < t \le T, \end{aligned}$$

(2.1)

where $t_0$ denotes the time of a possible change in the probabilities of observing new records with respect to the stationary case. The alternative hypothesis supports many nonstationary scenarios, for example a shift or a drift in location, variation or in one or both tails.

2.1 Tests based on asymptotic results

To obtain a p-value from a changepoint detection test, the exact distribution of the changepoint statistic is usually impractical, so it is generally preferable to have asymptotic results. Wiener processes, Brownian bridges and other Gaussian processes arise as asymptotic distributions in many limit problems providing exact tail probabilities. Our first objective is to build from the indicators above a random function $W_T(\nu )$, for $\nu \in [0,1]$, in such a way that $W_T(\nu )$ converges in distribution to a Wiener process. For this purpose, we define the standardized record indicators, $\xi _{T1},\ldots ,\xi _{TT}$ as

$$\begin{aligned} \xi _{Tt} = \frac{I_t - E(I_t)}{\sigma _T}, \end{aligned}$$

(2.2)

where $\sigma _t^{2} = \sum _{k=1}^{t} Var(I_k)$. We also define the standardized number of records $S_{Tt} = \sum _{k=1}^{t} \xi _{Tk}$, its variance $\nu _{Tt} = \sum _{k=1}^{t} Var(\xi _{Tk}) = \sigma _t^2 / \sigma _T^2$, and finally the random function

$$\begin{aligned} W_T(\nu ) = S_{Tt} + \xi _{T,t+1} \frac{\nu - \nu _{Tt}}{\nu _{T,t+1} - \nu _{Tt}} \end{aligned}$$

(2.3)

for $\nu \in [\nu _{Tt}, \nu _{T,t+1}]$. Note that $S_{T1}=0$, $\nu _{T1}=0$ and $\nu _{TT}=1$. It is noteworthy that the function $W_T(\nu )$ is a random broken line connecting points in the plane with coordinates $(\nu _{Tt},S_{Tt})$ for $t=1,\ldots ,T$.

One of our major results is the asymptotic characterization of the functional evolution of the standardized number of records, $W_T(\nu )$, as a Wiener process. The result is essentially a consequence of the functional central limit theorem for independent but nonidentically distributed RVs (see, e.g., Gikhman and Skorokhod 1969). To be under the conditions of the theorem, Lindeberg’s condition needs to be proved for the variables $\xi _{Tt}$ in (2.2), which follows immediately from

$$\begin{aligned} \lim _{T \rightarrow \infty } \sum _{t=1}^T E\left( \xi _{Tt}^2 \times \mathbf{1 }_{\{|\xi _{Tt}|> \varepsilon \}} \right) \le \lim _{T \rightarrow \infty } \mathbf{1 }_{\{1/\sigma _T > \varepsilon \}} = 0 \end{aligned}$$

for all $\varepsilon > 0$, where $\mathbf{1 }_{\{\cdot \}}$ is the indicator function.

Theorem 2.1

Let $X_1,\ldots ,X_T$ be a sequence of IID continuous RVs with $W_T(\nu )$ in (2.3). Then, as $T \rightarrow \infty ,$

$$\begin{aligned} W_T(\nu ) \overset{{\mathcal {D}}}{\longrightarrow }W(\nu ), \quad \nu \in [0,1], \end{aligned}$$

in the metric space ${\mathcal {C}}[0,1],$ where $W(\nu )$ is a standard Wiener process.

Thus, the changepoint records statistic proposed is

$$\begin{aligned} K_T = \max _{1\le t \le T} |B_T(\nu _{Tt})|, \end{aligned}$$

(2.4)

where $B_T(\nu ) = W_T(\nu ) - \nu W_T(1)$, $\nu \in [0,1]$. The time t where (2.4) takes its maximum is the changepoint estimate ${\hat{t}}_0$. As a consequence of Theorem 2.1, $B_T(\nu )$ is asymptotically distributed as a standard Brownian bridge process. Moreover, the distribution of the supremum of the absolute value of a Brownian bridge is known as the Kolmogorov distribution. As $\sup _{0 \le \nu \le 1} |f(\nu ) - \nu f(1)|$ is a continuous functional for f in ${\mathcal {C}}[0,1]$, the asymptotic characterization under the null hypothesis of the statistic $K_T$ is as follows.

Theorem 2.2

Let $X_1,\ldots ,X_T$ be a sequence of IID continuous RVs with $K_T$ in (2.4). Then, as $T \rightarrow \infty ,$

$$\begin{aligned} K_T \overset{{\mathcal {D}}}{\longrightarrow }K = \sup _{0 \le \nu \le 1} |B(\nu )|, \end{aligned}$$

where $B(\nu )$ is a standard Brownian bridge process and K is a Kolmogorov distributed RV.

The null hypothesis is rejected when $K_T$ is too large to be explained by chance variation. In particular, if the alternative hypothesis in (2.1) is true for some time $t_0$, then it follows that $|B_T(\nu _{Tt_0})|$ is large and can show statistical evidence that a change occurred at time $t_0$. Under the null hypothesis, the p-value of the two-sided test can be calculated from any of the expressions of the Kolmogorov distribution

$$\begin{aligned} P(K \ge x)= & {} 2 \sum _{k=1}^{\infty } (-1)^{k-1} \exp \left\{ -2(kx)^2\right\} \\= & {} 1 - \frac{\sqrt{2 \pi }}{x} \sum _{k=1}^{\infty } \exp \left\{ -\left( \frac{(2k-1) \pi }{2\sqrt{2} x}\right) ^2\right\} . \end{aligned}$$

To give a clear interpretation of $K_T$, we define $N_t = I_1 + \cdots + I_t$ the number of records up to time t and $N_{t_1:t_2} = I_{t_1} + \cdots + I_{t_2}$ the number of records between times $t_1 \le t_2$. Then, $B_T(\nu _{Tt})$ can be rewritten as

$$\begin{aligned} B_T(\nu _{Tt}) = \frac{1}{\sqrt{Var(N_T)}} \left( (N_t - E(N_t)) - \frac{Var(N_t)}{Var(N_T)} (N_T - E(N_T)) \right) . \end{aligned}$$

Weighting for differences in the effective sample sizes of the number of records in two segments, $\{1,\ldots ,t\}$ and $\{t+1,\ldots ,T\}$, $B_T(\nu _{Tt})$ can be viewed as a scaled difference between $Var(N_t)^{-1} (N_t - E(N_t))$ and $Var(N_{(t+1):T})^{-1} (N_{(t+1):T} - E(N_{(t+1):T}))$. Consequently, $K_T$ compares the number of records in both segments for every t and assigns as estimator, ${\hat{t}}_0$, the point that separates the segment that deviates the most from the null hypothesis. The mean is $E(B_T(\nu _{Tt})) = 0$ and simple calculation leads to $Var(B_T(\nu _{Tt})) = \nu _{Tt} (1 - \nu _{Tt})$. The nonuniform variance, small when it is near the ends of $\{1,\ldots ,T\}$, makes changepoints occurring near the beginning or the end of the series more difficult to detect (see “Appendix A” for further details). This is a common fact in CUSUM-type statistics.

The proposed statistic only uses the information from one tail of the distribution, the right tail if upper records are used or the left tail if lower records are used. To study both tails and collect more evidence with a single statistic, it is enough to consider the variables $d_t$ and $s_t$. Since the $d_t$’s and $s_t$’s also fulfill Lindeberg’s condition, all the previous results are equally valid substituting $\xi _{Tt}$ in (2.2) by $\xi _{Tt} = d_t / \sigma _T$ with $\sigma _t^2 = \sum _{k=1}^{t} Var(d_k)$, $t=1,\ldots ,T$; or respectively, $\xi _{Tt} = (s_t - E(s_t)) / \sigma _T$ with $\sigma _t^2 = \sum _{k=1}^{t} Var(s_k)$, $t=1,\ldots ,T$. The statistic (2.4) based on $d_t$ can be used when an increase in upper records and a decrease in lower records are expected with respect to the null hypothesis, while the statistic based on $s_t$ can be used when an increase in both types of records is expected. In particular, the statistic based on $d_t$ can be useful against the alternative hypothesis of a trend in location, while the statistic based on $s_t$ can be useful against a trend in variation.

2.2 Tests with weighted statistics

Under the null hypothesis, the probability of record decreases as the series evolves. To give more importance to the most recent records and thus to be able to increase the power of the tests, we propose to give increasing weights, $\omega _t$, to the different records according to their position in the series as

$$\begin{aligned} \xi _{Tt}^{\omega } = \omega _t \frac{ I_t -E(I_t)}{\sigma _T}, \end{aligned}$$

(2.5)

where $\sigma _t^2 = \sum _{k=1}^t \omega _k^2 Var(I_k)$, $t = 1,\ldots ,T$. According to Proposition 2.1 (proved in “Appendix B”), these variables do not in general have an asymptotically normal sum, so asymptotic results such as those of Theorem 2.1 are not available.

Proposition 2.1

Let $X_1,\ldots ,X_T$ be a sequence of IID continuous RVs with the sequence of RVs $\xi _{Tt}^\omega $ in (2.5) and $\omega _t \sim t^n$ as $t \rightarrow \infty $. If $n > 0,$ then the central limit theorem does not hold for the $\xi _{Tt}^\omega $’s.

Likewise, a $K_T$-type statistic in (2.4) associated with the weighted variables can be defined, and the distribution of which can be simulated by means of Monte Carlo techniques under the null hypothesis.

In this work we consider two different weights. First, linear weights $\omega _t = t-1$ (see Diersen and Trenkler 1996, for a detailed explanation). Second, weights that make the discrete sequence of times of the process, $\nu _{Tt} = \sigma _t^2 / \sigma _T^2$, $t=1,\ldots ,T$, equally spaced, i.e., weights proportional to the inverse of the standard deviation (SD) of $I_t$, i.e., $\omega _1 = 0$ and $\omega _t = Var(I_t)^{-1/2} = t / \sqrt{t-1}$ for $t=2,\ldots ,T$. These weights make the variance of $B_T(\nu _{Tt})$ symmetric in $\{1,\ldots ,T\}$ (see “Appendix A”).

As above, the statistic has been defined in terms of the $I_t$’s but it is equivalent for the $d_t$’s or $s_t$’s. The SD in these cases suggests that the weights making the observed times of the process equally spaced are proportional to $\omega _1 = 0$, $\omega _t = \sqrt{t}$ for $t=2,\ldots ,T$, for the statistic based on $d_t$; and $\omega _1 = \omega _2 = 0$, $\omega _t = t / \sqrt{t-2}$ for $t=3,\ldots ,T$, for the statistic based on $s_t$.

2.3 Tests for seasonal series

Hirsch et al. (1982) introduced a seasonal version for tests of randomness based on ranks. Following their ideas, we propose tests which are insensitive to the existence of seasonality and serial correlation. If the time series data of interest are daily (or monthly) data, then the null hypothesis of randomness where all the observations come from the same continuous distribution may be too restrictive. For example, most series of daily temperature or precipitation show very strongly the presence of seasonality and serial correlation. Let $\mathbf{X } = (\mathbf{X }_1,\ldots ,\mathbf{X }_M)$ be a sequence of series where $\mathbf{X }_m = (X_{1m}, \ldots , X_{Tm})'$ is a series of RVs. That is, $\mathbf{X }$ is the entire series, made up of subseries $\mathbf{X }_1$ through $\mathbf{X }_M$ (one for each day), and each subseries $\mathbf{X }_m$ contains annual values from day m, for $m=1,\ldots ,M$. Note that for further development the M subseries must be independent, so in general a subset of these subseries will be used. That is, below a subset of independent subseries is considered, but the notation is maintained for simplicity. Then, we define the tth upper record indicator for the mth subseries as $I_{tm} = 1$ if $X_{tm} > \max \{X_{1m},\ldots ,X_{t-1,m}\}$ and $I_{tm} = 0$ otherwise; analogously for lower records. That is, records are calculated independently for each subseries, and the null hypothesis is relaxed allowing observations of different subseries not to come from the same distribution. To define a $K_T$-type statistic that joins the information of all the subseries, we simply take the $\xi _{Tt}$’s in (2.2) as

$$\begin{aligned} \xi _{Tt}^{\omega } = \omega _t \frac{\frac{1}{M}\sum _{m=1}^{M} I_{tm} - E(I_{t})}{\sigma _T}, \end{aligned}$$

where $\sigma _t^2 = \sum _{k=1}^{t} \omega _k^2 Var(I_k) / M$; or their respective versions based on $d_t$ or $s_t$. Thus, the alternative hypothesis is that of (2.1) with common changepoint $t_0$ for all the subseries. This approach not only allows the analysis of series with seasonal component, it also joins the information from several series, so the number of records and therefore the information used by the tests is greater.

3 Monte Carlo experiments

We investigate the empirical size, power and changepoint estimate of the changepoint tests based on the records statistics introduced in Sect. 2. Nine records statistics are considered: $N \equiv K_T$ in (2.4) with $\xi _{Tt}$ in (2.2), d and $s \equiv K_T$ in (2.4) substituting $I_t$ in (2.2) by $d_t$ and $s_t$, respectively; and the previous statistics with weights proportional to the inverse of the SD of $I_t$, $d_t$ and $s_t$, respectively (superscript var); and linear weights $t-1$ (superscript linear). Thus, three types of records statistics are analyzed. We denote by N-type statistics to the statistic N and its weighted versions, equivalently for d and s. Recall that, under the null hypothesis, the statistics N, d and s are asymptotically Kolmogorov distributed, while weighted statistics need Monte Carlo simulations to estimate their distribution (1000 replicates are considered).

3.1 Analysis of size

We simulate 10,000 replicates of M independent series formed by T independent samples from the standard normal distribution, i.e.,

$$\begin{aligned} Y_{tm} = \epsilon _{tm} \sim N(0,1), \quad \text {for } t = 1,\ldots ,T \quad \text {and}\quad m = 1,\ldots ,M. \end{aligned}$$

The size results are generalizable to any other continuous distribution given the distribution-free property of the tests under the null hypothesis. The size of the tests is simulated for the combination of values $T = 50,100$, $M = 1,12,36$ and for a large series $T=500$, $M=1$.

Table 1 reports the empirical size results of the changepoint tests based on the records statistics N, d and s for nominal values $\alpha =0.01,0.05,0.10$; i.e., we count how often the records statistics exceed the 99, 95, 90th percentile of the Kolmogorov distribution. We do not show the rejection frequencies of the tests based on the weighted statistics since their size is assured by simulating their p-value under the null hypothesis. All tests show an acceptable size for the levels $\alpha $ considered. Most of the tests are conservative, but their size approaches the nominal values as T increases. When M is greater than 1, the size of the statistics is considerably less than the nominal value. The size of d is particularly low, implying that these tests are very conservative.

In conservative tests, Fisher and Robbins (2019) proposed a general method to obtain a size closer to the nominal value and therefore increase the power of the tests. For our proposed tests, the method simply consists of changing the $K_T$-type statistic by $-\sqrt{T} \log (1 - K_T/\sqrt{T})$. Although we do not apply this method in the present paper, it may be a factor to consider in applications with low evidence since the power can increase while maintaining a proper size.

Table 1 Test size for $\alpha =0.01,0.05,0.10$ level tests

Full size table

3.2 Analysis of power

The power analysis consists of 10,000 simulations of M independent series with T observations following two scenarios under the alternative hypothesis.

Scenario A.:

Linear drift model in the mean:

$$\begin{aligned} Y_{tm} = \mu _t + \epsilon _{tm}, \quad \text {for } t = 1,\ldots ,T \quad \text {and}\quad m = 1,\ldots ,M, \end{aligned}$$

where $\epsilon _{tm} \sim N(0,1)$, and $\mu _t = 0$ if $1 \le t \le t_0$ and $\mu _t = \theta (t - t_0)$ if $t_0 < t \le T$.

Scenario B.:

Linear drift model in the SD:

$$\begin{aligned} Y_{tm} = \sigma _t \epsilon _{tm}, \quad \text {for } t = 1,\ldots ,T\quad \text {and}\quad m = 1,\ldots ,M, \end{aligned}$$

where $\epsilon _{tm} \sim N(0,1)$, and $\sigma _t = 1$ if $1 \le t \le t_0$ and $\sigma _t = 1 + \theta (t - t_0)$ if $t_0 < t \le T$.

We report results for $T=100$, $M = 1,12,36$, $t_0 = 25,50,75$ and the drift term $\theta = -0.10,-0.09,\ldots ,-0.02,-0.01,-0.005,0.005,0.01,0.02,\ldots ,0.09,0.10$ for Scenario A and $\theta = 0.005,0.01,0.02,\ldots ,0.09,0.10$ for Scenario B. N-type statistics are analyzed against both scenarios, d-type statistics against Scenario A and s-type statistics against Scenario B.

Figures 1 and 2 show, for $\alpha = 0.05$, plots of the power of the tests versus the trend $\theta $ for the Scenarios A and B, respectively. We make the following observations:

(1)
All tests increase their power as the magnitude of the drift $\theta $ or the number of series M increases. In Scenario A, d-type statistics have a symmetric behavior with respect to a vertical line at $\theta =0$, but when the drift is negative, N-type statistics have a power close to the nominal value unless M is large. This phenomenon is due to the fact that the greatest effect that a negative trend can cause is that only one record is observed in each series and under the null hypothesis it is likely to find a single record in a small number of series but it is unlikely to find a single record in many series. Finally, note that the power of tests with upper records against a positive drift is equivalent to that of tests with lower records against a negative drift.
(2)
The power of the statistics according to the position of the changepoint depends on the type of weight used. The tests have a lower power for a changepoint $t_0$ close to the end of the series, since the accumulated trend is smaller. The unweighted statistics have a higher power when the changepoint is at the beginning of the series and lose power as it approaches the middle and especially the end of the series. The statistics with weights proportional to the inverse of the SD, in Scenario A, maintain the same power when the changepoint is in the first half of the series and lose power if the changepoint is at the end; in Scenario B, they have a higher power when the changepoint is in the middle of the series. The statistics with linear weights have a higher power when the changepoint is in the middle or the end of the series than at the beginning.
(3)
For positive drifts and comparing statistics with the same type of weight. In Scenario A, N-type statistics have a higher power than d-type for low M, but when M is large this difference decreases and d-type have an equal or higher power than N-type. In Scenario B, s-type statistics have a higher power than N-type.
(4)
The statistics with weights proportional to the inverse of the SD turn out to have the overall best performance with the most balanced behavior. The statistics without weights are those that have a higher power when the changepoint is at the beginning of the series, the statistics with weights proportional to the inverse of the SD have a higher power when the changepoint is in the middle of the series and the statistics with linear weights have a higher power when the changepoint is at the end of the series. While the second show a power close to the best in each case, the first and third have considerably less power than the others when they are not the most powerful.
(5)
Some cases in which the records tests reach a power between 0.85 and 1 for $T=100$ are given below. Under Scenario A, when $t_0=25$, we would detect $\theta =0.05$ with $M=1$ for statistic N or $\theta =0.02$ with $M=12$ for all statistics except those with linear weights or $\theta =0.01$ with $M=36$ for d. When $t_0=50$, we would detect $\theta =0.10$ with $M=1$ for all statistics or $\theta =0.03$ with $M=12$. Under Scenario B, when $t_0=25$, we would detect $\theta =0.04$ with $M=1$ for the statistic s or $\theta =0.01$ with $M=12$ or $\theta =0.005$ with $M=36$ for the statistics s and $s^{var}$. When $t_0=50$, we would detect $\theta =0.05$ with $M=1$ or $\theta =0.01$ with $M=12$ for the statistics s and $s^{var}$.

3.3 Analysis of changepoint estimation

The analysis of the changepoint estimation reports results for Scenarios A and B considered in Sect. 3.2 for $T=100$, $M=1$ and $\theta = 0.10$, and for $T=100$, $M=36$ and $\theta = 0.05$, both for a wide range of changepoints $t_0 = 10,20,\ldots ,80,90$.

Figures 3 and 4 show boxplots of the estimated changepoint for the Scenarios A and B, respectively. We remark the following conclusions:

(1)
As it was advanced in Sect. 2.1, the nonuniform variance in CUSUM-type statistics means that changepoints occurring near the data boundaries are more difficult to detect, hence, they have trouble in detecting changes occurring away from the middle of the series. This effect is reduced as the number of series M or the magnitude of the drift $\theta $ increases.
(2)
Comparing statistics with the same type of weight. In Scenario A, N-type statistics place the changepoint slightly better than d-type. In Scenario B, s-type statistics place the chagepoint better than N-type.
(3)
The performance of the changepoints depends on the type of weight used. The statistics without weights properly place the changepoint when it is at the beginning or the middle of the series, but not at the end. The statistics with weights proportional to the inverse of the SD properly place the changepoint when it is not found at the beginning or the end of the series. The statistics with linear weights place the majority of changepoints in the second half of the series, so its estimate is not reliable for practical use, although this effect is reduced by increasing M.

These changepoint detection tests based on the breaking of records only make use of the record occurrence to determine the changepoint estimate. For that reason, the estimated changepoint will usually be placed in the previous time of a record time, i.e., the effect of the drift is not immediately reflected in the observed record occurrence. This means that a proper estimate of the changepoint in the record occurrence will often be placed later in the series than the actual changepoint in the mean or variance. Thus, the main question here is then whether the correctly detected, but possibly displaced, changepoints are clustered near the actual value or not.

This section has been useful to illustrate the behavior of the tests against usual alternative hypotheses. Other scenarios could be considered, e.g., (C) a shift model in the mean, i.e., $\mu _t = \theta $ if $t_0 < t \le T$ under Scenario A; or (D) a mixture model with a drift in the right tail, i.e., $Y_{tm} = \epsilon ^{(0)}_{tm}$ if $u_{tm} \le \tau $ and $Y_{tm} = \mu _t + \epsilon ^{(1)}_{tm}$ if $u_{tm} > \tau $ where $u_{tm} \sim U(0,1)$, $\tau $ a high quantile order (e.g., $\tau = 0.95$), $\mu _t$ under Scenario A, and $\epsilon ^{(0)}_{tm}$ and $\epsilon ^{(1)}_{tm}$ truncate N(0, 1) in $(-\infty ,\Phi ^{-1}(\tau ))$ and $(\Phi ^{-1}(\tau ),\infty )$, respectively. Preliminary analyzes show that the tests perform poorly against Scenario C, but have great power against Scenario D, even outperforming commonly used changepoint detection tests (e.g., the Pettitt test).

4 Application to temperature series

To illustrate the practical use of the three types of records tests, we applied them to the daily maximum temperature series measured in degree Celsius ($^{\circ }$C) from 1940 to 2019 at Madrid, Spain. Data are provided by European Climate Assessment & Dataset (ECA &D; Klein Tank et al. 2002) available online at https://www.ecad.eu. Madrid is located in the center of the Iberian Peninsula ($40.4^{\circ }$ N, $3.7^{\circ }$ W) at 667 m a.s.l. and its daily temperature series has a seasonal component and a strong serial correlation. This series is analyzed using three different approaches to show the performance of the tests in different situations. The first approach considers the series of annual maximum temperatures, which corresponds to the traditional block maxima. The second approach considers the series of annual mean temperature. Finally, the series is considered on a daily scale. To do this, first we take 365 subseries each corresponding to the data of a given day across years and then we select a subset of uncorrelated subseries (Cebrián et al. 2022) on which the procedure of Sect. 2.3 is applied. The three approaches have series of length $T = 80$, the first two with $M = 1$ and the third with $M = 58$ uncorrelated series out of the 365 dependent subseries.

In the context of global warming, it is reasonable to assume an increasing trend in location that can cause an increase in the number of upper records as well as decrease the number of lower records with respect to the values expected under a stationary climate, i.e., IID series. For this reason, only results for N and d-type statistics are shown. These statistics are powerful against this scenario and obtained more evidence than s-type statistics. To compare the detection time of a changepoint in location versus a changepoint in the record occurrence, we consider the Pettitt (1979) test, which is a nonparametric rank based test widely used to detect AMOC at location.

Figure 5 shows time series plots of annual maximum a and annual mean b temperature at Madrid with their records and changepoint estimates. Table 2 shows for the two previous series and for the series in daily scale the p-values and changepoint estimates for the six records tests and the Pettitt test. Small p-values in the records tests provide evidence against the null hypothesis of stationarity, in particular, all tests are significant at a level $\alpha = 0.10$, all but one are significant at a level $\alpha = 0.05$ and fourteen out of eighteen are significant at $\alpha = 0.01$. The Pettitt test is also significant for any usual significance level in both series with $M = 1$. The estimated changepoint for the annual maximum temperature series is ${\hat{t}}_0 = 51$ (year 1990) for all the records statistics and ${\hat{t}}_0 = 38$ (1977) for the Pettitt test. The minimum p-value for the records tests is 0.0013 for the statistic $N^{var}$. The estimated changepoint for the annual mean temperature series is ${\hat{t}}_0 = 55$ (1994) with the statistics without weights and with weights proportional to the inverse of the SD, but it is ${\hat{t}}_0 = 69$ (2008) for the statistics with linear weights and ${\hat{t}}_0 = 41$ (1980) for the Pettitt test. The minimum p-value of the records tests is 0.0004 for the tests $N^{var}$ and $N^{linear}$. For the daily scale series the changepoint estimate is ${\hat{t}}_0 = 38$ (1977) for all records statistics and here the minimum p-value is $4e{\text {-}}05$ for $d^{var}$.

The results in Table 2 agree with the results obtained in Sect. 3. When $M = 1$, N-type statistics obtain lower p-values than d-type, and the statistics with weights proportional to the inverse of the SD are those that obtained the strongest evidence. The changepoint estimate of the records tests is usually placed between 10 and 15 years after the Pettitt test estimates a changepoint in location. The changepoint estimated by the statistics with linear weights tend to locate the change very late. It is noteworthy that the changepoint is always estimated just before a record (see Fig. 5), so the changepoint estimate of a significant records test can be interpreted as the time from which there is evidence that the record occurrence is no longer stationary and the tail of the distribution begins to take on ever greater values, not previously seen. When $M > 1$ the results are more stable, the estimated changepoint appears earlier as more information is available and d-type statistics obtain smaller p-values than N-type.

Table 2 Estimated changepoint ${\hat{t}}_0$ and p-value of the records tests and the Pettit test for the annual maximum, annual mean and daily scale temperature series at Madrid, Spain

Full size table

Figure 6 plots the year versus the absolute value of the processes associated with the records statistics for the annual maximum a and mean b temperature series along with $95\%$ confidence thresholds based on the Kolmogorov distribution (they are very similar even for nonKolmogorov distributed statistics). These plots allow to see the evolution of the processes and other possible points with greater record probability than under the null hypothesis. Again, the stationary null hypothesis is rejected, indicating potential changepoint 1990 and 1994, respectively. The equivalent plot for the daily scale temperature series is shown in c, showing a clear maximum in 1977.

5 Discussion, conclusions and future work

The interest in statistical tools to analyze nonstationary behaviors in the extreme values of the distribution is growing. While extreme value analysis has been traditionally based on block maxima and excesses over threshold, this paper proposes the use of records to study changes in the tails of the distribution. In particular, this paper proposes three novel distribution-free changepoint detection tests and some generalizations based on the breaking of records to (1) detect changes in the extreme events of the distribution, (2) learn about features of the record occurrence and (3) analyze data when only their records are available.

The proposed statistics are CUSUM-type statistics based on the record indicators. Statistics to deal with seasonal series have also been considered. Despite having a very small sample information compared to the total of the series, the Monte Carlo simulations have shown that the proposed records tests are capable of detecting deviations from the null hypothesis and a reasonable changepoint at which this deviation becomes significant in the probabilities of record. However, care must be taken in the interpretation of the changepoint estimate, on the one hand it is usually misplaced when the actual changepoint is located in the ends of the series. On the other hand, when it is well located, it is often slightly after the actual changepoint in location or scale if it exists, i.e., the effect of the change is not immediately reflected in the observed record occurrence.

The recommendation for use according to the power and changepoint estimate accuracy of the tests is as follows. If an increase in the number of records with respect to the stationary case is expected in a single tail of the distribution, the results show that N-type statistics are usually recommended. If an increase in the number of records is expected in both tails, then s-type statistics are preferred. The statistics without weights have the advantage of having a known asymptotic distribution, while the statistics with weights proportional to the inverse of the SD have shown to have a more balanced behavior against the alternative hypothesis in the simulation results, with the disadvantage that their distribution must be calculated using Monte Carlo techniques.

The proposed tests join two important aspects in the study of climate change, changepoint detection methods and record-breaking events. This last concern has been made apparent when applying the tests on different summary series (block maxima and annual mean) and on the series on a daily scale of temperatures at Madrid, Spain; detecting significant evidence of warming since the late 1970s and early 1990s.

Future work may go in different directions. (1) Combining the information from the different statistics could be of interest to increase the power and decrease the chance of mis-detection, e.g., the harmonic mean p-value by Wilson (2019) could be used to have a single p-value of all tests. (2) The idea of splitting the series is fundamental to dealing with seasonal behavior. Here we use the method by Cebrián et al. (2022) to extract uncorrelated subseries. Another alternative to consider would be to implement permutation tests, i.e., the test statistic under the null hypothesis would be obtained by calculating all possible values of the test statistic under all possible rearrangements of the observed years, $t=1,\ldots ,T$. In this way we maintain the dependence structure between the subseries without the need to have a subset of independent subseries. (3) Our method has been developed within the AMOC domain, however its extension to the multiple changepoint domain could be of interest. The simplest procedure would be to split the series where the changepoint is detected and retest the two subseries separately. However, this can cause the number of records in the new subseries to be too small to detect new changepoints, so other alternatives should be studied.

Finally, it is noteworthy that the proposed changepoint detection records tests are not only useful for analyzing the effect of global warming on the occurrence of records, but also in other fields where records are important. Other applications of these tests are in other environmental sciences in the presence of climate change, in the study of extreme values in stock prices or in the influence that new sports equipment has on the occurrence of sports records. To facilitate its use, all the statistical tools proposed in this paper are included in the R package RecordTest (Castillo-Mateo 2021) available from CRAN at https://CRAN.R-project.org/package=RecordTest.

Data availability

Data and metadata are provided by the ECA &D project and available at http://www.ecad.eu. The series of Madrid is the blended series of station SPAIN, MADRID - RETIRO (STAID: 230).

References

Achcar JA, Rodrigues ER, Paulino CD, Soares P (2010) Non-homogeneous Poisson models with a change-point: an application to ozone peaks in Mexico City. Environ Ecol Stat 17:303–322. https://doi.org/10.1007/s10651-009-0114-3
Article CAS Google Scholar
Achcar JA, Coelho-Barros EA, de Souza RM (2016) Use of non-homogeneous Poisson process (NHPP) in presence of change-points to analyze drought periods: a case study in Brazil. Environ Ecol Stat 23:405–419. https://doi.org/10.1007/s10651-016-0345-z
Article Google Scholar
Arnold BC, Balakrishnan N, Nagaraja HN (1998) Records. Wiley series in probability and statistics. Wiley, New York. https://doi.org/10.1002/9781118150412
Book Google Scholar
Benestad RE (2004) Record-values, nonstationarity tests and extreme value distributions. Glob Planet Change 44(1–4):11–26. https://doi.org/10.1016/j.gloplacha.2004.06.002
Article Google Scholar
Brodsky E, Darkhovsky BS (1993) Nonparametric methods in change point problems. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-8163-9
Book Google Scholar
Castillo-Mateo J (2021) RecordTest: inference tools in time series based on record statistics. R package version 2.1.0. https://CRAN.R-project.org/package=RecordTest
Cebrián AC, Castillo-Mateo J, Asín J (2022) Record tests to detect non-stationarity in the tails with an application to climate change. Stoch Environ Res Risk Assess 36(2):313–330. https://doi.org/10.1007/s00477-021-02122-w
Article Google Scholar
Coumou D, Robinson A, Rahmstorf S (2013) Global increase in record-breaking monthly-mean temperatures. Clim Change 118(3–4):771–782. https://doi.org/10.1007/s10584-012-0668-1
Article Google Scholar
Csörgő M, Horváth L (1997) Limit theorems in change-point analysis. Wiley, Chichester
Google Scholar
Dierckx G, Teugels JL (2010) Change point analysis of extreme values. Environmetrics 21(7–8):661–686. https://doi.org/10.1002/env.1041
Article Google Scholar
Diersen J, Trenkler G (1996) Records tests for trend in location. Statistics 28(1):1–12. https://doi.org/10.1080/02331889708802543
Article Google Scholar
e Silva WVM, do Nascimento FF, Bourguignon M (2020) A change-point model for the r-largest order statistics with applications to environmental and financial data. Appl Math Model 82:666–679. https://doi.org/10.1016/j.apm.2020.01.064
Article Google Scholar
Fisher TJ, Robbins MW (2019) A cheap trick to improve the power of a conservative hypothesis test. Am Stat 73(3):232–242. https://doi.org/10.1080/00031305.2017.1395364
Article Google Scholar
Foster FG, Stuart A (1954) Distribution-free tests in time-series based on the breaking of records. J R Stat Soc B 16(1):1–22
Google Scholar
Gikhman II, Skorokhod AV (1969) Introduction to the theory of random processes. Saunders, Philadelphia
Google Scholar
Hirsch RM, Slack JR, Smith RA (1982) Techniques of trend analysis for monthly water quality data. Water Resour Res 18(1):107–121. https://doi.org/10.1029/WR018i001p00107
Article Google Scholar
Klein Tank AMG, Wijngaard JB, Können GP, Böhm R, Demarée G, Gocheva A, Mileta M, Pashiardis S, Hejkrlik L, Kern-Hansen C, Heino R, Bessemoulin P, Müller-Westermeier G, Tzanakou M, Szalai S, Pálsdóttir T, Fitzgerald D, Rubin S, Capaldo M, Maugeri M, Leitass A, Bukantis A, Aberfeld R, van Engelen AFV, Forland E, Mietus M, Coelho F, Mares C, Razuvaev V, Nieplova E, Cegnar T, Antonio López J, Dahlström B, Moberg A, Kirchhofer W, Ceylan A, Pachaliuk O, Alexander LV, Petrovic P (2002) Daily dataset of 20th-century surface air temperature and precipitation series for the European Climate Assessment. Int J Climatol 22(12):1441–1453. https://doi.org/10.1002/joc.773
Article Google Scholar
Kojadinovic I, Naveau P (2017) Detecting distributional changes in samples of independent block maxima using probability weighted moments. Extremes 20(2):417–450. https://doi.org/10.1007/s10687-016-0273-1
Article Google Scholar
Lehmann J, Coumou D, Frieler K (2015) Increased record-breaking precipitation events under global warming. Clim Change 132(4):501–515. https://doi.org/10.1007/s10584-015-1434-y
Article Google Scholar
Nevzorov V (2001) Records: mathematical theory. American Mathematical Society, Rhode Island
Google Scholar
Page ES (1954) Continuous inspection schemes. Biometrika 41(1–2):100–115. https://doi.org/10.1093/biomet/41.1-2.100
Article Google Scholar
Page ES (1955) A test for a change in a parameter occurring at an unknown point. Biometrika 42(3–4):523–527. https://doi.org/10.1093/biomet/42.3-4.523
Article CAS Google Scholar
Pettitt AN (1979) A non-parametric approach to the change-point problem. J R Stat Soc C 28(2):126–135. https://doi.org/10.2307/2346729
Article Google Scholar
R Core Team (2021) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. https://www.R-project.org/
Ratnasingam S, Ning W (2021) Modified information criterion for regular change point models based on confidence distribution. Environ Ecol Stat 28:303–322. https://doi.org/10.1007/s10651-021-00485-5
Article Google Scholar
Reeves J, Chen J, Wang XL, Lund R, Lu QQ (2007) A review and comparison of changepoint detection techniques for climate data. J Appl Meteorol Climatol 46(6):900–915. https://doi.org/10.1175/JAM2493.1
Article Google Scholar
Rodrigues ER, Nicholls G, Tarumoto MH, Tzintzun G (2019) Using a non-homogeneous Poisson model with spatial anisotropy and change-points to study air pollution data. Environ Ecol Stat 26:153–184. https://doi.org/10.1007/s10651-019-00423-6
Article CAS Google Scholar
Wergen G (2013) Records in stochastic processes—theory and applications. J Phys A 46(22):223001. https://doi.org/10.1088/1751-8113/46/22/223001
Article Google Scholar
Wilson DJ (2019) The harmonic mean p-value for combining dependent tests. Proc Natl Acad Sci USA 116(4):1195–1200. https://doi.org/10.1073/pnas.1814092116
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was partially supported by the Ministerio de Ciencia e Innovación under Grant PID2020-116873GB-I00; Gobierno de Aragón under Research Group E46_20R: Modelos Estocásticos; and Gobierno de Aragón under Doctoral Scholarship ORDEN CUS/581/2020. The author thanks Jesús Asín and Ana C. Cebrián for helpful comments; the editor and two anonymous reviewers for helpful reviews; and the ECA &D project for providing the data.

Funding

Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature.

Author information

Authors and Affiliations

Department of Statistical Methods, University of Zaragoza, Zaragoza, Spain
Jorge Castillo-Mateo

Authors

Jorge Castillo-Mateo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jorge Castillo-Mateo.

Ethics declarations

Conflict of interest

The author declares that he has no conflict of interest.

Ethical approval

This work does not contain any studies with human participants and/or animals.

Additional information

Handling Editor: Luiz Duczmal.

Appendices

A Appendix: The variance of $B_T(\nu _{Tt})$

Figure 7 shows the variance of $B_T(\nu _{Tt})$ for $T = 100$ and 1000 across $t=1,\ldots ,T$. In particular, it is shown for the unweighted statistic, the statistics with weights proportional to the inverse of the SD and linear weights (see Sect. 2.2). While the second one generates a symmetric variance in $\{1,\ldots ,T\}$ by construction, this does not happen with the other two. In all three cases the variance is zero for $t \in \{1,T\}$ and the maximum value is 1/4.

The nonuniform variance makes changepoints occurring near the beginning or the end of the series (small variance times) more difficult to detect. Under the null hypothesis, the process reaches its maximum (in absolute value) with the highest probability at time t where it has the highest variance. Then, deviations from the null hypothesis at small variance times generate smaller deviations in $B_T(\nu _{Tt})$ than deviations at times of maximum variance. Thus, it is expected that the unweighted statistic will have more power when the changepoint is at the beginning of the series, the statistic with weights proportional to the inverse of the SD when the changepoint is in the middle of the series and the statistic with linear weights when the changepoint is at the end of the series. These conclusions agree with the analysis of power in Sect. 3.2.

B Appendix: Proof of Proposition 2.1

We prove that the weighted statistics with polynomial weights $\omega _t \sim t^n$ as $t \rightarrow \infty $ for $n > 0$ do not have asymptotic Gaussian properties. In particular, the distribution of the weighted number of records do not approach the normal distribution for increasing T. This is verified by using its asymptotic skewness and showing that it is different from 0 (the skewness of any normal RV) for $n > 0$. As a consequence, the asymptotic distribution of the functional evolution of the weighted number of records does not approach that of the Wiener process.

Proof (of Proposition 2.1)

To prove that the $\xi _{Tt}^\omega $’s do not satisfy the central limit theorem, it is sufficient to prove that the sum $N_T^{\omega } = \sum _{t=1}^{T} \omega _t I_t$ does not have skewness 0 as $T \rightarrow \infty $. Using the basic properties of the central moments of a RV,

$$\begin{aligned}&\mu \equiv \mu _1(N_T^{\omega }) = E(N_T^{\omega }) = \sum _{t=1}^{T} \omega _t \, \frac{1}{t},\\&\sigma ^2 \equiv \mu _2(N_T^{\omega }) = E\left[ (N_T^{\omega } - \mu )^2\right] = \sum _{t=1}^{T} \omega _t^2 \, \frac{t-1}{t^2},\\&\mu _3(N_T^{\omega }) = E\left[ (N_T^{\omega } - \mu )^3\right] = \sum _{t=1}^{T} \omega _t^3 \, \frac{t^2 - 3t + 2}{t^3}. \end{aligned}$$

Then, the following is a consequence of the properties of the generalized harmonic numbers, as $T \rightarrow \infty $,

$$\begin{aligned} Skew(N_T^{\omega }) = \frac{\mu _3(N_T^{\omega })}{\sigma ^3} \sim \frac{\sum _{t=1}^T t^{3n-1}}{\left( \sum _{t=1}^T t^{2n-1}\right) ^{3/2}} \longrightarrow \frac{2}{3} \sqrt{2n}. \end{aligned}$$

Consequently the skewness of $N_T^{\omega }$ is asymptotically different from 0 for $n > 0$. $\square $

This proposition is easily extended to the weighted statistics based on the $d_t$’s and $s_t$’s. The former requires the calculation of the kurtosis since its skewness is 0 because it is a symmetric RV. We omit the details for the sake of brevity.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Castillo-Mateo, J. Distribution-free changepoint detection tests based on the breaking of records. Environ Ecol Stat 29, 655–676 (2022). https://doi.org/10.1007/s10651-022-00539-2

Download citation

Received: 01 March 2022
Accepted: 08 June 2022
Published: 06 July 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s10651-022-00539-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Distribution-free changepoint detection tests based on the breaking of records

Abstract

Similar content being viewed by others

Robust Change Detection in the Dependence Structure of Multivariate Time Series

Nuisance-parameter-free changepoint detection in non-stationary series

The multiple filter test for change point detection in time series

1 Introduction

2 Tests based on theory of records

Lemma 2.1

2.1 Tests based on asymptotic results

Theorem 2.1

Theorem 2.2

2.2 Tests with weighted statistics

Proposition 2.1

2.3 Tests for seasonal series

3 Monte Carlo experiments

3.1 Analysis of size

3.2 Analysis of power

3.3 Analysis of changepoint estimation

4 Application to temperature series

5 Discussion, conclusions and future work

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Appendices

A Appendix: The variance of \(B_T(\nu _{Tt})\)

B Appendix: Proof of Proposition 2.1

Proof (of Proposition 2.1)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation