Elsevier

Journal of Hydrology

Volume 594, March 2021, 125954
Journal of Hydrology

Research papers
An unbiased estimator of coefficient of variation of streamflow

https://doi.org/10.1016/j.jhydrol.2021.125954Get rights and content

Highlights

  • Daily streamflow exhibits periodic, skewed, multimodal and intermittent behavior.

  • Those factors lead to severe bias and variability in coefficient of variation C, estimates.

  • A zero-inflated lognormal monthly mixture, ΔLN3MM, model mimics observations.

  • A ΔLN3MM estimator leads to reliable and unbiased estimates of C.

  • Comparisons reveal an entirely new level of streamflow variability never before witnessed.

Abstract

Given increasing demand for high frequency streamflow series (HFSS) at daily and subdaily time scales there is increasing need for reliable metrics of relative variability for such series. HFSS can exhibit enormous relative variability especially in comparison with low frequency streamflow series formed by aggregation of HFSS. The product moment estimator of the coefficient of variation C, defined as the ratio of sample standard deviation to sample mean, as well as ten other common estimators of C, are shown to provide severely downward biased and highly variable estimates of C for very long records of highly skewed and periodic HFSS particularly for rivers which exhibit zeros. Resorting to the theory of compound distributions, we introduce an estimator of C corresponding to a mixture of monthly zero-inflated lognormal distributions denoted as a delta lognormal monthly mixture ΔLN3MM model. Through monthly stratification, our ΔLN3MM model accounts for the seasonality, skewness, multimodality, and the possible intermittency of HFSS. In comparisons among estimators, our ΔLN3MM based C estimator is shown to yield much more reliable and approximately unbiased estimates of C not only for small samples but also for very large samples (tens of thousands of observations). We document values of C in the range of [0.18, 42,000] with a median of 1.9 and an interquartile range of [1.34, 3.75] for 6807 daily streamflow series across the U.S. from GAGES-II dataset, with the highest values of C occurring in arid and semiarid regions. A multivariate analysis and national contour map reveal that extremely large values of C, never previously documented, tend to occur in arid watersheds with low runoff ratios, which tend to also exhibit a considerable number of zero streamflows.

Introduction

Streamflow variability has a profound impact on nearly every aspect of water resource design, planning and management, and therefore its quantification plays a key role. Reliable and unbiased metrics for streamflow variability are needed in a range of activities including, but not limited to: classification of regional hydrologic homogeneity; goodness-of-fit assessments of hydrologic models; evaluation of impacts of climatic variability and change on hydrologic systems; and in a wide range of research activities which seek to understand the hydroclimatic mechanisms which give rise to hydrologic variability. Recent research has shown that high levels of hydrologic variability play a dominant role and create considerable obstacles relating to our ability to estimate very common hydrologic statistics such as the Pearson correlation coefficient (Barber et al., 2019) and the Nash-Sutcliffe efficiency goodness-of-fit metric (Lamontagne et al., 2020).

Among the existing measures of relative variability, the coefficient of variation, C, introduced by Pearson (1896, pp. 276–277), is now perhaps the most widely used metric. C is defined as C=σ/μ where μ and σ denote the population mean and standard deviation respectively, of the random variable of interest. The index C has been applied in many other research areas including business, engineering, science, medicine, economics, psychology and other social sciences as well as many other fields (Nairy and Rao, 2003, Kelley, 2007, Soliman et al., 2012). Alternatively, the standard deviation σ is often used for comparing the variability of samples, because it has the same units as the random variable of interest. However, when one’s interest is in comparing the relative variability of several samples, each with different mean values, C is preferred to σ because it is nondimensional and thus accounts for differences in the mean of the samples. In finance, the inverse of C is commonly used as a measure of the performance of an investment portfolio (Knight and Satchell, 2005) and is often termed the risk to reward ratio.

We note at the outset, that C should only be computed for data measured on a ratio scale and would have no meaning for summarizing data on an interval scale (Velleman and Wilkinson, 1993). Ratio scale data are equally spaced data which exhibit a zero, whereas interval scale data are equally spaced data without a predefined zero point. Examples of ratio scale variables include streamflow, precipitation, and temperature measured in Kelvin. In contrast, most common temperature scales (e.g., Celsius, Fahrenheit etc.) are interval scales with arbitrary zeros, so C would be different depending on which scale is used. Most statistics including C, are meaningful for ratio data because their interpretation is unchanged when linear transformations are applied to the data. Other statistics such as means, standard deviations, and product moment correlations are meaningful for summarizing data on both ratio and interval scales.

Streamflow variability and skewness are linked. Vargo et al. (2010, Fig. 3) document the theoretical relationship between C and the coefficient of skewness γ for 36 probability distribution functions (pdfs). For two-parameter pdfs a unique relationship usually exists between C and γ (see Fig. 4 and discussion in Vogel and Fennessey, 1993). For example, for positively skewed Gamma and LN2 variables, γ is related to C via the relations γ=2C, and γ=3C+C3, respectively. For more complex pdfs, with more than two parameters, there is usually no unique relationship between C and γ, however there still remains a linkage between the two, given by a two-dimensional region within the plot of C versus γ, as depicted in Fig. 3 of Vargo et al. (2010).

It is widely understood that streamflow observations exhibit both variability and skewness, yet due to numerous factors discussed in Section 1.5, the most common and widely used product moment estimators of the coefficients of variation C and skewness γ, exhibit both severe downward bias and variability, and are generally not to be trusted, even for sample sizes in the tens of thousands (see Vogel and Fennessey, 1993).

Obtaining reliable estimates of C for daily flow series constitutes the central challenge of this study. Initial efforts to obtain reliable estimates of C for daily streamflow series were made by Limbrunner et al., 2000, Vogel et al., 2003 who applied an L-moment estimator of C based on a three-parameter lognormal distribution to flow series at 1571 watersheds across the U.S. Vogel et al. (2003) reported values of C for daily flow series ranging from approximately 0.5 to 10,000 with a median value of 10, and an interquartile range from 3 to 33. We were unable to find examples of such high values of C reported for any other variables, across multiple disciplines, which in part explains the need for new methods introduced here.

In the past, many water resources design, planning and management problems relied on low frequency streamflow series (LFSS) such as annual and monthly series resulting from the temporal aggregation (average) of daily, hourly, or subhourly high frequency streamflow series (HFSS). All such LFSS exhibit much less variability than the HFSS from which they were created. Regardless of the random variable of interest, aggregation leads to a reduction in variability. Given our focus on streamflow variability it is instructive to first consider the impact of aggregation of independent and identically distributed series. The aggregation (i.e. taking the average) of any independent and identically distributed (iid) random variable X over n intervals leads to a drop in its standard deviation σ to σ/n, so that the coefficient of variation of the aggregated variable denoted as Cn is reduced to Cn=C/n. Assume a daily time scale as a reference and denote the corresponding C as C1. Since average annual streamflows are known to exhibit values of C365 in the range of [0.2, 1.5] across the conterminous U.S. (see Vogel et al., 1998), one would expect C for daily streamflows to be in the range [3.8, 28.5] resulting from the relationship C1=365C36519C3653.8,28.5. These initial results are only very crude approximations because daily streamflows are neither identically distributed nor independent, and both of these factors affect the estimation of C, a central focus of this study.

Most streamflow statistics attempt to provide a summary of the statistical behavior of streamflow, as distinguished from its physical or deterministic characteristics. Daily streamflow is subject to numerous deterministic characteristics including a periodic component which is dominated by seasonal climatic conditions which can lead to intermittent and ephemeral streamflows and the occurrence of observations equal to zero. HFSS, such as hourly streamflow, may be subject to other forms of periodic behavior such as diurnal variations.

In a recent study closely related to this study, the need to account for periodicity when estimating sample statistics of daily streamflow series was documented by Lamontagne et al. (2020) in their Monte-Carlo experiments which evaluated the sampling variability of estimates of the goodness-of-fit metric termed efficiency E. Fig. 4 of Lamontagne et al. (2020) documents that accounting for the periodic behavior of daily streamflow led to marked reductions in the variability of estimates of E when compared with estimators of E which did not account for streamflow periodicity.

Zero streamflows are defined as streamflow below the measurement threshold, which is approximately 0.01 cfs in the U.S. (Granato et al., 2017). Of the 20,438 U.S. Geological Survey (USGS) river gages evaluated by Granato et al. (2017), 36% of those gages had at least one occurrence of zero streamflow and 2.6% of those gages had more than 297 days per year (or 81.3%) of zero streamflow. According to Levick et al. (2008), ephemeral and intermittent streams make up approximately 59% of all streams in the United States (excluding Alaska), and over 81% in the arid and semi-arid Southwest according to the USGS National Hydrography Dataset. Such streams usually reside in the headwaters or major tributaries of perennial streams in the Southwest.

Since the occurrence of zero daily streamflow is so common, we introduce a model which accommodates their occurrence; such a model that allows for frequent zero-valued observations is known as a zero-inflated model. We document later using both a zero-inflated model and streamflow observations, that the occurrence of zero streamflows leads to considerable increases in C, requiring estimators of C that accommodate their occurrence.

All statistics have both a theoretical and empirical interpretation. For example, the sample mean x- computed from a single sample of length n, is a sample estimate of the true or population mean μ. The theoretical sampling properties of x- are known for any iid variable and can be summarized by the mean and variance of x-. Since Ex-=μ, x- is said to be an unbiased estimator of μ. Barber et al., 2019, Lamontagne et al., 2020 provided examples of the sampling properties of estimates of the correlation coefficient and the Nash-Sutcliffe efficiency, respectively.

The difference between the population C and the mean of estimates of C is often referred to as ‘sampling bias’ because it results from estimating the true population value by a finite sample. Wallis et al. (1974) first exposed the importance of sampling bias in their seminal paper “Just a moment”. Bias results from the combination of numerous phenomena acting together. The occurrence of zeros, skewness, persistence, and seasonality characterizing HFSS lead to considerable challenges associated with estimation of C. Kirby (1974) also derived an upper bound on the product moment estimator of C which, until this study, was only considered to be important for small samples.

Values of population C exceeding unity correspond to samples with extremely high skewness, leading to enormous downward bias in all ratio estimators commonly used in hydrology. For example, for the case of population C = 10, Vogel and Fennessey (1993) reported downward bias associated with conventional product moment estimator of C of about 40% and 80% for samples of length 10,000 from synthetic iid samples from lognormal and generalized Pareto distributions, respectively. Note that Limbrunner et al., 2000, Vogel et al., 2003 and this study all document that a value of C = 10 for daily streamflows is not uncommon.

It is well known that autocorrelation inflates the sampling variance of most statistics. Vogel et al., 1998, Lombardo et al., 2014 analyzed the effect of the autocorrelation on the sampling properties of various moments and moment ratios, expanding the results of Wallis et al. (1974). One can think of the impact of autocorrelation as decreasing the effective sample size, so that in the limit, the sample size approaches unity as the autocorrelation approaches unity (which is the case for HFSS). Moreover, all ratio estimators are known to exhibit bias, which is induced by the fact that the numerator and denominator are often correlated random variables each with different sampling properties of their own, resulting in bias in estimation of the overall ratio.

All product moment ratio estimators are known to exhibit bias due to outliers, because very large/small observations, which are far away from the sample mean, exert much more influence than the other observations, due to the exponentiation involved in higher-order moments. In such instances, which occur frequently in daily streamflow series, observations do not exert the same weight, and single or few observations can dictate the value of the sample estimates. Vogel and Fennessey (1993, Fig. 3) documented the enormous impact of the largest observation even for very large samples of daily streamflows in the tens of thousands; they also discussed remarkably large downward bias associated with product moment estimates of skewness and by analogy the same would be true for kurtosis and all other higher order moment ratios. For highly skewed bivariate lognormal samples, Lai et al. (1999) concluded that significant upward bias in estimates of the Pearson correlation coefficient exist, and only begins to disappear for sample sizes in the range of 3–4 million observations. Barber et al. (2019) extend the results of Lai et al. (1999) to highly skewed and periodic hydrologic series, showing clearly that the ordinary product moment estimator of the Pearson correlation coefficient should generally be avoided for use with daily and sub daily streamflow series. Thus, there is ample evidence in the literature that new methods are needed to better estimate C for highly skewed hydrologic data, even for very large samples.

The study of the sampling bias associated with commonly used product moment ratio estimators of HFSS has received very little attention, which is surprising when one considers the increasing attention being given to the application of HFSS in water resource management activities. It is now commonplace for hydrologists to model streamflow at sub daily scales (including hourly and even sub hourly scales) for use in flood forecasting as well as for real-time stormwater and water quality management activities and in hillslope hydrology applications. For example, the National Water Model (Office of Water Prediction, 2017) provides hourly streamflow forecasts for any location within the U.S. With increasing focus on challenges relating to big data, combined with increasing access to graphical processing units, supercomputer resources and the internet of things, one can expect to see continuing development of new HFSS modeling and data acquisition approaches which will result in profound challenges associated with estimation of various summary statistics corresponding to zero-inflated, periodic and highly skewed HFSS. Examples of research which address such challenges include this study, as well as the two recent studies by Barber et al., 2019, Lamontagne et al., 2020.

The primary goals of this paper are (1) to develop and compare approximately unbiased estimators of C for use with highly skewed, periodic, and possibly intermittent daily flow series and (2) to apply those estimators to large samples of daily streamflow data to enable a better understanding of the influence of skewness, zeros, periodicity and other physical factors, on the relative variability of streamflow. We begin with a literature review of estimators of C, followed by introduction of zero-inflated and monthly mixture models to deal with periodicity, skewness and non-perennial rivers. Three classes of estimators of C are introduced. The first class of estimators account for the high degree of variability and skewness associated with daily streamflows. The second class of estimators account for both skewness and zero-inflation, while the third class of estimators account for skewness, zero-inflation and the periodicity of daily streamflows. Then, we perform Monte Carlo experiments which compare the behavior of these estimators of C, and finally we apply those estimators to observed daily flow series across the U.S. A multivariate analysis and contour map of estimates of C enable us to summarize some of the hydroclimatic mechanisms which drive daily streamflow variability. Summary and recommendations conclude this study.

Section snippets

Estimation of C – Literature review

Our review reveals that nearly all approaches to estimation of C and its sampling properties assume that the random variable of interest is independent and identically distributed (iid). Daily streamflow is neither independent, nor identically distributed, yet since this is the assumption behind every estimator of C we could locate, we begin with a review of the estimators of C which stem from this assumption.

Most literature on the sampling properties of estimators of C assume independent

Summary of estimators of C

Three classes of estimators of C are introduced in this study: (1) sample estimators based on the assumption that daily streamflow arises from a single distribution which is equivalent to the assumption of identically distributed streamflows, throughout the year, (2) sample estimators based on a single zero-inflated which assume two populations, one for zero flows and one for nonzero flows, and (3) a mixture model which combines monthly zero-inflated LN3 model with a monthly mixture model. The

Results

In the following section we first evaluate and compare the goodness-of-fit of the various pdfs introduced for modeling daily streamflow observations at thousands of watersheds across the conterminous U.S. We also perform Monte Carlo experiments which evaluate the sampling properties (bias and RMSE) of the various estimators of C introduced previously with the goal of determining which estimator is best suited to use with HFSS. We then apply the estimator of C which performs best in those

Summary of behavior of estimates of C

Of considerable interest are the factors which give rise to such extremely large values of ĈΔLN3MM depicted in Fig. 9 across the conterminous U.S. Naturally there are many factors which would explain the gross variations in ĈΔLN3MM including, but not limited to: physical watershed characteristics, climatic characteristics, and numerous water management variables relating to withdrawals, diversions, return flows, reservoir operations, and other factors. It is beyond the scope of this initial

Impact of persistence on the behavior of estimates of C

Previous sections have accounted for the influence of periodicity, skewness, intermittent and ephemeral streamflow conditions on the properties of C. In this section we discuss briefly the influence of persistence on flow variability. Evidence of streamflow persistence can be described by its autocorrelation structure, which is known to have an impact on the sampling properties of estimators of C, the topic of this section. Vogel et al. (1998) documented the impact of serial correlation of the

Conclusions and recommendations

This study has sought to improve our understanding of the behavior of, and our ability to estimate the coefficient of variation C of high frequency streamflow series (HFSS), such as daily and subdaily series. We have also documented the critical need to account for the high levels of periodicity, skewness and zero streamflows associated with HFSS, when attempting to estimate summary statistics of such series.

Although other measures of relative variability exist, we focus on C because it is easy

CRediT authorship contribution statement

Lei Ye: Software, Validation, Formal analysis, Investigation, Resources, Supervision, Visualization, Data curation, Project administration, Funding acquisition. Xuezhi Gu: Software, Formal analysis, Data curation, Investigation. Dingbao Wang: Conceptualization, Supervision, Investigation, Writing - review & editing. Richard M. Vogel: Conceptualization, Methodology, Formal analysis, Writing - original draft, Writing - review & editing, Validation, Visualization, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

We are indebted to the late Nicholas C. Matalas, who provided some input on a very early version of this manuscript and who provided some of the inspiration for this work. We are also indebted to Francesco Serinaldi for his rigorous review of an early draft of this manuscript which led to considerable improvements in our presentation. We are also particularly indebted to Robert Hirsch for his extremely detailed and constructive comments. This work was partially supported by National Key

References (62)

  • W.-K. Pang et al.

    On interval estimation of the coefficient of variation for the three parameter Weibull, log-normal and gamma distribution: A simulation-based approach

    Eur. J. Oper. Res

    (2005)
  • W.K. Pang et al.

    A simulation-based approach to the study of coefficient of variation of dividend yields

    Eur. J. Oper. Res.

    (2008)
  • J. Aitchison

    On the distribution of a positive random variable having a discrete probability mass at origin

    J. Am. Stat. Assoc.

    (1955)
  • S. Amiri et al.

    Assessing the coefficient of variations of chemical data using Bootstap method

    J. Chemometrics

    (2010)
  • C.K. Baldwin et al.

    Seasonality of streamflow: The upper Mississippi River

    Water Resour. Res.

    (1999)
  • S. Banik et al.

    Estimating the population coefficient of variation by confidence intervals

    Commun. Stat. – Simul. Comput.

    (2011)
  • C. Barber et al.

    Improved estimators of correlation and R2 for skewed hydrologic data

    Hydrol. Sci. J.

    (2019)
  • A.G. Blum et al.

    The probability distribution of daily streamflow in the United States

    Hydrol. Earth Syst. Sci.

    (2017)
  • R. Breunig

    An almost unbiased estimator of the coefficient of variation

    Econ. Lett.

    (2011)
  • M.I. Brunner et al.

    Stochastic simulation of streamflow and spatial extremes: a continuous, wavelet-based approach

    Hydrol. Earth Syst. Sci.

    (2020)
  • Castellarin, A., Botter, G., Hughes, D.A., Liu, S., Ouarda, T.B.M.J., Parajka, J., Post, D.A., Sivapalan, M., Spence,...
  • V.T. Chow

    The log-probability law and its engineering applications

    Proc. ASCE, New York

    (1954)
  • M. Christman

    Review of estimation methods for parameters of the delta-lognormal distribution

    Technical Report

    (2019)
  • Crow, E.L., Shimizu, K. (Eds.), 1988. Lognormal Distributions: Theory and Applications. Marcel Dekker,...
  • S.D. Dubey

    Compound gamma, beta and F distributions

    Metrika

    (1970)
  • Falcone, J.A., (2011). GAGES–II—Geospatial attributes of gages for evaluating streamflow: U.S. Geological Survey...
  • D.J. Finney

    On the distribution of a variate whose logarithm is normally distributed

    Suppl. J. Roy. Stat. Soc.

    (1941)
  • F.F. Gan et al.

    Goodness-of-fit tests based on P-P probability plots

    Technometrics

    (1990)
  • Granato G.E., Ries, K.G., III, Steeves, P.A., 2017. Compilation of streamflow statistics calculated from daily mean...
  • Y. Guo et al.

    Analytical estimation of geomorphic discharge indices for small intermittent streams

    J. Hydrol. Eng.

    (2016)
  • Helsel, D.R., Hirsch, R.M., Ryberg, K.R., Archfield, S.A., Gilroy, E.J., (2020). Statistical methods in water...
  • W.A. Hendricks et al.

    The sampling distribution of the coefficient of variation

    Ann. Math. Stat.

    (1936)
  • J.R.M. Hosking

    L-moments: analysis and estimation of distributions using linear combinations of order statistics

    J. Roy. Stat. Soc. B

    (1990)
  • J.R.M. Hosking

    The four parameter kappa distribution

    IBM J. Res. Dev.

    (1994)
  • J.R.M. Hosking et al.

    Regional frequency analysis: an approach based on L-moments

    (1997)
  • Hosking J. R. M., (2017). L-Moments. R package, version 2.6. URL:...
  • J.C. Houghton

    Birth of a parent: The Wakeby distribution for modeling flood flows

    Water Resour. Res.

    (1978)
  • N.L. Johnson et al.

    Application of the nonecentral t-distribution

    Biometrika

    (1940)
  • B. Kedeem et al.

    Estimation of mean rain rate: application to satellite observations

    J. Geophys. Res.

    (1990)
  • K. Kelley

    Sample size planning for the coefficient of variation from the accuracy in parameter estimation approach

    Behav. Res. Methods

    (2007)
  • W. Kirby

    Algebraic boundedness of sample statistics

    Water Resour. Res.

    (1974)
  • Cited by (12)

    • New method for scaling nonpoint source pollution by integrating the SWAT model and IHA-based indicators

      2023, Journal of Environmental Management
      Citation Excerpt :

      The simulation performance at the daily step was slightly worse than that at the monthly step, reflected by the difference in NSE. This is consistent with previous studies (Ye et al., 2021). The amount of simulated data increased with decreasing temporal scale, leading to greater errors in recession calculations in the calibration and validation of SWAT.

    View all citing articles on Scopus
    View full text