A generic framework to analyse the spatiotemporal variations of water quality data on a catchment scale

https://doi.org/10.1016/j.envsoft.2017.11.003Get rights and content

Highlights

  • Propose a generic framework for spatiotemporal variation analysis of water quality.

  • Inter- and intra-change of spatiotemporal variations of water quality were studied.

  • Integrate data mining techniques to solve environmental problems.

  • The proposed framework has the potential to be applied to other environmental data.

Abstract

Most spatiotemporal studies treat spatial and temporal analysis separately. However, spatial and temporal changes occur simultaneously and are correlated. In this study, we propose a generic framework to simultaneously analyse the spatial and temporal variations of water quality on a catchment scale. Specifically, we analyse the heterogeneity of temporal evolution of water quality data among different sampling sites, and the heterogeneity of spatial distribution of water quality data over different sampling times, respectively, by integrating the techniques of normalized mutual information, dynamic time wrapping and cluster analysis. To bring deep insight into the spatiotemporal variations, inter-change and intra-change are further defined and distinguished, respectively. Taking the Fuxi River catchment as a case study, results indicate that the proposed framework is intuitive and efficient. Beyond this, the generic framework can be expanded for other catchments and various environmental data.

Introduction

The water quality of a river can be attributed to both natural processes and anthropogenic activities including surface runoff (Abbaspour et al., 2007), climate change (Whitehead et al., 2009), geological structure (Bilgin and Konanç, 2016), land use (Sliva and Williams, 2001, Ding et al., 2015) and sewage discharge (Zhen and Zhu, 2016). Different atmospheric inputs, climatic conditions and anthropogenic inputs may result in spatial variation of water quality in rivers (Bricker and Jones, 1995). On the other hand, seasonal variation in precipitation may cause river discharge variations and subsequently affect the concentration of pollutants in river water (Vega et al., 1998). Therefore, a good understanding of temporal and spatial variations of water quality on a catchment scale is of great importance for water pollution control, aquatic ecosystem restoration and water management by regional communities (Bu et al., 2010, Iscen et al., 2008).

The methods for temporal and/or spatial analysis on water quality data have developed from univariate analysis to multivariate analysis. In the early stage, spatial and/or temporal analysis of water quality data mainly focused on a single argument (Laznik et al., 1999, Niu et al., 2004). This kind of analysis method cannot fit well to more and more complex water challenges. With the development of computer science, multivariate statistical analysis methods have grown, mainly including principal component analysis (PCA), principal factor analysis (PFA), factor analysis (FA), cluster analysis (CA) and discriminant analysis (DA) (Singh et al., 2004, Smeti and Golfinopoulos, 2016). These multivariate statistical techniques have been well-accepted and widely applied in water quality assessments (Shrestha and Kazama, 2007, Iscen et al., 2008, Wang et al., 2013), particularly for key parameter extraction and main pollution source identification.

For instance, applying CA and FA, Bu et al. (2010) grouped 12 sampling sites into three pollution level clusters (no pollution, moderate pollution and high pollution) and identified five factors of pollution sources for the Jinshui River of the South Qinling Mountains in China. Wang et al. (2013) applied CA and PCA/FA to evaluate temporal/spatial variations in water quality and identify latent sources of water pollution in the Songhua River Harbin region, China. Smeti and Golfinopoulos (2016) applied DA on surface water quality data of Yliki Lake to determine which variables were the most efficient in discriminating between clusters.

However, most existing spatiotemporal variation analytical works treat the temporal and spatial analysis of water quality data separately. For instance, Ouyang et al. (2006) assessed the seasonal variations in surface water quality in the lower St. Johns River. Wang et al. (2013) performed the temporal and spatial cluster analysis of water quality in the Songhua River Harbin region separately, classifying monitoring time into three periods and classifying monitoring stations into three groups. Chang (2008) conducted spatial analysis of water quality trends in the Han River basin, South Korea. However, the co-occurrence and correlation of temporal and spatial variations were not thoroughly considered during the previous spatiotemporal variation assessment.

Therefore, an approach, which allows to comprehensively analyse the temporal and spatial variations of water quality data simultaneously, is highly needed. Especially, in the context of a changing environment, it is essential to study the spatiotemporal variations of water quality data, so as to adapt to climate change and environment deterioration.

This paper aims to propose a generic framework to simultaneously analyse the spatiotemporal variations of water quality data on a catchment scale. The importance of this paper lies in teaching environmental engineers and scientists without a computational background to systemically analyse temporal and spatial variations of water quality data in a more effective, intuitive and easier way.

The objectives of this paper are as follows.

  • 1)

    to analyse how the spatial distributions of water quality data change over time;

  • 2)

    to analyse the spatial heterogeneity of temporal evolution of water quality data; and

  • 3)

    to demonstrate the procedure and verify the effectiveness of the proposed framework, taking a water quality dataset in Fuxi River catchment as a typical example.

Section snippets

Study area

To demonstrate the proposed framework, any catchment could have been chosen for illustration. Here, the Fuxi River catchment is used as a case study as the authors are familiar with this example catchment and reliable data have been collected by trusted sources.

The Fuxi River catchment (28°58′-29°46′ N, 103°43′-105°36′ E, Fig. 1) covers 3490 km2 of drainage area with a mainstream (Fuxi River) length of 73.2 km, a bending coefficient of 2.21 and an average gradient of 0.27‰. According to the

Overview of the framework

An overview of the proposed framework to analyse the spatiotemporal variation of water quality data is presented in Fig. 2. In general, the framework consists of two routes. The first route is to analyse the heterogeneity of temporal evolution of water quality among different sampling sites. Namely, how the water quality of each sampling site evolves over time, and is the temporal evolution of water quality at different sampling sites similar or not? The second route focuses on the

Overview

This section presents the spatiotemporal analysis results of water quality by applying the proposed generic framework on Fuxi River catchment. The results include two parts: the results of spatial heterogeneity of temporal evaluation of water quality and the results of distinct changes of spatial distribution over time. In addition, potential reasons behind the results have been discussed by integrating the meteorological environment, agriculture and human activities in the studied catchment.

Inter-change assessment

Conclusions and recommendations

A generic framework for the spatiotemporal variations of water quality on the catchment scale has been proposed. This framework includes two tasks of water quality analysis: spatial heterogeneity of temporal evolution and changes of spatial distribution over time. The results with respect to the specific case study indicate that our framework allows revealing the inter- and intra-change of water quality systematically and report the spatial and temporal changes of water quality simultaneously.

Acknowledgments

Author contributions: J. S. and Q. Y. designed the research; G. W. and M. S. made valuable suggestions and comments to the research design; Q.Y., and J. S. analysed the data; G. W. and X. L. provided the data, Q. Y. wrote the first manuscript draft; all authors read and commented on the paper, and M. S. edited the final paper.

This work has been financially supported by the National Natural Science Foundation of China (grant numbers 61403062 and 41601025), The National Key Research and

References (33)

  • M. Vega et al.

    Assessment of seasonal and polluting effects on the quality of river water by exploratory data analysis

    Water Res.

    (1998)
  • Y. Wang et al.

    Assessment of surface water quality via multivariate statistical techniques: a case study of the Songhua River Harbin region, China

    J. Hydro-Environment Res.

    (2013)
  • S. Zhen et al.

    Analysis of isotope tracing of domestic sewage sources in Taihu Lake — a case study of Meiliang Bay and Gonghu Bay

    Ecol. Indic.

    (2016)
  • A. Bilgin et al.

    Evaluation of surface water quality and heavy metal pollution of Coruh River Basin (Turkey) by multivariate statistical methods

    Environ. Earth Sci.

    (2016)
  • O.P. Bricker et al.

    Main Factors Affecting the Composition of Natural Waters. Trace Elements in Natural Waters

    (1995)
  • T.M. Cover et al.

    Entropy, relative entropy and mutual information

    Elem. Inf. Theory

    (1991)
  • Cited by (4)

    • The DiscoverFramework freeware toolkit for multivariate spatio-temporal environmental data visualization and evaluation

      2021, Environmental Modelling and Software
      Citation Excerpt :

      Thus, the data and the insights that data contain are largely inaccessible to most stakeholders. The DiscoverFramework fills this gap by taking advantage of recent approaches that support analysis and visualization of complex spatio-temporal multivariate data sets (Lundblad and Jern, 2012; Yang et al., 2017; Yigitbasioglu and Velcu, 2012). Of most importance to DiscoverFramework are (1) the widespread and ever increasing access to technologies such as laptops, tablets, and smart phones; (2) expanding web access to public domain data sets on environmental systems; (3) access to free cloud data storage; (4) new web access and design ideas that enable increasingly comprehensive displays of data; (5) increasingly robust free GIS capabilities through QGIS; and (6) increasing ability to include interactive maps, charts, and communication between users in a single web page, recently applied mostly in financial and news sectors.

    • Assessing climate impact on forest cover in areas undergoing substantial land cover change using Landsat imagery

      2019, Science of the Total Environment
      Citation Excerpt :

      As a result, monthly temperature and precipitation during December 1988 and February 2015 and annual temperature and precipitation during 1989–2014 on pixel scale are constructed. Diverse methods of spatiotemporal variation analysis have been proposed (Yang et al., 2017). In this study, to characterize the spatial variation features of forest EVI, spatial heterogeneity of seasonal and annual EVI over three different phases are displayed and compared.

    • Dynamic runoff simulation in a changing environment: A data stream approach

      2019, Environmental Modelling and Software
      Citation Excerpt :

      There have been extensive studies on runoff simulation, yet more accurate runoff simulation still remains a challenge in the context of a changing environment (Montanari et al., 2013). In recent decades, the formation and variation of runoff is suggested to have been considerably impacted by climate change and increasingly intensive human activities (Wang et al., 2016; Yang et al., 2017). The runoff series and the rainfall-runoff relationship have exhibited non-stationary features (Milly et al., 2005), which result in poor performances of many existing runoff simulation models.

    View full text