Position paper: Sensitivity analysis of spatially distributed environmental models- a pragmatic framework for the exploration of uncertainty sources

https://doi.org/10.1016/j.envsoft.2020.104857Get rights and content

Highlights

  • A pragmatic framework of sensitivity analyses is provided for spatially distributed environmental models.

  • The framework prescribes sequential steps in which important considerations are highlighted.

  • The framework benefits users of sensitivity analyses in environmental modeling and GIS communities.

Abstract

Sensitivity analysis (SA) has been used to evaluate the behavior and quality of environmental models by estimating the contributions of potential uncertainty sources to quantities of interest (QoI) in the model output. Although there is an increasing literature on applying SA in environmental modeling, a pragmatic and specific framework for spatially distributed environmental models (SD-EMs) is lacking and remains a challenge. This article reviews the SA literature for the purposes of providing a step-by-step pragmatic framework to guide SA, with an emphasis on addressing potential uncertainty sources related to spatial datasets and the consequent impact on model predictive uncertainty in SD-EMs. The framework includes: identifying potential uncertainty sources; selecting appropriate SA methods and QoI in prediction according to SA purposes and SD-EM properties; propagating perturbations of the selected potential uncertainty sources by considering the spatial structure; and verifying the SA measures based on post-processing. The proposed framework was applied to a SWAT (Soil and Water Assessment Tool) application to demonstrate the sensitivities of the selected QoI to spatial inputs, including both raster and vector datasets - for example, DEM and meteorological information - and SWAT (sub)model parameters. The framework should benefit SA users not only in environmental modeling areas but in other modeling domains such as those embraced by geographical information system communities.

Introduction

Sensitivity analysis (SA) and uncertainty analysis (UA) are important tools for investigating model behavior, testing model hypotheses, and exploring the potential for simplifying models (Wagener and Pianosi, 2019). Uncertainty is intrinsic to all modeling work that involves representing natural processes and/or human behavior. Sources of uncertainty that need to be considered in such exercises are model input datasets, model structure, and model parameters. SA studies the influence of input factors (e.g., parameters, forcing, initial value of model states, model resolution, and model structure such as different parameterization schemes of a model or submodel) on model outputs. It is considered a key practice in the assessment of environmental models (Chen et al., 2019; Gan et al., 2014; Jakeman et al., 2006; Matott et al., 2009; Oakley and O'Hagan, 2004; Pianosi et al., 2016; Yue et al., 2020). In comparison, UA quantifies the uncertainty of model outputs from input datasets and model parameters, typically characterized by empirical probability distributions and/or confidence bounds for the model parameters and outputs. UA can be considered an extension of SA with the uncertainty distributions for the input factors being used as the perturbations. Therefore, SA can be used to indicate when uncertainty in input factors matters in terms of the impact on the uncertainty in the outputs. Care must however be taken in the interpretation of SA results as sensitivities can be dependent on parameter ranges selected, model structure assumed, length of data period examined and its climatic forcing (Shin et al., 2013).

The use of SA and UA in environmental modeling has become of particular importance due to the highly complex nature of environmental systems, and the attendant complexity of models typically invoked to represent them. This is especially the case for spatially distributed environmental models (referred to from hereon as SD-EMs), where there tends to be a considerable number of model parameters due to their spatially variant nature, and substantial uncertainty in the model and its predictions. Uncertainty and sensitivity related studies in environmental modeling are rising in popularity because of the growing awareness of the importance of models in supporting informed decision making (Douglas-Smith et al., 2020), coupled with the fact that current process-based environmental models are typically, and perhaps necessarily, deterministic in their representation (Farmer and Vogel, 2016; Uusitalo et al., 2015).

This paper focuses on Monte Carlo simulation-based SA of SD-EMs, which is a valuable tool per se and one that can also inform uncertainty analyses. Monte Carlo simulation-based approaches are widely applied due to their ease of implementation, yet there is a lack of a comprehensive pragmatic framework for conducting such approaches for SD-EMs (Yang et al., 2018). An SD-EM is intrinsically tied to the spatial dimensions of producing and utilizing data that represent the spatially distributed nature of the modeling context. Grid-based digital elevation models (DEMs), site-specific point measurements, and remotely sensed images are examples of such data. However, SAs are rarely conducted for DEM and DEM-derived parameters even though the inherent scale and errors of a spatial dataset and/or of the whole environmental model can have a significant impact on model outputs (Tran et al., 2018). A crucial issue to take into account regarding spatial datasets is the spatial structure of their uncertainty. Generally, spatial datasets are characterized by spatial dependence (i.e., spatial coherence), and their uncertainties are also spatially autocorrelated (Oksanen and Sarjakoski, 2005a; Wechsler, 2007). Thus, ignoring such characteristics can lead to erroneous estimation of sensitivity measurements. Moreover, because spatial datasets often determine the uncertainty in model resolution and structures through their boundaries, discretization and scale, exploring uncertainty related to spatial datasets can partly account for model uncertainty in SD-EMs.

This article introduces a pragmatic framework for the application of SA to an SD-EM, using a scenario/simulation-based approach to investigate the significance of potential uncertainties in the model inputs, which can not only explore model and data assumptions transparently but also be an informative precursor to a more thorough UA. The objectives of the framework are to provide sufficient information and background in order to guide the selection of more appropriate choices at each step of the SA process: potential uncertainty source identification; selection of SA method(s) and quantities of interest (QoI); perturbation propagation; and SA evaluation and post-processing. The framework emphasizes the following aspects: it attempts to address potential uncertainty sources related to spatial datasets; and assists in propagating the potential uncertainty sources by considering their likely spatial structure. Therefore, the framework helps to explore the impact of potential uncertainty of spatial datasets in an SD-EM, and to compare their relative impacts with the usual factors in SA (e.g., model parameters). The framework is intended to benefit both non-experts and SA users in environmental modeling and geographical information system (GIS) communities.

The remainder of this article is organized as follows. Section 2 broadly introduces the pragmatic framework for applying SA to SD-EMs, covering potential uncertainty source identification, selection of SA method and QoI, perturbation propagation, and SA evaluation and post-processing. Then, from Sections 3 Identification of potential uncertainty sources, 4 Selection of SA method(s) and quantities of interest, 5 Perturbation propagation, 6 SA evaluation and post-processing, the detailed steps and their corresponding considerations are discussed. Section 7 provides a concise example of the SA framework. The article concludes in Section 8 with a discussion of future needs and opportunities.

Section snippets

A pragmatic SA framework for SD-EMs

The presented framework prescribes sequential steps in which important considerations are highlighted to guide modelers towards the selection of appropriate choices for the pragmatic application of SA to uncertainty exploration in SD-EMs. The overarching steps and the corresponding considerations are depicted in Fig. 1. The main purpose of the framework is to identify the contributions of potential uncertainty sources to the selected QoI. This section introduces the pragmatic framework to

Classification of uncertainty sources

This initial step involves the identification of potential uncertainty sources associated with the model's input factors that influence the selected outputs of an environmental model, or functions of those outputs (i.e., QoI). Various types of uncertainty sources could influence the outputs of the models, and numerous classification schemes for uncertainty sources have been introduced to categorize them (Matott et al., 2009; Refsgaard et al., 2007; Beck, 1987; Linkov and Burmistrov, 2003).

Selection of SA method(s) and quantities of interest

This step firstly provides guidance on SA method selection based on two main criteria: the purposes of the SA and the characteristics of the SD-EM. This guidance includes only two fundamental SA purposes (i.e., ranking and screening), but SA can have additional purposes such as factor mapping that provides further descriptions for the input space related to QoI (Saltelli et al., 2008). SA methods for ranking generate the order of input factors based on their relevant influence on QoI, and

Perturbation propagation

Monte Carlo simulation-based SA needs to propagate the perturbations of input factors through the model to analyze the sensitivity of model outputs and their QoI to those input factors (Saltelli and Tarantola, 2002). Proper selection of the perturbations within plausible ranges and distributional assumptions is a crucial step in SA because the perturbation attempts to reflect the degree of uncertainty in input factors. This section introduces useful methods for perturbation propagation of the

Assessing convergence and credibility

The convergence of SA measures needs to be assessed because SA measures sometimes are not constant and vary with sample sizes, especially when they are obtained from smaller sample sizes than required sizes suggested in the literature (Sarrazin et al., 2016; Vanrolleghem et al., 2015). Two methods are generally utilized to evaluate the convergence of SA measures, which are based on the central limit theorem (CLT) and the bootstrapping technique (Yang, 2011). According to the CLT, the sample

An example of the framework with SWAT

This section illustrates the SA framework applied to a widely used SD-EM, the Soil and Water Assessment Tool (SWAT) (Neitsch et al., 2011; Zhang et al., 2019), based on an extension of the SA applied in (Koo et al., 2020). Following the framework, the first step is uncertainty source identification. The application of SWAT can be divided into three sub-models: the watershed delineation model, the HRU (hydrological response units) generation model for preprocessing, and the SWAT model for the

Conclusions

This article presents a pragmatic framework for the application of sensitivity analysis (SA) to a spatially distributed environmental model (SD-EM). The suggested framework for SA consists of four general steps: potential uncertainty source identification, selection of SA method and predictive quantities of interest, perturbation propagation, and SA evaluation and post-processing. This framework also provides useful background and general guidance on applying SA to other areas of environmental

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

The primary idea of this manuscript was discussed during two workshops in Nanjing (May 2019) and Canberra (December 2019), and the manuscript was drafted through several Zoom meetings. This work was supported by the Key Project of NSF of China (Grant 41930648), the NSF for Excellent Young Scholars of China (Grant 41622108), National Key Research and Development Program of China (Grant 2017YFB0503500), the Australian Government Research Training Program (AGRTP) Scholarship and a top-up

References (173)

  • A. Comber et al.

    Spatial analysis of remote sensing image classification accuracy

    Remote Sens. Environ.

    (2012)
  • D. Douglas-Smith et al.

    Certain trends in uncertainty and sensitivity analysis: an overview of software tools and techniques

    Environ. Model. Software

    (2020)
  • C.R. Ehlschlaeger et al.

    Visualizing spatial data uncertainty using animation

    Comput. Geosci.

    (1997)
  • M.W. Farthing et al.

    Effects of model resolution on optimal design of subsurface flow and transport problems

    Adv. Water Resour.

    (2012)
  • Y. Gan et al.

    A comprehensive evaluation of various sensitivity analysis methods: a case study with a hydrological model

    Environ. Model. Software

    (2014)
  • Y. Ge et al.

    Exploring uncertainty in remotely sensed data with parallel coordinate plots

    Int. J. Appl. Earth Obs. Geoinf.

    (2009)
  • J.H. Guillaume et al.

    Introductory overview of identifiability analysis: a guide to evaluating whether you have the right type of data for your modeling purpose

    Environ. Model. Software

    (2019)
  • J.C. Helton

    Uncertainty and sensitivity analysis techniques for use in performance assessment for radioactive waste disposal

    Reliab. Eng. Syst. Saf.

    (1993)
  • J.C. Helton et al.

    Survey of sampling-based methods for uncertainty and sensitivity analysis

    Reliab. Eng. Syst. Saf.

    (2006)
  • K.W. Holmes et al.

    Error in a USGS 30-meter digital elevation model and its impact on terrain modeling

    J. Hydrol.

    (2000)
  • A.J. Jakeman et al.

    Ten iterative steps in development and evaluation of environmental models

    Environ. Model. Software

    (2006)
  • C. Kelleher et al.

    Ten guidelines for effective data visualization in scientific publications

    Environ. Model. Software

    (2011)
  • H. Koo et al.

    Geovisualizing attribute uncertainty of interval and ratio variables: a framework and an implementation for vector data

    J. Vis. Lang. Comput.

    (2018)
  • X. Li

    Characterization, controlling, and reduction of uncertainties in the modeling and observation of land-surface systems

    Sci. China Earth Sci.

    (2014)
  • H. Lin et al.

    Virtual geographic environments (VGEs): a new generation of geographic analysis tool

    Earth Sci. Rev.

    (2013)
  • L.T.H. Newham et al.

    Sensitivity analysis for assessing the behaviour of a landscape-based sediment source and transport model

    Environ. Model. Software

    (2003)
  • J. Norton

    An introduction to sensitivity assessment of simulation models

    Environ. Model. Software

    (2015)
  • J. Nossent et al.

    Sobol’sensitivity analysis of a complex environmental model

    Environ. Model. Software

    (2011)
  • A. O'Hagan

    Probabilistic uncertainty specification: overview, elaboration techniques and their application to a mechanistic model of carbon flux

    Environ. Model. Software

    (2012)
  • J. Oksanen et al.

    Error propagation of DEM-based surface derivatives

    Comput. Geosci.

    (2005)
  • P.S. Palar et al.

    Global sensitivity analysis via multi-fidelity polynomial chaos expansion

    Reliab. Eng. Syst. Saf.

    (2018)
  • K.C. Abbaspour

    SWAT Calibration and Uncertainty Programs—A User Manual

    (2015)
  • J.C. Aerts et al.

    Accounting for spatial uncertainty in optimization with spatial decision support systems

    Trans. GIS

    (2003)
  • D.P. Ahlfeld et al.

    Impact of simulation model solver performance on ground water management problems

    Ground Water

    (2008)
  • L. Anselin

    Local indicators of spatial association—LISA

    Geogr. Anal.

    (1995)
  • L. Anselin

    Spatial Regression Analysis in R: A Workbook

    (2005)
  • L. Anselin

    A local indicator of multivariate spatial association: extending Geary's C

    Geogr. Anal.

    (2019)
  • Ansi (American National Standards Institute)

    Spatial Data Transfer Standard (SDTS) - Part 1. Logical Specifications. ANSI NCITS 320-1998. Washington, DC

    (1998)
  • J. Aouissi et al.

    Sensitivity analysis of SWAT model to the spatial rainfall distribution and watershed subdivision in streamflow simulations in the Mediterranean context: a case study in the Joumine watershed

  • A. Bárdossy et al.

    Influence of rainfall observation network on model calibration and application

    Hydrol. Earth Syst. Sci.

    (2008)
  • M.B. Beck

    Water quality modeling: a review of the analysis of uncertainty

    Water Resour. Res.

    (1987)
  • F. Berzaghi et al.

    Carbon stocks in central African forests enhanced by elephant disturbance

    Nat. Geosci.

    (2019)
  • D. Brockmann et al.

    Comparing global sensitivity analysis for a biofilm model for two-step nitrification using the qualitative screening method of Morris or the quantitative variance-based Fourier Amplitude Sensitivity Test (FAST)

    Water Sci. Technol.

    (2007)
  • F. Campolongo et al.

    Comparing different sensitivity analysis methods on a chemical reactions model

  • I. Chaubey et al.

    Effect of DEM data resolution on SWAT output

    Hydrol. Process.

    (2005)
  • J. Cho et al.

    Effect of spatial distribution of rainfall on temporal and spatial uncertainty of SWAT output

    Trans. ASABE (Am. Soc. Agric. Biol. Eng.)

    (2009)
  • N.R. Chrisman

    The error component in spatial data

  • M. Crosetto et al.

    Uncertainty and sensitivity analysis: tools for GIS-based model implementation

    Int. J. Geogr. Inf. Sci.

    (2001)
  • R.I. Cukier et al.

    Study of the sensitivity of coupled reaction systems to uncertainties in rate coefficients. I Theory

    J. Chem. Phys.

    (1973)
  • C.V. Deutsch et al.

    GSLIB Geostatistical Software Library and User's Guide

    (1998)
  • Cited by (37)

    • Reproducing computational processes in service-based geo-simulation experiments

      2023, International Journal of Applied Earth Observation and Geoinformation
    • Assessing safety functionalities in the design and validation of driving automation

      2023, Transportation Research Part C: Emerging Technologies
    • Global sensitivity and uncertainty analysis of a coastal morphodynamic model using Polynomial Chaos Expansions

      2023, Environmental Modelling and Software
      Citation Excerpt :

      Wang et al. (2020) illustrated the strength of PCE compared to the Sobol method (a Monte-Carlo-based method) in evaluating the effect of parameter interactions on hydrological models using a smaller number of simulations, e.g. 1,000,000 versus 600 simulations. Many other studies (Shahsavani and Grimvall, 2011; Hu et al., 2015; Fan et al., 2016; Kerrou et al., 2017; Christelis and Hughes, 2018; Goeury et al., 2022; Koo et al., 2020; Ayyad et al., 2021) used the PCE methods in the sensitivity analysis of hydrodynamic, hydrological, and hydraulic models. Yet, to the authors’ knowledge, the use of a robust and cost-effective sensitivity analysis method, based on the PCE, in the field of coastal erosion modeling has not been introduced.

    View all citing articles on Scopus
    View full text