Continuous evaluation of the spatial representativeness of land surface temperature validation sites
Introduction
Land surface temperature (LST) is a vital indicator of the interaction between the Earth's surface and atmosphere, and it is an essential input parameter for associated models (Kalma et al., 2008; Liang et al., 2010; Tomlinson et al., 2011; Zhan et al., 2014). Compared with LST measured at ground sites, LST retrieved from satellite remote sensing has the advantage of spatial high-density coverage, continuity, and low cost. Therefore, it is efficient to use remote sensing to obtain LST on regional and global scales. Accordingly, the validation of satellite-retrieved LST is important: on the one hand, it is essential for developers of LST products, since it allows them to improve their retrieval algorithms and parameterization schemes (Ghent et al., 2017; Wan, 2014; Liang et al., 2021); on the other hand, satellite LST accuracy is important information for many applications, e.g. the target accuracy of satellite LST is 1 K for climate-related studies (Guillevic et al., 2018; WMO, 2020). Therefore, it is crucial to quantify the accuracy of a satellite LST product before its application.
Four LST validation methods are recommended by the Committee on Earth Observation Satellite (CEOS) working group on calibration and validation: temperature-based (T-based), radiance-based, cross-validation/intercomparison, and time-series analysis (Guillevic et al., 2018). The T-based method, which compares satellite LST with in-situ LST directly, is the most widely used method in satellite LST validation and the only method that qualifies as validation in the classical, metrological sense (Coll et al., 2012; Duan et al., 2019; Göttsche et al., 2016; Guillevic et al., 2014; Li et al., 2013; Martin et al., 2019; Ouyang et al., 2017; Simó et al., 2016; Zhang et al., 2019; Zhang et al., 2021). The T-based method makes the hypothesis that in-situ LST is representative on the spatial scale of the satellite. Theoretically, only ground sites with perfectly homogeneous surfaces (e.g. lakes, grassland, and desert) on the satellite pixel scale as well as on the in-situ measurement scale are appropriate for LST validation, that is, the in-situ LST must be spatially representative on satellite pixel scale. In practice, most ground sites are located over inhomogeneous landscapes and are not dedicated to satellite LST validation (e.g. FluxNET, ChinaFlux, and HiWATER), i.e., the in-situ LST does not represent ground truth on satellite pixel scale. Can we evaluate the spatial representativeness of such sites and use them for satellite LST validation? How should we evaluate their spatial representativeness and quantify its influence on the LST validation results?
Several methods have been used to quantify a ground site's spatial representativeness in the context of validating LST and other surface parameters. Generally, those methods can be divided into two categories. The first category is based on analyses of the surface heterogeneity within a pixel. A more heterogeneous surface yields lower spatial representativeness, e.g. spatial heterogeneity was analyzed with semi-variograms in albedo validation (Román et al., 2009; Wang et al., 2014); the site was classified into different heterogeneity levels by combining semi-variogram, coefficient of variation, and dominant land cover type in LAI (leaf area index) and LST validation (Xu et al., 2016; Yu et al., 2017); other indicators, e.g. standard deviation (Coll et al., 2012; Duan et al., 2019; Guillevic et al., 2014; Li et al., 2014), and coefficient of variation (Zhou et al., 2015), were also used to analyze spatial heterogeneity. Li et al. (2020) proposed a spatial and diurnal temperature cycle-constrained sampling strategy, which provides an effective way to validate LST over heterogeneous surfaces with dramatic diurnal LST changes. The second category analyses the relationship between various parameters on the in-situ and pixel scale, e.g. Huang et al. (2016) compared multi-point observed surface solar radiation with their averages to quantify the representativeness error; Hakuba et al. (2013) assessed the spatial sampling error of a site on the grid-scale with the corresponding collocated pixel and surrounding area averages; Martin et al. (2019) selected the representativeness period based on the annual variation of multi-year monthly averaged daytime differences between AATSR LST and in-situ LST; Liu et al. (2013) use the footprint model to assess the representativeness area of evapotranspiration measured at a ground site.
In general, in LST validation one cannot use those methods directly to evaluate the spatial representativeness. On the one hand, spatial heterogeneity characterizes the complexity of the surface, and spatial representativeness characterizes the degree of difference between ground measured and pixel observed parameters. However, satellite observations reflect the averaged value within a pixel: therefore, heterogeneity cannot fully characterize representativeness. On the other hand, compared to land surface parameters with low temporal variation, e.g. LAI, LST varies strongly on diurnal and intra-annual scales (Göttsche and Olesen, 2001; Zhan et al., 2014). In addition, due to differences in thermal inertia, LST variations in the ground radiometer's field-of-view (FOV) and on satellite pixel scale also differ in their responses to the variations of weather conditions. Therefore, it is reasonable to speculate that a ground site's spatial representativeness also varies on the temporal scale and, thus, the temporal evaluation of spatial representativeness is an important issue for validating satellite LST. Overall, the current methods do not provide a temporally continuous evaluation of spatial representativeness.
Therefore, this study proposes a temporal variation method (TVM) for evaluating the spatial representativeness of LST validation sites. The basic idea of the method is that a ground site's spatial representativeness is a function of time and can be described by a spatial representativeness indicator (SRI), which is defined as the LST difference between the ground radiometer's FOV and satellite pixel. Under ideal conditions, LST variation depends exclusively on the revolution of Earth around its axis and the Sun: the temporally continuous SRI introduced here describes this ideal scenario as a smooth curve. In reality, the SRI curve fluctuates around this ideal smooth curve as it is affected by changing surface characteristics (e.g. soil moisture, vegetation, land cover) and weather conditions. The variation of SRI can be mainly attributed to spatial variations of LST that follow typical intra-annual variations, e.g. due to vegetation changes, but are also affected significantly by weather conditions. Since the indicator describes the LST difference between the two scales, it can also be used as a bridge to convert the in-situ LST to pixel scale, and the satellite LST can be directly compared with the scale-converted LST to address the uncertainty caused by the spatial representativeness. The remainder of this article introduces and demonstrates applications of the TVM method and is structured into the sections Datasets, Methods, Results, Discussion, and Conclusions.
Section snippets
Ground sites and measurements
Sixteen ground sites with longwave radiation measurements in China were selected. Their geolocations and details are shown in Fig. 1 and Table 1. These sites were operated during different campaigns and are in different climate regions and biomes; thus, they represent a broad range of surface and climatic situations.
The selected sites were equipped with different models of longwave radiometers or four-component radiometers for the outgoing and incoming longwave radiation measurements, including
Temporal variation method
Following Nappo et al. (1982), we define the spatial representativeness of a site as the LST difference between the surface within the ground radiometer's FOV and the surface within the satellite pixel covering the site. Inheriting the rapid variation characteristics of LST over time, the spatial representativeness of a site can be quantified as:where SRI is the spatial representativeness indicator in K; t denotes time, for the purpose in this study, it denotes the day of the year
∆ATC
TP on the MODIS pixel scale and TF on the ground radiometer's FOV scale (extracted from the TM/ETM+ data) for the 16 sites are shown in Fig. 3. TP on the AATSR pixel scale is similar to that on the MODIS pixel scale; thus, the corresponding TP for AATSR is not shown. For all sites, the extracted samples of TF and TP are relatively evenly distributed over the intra-annual cycle, allowing satisfactory fits of the ATC model. Since a ground radiometer's FOV is within its corresponding MODIS/AATSR
Uncertainty sources in the TVM method
The TVM decomposes the SRI into three parts (see Eq. (6)) and the resulting temporally extended SRITPR is partly predicted. Besides the uncertainty of the model (see Eq. (12)), the uncertainty sources of the SRITPR include the bias of Landsat LST in ∆ATC, the uncertainty of the various input factors in ∆USC, and the neglected ∆DTCF-P (Eq. (13)). In the following, each uncertainty contribution is analyzed separately.
Let the Landsat LST mean bias on the spatial scale of satellite pixel and the
Conclusions
Being able to quantify the spatial representativeness of a ground site is a requirement for performing meaningful T-based satellite LST validations. Since LST has evident variations on different temporal scales and is significantly affected by the underlying surface as well as weather conditions, a site's spatial representativeness for a specific satellite LST product should be evaluated continuously rather than just once. In this study, we propose a temporal variation method (TVM) to evaluate
Declaration of Competing Interest
The authors declare that there is no conflict of interests regarding the publication of this article.
Acknowledgments
This work was supported in part by the Strategic Priority Research Program of Chinese Academy of Science under Grant XDA20100101, the National Natural Science Foundation of China under Grants 41871241, by the Fundamental Research Funds for the Central Universities of China, University of Electronic Science and Technology of China under Grant ZYGX2019J069. This work was also supported by the ESA-MOST Dragon 5 Cooperation Programme under Grant 59318. The authors would like to thank LP DAAC for
References (63)
- et al.
GEOV1: LAI, FAPAR essential climate variables and FCOVER global time series capitalizing over existing products. Part 2: validation and intercomparison with reference products
Remote Sens. Environ.
(2013) - et al.
Long-term accuracy assessment of land surface temperatures derived from the advanced along-track scanning radiometer
Remote Sens. Environ.
(2012) - et al.
Evaluation of six land-surface diurnal temperature cycle models using clear-sky in situ and satellite data
Remote Sens. Environ.
(2012) - et al.
Validation of collection 6 MODIS land surface temperature product using in situ measurements
Remote Sens. Environ.
(2019) - et al.
Modelling of diurnal cycles of brightness temperature extracted from Meteosat data
Remote Sens. Environ.
(2001) - et al.
Validation of land surface temperature products derived from the visible infrared imaging radiometer suite (VIIRS) using ground-based and heritage satellite measurements
Remote Sens. Environ.
(2014) - et al.
Representativeness errors of point-scale ground-based solar radiation measurements in the validation of remote sensing products
Remote Sens. Environ.
(2016) - et al.
Satellite-derived land surface temperature: current status and perspectives
Remote Sens. Environ.
(2013) - et al.
Evaluation of the VIIRS and MODIS LST products in an arid area of Northwest China
Remote Sens. Environ.
(2014) - et al.
An optimal sampling method for multi-temporal land surface temperature validation over heterogeneous surfaces
ISPRS J. Photogramm. Remote Sens.
(2020)