Abstract
Background and aims
The quantitative retrieval of soil organic carbon (SOC) storage, particularly for soils with a large potential for carbon sequestration, is of global interest due to its link with the carbon cycle and the mitigation of climate change. However, complex ecosystems with good soil qualities for SOC storage are poorly studied.
Methods
The interrelation between SOC and various vegetation remote sensing drivers is understood to demonstrate the link between the carbon stored in the vegetation layer and SOC of the top soil layers. Based on the mapping of SOC in two horizons (0–30 cm and 30–60 cm) we predict SOC with high accuracy in the complex and mountainous heterogeneous páramo system in Ecuador. A large SOC database (in weight % and in Mg/ha) of 493 and 494 SOC sampling data points from 0–30 cm and 30–60 cm soil profiles, respectively, were used to calibrate GPR models using Sentinel-2 and GIS predictors (i.e., Temperature, Elevation, Soil Taxonomy, Geological Unit, Slope Length and Steepness (LS Factor), Orientation and Precipitation).
Results
In the 0–30 cm soil profile, the models achieved a R2 of 0.85 (SOC%) and a R2 of 0.79 (SOC Mg/ha). In the 30–60 cm soil profile, models achieved a R2 of 0.86 (SOC%), and a R2 of 0.79 (SOC Mg/ha).
Conclusions
The used Sentinel-2 variables (FVC, CWC, LCC/Cab, band 5 (705 nm) and SeLI index) were able to improve the estimation accuracy between 3–21% compared to previous results of the same study area. CWC emerged as the most relevant biophysical variable for SOC prediction.
Similar content being viewed by others
Introduction
Soil organic carbon (SOC) levels are directly related to the amount of soil organic matter (SOM), due to its composition of organic compounds that are highly enriched in carbon. SOM is a heterogeneous mixture of materials that range in stage of decomposition from fresh plant residues to highly decomposed material known as humus (Ontl and Schulte 2012). SOM contains roughly 55–60% C by mass. In many soils, C stored in SOM comprises most or all of the C stock – referred to as SOC – except where inorganic forms of soil C occur (FAO and ITPS 2015). A majority of studies since the last 120 years it is found that carbon comprises 58% of organic matter obtaining a conversion factor of 1.724 (Van Bemmelen 1890). However, the assumption that organic matter is 50% carbon, would in almost all cases, be more accurate (Pribyl 2010). The SOC pool is also a key indicator of soil quality as it affects essential biological, chemical and physical soil functions such as nutrient cycling, pesticide and water retention, and soil structure maintenance (Jandl et al. 2014). Moreover, the quantitative retrieval of SOC storage is of global interest due to its link with the carbon cycle, and the monitoring of soils with a large potential for carbon sequestration in the mitigation of climate change (Crowther et al. 2016; Zahasky and Krevor 2020; Tan et al. 2020). A pressing need is to reduce the human-caused elevated CO2 concentrations and to protect valuable natural carbon sinks such as preserving ecosystems with high SOC storage capacity (Meinshausen et al. 2009; Corbeels et al. 2019). On the other hand, ecosystems with spatial heterogeneity involve variations between land and atmospheric conditions (Adhikari et al. 2020), and SOC storage strongly relates to the physical, chemical and biological behavior of the soil (Mirchooli et al. 2020). The depletion of SOC stock is attributed to numerous factors including: decrease in the amount of biomass returned to the soil, change in soil moisture and temperature regimes (which accentuate the rate of decomposition of organic matter), high decomposability of crop residues due to differences in C:N ratio and lignin content, tillage-induced perturbations, decrease in soil aggregation and reduction in physical protection of the soil organic matter, and increase in soil erosion (Lal 2005, 2021). Therefore, SOC variations also are related to land use changes and events related to climate change (Grace et al. 2006; Martin et al. 2010; Müller-Nedebock et al. 2016; Zhou et al. 2019). Climatic changes such as the overall increasing temperatures tend to increase decomposition but this will be limited where soils become very dry (Keesstra et al. 2016), while in contrast, climate events such as extreme rainfall or higher temperature events may alter the micro-organisms/ biota and vegetation cover, leading to adverse effect over the mineralization processes (Lefévre et al. 2017). Hence, the magnitude of the effect of climate change over SOC stocks depends on the intricate interplay of local external factors, such as climate, and the ecosystem-specific composition of the organic matter itself that steers its interactions with the inorganic soil phase (Schmidt et al. 2011).
In the last years, some remote sensing techniques attempted to achieve SOC predictions with a high precision level (Angelopoulou et al. 2019) using high-spatial resolution spaceborne platforms such as Sentinel-2 (S2) (Vaudour et al. 2019). Through the European Union’s Copernicus programme free information services drawn from satellite Earth Observation and in-situ data can be obtained. Satellites from the Sentinel missions belong to a set of dedicated satellites developed and operated by the European Space Agency (ESA) and the European Organization for the Exploitation of Meteorological Satellites (EUMETSAT) (European Union 2015). Through its products, ESA aims to generate fully-calibrated estimates of at-land products quasi-independent of the original Earth Observation to offer aid and understanding of global carbon cycling (Plummer et al. 2006). The S2-A and S2-B satellites offer free multi-spectral decametric high-spatial resolution (10 to 60 m) optical imagery. From the S2 multi-spectral instrument (MSI) images biophysical products are routinely derived such as Fraction of Absorbed Photosynthetically Active Radiation (FAPAR), Leaf Area Index (LAI), Fractional Vegetation Cover (FVC), Leaf Chlorophyll Content (Cab) and Canopy Water Content (CWC), providing unique opportunities to exploit land products for complex ecosystems (ESA 2015; Hu et al. 2020). Due to the short revisit time and the spatial resolution of S2 it is possible to obtain a large availability of remote sensing data. Therefore, finding key S2 indicators that allows linking vegetation cover characteristics with SOC storage would open opportunities to improve the SOC variability description both at field and regional scales(Castaldi et al. 2019).
The estimation of SOC using remote sensing data would be a non-destructive and a non-invasive methodology. Important information can be obtained through the link between different environmental parameters and spectral information in the visible (400–700 nm) and near-infrared (700–2500 nm) regions (Sørensen and Dalsgaard 2005; Xie et al. 2011), where soil appears darker with increasing SOC content (Ladoni et al. 2010), and an increase of reflectance near 800 nm is identified with increasing SOM (Krishnan et al. 1980). Spectral absorption regions where C-rich plant constituents absorb can be related to the above soil characteristics (e.g. vegetation coverage and types of land uses) using various regression methods through a training Machine Learning Regression Algorithms (MLRA) (Verrelst et al. 2012).
Among the outstanding MLRA methods are: decision trees (Breiman 1996, 2001; Friedman 2001); geostatistics (Rasmussen 1996); Neural Networks (NN) (Haykin 1999); kernel-based methods (Suykens and Vandewalle 1999) and Gaussian Processes Regression (GPR) (Rasmussen and Williams 2006). GPR is a non-linear regression method that uses non-parametric Bayesian modelling (Rasmussen 1996; Rasmussen and Williams 2006) that is able to perform adaptive, nonlinear data fitting for complex datasets, using relevant bands during the development of a regression model. Therefore it is possible to explain the physical meaning of the model (Camps-Valls et al. 2009; Verrelst et al. 2012; Van Wittenberghe et al. 2014). GPR models can achieve high precision, as well as obtain confidence intervals in the predictions. Likewise, GPR methods have a great adaptability to data with low susceptibility to the presence of noise (Camps-Valls et al. 2009; Verrelst et al. 2013a). Hence, GPR methods applied to remote sensing of vegetation properties provide favorable results for vegetation traits such as LAI (Estévez et al. 2020; Verrelst et al. 2020; Zhang et al. 2020b). Also for the retrieval of soil properties, GPR methods obtained high accuracies to predict soil moisture content and soil water retention, while the retrieval of SOM is considered acceptable (Taneja et al. 2021; Rastgou et al. 2021). Models trained combining kriging and machine learning can also be used to reveal chemical soil properties related to the capacity of soils to store SOC (John et al. 2021). The quantification of global SOC storage and revealing of soils rich in SOC storage is a goal for mankind to generate preservations and monitoring strategies, carbon sequestration could potentially mitigate climate change (Crowther et al. 2016; Angelopoulou et al. 2019, 2020).
SOC is high at the upper soil layer and decreases with the depth (Jobbágy and Jackson 2000; Olson and Al-Kaisi 2015), but the topsoil layer is also more exposed to the effects of climatic changes, land uses changes, erosion, etc. Understanding the vertical distributions of SOC is key to predicting and simulating the influences of climate, global change, and human activities on the terrestrial carbon cycle (Wang et al. 2004). Differences in SOC accumulation also depend on the composition and age of the soil at different depths, therefore variables related with SOC storage would be more or less representative in the predicted models.
This study focuses on the largest herbaceous páramo ecosystem in Chimborazo province, Ecuador, (MAE 2013), and wherefore we aim to develop a regression model to predict SOC at different soil depths/layers with a high level of precision based on an in situ validated GPR method. The objectives are: (i) to assess several variables linked to SOC sequestration to calibrate a prediction model optimized with GPR for estimation of SOC at different soil depths (0–30 cm and 30–60 cm below ground), and (ii) to spatially predict and compare the SOC content at high spatial resolution at 0–30 cm and 30–60 cm soil profiles.
Materials and methods
Study area
The páramo ecosystem forms a North–South corridor of more than 2000 km between Venezuela and Peru, in one of the most dynamic mountain ranges, geologically and biogeographically speaking (Jorgense and Ulloa 1994; Hofstede 2004). With an elevation generally between 3000 and 4000 m a.s.l and topographic heterogeneity, it is the largest supplier of water to the Andes of Venezuela, Colombia and Ecuador and to extensive parts of the inter-Andean areas, from the Caribbean and Pacific coasts of Costa Rica and Panama, to the desert of northern Peru (Hofstede et al. 2014). In Ecuador, the páramo covers the upper part of the two mountain ranges that run north–south, where the soils principally correspond to the volcanic origin of different ages. The main difference between the soils of the páramo is due to the difference in bedrock between the north and the south of the country. The ecosystem is characterized by its low temperatures, a high humidity despite moderate rainfall and weak evaporation, and a high capacity of water retention and regulation. It has high concentrations of SOM and their soils are deep (up to several meters) with variations influenced by soil evolution (Mena et al. 2000). Consequently, the total amount of carbon stored per hectare of páramo can be greater than that of a tropical forest soil, hence the páramo soils are considered as SOC sink (Hofstede 2004). Studying this type of ecosystem and revealing the SOC reserves with a high level of precision could contribute to the understanding of global SOC distribution.
The study area is the herbaceous páramo ecosystem in the central Ecuadorian páramo region. It is a complex mountainous system with an elevation range between 2303–4501 m a.s.l. and covering 25.7% of the Chimborazo province (Fig. 1). Extended between 78º39' west longitude and 1º39' south latitude, the study area covers approximately 1667.6 km2 (Ayala Izurieta et al. 2021). The ecosystem boundary is the Western and Eastern Cordilleras arranged in meridian direction (Moreno et al. 2018).
The weather is cold-humid with a constant humidity, and a mean annual temperature of 11 ºC. The cloud cover in the study area is high most of the time due to the high level of humidity and the mountainous conditions. About the climate, it is quite stable along the year, with larger temperature changes during the day compared to the annual changes of the mean temperature. Therefore dry and wet seasons occur but without notable differences (Mena Vásconez et al. 2011).
The herbaceous páramo soils are characterized by a soil water retention and regulation capacity between 0.55 and 0.90 cm3 cm−3 (Ministerio de Ambiente del Ecuador 2012; Guio Blanco et al. 2018; Moreno et al. 2018). Additionally, these soils are resistant to erosion with a stable structure, micro-aggregation, high porosity, high humidity rate and permeability, stimulating the root development (Mena et al. 2000). The predominant vegetation is mainly based on tufted grasses larger than 50 cm in height. Within the dominant species, the genus Calamagrostis, Agrostis, Festuca, Cortaderia and Stipa stand out, shrub patches of the genus Diplostephium, Hypericum and Pentacalia are also differentiated, among others (MAE 2013).
Soil organic carbon data
A database with SOC data on two soil profiles (0–30 cm and 30–60 cm) collected in 2016 and 2017 was used. The initial database contains around 500 in situ SOC sampling points, with a random spatial sampling distribution taking into account the geological units, the taxonomy of the soil, and accessibility based on the steepness of the terrain and availability of ballast roads and trails (Ayala Izurieta et al. 2021).
The in situ soil samples were collected from the top layer (0 to 30 cm) and 30 to 60 cm below ground using a blast-hole, then these are stored and taken to the laboratory for SOC quantification processes. Soil samples were sieved (2-mm mesh), oven-dried at 105 °C, and ground prior to analysis. The total SOC in the collected soil samples were determined with an Elemental Analyzer (Flash 2000 Organic Elemental Analyzer type CHNS/O, ThermoFicher Scientific). Specifically, a soil aliquot, containing approximately 10 mg of organic carbon in silver capsules, was weighed (Tonon et al. 2010; Bateni et al. 2021). Additional samples were taken at each location of the sampling points of both profiles to determine the soil bulk density (in g/cm3). Cylinders of 88 cm3 were used, taking undisturbed soil samples (Al-Shammary et al. 2018) and then the common volume cylinder method was applied (Bongiorno et al. 2019). With the soil sample SOC weight % obtained (g C/100 g of soil) and the soil bulk density of the sample, the SOC content was expressed in Mg/ha (Lee et al. 2009; Tonon et al. 2010; Al-Shammary et al. 2018; Bongiorno et al. 2019; Ayala Izurieta et al. 2021; Bateni et al. 2021).
After identifying the anomalous data points (points extremely far from the tendency line and points without information or zero value), 493 and 464 SOC data points were kept in the 0–30 cm and 30–60 cm soil profile datasets, respectively (see Fig. 1). The geographical position (UTM coordinates, datum WGS84) of each data point was recorded (using a PGS-Trimble JUNO SB handheld with 2-to- 5-m positional accuracy in real time) to georeference the SOC data and to obtain a spatial relation with all variables evaluated on this study to be potential proxies of SOC content.
Meteorological and geophysical data
Considering the environmental factors, the following variables were analysed: temperature, elevation, soil taxonomy, geological unit, slope length and steepness (LS Factor), orientation and precipitation. In a previous study these variables were identified as being related to the storage of SOC in different levels of importance (Ayala Izurieta et al. 2021). The air temperature and precipitation are linked with SOC decomposition rates (Zhang et al. 2015). Also, soils with lower decomposition rates of carbon have high field water holding capacity (WHC) (Xu et al. 2016). The average annual temperature and the precipitation data in the herbaceous páramo ecosystem in 2015 were obtained with a kriging interpolation method (Oliver and Webster 2007; Mishra et al. 2009) using the data from 57 meteorological weather stations (21 stations belonging to Institute of Meteorology of Ecuador (INAMHI) located within Chimborazo province, 3 stations belonging to National University of Chimborazo (UNACH), to cover the southeast of Chimborazo province and 33 stations belonging to INAMHI located in surrounding areas out of the study area).
Topographic factors can be indicators of SOC due to their influence on the capacity of the soil to store SOC (Abebe et al. 2020). The uneven soil topography is typical of this complex geographic area (Podwojewski and Poulenard 2004), therefore elevation and orientation variables are evaluated. Also, due to the strong elevation gradients, land-use changes and soil erosion is considered an important factor for the SOC variability, as changes in the soil erosion can alter long-term stored SOC in the top soil layers and lead loss of SOC (Olson et al. 2016; Shi et al. 2020). The LS factor (Desmet and Govers 1996; Panagos et al. 2015; Lu et al. 2020) is one of the most important controlling factors of soil characteristics and geomorphic processes such as soil erosion (Khanifar and Khademalrasoul 2020). It is the product between the L and S factors (Eqs. 1 to 6, see below), where factor L correspond to the influence of impact of the slope length while the factor S explains the effect of the slope's inclination (Ozsoy et al. 2012; da Cunha et al. 2017). The LS factor map was created in ArcGis 10.2 software using the DEM corresponding to the study area with a spatial resolution of 30 m and applying the expressions (1) to (6), where Ai,j is the accumulation area with coordinates (i,j) [m2]; D is the length of the pixel size [m]; x is the shape coefficient [dimensionless]; m has values between 0 and 1 [dimensionless]; \(\theta\) is the slope angle [rad]; and \(\beta\) is the ratio of rill to interrill erosion [dimensionless] (Desmet and Govers 1996).
Due to topographic heterogeneity of this mountain geosystem and taking into account that the soil use of herbaceous páramo has only natural vegetation, the soil classification is also evaluated as a predictor based on the Soil Taxonomy and Geological Unit. These variables are highly relevant for the predictive modelling of SOC storage (Ayala Izurieta et al. 2021). The national geological mapping classification was used, where 31 geological units conform to the herbaceous páramo ecosystem (SNI 2011). On the other hand, the Soil Taxonomy is a more international system of classification, wherefore the USDA Soil Taxonomy is used (USDA and NRCS 2014).
Image processing and remote sensing data
Given its good spatial resolution, we selected multispectral S2 imagery for this study. Ten S2 bands were used, i.e., four bands at 10 m (B2-490 nm, B3-560 nm, B4-665 nm, B8-842 nm) and six bands at 20 m (B5-705 nm, B6-740 nm, B7-783 nm, B8a-865 nm, B11-1610 nm, B12-2190 nm) (ESA 2015). There are three bands in the visible spectrum, a band to the near infrared, four bands located in the red edge and two bands of SWIR. S2 has a temporal resolution of 5 days with both satellites operating (S2-A, S2-B), or 10 days referring to a single satellite. The S2 images used are Level-1C processing products (ESA 2015) (see Table 1), downloaded from Global Visualization Viewer (GloVis) web service of the United States Geological Survey (USGS 2017).
Level-1C products include radiometric and geometric corrections, and also orthorectification and spatial registration in a global reference system using a Digital Elevation Model (DEM) to project the image in cartographic coordinates (ESA 2015), leading to Top Of Atmosphere (TOA) reflectances. To obtain per-pixel Bottom Of Atmosphere (BOA) reflectances, i.e. processing Level 2A, the Sen2Cor Toolbox version 2.4 through the Sentinel Applications Platform (SNAP) software version 5.0 was used (see Fig. 2a). The complex study area is typically characterized by a high percent of cloud cover, which demands for a special treatment for encapsulation and removal of clouds. The clouds encapsulation was realized using the additional Cloud Mask products combined with manual identifying and removing of clouds. Finally, a BOA mosaic of the herbaceous páramo ecosystem was obtained with ArcGis 10.2 software (see Fig. 2b).
This study uses the S2 spectral indices NDVI, SAVI, WDRVI, EVI2, NDWI, VARIg, NDSI, BI, NDMI, NBR, NBR2 and adds a physiologically-based green leaf area index (LAIgreen index) known as the Sentinel-2 LAIgreen Index (SeLI) (Pasqualotto et al. 2019) (see Table 2). The SeLI index has the potential to be used in a unified algorithm for LAIgreen estimation or to identify bare areas, wherefore it can provide relevant information as a SOC storage indicator.
Punctual spectral information (i.e., individual S2-bands, spectral indices) is extracted from the final mosaic corresponding to the SOC sampling databases (493 SOC data points in the 0–30 cm profile and 464 SOC data points in the 30–60 cm profile); this database is used for training the regression model for SOC prediction. It contains spectral information of the input variables to be evaluated as predictors (S2-bands), spectral indices, meteorological and soil data, and also biophysical data). In the same way, the final mosaic is used to create an extensive geo-database of points, one point per pixel in the study area. The database contains information on the found SOC predictors, and it is used to obtain the SOC prediction values using the trained and optimized regression model.
Biophysical vegetation data
S2-derived vegetation biophysical variables were directly obtained from an automatic process implemented in SNAP 5.0 including FAPAR, LAI, FVC, LCC/Cab [µg.cm−2] and CWC [mg.cm−2] for each L2A product of the used scenes (Table 1). These biophysical parameters were evaluated as variables linked to the SOC storage. FAPAR is the fraction of incoming solar radiation in the range of 400–700 nm that is absorbed by vegetation. It characterizes the growth status of the vegetation and it can be used for tracing mass and energy exchanges (Chen et al. 2020). LAI parameter is the one-sided green leaf area per unit ground area (Chen and Black 1992), related to vegetation growth and productivity, as well as significant for modelling energy fluxes, water and carbon (Chen et al. 2002; Verrelst et al. 2015). FVC is the ratio of the vertically projected area of vegetation to the total surface area (Song et al. 2017) and is used to separate vegetation and soil in energy balance processes, including temperature and evapotranspiration. FCV is also used for monitoring of green vegetation, due to it does not depend on variables such as the geometry of illumination as compared to FAPAR (Weiss et al. 2000; Weiss and Baret 2016). LCC or Cab is a pivotal parameter in the monitoring of agriculture and carbon cycle modelling at regional and global scales as it can provide crucial information for understanding photosynthesis potential (Osborne and Raven 1986) plant stress (Carter 1994) and physiological status (Qian and Liu 2020; Zhang et al. 2020a). CWC is the product of LAI with leaf canopy content (LWC) (Verrelst et al. 2020), LWC is the Leaf water content and it has important physiological and ecological significance for plant growth (Wang et al. 2021).
Finally, the cloud removal step was also applied to the vegetation product images (see Fig. 2b). Information of biophysical variables was added to each pixel of the database for SOC prediction, as well as to the sampling point database for training the SOC regression model.
Machine learning regression method—Gaussian Process Regression
Unlike class labels obtained through a classification process, a regression produces predictions of continuous values. Considering that SOC database contains in situ sampling campaigns as well as meteorological information and the corresponding biophysical, spectral and soil surface characteristics of the study area, then a non-parametric regression model is applied. It allows to achieve a SOC prediction model adjusted with respect to the information of the input variables and existing SOC observations (see Fig. 3).
In order to probe the GPR capacity, multiple machine learning regression algorithms (MLRA) are used. For this purpose, several available methods were applied using the ARTMO software (Automated Radiative Transfer Model Operator) version 3.29 (Verrelst et al. 2019). Eight regression methods were tested (see Fig. 3b) using the same training data (input variables / bands, SOC data in situ) with a data percentage of 70% for training and 30% for validation.
SOC prediction with Gaussian Processes Regression (GPR)
Previous studies with GPR for the evaluation of biophysical parameters obtained results superior to those obtained with other non-parametric regression methods such as NN and KRR (Verrelst et al. 2012). GPR combining remote sensing data may be suitable for predicting physicochemical parameters of the soil (Rasmussen and Williams 2006; Verrelst et al. 2013a) as is the case of the SOC in this study, obtaining predictions with a resolution level of pixel size.
A Gaussian Process (GP) defines a distribution over functions. In other words, it generates a finite set of random variables, of which there is a joint Gaussian distribution; GPR is a non-linear regression method that uses non-parametric Bayesian modelling that considers the variance of the data set and a maximization of the probability margin in the training set using a scaled anisotropic Gaussian kernel function (see Table 3) (Pérez-Planells et al. 2015). GPR allows identifying the important characteristics of the input variables (Rasmussen and Williams 2006) and evaluating the relative contribution highlighting the most relevant bands or parameters to the prediction model. In this way, it is possible to find the input variables linked to SOC sequestration (SOC predictors) in order of importance for their contribution to the optimization of the resulting model.
Method parameters \(v, {\upsigma }_{b}\mathrm{ and }{\propto }_{i}\) are automatically optimized using marginal likelihood on the training set (Rasmussen and Williams 2006; Camps-Valls et al. 2009; Van Wittenberghe et al. 2014). The \({\propto }_{i}\) values indicate the relevance of each input band xi. The mean SOC prediction comes from the weighted average of the values of the SOC parameters of interest related to the training samples closest to the analysis sample. The inverse of \({\upsigma }_{b}\) indicates the relevance of each \(b\) band, i.e., low values of this parameter indicate that \(b\) band provides more information to the training function \(K\) (Verrelst et al. 2013b).
GPR model training, optimization and calibration process
After testing all input variables as SOC predictors (see Fig. 3) with eight MLRA models, the best model and input variables (b) are selected and used to calibrate the GPR final model. The procedural diagram followed for training of the SOC prediction model is observed in Fig. 4, the model is trained as well as used to predict the SOC content in the herbaceous páramo ecosystem using ARTMO 3.29 software. The SOC database used contains SOC information in two profiles, 493 SOC sampling points corresponding to profile 0–30 cm below ground, and 464 points from the profile 30–60 cm below ground, in % and Mg/ha units.
The training process uses 70% of in situ SOC samples database and 30% of data are reserved for model validation. When the R2 reaches a maximum value and also the RMSE is reduced, then input variables are defined as predictors. The \({\upsigma }_{b}\) value expresses the relationship for each of the input variables. A SOC predictor model is obtained for each profile or horizon (0–30 cm and 30–60 cm) and used for the prediction process (retrieval), as illustrated in Fig. 4. The spatial structure of S2-images is used to generate bands for all SOC predictors obtained in the GPR process with the same spatial resolution (10 m). Its file is used to predict SOC for each pixel in the 0–30 cm profile and 30–60 cm profile.
Spectral band relevance is obtained for each input band \({\upsigma }_{b}\) is observed to obtain a good model calibration. In order of comparing the importance variables for each profile (0–30 cm and 30–60 cm) in both units (SOC% and SOC Mg/ha), polar plots were obtained analyzing b. In the polar plots are possible obtain a positive representation of the spectral relevance with a conversion of lower values of \({\upsigma }_{b}\) into higher ones and calculating the relative \({\upsigma }_{b}\) (Peppo et al. 2021). In this study, the polar plots used also introduce a function of scaling in order to detect relevance differences between bands, this is detailed in expression (7) (Peppo et al. 2021) and for the graphical representation the expression (8) is used.
Results
Comparing the SOC sample data points of both depths, we note a decrease of SOC storage in the 30–60 cm profile (see Fig. 5). The SOC prediction models for 0–30 cm and 30–60 cm profiles were done, using the training data base with 493 SOC sampling data points from 0–30 cm profile and 494 SOC data points from 30–60 cm profile (in weight % and in Mg/ha). The importance level of meteorological variables, soil (topographic, geological-taxonomic) and S2 variables (10 bands, 12 indices and 5 biophysical variables (FAPAR, LAI, FVC, Cab, CWC)) for SOC prediction were founded using various regression methods in ARTMO-MLRA. With 70% of the data for training and 30% for validation, the goodness-of-fit statistics were shown in Table 4, observing that GPR has higher precision with a higher R2 between 3–7% with respect to the second most relevant method. Therefore, the GPR method was identified as the most suitable method for SOC prediction in the study zone.
Our results identified the most relevant variables for the prediction model in the 0–30 cm profile, being the variables Elevation, Soil Taxonomy, Geological Unit, LS Factor, Orientation, Precipitation, Average Annual Temperature, NBR2 Index, FVC, CWC, B5 the most significant in order to relevance. In the 0–60 cm profile they were: Elevation, Soil Taxonomy, Geological Unit, LS Factor, Orientation, Precipitation, Average Annual Temperature, VARIg Index, SeLI Index, LCC/Cab, CWC. Figure 6 shows the polar plots where the importance of the variables is shown in a comparative form through sigma value band (σb), based on their contribution within each SOC prediction model. From these results it is clear that Geological Unit, Soil Taxonomy, Precipitation and Elevation located near the edge of the polar plots were the most important variables for both profiles and in both units. Note that eight variables (i.e., Elevation, Soil Taxonomy, Geological Unit, LS Factor, Orientation, Precipitation, Average Annual Temperature, and CWC) define the SOC model in both soil profiles. Figure 7 shows the validation results of the models, with the distribution of the estimated SOC points vs. the measured ones being around the 1:1-line.
After calibrating the models with the most relevant variables, we obtained a high precision in SOC prediction, each model based on 11 variables or predictors. As a result, in the 0–30 cm profile, the SOC% prediction model has an R2 of 0.85, RMSE of 1.58% and the SOC model in Mg/ha reaches an R2 of 0.79, RMSE of 24.7 Mg/ha. The SOC model (in weight %) of 30–60 cm profile achieved an R2 of 0.86, RMSE of 1.24%, and the SOC model in Mg/ha reached an R2 of 0.79, RMSE of 20.83 Mg/ha.
Through the calibrated models, the prediction of the SOC values and the digital mapping were performed in both profiles, the results are shown in Fig. 8. It is found that, for the 0–30 cm profile, the values range between 3.8–16.4 SOC % and between 65.8–220.6 SOC Mg/ha. Regarding the 30–60 cm profile, the stored SOC is between 2.7–13 SOC % and between 32.5–178.2 SOC Mg/ha. The 30–60 cm profile shows lower SOC values compared to the upper profile, but, according to (Lefévre et al. 2017) and (Rumpel et al. 2012) the SOC in deep soil horizons would be more stable. SOC results in both profiles are added in order to obtain an accumulative SOC storage in the 0–60 cm Horizon, resulting in SOC values between 107.2 – 377.5 Mg/ha (see Fig. 8e). Also it is important to note that, in the zoom area (Fig. 8e) S2 predictors are able to detect local changes due to the spatial resolution of S2.
Due to the high importance of soil taxonomy variable the for SOC prediction, the results of SOC for all soil taxonomy type existing in the study zone (i.e., Andosols, Entisols, Histosols and Mollisols) were obtained and compared; we can detect variations over SOC distribution on the different soil types (see Fig. 9) (additional data are given in Online Resource 1). Andosols predominate in the study area with a share of 89%. Characterized by presenting a melanic epipedon, they are dark soils where SOM forms a stable complex with materials non-crystalline inorganic substances. Hence the decomposition of SOM is delayed, allowing the accumulation of SOC (Buytaert et al. 2005), which is observed in the results with high carbon concentrations between 190 and 200 Mg C/ha (see Fig. 9). Histosols represent a lower percentage of the study area (1.7%), and may have lower or higher carbon concentrations depending on the folic and histic epipedon values (Hribljan et al. 2016). Mollisols are soils with slightly acidic pH originating a mollic epipedon (Moreno et al. 2016), these conditions increase the mineralization of organic matter, predominantly values around 150 Mg C/ha. They cover approximately a 7.4% of the study area and are located in areas that have a higher temperature regime and an ustic moisture regime, their parent material is influenced by the Paute geological formation, which has the presence of basaltic rocks that originate soils. In the case of Entisols present in 1.9% of the study area, these soils show the lowest SOC concentrations of the results; located on higher slopes, they are young and in some cases they are subjected to erosion processes. Figure 9 shows high SOC values within the group of Entisols, which may be related to inclusions of Andosols since these landscapes have a complex spatial distribution. The results of SOC storage predicted seem to be in concordance with the soil type but it is important to note the existence of some SOC values outside the usual SOC values, especially in Entisols. We speculate that this might be due to problems of delimitation of the types of soils of the existing national cartography, where in the soil taxonomy map, the soils in analysis share limits, likewise in the cartography pedological units are allowed with up to 15% inclusions of other soils (SIGTIERRAS 2012). Figure 9 shows the complexity of the retrieval, even so, our SOC models are able to find almost gaussian-like distributions of the parameter for some soil types due to the appropriately sampling strategy.
Discussion
Earlier studies of SOC estimation for different study areas using S2 and other spaceborne platforms report a R2 below 0.7 (Nocita et al. 2013; Mirzaee et al. 2016; Steinberg et al. 2016; Vaudour et al. 2019). The comparative performance results of the regression methods analysed (see Table 4)—using the same variables to predict SOC, and using the same SOC database—show that GPR obtained better results compared to the other models, and this improvement corresponds only to the model. A previous study using Random Forest Regression for the same study area using Landsat 8 (OLI and TIRS sensors) obtained 0-30 cm profile obtained a R2 of 0.82 and RMSE of 1.72 (for SOC%) and a R2 of 0.77 and RMSE of 25.8 (for SOC in Mg/ha) (Ayala Izurieta et al. 2021); comparing RFTB with GPR in this study we obtain that GPR accuracy is better that RFTB in a 15% for SOC% and 26% for SOC Mg/ha in the 0–30 cm profile, and for the 30–60 cm profile, GPR model is better that RFTB in a 17% and 9% corresponding to SOC% and SOC in Mg/ha respectively. This study obtained similar results of SOC predicted in 0–30 profile, but with different spectral variables using S2 images, therefore we obtained an increase of the spatial resolution and also new SOC indicators for the model of SOC prediction were found (i.e., FVC, CWC, LCC/Cab and the SeLI index). It shows the importance of spectral indices into the models. The results indicate that topography has a higher influence on SOC at finer spatial scales (Adhikari et al. 2020), which is indicated by the high importance obtained with the elevation, orientation and LS Factor as SOC predictors. Soil formation conditions on páramo depend on three main factors which are climate, bedrock and the age of the soils (Mena et al. 2000). In general, the páramo soils are vast SOC reservoirs where the capacity of SOC storage into the soils depend on soil types and their morphologic, physical and chemical parameters. These parameters are approached with Geological Unit, Soil Taxonomy, Precipitation and Average Annual Temperature predictors (see Fig. 6), also the SOC mapping results revealed that the SOC distribution is strongly guided by Geological Unit and Soil Taxonomy predictors (see Fig. 8).
Our results are according to (Minaya et al. 2016), due to the higher SOC stocks are found in the low and mid catchments, which could be associated to higher content of soil silt fraction. Our SOC predicted values have variations comparing profile 0-30 cm and profile 30-60 cm. Thompson et al. (2021) found changes over bulk density occur with increase of soil depth, and also SOC changing in function of land use. The land use change is related with a change of soil properties to SOC stock and stabilization on Andean ecosystems of Ecuador (Tonneijck et al. 2010). Therefore, a possible linking of SOC stocks and the change in soil texture would be interesting to hold study, in order to detect the change patron to different soil profiles.
S2 variables also contributed to the improved prediction, such as the biophysical variables FVC, CWC and LCC/Cab, from reflectivity of S2-band 5 (705 nm) and SeLI index, considering them as new SOC predictors. These variables used at a resolution of 10 m, give more resolution to mapping results for zones with sudden alterations in SOC storage produced by mainly anthropic actions or sudden environmental changes, having obtained a fine scale on SOC mapping results. FVC variable results from normalized difference vegetation index (NDVI) values of highly dense vegetation (NDVIv) and bare soil (NDVIs) (Montandon and Small 2008; Jiapaer et al. 2011), this variable provides more information to the model than NDVI alone, which had no impact on the SOC prediction model. FVC can describe the quality of herbaceous páramo vegetation and its possible changes, also considering that anthropic intervention over the ecosystem affects the endemic vegetation. The páramo soils and its vegetation have high capacity to retain water (Podwojewski and Poulenard 2004), this seems to be proven with the CWC (Verrelst et al. 2020) as the biophysical variable that shows the highest importance to the SOC prediction models, giving more sensitivity mainly to the types of vegetation based on the soils conditions since, taking account the perennial vegetation (MAE 2013) owns of páramo ecosystem. LCC or Cab give important information for the results in the profile 30–60 cm, and these results are in line with studies that found potential for this variable to model the carbon cycle and the chemical process into the plants (Osborne and Raven 1986; Carter 1994; Qian and Liu 2020). An advantage of the MSI from Sentinel 2 is the location of Band 5 (705 nm) corresponding to the Near Infrared (NIR) edge (ESA 2015). Here, the reflectance produced by the vegetation varies by an effect to the leaf chlorophyll and leaf structure, the change is more subtle in the case of dry vegetation or soil, allowing a good discrimination for changes in land cover and vegetation cover. In the same way, the SeLI index exploits the red-edge region for low-saturating absorption sensitivity to photosynthetic vegetation (Pasqualotto et al. 2019) providing predictive power to the SOC model for the 30–60 cm profile and for the accumulative 0–60 cm profile.
Conclusions
SOC storage in both soil profiles (0–30 cm and 30–60) were estimated using GPR models with an accuracy of 85% for the 0–30 cm profile and 86% for the 30–60 cm profile, based on commonly used and novel SOC predictors. With the inclusion of five new S2-derived SOC predictors, i.e., FVC, CWC, LCC/Cab, S2 band 5 (705 nm) and the SeLI index, we were able to improve the estimation results and increase its spatial resolution. Products with high spatial resolution like S2, favor the SOC mapping resolution. Apart from classic SOC predictors (Geological Unit, Soil Taxonomy, Precipitation and Average Annual Temperature), CWC emerged as the most relevant biophysical variable. Therefore, SOC reserves are correlated with the aboveground biophysical characteristics and changes over it.
The retrieval works adequately, the results evidenced at high detail the high SOC storage in the vast and poorly accessible study area. The methodology presented here could be used and applied to obtain the digital SOC mapping over other areas of herbaceous páramo ecosystem and also over ecosystems with similar conditions and characteristics.
Between all MLRA, GPR stood out to predict SOC in complex areas, even outperforming methods such as RFR, KRR, NN, etc. A high precision was achieved, with the identification of the important characteristics to the SOC prediction model. Therefore, it was possible to find 14 SOC predictors linked to SOC sequestration in order of importance for their contribution to the optimization of the resulting model. In the 0–30 cm profile the order of relevance was: Elevation > Soil Taxonomy > Geological Unit > LS Factor > Orientation > Precipitation > Average Annual Temperature > NBR2 Index > FVC > CWC > B5. In the 0–60 cm profile: Elevation > Soil Taxonomy > Geological Unit > LS Factor > Orientation > Precipitation > Average Annual Temperature > VARIg Index > SeLI Index > LCC/Cab > CWC.
Data Availability
The datasets generated during and/or analysed during the current study are available as supplementary material.
Abbreviations
- SOC:
-
Soil organic carbon
- SOM:
-
Soil organic matter
- OM:
-
Organic matter
- S2:
-
Sentinel-2
- MSI:
-
Multi-spectral instrument
- ESA:
-
European Space Agency
- EUMETSAT:
-
European Organization for the Exploitation of Meteorological Satellites
- FAPAR:
-
Fraction of Absorbed Photosynthetically Active Radiation
- LAI:
-
Leaf Area Index
- FVC:
-
Fractional Vegetation Cover
- LCC/Cab :
-
Leaf Chlorophyll Content
- CWC:
-
Canopy Water Content
- MLRA:
-
Machine Learning Regression Algorithms
- RFFE:
-
Random Forest Fitensemble
- RFTB:
-
Random Forest TreeBagger
- GBT:
-
Gradient Boosting/Boosted Trees
- KNNR:
-
K-nearest neighbors regression
- NN:
-
Neural Network
- KRR:
-
Kernel ridge Regression
- KSNR:
-
Kernel signal to noise ratio
- VH-GPR:
-
Kernel-based method Variational Heteroscedastic Gaussian Processes Regression
- GP:
-
Gaussian Process
- GPR:
-
Gaussian Processes Regression
- GloVis:
-
Global Visualization Viewer
- DEM:
-
Digital Elevation Model
- TOA:
-
Top Of Atmosphere
- BOA:
-
Bottom Of Atmosphere
- NDMI:
-
Normalized Difference Moisture Index
- SAVI:
-
Soil-Adjusted Vegetation Index
- WDRVI:
-
Wide Dynamic Range Vegetation Index
- EVI2:
-
Enhanced Vegetation Index 2
- NDWI:
-
Normalized Difference Water Index
- VARIg:
-
Visible Atmospherically Resistant Vegetation Index green
- NDSI:
-
Normalized Difference Snow Index
- BI:
-
Bare Soil Index
- NDMI:
-
Normalized Difference Moisture Index
- NBR:
-
Normalized Burn Ratio
- NBR2:
-
Normalized Burn Ratio 2
- SeLI:
-
LAIgreen Index
- ARTMO:
-
Automated Radiative Transfer Model Operator
References
Abebe G, Tsunekawa A, Haregeweyn N et al (2020) Effects of Land Use and Topographic Position on Soil Organic Carbon and Total Nitrogen Stocks in Different Agro-Ecosystems of the Upper Blue Nile Basin. Sustainability 12:2425. https://doi.org/10.3390/su12062425
Adhikari K, Mishra U, Owens PR et al (2020) Importance and strength of environmental controllers of soil organic carbon changes with scale. Geoderma 375:114472. https://doi.org/10.1016/j.geoderma.2020.114472
Al-Shammary AAG, Kouzani AZ, Kaynak A et al (2018) Soil Bulk Density Estimation Methods: A Review. Pedosphere 28:581–596. https://doi.org/10.1016/S1002-0160(18)60034-7
Angelopoulou T, Balafoutis A, Zalidis G, Bochtis D (2020) From laboratory to proximal sensing spectroscopy for soil organic carbon estimation-A review. Sustain 12:1–24. https://doi.org/10.3390/su12020443
Angelopoulou T, Tziolas N, Balafoutis A et al (2019) Remote sensing techniques for soil organic carbon estimation: A review. Remote Sens 11:676. https://doi.org/10.3390/rs11060676
Ayala Izurieta JE, Márquez CO, García VJ et al (2021) Multi-predictor mapping of soil organic carbon in the alpine tundra: a case study for the central Ecuadorian páramo. Carbon Balance Manag 16:32. https://doi.org/10.1186/s13021-021-00195-2
Ayala JE, Márquez CO, García VJ, et al (2017) Land cover classification in an ecuadorian mountain geosystem using a random forest classifier, spectral vegetation indices, and ancillary geographic data. Geosci 7. https://doi.org/10.3390/geosciences7020034
Bateni C, Ventura M, Tonon G, Pisanelli A (2021) Soil carbon stock in olive groves agroforestry systems under different management and soil characteristics. Agrofor Syst 95:951–961. https://doi.org/10.1007/S10457-019-00367-7
Bongiorno G, Bünemann EK, Oguejiofor CU et al (2019) Sensitivity of labile carbon fractions to tillage and organic matter management and their potential as comprehensive soil quality indicators across pedoclimatic conditions in Europe. Ecol Indic 99:38–50. https://doi.org/10.1016/j.ecolind.2018.12.008
Breiman L (2001) Random Forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Breiman L (1996) Bagging Predictors. Mach Learn 24:123–140. https://doi.org/10.1007/BF00058655
Buytaert W, Sevink J, De Leeuw B, Deckers J (2005) Clay mineralogy of the soils in the south Ecuadorian páramo region. Geoderma 127:114–129. https://doi.org/10.1016/J.GEODERMA.2004.11.021
Cammarano D, Fitzgerald GJ, Casa R, Basso B (2014) Assessing the Robustness of Vegetation Indices to Estimate Wheat N in Mediterranean Environments. Remote Sens 6:2827–2844. https://doi.org/10.3390/rs6042827
Camps-Valls G, Gómez-Chova L, Muñoz-Marí J, et al (2009) Biophysical parameter estimation with adaptive gaussian processes Image Processing Laboratory ( IPL ). In: 2009 IEEE International Geoscience and Remote Sensing Symposium. pp IV-69-IV–72
Carter GA (1994) Ratios of leaf reflectances in narrow wavebands as indicators of plant stress. Int J Remote Sens 15:697–703. https://doi.org/10.1080/01431169408954109
Castaldi F, Hueni A, Chabrillat S et al (2019) Evaluating the capability of the Sentinel 2 data for soil organic carbon prediction in croplands. ISPRS J Photogramm Remote Sens 147:267–282. https://doi.org/10.1016/j.isprsjprs.2018.11.026
Chen JM, Black TA (1992) Defining leaf area index for non-flat leaves. Plant Cell Environ 15:421–429. https://doi.org/10.1111/j.1365-3040.1992.tb00992.x
Chen JM, Pavlic G, Brown L et al (2002) Derivation and validation of Canada-wide coarse-resolution leaf area index maps using high-resolution satellite imagery and ground measurements. Remote Sens Environ 80:165–184. https://doi.org/10.1016/S0034-4257(01)00300-5
Chen S, Liu L, He X et al (2020) Upscaling from Instantaneous to Daily Fraction of Absorbed Photosynthetically Active Radiation (FAPAR) for Satellite Products. Remote Sens 12:2083. https://doi.org/10.3390/rs12132083
Chen W, Liu L, Zhang C et al (2004) Monitoring the seasonal bare soil areas in Beijing using multi-temporal TM images. 1–5. https://doi.org/10.1109/IGARSS.2004.1370429
Corbeels M, Cardinael R, Naudin K et al (2019) The 4 per 1000 goal and soil carbon storage under agroforestry and conservation agriculture systems in sub-Saharan Africa. Soil Tillage Res 188:16–26. https://doi.org/10.1016/j.still.2018.02.015
Crowther TW, Todd-Brown KEO, Rowe CW et al (2016) Quantifying global soil carbon losses in response to warming. Nature 540:104–108. https://doi.org/10.1038/nature20150
da Cunha ER, Bacani VM, Panachuki E (2017) Modeling soil erosion using RUSLE and GIS in a watershed occupied by rural settlement in the Brazilian Cerrado. Nat Hazards 85:851–868. https://doi.org/10.1007/s11069-016-2607-3
Desmet PJJ, Govers G (1996) A GIS procedure for automatically calculating the USLE LS factor on topographically complex landscape units. J Soil Water Conserv 51:427–433
ESA (2015) Sentinel-2 User Handbook. User Guide. Eupean Space Agency (ESA) Estándar Document. Issue 1, rev. 2. Date 24/07/2015. https://sentinels.copernicus.eu/web/sentinel/user-guides/document-library/-/asset_publisher/xlslt4309D5h/content/sentinel-2-user-handbook
Estévez J, Vicent J, Rivera-caicedo JP, Morcillo-pallarés P (2020) Gaussian processes retrieval of LAI from Sentinel-2 top-of-atmosphere radiance data. ISPRS J Photogramm Remote Sens 167:289–304. https://doi.org/10.1016/j.isprsjprs.2020.07.004
European Union (2015) Copernicus. Belgium. https://www.eumetsat.int/copernicus
FAO, ITPS (2015) Status of the world’s soil resources. Rome. Main Report. Food and Agriculture Organization of the United Nations and Intergovernmental Technical Panel on Soils. Rome, FAO, p 650
Friedman JH (2001) Greedy function approximation: A gradient boosting machine. Ann Stat 29:1189–1232. https://doi.org/10.1214/aos/1013203451
Gitelson A (2004) Wide dynamic range vegetation index for remote quantification of biophysical characteristics of vegetation. J Plant Physiol 161:165–173. https://doi.org/10.1078/0176-1617-01176
Gitelson AA, Kaufman YJ, Stark R, Rundquist D (2002) Novel algorithms for remote estimation of vegetation fraction. Remote Sens Environ 80:76–87. https://doi.org/10.1016/S0034-4257(01)00289-9
Grace PR, Post WM, Hennessy K (2006) The potential impact of climate change on Australia’s soil organic carbon resources. Carbon Balance Manag 1:14. https://doi.org/10.1186/1750-0680-1-14
Guio Blanco CM, Brito Gomez VM, Crespo P, Ließ M (2018) Spatial prediction of soil water retention in a Páramo landscape: Methodological insight into machine learning using random forest. Geoderma 316:100–114. https://doi.org/10.1016/j.geoderma.2017.12.002
Haykin S (1999) Neural networks: a comprehensive foundation. Prentice Hall, Upper Saddle River
Hislop S, Jones S, Soto-berelov M et al (2018) Using Landsat Spectral Indices in Time-Series to Assess Wildfire Disturbance and Recovery. Remote Sens 10:460. https://doi.org/10.3390/rs10030460
Hofstede R (2004) Health state of Páramos: an effort to correlate science and practice. Lyonia 6(1):62–73. https://www.lyonia.org/downloadPDF-2.266.pdf?pdfID=2.266
Hofstede R, Calles J, López V, et al (2014) Los páramos Andinos ¿Qué Sabemos? Estado de conocimiento sobre el impacto del cambio climático en el ecosistema páramo
Hribljan JA, Suárez E, Heckman KA et al (2016) Peatland carbon stocks and accumulation rates in the Ecuadorian páramo. Wetl Ecol Manag 24:113–127. https://doi.org/10.1007/s11273-016-9482-2
Hu Q, Yang J, Xu B et al (2020) Evaluation of Global Decametric-Resolution LAI, FAPAR and FVC Estimates Derived from Sentinel-2 Imagery. Remote Sens 12:912. https://doi.org/10.3390/rs12060912
Huete AR (1988) A Soil-Adjusted Vegetation Index. Remote Sens Environ 25:295-309 (1). https://doi.org/10.1016/0034-4257(88)90106-X
Jandl R, Rodeghiero M, Martinez C et al (2014) Current status, uncertainty and future needs in soil organic carbon monitoring. Sci Total Environ 468–469:376–383. https://doi.org/10.1016/j.scitotenv.2013.08.026
Jiang Z, Huete AR, Didan K, Miura T (2008) Development of a two-band enhanced vegetation index without a blue band. Remote Sens Environ 112:3833–3845. https://doi.org/10.1016/j.rse.2008.06.006
Jiapaer G, Chen X, Bao A (2011) A comparison of methods for estimating fractional vegetation cover in arid regions. Agric for Meteorol 151:1698–1710. https://doi.org/10.1016/j.agrformet.2011.07.004
Jobbágy EG, Jackson RB (2000) The vertical distribution of soil organic carbon and its relation to climate and vegetation. Ecol Appl 10:423–436. https://doi.org/10.1890/1051-0761(2000)010[0423:TVDOSO]2.0.CO;2
John K, Agyeman PC, Kebonye NM et al (2021) Hybridization of cokriging and gaussian process regression modelling techniques in mapping soil sulphur. CATENA 206:105534. https://doi.org/10.1016/J.CATENA.2021.105534
Jorgense P, Ulloa U (1994) Seed plants of the high Andes of Ecuador: a checklist. AAU Reports No. 34. https://www.cabi.org/ISC/abstract/19946797950
Keesstra SD, Bouma J, Wallinga J et al (2016) The significance of soils and soil science towards realization of the United Nations Sustainable Development Goals. SOIL 2:111–128. https://doi.org/10.5194/soil-2-111-2016
Key CH, Benson NC (2006) Landscape Assessment: Ground measure of severity, the Composite Burn Index; and Remote sensing of severity, the Normalized Burn Ratio. In: Lutes D., Keane R., Caratti J., et al. (eds) FIREMON: Fire Effects Monitoring and Inventory System, RMRS-GTR-1. USDA Forest Service, Rocky Mountain Research Station, Ogden, UT, p LA 1-LA 51
Khanifar J, Khademalrasoul A (2020) Multiscale comparison of LS factor calculation methods based on different flow direction algorithms in Susa Ancient landscape. Acta Geophys 68:783–793. https://doi.org/10.1007/s11600-020-00432-1
Krishnan P, Alexander JD, Butler BJ, Hummel JW (1980) Reflectance Technique for Predicting Soil Organic Matter. Soil Sci Soc Am J 44:1282–1285. https://doi.org/10.2136/sssaj1980.03615995004400060030x
Ladoni M, Bahrami HA, Alavipanah SK, Norouzi AA (2010) Estimating soil organic carbon from soil reflectance: a review. Precis Agric 11:82–99. https://doi.org/10.1007/s11119-009-9123-3
Lal R (2005) Forest soils and carbon sequestration. For Ecol Manage 220:242–258. https://doi.org/10.1016/j.foreco.2005.08.015
Lal R (2021) Soil management for carbon sequestration. South African J Plant Soil 38:231–237. https://doi.org/10.1080/02571862.2021.1891474
Lee J, Hopmans JW, Rolston DE et al (2009) Determining soil carbon stock changes: Simple bulk density corrections fail. Agric Ecosyst Environ 134:251–256. https://doi.org/10.1016/j.agee.2009.07.006
Lefévre C, Rekik F, V A, Wiese L (2017) Soil Organic Carbon: the Hidden Potential. Rome. Italy. https://reliefweb.int/report/world/soil-organic-carbon-hidden-potential
Lu S, Liu B, Hu Y et al (2020) Soil erosion topographic factor (LS): Accuracy calculated from different data sources. CATENA 187:104334. https://doi.org/10.1016/j.catena.2019.104334
MAE (2013) Sistema de Clasificación de Ecosistemas del Ecuador Continental. Quito. http://app.sni.gob.ec/sni-link/sni/PDOT/NIVEL%20NACIONAL/MAE/ECOSISTEMAS/DOCUMENTOS/Sistema.pdf
Martin D, Lal T, Sachdev CB, Sharma JP (2010) Soil organic carbon storage changes with climate change, landform and land use conditions in Garhwal hills of the Indian Himalayan mountains. Agric Ecosyst Environ 138:64–73. https://doi.org/10.1016/J.AGEE.2010.04.001
McFeeters Sk (1996) The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. Int J Remote Sens 17:1425–1432. https://doi.org/10.1080/01431169608948714
Meinshausen M, Meinshausen N, Hare W et al (2009) Greenhouse-gas emission targets for limiting global warming to 2°C. Nature 458:1158–1162. https://doi.org/10.1038/nature08017
Mena P, Josse C, Medina G (2000) Los suelos del Páramo. Serie Páramo 5. GTP/Abya Yala. Quito
Mena Vásconez P, Castillo A, Flores S, et al (2011) Páramo. Paisaje estudiado, habitado, manejado e institucionalizado
Minaya V, Corzo G, Romero-Saltos H et al (2016) Altitudinal analysis of carbon stocks in the Antisana páramo, Ecuadorian Andes. J Plant Ecol 9:553–563. https://doi.org/10.1093/jpe/rtv073
Ministerio de Ambiente del Ecuador (2012) Sistema de clasificación de los ecosistemas del Ecuador continental. Subsecretaría de Patrimonio Natural (a). Quito
Mirchooli F, Kiani-Harchegani M, Khaledi Darvishan A et al (2020) Spatial distribution dependency of soil organic carbon content to important environmental variables. Ecol Indic 116:106473. https://doi.org/10.1016/j.ecolind.2020.106473
Mirzaee S, Ghorbani-Dashtaki S, Mohammadi J et al (2016) Spatial variability of soil organic matter using remote sensing data. CATENA 145:118–127. https://doi.org/10.1016/j.catena.2016.05.023
Mishra U, Lal R, Slater B et al (2009) Predicting Soil Organic Carbon Stock Using Profile Depth Distribution Functions and Ordinary Kriging. Soil Sci Soc Am J 73:614–621. https://doi.org/10.2136/sssaj2007.0410
Montandon LM, Small EE (2008) The impact of soil reflectance on the quantification of the green vegetation fraction from NDVI. Remote Sens Environ 112:1835–1845. https://doi.org/10.1016/j.rse.2007.09.007
Moreno J, Yerovi F, Herrera M, et al (2018) The Soils of Ecuador. In: Espinosa J, Moreno J, Bernal G (eds) Soil from the Highlands, Primera. Springer US, pp 11–49. https://doi.org/10.1007/978-3-319-25319-0
Moreno J, Yerovi F, Herrera M et al (2016) The Soils of Ecuador. In: Espinosa J, Moreno J, Bernal G (eds) Soil fron the Highlands, Primera. Springer US, pp 11–49
Müller-Nedebock D, Chivenge P, Chaplot V (2016) Selective organic carbon losses from soils by sheet erosion and main controls. Earth Surf Process Landforms 41:1399–1408. https://doi.org/10.1002/esp.3916
Nocita M, Stevens A, Noon C, Van Wesemael B (2013) Prediction of soil organic carbon for different levels of soil moisture using Vis-NIR spectroscopy. Geoderma 199:37–42. https://doi.org/10.1016/j.geoderma.2012.07.020
Oliver MA, Webster R (2007) Kriging: a method of interpolation for geographical information systems. 101080/02693799008941549 4:313–332. https://doi.org/10.1080/02693799008941549
Olson KR, Al-Kaisi M, Lal R, Cihacek L (2016) Impact of soil erosion on soil organic carbon stocks. J Soil Water Conserv 71:61A-67A. https://doi.org/10.2489/jswc.71.3.61A
Olson KR, Al-Kaisi MM (2015) The importance of soil sampling depth for accurate account of soil organic carbon sequestration, storage, retention and loss. CATENA 125:33–37. https://doi.org/10.1016/J.CATENA.2014.10.004
Ontl TA, Schulte LA (2012) 2012-Ontl-and-Schulte-SOil carbon Storage.pdf. Nat Edication Knowl 3:35
Osborne BA, Raven JA (1986) Light absorption by plants and its implications for photosynthesis. Biol Rev 61:1–60. https://doi.org/10.1111/j.1469-185x.1986.tb00425.x
Ozsoy G, Aksoy E, Dirim MS, Tumsavas Z (2012) Determination of Soil Erosion Risk in the Mustafakemalpasa River Basin, Turkey, Using the Revised Universal Soil Loss Equation, Geographic Information System, and Remote Sensing. Environ Manage 50:679–694. https://doi.org/10.1007/s00267-012-9904-8
Panagos P, Borrelli P, Meusburger K (2015) A New European Slope Length and Steepness Factor (LS-Factor) for Modeling Soil Erosion by Water. Geosciences 5:117–126. https://doi.org/10.3390/geosciences5020117
Pasqualotto N, Delegido J, Van Wittenberghe S et al (2019) Multi-Crop Green LAI Estimation with a New Simple Sentinel-2 LAI Index (SeLI). Sensors 19:904. https://doi.org/10.3390/s19040904
Peppo M De, Taramelli A, Boschetti M et al (2021) Non-Parametric Statistical Approaches for Leaf Area Index Estimation from Sentinel-2 Data: A Multi-Crop Assessment. Remote Sens 13(2841 13):2841. https://doi.org/10.3390/RS13142841
Pérez-Planells L, Delegido J, Rivera-Caicedo J, Verrelst J (2015) Análisis de métodos de validación cruzada para la obtención robusta de parámetros biofísicos. Rev Teledetección 44:55–65. https://doi.org/10.4995/raet.2015.4153
Plummer S, Arino O, Simon M, Steffen W (2006) Establishing A Earth Observation Product Service For The Terrestrial Carbon Community: The Globcarbon Initiative. Mitig Adapt Strateg Glob Chang 11:97–111. https://doi.org/10.1007/s11027-006-1012-8
Podwojewski P, Poulenard J (2004) Paramos soils. Encycl Soil Sci 3068. CRC Press Taylor y Francis Group, US. 1239 – 1242
Pribyl DW (2010) A critical review of the conventional SOC to SOM conversion factor. Geoderma 156:75–83. https://doi.org/10.1016/J.GEODERMA.2010.02.003
Qian X, Liu L (2020) remote sensing Retrieving Crop Leaf Chlorophyll Content Using an Improved Look-Up-Table Approach by Combining Multiple Canopy Structures and Soil Backgrounds. Remote Sens 12:2139. https://doi.org/10.3390/rs12132139
Rasmussen CE (1996) Evaluation of Gaussian processes and other methods for non-linear regression. University of Toronto. http://mlg.eng.cam.ac.uk/pub/pdf/Ras96b.pdf
Rasmussen CE, Williams CKI (2006) Gaussian Processes for Machine Learning. The MIT Press, New York
Rastgou M, Bayat H, Mansoorizadeh M, Gregory AS (2021) Prediction of soil hydraulic properties by Gaussian process regression algorithm in arid and semiarid zones in Iran. Soil Tillage Res 210:104980. https://doi.org/10.1016/j.still.2021.104980
Rouse JW, Hass RH, Schell JA, Deering DW (1974) Monitoring vegetation systems in the great plains with ERTS. In: Third Earth Resources Technology Satellite (ERTS) symposium. NASA special publication. Washington, D.C., USA, p 309
Rumpel C, Chabbi A, Marschner B (2012) Carbon Storage and Sequestration in Subsoil Horizons: Knowledge, Gaps and Potentials BT - Recarbonization of the Biosphere: Ecosystems and the Global Carbon Cycle. In: Lal R, Lorenz K, Hüttl RF, et al. (eds). Springer Netherlands, Dordrecht, pp 445–464
Schmidt MWI, Torn MS, Abiven S et al (2011) Persistence of soil organic matter as an ecosystem property. Nature 478:49–56. https://doi.org/10.1038/NATURE10386
Shi P, Zhang Y, Zhang Y et al (2020) Land-use types and slope topography affect the soil labile carbon fractions in the Loess hilly-gully area of Shaanxi, China. Arch Agron Soil Sci 66:638–650. https://doi.org/10.1080/03650340.2019.1630824
SIGTIERRAS (2012) Memoria Técnica Cantón Guano Proyecto: “GENERACIÓN DE GEOINFORMACIÓN PARA LA GESTIÓN DEL TERRITORIO A NIVEL NACIONAL ESCALA 1: 25 000”
SNI (2011) INFOPLAN. In: Infraestruct. Ecuatoriana Datos Espac. http://sni.gob.ec/inicio. Accessed 9 July 2020
Song W, Mu X, Ruan G et al (2017) Estimating fractional vegetation cover and the vegetation index of bare soil and highly dense vegetation with a physically based method. Int J Appl Earth Obs Geoinf 58:168–176. https://doi.org/10.1016/j.jag.2017.01.015
Sørensen LK, Dalsgaard S (2005) Determination of Clay and Other Soil Properties by Near Infrared Spectroscopy. Soil Sci Soc Am J 69:159. https://doi.org/10.2136/sssaj2005.0159
Steinberg A, Chabrillat S, Stevens A et al (2016) Prediction of common surface soil properties based on Vis-NIR airborne and simulated EnMAP imaging spectroscopy data: Prediction accuracy and influence of spatial resolution. Remote Sens 8:613. https://doi.org/10.3390/rs8070613
Stokey EA, Stow DA, O’Leary JF (2016) Assessing postfire recovery of chamise chaparral using multi-temporal spectral vegetation index trajectories derived from Landsat imagery. Remote Sens Environ 183:53–64. https://doi.org/10.1016/j.rse.2016.05.018
Suykens JAK, Vandewalle J (1999) Least Squares Support Vector Machine Classifiers. Neural Process Lett 9:293–300. https://doi.org/10.1023/A:1018628609742
Takeuchi W, Yasuoka Y (2004) Development of normalized Vegetation, soil and water indices derived from satellite remote sensing data. J Japan Soc Photogramm Remote Sens 43:7–19. https://doi.org/10.4287/jsprs.43.6_7
Tan Q, Han W, Li X, Wang G (2020) Clarifying the response of soil organic carbon storage to increasing temperature through minimizing the precipitation effect. Geoderma 374:114398. https://doi.org/10.1016/j.geoderma.2020.114398
Taneja P, Vasava HK, Daggupati P, Biswas A (2021) Multi-algorithm comparison to predict soil organic matter and soil moisture content from cell phone images. Geoderma 385:114863. https://doi.org/10.1016/j.geoderma.2020.114863
Thompson JB, Zurita-Arthos L, Müller F et al (2021) Land use change in the Ecuadorian páramo: The impact of expanding agriculture on soil carbon storage. Arctic Antarct Alp Res 53:48–59. https://doi.org/10.1080/15230430.2021.1873055
Tonneijck FH, Jansen B, Nierop KGJ et al (2010) Towards understanding of carbon stocks and stabilization in volcanic ash soils in natural Andean ecosystems of northern Ecuador. Eur J Soil Sci 61:392–405. https://doi.org/10.1111/j.1365-2389.2010.01241.x
Tonon G, Monti B, Lenzi D et al (2010) Comparison of non-invasive indirect method and trenching to partitioning soil respiration in a hardwood temperate forest. In: EGU General Assembly Conference Abstracts. p 14300. https://ui.adsabs.harvard.edu/abs/2010EGUGA..1214300T/abstract
USDA, NRCS (2014) Claves para la Taxonomía de Suelos. https://www.nrcs.usda.gov/Internet/FSE_DOCUMENTS/nrcs142p2_051546.pdf
USGS U (2017) USGS GloVis. In: 2017. https://glovis.usgs.gov. Accessed 18 Nov 2020
Van Bemmelen JM (1890) Über die Bestimmung des Wassers, des Humus, des Schwefels, der in den colloïdalen Silikaten gebundenen Kieselsäure, des Mangans usw im Ackerboden
Van Wittenberghe S, Verrelst J, Rivera JP et al (2014) Gaussian processes retrieval of leaf parameters from a multi-species reflectance, absorbance and fluorescence dataset. J Photochem Photobiol B Biol 134:37–48. https://doi.org/10.1016/j.jphotobiol.2014.03.010
Vaudour E, Gomez C, Fouad Y, Lagacherie P (2019) Sentinel-2 image capacities to predict common topsoil properties of temperate and Mediterranean agroecosystems. Remote Sens Environ 223:21–33. https://doi.org/10.1016/j.rse.2019.01.006
Verrelst J, Alonso L, Rivera JP et al (2013a) Gaussian Process Retrieval of Chlorophyll Content From Imaging Spectroscopy Data. IEEE J Sel Top Appl EARTH Obs Remote Sens 6:867–874
Verrelst J, Malenovský Z, Van der Tol C, Camps-Valls G, Gastellu-Etchegorry JP, Lewis P, North P, Moreno J (2019) Quantifying vegetation biophysical variables from imaging spectroscopy data: A review on retrieval methods. Surv Geophys 40(3):589–629. https://doi.org/10.1007/s10712-018-9478-y
Verrelst J, Muñoz J, Alonso L et al (2012) Remote Sensing of Environment Machine learning regression algorithms for biophysical parameter retrieval : Opportunities for Sentinel-2 and -3. Remote Sens Environ 118:127–139. https://doi.org/10.1016/j.rse.2011.11.002
Verrelst J, Rivera J-P, Moreno J, Camps-valls G (2013b) Gaussian processes uncertainty estimates in experimental Sentinel-2 LAI and leaf chlorophyll content retrieval. ISPRS J Photogramm Remote Sens 86:157–167. https://doi.org/10.1016/j.isprsjprs.2013.09.012
Verrelst J, Rivera JP, Gitelson A et al (2020) Spectral band selection for vegetation properties retrieval using Gaussian processes regression. Int J Appl Earth Obs Geoinf 52:554–567. j.jag.2016.07.016
Verrelst J, Rivera JP, Veroustraete F et al (2015) Experimental Sentinel-2 LAI estimation using parametric, non-parametric and physical retrieval methods - A comparison. ISPRS J Photogramm Remote Sens 108:260–272. https://doi.org/10.1016/j.isprsjprs.2015.04.013
Wang R, He N, Li S et al (2021) Spatial variation and mechanisms of leaf water content in grassland plants at the biome scale: evidence from three comparative transects. Sci Rep 11:9281. https://doi.org/10.1038/s41598-021-88678-7
Wang S, Huang M, Shao X et al (2004) Vertical Distribution of Soil Organic Carbon in China. Environ Manage 33:S200–S209. https://doi.org/10.1007/s00267-003-9130-5
Weiss M, Baret F (2016) S2ToolBox Level 2 products : LAI, FAPAR, FCOVER. http://step.esa.int/docs/extra/ATBD_S2ToolBox_L2B_V1.1.pdf
Weiss M, Baret F, Myneni R et al (2000) Investigation of a model inversion technique to estimate canopy biophysical variables from spectral and directional reflectance data. Agronomie 20:3–22. https://doi.org/10.1051/agro:2000105
Wilson EH, Sader SA (2002) Detection of forest harvest type using multiple date of Landsat Tm imagery. Remote Sens Environ 80:385–396. https://doi.org/10.1016/S0034-4257(01)00318-2
Xie HT, Yang XM, Drury CF et al (2011) Predicting soil organic carbon and total nitrogen using mid- and near-infrared spectra for Brookston clay loam soil in Southwestern Ontario, Canada. Can J Soil Sci 91:53–63. https://doi.org/10.4141/CJSS10029
Xu X, Shi Z, Li D et al (2016) Geoderma Soil properties control decomposition of soil organic carbon: Results from data-assimilation analysis. Geoderma 262:235–242. https://doi.org/10.1016/j.geoderma.2015.08.038
Zahasky C, Krevor S (2020) Global geologic carbon storage requirements of climate change mitigation scenarios. Energy Environ Sci 13:1561–1567. https://doi.org/10.1039/d0ee00674b
Zhang K, Dang H, Zhang Q, Cheng X (2015) Soil carbon dynamics following land-use change varied with temperature and precipitation gradients: evidence from stable isotopes. Glob Chang Biol 21:2762–2772. https://doi.org/10.1111/gcb.12886
Zhang K, Liu X, Ma Y et al (2020a) A Comparative Assessment of Measures of Leaf Nitrogen in Rice Using Two Leaf-Clip Meters. Sensor 20:175. https://doi.org/10.3390/s20010175
Zhang Y, Yang J, Liu X et al (2020b) Estimation of Multi-Species Leaf Area Index Based on Chinese GF-1 Satellite Data Using Look-Up Table and Gaussian Process Regression Methods. Sensors 20:2460. https://doi.org/10.3390/s20092460
Zhou Y, Hartemink AE, Shi Z et al (2019) Land use and climate change effects on soil organic carbon in North and Northeast China. Sci Total Environ 647:1230–1238. https://doi.org/10.1016/j.scitotenv.2018.08.016
Acknowledgements
This work was supported by "Soil Organic Carbon Evaluation and Sequestration in Ecuadorian Páramo Ecosystems" (SOCEPE) project funding by the National University of Chimborazo, Ecuador, who provided the SOC dataset used on this study; and the Ministry of the Environment of Ecuador (MAE) who provide to the SOCEPE project the scientific research authorization Nº 017-2019-IC-FLO-FAU-DNB/MAE.
Funding
Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. This work was supported by the European Research Council (ERC) under the ERC-2017-STG SENTIFLEX project (Grant Agreement 755617).
Author information
Authors and Affiliations
Contributions
Methodology, Formal analysis, Writing—Original Draft, Writing—Review & Editing: Johanna Elizabeth Ayala Izurieta; Methodology: Carlos Arturo Jara Santillán, Juan Pablo Rivera-Caicedo; Supervision: Carmen Omaira Márquez, Víctor Julio García; Supervición, Writing—Review & Editing; Shari Van Wittenberghe, Jesús Delegido; Writing—Review & Editing: Jochem Verrelst. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing Interests
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Responsible Editor: Zucong Cai.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ayala Izurieta, J.E., Jara Santillán, C.A., Márquez, C.O. et al. Improving the remote estimation of soil organic carbon in complex ecosystems with Sentinel-2 and GIS using Gaussian processes regression. Plant Soil 479, 159–183 (2022). https://doi.org/10.1007/s11104-022-05506-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11104-022-05506-1