Elsevier

Remote Sensing of Environment

Volume 262, 1 September 2021, 112514
Remote Sensing of Environment

Geographically and temporally weighted neural network for winter wheat yield prediction

https://doi.org/10.1016/j.rse.2021.112514Get rights and content

Highlights

  • The existence of spatial and temporal non-stationarity was statistically tested.

  • A GTWNN model was developed for winter wheat yield prediction in US.

  • GTWNN outperformed three widely used models including GTWR, ANN and SVR.

  • Moran's I values showed GTWNN's ability to address spatiotemporal non-stationarity.

Abstract

Accurate prediction of crop yield is essential for agricultural trading, market risk management and food security. Although various statistical models and machine learning models have been developed to enhance prediction accuracy, spatial and temporal non-stationarity, an intrinsic attribute of many geographical processes, is still rarely considered in crop yield modeling. From a statistical point of view, this study respectively provided evidence for the existence of spatial non-stationarity and temporal non-stationarity in winter wheat yield prediction based on geographically weighted regression (GWR) and temporally weighted regression (TWR). Then, a geographically and temporally weighted neural network (GTWNN) model was proposed by integrating artificial neural network (ANN) into geographically and temporally weighted regression (GTWR) using publicly available data sources, including satellite imagery and climate data. For a more credible evaluation, the leave-one-year-out strategy was adopted to make out-of-sample prediction resulting in a total of 12 test years from 2008 to 2019. The experiment results showed that the proposed GTWNN outperformed ANN, GTWR and support vector regression (SVR) achieving the average coefficient of determination (R2) values of 0.766, 0.759 and 0.720 at the three prediction times of end of July, end of June and end of May. Moreover, an extended Moran's I was adopted to assess the degree of spatiotemporal autocorrelation of the prediction errors. The error aggregation of GTWNN was lower than other models, indicating that GTWNN is applicable to addressing spatial non-stationarity in modeling the relationship between predictors and yield response. The methodology proposed in this paper can be extended to handle spatiotemporal non-stationarity in other crop yield predictions and even other environmental phenomena.

Introduction

Wheat is one of the earliest cultivated crop in the history of agriculture (Bell, 1987), and it continues to be vital in providing calories and nutrients for human and livestock today (Shewry and Hey, 2015). In addition to diet and health, wheat is even associated with general economic and social conditions (Curtis and Halford, 2014). As a major wheat producer and explorer, the United States (US) accounted for 7% of the world's wheat production and 14% of the world's wheat exports in 2019 (USDA, 2020a). According to the growing season, wheat is primarily classified into winter wheat and spring wheat. Winter wheat, planted in fall and harvested in the next summer, represents around 70% of total wheat US production with 35 million metric tons of harvest from 24 million acres in 2019 (USDA, 2020b). Timely, accurate and spatially specific winter wheat yield estimation therefore is important for food security monitoring and marketing planning on the local, national, and international levels (Cai et al., 2019).

The availability of growth conditions during the growing season has made it possible for researchers to perform accurate yield estimates. The low cost and wide spatial coverage of satellite imagery has also made up for the lack of ground observations, so that the scale of yield estimation has been expanded from field to regional, national and even global scales. Several studies have combined satellite imagery and statistical approaches to perform crop yield estimation (Balaghi et al., 2008; Bolton and Friedl, 2013; Kern et al., 2018; Ma et al., 2021). For example, Maselli and Rembold (2001) built linear regression models with monthly normalized difference vegetation index (NDVI) derived from the Advanced Very High Resolution Radiometer (AVHRR) images, to forecast cereal crop yield in four North African Countries. Becker-Reshef et al. (2010) developed a winter wheat yield prediction approach with time series NDVI derived from the Moderate resolution Imaging Spectro-radiometer (MODIS) data, and the regression model achieved satisfactory performance with an error less than 10%. Similarly, NDVI derived from MODIS was also adopted to predict winter wheat yield in Shandong, China using a linear model (Ren et al., 2008). Despite moderate success, the models established in these studies are global regression models, which means that the relationship between yield and explanatory variables is assumed to be spatially constant. However, in fact the relationship can be highly variable over space especially in large areas, because it is always affected by natural environment such as topography and soil type, and human factors such as farming habits and management system.

The phenomenon that an explanatory variable has different effects on model output over space is known as spatial non-stationarity or spatial heterogeneity (Stewart Fotheringham et al., 1996). Non-stationarity has been recognized by many researchers and has been taken into account in different areas such as environmental analysis and epidemiological spreading (Brunton et al., 2017; Lin and Wen, 2011; Song et al., 2014; Zhai et al., 2018). Moreover, the assumption of non-stationarity has been demonstrated to be of great significance to the improvement of model accuracy in the research of agricultural yield estimation (Shen et al., 2018). Therefore, various local modeling approaches have been developed to deal with the spatial non-stationarity issue (Huang et al., 2013; Imran et al., 2013; Shiu and Chuang, 2019). A commonly used approach in large-scale yield estimation is partition modeling, which assumes that in each partition, the relationship between dependent and independent variables is spatially stationary, and does not follow a location-specific regression function. Therefore, in partition modeling, a number of global models are developed independently for the geographic partitions. For example, to achieve better modeling accuracy, Manjunath et al. (2002) developed a linear yield prediction model for each of the 15 wheat-growing regions in Rajasthan, India, by integrating spectral and meteorological data. Mishra et al. (2008) established multiple district-level linear regression models to make in season yield prediction of sorghum in Burkina Faso, a West African country. However, the separation of limited datasets brings a tradeoff between achieving generalization performance and modeling spatial non-stationarity. Another model designed to address the issue of spatial non-stationarity is geographically weighted regression (GWR), which has been widely applied in many fields, such as geography and ecology, and has been demonstrated to be effective for crop yield prediction in agricultural research. For instance, Imran et al. (2015) compared GWR with conditional autoregression, a global spatial regression, when simulating the yields of sorghum, millet and cotton in West Africa, and found that the average accuracy improvement of GWR in semiarid zone and subhumid zone was 0.1 and 0.3, respectively. Shiu and Chuang (2019) who compared GWR with least squares ordinary in paddy rice yield estimation also observed similar results. Despite success, GWR has limitations in constructing spatial weight matrix due to the heavy reliance on the predefined spatial kernel functions (e.g. Gaussian, Bi-square, Exponential functions, etc). Essentially, spatial kernel functions are predefined distance decay functions in which larger weights are assigned to closer observations based on the Tobler's First Law of Geography (Tobler, 1970), and they are the simplified and idealized expressions of spatial proximity, making it possible to quantify the spatial relationships in modeling process. However, these kernels are relatively simple in addressing complicated interactions inherent in geographical relationships (Du et al., 2020). Additionally, although the selection of optimal bandwidth requires data-driven approaches such as Akaike Information Criterion (AIC), the predefined kernels expressed in formulas may hinder the precise quantification of spatial relationships using the inherent information in the data (Hagenauer and Helbich, 2021).

In addition to space, time is also an important dimension that affects the relationship between predictors and response (Fotheringham et al., 2015). Extensive efforts have been made to incorporate temporal effects into spatial regression. Among these, geographically and temporally weighted regression (GTWR) is one of the successful attempts for modeling non-stationary spatiotemporal relationships. It was originally achieved by redefining the distances between observations in the construction of the weight matrix. For example, Huang et al. (2010) formed a spatiotemporal distance through the linear combination of spatial and temporal distance, which was then improved by adding the interaction term of time and space dimensions on the basis of the original distance formula (Wu et al., 2014). Instead of improving the expression of spatiotemporal distance, Fotheringham et al. (2015) realized the expansion of GWR to GTWR by constructing a spatiotemporal weighting function which was expressed by the product of a temporal kernel function with a unique temporal bandwidth, and a spatial kernel function with a set of segregated spatial bandwidths over time. The various structures of GTWR adapt the interpretability of GWR, that is, they can generate a set of coefficients of the independent variables corresponding to a specific spatiotemporal coordinates, and output bandwidth which can be interpreted as spatiotemporal effects. In agriculture, factors including soil fertility changes and farming technology innovations are all possible causes of temporal non-stationarity. However, although GTWR has been successfully applied in various fields (Guo et al., 2017; Huang et al., 2010), it has never been used for crop yield estimation.

In contrast to conventional statistical methods, machine learning allows the appliance of complex mathematical calculations to large dataset, and it has demonstrated its superior ability in numerous fields such as natural language processing and image recognition (Goldberg, 2016; Nadkarni et al., 2011; Svyrydov et al., 2018). Increasingly, machine learning has been used in agriculture, such as crop yield estimation. For instance, Matsumura et al. (2015) compared the performances between artificial neural network (ANN) and multiple linear regression in predicting maize yield in Jilin province, China, and the results showed that ANN significantly outperformed the linear one. Sakamoto (2020) built a random forest (RF) regression algorithm with time-series MODIS satellite data and environmental records for soybean and corn yield prediction in the US, and the approach accurately predicted the yields of corn and soybean with root mean square error (RMSE) of 0.539 t/ha 0.206 t/ha, respectively. In addition, several researchers also have tried to establish machine learning based models considering spatiotemporal non-stationarity. For example, Crane-Droesch (2018) developed a semiparametric ANN using weather, soil, geographic coordinates and year data to predict corn yield across multiple geographically contiguous states in the US. Shook et al. (2018) built a Long Short Term Memory based framework with weather and genotype variables for soybean yield estimation in North America, and the model achieved improved performance by incorporating GPS coordinates and predicting year. Similar ideas can be found from these studies that spatial and temporal information are added as independent and parallel factors with other variables into the machine learning method. However, in fact, each location in spatiotemporal dimension is characterized by a series of physical and human properties, and it is these properties, not the location, that directly affect the growth and yield of crops. Moreover, the machine learning model performance can potentially be further enhanced by incorporating prior knowledge of the crop.

In this study, we proposed a geographically and temporally weighted neural network (GTWNN) considering spatial and temporal non-stationarity for county-level winter wheat yield estimation in the US. Before modeling, we first performed statistical tests to provide evidence for the existence of spatial and temporal non-stationarity in winter wheat yield estimation based on GWR and temporally weighted regression (TWR) respectively. A GTWNN model was then developed with the smoothed sequential features extracted from satellite images and meteorological datasets from 2008 to 2019. Furthermore, the proposed model was compared with GTWR, ANN and support vector regression (SVR) for performance evaluation. Finally, the spatiotemporal patterns of the prediction errors derived from the four models were analyzed.

Section snippets

Study area and yield data

The modeling work was performed to estimate winter wheat yield in the US. County-level yield records over years 2008–2019 were collected from the United States Department of Agriculture (USDA) National Agricultural Statistics Service (NASS) database. Overall, the national annual yield of winter wheat exhibited an upward trend, increasing from 3.167 t/ha in 2008 to 3.605 t/ha in 2019 (Fig. 1b). However, there were also obvious fluctuations in the inter-annual variation of yield. Specifically,

OLS regression

Ordinary least squares (OLS) regression is a best known statistical method in identifying the relationships between the dependent variable yi and the independent variables xi1, xi2,⋯, xip. The OLS regression is expressed as:yi=β0+k=1pβkxik+εi,i=1,2,,nwhere β0 is the intercept term and β1, ⋯, βp are the regressive coefficients of the corresponding independent variables; n and p represent the size of the samples and the number of independent variables respectively; and ε denotes the error term

Important feature selection

The input features were determined based on the strategy described in Section 3.3.1, and the correlation between each selected feature and yield was also calculated using Pearson Correlation Coefficient (PCC). Table 1 provides the descriptions of the selected features. Three VIs including EVI, NDVI and GCI were selected in this study, and higher values indicate healthier growing status and lower levels of environmental stress. Therefore, all the three VIs were found to be positively correlated

Predictive factor analysis

To perform accurate winter wheat yield estimation, we selected seven time-series variables based on the strategy described in Section 3.3.1, including three VIs characterizing crop properties and four climate factors simulating the growth environment. In this section, the relationship between each variable and yield was further analyzed, and the difference of the variables between regions was also compared.

VIs are widely used to quantify the vegetation properties and indicate yield potential (

Conclusions

Accurate prediction of winter wheat yield is important for marketing, transportation and storage decisions, and helps manage the risks associated with them. In this study, we proposed a GTWNN model for county-level winter wheat yield prediction model. The developed model could address the spatiotemporal non-stationarity in identifying the relationship between predictors and the yield. Specifically, several sequential predictors were extracted from satellite imagery and climate data, and then

Funding

This work was supported by the USDA National Institute of Food and Agriculture, United States Department of Agriculture, Hatch project WIS03026; the University of Wisconsin–Madison, Office of the Vice Chancellor for Research and Graduate Education with funding from the Wisconsin Alumni Research Foundation; the China Scholarship Council (NO.201906270096); and the Wuhan University Graduates International Exchange Program (NO.201905).

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

The authors declare the following financial interests/personal relationships which may be considered as potential competing interests:

References (84)

  • T. Li et al.

    Geographically and temporally weighted neural networks for satellite-based mapping of ground-level PM2. 5

    ISPRS J. Photogramm. Remote Sens.

    (2020)
  • Y. Ma et al.

    Corn yield prediction and uncertainty analysis based on remotely sensed variables using a Bayesian neural network approach

    Remote Sens. Environ.

    (2021)
  • A. Mishra et al.

    Sorghum yield prediction from seasonal rainfall forecasts in Burkina Faso

    Agric. For. Meteorol.

    (2008)
  • M. Niazian et al.

    Modeling the seed yield of Ajowan (Trachyspermum ammi L.) using artificial neural network and multiple linear regression models

    Ind. Crop. Prod.

    (2018)
  • Y. Pan et al.

    Winter wheat area estimation from MODIS-EVI time series data using the crop proportion phenology index

    Remote Sens. Environ.

    (2012)
  • T. Pede et al.

    Improving corn yield prediction across the US Corn Belt by replacing air temperature with daily MODIS land surface temperature

    Agric. For. Meteorol.

    (2019)
  • J. Ren et al.

    Regional yield estimation for winter wheat with MODIS-NDVI data in Shandong, China

    Int. J. Appl. Earth Obs. Geoinf.

    (2008)
  • T. Sakamoto

    Incorporating environmental variables into a MODIS-based crop yield estimation method for United States corn and soybeans through the use of a random forest regression algorithm

    ISPRS J. Photogramm. Remote Sens.

    (2020)
  • R.A. Schwalbert et al.

    Satellite-based soybean yield forecast: integrating machine learning and weather data for improving crop yield prediction in southern Brazil

    Agric. For. Meteorol.

    (2020)
  • P.C. Sentelhas et al.

    Suitability of relative humidity as an estimator of leaf wetness duration

    Agric. For. Meteorol.

    (2008)
  • N.T. Son et al.

    A comparative analysis of multitemporal MODIS EVI and NDVI data for large-scale rice yield estimation

    Agric. For. Meteorol.

    (2014)
  • W. Song et al.

    A satellite-based geographically weighted regression model for regional PM2. 5 estimation over the Pearl River Delta region in China

    Remote Sens. Environ.

    (2014)
  • S.M. Vicente-Serrano et al.

    Diverse relationships between forest growth and the normalized difference vegetation index at a global scale

    Remote Sens. Environ.

    (2016)
  • G. Xiao et al.

    Impact of temperature increase on the yield of winter wheat at low and high altitudes in semiarid northwestern China

    Agric. Water Manag.

    (2010)
  • L. Zhai et al.

    An improved geographically weighted regression model for PM2. 5 concentration estimation in large areas

    Atmos. Environ.

    (2018)
  • B. Basso et al.

    Evapotranspiration in high-yielding maize and under increased vapor pressure deficit in the US Midwest

    Agric. Environ. Lett.

    (2018)
  • G.D.H. Bell

    The History of Wheat Cultivation, Wheat Breeding

    (1987)
  • J. Ben-Asher et al.

    Dew is a major factor affecting vegetation water use efficiency rather than a source of water in the eastern Mediterranean area

    Water Resour. Res.

    (2010)
  • C. Brunsdon et al.

    Geographically weighted regression

    J. Royal Statist. Soc.: Ser. D (Statist.)

    (1998)
  • L.A. Brunton et al.

    Using geographically weighted regression to explore the spatially heterogeneous spread of bovine tuberculosis in England and Wales

    Stoch. Env. Res. Risk A.

    (2017)
  • N.S. Chandel et al.

    Yield prediction in wheat (Triticum aestivum L.) using spectral reflectance indices

    Curr. Sci.

    (2019)
  • Climate Central

    It's Official: 2019 was the Nation's Second Wettest Year on Record

  • A. Crane-Droesch

    Machine learning methods for crop yield prediction and climate change impact assessment in agriculture

    Environ. Res. Lett.

    (2018)
  • T. Curtis et al.

    Food security: the challenge of increasing wheat yield and the importance of not compromising food safety

    Ann. Appl. Biol.

    (2014)
  • Z. Du et al.

    Geographically neural network weighted regression for the accurate estimation of spatial non-stationarity

    Int. J. Geogr. Inf. Sci.

    (2020)
  • A.S. Fotheringham et al.

    Geographically weighted regression: a natural evolution of the expansion method for spatial data analysis

    Environ. Plan. A

    (1998)
  • A.S. Fotheringham et al.

    Geographical and temporal weighted regression (GTWR)

    Geogr. Anal.

    (2015)
  • Y. Gao et al.

    Measuring spatio-temporal autocorrelation in time series data of collective human mobility

    Geo-spatial Inform. Sci.

    (2019)
  • R. Ghimire et al.

    Soil health response of cover crops in winter wheat–fallow system

    Agron. J.

    (2019)
  • Y. Goldberg

    A primer on neural network models for natural language processing

    J. Artif. Intell. Res.

    (2016)
  • Y. Gu et al.

    A five-year analysis of MODIS NDVI and NDWI for grassland drought assessment over the central Great Plains of the United States

    Geophys. Res. Lett.

    (2007)
  • J. Hagenauer et al.

    A geographically weighted artificial neural network

    Int. J. Geogr. Inf. Sci.

    (2021)
  • Cited by (0)

    1

    These two authors contributed equally to this work.

    2

    Corresponding author.

    View full text