Using machine learning with case studies to identify practices that reduce greenhouse gas emissions across Australian grain production regions

Meier, Elizabeth; Thorburn, Peter; Biggs, Jody; Palmer, Jeda; Dumbrell, Nikki; Kragt, Marit

doi:10.1007/s13593-023-00880-1

Using machine learning with case studies to identify practices that reduce greenhouse gas emissions across Australian grain production regions

Research Article
Open access
Published: 20 March 2023

Volume 43, article number 29, (2023)
Cite this article

Download PDF

You have full access to this open access article

Agronomy for Sustainable Development Aims and scope Submit manuscript

Using machine learning with case studies to identify practices that reduce greenhouse gas emissions across Australian grain production regions

Download PDF

Elizabeth Meier¹,
Peter Thorburn¹,
Jody Biggs¹,
Jeda Palmer¹,
Nikki Dumbrell² &
…
Marit Kragt³

1808 Accesses
Explore all metrics

Abstract

It is difficult to identify farm management practices that consistently provide greenhouse gas (GHG) abatement at different locations because effectiveness of practices is greatly influenced by climates and soils. We address this knowledge gap by identifying practices that provide abatement in eight case studies located across diverse conditions in Australian’s grain-producing areas. The case studies focus on soil-based emissions of nitrous oxide (N₂O) and changes in soil organic carbon (SOC), simulated over 100 years for 15 cropping management scenarios. Average changes in the balance of GHG from both N₂O emissions and SOC sequestration (∆GHG balance) and gross margins compared to a high emissions baseline were determined over 25 and 100 simulated years. Because scenarios providing the greatest abatement varied across individual case studies, we aggregated the data over all case studies and analysed them with a random forest data mining approach to build models for predicting ∆GHG balance. Increased cropping intensity, achieved by including cover crops, additional grains crops, or crops with larger biomass in the rotation, was the leading predictor of ∆GHG balance across the scenarios and sites. Abatement from increased cropping intensity averaged 774 CO₂-e ha⁻¹ year⁻¹ (25 years) and 444 kg CO₂-e ha⁻¹ year⁻¹ (100 years) compared to the baseline, with reduced emissions from SOC sequestration offsetting increased N₂O emissions for both time frames. Increased cropping intensity decreased average gross margins, indicating that a carbon price would likely be needed to maximise GHG abatement from this management. To our knowledge, this is the first time that the random forest approach has been applied to assess management practice effectiveness for achieving GHG abatement over diverse environments. Doing so provided us with more general information about practices that provide GHG abatement than would have come from qualitative comparison of the variable results from the case studies.

Soil carbon-food synergy: sizable contributions of small-scale farmers

Article Open access 04 November 2021

An approach to forecast grain crop yield using multi-layered, multi-farm data sets and machine learning

Article 08 January 2019

Identifying crop yield gaps with site- and season-specific data-driven models of yield potential

Article 13 September 2021

1 Introduction

Warming of the global climate system has occurred in response to rising atmospheric concentrations of anthropogenic greenhouse gases (GHGs; IPCC 2014). Resulting climatic changes include an increased frequency of extreme weather events such as heat waves, droughts, floods, cyclones, and fires. These changes impact all aspects of our lives including agricultural systems and thus food security. A business-as-usual approach to global GHG emissions is projected to have “severe, widespread, and irreversible impacts” (IPCC 2014), and so management to reduce emissions is imperative. The agriculture, forestry, and other land use sectors generate a greater share of anthropogenic GHG emissions than any other economic sector (24%; Smith et al. 2014), and so has an important role to play in reducing emissions. The reductions from this sector can be achieved by reducing the emissions from different management practices, as well as by mitigating emissions of atmospheric CO₂ from sequestering carbon (C) in soil and biomass.

Unfortunately, the effectiveness of different agricultural practices for reducing or abating GHG emissions can be variable (e.g., Lui et al., 2016; Du at al., 2017; Dumbrell et al. 2017; Feng et al. 2018; Huang et al. 2018; Meier et al. 2017), making it difficult for land managers and policy makers to identify widely applicable abatement actions. Practices that provide abatement in one location may not be effective in others (e.g., Lam et al. 2013; Powlson et al. 2016; Robertson et al. 2015; Sun et al. 2020; Trost et al. 2013). This can be caused by differences in local climate, through effects on biomass production and thus upon the amount of carbon returned to soil in residues (Meier et al. 2020; Robertson et al. 2015; Allen et al., 2013), or differences in local soil properties such as soil organic carbon (SOC), texture, and pH which affect processes of SOC sequestration and nitrous oxide (N₂O) emissions (Barton et al. 2016; Charles et al. 2017; Huang et al. 2018; Rosace et al. 2020; Trost et al. 2013). Another consideration is that some practices may have contrasting effects on different GHG emissions. A relevant example is that reducing CO₂ by increasing SOC can result in increased N₂O emissions (e.g., Bos et al. 2016; Gregorich et al. 2005; Kumara et al. 2020; Mei et al. 2018; Meier et al. 2020; Trost et al. 2013; Xia et al. 2018). Such interactions are especially important where GHG abatement comes from storing C in soil, because the rate of SOC increase declines over time as SOC stores saturate. So, while many studies have quantified GHG abatement potential achieved through a single mechanism (e.g., increasing SOC or decreasing N₂O emissions; Lui et al., 2016), there is little information on the net GHG abatement across multiple GHG and a range of practices each implemented at multiple sites with different soils, climates, etc.

Gaining such information from experiments is a daunting endeavour because of the resources required to run appropriate field experiments over the time scales (e.g., 25 to 100 years) required to properly assess GHG abatement. In the face of these difficulties, simulation models are useful tools (Smith et al. 2020; Lui et al., 2016). One example of the application of these models is the simulation studies undertaken of GHG abatement potential of multiple management practices (Fig. 1) possible at case study sites in some Australian grain production regions (Dumbrell et al. 2017; Meier et al. 2017; Palmer et al. 2017). Each of these studies determined the abatement potential of different practices for a small number of sites or limited area. The strength of this approach was its relevance to each case study through using data on local climate and soils, and having management practices tailored to these conditions. However, the practices providing the greatest abatement varied across the different sites reflecting the heterogenous environments and practices. For example, conserving rather than burning crop stubble (scenario 2, Table 1) resulted in contrasting GHG abatement in the case study sites at Dalwallinu (Fig. 1g) and Wimmera (Fig. 1e) because of differences in SOC sequestration at the two sites (Meier et al. 2017). The challenge now is to generalise across case studies such as these, in order to provide comprehensive insights into the practices that provide abatement across the diverse environments.

Table 1 Management practice scenarios simulated across sites and soils and associated predictors of GHG emissions (defined in Table 2).

Full size table

As computing power increases, it is becoming common to run large numbers of simulations with complex cropping system models, with statistical techniques used to explore and/or summarise the output from these simulations (Godde et al. 2016; Gladish et al. 2019; Shahhosseini et al. 2019). Machine learning algorithms are well suited to finding patterns in large datasets, and random forest is a popular machine learning method for application to heterogeneous data (Breiman, 2001; Paluszynska, 2020; SAS 2022; Shaikhina et al., 2019). It is effective for evaluating data that have many variables (termed “predictors” within this methodology, e.g., annual rainfall, soil texture, fertiliser rate), many classes within a predictor (e.g., loam, clay, and sandy for a soil texture predictor), heterogeneous data types (e.g., continuous, categorical), data interactions (e.g., records with more than one treatment implemented), and small or imbalanced datasets. Regression and classification trees exploit this variability to group classes of predictors that are important for a targeted outcome; random subsampling from a population permits many such trees to be calculated. In the random forest approach, the ensemble of trees is analysed to build a model that identifies the most important predictors for an outcome (Breiman, 2001; Fawagreh et al. 2014). The approach is highly regarded for its accuracy and power as a data mining tool (Fawagreh et al. 2014), and has been used in several recent studies for prediction of SOC (e.g., Nabiollahi et al. 2019; Payen et al. 2021).

In this study, we used the random forest method to determine management practices that consistently provide GHG abatement across diverse environments of Australian grain production areas. The method was applied to simulated changes in SOC sequestration and soil N₂O emissions from eight case studies sites (described in Section 2.2.1). To our knowledge, the random forest methodology has had little or no previous application to the problem of identifying practices for net abatement from changes in SOC and from non-CO₂ GHGs such as N₂O (Fawagreh et al. 2014; Liaw and Wiener 2002) despite the importance of considering multiple GHGs in the search for effective practices and policy for GHG abatement (e.g., Nong et al. 2021). We found that increased cropping intensity (through incorporating cover crops, increasing frequency of grains crops, or selecting crops with higher biomass in the rotation) was markedly and consistently of greater importance for predicting GHG abatement balance than any other predictor, so we focus the discussion on this predictor.

2 Materials and methods

2.1 The case study sites

2.1.1 Description

This study was based on soil, crop, climate, GHG emissions (from changes in SOC and soil N₂O emissions), and gross margins from 15 GHG abatement scenarios (Table 1) simulated for eight case studies (Fig. 1e–g). Full details of the data and simulations are given by Dumbrell et al. (2017), Meier et al. (2017, 2020, 2022), and Palmer et al. (2017) and only an overview is provided here. The case study sites were located in different agro-ecological zones of Australian’s grain-producing areas, spanning diverse climates (subtropical, temperate, Mediterranean), soils (sands, texture contrast, clay), and crop management (summer and winter crops). There were up to 7 soil types at each of the sites.

The GHG abatement scenarios (Table 1) were chosen to (1) potentially provide GHG abatement and (2) fit within constraints to cropping system management at each site. Management practices in the scenarios included changes to crop stubble management, manure application and N fertiliser rates, as well as changes to the amount of crop biomass grown each year (referred to as changes to “cropping intensity”) achieved by using short-term cover crops or making alternative crop choices. The abatement scenarios were “tuned” for local implementation. For example, for sites with a Mediterranean climate where crops are grown in winter, an increase in cropping intensity could be achieved by planting a cover crop in summer (which would grown in years of adequate summer rainfall). By contrast, in northern Australian locations where rainfall is more evenly distributed throughout the year and both winter and summer crops can be grown, an increase in cropping intensity could include replacing bare fallows with a cover crop, increasing the frequency of grains crops to more than one per year, or selecting a crop potentially having larger biomass (e.g., replacing chickpeas with faba beans) in the crop rotation. The tailoring of scenarios to local practices meant that not all scenarios were simulated at all sites.

The management scenarios were simulated for all site-soil combinations with the Agricultural Production Systems sIMulator v7.5 (APSIM; https://www.apsim.info/). APSIM is a widely used and well-tested model being applied to and validated for diverse farming systems in many countries (Holzworth et al. 2014; Keating and Thorburn 2018) including Australian grain production systems (Hochman and Horan 2018). Specific case study site validations are described by Godde et al. (2016), Meier et al. (2017, 2020), and Palmer et al. (2017). All simulations were run for 100 years. Each 100-year simulation was repeated ten times, each starting in different years (giving 526,000 records). Outputs from the simulations were used to calculate annual GHG emissions and gross margins for each site-soil-scenario-start year combination. Gross margins were calculated as crop income less direct costs of crop production (Dumbrell et al. 2017). The GHG emissions consisted of (a) N₂O emissions from the soil (0.0–1.0 m) and (b) CO₂ associated with the change in SOC (0.0–0.3 m) (Meier et al. 2017).

The global warming potential (GWP) of N₂O and CO₂ was converted to the carbon dioxide equivalent (CO₂e) mass using the 100-year GWP conversion factors of 3.67 for CO₂ (CO₂e-SOC) and 298 for N₂O (CO₂e-N2O), respectively (Myhre et al. 2013). The GHG balance in each year was calculated from the sum of CO₂e-SOC and CO₂e-N₂O. GHG abatement was determined by comparing the emissions from the scenarios (S2-15, Table 1) to a high emissions baseline (S1). The average annual change in GHG emissions (∆GHG balance) and gross margins of each scenario compared to the baseline was determined over 25 and 100 years after the simulation commenced, consistent with the time horizons identified by the Intergovernmental Panel on Climate Change (Solomon and Srinivasan, 1995) and aligning with permanence periods for sequestered carbon in Australia under the Carbon Credits (Carbon Farming Initiative) Act 2011 (Cth) s. 86A (Austl.).

2.1.2 Predictors for GHG emissions

Independent variables (termed “predictors” in the random forest terminology) that potentially predict the ∆GHG balance were identified from the dataset for management practice scenarios and location-based characteristics. Four management-based predictors were derived, namely crop residue management, manure application, change in cropping intensity, and N fertiliser rate (Table 2). Climate and soil properties known to influence GHG emissions were also selected as potential predictors for GHG abatement because they can override the effect of management practices when compared between locations. The predictors were derived from the climate and soil data used in the simulations.

Table 2 Management-based and location-based predictors of GHG emissions.

Full size table

2.1.3 Correction of data imbalance

The combined dataset was imbalanced because there were differences in the number of soils and scenarios simulated at each case study site, yet each site was equally important for representing the soils, climates, and scenarios in the study. For example, 40% of the simulated data came from the Liebe site compared with 2% from the Wimmera site. These differences needed to be corrected to avoid bias toward sites that were more frequently represented in data. The imbalance was corrected by additional sampling (“oversampling”) records from the underrepresented sites. This technique was used because we wanted to ensure that no predictive value was lost from any of the data. Loss of predictive value could have occurred if problems with imbalanced data were overcome by removing (“undersampling”) records from the overrepresented sites. Oversampling was achieved by increasing the number of records from underrepresented sites through random sampling, with replacement, until equal proportions of each combination was achieved. The oversampling was performed using the caret package (Kuhn, 2022) within the R software environment (R Core Team 2020).

2.2 Analyses

2.2.1 The random forest approach

The random forest approach (Breiman, 2001) was implemented using the randomForest package in R (Liaw and Wiener 2002; R Core Team 2020). In this analysis, two random forests were generated to predict ∆GHG balance averaged over the first 25 and 100 years after the simulation commenced (subsequently referred to as 25 and 100 years). The dataset was randomly divided into training (75%) and testing (25%) datasets, using the rsample package (Silge et al. 2022), to evaluate the predictive performance of the random forest models. The number of regression trees “grown” was set to 1000 and the importance of predictors then computed. Default arguments for the randomForest package in R were used including (a) the number of variables randomly sampled as candidates at each split, which was based on the number of predictors (12/3 = 4); (b) sampling of cases with replacement; and (c) setting the minimum size of terminal nodes to 5, to minimise computation time and avoid overfitting.

2.2.2 Statistics

Statistics for evaluating model performance

The capacity of each random forest to predict ∆GHG balance was evaluated against the testing dataset using the coefficient of determination (r²) statistic to indicate the proportion of data that fit the regression model.

Statistics for ranking predictor importance within random forests

The randomForestExplainer package (Paluszynska et al. 2020) was used in the R programming environment (R Core Team 2020) to calculate the following statistics for ranking predictor importance:

(a)
Increase in mean squared error

The increase in mean squared error statistic is the average increase of mean squared error of the random forest after a predictor is excluded (Paluszynska et al. 2020). This statistic is a relative measure of importance, and larger values of this statistic indicate that the predictor is of greater importance to the outcome of interest.

(b)
Times a root and node depth

This statistic refers to the proportion of the 1000 trees in the random forest for which a predictor was identified to split the root (i.e., first) node in the tree (Paluszynska et al. 2020). Predictors that split the data first provide the greatest difference between observations in the dataset.

A related statistic also used here is node depth, which refers to the average number of splits (nodes) at which the predictor occurs in the forest (the first split is node “1” and indicates highest importance).

(iii)
Increase in node purity

Increase in node purity refers to the decrease in the sum of squared errors accumulated whenever a specific predictor is chosen to split the data (Paluszynska et al. 2020). This is a relative measure and predictors with larger values are more important for predicting the variable of interest.

3 Results and discussion

3.1 Predictive strength of the random forests for ∆GHG balance

The random forest models for prediction of ∆GHG balance explained a high proportion of variation in the test data over both 25 and 100 years (r² = 0.88 and r² = 0.86, respectively). These values were acceptable given the high diversity amongst case study farms, so the models were relied upon to identify important predictors of ∆GHG balance.

3.2 Ranking of emission predictors

All statistics identified CropIntensity (cropping intensity) as the most important predictor of ∆GHG balance. The mean squared error (IncMSE) for prediction of ∆GHG balance for 25 and 100 years increased by 95 and 99%, respectively, when cropping intensity was excluded from the model (Fig. 2). This increase was two to four times that which occurred when other predictors of ∆GHG balance were excluded from the model for both time periods. Other statistics also confirmed the importance of CropIntensity for predicting ∆GHG balance for both 25 and 100 years (Table 3): CropIntensity was the predictor used to split the root node (nRoot) 325 and 348 times. This rate of splitting was at least 25% greater than for all other predictors and resulted in an average node depth (Depth) of 1.3 for cropping intensity compared with 1.7 and 1.8 for the next most important predictors. In a related outcome, the node purity statistic (IncPurity) was 3.3 times greater than for the next most important predictor when cropping intensity was selected to split data in the random forest model.

Table 3 Increase in mean squared error (IncMSE), times a root (nRoot), node depth (Depth), and increase in node purity (IncPurity) statistics for ranking the cropping intensity (CropIntensity) predictor compared to the next most important predictor for the random forest models predicting ∆GHG balance. Statistics are defined in Section 2.2.2 and presented for both 25- and 100-year simulations.

Full size table

3.3 Effect of cropping intensity on GHG emissions

Cropping intensity (CropIntensity) was markedly and consistently of greater importance for predicting ∆GHG balance than any other predictor (Fig. 2), so it is valuable to examine the variation in ∆GHG balance and the effects of cropping intensity on the individual GHG that define ∆GHG balance. Mean ∆GHG balance decreased markedly (i.e., abatement occurred) over both 25 and 100 years when cropping intensity was increased (Fig. 3a), although there was a considerable range in ∆GHG balance across the different scenarios, sites, and environments (as indicated by the error bars). The abatement was driven by decreases in ∆CO₂e for SOC emissions (i.e., increases in stored SOC; Fig. 3b) that offset the increase in ∆CO₂e from N₂O emissions (Fig. 3c). Average annual CO₂e for SOC was lower for the 100 than 25 year results (Fig. 3b) because the rates of increase in SOC caused by higher cropping intensity declined over time as soils became increasingly saturated in SOC. Nevertheless, even over 100 years, decreases in ∆CO₂e from SOC offset increased ∆CO₂e from N₂O emissions and resulted in net emissions reductions (negative ∆GHG balance). While increasing cropping intensity is a well-established practice for potentially abating GHG emissions (Basche et al. 2016; Kaye and Quemada 2017; Lui et al., 2016; Smith et al. 2014), we showed its contrasting effects on both SOC and N₂O and that it has the potential to provide GHG abatement across diverse climates, soils, and crop management occurring the Australian grain production areas.

The predictor CropIntensity was consistently more important for predicting ∆GHG balance across the cases studies than Nfertiliser (Table 3), despite inorganic N fertiliser use being an important source of agricultural emissions in Australian grain production systems (Commonwealth of Australia 2021; Mielenz et al. 2016; Sevenster et al. 2022). One reason for the relatively better performance of CropIntensity than Nfertiliser as a predictor was that all soils had relatively low SOC (0.5–2.2%; Meier et al. 2022) so there was potential for the increased biomass carbon returned to soils to be stored. The low SOC of many Australian agricultural soils has arisen because the decomposition rate of the original SOC has exceeded the rate of inputs of carbon from crops after the commencement of crop production (Luo et al. 2010). These soils therefore had the potential to build SOC in response to increased carbon inputs from biomass when the cropping intensity increased (Fig. 3a).

In comparison, the Nfertiliser predictor was relatively less important for predicting ∆GHG balance of this dataset because conservative N application rates are already used by many grain producers (Hochman and Horan 2018), and these practices were reflected in the scenarios. So even the scenarios with increased N fertiliser inputs (S3 and S5, Table 1) were relatively conservative and there was little potential to alter GHG emissions from decreasing N fertiliser inputs. The importance of the Nfertiliser predictor could change, however, if a greater number of emissions had been included when calculating GHG balances in the dataset, e.g., emissions embodied in the production and transport of N fertiliser (Sevenster et al. 2022).

Different results would also have been obtained if we considered some N fertiliser management scenarios well outside current common practice. The conservative N application rates common in Australian grain production limit yields, an outcome that farmers generally accept in order to limit the risk of not getting an economic return (through higher yields) on higher N rates in dry years (Hochman and Horan 2018). Higher N fertiliser rates will, on average, increase yields as well as N₂O emissions. However, GHG intensity (i.e., emissions relative to yields) of grain production is lower in this situation (Sevenster et al. 2022). The differences between that result and that of our study, which aimed to identify practices that will reduce GHG emissions, illustrate the need for a clear objective when assessing the GHG benefits of different crop management practices or designing policies to support adoption of practices.

3.4 Effect of cropping intensity on gross margins

The increasing cropping intensity predictor included planting a crop at every opportunity in the cropping sequence. Although this produced larger abatement compared to scenarios with lower crop intensity, it was less profitable across the majority of site-soil-start year combinations (Fig. 3d), more so for the 25- than 100-year simulations. The result occurred because the increase in cropping intensity did not consistently increase yield and thus crop income, resulting in the average crop gross margins decreasing by $57 and $28 AUD ha⁻¹ year⁻¹ over 25 and 100 years. Limiting the implementation of cropping intensity practices to favourable opportunities (e.g., only sowing a cover crop when soil water is adequate) can increase average annual gross margins (Whish et al. 2009; Rose et al. 2022), but doing so reduces the abatement achieved.

Maximizing abatement from increased cropping intensity could be helped if farmers could access financial support from carbon markets to compensate for the generally reduced gross margins (Fig. 3d). In Australia, the Clean Energy Regulator administers the Emissions Reduction Fund to purchase carbon credits in exchange for evidence of GHG abatement (CER 2021). The “2021 soil carbon method” administered by the Fund explicitly includes cover crops in its methodology but does not include other ways of increasing cropping intensity, such increasing the frequency of grains crops or crops with larger biomass in the rotation where the environment permits. Thus, a broader definition of cropping intensity to include both the size and number of crops could increase the options for increasing SOC and hence GHG abatement.

4 Conclusions

Achieving GHG abatement in cropping systems requires identification and adoption of practices that are both likely to reduce emissions from various sources of GHG and be widely effective (Nong et al. 2021). Coupling the random forest approach with the locally specific results of case studies enabled us to identify, for the first time, practices likely to provide consistent GHG abatement across the diverse environmental and management conditions of Australian grain production areas. This outcome was remarkable considering that conditions at the case study sites ranged from deep sandy soils in low rainfall Mediterranean climates, to heavy clay soils in subtropical climates with double the rainfall of the Mediterranean sites. The random forest methodology facilitated a redefinition of the management practice scenarios to more general predictors that were common to all case studies. The redefined scenarios could have been used to evaluate GHG abatement in the case studies without a random forest. However, this is not an obvious approach for analysing case study results because the usual purpose of a case study approach is to examine research questions in a local context. By comparison, the methodology used in this study could both make use of the diversity of the case study conditions and provide statistics for the general importance of the different management practice predictors.

The clear finding from our study that increasing cropping intensity could generally provide abatement is consistent with results of experimental studies on practices providing GHG abatement (Lui et al., 2016). It also provides a clearer comparison of the benefit of increasing cropping intensity with other practices than is likely to be the case with experiments. Further, the method used in this study accounts for the contrasting effects of SOC sequestration and N₂O emissions on GHG abatement, increasing the generality of the findings (Nong et al. 2021). We also found there was a trade-off between abatement and profitability from increasing cropping intensity, indicating that economic incentives are likely needed to support the widespread adoption of these practices. Current Australian carbon accounting methodologies do not recognise some of the ways to increase cropping intensity and changes to the methodology will be required for Australian grain farmers to earn carbon credits from these. This conclusion illustrates that our approach has the potential to benefit industry by providing information on abatement activities and policies. Similar application of the approach to other problems has clear potential to increase the return on research investment.

The study was based on simulations of the management practice scenarios at each of the case study sites and, as with all simulation studies, further testing is required to build confidence in the predictions. The results can guide efforts put into that testing. The study also only considered effects of SOC sequestration and N₂O emissions on GHG abatement. It would be useful to consider other sources of on-farm (e.g., lime application) and off-farm (e.g., emissions embodied in production of nitrogenous fertilizers) emissions in future studies as is commonly done with life cycle assessments. Where those assessments are conducted through case studies (Sevenster et al. 2022), the machine learning approach used here, or similar, could help better generalise the case study results.

Data availability

The dataset used in the study is available from the CSIRO Data Collection https://doi.org/10.25919/21qh-m346.

Code availability

The random forest code follows the example from Paluszynska et al. 2020.

References

Allen DE, Pringle MJ, Bray S et al (2013) What determines soil organic carbon stocks in the grazing lands of north-eastern Australia? Soil Res. https://doi.org/10.1071/SR13041
Barton L, Hoyle FC, Stefanova KT, Murphy DV (2016) Incorporating organic matter alters soil greenhouse gas emissions and increases grain yield in a semi-arid climate. Agric Ecosyst Environ 231:320–330. https://doi.org/10.1016/j.agee.2016.07.004
Article CAS Google Scholar
Basche AD, Archontoulis SV, Kaspar TC, Jaynes DB, Parkin TB, Miguez FE (2016) Simulating long-term impacts of cover crops and climate change on crop production and environmental outcomes in the Midwestern United States. Agric Ecosyst Environ 281:95–106. https://doi.org/10.1016/j.agee.2015.11.011
Article Google Scholar
Bos JFFP, ten Berge HFM, Verhagen J, van Ittersum MK (2016) Trade-offs in soil fertility management on arable farms. Agric Syst 157:292–302. https://doi.org/10.1016/j.agsy.2016.09.013
Article Google Scholar
Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Article Google Scholar
CER (Clean Energy Regulator, Australian Government) (2021) Understanding your soil carbon project. http://www.cleanenergyregulator.gov.au/DocumentAssets/Documents/Understanding%20your%20soil%20carbon%20project%20-%20Simple%20method%20guide.pdf. Accessed 31/3/2022
Charles A, Rochette P, Whalen JK, Angers DA, Chantigny MH, Bertrand N (2017) Global nitrous oxide emission factors from agricultural soils after addition of organic amendments: a meta-analysis. Agric Ecosyst Environ 236:88–98. https://doi.org/10.1016/j.agee.2016.11.021
Article CAS Google Scholar
Commonwealth of Australia (2021) National Inventory Report 2019. https://www.industry.gov.au/sites/default/files/April%202021/document/national-inventory-report-2019-volume-1.pdf. Accessed 31/3/2022
Du Z, Angers DA, Ren T, Zhang Q, Li G (2017) The effect of no-till on organic C storage in Chinese soils should not be overemphasized: A meta-analysis. Agric Ecosyst Environ. https://doi.org/10.1016/j.agee.2016.11.007
Dumbrell NP, Kragt ME, Meier EA, Biggs JS, Thorburn PJ (2017) Greenhouse gas abatement costs are heterogeneous between Australian grain farms. Agron Sustain Dev 37:28. https://doi.org/10.1007/s13593-017-0438-6
Article CAS Google Scholar
Fawagreh K, Gaber MM, Elyan E (2014) Random forests: from early developments to recent advancements. Syst Sci Control Engineer 2:602–609. https://doi.org/10.1080/21642583.2014.956265
Article Google Scholar
Feng J, Li F, Zhou X, Xu C, Ji L, Chen Z, Fang F (2018) Impact of agronomy practices on the effects of reduced tillage systems on CH₄ and N₂O emissions from agricultural fields: a global meta-analysis. PLoS ONE 13(5): e0196703. https://doi.org/10.1371/journal.pone.0196703
Gladish DW, Darnell R, Thorburn PJ, Haldankar B (2019) Emulated multivariate global sensitivity analysis for complex computer models applied to agricultural simulators. J Agric Biol Environ Stat 24:130–53. 10.1007/ s13253-018-00346-y
Godde C, Thorburn P, Biggs J, Meier E (2016) Understanding the impacts of soil, climate, and farming practices on soil organic carbon sequestration: a simulation study in Australia. Front Plant Sci 7:661. https://doi.org/10.3389/fpls.2016.00561
Article PubMed PubMed Central Google Scholar
Gregorich EG, Rochette P, VandenBygaart AJ, Angers DA (2005) Greenhouse gas contributions of agricultural soils and potential mitigation practices in Eastern Canada. Soil Tillage Res 83:53–72. https://doi.org/10.1016/j.still.2005.02.009
Article Google Scholar
Hochman Z, Horan H (2018) Causes of wheat yield gaps and opportunities to advance the water-limited yield frontier in Australia. Field Crops Res 228:20–30. https://doi.org/10.1016/j.fcr.2018.08.023
Article Google Scholar
Holzworth DP, Huth NI, deVoil PG et al (2014) APSIM – evolution towards a new generation of agricultural systems simulation. Environ Model Softw 62:327–350. https://doi.org/10.1016/j.envsoft.2014.07.009
Article Google Scholar
Huang Y, Ren W, Wang L, Hui D, Grove JH, Yang X, Tao B, Goff B (2018) Greenhouse gas emissions and crop yield in no-tillage systems: a meta-analysis. Agric Ecosyst Environ 268:144–153. https://doi.org/10.1016/j.agee.2018.09.002
Article CAS Google Scholar
IPCC (2014) Climate Change 2014: synthesis report. IPCC, Geneva, Switzerland, 151 pp
Kaye JP, Quemada M (2017) Using cover crops to mitigate and adapt to climate change. A review. Agron Sustain Dev 37:4. https://doi.org/10.1007/s13593-016-0410-x
Keating BA, Thorburn PJ (2018) Modelling crops and cropping systems – evolving purpose, practice and prospects. Eur J Agron 100:163–176. https://doi.org/10.1016/j.eja.2018.04.007
Article Google Scholar
Kumara TMK, Kandpal A, Pal S (2020) A meta-analysis of economic and environmental benefits of conservation agriculture in South Asia. J Environ Manage 269:110773. https://doi.org/10.1016/j.jenvman.2020.110773
Kuhn M (2022) Package ‘caret’. https://cran.rproject.org/web/packages/caret/caret.pdf. Accessed 1 Mar 2023
Lam SK, Chen D, Mosier AR, Roush R (2013) The potential for carbon sequestration in Australian agricultural soils is technically and economically limited. Sci Report 3:2179. https://doi.org/10.1038/srep02179
Article Google Scholar
Liaw A, Wiener M (2002) Classification and regression by randomForest. R News. 2002; 2(3):18–22. https://www.r-project.org/doc/Rnews/Rnews_2002-3.pdf. Accessed 31/3/2022
Liu C, Cutforth H, Chai Q, Gan Y (2016) Farming tactics to reduce the carbon footprint of crop cultivation in semiarid areas. A review. Agron Sustain Dev 36(69). https://doi.org/10.1007/s13593-016-0404-8
Luo Z, Wang E, Sun OJ (2010) Soil carbon change and its responses to agricultural practices in Australian agro-ecosystems: a review and synthesis. Geoderma 155:211–223. https://doi.org/10.1016/j.geoderma.2009.12.012
Article CAS Google Scholar
Mei K, Wang Z, Huang H, Zhang C, Shang X, Dahlgren RA, Zhang M, Xia F (2018) Stimulation of N₂O emission by conservation tillage management in agricultural lands: a meta-analysis. Soil Tillage Res 182:86–93. https://doi.org/10.1016/j.still.2018.05.006
Article Google Scholar
Meier EA, Thorburn PJ, Kragt ME, Dumbrell NP, Biggs JS, Hoyle FC, van Rees H (2017) Greenhouse gas abatement on southern Australian grains farms: biophysical potential and financial impacts. Agric Syst 155:147–157. https://doi.org/10.1016/j.agsy.2017.04.012
Article Google Scholar
Meier EA, Thorburn PJ, Bell LW, Harrison MT, Biggs JS (2020) Greenhouse gas emissions from cropping and grazed pastures are similar: a simulation analysis in Australia. Front Sustain Food Syst 3:121. https://doi.org/10.3389/fsufs.2019.00121
Article Google Scholar
Meier E, Thorburn P, Biggs J, Palmer J, Dumbrell N, Kragt M (2022) Achieving least cost GHG abatement opportunities in Australian grain farms - case study simulation outputs. v1. CSIRO. Data Collection. https://doi.org/10.25919/21qh-m346
Mielenz H, Thorburn PJ, Harris RH, Officer SJ, Li G, Schwenke GD, Grace PR (2016) Nitrous oxide emissions from grain production systems across a wide range of environmental conditions in eastern Australia. Soil Res 54:659–674. https://doi.org/10.1071/SR15376
Article CAS Google Scholar
Myhre G, Shindell D, Breon F-M et al (2013) Anthropogenic and natural radiative forcing. In: Climate Change 2013: the physical science basis. Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA
Nabiollahi K, Eskandari Sh, Taghizadeh-Mehrjardi R, Kerry R, Triantafilis J (2019) Assessing soil organic carbon stocks under land-use change scenarios using random forest models. Carbon Manag 10(1):63–77. https://doi.org/10.1080/17583004.2018.1553434
Article CAS Google Scholar
Nong D, Simshauser P, Nguyen DB (2021) Greenhouse gas emissions vs CO2 emissions: comparative analysis of a global carbon tax. Appl Energy 298:117223. https://doi.org/10.1016/j.apenergy.2021.117223
Palmer J, Thorburn PJ, Meier EA, Biggs JS, Whelan B, Singh K, Eyre DN (2017) Can management practices provide greenhouse gas abatement in grain farms in New South Wales, Australia? Crop Pasture Sci 68:390–400. https://doi.org/10.1071/CP17026
Article Google Scholar
Paluszynska A (2020) Understanding random forests with random Forest Explainer. DrWhy.AI. https://modeloriented.github.io/randomForestExplainer/articles/randomForestExplainer.html. Accessed 1 Mar 2023
Paluszynska A, Biecek P, Jiang Y (2020) randomForestExplainer: explaining and visualizing random forests in terms of variable importance. R package version 0.10.1. https://CRAN.R-project.org/package=randomForestExplainer. Accessed 31/3/2022
Payen FT, Sykes A, Aitkenhead M, Alexander P, Moran D, MacLeod M (2021) Predicting the abatement rates of soil organic carbon sequestration management in Western European vineyards using random forest regression. Clean Environ Syst 2:100024. https://doi.org/10.1016/j.cesys.2021.100024
Powlson DS, Stirling CM, Thierfelder C, White RP, Jat ML (2016) Does conservation agriculture deliver climate change mitigation through soil carbon sequestration in tropical agro-ecosystems? Agric Ecosyst Environ 220:164–174. https://doi.org/10.1016/j.agee.2016.01.005
Article CAS Google Scholar
R Core Team (2020) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/. Accessed 31/3/2022
Robertson F, Armstrong R, Partington D, Perris R, Oliver I, Aumann C, Crawford D, Rees D (2015) Effect of cropping practices on soil organic carbon: evidence from long-term field experiments in Victoria, Australia. Soil Res 53:636–646. https://doi.org/10.1071/SR14227
Article CAS Google Scholar
Rosace MC, Veronesi F, Briggs S, Cardenas LM, Jeffery S (2020) Legacy effects override soil properties for CO₂ and N₂O but not CH₄ emissions following digestate application to soil. Glob Change Biol Bioenergy 12:445–457. https://doi.org/10.1111/gcbb.12688
Article CAS PubMed PubMed Central Google Scholar
Rose TJ, Parvin S, Han E, Condon J, Flohr BM, Schefe C, Rose M, Kirkegaard JA (2022) Prospects for summer cover crops in southern Australian semi-arid cropping systems. Agric Syst 200:103415. https://doi.org/10.1016/j.agsy.2022.103415
SAS (2022) Machine learning, what it is and why it matters. https://www.sas.com/en_au/insights/analytics/machine-learning.html#:~:text=Machine%20learning%20is%20a%20method,decisions%20with%20minimal%20human%20intervention. Accessed 1 Mar 2023
Sevenster M, Bell L, Anderson B, Jamali H, Horan H, Simmons A, Cowie A, Hochman Z (2022) Australian grains baseline and mitigation assessment. Main report. CSIRO, Australia. https://grdc.com.au/about/our-industry/greenhouse-gas-emissions/GRDC_MainFinalReport_170122_CONFIDENTIAL.pdf. Accessed 31/5/2022
Shahhosseini M, Martinez-Feria RA, Hu G, Archontoulis SV (2019) Maize yield and nitrate loss prediction with machine learning algorithms. Environ Res Lett 14:124026. https://doi.org/10.1088/1748-9326/ab5268
Shaikhina T, Lowe D, Daga S, Briggs D, Higgins R, Khovanova N (2019) Decision tree and random forest models for outcome prediction in antibody incompatible kidney transplantation. Biomed Signal Proces. https://doi.org/10.1016/j.bspc.2017.01.012
Silge J, Chow F, Kuhn M, Wickham H (2022) rsample: general resampling infrastructure. https://rsample.tidymodels.org, https://github.com/tidymodels/rsample. Accessed 1 Mar 2023
Smith P, Bustamante M, Ahammad H et al (2014) Agriculture, Forestry and Other Land Use (AFOLU). In: Climate Change 2014: mitigation of climate change. Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA
Smith P, Soussana J-F, Angers D et al (2020) How to measure, report and verify soil carbon change to realize the potential of soil carbon sequestration for atmospheric greenhouse gas removal. Glob Change Biol 26:219–241. https://doi.org/10.1111/gcb.14815
Article Google Scholar
Solomon S, Srinivasan J (1995) Radiative forcing. In: The science of climate change, second assessment report to the intergovernmental panel on climate change. Cambridge University Press, Cambridge, UK, pp 108–118
Sun W, Canadell JG, Yu L, Yu L, Zhang W, Smith P, Fischer T, Huang Y (2020) Climate drives global soil carbon sequestration and crop yield changes under conservation agriculture. Glob Change Biol 26:3325–3335. https://doi.org/10.1111/gcb.15001
Article Google Scholar
Trost B, Prochnow A, Drastig K, Meyer-Aurich A, Ellmer F, Baumecker M (2013) Irrigation, soil organic carbon and N₂O emissions A review. Agron Sustain Dev 33:733–749. https://doi.org/10.1007/s13593-013-0134-0
Article CAS Google Scholar
Whish JPM, Price L, Castor PA (2009) Do spring cover crops rob water and so reduce wheat yields in the northern grain zone of eastern Australia? Crop Pasture Sci 60:517–525. https://doi.org/10.1071/CP08397
Article Google Scholar
Xia L, Lam SK, Wolf B, Kiese R, Chen D, Butterbach-Bahl K (2018) Trade-offs between soil carbon sequestration and reactive nitrogen losses under straw return in global agroecosystems. Glob Change Biol 24:5919–5932. https://doi.org/10.1111/gcb.14466
Article Google Scholar

Download references

Funding

Open access funding provided by CSIRO Library Services. Development of the original dataset was funded by the Grains Research and Development Corporation project “Achieving least cost GHG abatement–opportunities in Australian grains farms”. The analysis reported in this paper was funded by the Commonwealth Scientific and Industrial Research Organisation (CSIRO).

Author information

Authors and Affiliations

Commonwealth Scientific and Industrial Research Organisation, 306 Carmody Road, St Lucia, QLD, Australia
Elizabeth Meier, Peter Thorburn, Jody Biggs & Jeda Palmer
Commonwealth Scientific and Industrial Research Organisation, 2-40 Clunies Ross Street, Acton, ACT, 2601, Australia
Nikki Dumbrell
Centre for Agricultural Economics and Development, UWA School of Agriculture and Environment, University of Western Australia, 35 Stirling Highway, Perth, WA, 6009, Australia
Marit Kragt

Authors

Elizabeth Meier
View author publications
You can also search for this author in PubMed Google Scholar
Peter Thorburn
View author publications
You can also search for this author in PubMed Google Scholar
Jody Biggs
View author publications
You can also search for this author in PubMed Google Scholar
Jeda Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Nikki Dumbrell
View author publications
You can also search for this author in PubMed Google Scholar
Marit Kragt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

EAM: analyses, methodology, writing, reviewing, editing.

PJT: conceptualization, methodology, writing, reviewing, editing, resources.

JSB: conceptualization, methodology, software, investigation.

JP: analyses, reviewing, editing.

ND: analyses, reviewing, editing

MEK: conceptualization, analyses, reviewing, editing.

Corresponding author

Correspondence to Peter Thorburn.

Ethics declarations

Ethics approval

No ethics approval was required for use of the published dataset in which case study farms were not identified.

Consent to participate

No consent to participate was required for use of the published dataset in which case study farms were not identified.

Consent for publication

No consent for publication was required in respect to the published dataset in which case study farms were not identified.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Meier, E., Thorburn, P., Biggs, J. et al. Using machine learning with case studies to identify practices that reduce greenhouse gas emissions across Australian grain production regions. Agron. Sustain. Dev. 43, 29 (2023). https://doi.org/10.1007/s13593-023-00880-1

Download citation

Accepted: 16 February 2023
Published: 20 March 2023
DOI: https://doi.org/10.1007/s13593-023-00880-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Using machine learning with case studies to identify practices that reduce greenhouse gas emissions across Australian grain production regions

Abstract

Similar content being viewed by others

Soil carbon-food synergy: sizable contributions of small-scale farmers

An approach to forecast grain crop yield using multi-layered, multi-farm data sets and machine learning

Identifying crop yield gaps with site- and season-specific data-driven models of yield potential

1 Introduction

2 Materials and methods

2.1 The case study sites

2.1.1 Description

2.1.2 Predictors for GHG emissions

2.1.3 Correction of data imbalance

2.2 Analyses

2.2.1 The random forest approach

2.2.2 Statistics

Statistics for evaluating model performance

Statistics for ranking predictor importance within random forests

3 Results and discussion

3.1 Predictive strength of the random forests for ∆GHG balance

3.2 Ranking of emission predictors

3.3 Effect of cropping intensity on GHG emissions

3.4 Effect of cropping intensity on gross margins

4 Conclusions

Data availability

Code availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent to participate

Consent for publication

Conflict of interest

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation