Challenges of predicting gas transfer velocity from wind measurements over global lakes

Klaus, Marcus; Vachon, Dominic

doi:10.1007/s00027-020-00729-9

Challenges of predicting gas transfer velocity from wind measurements over global lakes

Research Article
Open access
Published: 01 May 2020

Volume 82, article number 53, (2020)
Cite this article

Download PDF

You have full access to this open access article

Aquatic Sciences Aims and scope Submit manuscript

Challenges of predicting gas transfer velocity from wind measurements over global lakes

Download PDF

3497 Accesses
33 Citations
4 Altmetric
Explore all metrics

Abstract

Estimating air–water gas transfer velocities (k) is integral to understand biogeochemical and ecological processes in aquatic systems. In lakes, k is commonly predicted using wind-based empirical models, however, their predictive performance under conditions that differ from their original calibration remains largely unassessed. Here, we collected 2222 published k estimates derived from various methods in 46 globally distributed lakes to (1) evaluate the predictions of a selection of six available wind-speed based k models for lakes and (2) explore and develop new empirical models to predict k over global lakes. We found that selected k models generally performed poorly in predicting k in lakes. Model predictions were more accurate than simply assuming a mean k in only 2–39% of all lakes, however, we could not identify with confidence the specific conditions in which some models outperformed others. We developed new wind-based models in which additional variables describing the spatial coverage of k estimates and the lake size and shape had a significant effect on the wind speed-k relationship. Although these new models did not fit the global dataset significantly better than previous k models, they generate overall less biased predictions for global lakes. We further provide explicit estimates of prediction errors that integrate methodological and lake-specific uncertainties. Our results highlight the potential limits when using wind-based models to predict k across lakes and urge scientists to properly account for prediction errors, or measure k directly in the field whenever possible.

Effective fetch and relative exposure index maps for the Laurentian Great Lakes

Article Open access 18 December 2018

Lacey A. Mason, Catherine M. Riseng, … Robert Jensen

Global lakes are warming slower than surface air temperature due to accelerated evaporation

Article 23 October 2023

Yan Tong, Lian Feng, … R. Iestyn Woolway

Global reconstruction of twentieth century lake surface water temperature reveals different warming trends depending on the climatic zone

Article 27 January 2020

Sebastiano Piccolroaz, R. Iestyn Woolway & Christopher J. Merchant

Introduction

Estimating gas fluxes across the air–water interface in lakes is fundamental for understanding their biogeochemical, environmental and ecological functioning. Accurate estimates of lakes carbon (CO₂ and CH₄), nitrogen (N₂ and N₂O) and oxygen (O₂) fluxes with the atmosphere are key to constrain their biogeochemical cycles (Likens 2010), quantify whole lake metabolism (Dugan et al. 2016) and evaluate greenhouse gas emissions at regional and global scales (Raymond et al. 2013; Soued et al. 2015; Hastie et al. 2017). Estimating air–water fluxes of environmental aquatic contaminants (Hornbuckle et al. 1994; Bidleman 1999; Poissant et al. 2000) and biogenic volatile organic compounds (VOC) in lakes (Fink 2007) is also critical for preserving lake ecosystem services. Gas flux across the air–water interface (F) is described by using Fick’s first law of diffusion as the difference in the surface water (C_wtr) and air-equilibrium (C_eq) gas concentrations, multiplied by the air–water gas transfer velocity (k):

$$F = k \cdot \left( {C_{wtr} - C_{eq} } \right),$$

(1)

where positive $F$ implies flux from the water to the atmosphere. There are methods to estimate gas fluxes such as the use of floating chambers (Engle and Melack 2000; Matthews et al. 2003) or by eddy covariance (Anderson et al. 1999). However, these methods can be time and/or cost consuming and may be difficult to be applied to capture potential spatial variation in multiple systems at the same time (Cole et al. 2010; Schilder et al. 2013; Erkkilä et al. 2018). Alternatively, fluxes can be modelled using Eq. 1, based on gas concentrations and k. However, while gas concentrations are usually relatively straightforward to measure, k is by far more difficult to estimate with high confidence.

The air–water gas transfer velocity of sparingly soluble gases is driven by near-surface water turbulence, which in lakes is mainly generated by wind stress over the lake surface and thermal convection when colder waters masses at the surface sink due to greater density (MacIntyre et al. 1995). Wind is the main source of near-surface turbulence in many lakes (Read et al. 2012), however, the efficiency of this wind-to-turbulence transfer may be modulated by several lake characteristics and other hydrodynamic processes. For a given wind speed, larger fetch lengths will result in larger wave heights, greater turbulence, and thus higher k values (Schilder et al. 2013; Vachon and Prairie 2013; Gålfalk et al. 2013). Surface heat flux, which determines whether a lake is warming or cooling, also affects near-surface turbulence. A warming surface water will stratify thermally and will suppress the wind-driven turbulence (negative buoyancy flux), while a cooling lake surface will generate additional turbulence from the convective movements of water masses (positive buoyancy flux) (MacIntyre et al. 2010). This effect is related to lake area and latitude (Read et al. 2012). Using wind to predict k in lakes is thus far from being direct, however, compared with turbulence measurements, wind speed is relatively easy to measure and therefore the most widely used predictor in empirical models of k for lakes.

Several empirical wind-based k models are currently available for lakes, with k often standardized to a Schmidt number of 600 (k₆₀₀) to characterize CO₂ transfer at 20 °C water temperature. However, each model was calibrated in a distinct system, under specific conditions, and using different methods to determine k₆₀₀. This results in a wide range of model parameterizations (Table 1). The most common methods to derive k₆₀₀ include floating chambers (e.g. Vachon and Prairie 2013), mass balance approaches (e.g. Cole and Caraco 1998) and the eddy covariance technique (e.g. MacIntyre et al. 2010). These methods are fundamentally different in their approach, in addition to having specific issues that may affect the resulting k₆₀₀ estimates (see MacIntyre et al. (1995), Jähne and Haussecker (1998), Wanninkhof et al. (2009) and Cole et al. (2010) for more detailed discussions). A key difference between methods, however, is related to their scales of spatial and temporal integration (SIN and TIN, respectively), ranging from centimeters and minutes (floating chambers); to meters and hours (eddy covariance technique) and the whole lake and days (mass balance approach) (Fig. 1a). The different SIN and TIN will have implications for the measured k₆₀₀ for a given wind speed. For example, when the relationship between k₆₀₀ and wind speed has a positive curvature, the average k₆₀₀ derived over a period of variable winds will be greater than over a period of steady winds of the same average wind speed (Wanninkhof et al. 1987; Livingstone and Imboden 1993). Longer TIN may thus increase its estimated value for a given averaged wind speed. The effect of SIN is less well understood and documented (Schilder et al. 2013; Paranaíba et al. 2018). Wind measured in the center of the lake may potentially record higher wind speeds than the average wind speed integrated over the whole lake area (Venäläinen et al. 2003; Docquier et al. 2016). This intra-lake spatial variability depends on the size and shape of the lake and whether the lake is sheltered by trees, mountains or buildings (Kwan and Taylor 1994; Markfort et al. 2010; Vachon and Prairie 2013). As a result, complex-shape lakes should, in theory, have greater within-lake spatial variability in wind-driven turbulence and thus k₆₀₀. The average k₆₀₀ derived from mass balances (whole-lake, i.e. SIN = 1) should thus be lower than the measured k₆₀₀ at the center of the lake from the eddy covariance technique (SIN < 1) and floating chambers (SIN << 1) (Fig. 1c).

Table 1 Models for predicting the air–water gas transfer velocity (k₆₀₀, in cm h⁻¹) based on wind speed at 10 m height (U₁₀, in m s⁻¹) alone or in combination with lake area (LA, in km²)

Full size table

Available wind-based k₆₀₀ models also differ in the number and geometrical dimensions of the lakes they are calibrated against, in particular the surface area (LA) and the shoreline development index (SDI), the latter describing the ratio of the shoreline length of a given lake relative to the shoreline length of a circular lake of the same area (Fig. 1b). Each model has been calibrated under specific conditions. How the models perform under different conditions still remains widely unknown. This uncertainty is typically dealt with by averaging across predictions from a number of different k₆₀₀ models (Raymond et al. 2013; Hastie et al. 2017). However, model averaging does not necessarily reduce prediction errors nor provide more accurate estimates (Dormann et al. 2018). Existing uncertainties in k₆₀₀ modelling call for a systematic evaluation of the context-dependence of the bias of currently used k₆₀₀ models and attempts to develop new models in order to test whether uncertainties can be reduced and quantified relative to environmental conditions.

Our first aim is to evaluate the predictions of a selection of currently available wind-based k₆₀₀ models using published k₆₀₀ measurements in global lakes. We selected six commonly used or recently published models that differ in their methods of k₆₀₀ measurements, their SIN and TIN and the geometry of the lake under which the models were calibrated (Fig. 1 and Table 1). Specifically, we assess context-dependent model prediction performance using multivariate regression tree analyses and discuss how the environmental conditions (average wind speed), the geometry (LA and SDI) and latitude (Lat) of the lakes studied, and the relevant scales of SIN and TIN related to the method affect the model performance. We hypothesized that none of the available models generally performs better than other models (H1), but that certain models outperform others under certain environmental conditions or system characteristics (H2). Our second aim is to explore new parametrizations using the existing lake k₆₀₀ data that allow more flexibility by accounting for lake-specific effects and explicitly provide prediction errors. We hypothesized that the wind speed effect on k₆₀₀ will vary among the different lakes due to their different LA, SDI, SIN or Lat (H3) and hence including these additional variables will generally improve k₆₀₀ predictions among a wide range of lakes (H4).

Materials and methods

Data compilation and standardization

We compiled data on 2297 simultaneous estimates of k and wind speed from 79 global lakes and 10 reservoirs (Online Resources Table S1), however, only a subset was used to accommodate our analyses. The data were retrieved from all peer-reviewed scientific papers (n = 46) we could find via the search engine “web of science” and papers cited therein. As keywords, we used “lake”, “pond” or “reservoir” in combination with “wind”, and either “k600”, “gas exchange”, “piston velocity”, “reaeration” or “gas transfer”. We only included k estimates that were based on sparingly soluble gases where air–water gas exchange is dominantly water-side controlled. Data were either provided by corresponding authors, extracted from tables, or digitized from figures using the web tool WebPlotDigitizer (Rohatgi 2019) as indicated in Online Resources Table S1. All k estimates were derived from the floating chamber, mass balance or eddy covariance approach and based on a variety of tracer gases (carbon dioxide, methane, oxygen, radon, helium, neon, krypton, sulfur hexafluoride, mercury and propane). The eddy covariance data were binned according to wind speed following the methodology described in the respective original papers. We documented the method-specific SIN and TIN as described in Online Resources Text S1. SIN was expressed relative to the total lake surface area. To make the data comparable across studies, we used k₆₀₀ and wind speed at 10 m height above ground (U₁₀). If k values were not reported as k₆₀₀, we converted the estimates of k for a given Schmidt number (Sc) to a Schmidt number of 600 $k_{600} = k\left( {\frac{600}{{Sc}}} \right)^{ - n}$ where n was 2/3 for low wind speeds (U₁₀ ≤ 3.7 m s⁻¹) and 1/2 for high wind speeds (U₁₀ > 3.7 m s⁻¹) as suggested by Jähne et al. (1987). We used Schmidt number parameterizations by Wanninkhof (2014) for carbon dioxide, methane, oxygen, radon, helium (³He), neon, krypton, and sulfur hexafluoride, by Kuss et al. (2009) for mercury and by Witherspoon and Saraf (1965) for propane. Parameterizations were chosen for fresh- or saltwater, depending on the salinity classification in the original paper. For each lake, we also compiled Lat, LA and SDI, calculated following Hutchinson (1957):$SDI = \frac{p}{{2\sqrt {\pi LA} }},$ where p is the lake perimeter. These measures were extracted from the HydroLakes dataset (for $LA$> 10 ha, Messager et al. (2016)), by on-screen digitization of lake surfaces in Google Maps, or, if available, as shown in maps provided in the original papers (for LA ≤10 ha). For Swedish lakes, we relied on the ViVaN dataset because of its higher accuracy and resolution relative to the HydroLakes dataset (Nisell et al. 2007).

General analytical approach

We first evaluated the predictions of each of the six k₆₀₀ models listed in Table 1 relative to observed k₆₀₀ in our dataset. We then analyzed context-dependent model biases using multivariate regression tree (MRT) analysis to identify experimental and method-specific conditions (average U₁₀, LA, SDI, SIN, TIN, absolute value of Lat) under which certain k₆₀₀ models would perform better than other models. Finally, we explored new k₆₀₀ parametrizations using multivariate regression analysis. For these analyses, we only included those 46 lakes with at least six k₆₀₀ observations each, and a total of 2222 observations. We chose this threshold to maximize the total number of observations and the number of lakes, but fulfill recommendations of a minimum larger than five observations per lake (Theall et al. 2011) and an average larger than 30 observations per lake (Scherbaum and Ferreter 2009, Online Resources Fig. S1).

Evaluating available wind-based k₆₀₀ models

We evaluated models by comparing observed and predicted k₆₀₀ based on four performance measures and summarized these in an integrated performance index. First, we calculated the root mean squared deviation (RMSD) of the predicted values relative to the 1:1 line of observed vs. predicted values (Piñeiro et al. 2008). The smaller the RMSD, the better is the model fit. Second, we calculated the coefficient of determination, $R^{2} = 1 - \frac{{SS_{res} }}{{SS_{tot} }}$, where $SS_{res} = \mathop \sum \nolimits_{i} \left( {y_{i} - f_{i} } \right)^{2}$ is the residual sum of square and $SS_{tot} = \mathop \sum \nolimits_{i} \left( {y_{i} - \overline{y}_{i} } \right)^{2}$ is the total sum of squares with observed values y_i, predicted values f_i and the mean of the observed data $\overline{y}_{i}$. To account for differences in model complexity, we further adjusted R² for the number of predictor variables v relative to the number of observations $m:R_{adj} ^{2} = 1 - (1 - R^{2} )(m - 1)/(m - v - 1). R_{adj} ^{2} = 1$ implies perfect fit, $R_{adj}^{2} > 0$ implies the model fits the data better than the arithmetic mean of the data, and $R_{adj}^{2} < 0$ implies that the arithmetic mean fits the data better than the model predictions. Third and fourth, we calculated the intercept and slope of the linear regression line of observed vs. predicted k₆₀₀. We tested whether the intercept was significantly different from zero based on the significance of the intercept of the linear regression of observed vs. predicted k₆₀₀. We also tested whether the slope was significantly different from 1 based on the significance of the slope of the linear regression of observed-minus-predicted vs. predicted k₆₀₀. The larger the p-value of the intercept and slope, the closer the regression line of observed vs. predicted k₆₀₀ is to the 1:1 line (intercept = 0, slope = 1). Linear regressions were fit using the ‘lm’ function in R 3.6.1 (R Core Team 2019). We ranked each model, with the highest rank assigned to the lowest RMSD, highest R²_adj, and highest p-values of the intercept and slope tests explained above. We used the median rank among all performance measures as an integrated performance index scaling from 1 (best performance) to 6 (worst performance).

We used MRT analysis (De’Ath 2002) to evaluate if any models perform better than others under certain conditions. MRT forms groups (leaves) of lakes by repeated splitting of the data. Each split is defined by a simple rule based on specific conditions and is selected to cluster together lakes for which the different model performance patterns are similar. For example, a cluster could be created if under specific conditions hypothetical models A and B perform better than hypothetical models C and D. In practice, MRT clusters a matrix of dependent variables under the constraints of a matrix of independent variables. As dependent variables, we chose the integrated performance index of the different k₆₀₀ models. As independent variables, we chose average U₁₀, LA, SDI, the method-specific SIN and average TIN, and the absolute value of Lat. To account for potential variability in tree structures depending on the threshold number of lakes to be included per leaf, we fitted a series of trees with threshold numbers ranging from 1 to 23. We fitted MRTs using the ‘mvpart’ function of the R package ‘mvpart’ (De’Ath 2014), using the Euclidean distance as a dissimilarity index. We selected the best tree within one standard error of the overall best using tenfold cross-validation. We evaluated the extent to which the MRT can explain variability in model performance among lakes based on its R² and tenfold cross-validated relative error. As an additional evaluation, we also performed an unconstrained cluster analysis using the same number of clusters and the same metric of dissimilarity as in the MRT analysis using the ‘kmeans’ function in R (Hartigan and Wong 1979). If the unconstrained cluster analysis yields a higher R² than the MRT analysis, then unobserved factors account for additional variation in model performance (De’Ath 2002). We also tested if certain models perform differently within each leaf relative to grand means using a Kruskal–Wallis one-way analysis of variance (‘kruskal.test’ function in R). In case of significant overall differences (p < 0.05), we tested model-specific differences using pairwise Wilcoxon rank sum tests (‘pairwise.wilcox.test’ function in R). We chose nonparametric hypothesis tests to account for the relatively small sample size.

Parametrizing new wind-based k₆₀₀ models

To explore new k₆₀₀ parameterizations, we fitted a series of regression models to the dataset of lakes with at least six k₆₀₀ observations. We fitted nonlinear mixed-effects models following the multilevel approach with cross-level interactions by Bryk and Raudenbush (1992). We used a 2-level model, where U₁₀ was included to explain variation in k₆₀₀ at the (within-lake) observation level and LA, Lat, SDI and SIN to explain variation in the U₁₀- k₆₀₀ relationships at the (among-) lake level. We tested three functional forms commonly used in the literature (Table 1): linear ($k_{600} = a \cdot U_{10} + c$), exponential ($k_{600} = a \cdot exp(U_{10} \cdot b)$) and power ($k_{600} = a \cdot U_{10}^{b} + c$). For each functional form we tested to what extent the slope (a) or shape (b) of the k₆₀₀-U₁₀ relationship is a function of either LA or SDI, based on previous evidence on the potential lake-specific transfer from wind to turbulence (Schilder et al. 2013; Vachon and Prairie 2013; Gålfalk et al. 2013). We also tested whether the offset (c) is a function of either LA or Lat, based on previous evidence of lake-specific additional wind-independent turbulence by e.g. heat fluxes (MacIntyre et al. 2010; Read et al. 2012). The effect of SIN has never been tested, therefore we allowed it to have an influence on all parameters (a, b and c). We fitted models that cover all combinations of these predictor variables, amounting to 16 (linear, exponential) or 64 (power) candidate models. We did not test for TIN effects because it would not be useful as a predictive variable and require additional but unavailable data on the frequency distribution of U₁₀ for each time integration interval (Wanninkhof et al. 1987; Livingstone and Imboden 1993). We evaluated the fits of all candidate models using the Akaike Information Criterion (AIC) and selected, for each of the three model types, the model with the lowest AIC and all parameters significant as the final model. For the final models, we report the RMSD, R²_adj, and slope and intercept of linear regressions of observed vs. predicted k₆₀₀ values and present 95% confidence intervals for mean predictions, representing a typical lake, following an approach by Bolker (2008). See Online Resources Text S2 for more details on the modelling procedure.

Results

Data set

The compiled data covered k₆₀₀ values from 0.01 to 57.62 cm h⁻¹, U₁₀ from 0 to 13 m s⁻¹, LA from 181 m² to 1342 km² and SDI from 1.0 to 22.5 (Fig. 2). k₆₀₀ increased with U₁₀ and was generally < 10 cm h⁻¹ for U₁₀ < 2 m s⁻¹ and > 10 cm h⁻¹ for U₁₀ > 8 m s⁻¹. U₁₀ increased generally with LA, but decreased for very large LA (Fig. 2b), because these had very high SDI (Online Resources Fig. S2). k₆₀₀ was not strongly related to SDI (Fig. 2c). TIN varied from 0.0024 to 122 days and SIN varied from 5·10^–11 to 1 (Online Resources Table S1).

General performance of wind-based k₆₀₀ models

Table 2 summarizes the general performance of each model in predicting k₆₀₀ for a specific lake. Model performance was similar for five of the six models with median RMSDs of 2.2–4.3 cm h⁻¹, R²_adj values of − 0.9 to − 0.2, intercepts of − 0.9 to 1.9 cm h⁻¹ and slopes of 0.7–1.7. Negative R²_adj suggest that predictions were not better than using mean k₆₀₀ observations. Only 2–39% of all cases showed positive R²_adj depending on the model. Intercepts and slopes of linear regressions of observed vs. predicted values were significantly different from 0 and 1 in 30–61% and 41–52% of all cases, respectively. The model L18 by Li (2018) performed generally very poorly outside its training domain. Complete results for each model and lake are reported in Online Resources Table S2.

Table 2 Performance of empirical wind-based models for predictions of air–water gas transfer velocities (k₆₀₀) in 46 lakes with at least 6 observations per lake

Full size table

Predicting variability in model performance

The available experimental conditions (average U₁₀, LA, SDI, SIN, TIN, absolute value of Lat) were poor predictors of k₆₀₀ model performance. This was indicated by the low explanatory power of the MRT analysis, and variable tree structures, depending on the threshold set for the minimum number of lakes included in each leaf. Depending on this threshold, two different trees were generated (Fig. 3, Online Resources Fig. S3). Cross-validated relative errors of these trees were between 1.14 and 1.17 and hence close to one, indicating poor prediction (De’Ath 2002). This was confirmed by the small proportion of variance in model performance ranks that was explained by the MRTs (R² = 0.10–0.11). The unconstrained cluster analyses explained a much higher proportion of variance (R² = 0.32–0.36), indicating that model performance ranks clustered relatively strongly and that k₆₀₀ models performed differently under different conditions. However, the lower R² of MRT relative to the unconstrained cluster analysis suggests that observed conditions could only partly account for the difference in model performance (De’Ath 2002).

Factors influencing wind-based k₆₀₀ model performance

The two MRTs suggest that model performance was a function of TIN or LA, depending on the threshold number of lakes chosen. Apart from L18 always being the worst model, we can observe the following structures. For a threshold of 1–4 lakes per leaf, model performance was determined by LA (Online Resources Fig. S3). For very large systems (LA ≥ 375 km²), the model VP13 by Vachon and Prairie (2013) performed slightly but not statistically significantly better than the other models. For systems smaller than 375 km², models performed equally. For a threshold of minimum 5 lakes per leaf, model performance was primarily determined by TIN (Fig. 3). For TIN < 105 min, VP13 performed slightly, but not statistically significantly better than the other models. For TIN ≥ 105 min, the model G07 by Guerin et al. (2007) performed significantly better than VP13.

Differences in model performance ranks were reflected by differences in the individual performance measures. For example, the higher ranks of G07 relative to VP13 for TIN ≥ 105 min were due to lower RMSDs and higher R²_adj (Online Resources Fig. S4). The generally poor performance of L18 was reflected by extremely high RMSD, low R²_adj and slopes almost always being significantly different from 1.

New k₆₀₀ model parametrizations

Among all the tested linear, exponential and power models, we identified a suite of potential candidates for new k₆₀₀ parameterizations in which all their parameters were statistically significant (p < 0.05), and which had similar statistical support (evidence ratio ≥ 0.05) as the respective models with the lowest AIC (Table 3). The candidate models all included lake- or method-specific characteristics in addition to U₁₀. Hence, including these variables significantly improved the models relative to models with U₁₀ alone. The slope of the linear model type was equivalently explained by LA, SDI or SIN, and the intercept of the power model type was equivalently explained by SIN, LA, or no predictor variable. These equivalences likely result, at least in part, from correlations between LA, SDI and SIN (Online Resources Fig. S2). The linear and power models with the lowest AIC showed a similar structure, where their slope component (a) increased with LA and their intercept component (c) decreased with SIN. We did not find any variable that significantly modulated the shape (b) component in the power model. The slope (a) and shape (b) component of the exponential model with the lowest AIC decreased with SIN and increased with SDI, respectively (Table 3).

Table 3 Performance characteristics of linear, exponential and power mixed-effects models to predict air–water gas transfer velocity (k₆₀₀) from wind speed at 10 m height (U₁₀), lake area (LA), shoreline development index (SDI) and/or space integration (SIN)

Full size table

For each functional shape, we exemplarily chose the model with the lowest AIC for further evaluation. Accordingly, the linear and power model predictions explained similar variability in k₆₀₀ (65%) and had similar RMSD (3.35 and 3.34 cm h⁻¹, Online Resources Table S3). The linear regressions of observed vs. predicted k₆₀₀ followed closely the 1:1 line, with intercepts and slopes not significantly different from 0 and 1, respectively (Fig. 4a,c). Accounting for lake-specific intercepts and slopes would not improve the fit of observed and predicted k₆₀₀ (Likelihood ratio test, Online Resources Table S6), suggesting a good fit among all lakes. Model residuals were homogeneous with a mean near zero across the whole range of predicted k₆₀₀, U₁₀, LA, SDI, SIN, and Lat (Online Resources Fig. S5–S6). Finally, the linear and power model parameters were robust against the threshold minimum number of k₆₀₀ observations per lake (Online Resources Fig. S8). In contrast, the exponential model with the lowest AIC explained only 35% of the variability in k₆₀₀ with a RMSD of 4.89 cm h⁻¹ (Online Resources Table S3). The linear regressions of observed and predicted k₆₀₀ diverged from the 1:1 line, with intercept and slope significantly different from 0 and 1, respectively (Fig. 4b). Accounting for lake-specific intercepts and slopes would improve the fit of observed and predicted k₆₀₀ (Likelihood ratio test, Online Resources Table S6), suggesting that the exponential model was significantly biased for some lakes. Model residuals were heterogeneous across the whole range of predicted k₆₀₀, U₁₀, LA, SDI, SIN, and Lat (Online Resources Fig. S7). The model parameter coefficients varied with the threshold minimum number of k₆₀₀ observations per lake (Online Resources Fig. S8). Overall, these evaluations suggest that the linear and power models fitted the data substantially better and with less bias than the exponential model.

The k₆₀₀ predictions based on our new parameterizations were surprisingly similar for the linear and power models with the lowest AIC values, because of their similar model structure and the power exponent is close to 1 (Fig. 5). Predictions by the exponential model tended to be higher for relatively low (< 1 m s⁻¹) and high (> 7 m s⁻¹) U₁₀. These mixed-effect models integrate a large variability in lake-specific model parameterizations (see grey lines in Fig. 5). For example, U₁₀-k₆₀₀ slopes (a) varied roughly between 0 and 5, and power exponents (b) varied from near 0 to up to 10 (Online Ressources Fig. S10). Our mixed-effects model predictions fell largely within the range of predictions by published models (Fig. 5). For example, the intercept of our linear model was similar to M10 and CW03 for whole-lake space integrations (SIN = 1) and similar to G07, CC98, and VP13 for the minimum space integration found in our dataset (SIN = 5·10^–11). The slopes were within the range of slopes in VP13.

The k₆₀₀ predictions showed large uncertainties, as exemplarily shown in Fig. 6 for the linear model with the lowest AIC value. The lower and upper bounds of 95% confidence intervals were 10–250% smaller or larger than the mean predictions, respectively. This ratio was relatively small (< 25%) for U₁₀ > 2 m s⁻¹ and LA > 1 km² but increased drastically towards smaller LA where the density of available data was relatively scarce, and towards lower U₁₀. The increase in prediction uncertainties towards lower U₁₀ was more pronounced for whole-lake integrations (SIN = 1, Fig. 6a–c) relative to smaller space integrations of, for example, 1 m² (SIN = 1 m²/LA, Fig. 6d–f).

Compared with previous k₆₀₀ models, none of our new models performed better. Our linear, exponential and power models with the lowest AIC had median RMSDs of 3.3, 3.6 and 3.4, R²_adj of − 0.1, − 0.2 and − 0.1 and intercepts and slopes of observed vs. predicted k₆₀₀ of − 0.3, − 0.7 and − 0.3, and 1.0, 1.2 and 0.9, respectively (c.f. Table 2). The proportion of positive R²_adj was 0.41, 0.39 and 0.37, the proportion of intercepts significantly different from 0 was 0.28, 0.46 and 0.28, and the proportion of slopes significantly different from 1 was 0.41, 0.46 and 0.41, respectively. Overall, the integrated performance indices of our new models were not significantly different from most previous models, except for L18 (pairwise Wilcoxon rank sum test, Online Resources Fig. S9).

Discussion

How well can wind speed predict k₆₀₀ over global lakes?

Previously published models showed that U₁₀ alone can explain a high share of variance in measured k₆₀₀ when applied within their calibration domain (R² = 0.68–0.93; Table 1), suggesting that U₁₀ can be a robust predictor under specific conditions. However, several studies have also failed to identify U₁₀ as a predictor of k₆₀₀ in specific lakes (Cole and Caraco 1998; Matthews et al. 2003; Xiao et al. 2014; Podgrajsek et al. 2015; Holgerson et al. 2017), and the number of unpublished unsuccessful attempts is unknown. Our global data synthesis allowed us to evaluate U₁₀ as a predictor of k₆₀₀ over a wide range of lakes. This dataset, i.e. 2222 simultaneous k₆₀₀ and U₁₀ measurements from 46 lakes and reservoirs, is by far more extensive than the database of previously published wind-based models (Table 1). We show that applying these wind-based models on new lakes and under new conditions results in poor and arbitrary k₆₀₀ predictability, suggesting that U₁₀ parameterizations derived from one or few systems cannot always be used to predict k₆₀₀ beyond the training data sets. The R²_adj of observed vs. predicted k₆₀₀ was on average negative, and only in a minority of cases, U₁₀-based predictions were more accurate than simply assuming a mean k₆₀₀ independent of U₁₀. (Table 2). These results signify that there is no wind-based model that predicts k₆₀₀ well in all types of lakes.

Among all lakes, none of the tested published models clearly performed better than the others, in line with our first hypothesis (H1). However, one model (L18) performed worse than all other models likely because it was developed in a reservoir with significant lateral water flow as an additional source of turbulence (Li 2018). Some models seemed to perform slightly better than others under specific conditions such as very large LA or short TIN (Fig. 3 and Online Resources Fig. S3). However, these differences were small (typically < 2 performance index ranks) and not statistically significant. Overall, we did not clearly identify conditions under which certain models performed better than others, and therefore reject our second hypothesis (H2). This result is further supported by the very low explanatory power of the MRTs (R² = 0.10–11) relative to the unconstrained cluster analysis (R² = 0.32–0.36), implying that there was little structure in model performance and that this structure was mainly determined by unobserved factors, or by secondary factors that were observed but remained unrevealed due to a lack of statistical power.

The generally poor and nearly random performance of published k₆₀₀ models suggests that k₆₀₀ predictions are associated with large errors, especially when models are used to extrapolate to new conditions. This finding emphasizes our second research question whether a more general model fitted to a wider range of training data, and including additional easy to obtain predictor variables could improve k₆₀₀ predictions or at least better account for prediction uncertainties.

Functional shape of wind-based k₆₀₀ parametrizations

To properly assess the new k₆₀₀ parameterizations and the effects of additional predictor variables, we first have to evaluate the shape of the U₁₀-k₆₀₀ relationship. The U₁₀-k₆₀₀ relationship can have many different shapes among lakes (Fig. 5), and this can be due to several factors. For example, non-linear U₁₀-k₆₀₀ relationships can be a result of k₆₀₀ being enhanced at low U₁₀ due to convection (MacIntyre et al. 2010; Polsenaere et al. 2013) or chemical enhancement of reactive gases (Wanninkhof et al. 1987). Our models were not flexible enough to accommodate such variability, which is reflected in high uncertainties at low U₁₀ (Fig. 6). k₆₀₀ can also be enhanced by bubbles formed by breaking waves, leading to an accelerating increase at U₁₀ > 12 m s⁻¹ (Broecker and Siems 1984). This process was negligible in our dataset because most observations were done below the U₁₀ threshold for breaking waves. Some studies hypothesized the presence of microbubbles affecting k₆₀₀ measurements from less soluble gases (e.g. CH₄; Prairie and Giorgio 2013; McGinnis et al. 2015). While the exact drivers of this phenomenon are still unknown, it could indeed affect the lake-specific U₁₀-k₆₀₀ shape. We accounted for potential variability among lakes in the shape of the U₁₀-k₆₀₀ relationships by the exponent b in our power model. Interestingly, b varied widely among lakes but none of the tested lake- or method-related characteristics could significantly explain this high variability. This highlights the need for future studies to identify other factors that could explain high between-lake variability in U₁₀-k₆₀₀ shapes. Our best estimate for the exponent b in the power model was near 1 (Table 3). Alternative parameterizations that assume an exponential shape resulted in poor model fits with strong biases. Therefore, we conclude that the U₁₀-k₆₀₀ relationship of a typical lake in our database is linear and recommend the use of our linear model parameterizations for applications across a wide range of lakes. We provide code implemented in the program R to estimate k₆₀₀ and associated uncertainties as a function of U₁₀, LA, and SIN based on our linear model with the lowest AIC value (Online Resources Code S1, Data S1, and Data S2). We emphasize that this model is one of several other linear models that fitted the data equally well.

Modulators of wind-speed effect on k₆₀₀

Our new multilevel modelling procedure showed that LA, SDI and/or SIN can significantly explain between-lake differences in the U₁₀-k₆₀₀—relationships, which supports our third hypothesis (H3). The U₁₀ effect on k₆₀₀ increased with LA likely as a result of progressively larger fetch length, wave build-up and hence turbulence (Schilder et al. 2013; Vachon and Prairie 2013). The modulating effect of LA in our dataset was less strong than the effect found by Vachon and Prairie (2013) and Guerin et al. (2007) (Online Resources Fig. S11, but note absence of effects in Wanninkhof et al. (1987)). This variability in the effect of LA on U₁₀ can arise from differences in shoreline sheltering of the lakes considered. Relatively many small lakes included in our dataset (Sebacher et al. 1983; Leuning et al. 1984; Boyd and Teichert-Coddington 1992; Denmead and Freney 1992) had only little shoreline sheltering which allowed k₆₀₀ to respond relatively strongly to U₁₀. As a result, the LA effect on U₁₀-k₆₀₀ relationships was rather weak (Online Resources Fig. S11). In contrast, the very large LA effects reported by Guérin et al. (2007) may be due to the long effective fetch length in the elongated estuaries included in this study, allowing larger wave heights for a given U₁₀. The larger LA effects reported by Vachon and Prairie (2013) and Guérin et al. (2007) could also be a result of estimating k₆₀₀ locally in the lake center, for which fetch length may matter more than for whole-lake integrated data. Hence, the effect of LA on U₁₀-k₆₀₀ relationships is far from universal and may need further investigations. Our dataset covered both sheltered and unsheltered systems, and also a wide range of SDI. We argue that all these effects were partly captured by the addition of LA as a modulator of the U₁₀ effect, making our model less specific and more generic than previous models.

Our modelling procedure quantified, for the first time, the effect of SIN on k₆₀₀ estimates. This scale is strongly related to the method of estimating k₆₀₀. Predicted k₆₀₀ was around 2.4 cm h⁻¹ higher for small-scale integrations (10^–6 km²/LA) in the lake center (typical for floating chambers) relative to integrations over the whole lake (typical for mass balances). With the mass balance approach, k₆₀₀ are integrated over the whole lake surface while floating chambers and the eddy covariance technique usually integrate smaller areas near the lake center (Fig. 1c). Here, U₁₀ and hence k₆₀₀ is usually higher than near-shore (Venäläinen et al. 2003; Schilder et al. 2013; Vachon and Prairie 2013; Docquier et al. 2016). Therefore smaller SIN resulted in higher k₆₀₀ in our model. It is important to note, however, that our models predict local k₆₀₀ only for areas around the lake center and that higher or lower k₆₀₀ should be expected in near-shore areas, depending on the wind direction and lake shape (Vachon and Prairie 2013). The SIN effect should raise the awareness among k₆₀₀ model users about which proportion of the lake the anticipated k₆₀₀ values should integrate (Fig. 1c). This finding has implications for calculations of local vs. whole-lake gas fluxes where the scale of concentration and k₆₀₀ estimates should match. For example, flux calculations should be based on SIN << 1 if gas concentrations are measured in the center of the lake, and on SIN = 1, if gas concentrations are measured at (multiple) points that are representative for the whole lake surface.

Some of the selected best candidate models included SDI, suggesting the U₁₀ effect on k₆₀₀ to increase in lakes with more complex shoreline geometry. This finding is rather counter-intuitive as one would think that k₆₀₀ should decrease with SDI, given the increased shoreline sheltering simulating the effect of a small lake (Schilder et al. 2013; Vachon and Prairie 2013; Gålfalk et al. 2013). The positive effect of SDI likely resulted from SDI being highly correlated with LA (Online Resources Fig. S2A) and LA having a positive effect on k₆₀₀. We, therefore, conclude that sheltering due to complex shorelines is not a dominant modulator of the U₁₀-k₆₀₀ relationship and do not recommend using our models that include SDI.

Performance and applicability of new parametrization on global lakes

Accounting for the effects of LA or SIN on lake-level U₁₀-k₆₀₀ relationships improved model fits relative to single-level models based on U₁₀ alone. However, despite this improvement, even our best k₆₀₀ models did not clearly outperform the previously published k₆₀₀ models most of which only included U₁₀ (Online Resources Fig. S9), which provides support to falsify our fourth hypothesis (H4). However, even if including additional variables in addition to U₁₀ did not improve the statistical model fit, it may contribute to a more accurate geographic explanation of variations in k₆₀₀ among a wide range of lakes.

Our best linear model is designed to fit average conditions across a wide range of global lakes and to account for their variability by estimating the error. We regard our model to be the globally least biased k₆₀₀ model available, as it averages out method and system-specific errors across a wide range of conditions and explicitly predicts these errors. This approach fills an important gap, relative to previous k₆₀₀ models which were developed under limited conditions (Table 1). Our new model also greatly expands previous limits of the calibration data sets in terms of U₁₀ and LA and should include most conditions encountered in the world (U₁₀ = 0–13 m s⁻¹, LA = 183 m² to 1342 km², SDI = 1–22.5; (c.f. Verpoorter et al. 2014)). Hence, our model yields predictions to represent, to the best possible, an average global lake, and error predictions to account for potential spatiotemporal variability. With these characteristics, our new model is suitable for large-scale applications such as upscaling gas fluxes to regional and global scales.

Within our prediction domain, prediction errors were large (up to 10 cm h⁻¹ or > 200% of mean predictions) and must be accounted for in large-scale or global gas flux estimates. Interestingly, prediction errors varied non-linearly with U₁₀ and LA. While k₆₀₀ was rather well constrained under conditions that have previously been relatively commonly sampled (intermediate U₁₀ and LA), large uncertainties still exist in relative terms in small lakes (LA < 1 km²) and for low wind speeds (U₁₀ < 2 m s⁻¹), conditions that are common globally (Verpoorter et al. 2014, https://globalwindatlas.info/). Under these conditions, k₆₀₀ can be dominantly driven by many other factors in addition to U₁₀ (Crusius and Wanninkhof 2003; Read et al. 2012; Holgerson et al. 2017). Future data collections should focus on low wind speeds and small lakes, to identify underlying drivers of high variability in k₆₀₀ and by that reduce prediction uncertainties.

Limitations and way forward for better wind-based k₆₀₀ models

Several factors that are known to potentially influence k₆₀₀ were not accounted for in our new k₆₀₀ models. Such factors include surface films, turbulence due to convective cooling, bubble-mediated gas transfer, gas-specific behavior (e.g. chemical enhancement), boundary layer stability, stratification, variability in wind stress and the wave field for given U₁₀ levels due to shoreline sheltering, or method-specific issues beyond spatial and temporal integrations (Wanninkhof et al. 1987; MacIntyre et al. 1995; Jähne and Haussecker 1998). To capture these drivers is often difficult and labor-intensive, hence, relevant data are only available for a limited number of systems. Accounting for environmental factors beyond wind may improve k₆₀₀ predictions in specific lakes (MacIntyre et al. 2010; Polsenaere et al. 2013; Heiskanen et al. 2014). Here, a way forward is to develop new models based on mechanistic first principles, relating environmental factors to turbulent kinetic energy dissipation as the primary driver of k₆₀₀ (Zappa et al. 2007; Vachon et al. 2010). However, it remains to be assessed whether these factors would matter and to what extent mechanistic models that account for these factors could improve k₆₀₀ predictions at larger spatial scales. If turning out important, simple proxies of otherwise difficult to measure processes would need to be found to develop k₆₀₀ models that are applicable over many lakes.

Our analysis indicates that our ability to predict k₆₀₀ based on empirical wind-based models could approach an upper limit that is not necessarily determined by a lack of understanding of the controls of k₆₀₀, but by a lack of methodological consistency. First, models with additional predictor variables or relatively many lakes included (e.g. CC98, VP13), did not perform significantly better than single lake/single variable models (e.g. G07, M10). Second, there was also only little structure in the model’s performance relative to the experimental or environmental conditions under which the data were collected (Fig. 3 and Online Resources Fig. S3). These observations could be either explained by a true lack of structure or that this structure is masked by high noise in the collected k₆₀₀ data due to measurement errors or inconsistencies in methodologies.

One important methodological inconsistency with consequences for the predictability of k₆₀₀ is the way U₁₀ is measured. In our data set, U₁₀ was mainly measured on or within 3 km of the lake surface (76% or 89% of observations from all lakes and 42% or 85% of observations from lakes with at least 6 observations, Online Resources Table S1), but some were measured even farther inland. Measurements were also scaled to 10 m height from different measurement heights. Wind speed point measurements are not always representative of the whole lake mean U₁₀ (Fig. 1c; Venäläinen et al. 2003; Docquier et al. 2016). Wind height scaling is also associated with uncertainties (Large and Pond 1981). To reduce these uncertainties, the spatial resolution of wind speed measurements should be increased to match the scale and extent of k₆₀₀ estimates [e.g. several anemometers over the lake, (Wanninkhof et al. 1987)].

Many other methodological and environmental issues that could lead to noise in U₁₀-k₆₀₀ relationships have been discussed in the literature. In essence, every approach addresses air–water gas transfer from a different angle, with characteristic scales and with method-specific advantages and disadvantages (MacIntyre et al. 1995; Jähne and Haussecker 1998). Our new k₆₀₀ models account for most of these uncertainties as they integrate the variability in previous studies carried out under widely different conditions. However, despite the wide variety of studies included, more efforts are needed to measure k₆₀₀ in lakes types that go beyond our dataset and to assess how representative our data collection is for global conditions of lake-atmosphere gas transfer.

Implications and conclusions

With the currently available set of k₆₀₀ models with their limited calibration domain, researchers have been in need to extrapolate k₆₀₀ to their system of interest without being able to properly quantify and account for resulting uncertainties. This practice would strongly limit their ability to gain insights into ecological and biogeochemical processes in specific systems (Dugan et al. 2016; Kiuru et al. 2019) and to upscale the lakes’ air–water gas fluxes to the globe (Raymond et al. 2013). Based on an evaluation of existing wind-based k₆₀₀ models against an extensive set of published U₁₀ and k₆₀₀ estimates, we conclude that extrapolation can lead to significant biases in k₆₀₀ predictions.

Building on the growing awareness that “the gas transfer velocity is not simply a function of the wind speed” (Jähne and Haußecker 1998), we found here that U₁₀ is generally a poor universal predictor of k₆₀₀ over lakes or reservoirs, no matter which of the existing model parameterizations are applied. Prediction errors remain high even in new parameterizations calibrated against the global data set. Therefore, we conclude (in agreement with Cole et al. 2010) that wind-based models are currently very limited in their use to scale k₆₀₀ across lakes and advise better measurements rather than models of k₆₀₀ when accurate estimations for specific lakes are needed. Similar challenges of scaling k₆₀₀ across systems remain in streams and rivers (Hall and Ulseth 2019).

For the future development of lake models (c.f. Tan and Zhuang 2015; Stepanenko et al. 2016) or larger scale applications (c.f. Zwart et al. 2018), we emphasize accounting for large uncertainties when modelling k₆₀₀. To do so, we propose a new k₆₀₀ model that provides estimates of means and uncertainties in k₆₀₀ as a function of U₁₀, LA, and SIN. Large uncertainties in k₆₀₀ models may be overcome by developing mechanistic models from first principles. Until this is achieved, the best approach remains to calibrate empirical constants against extensive data sets. Progress on these lines will improve the development of coupled atmosphere–land surface–lake models and incorporation of lakes in earth system models (MacKay et al. 2009).

The results from this study emphasize careful thought about the strength and limitations, in particular the calibration domain, and spatial scale of integration of the anticipated means to estimate k₆₀₀, no matter if k₆₀₀ is estimated empirically or modelled. By an informed choice of the most suitable k₆₀₀ model, researchers should be able to limit uncertainties in k₆₀₀ predictions within acceptable boundaries, or as George Box phrased it: “Essentially, all models are wrong, but some are useful”.

References

Anderson DE, Striegl RG, Stannard DI et al (1999) Estimating lake–atmosphere CO₂ exchange. Limnol Oceanogr 44:988–1001. https://doi.org/10.4319/lo.1999.44.4.0988
Article CAS Google Scholar
Bidleman TF (1999) Atmospheric transport and air-surface exchange of pesticides. In: Van Dijk HFG, Van Pul WAJ, De Voogt P (eds) Fate of pesticides in the atmosphere: implications for environmental risk assessment. Springer, Dordrecht
Google Scholar
Bolker B (2008) Ecological models and data in R. Princeton University Press, Princeton
Book Google Scholar
Boyd CE, Teichert-Coddington D (1992) Relationship between wind speed and reaeration in small aquaculture ponds. Aquac Eng 11:121–131. https://doi.org/10.1016/0144-8609(92)90014-O
Article Google Scholar
Broecker HC, Siems W (1984) The role of bubbles for gas transfer from water to air at higher windspeeds. Experiments in the wind-wave facility in Hamburg. In: Brutsaert W, Jirka GH (eds) Gas transfer at water surfaces. Springer, New York, pp 229–236
Chapter Google Scholar
Bryk AS, Raudenbush SW (1992) Hierarchical linear models: applications and data analysis methods. Sage Publications, New York
Google Scholar
Clark JF, Schlosser P, Wanninkhof R, Simpson HJ, Schuster WSF, Ho DT (1995) Gas transfer velocities for SF6 and ³He in a small pond at low wind speeds. Geophys Res Lett 22(2):93–96. https://doi.org/10.1029/94GL02410
Article CAS Google Scholar
Cole J, Bade D, Bastviken D (2010) Multiple approaches to estimating air-water gas exchange in small lakes. Limnol Oceanogr Methods. https://doi.org/10.4319/lom.2010.8.285
Article Google Scholar
Cole JJ, Caraco NF (1998) Atmospheric exchange of carbon dioxide in a low-wind oligotrophic lake measured by the addition of SF6. Limnol Oceanogr 43:647–656. https://doi.org/10.4319/lo.1998.43.4.0647
Article CAS Google Scholar
Crusius J, Wanninkhof R (2003) Gas transfer velocities measured at low wind speed over a lake. Limnol Oceanogr 48:1010–1017. https://doi.org/10.4319/lo.2003.48.3.1010
Article Google Scholar
De’Ath G (2002) Multivariate regression trees : a new technique for modelling species-environment relationships. Ecology 83:1105–1117. https://doi.org/10.1890/0012-9658(2002)083[1105:MRTANT]2.0.CO;2
Article Google Scholar
De’Ath G (2014) mvpart 1.6–2 Multivariate regression trees. https://mran.microsoft.com/snapshot/2014-12-11/web/packages/mvpart/index.html. Accessed 06 Apr 2020).
Denmead OT, Freney JR (1992) Transfer coefficients for water-air exchange of ammonia, carbon dioxide and methane. Ecol Bull 42:31–41
CAS Google Scholar
Docquier D, Thiery W, Lhermitte S, van Lipzig N (2016) Multi-year wind dynamics around Lake Tanganyika. Clim Dyn 47:3191–3202. https://doi.org/10.1007/s00382-016-3020-z
Article Google Scholar
Dormann CF, Calabrese JM, Guillera-Arroita G et al (2018) Model averaging in ecology: a review of Bayesian, information-theoretic, and tactical approaches for predictive inference. Ecol Monogr 88:485–504. https://doi.org/10.1002/ecm.1309
Article Google Scholar
Dugan HA, Iestyn Woolway R, Santoso AB et al (2016) Consequences of gas flux model choice on the interpretation of metabolic balance across 15 lakes. Inl Waters 6:581–592. https://doi.org/10.5268/IW-6.4.836
Article CAS Google Scholar
Emerson S (1975) Gas exchange rates in small Canadian Shield lakes. Limnol Oceanogr 20:754–761. https://doi.org/10.4319/lo.1975.20.5.0754
Article CAS Google Scholar
Emerson S, Broecker W, Schindle DW (1973) Gas-exchange rates in a small lake as determined by radon method. J Fish Res Board Canada 30:1475–1484. https://doi.org/10.1139/f73-237
Article CAS Google Scholar
Engle D, Melack JM (2000) Methane emissions from an Amazon floodplain lake: enhanced release during episodic mixing and during falling water. Biogeochemistry 51:71–90. https://doi.org/10.1023/A:1006389124823
Article CAS Google Scholar
Erkkilä K-M, Ojala A, Bastviken D et al (2018) Methane and carbon dioxide fluxes over a lake: comparison between eddy covariance, floating chambers and boundary layer method. Biogeosciences 15:429–445. https://doi.org/10.5194/bg-15-429-2018
Article CAS Google Scholar
Fink P (2007) Ecological functions of volatile organic compounds in aquatic systems. Mar Freshw Behav Physiol 40:155–168. https://doi.org/10.1080/10236240701602218
Article CAS Google Scholar
Frost T, Upstill-Goddard RC (2002) Meteorological controls of gas exchange at a small English lake. Limnol Oceanogr 47:1165–1174. https://doi.org/10.4319/lo.2002.47.4.1165
Article CAS Google Scholar
Gålfalk M, Bastviken D, Fredriksson S, Arneborg L (2013) Determination of the piston velocity for water-air interfaces using flux chambers, acoustic Doppler velocimetry, and IR imaging of the water surface. J Geophys Res Biogeosci 118:770–782. https://doi.org/10.1002/jgrg.20064
Article Google Scholar
Gelda RK, Auer MT, Effler SW et al (1996) Determination of reaeration coefficients: whole-lake approach. J Environ Eng. https://doi.org/10.1061/(ASCE)0733-9372(1996)122
Article Google Scholar
Guérin F, Abril G, Serça D et al (2007) Gas transfer velocities of CO₂ and CH₄ in a tropical reservoir and its river downstream. J Mar Syst 66:161–172. https://doi.org/10.1016/j.jmarsys.2006.03.019
Article Google Scholar
Hall RO, Ulseth AJ (2019) Gas exchange in streams and Rivers. WIREs Water 1391:1–18. https://doi.org/10.1002/wat2.1391
Article Google Scholar
Hartigan JA, Wong MA (1979) Algorithm AS 136: a K-means clustering algorithm. J R Stat Soc Ser C 28:100–108. https://doi.org/10.2307/2346830(Applied Stat)
Article Google Scholar
Hastie A, Lauerwald R, Weyhenmeyer G et al (2017) CO₂evasion from boreal lakes: revised estimate, drivers of spatial variability, and future projections. Glob Chang Biol 2:711–728. https://doi.org/10.1111/gcb.13902
Article Google Scholar
Heiskanen JJ, Mammarella I, Haapanala S et al (2014) Effects of cooling and internal wave motions on gas transfer coefficients in a boreal lake. Tellus, Ser B Chem Phys Meteorol 66:1–16. https://doi.org/10.3402/tellusb.v66.22827
Article Google Scholar
Hesslein RH, Broecker WS, Quay PD, Schindler DW (1980) Whole-Lake radiocarbon experiment in an oligotrophic lake at the experimental Lakes Area, Northwestern Ontario. Can J Fish Aquat Sci 37:454–463. https://doi.org/10.1139/f80-059
Article CAS Google Scholar
Ho DT, Engel VC, Ferrón S et al (2018) On factors influencing air-water gas exchange in emergent Wetlands. J Geophys Res Biogeosciences 123:178–192. https://doi.org/10.1002/2017JG004299
Article Google Scholar
Holgerson MA, Farr ER, Raymond PA (2017) Gas transfer velocities in small forested ponds. J Geophys Res Biogeosci 122:1011–1021. https://doi.org/10.1002/2016JG003734
Article Google Scholar
Hornbuckle KC, Jeremiason JD, Sweet CW, Elsenreich SJ (1994) Seasonal variations in air-water exchange of polychlorinated biphenyls in Lake Superior. Environ Sci Technol 28:1491–1501. https://doi.org/10.1021/es00057a018
Article CAS PubMed Google Scholar
Howard EM, Forbrich I, Giblin AE et al (2018) Using noble gases to compare parameterizations of air–water gas exchange and to constrain oxygen losses by ebullition in a shallow aquatic environment. J Geophys Res Biogeosci 123:2711–2726. https://doi.org/10.1029/2018jg004441
Article CAS Google Scholar
Hutchinson GE (1957) A treatise on limnology geography, physics, and chemistry. Part. 1. Geography and physics of lakes, vol 1. Wiley, New York
Google Scholar
Jähne B, Fischer KH, Imberger J, et al (1984) Parametrization of air/lake gas exchange. In: Brutsaert W, Jirka GH (eds) Gas Transfer at Water Surfaces. D. Reidel Publishing Company, pp 459–466
Jähne B, Haussecker H (1998) Air-water gas exchange. Annu Rev Fluid Mech 30:443–468. https://doi.org/10.1146/annurev.fluid.30.1.443
Article Google Scholar
Jähne BJ, Münnich KOM, Bösinger R et al (1987) On the parameters influencing air-water gas exchange. J Geophys Res 92:1937–1949. https://doi.org/10.1029/JC092iC02p01937
Article Google Scholar
Jean-Baptiste P, Poisson A (2000) Gas transfer experiment on a lake (Kerguelen Islands) using 3He and SF6. J Geophys Res 105:1177–1186. https://doi.org/10.1029/1999JC900088
Article CAS Google Scholar
Jonsson A, Åberg J, Lindroth A, Jansson M (2008) Gas transfer rate and CO₂ flux between an unproductive lake and the atmosphere in northern Sweden. J Geophys Res 113:G04006. https://doi.org/10.1029/2008JG000688
Article CAS Google Scholar
Kiuru P, Ojala A, Mammarella I et al (2019) Applicability and consequences of the integration of alternative models for CO₂ transfer velocity into a process-based lake model. Biogeosci Discuss 16:3297–3317. https://doi.org/10.5194/bg-2019-95
Article CAS Google Scholar
Kuss J, Holzmann J, Ludwig R (2009) An elemental mercury diffusion coefficient for natural waters determined by molecular dynamics simulation. Environ Sci Technol 43:3183–3186. https://doi.org/10.1021/es8034889
Article CAS PubMed Google Scholar
Kwan J, Taylor PA (1994) On gas fluxes from small lakes and ponds. Boundary-Layer Meteorol 68:339–356
Article Google Scholar
Large WG, Pond S (1981) Open ocean momentum flux measurements in moderate to strong winds. J Phys Oceanogr 11:324–336. https://doi.org/10.1175/1520-0485(1981)011%3c0324:OOMFMI%3e2.0.CO;2
Article Google Scholar
Leuning R, Denmead O, Simpson J, Freney J (1984) Processes of ammonia loss from shallow floodwater. Atmos Environ 18:1583–1592. https://doi.org/10.1016/0004-6981(84)90380-9
Article CAS Google Scholar
Li S (2018) CO₂ oversaturation and degassing using chambers and a new gas transfer velocity model from the Three Gorges Reservoir surface. Sci Total Environ 640–641:908–920. https://doi.org/10.1016/j.scitotenv.2018.05.345
Article CAS PubMed Google Scholar
Likens GE (ed) (2010) Biogeochemistry of Inland Waters, 1st Edition. Academic Press, Cambridge
Livingstone DM, Imboden DM (1993) The non-linear influence of wind-speed variability on gas transfer in lakes. Tellus B 45:275–295. https://doi.org/10.1034/j.1600-0889.1993.t01-2-00005.x
Article Google Scholar
López Bellido J, Tulonen T, Kankaala P, Ojala A (2009) CO₂ and CH₄ fluxes during spring and autumn mixing periods in a boreal lake (Pääjärvi, southern Finland). J Geophys Res Biogeosci 114:1–12. https://doi.org/10.1029/2009JG000923
Article CAS Google Scholar
MacIntyre S, Jonsson A, Jansson M et al (2010) Buoyancy flux, turbulence, and the gas transfer coefficient in a stratified lake. Geophys Res Lett 37:L24604. https://doi.org/10.1029/2010GL044164
Article Google Scholar
MacIntyre S, Wanninkhof R, Chanton JP (1995) Trace gas exchange across the air-water interface in freshwater and coastal marine environments. In: Matson PA, Harriss RC (eds) Biogenic trace gases: measuring emissions from soil and water. Wiley, New York, pp 52–97
Google Scholar
MacKay MD, Neale PJ, Arp CD et al (2009) Modeling lakes and reservoirs in the climate system. Limnol Oceanogr 54:2315–2329. https://doi.org/10.4319/lo.2009.54.6_part_2.2315
Article CAS Google Scholar
Markfort CD, Perez ALS, Thill JW et al (2010) Wind sheltering of a lake by a tree canopy or bluff topography. Water Resour Res. https://doi.org/10.1029/2009WR007759
Article Google Scholar
Martinsen KT, Kragh T, Sand-Jensen K (2020) Carbon dioxide efflux and ecosystem metabolism of small forest lakes. Aquat Sci 82:1–17. https://doi.org/10.1007/s00027-019-0682-8
Article CAS Google Scholar
Matthews CJD, St Louis VL, Hesslein RH (2003) Comparison of three techniques used to measure diffusive gas exchange from sheltered aquatic surfaces. Environ Sci Technol 37:772–780. https://doi.org/10.1021/es0205838
Article CAS PubMed Google Scholar
McGinnis DF, Kirillin G, Tang KW et al (2015) Enhancing surface methane fluxes from an oligotrophic lake: exploring the microbubble hypothesis. Environ Sci Technol 49:873–880. https://doi.org/10.1021/es503385d
Article CAS PubMed Google Scholar
Messager ML, Lehner B, Grill G et al (2016) Estimating the volume and age of water stored in global lakes using a geo-statistical approach. Nat Commun 7:13603. https://doi.org/10.1038/ncomms13603
Article CAS PubMed PubMed Central Google Scholar
Nisell J, Lindsjö A, Temnerud J (2007) Rikstäckande virtuellt vattendrags nätverk för flödesbaserad modellering ViVaN (in Swedish with an English summary). Department of Aquatic Science and Assessment. Swedish University of Agricultural Sciences. Uppsala. Sveriges Lantbruksuniversitet Rep 17
Paranaíba JR, Barros N, Mendonça R et al (2018) Spatially resolved measurements of CO₂ and CH₄ concentration and gas-exchange velocity highly influence carbon-emission estimates of reservoirs. Environ Sci Technol 52:607–615. https://doi.org/10.1021/acs.est.7b05138
Article CAS PubMed Google Scholar
Piñeiro G, Perelman S, Guerschman JP, Paruelo JM (2008) How to evaluate models: observed vs. predicted or predicted vs. observed? Ecol Modell 216:316–322. https://doi.org/10.1016/j.ecolmodel.2008.05.006
Article Google Scholar
Podgrajsek E, Sahlée E, Rutgersson A (2015) Diel cycle of lake-air CO₂ flux from a shallow lake and the impact of waterside convection on the transfer velocity. J Geophys Res Biogeosci. https://doi.org/10.1002/2014JG002781.Received
Article Google Scholar
Poissant L, Amyot M, Pilote M, Lean D (2000) Mercury water–air exchange over the upper St. Lawrence River and Lake Ontario. Environ Sci Technol 34:3069–3078. https://doi.org/10.1021/es990719a
Article CAS Google Scholar
Polsenaere P, Deborde J, Detandt G et al (2013) Thermal enhancement of gas transfer velocity of CO₂ in an Amazon floodplain lake revealed by eddy covariance measurements. Geophys Res Lett 40:1734–1740. https://doi.org/10.1002/grl.50291
Article CAS Google Scholar
Poulain AJ, Orihel DM, Amyot M et al (2006) Relationship between the loading rate of inorganic mercury to aquatic ecosystems and dissolved gaseous mercury production and evasion. Chemosphere 65:2199–2207. https://doi.org/10.1016/j.chemosphere.2006.05.066
Article CAS PubMed Google Scholar
Prairie YT, Giorgio PA (2013) A new pathway of freshwater methane emissions and the putative importance of microbubbles. Inl waters 3:311–320. https://doi.org/10.5268/IW-3.3.542
Article CAS Google Scholar
R Core Team (2019) R: a language and environment for statistical computing. https://www.r-project.org. Accessed 03 Apr 2020
Rantakari M, Heiskanen J, Mammarella I et al (2015) Different apparent gas exchange coefficients for CO₂ and CH₄: comparing a brown-water and a clear-water lake in the boreal zone during the whole growing season. Environ Sci Technol 49:11388–11394. https://doi.org/10.1021/acs.est.5b01261
Article CAS PubMed Google Scholar
Raymond PAP, Hartmann J, Lauerwald R et al (2013) Global carbon dioxide emissions from inland waters. Nature 503:355–359. https://doi.org/10.1038/nature12760
Article CAS PubMed Google Scholar
Read JS, Hamilton DP, Desai AR et al (2012) Lake-size dependency of wind shear and convection as controls on gas exchange. Geophys Res Lett 39:L09405. https://doi.org/10.1029/2012GL051886
Article Google Scholar
Rohatgi A (2019) WebPlotDigitizer 4.2. https://automeris.io/WebPlotDigitizer Accessed 16 Apr 2019
Rosentreter JA, Maher DT, Ho DT et al (2017) Spatial and temporal variability of CO₂ and _CH4 gas transfer velocities and quantification of the CH₄ microbubble flux in mangrove dominated estuaries. Limnol Oceanogr 62:561–578. https://doi.org/10.1002/lno.10444
Article CAS Google Scholar
Scherbaum CA, Ferreter JM (2009) Estimating statistical power and required sample sizes for organizational research using multilevel modeling. Organ Res Methods 12:347–367. https://doi.org/10.4135/9780857028228
Article Google Scholar
Schilder J, Bastviken D, Van Hardenbroek M et al (2013) Spatial heterogeneity and lake morphology affect diffusive greenhouse gas emission estimates of lakes. Geophys Res Lett 40:5752–5756. https://doi.org/10.1002/2013GL057669
Article CAS Google Scholar
Schilder J, van Hardenbroek M, Bastviken D, Heiri O (2016) Spatio-temporal patterns in methane flux and piston velocity at low wind speed: implications for upscaling studies on small lakes. J Geophys Res Biogeosci. https://doi.org/10.1002/2016JG003346
Article Google Scholar
Sebacher DI, Harriss RC, Bartlett KB (1983) Methane flux across the air-water interface: air velocity effects. Tellus B 35B:103–109. https://doi.org/10.1111/j.1600-0889.1983.tb00014.x
Article CAS Google Scholar
Sollberger S, Wehrli B, Schubert CJ et al (2017) Minor methane emissions from an Alpine hydropower reservoir based on monitoring of diel and seasonal variability. Environ Sci Process Impacts 19:1278–1291. https://doi.org/10.1039/c7em00232g
Article CAS PubMed Google Scholar
Soued C, del Giorgio PA, Maranger R (2015) Nitrous oxide sinks and emissions in boreal aquatic networks in Québec. Nat Geosci. https://doi.org/10.1038/ngeo2611
Article Google Scholar
Soumis N, Canuel R, Lucotte M (2008) Evaluation of two current approaches for the measurement of carbon dioxide diffusive fluxes from lentic ecosystems. Environ Sci Technol 42:2964–2969. https://doi.org/10.1021/es702361s
Article CAS PubMed Google Scholar
Southworth G, Lindberg S, Hintelmann H et al (2007) Evasion of added isotopic mercury from a northern temperate lake. Environ Toxicol Chem 26:53–60. https://doi.org/10.1897/06-148r.1
Article CAS PubMed Google Scholar
Stepanenko V, Mammarella I, Ojala A et al (2016) LAKE 2.0: a model for temperature, methane, carbon dioxide and oxygen dynamics in lakes. Geosci Model Dev 9:1977–2006. https://doi.org/10.5194/gmd-9-1977-2016
Article CAS Google Scholar
Tan Z, Zhuang Q (2015) Methane emissions from pan-Arctic lakes during the 21st century: an analysis with process-based models of lake evolution and biogeochemistry. J Geophys Res Biogeosci 120:1–13. https://doi.org/10.1002/2015JG003184
Article CAS Google Scholar
Theall KP, Scribner R, Broyles S et al (2011) Impact of small group size on neighbourhood influences in multilevel models. J Epidemiol Community Heal 65:688–695. https://doi.org/10.1136/jech.2009.097956
Article Google Scholar
Torgersen T, Mathieu G, Hesslein RH, Broecker WS (1982) Gas exchange dependency on diffusion coefficient: direct 222Rn and 3He comparisons in a small lake. J Geophys Res 87:546–556. https://doi.org/10.1029/JC087iC01p00546
Article Google Scholar
Upstill-Goddard RC, Watson AJ, Liss PS, Liddicoat MI (1990) Gas transfer velocities in lakes measured with SF6. Tellus B 42:364–377. https://doi.org/10.1034/j.1600-0889.1990.t01-3-00006.x
Article Google Scholar
Vachon D, Langenegger T, Donis D et al (accepted) Methane emission offsets carbon dioxide uptake in a small productive lake. Limnol Oceanogr Lett
Vachon D, Prairie YT (2013) The ecosystem size and shape dependence of gas transfer velocity versus wind speed relationships in lakes. Can J Fish Aquat Sci 70:1757–1764. https://doi.org/10.1139/cjfas-2013-0241
Article Google Scholar
Vachon D, Prairie YT, Cole JJ (2010) The relationship between near-surface turbulence and gas transfer velocity in freshwater systems and its implications for floating chamber measurements of gas exchange. Limnol Oceanogr 55:1723–1732. https://doi.org/10.4319/lo.2010.55.4.1723
Article CAS Google Scholar
Venäläinen A, Sahlgren V, Podsechin V, Huttula T (2003) Small-scale variability of the wind field over a typical Scandinavian lake. Boreal Environ Res 8:71–81
Google Scholar
Verpoorter C, Kutser T, Seekell DA, Tranvik LJ (2014) A global inventory of lakes based on high-resolution satellite imagery. Geophys Res Lett. https://doi.org/10.1002/2014GL060641
Article Google Scholar
Wanninkhof R (2014) Relationship between wind speed and gas exchange over the ocean revisited. Limnol Oceanogr Methods 12:351–362. https://doi.org/10.4319/lom.2014.12.351
Article Google Scholar
Wanninkhof R, Asher WE, Ho DT et al (2009) Advances in quantifying air-sea gas exchange and environmental forcing. Ann Rev Mar Sci 1:213–244. https://doi.org/10.1146/annurev.marine.010908.163742
Article PubMed Google Scholar
Wanninkhof R, Ledwell J, Crusius J (1991) Gas transfer velocities on lakes measured with sulfur hexafluoride. In: Wilhelms SC, Gulliver JS (eds) Proceedings of the Second International Symposium on Gas Transfer at Water Surfaces. American Society of Civil Engineers, New York, pp 441–458
Wanninkhof R, Ledwell JR, Broecker WS (1985) Gas exchange-wind speed relation measured with sulfur hexafluoride on a lake. Science (80- ) 227:1224–1226. https://doi.org/10.1126/science.227.4691.1224
Article CAS Google Scholar
Wanninkhof R, Ledwell JR, Broecker WS, Hamilton M (1987) Gas exchange on Mono Lake and Crowley Lake, California. J Geophys Res Ocean 92:14567–14580. https://doi.org/10.1029/JC092iC13p14567
Article CAS Google Scholar
Witherspoon PA, Saraf DN (1965) Diffusion of methane, ethane, propane, and n-butane in water from 25 to 43°. J Phys Chem 69:3752–3755. https://doi.org/10.1021/j100895a017
Article CAS Google Scholar
Xiao S, Yang H, Liu D et al (2014) Gas transfer velocities of methane and carbon dioxide in a subtropical shallow pond. Tellus, Ser B Chem Phys Meteorol. https://doi.org/10.3402/tellusb.v66.23795
Article Google Scholar
Yu SL, Tuffey TJ, Lee D-S (1977) Atmospheric reaeration in a lake. Water problems in an urbanizing state. Rutgers. The State University of New Jersey. Water Resources Research Institute. 1–50
Zappa CJ, McGillis WR, Raymond PA et al (2007) Environmental turbulent mixing controls on air-water gas exchange in marine and aquatic systems. Geophys Res Lett 34:1–6. https://doi.org/10.1029/2006GL028790
Article CAS Google Scholar
Zwart JA, Hanson ZJ, Vanderwall J et al (2018) Spatially explicit, regional-scale simulation of lake carbon fluxes. Global Biogeochem Cycles 32:1276–1293. https://doi.org/10.1002/2017GB005843
Article CAS Google Scholar

Download references

Acknowledgements

Open access funding provided by Umea University. This paper is based on research supported by the Knut and Alice Wallenberg Foundation (dnr: 2016.0083). Dominic Vachon was also supported by postdoctoral fellowships from the Natural Sciences and Engineering Research Council of Canada. We thank Anders Jonsson, Werner Eugster, Erik Sahlee, Eva Podgrajsek, and Kenneth Thoro-Martinsen for providing original published data. We thank Rik Wanninkhof, Peter Liss and one anonymous reviewer for constructive comments on a previous version of this manuscript.

Author information

Authors and Affiliations

Department of Ecology and Environmental Science, Umeå University, Umeå, Sweden
Marcus Klaus & Dominic Vachon
Department of Forest Ecology and Management, Swedish University of Agricultural Sciences, Umeå, Sweden
Marcus Klaus

Authors

Marcus Klaus
View author publications
You can also search for this author in PubMed Google Scholar
Dominic Vachon
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marcus Klaus.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary file1 (Docx 831 kb)

Supplementary file2 (txt 2 kb)

Supplementary file3 (R 10 kb)

Supplementary file4 (RData 95 kb)

Supplementary file5. Online Resources Table S1 (XLSX 395 kb)

Supplementary file6. Online Resources Table S2 (XLSX 40 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Klaus, M., Vachon, D. Challenges of predicting gas transfer velocity from wind measurements over global lakes. Aquat Sci 82, 53 (2020). https://doi.org/10.1007/s00027-020-00729-9

Download citation

Received: 18 October 2019
Accepted: 25 April 2020
Published: 01 May 2020
DOI: https://doi.org/10.1007/s00027-020-00729-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Challenges of predicting gas transfer velocity from wind measurements over global lakes

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Data compilation and standardization

General analytical approach

Evaluating available wind-based k600 models

Parametrizing new wind-based k600 models

Results

Data set

General performance of wind-based k600 models

Predicting variability in model performance

Factors influencing wind-based k600 model performance

New k600 model parametrizations

Discussion

How well can wind speed predict k600 over global lakes?

Functional shape of wind-based k600 parametrizations

Modulators of wind-speed effect on k600

Performance and applicability of new parametrization on global lakes

Limitations and way forward for better wind-based k600 models

Implications and conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Evaluating available wind-based k₆₀₀ models

Parametrizing new wind-based k₆₀₀ models

General performance of wind-based k₆₀₀ models

Factors influencing wind-based k₆₀₀ model performance

New k₆₀₀ model parametrizations

How well can wind speed predict k₆₀₀ over global lakes?

Functional shape of wind-based k₆₀₀ parametrizations

Modulators of wind-speed effect on k₆₀₀

Limitations and way forward for better wind-based k₆₀₀ models