Predicting the ripening time of ‘Hass’ and ‘Shepard’ avocado fruit by hyperspectral imaging

Han, Yifei; Bai, Shahla Hosseini; Trueman, Stephen J.; Khoshelham, Kourosh; Kämper, Wiebke

doi:10.1007/s11119-023-10022-y

Predicting the ripening time of ‘Hass’ and ‘Shepard’ avocado fruit by hyperspectral imaging

Open access
Published: 18 April 2023

Volume 24, pages 1889–1905, (2023)
Cite this article

Download PDF

You have full access to this open access article

Precision Agriculture Aims and scope Submit manuscript

Predicting the ripening time of ‘Hass’ and ‘Shepard’ avocado fruit by hyperspectral imaging

Download PDF

Yifei Han^1,2,
Shahla Hosseini Bai³,
Stephen J. Trueman³,
Kourosh Khoshelham⁴ &
…
Wiebke Kämper ORCID: orcid.org/0000-0002-8646-4492³

2771 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Predicting the ripening time of avocado fruit accurately could improve fruit storage and decrease food waste. No reasonable method exists for predicting the postharvest ripening time of avocado fruit during transport, storage or retail display. Here, hyperspectral imaging ranging from 388 to 1005 nm with 462 bands was applied to 316 ‘Hass’ and 160 ‘Shepard’ mature, unripe avocado fruit to predict how many days it took for individual fruit to become ripe. Three models were developed using partial least squares regression (PLSR), deep convolutional neural network (DCNN) regression and DCNN classification. Our PLSR models provided coefficients of determination (R²) of 0.76 and 0.50 and root mean squared errors (RMSE) of 1.20 and 1.13 days for ‘Hass’ and ‘Shepard’ fruit, respectively. The DCNN-based regression models produced similar results with R² of 0.77 and 0.59, and RMSEs of 1.43 and 0.94 days for ‘Hass’ and ‘Shepard’ fruit, respectively. The prediction accuracies and RMSEs from DCNN classification models, respectively, were 67.28% and 1.52 days for ‘Hass’ and 64.06% and 1.03 days for ‘Shepard’. Our study demonstrates that the spectral reflectance of the skin of mature, unripe ‘Hass’ and ‘Shepard’ fruit provides adequate information to predict ripening time and, thus, has the potential to improve postharvest processing and reduce postharvest losses of avocado fruit.

Fruit ripeness identification using YOLOv8 model

Article Open access 31 August 2023

Bingjie Xiao, Minh Nguyen & Wei Qi Yan

Automatic fruit picking technology: a comprehensive review of research advances

Article Open access 14 February 2024

Jun Zhang, Ningbo Kang, … Hongbo Zhang

Machine learning for leaf disease classification: data, techniques and applications

Article Open access 18 October 2023

Jianping Yao, Son N. Tran, … Saurabh Garg

Introduction

The quality of food can change along the supply chain from harvest through to processing, quality assessment, and retail display for consumers (Parfitt et al., 2010). Shelf life and ripening time vary among individual fruit and vegetables, and so techniques that evaluate the ripening time of individual fruit are valuable for determining postharvest storage and handling strategies. A total of 14% of food is lost between harvest and retail globally (FAO, 2019). Sorting fresh fruit into homogeneous classes with a similar ripening time could help to reduce food waste and increase customer satisfaction.

Hyperspectral imaging (HSI) has been popular as a non-destructive technology for predicting shelf life or estimating the maturity of many fruits, where partial least squares regression (PLSR) models have been commonly applied to meet these goals (Rajkumar et al., 2012; Wei et al., 2014; Pu et al., 2016; Li et al., 2018). However, HSI generates highly dimensional data that are difficult to analyse due to its high computational complexity (Han et al., 2020; Huang et al., 2014). Deep learning techniques such as deep convolutional neural networks (DCNN) have the ability to deal with high computational complexity and have been used in feature learning and feature extraction from images (Bengio et al., 2013; Schmidhuber, 2015). Various deep learning techniques applied to hyperspectral imagery have been used to detect disease, recognise food and drink images, and estimate nut and meat quality (Mezgec & Koroušić Seljak, 2017; Ma et al., 2018; Han et al., 2020; Liu et al., 2020). However, DCNN approaches have found little application, thus far, in determining the ripening time of fruit (Steinbrener et al., 2019; Garillos-Manliguez & Chiang, 2021).

Global market demand for avocado products has increased exponentially in recent years (New Zealand Avocado, 2022). However, avocado fruit are highly perishable when compared with many other fruits, and the inability to predict avocado ripening time is one factor that leads to consumer dissatisfaction and fruit loss (Gamble et al., 2010; Perkins et al., 2020; Kämper et al., 2020). The maturity of avocado fruit can be assessed based on dry matter concentration. Fruit with low dry matter concentration do not ripen into edible fruit whereas fruit with high dry matter concentration ripen quickly (Sivakumar et al., 2011). Dry matter concentrations of individual fruit can vary greatly within the same farm or the same tree depending on, for example, the location of the fruit in the tree canopy (Alcobendas et al., 2013; Carvalho et al., 2015), affecting the ripening time of mature fruit after harvest.

This study aimed to use two avocado cultivars, ‘Hass’ and ‘Shepard’, to determine the potential of hyperspectral imaging for rapidly predicting the ripening time of mature, unripe avocado fruit. Specifically, PLSR models, DCNN regression models, and DCNN classification models were developed to predict the ripening time of ‘Hass’ and ‘Shepard’ avocado fruit. The performances of DCNN regression and classification models were compared with traditional PLSR models to identify the best approach. This is the first study, to our knowledge, to examine the combination of DCNN and HSI for predicting the ripening time of avocado fruit. Rapid fruit ripening assessment can help to decrease postharvest fruit loss.

Materials and methods

Sample collection and preparation

Avocado fruit are harvested in dry weather from a commercial avocado orchard (25°08′ 19″ S 152°15′48″ E) near Childers, Queensland, Australia. A total of 160 ‘Hass’ and 160 ‘Shepard’ fruit were harvested on 15 April 2019 and another 156 ‘Hass’ fruit on 10 June 2019. All fruit were harvested from trees in a large ‘Hass’ block that included a single row of ‘Shepard’ trees. On each collection date, the fruit were harvested from three to five different trees per sampled cultivar and kept in the shade until they were moved into a cold room at 4 °C for ‘Hass’ and 7 °C for ‘Shepard’ on the same day according to the recommended temperature standards for these two different cultivars (Kämper et al., 2020; Ledger et al., 2016). ‘Hass’ and ‘Shepard’ fruit were stored in the cold room for 8 days and 7 days, respectively. All fruit were moved to room temperature (21 °C), imaged, and kept at room temperature to allow the onset of ripening. The ripeness of each fruit was inspected daily and confirmed by measuring skin firmness with a handheld sclerometer (8 mm head; Lutron Electronic Model: FR-5120). The fruit was considered ripe when the maximum force needed to press the sclerometer tip 1 mm deep was < 15 N (Smith et al., 1997; Flitsanov et al., 2000; Hofman et al., 2013). The day of full ripeness was recorded.

Imaging system and spectral profile extraction

An image of the skin on one side of the avocado fruit was captured using a hyperspectral imaging system. All images were captured with a 12-bit line scanner camera (Pika XC2, USA) containing a lens with 23 mm focal length and four current-controlled wide-spectrum quartz-halogen lights. The spectral resolution of the camera was 1.36 nm, resulting in 462 bands between 388 and 1005 nm. Each fruit was placed on a black tray on a translation stage moving at 1.23 mm s⁻¹ and the exposure time was set to 19.4 ms. The DCNN regression and classification were based on the full images but, for PLSR, the HSI data were then exported using Spectronon Pro software package (Version 2.112) by Resonon. The raw reflectance (${R}_{0}$) of each hyperspectral image was extracted by marking a region of interest (ROI), which excluded the fruit centre that had intense light reflection. The corrected relative reflectance $\left(R\right)$ was calculated within Spectronon using Eq. 1:

$$\begin{array}{c}R=\frac{R_0-D}{W-D}\end{array}$$

(1)

where $D$ is the reflectance of a reference dark image (camera lens covered) and $W$ is the reflectance of a white Teflon sheet that reflects around 99% of incident light (Ariana et al., 2006). This method helped to correct the spectral curve of the fruit surface. The 100% reflectivity was scaled to 10,000 (integers) by default.

Partial least squares regression (PLSR)

The ROI of each fruit was treated as a sample when PLSR was applied. The spectral outliers in the samples, if any, were detected and removed using a Hotelling’s T² test within a 95% confidence level (Morellos et al., 2016). The reflectance data were divided into calibration sets and test sets using the Kennard-Stone algorithm (‘Hass’: N = 256 for the calibration set and N = 60 for the test set; ‘Shepard’: N = 128 for the calibration set and N = 32 for the test set). Spectral data transformations such as Savitzky-Golay first, second and third derivatives, standard normal variate (SNV), orthogonal signal correction (OSC) and multiplicative scatter correction (MSC) were performed on the calibration sets. These transformations aim to reduce undesired effects such as light scattering or uncontrolled external factors (Rinnan et al., 2009; Bai et al., 2018). PLSR models were developed using both the raw and transformed spectral data to correlate the number of days until ripe with the relative reflectance measured in the full spectral range (Wold et al., 2001). Models were developed with the raw number of days and the log-transformed number of days until ripe, and the model with the better fit was selected. The root mean square error (RMSE) values were exponentiated if the best model was based on the log-transformed number of days until ripe, so that the RMSE values could be interpreted and compared more easily with the DCNN regression and classification models. The models were cross-validated using a 10-fold cross-validation technique. The coefficient of determination (R²) and RMSE were used as assessment metrics (Guo et al., 2021). For more detail on the PLSR specifics and calculations, see Tahmasbian et al. (2017, 2018a). After finding the best transformation for the data set, the number of spectral wavelengths was reduced stepwise by leaving the wavelengths with low β-coefficients out of the model (Tahmasbian et al., 2017). Removing wavelengths with low β-coefficients was continued until the model fit decreased. Removing unimportant wavelengths can facilitate the computation of the model and increase its accuracy (Wold et al., 1996; Kamruzzaman et al., 2012; Tahmasbian et al., 2018b). Then the ratio of prediction to deviation (RPD) was calculated using Eq. 2:

$$\begin{array}{c}RPD=\frac{{SD}_t}{{RMSE}_t}\end{array}$$

(2)

where ${SD}_{t}$ is the standard deviation of the observed values and ${RMSE}_{t}$ is the root square error of the prediction from the test set (_t). Transformations, outlier detection and removal, and all parts of model development were performed with Unscrambler software (CAMO, Norway, Version: 10.5.1).

Deep convolutional neural network (DCNN)

The regression and classification models based on DCNN were developed for ‘Hass’ and ‘Shepard’ fruit, respectively. One model per cultivar was designed for each method because the two cultivars have different physical and chemical properties. All DCNNs used in our study were trained on the University of Melbourne’s SPARTAN HPC system (Lafayette et al., 2016).

Data augmentation and preprocessing

The datasets were preprocessed due to size limitations before training the DCNNs. The size of each HSI was over 1000 × 1000 pixels and each HSI had 462 bands, which was too large as input for a DCNN. Principal component analysis (PCA) was performed to reduce the dimensionality of the data (Khoshelham & Oude Elberink, 2012; Sifre & Mallat, 2013; Sun et al., 2019). The first six principal components (PC), which contained over 99.5% of the overall information of each HSI, were selected (Fig. 1).

The reflectivity of the whole surface of each fruit was not homogenous due to the illumination condition (Fig. 1). Therefore, each HSI was segmented into smaller sub-images to ensure that the DCNN was trained with sample images exhibiting some variance, and to enable a robust and reliable prediction. An image segmentation method was applied to exclude the tray as background because the spectral reflectance of the tray did not differ significantly from those of the fruit surface in some wavelength bands (Supplementary Fig. 1). The top left corner point of each sub-image was fixed and then these sub-images were cropped out of the raw HSI to eliminate background interruption. A total of 12 sub-images from each ‘Hass’ image and 8 sub-images from each ‘Shepard’ image were finally cropped out due to the different shapes of the cultivars (Fig. 2). The size of each sub-image was set to 150 × 150 pixels. The spectral reflectance of all sub-images was normalized to the interval of [0, 1] based on the maximum (150,000) and minimum (− 150,000) pixel values. This batch processing method avoided the interruption of inconsistent normalization standards between different sub-images. In total, 3792 sub-images for ‘Hass’ and 1280 sub-images for ‘Shepard’ were obtained, and all of these sub-images were randomly split into training (70%), validation (20%), and test (10%) sets (Supplementary Table 1).

DCNN regression models

A DCNN for regression was developed to predict the number of days until each avocado fruit became fully ripe. ‘Hass’ and ‘Shepard’ sub-images shared identical architecture but their fully connected layers differed slightly (Fig. 3). All convolutional kernels were set to 1 × 1 to ensure that the DCNN could be trained sufficiently when the training samples were limited (Han et al., 2020). Selecting a large kernel size would extract and emphasize the edge, corner, or other spatial features of each sub-image, which may lead to inaccurate predictions because these visual properties of an avocado fruit surface cannot reflect the quality accurately (Vega Díaz et al., 2020).

Each convolutional layer in the DCNN for regression was followed by a batch normalization layer except for the last layer of each cultivar. Batch normalization of layers helps to keep the input distribution consistent and accelerates the convergence (Cooijmans et al., 2017). Two dropout layers followed by fully connected layers were also inserted before the output layers to avoid overfitting (Bisong, 2019). All activation functions were set to Leaky ReLU instead of the widely used ReLU because Leaky ReLU performs better on small datasets and assigns the non-zero output to retain the feature information of the input (Xu et al., 2015; Zhang et al., 2017). The two DCNN regression models built for ‘Hass’ and ‘Shepard’ sets were both trained using the same hyperparameters but different learning rates and batch sizes, which were experimentally set to 0.0002 and 128 for ‘Hass’ and 0.0005 and 96 for ‘Shepard’ avocado fruit, respectively (Supplementary Table 2). The initialized biases of our DCNNs were both set to zero and our initialized weights were set to be ‘Glorot_uniform’.

The Sigmoid function was chosen as the activation function for the output layer because all the inputs were normalized to the range of 0–1. Mean squared error (MSE) (Eq. 3) was chosen as the loss function for both regression models. A higher MSE means the prediction deviates more from the actual values.

$$\begin{array}{c}MSE=\frac1n\sum_{i=0}^n{(y_i-\widehat{y_i})}^2\end{array}$$

(3)

where $n$ is the number of samples, ${y}_{i}$ is the measured value of the $i$^th sample, and $\widehat{{y}_{i}}$ is the predicted value of the $i$^th sample.

For further assessment, R² and RMSE of DCNN regression results were also computed to consistently compare with PLSR results.

DCNN classification models

A classification model would be appropriate if the end users prefer an estimate in the form of a discrete variable or category label such as ‘unripe’ and ‘fully ripe’. One DCNN classification model was designed for each cultivar (Fig. 4) to test whether classification performs better than regression. All inputs were normalized to the interval [0, 1] as in the DCNNs for classification. Each of the convolutional layers was followed by a batch normalization layer and all activation functions were chosen to be Leaky ReLU except for the last output layer, which used the Softmax function instead. These two DCNNs were also trained using different batch sizes (Supplementary Table 3) and all ‘Hass’ and ‘Shepard’ fruit were classified into 14 and 7 categories, respectively, representing the number of days until ripe (Supplementary Table 4).

For both DCNN classification models, categorical cross-entropy (CCE) was used as the loss function. CCE represents the difference between two probability distributions and is defined by Eq. 4 (West & O’Shea, 2017):

$$\begin{array}{c}CCE=-\frac1n\sum\nolimits_{i=0}^n\sum\nolimits_{j=0}^m\left(y_{\left(i,j\right)}\times log\left({\widehat y}_{\left(i,j\right)}\right)\right)\end{array}$$

(4)

where $n$ is the number of samples, $m$ is the number of one-hot codes for each class, which equals the number of categories for each dataset, ${y}_{\left(i,j\right)}$ refers to the $j$^th binary value of the $i$^th sample and ${\widehat{y}}_{\left(i,j\right)}$ refers to the predicted value of ${y}_{\left(i,j\right)}$.

Assessing metrics such as the overall accuracy, precision, recall, and F1-score are common practices for evaluating the classification performance, but cannot properly reflect the discrepancy (in days) between the predicted ripening time and the ground truth value (GT). Instead of these typical metrics, RMSE was used to address this issue and measure the accuracy of the prediction, which also enabled a better comparison between our classification model and the other regression models.

Results

Partial least squares regression (PLSR)

PLSR models that predicted the ripening time of ‘Hass’ avocado fruit with RPD ≥ 1.4 (Fig. 5a) were fitted. Models with an RPD value above 1.4 provide good predictions (Bellon-Maurel et al., 2010). The best-fit PLSR model for the ripening time of ‘Hass’ fruit provided R²_cal = 0.74 and RMSE_cal = 1.18, followed by R²_val = 0.68 and RMSE_val = 1.20 in the cross-validation using the OSC transformed dataset (Fig. 6). This model provided prediction abilities for the test set of R² = 0.76 and RPD = 1.82 after wavelength reduction. We could not fit a model with high prediction accuracy of the ripening time for ‘Shepard’ fruit (Fig. 5b). The best-fit PLSR model for ripening time of ‘Shepard’ provided R²_cal = 0.47 and RMSE_cal = 1.13 for calibration, and R²_val = 0.43 and RMSE_val = 1.13 in the cross-validation after the spectra were OSC transformed (Fig. 6). This model provided prediction abilities for the test set of R² = 0.50 and RPD = 1.13 after wavelength reduction.

Deep convolutional neural network (DCNN)

DCNN regression

The training loss and the validation loss curves converged after 5000 epochs in the DCNN regression models for both cultivars (Supplementary Fig. 2). The DCNN regression models provided R² of 0.77 and 0.59, and RMSE of 1.43 and 0.94 days for the ‘Hass’ and ‘Shepard’ test sets, respectively (Table 1, Fig. 6).

Table 1 Root mean square error (RMSE) and the coefficient of determination (R²) of predictions by two regression methods for ‘Hass’ and ‘Shepard’ avocado fruit

Full size table

DCNN classification

The training and validation accuracies of DCNN classification stabilized after 5000 epochs for ‘Hass’ and ‘Shepard’ avocado fruit (Supplementary Fig. 3). Most samples were correctly classified although there was no correct prediction for some classes, like for the ground truth class label of 4 days to ripen in the ‘Hass’ test sets (Fig. 7). A similar problem occurred for ‘Shepard’ fruit that took more than 11 days to ripen (Fig. 7), with the RMSE of this class reaching a maximum of 3.00 (Table 2). However, this classification result was still acceptable because the mean RMSEs of both cultivars were below 2 days (Table 3).

Table 2 Root mean square error (RMSE) of each ripening time category for ‘Hass’ and ‘Shepard’ test sets

Full size table

Table 3 Accuracies, categorical cross-entropy (CCE), and root mean square error (RMSE) of the DCNN classification model for ‘Hass’ and ‘Shepard’ avocado-fruit test sets

Full size table

Discussion

PLSR and DCNN regressions showed different capabilities of prediction but both performed well, having low RMSE and high R² and, thus, providing high accuracy in predicting the ripening time of avocado fruit. Predicting the ripening time of ‘Hass’ and ‘Shepard’ avocado fruit is, therefore, possible using HSI and the spectral recognition based upon it. It was also found that DCNN-based classification can be used as an alternative approach to predict the ripening time of avocado fruit, although this method requires improvement.

The accuracy of predicting the ripening time of ‘Hass’ and ‘Shepard’ fruit was model-dependent. It was found that the R² in predicting the ripening time of ‘Hass’ fruit was similar using either PLSR or DCNN regressions, but that PLSR provided lower RMSE than did the DCNN regression model. In contrast, the DCNN regression model outperformed PLSR for the ‘Shepard’ fruit. The R² of our DCNN regression model was about 0.1 higher than using PLSR, and the RMSE of the DCNN regression model was about 0.2 days shorter than using PLSR (Table 3). Convolutional neural networks (CNN) provided higher accuracy than PLSRs for predicting dry matter concentrations of mango fruit or estimating the moisture concentrations and solid soluble concentrations of pear fruit (Mishra et al., 2021; Mishra & Passos, 2022). Foliar N concentrations of tomatoes and the geographical origins of narrow-leaved oleaster fruits have been predicted with similar accuracies using both CNN and PLSR on hyperspectral images (Gao et al., 2019; Pourdarbani et al., 2021). In our study, the prediction accuracy of DCNN regression was only higher than PLSR when applied to ‘Shepard’ avocado fruit. PLSR applied to ‘Hass’ provided a slightly more precise prediction than that from DCNN regression. Hence, our study revealed that regression performance was cultivar specific in avocados. Moreover, our optimal prediction in ripening time showed greater prediction accuracy than that of another study which predicted the ‘Hass’ avocado ripeness based on Vis-NIR spectroscopy, R² of 0.77 vs. R² of 0.63, respectively (Melado-Herreros et al., 2021). In comparison, our prediction not only presented a far better result where the highest R² reached 0.77 but also straightforwardly provided the specific time to ripen rather than indirect indices.

R² and RMSE values for the ‘Hass’ test sets calculated by either PLSR or DCNN regression were higher than for ‘Shepard’. A larger sample size enables deep learning models to better fit nonlinear relationships (Nasir & Sassani, 2021). In our study, the sample size and the number of sub-images for ‘Hass’ were higher than for ‘Shepard’, potentially explaining the higher R² for ‘Hass’ compared with ‘Shepard’. Although a balanced dataset usually leads to superior regression performance, the wider range of labels of the ‘Hass’ training sets can explain the higher RMSE values for ‘Hass’ compared to ‘Shepard’. For example, the maximum error possible for a prediction on ‘Hass’ was 14 days (2–16 days) but the maximum error for ‘Shepard’ was only 6 days (6–12 days), and thus the influence of a balanced dataset was reduced due to the differences in data ranges. Each individual prediction could lead to higher RMSE values in ‘Hass’ compared to ‘Shepard’. Despite the poor performance on ‘Shepard’ avocados, the prediction on ‘Hass’ avocados was reliable, with the R² of all methods above the acceptable standard value of 0.66 (Williams et al., 2019; Posom et al., 2021; Wei et al., 2022).

Our classification results showed similar prediction accuracies between ‘Hass’ and ‘Shepard’, with R² of the test sets being 67.28% and 64.06%, respectively. The models can be improved in future studies, and will need to include samples from various locations, seasons and cultivars. Some studies have shown that, when the sample size is small and the distribution of values is unbalanced, the classification model provided superior prediction performance when compared with regression (Han et al., 2020). In our study, however, the sample size was large and the distribution of classes was rather balanced, which could explain why the prediction of ripening time in classification was less accurate than regression. Additionally, the larger number of classes may have increased the complexity of the classification tasks, causing inaccurate results in the classification models (Mezgec & Koroušić Seljak, 2017). More importantly, the classification loss did not truly reflect the accuracy of prediction, either in training or in inference. For example, for a ground truth value of x days, a prediction of x + 1 and a prediction of x + 10 both had the same classification error. This resulted in poor classification performance as the model learned to minimize any loss rather than make an accurate prediction. In our study, 14 and 7 classes for ‘Hass’ and ‘Shepard’ cultivars were presented, respectively. Hence, our classification study was more complex than previous studies (Han et al., 2020; Liu et al., 2020), which only had 3 or 4 categories. Our study confirmed that using regression would lead to higher prediction accuracy than would classification when high numbers of samples and classifications exist.

Conclusion

Our study showed that combining HSI technologies with PLSR regression, DCNN regression, and DCNN classification is useful to predict the ripening time of ‘Hass’ and ‘Shepard’ avocado fruit. Our prediction straightforwardly presented the days to ripen, unlike most previous studies that focused on various indirect indices to estimate ripening stages. The optimal prediction of ripening time in days was with an RMSE between 0.94 and 1.52 days, regardless of method and cultivar. DCNN was applied for the first time to predict the days to ripen for avocado fruit and it provided acceptable prediction accuracy. Deep learning approaches were proven better compared with PLSR to predict avocado ripening time, and the prediction on ‘Hass’ avocado was more accurate than that of ‘Shepard’ avocado. A larger and more balanced dataset can help build a more reliable system to predict the ripening time of avocado. Our study highlights the strong potential of HSI to predict the ripening time of mature, unripe avocado fruit, which would allow processors and retailers to optimize the duration of fruit storage, select the most suitable timing for retail display, and minimize losses throughout the food supply chain.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Alcobendas, R., Mirás-Avalos, J. M., Alarcón, J. J., & Nicolás, E. (2013). Effects of irrigation and fruit position on size, colour, firmness and sugar contents of fruits in a mid-late maturing peach cultivar. Scientia Horticulturae, 164, 340–347. https://doi.org/10.1016/j.scienta.2013.09.048.
Article CAS Google Scholar
Ariana, D. P., Lu, R., & Guyer, D. E. (2006). Near-infrared hyperspectral reflectance imaging for detection of bruises on pickling cucumbers. Computers and Electronics in Agriculture, 53(1), 60–70. https://doi.org/10.1016/j.compag.2006.04.001.
Article Google Scholar
Bai, S. H., Tahmasbian, I., Zhou, J., Nevenimo, T., Hannet, G., Walton, D., Randall, B., Gama, T., & Wallace, H. M. (2018). A non-destructive determination of peroxide values, total nitrogen and mineral nutrients in an edible tree nut using hyperspectral imaging. Computers and Electronics in Agriculture, 151, 492–500. https://doi.org/10.1016/j.compag.2018.06.029.
Article Google Scholar
Bellon-Maurel, V., Fernandez-Ahumada, E., Palagos, B., Roger, J. M., & Mcbratney, A. (2010). Critical review of chemometric indicators commonly used for assessing the quality of the prediction of soil attributes by NIR spectroscopy. Trends in Analytical Chemistry, 29(9), 1073–1081. https://doi.org/10.1016/j.trac.2010.05.006.
Article CAS Google Scholar
Bengio, Y., Courville, A., & Vincent, P. (2013). Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(8), 1798–1828. https://doi.org/10.1109/TPAMI.2013.50.
Article PubMed Google Scholar
Bisong, E. (2019). Regularization for deep learning. In E. Bisong (Ed.), Building machine learning and deep learning models on google cloud platform (pp. 415–421). Apress. https://doi.org/10.1007/978-1-4842-4470-8_34
Chapter Google Scholar
Carvalho, C. P., Bernal, E. J., Velásquez, M. A., & Cartagena, V. J. R. (2015). Fatty acid content of avocados (Persea americana Mill. cv. Hass) in relation to orchard altitude and fruit maturity stage. Agronomía Colombiana, 33(2), 220–227. https://doi.org/10.15446/agron.colomb.v33n2.49902.
Article Google Scholar
Cooijmans, T., Ballas, N., Laurent, C., Gülçehre, Ç., & Courville, A. (2017). Recurrent batch normalization. 5th International Conference on Learning Representations, ICLR 2017—Conference Track Proceedings. https://openreview.net/pdf?id=r1VdcHcxx
FAO. Food and Agriculture Organization of the United Nations (2019). The State of Food and Agriculture https://www.fao.org/state-of-food-agriculture/2019/en
Flitsanov, U., Mizrach, A., Liberzon, A., Akerman, M., & Zauberman, G. (2000). Measurement of avocado softening at various temperatures using ultrasound. Postharvest Biology and Technology, 20(3), 279–286. https://doi.org/10.1016/S0925-5214(00)00138-1.
Article Google Scholar
Gamble, J., Harker, F. R., Jaeger, S. R., White, A., Bava, C., Beresford, M., Stubbings, B., Wohlers, M., Hofman, P. J., & Marques, R. (2010). The impact of dry matter, ripeness and internal defects on consumer perceptions of avocado quality and intentions to purchase. Postharvest Biology and Technology, 57(1), 35–43. https://doi.org/10.1016/j.postharvbio.2010.01.001.
Article Google Scholar
Gao, P., Xu, W., Yan, T., Zhang, C., Lv, X., & He, Y. (2019). Application of near-infrared hyperspectral imaging with machine learning methods to identify geographical origins of dry narrow-leaved oleaster (Elaeagnus angustifolia) fruits. Foods, 8(12), 620. https://doi.org/10.3390/foods8120620.
Article CAS PubMed PubMed Central Google Scholar
Garillos-Manliguez, C. A., & Chiang, J. Y. (2021). Multimodal deep learning and visible-light and hyperspectral imaging for fruit maturity estimation. Sensors (Basel, Switzerland), 21(4), 1–18. https://doi.org/10.3390/s21041288.
Article Google Scholar
Guo, J., Zhang, J., Xiong, S., Zhang, Z., Wei, Q., Zhang, W., Feng, W., & Ma, X. (2021). Hyperspectral assessment of leaf nitrogen accumulation for winter wheat using different regression modeling. Precision Agriculture, 22(5), 1634–1658. https://doi.org/10.1007/s11119-021-09804-z.
Article CAS Google Scholar
Han, Y., Liu, Z., Khoshelham, K., & Bai, S. H. (2020). Quality estimation of nuts using deep learning classification of hyperspectral imagery. Computers and Electronics in Agriculture, 180, 105868. https://doi.org/10.1016/j.compag.2020.105868.
Article Google Scholar
Hofman, P., Bower, J., & Woolf, A. (2013). Harvesting, packing, postharvest technology, transport and processing. In B. A. Schaffer, B. N. Wolstenholme, & A. W. Whiley (Eds.), The avocado: Botany, production and uses (2nd ed., pp. 489–540). CABI. https://doi.org/10.1079/9781845937010.0489
Chapter Google Scholar
Huang, H., Liu, L., & Ngadi, M. O. (2014). Recent developments in hyperspectral imaging for assessment of food quality and safety. Sensors (Basel, Switzerland), 14(4), 7248–7276. https://doi.org/10.3390/s140407248.
Article CAS PubMed Google Scholar
Kämper, W., Trueman, S. J., Tahmasbian, I., & Bai, S. H. (2020). Rapid determination of nutrient concentrations in Hass avocado fruit by Vis/NIR hyperspectral imaging of flesh or skin. Remote Sensing, 12(20), 3409. https://doi.org/10.3390/rs12203409.
Article Google Scholar
Kamruzzaman, M., ElMasry, G., Sun, D. W., & Allen, P. (2012). Prediction of some quality attributes of lamb meat using near-infrared hyperspectral imaging and multivariate analysis. Analytica Chimica Acta, 714, 57–67. https://doi.org/10.1016/j.aca.2011.11.037.
Article CAS PubMed Google Scholar
Khoshelham, K., & Oude Elberink, S. (2012). Role of dimensionality reduction in segment-based classification of damaged building roofs in airborne laser scanning data. In R.Q. Feitosa (Ed), Proceedings of GEOBIA 2012: 4th International Conference on Geographic Object-Based Image Analysis, Rio de Janeiro, Brazil (pp. 372–377). http://mtc-m16c.sid.inpe.br/col/sid.inpe.br/mtc-m18/2012/05.18.17.24/doc/103.pdf?linktype=relative
Lafayette, L., Sauter, G., Vu, L., & Meade, B. (2016). Spartan performance and flexibility: An hpc-cloud chimera. OpenStack Summit, Barcelona, Spain, 27. https://doi.org/10.4225/49/58ead90dceaaa
Ledger, S., Barker, L., Cambell, T., Hofman, P., & Marques, R. (2016). Avocado ripening manual. Retrieved February 27, 2023, from https://avocado.org.au/wp-content/uploads/2016/12/Avocado-Ripening-Manual.pdf
Li, X., Wei, Y., Xu, J., Feng, X., Wu, F., Zhou, R., Jin, J., Xu, K., Yu, X., & He, Y. (2018). SSC and pH for sweet assessment and maturity classification of harvested cherry fruit based on NIR hyperspectral imaging technology. Postharvest Biology and Technology, 143, 112–118. https://doi.org/10.1016/j.postharvbio.2018.05.003.
Article CAS Google Scholar
Liu, Z., Jiang, J., Qiao, X., Qi, X., Pan, Y., & Pan, X. (2020). Using convolution neural network and hyperspectral image to identify moldy peanut kernels. LWT – Food Science and Technology, 132, 109815. https://doi.org/10.1016/j.lwt.2020.109815.
Article CAS Google Scholar
Ma, J., Pu, H., & Sun, D. W. (2018). Predicting intramuscular fat content variations in boiled pork muscles by hyperspectral imaging using a novel spectral pre-processing technique. LWT – Food Science and Technology, 94, 119–128. https://doi.org/10.1016/j.lwt.2018.04.030.
Article CAS Google Scholar
Melado-Herreros, A., Nieto-Ortega, S., Olabarrieta, I., Gutiérrez, M., Villar, A., Zufía, J., Gorretta, N., & Roger, J. M. (2021). Postharvest ripeness assessment of ‘Hass’ avocado based on development of a new ripening index and Vis-NIR spectroscopy. Postharvest Biology and Technology, 181, 111683. https://doi.org/10.1016/j.postharvbio.2021.111683.
Article CAS Google Scholar
Mezgec, S., & Koroušić Seljak, B. (2017). NutriNet: A deep learning food and drink image recognition system for dietary assessment. Nutrients, 9(7), 657. https://doi.org/10.3390/nu9070657.
Article PubMed PubMed Central Google Scholar
Mishra, P., & Passos, D. (2022). Multi-output 1-dimensional convolutional neural networks for simultaneous prediction of different traits of fruit based on near-infrared spectroscopy. Postharvest Biology and Technology, 183, 111741. https://doi.org/10.1016/j.postharvbio.2021.111741.
Article Google Scholar
Mishra, P., Rutledge, D. N., Roger, J., Wali, K., & Khan, H. A. (2021). Chemometric pre-processing can negatively affect the performance of near-infrared spectroscopy models for fruit quality prediction. Talanta, 229, 122303. https://doi.org/10.1016/j.talanta.2021.122303.
Article CAS PubMed Google Scholar
Morellos, A., Pantazi, X. E., Moshou, D., Alexandridis, T., Whetton, R., Tziotzios, G., Wiebensohn, J., Bill, R., & Mouazen, A. M. (2016). Machine learning based prediction of soil total nitrogen, organic carbon and moisture content by using VIS-NIR spectroscopy. Biosystems Engineering, 152, 104–116. https://doi.org/10.1016/j.biosystemseng.2016.04.018.
Article Google Scholar
Nasir, V., & Sassani, F. (2021). A review on deep learning in machining and tool monitoring: Methods, opportunities, and challenges. International Journal of Advanced Manufacturing Technology, 115(9), 2683–2709. https://doi.org/10.1007/s00170-021-07325-7.
Article Google Scholar
New Zealand Avocado (2022). World avocado market. https://industry.nzavocado.co.nz/world-avocado-market/
Parfitt, J., Barthel, M., & Macnaughton, S. (2010). Food waste within food supply chains: Quantification and potential for change to 2050. Philosophical Transactions of the Royal Society B: Biological Sciences, 365(1554), 3065–3081. https://doi.org/10.1098/rstb.2010.0126.
Article Google Scholar
Perkins, M. L., Usanase, D., Zhang, B., Joyce, D. C., & Coates, L. M. (2020). Impact injury at harvest promotes body rots in ‘Hass’ avocado fruit upon ripening. Horticulturae, 6(1), 11. https://doi.org/10.3390/horticulturae6010011.
Article Google Scholar
Posom, J., Maraphum, K., & Phuphaphud, A. (2021). Rapid evaluation of biomass properties used for energy purposes using near-infrared spectroscopy. Renewable Energy—Technologies and Applications IntechOpen. https://doi.org/10.5772/intechopen.90828
Article Google Scholar
Pourdarbani, R., Sabzi, S., Rohban, M., García-Mateos, G., & Arribas, J. (2021). Nondestructive nitrogen content estimation in tomato plant leaves by Vis-NIR hyperspectral imaging and regression data models. Applied Optics, 60(30), 9560. https://doi.org/10.1364/ao.431886.
Article CAS PubMed Google Scholar
Pu, H., Liu, D., Wang, L., & Sun, D. W. (2016). Soluble solids content and pH prediction and maturity discrimination of lychee fruits using visible and near infrared hyperspectral imaging. Food Analytical Methods, 9(1), 235–244. https://doi.org/10.1007/s12161-015-0186-7.
Article Google Scholar
Rajkumar, P., Wang, N., Eimasry, G., Raghavan, G., & Gariepy, Y. (2012). Studies on banana fruit quality and maturity stages using hyperspectral imaging. Journal of Food Engineering, 108(1), 194–200. https://doi.org/10.1016/j.jfoodeng.2011.05.002.
Article Google Scholar
Rinnan, Å., Van Den Berg, F., & Engelsen, S. B. (2009). Review of the most common pre-processing techniques for near-infrared spectra. TrAC - Trends in Analytical Chemistry, 28(10), 1201–1222. https://doi.org/10.1016/j.trac.2009.07.007.
Article CAS Google Scholar
Schmidhuber, J. (2015). Deep learning in neural networks: An overview. Neural Networks, 61, 85–117. https://doi.org/10.1016/j.neunet.2014.09.003.
Article PubMed Google Scholar
Sifre, L., & Mallat, S. (2013). Rotation, scaling and deformation invariant scattering for texture discrimination. Proceedings of 2013 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Portland, OR (pp. 1233–1240). https://doi.org/10.1109/CVPR.2013.163
Sivakumar, D., Jiang, Y., & Yahia, E. M. (2011). Maintaining mango (Mangifera indica L.) fruit quality during the export chain. Food Research International, 44(5), 1254–1263. https://doi.org/10.1016/j.foodres.2010.11.022.
Article Google Scholar
Smith, T. E., Hofman, P. J., Stephenson, R. A., Asher, C. J., & Hetherington, S. E. (1997). Improving boron nutrition improves ‘Hass’ avocado fruit size and quality. In J. G. Cutting (Ed), Proceedings from Conference ’97: Searching for Quality. Joint Conference of the Australian Avocado Grower’s Federation and the New Zealand Avocado Growers’ Association, Tauranga, New Zealand (pp. 131–137). https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.539.2467&rep=rep1&type=pdf
Steinbrener, J., Posch, K., & Leitner, R. (2019). Hyperspectral fruit and vegetable classification using convolutional neural networks. Computers and Electronics in Agriculture, 162, 364–372. https://doi.org/10.1016/j.compag.2019.04.019.
Article Google Scholar
Sun, Y., Li, L., Zheng, L., Hu, J., Li, W., Jiang, Y., & Yan, C. (2019). Image classification base on PCA of multi-view deep representation. Journal of Visual Communication and Image Representation, 62, 253–258. https://doi.org/10.1016/j.jvcir.2019.05.016.
Article Google Scholar
Tahmasbian, I., Bai, S. H., Wang, Y., Boyd, S., Zhou, J., Esmaeilani, R., & Xu, Z. (2018a). Using laboratory-based hyperspectral imaging method to determine carbon functional group distributions in decomposing forest litterfall. Catena, 167, 18–27. https://doi.org/10.1016/j.catena.2018.04.023.
Article CAS Google Scholar
Tahmasbian, I., Xu, Z., Abdullah, K., Zhou, J., Esmaeilani, R., Nguyen, T. T. N., & Bai, S. H. (2017). The potential of hyperspectral images and partial least square regression for predicting total carbon, total nitrogen and their isotope composition in forest litterfall samples. Journal of Soils and Sediments, 17(8), 2091–2103. https://doi.org/10.1007/s11368-017-1751-z.
Article CAS Google Scholar
Tahmasbian, I., Xu, Z., Boyd, S., Zhou, J., Esmaeilani, R., Che, R., & Bai, S. H. (2018b). Laboratory-based hyperspectral image analysis for predicting soil carbon, nitrogen and their isotopic compositions. Geoderma, 330, 254–263. https://doi.org/10.1016/j.geoderma.2018.06.008.
Article CAS Google Scholar
Vega Díaz, J. J., Aldana, S., A. P., & Reina Zuluaga, D. V. (2020). Prediction of dry matter content of recently harvested ‘Hass’ avocado fruit using hyperspectral imaging. Journal of the Science of Food and Agriculture, 101(3), 897–906. https://doi.org/10.1002/jsfa.10697.
Article CAS PubMed Google Scholar
Wei, X., Liu, F., Qiu, Z., Shao, Y., & He, Y. (2014). Ripeness classification of astringent persimmon using hyperspectral imaging technique. Food and Bioprocess Technology, 7(5), 1371–1380. https://doi.org/10.1007/s11947-013-1164-y
Article Google Scholar
Wei, X., Wu, L., Ge, D., Yao, M., & Bai, Y. (2022). Prediction of the maturity of greenhouse grapes based on imaging technology. Plant Phenomics. https://doi.org/10.34133/2022/9753427
Article PubMed PubMed Central Google Scholar
West, N. E., & O’shea, T. (2017). Deep architectures for modulation recognition. Proceedings of 2017 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN), Baltimore, MD (pp. 1–6). https://doi.org/10.1109/DySPAN.2017.7920754
Williams, P., Manley, M., & Antoniszyn, J. (2019). Near-infrared technology: Getting the best out of light, Near-infrared technology: Getting the best out of light. African Sun Media. https://doi.org/10.18820/9781928480310.
Article Google Scholar
Wold, J. P., Jakobsen, T., & Krane, L. (1996). Atlantic salmon average fat content estimated by near-infrared transmittance spectroscopy. Journal of Food Science, 61(1), 74–77. https://doi.org/10.1111/j.1365-2621.1996.tb14728.x.
Article CAS Google Scholar
Wold, S., Sjöström, M., & Eriksson, L. (2001). PLS-regression: A basic tool of chemometrics. Chemometrics and Intelligent Laboratory Systems, 58(2), 109–130. https://doi.org/10.1016/S0169-7439(01)00155-1.
Article CAS Google Scholar
Xu, B., Wang, N., Chen, T., & Li, M. (2015). Empirical Evaluation of Rectified Activations in Convolutional Network. arXiv. https://arxiv.org/abs/1505.00853v2
Zhang, X., Zou, Y., & Shi, W. (2017). Dilated convolution neural network with LeakyReLU for environmental sound classification. Proceedings of 2017 22nd International Conference on Digital Signal Processing, London, UK (pp. 1–5). https://doi.org/10.1109/ICDSP.2017.8096153

Download references

Acknowledgements

The study was undertaken using the LIEF HPC-GPGPU Facility, established with the assistance of LIEF Grant LE170100200, at the University of Melbourne. We thank Costa Farms for assistance and access to their orchards.

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions. This work was supported by Project PH16001 of the Hort Frontiers Pollination Fund, part of the Hort Frontiers strategic partnership initiative developed by Hort Innovation, with co-investment from Griffith University, University of the Sunshine Coast, Plant and Food Research Ltd, and contributions from the Australian Government.

Author information

Authors and Affiliations

Key Laboratory of Monitoring and Estimate for Environment and Disaster of Hubei Province, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, 430077, China
Yifei Han
University of Chinese Academy of Sciences, Beijing, 100049, China
Yifei Han
Centre for Planetary Health and Food Security, School of Environment and Science, Griffith University, Nathan, QLD, 4111, Australia
Shahla Hosseini Bai, Stephen J. Trueman & Wiebke Kämper
Department of Infrastructure Engineering, The University of Melbourne, Parkville, VIC, 3010, Australia
Kourosh Khoshelham

Authors

Yifei Han
View author publications
You can also search for this author in PubMed Google Scholar
Shahla Hosseini Bai
View author publications
You can also search for this author in PubMed Google Scholar
Stephen J. Trueman
View author publications
You can also search for this author in PubMed Google Scholar
Kourosh Khoshelham
View author publications
You can also search for this author in PubMed Google Scholar
Wiebke Kämper
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: SHB, SJT, WK; Methodology: YH, KK, WK; Formal analysis: YH, WK; Software: YH, KK, WK; Resources: SHB; WK. Validation: YH, KK, WK; Data curation: WK; Project administration: YH; Writing—Original Draft: YH; Writing—Review & Editing: YH, SHB, SJT, KK, WK; Funding acquisition: SHB, SJT; Supervision: SHB, KK.

Corresponding author

Correspondence to Wiebke Kämper.

Ethics declarations

Conflict of interest

None.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 529.7 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Han, Y., Bai, S.H., Trueman, S.J. et al. Predicting the ripening time of ‘Hass’ and ‘Shepard’ avocado fruit by hyperspectral imaging. Precision Agric 24, 1889–1905 (2023). https://doi.org/10.1007/s11119-023-10022-y

Download citation

Accepted: 31 March 2023
Published: 18 April 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s11119-023-10022-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Predicting the ripening time of ‘Hass’ and ‘Shepard’ avocado fruit by hyperspectral imaging

Abstract

Similar content being viewed by others

Fruit ripeness identification using YOLOv8 model

Automatic fruit picking technology: a comprehensive review of research advances

Machine learning for leaf disease classification: data, techniques and applications

Introduction

Materials and methods

Sample collection and preparation

Imaging system and spectral profile extraction

Partial least squares regression (PLSR)

Deep convolutional neural network (DCNN)

Data augmentation and preprocessing

DCNN regression models

DCNN classification models

Results

Partial least squares regression (PLSR)

Deep convolutional neural network (DCNN)

DCNN regression

DCNN classification

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Supplementary Information

Supplementary material 1 (DOCX 529.7 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation