Plant species recognition based on global–local maximum margin discriminant projection

doi:10.1016/j.knosys.2020.105998

Knowledge-Based Systems

Volume 200, 20 July 2020, 105998

https://doi.org/10.1016/j.knosys.2020.105998 Get rights and content

Abstract

Plant species recognition using leaves is an important and challenging research topic, because the plant leaves are various and irregular and they have very large within-class difference and between-class similarity. Considering that leaves have different discriminant performance and contribution to plant recognition task, based on maximum neighborhood margin discriminant projection (MNMDP), we propose a global–local maximum margin discriminant projection (GLMMDP) algorithm for plant recognition. GLMMDP utilizes the local and class information and the global structure of the data to model the intra-class and inter-class neighborhood scatters and a global scatter, obtaining the projection matrix by minimizing the local intra-class scatter and meanwhile maximizing both the local inter-class scatter and the global between-class scatter. Compared with MNMDP, GLMMDP not only can detect the true intrinsic manifold structure of the data, but also can enhance the pattern discrimination between different classes by incorporating the global between-class scatter into MNMDP. The global between-class scatter fully indicates the difference and similarity between classes. The experimental results on the ICL (Intelligent Computing Laboratory) leaf datasets and Leafsnap leaf image datasets demonstrate the effectiveness of the proposed plant recognition method. The recognition accuracy is more than 95% on the ICL datasets and more than 90% on Leafsnap datasets.

Introduction

Nature, the leading international journal, published one paper online under the headline “the world’s largest plant survey reveals astonishing extinction rates”, revealing a startling subject [1]. The global ecological diversity is rapidly declining, due to human habitat changes, such as excessive logging, hunting, excessive usage of pesticides, construction of water conservancy projects, alien biological invasion, etc. The disappearance is not only the loss of species diversity in the ecosystem, but also the loss of the diversity of genetic resources on the earth. The impact might not be intuitive and measurable. Plant species diversity plays a crucial role in Earth ecology. The extinction of a large number of plant species has aroused human initiative to protect the plant species diversity. The conservation of plant species requires to recognize plant species, which is useful to botanists, industrialists, food engineers and physicians. The plant species can be recognized by its different organs, such as leaf, stem, flower and fruit. Botanists can easily identify plant species by distinguishing the shape of leaf, tip, base, leaf margin and leaf vein, as well as the texture of leaf and the arrangement of leaflets of compound leaves. Wäldchen et al. [2] systematically reviewed the existing plant species identification approaches in the 120 peer-reviewed studies published in the ten years from 2005 to 2015 and they classified these methods into two classes,according to the studied plant organs such as leaf, flower and fruit and the studied features such as shape, texture, color, margin, and vein structure. Furthermore, they compared the classification accuracy of these methods achieved on the publicly available datasets. Purohit et al. [3] reviewed the image based plant species identification methods and pointed out that different plant species recognition methods are used based on images of the different parts of plant. Plant leaf is approximately two-dimensional in nature and its shape and texture are two important features for characterizing various plant species. Thus, plant recognition can be achieved by extracting features from its leaf. Many leaf image based plant recognition methods have been presented [4], [5]. Handa et al. [6] compared various plant recognition algorithms, and reviewed the main computational, morphological and image processing methods in recent years. They concluded that the plant recognition can be done by extracting various features from their leaves and there are still room to improve plant species recognition performance through designing a new digital plant recognition system. Jana et al. [7] reviewed the computer vision based approaches for plant species identification, highlighted the main research challenges to overcome in providing feasible tools, and concluded with a discussion of open questions and future research directions. Wang et al. [8] extracted more than 30 leaf features including 16 shape features, 11 texture features and 4 color features, and introduced 8 classifiers for plant recognition. Wu et al. [9] presented a fast and robust method for leaf recognition by identifying leaves based on rotation invariant shape context (RISC) and summed squared differences (SSD) color matching. Unlike the traditional scale and translational invariant of leaf shape based methods, the proposed method can recognize the leaves with different rotational angles, namely rotation invariant. Wu et al. [10] presented a leaf recognition method by combining feature extraction and machine learning. To overcome the weakness exposed in the classical algorithms, the binary Gabor pattern (BGP) with offline manner and extreme learning machine (ELM) are applied to recognizing plant leaves. Different from the traditional neural network like BP and support vector machine (SVM), the method based on ELM only requires to set one parameter, without additional fine-tuning during the leaf recognition. Especially, Medicinal plants are the main source of traditional Chinese medicine, which can provide the basic protection of human health. Jyotismita et al. [11] proposed a plant leaf recognition method by combining texture and shape features. Kan et al. [12] proposed an automatic classification method based on leaf images of medicinal plants to address the limitation of manual classification methods in identifying medicinal plants. In the method, 10 shape features and 5 texture features are extracted and SVM is adopted to classify the leaves of medicinal plants. Lavania et al. [13] presented a leaf based plant recognition algorithm using scalar invariant feature transform (SIFT) and principal component analysis (PCA) with probabilistic neural network (PNN). Jin et al. [14] proposed an automatic species classification method using sparse representation of leaf tooth features. In the method, 4 leaf tooth features (Leaf-num, Leaf-rate, Leaf-sharpness and Leaf-obliqueness) are extracted and concatenated into a feature vector to identify plant species. Chaki et al. [15] proposed a method recognizing plant leaves by combining texture and shape features, where leaf texture is modeled using Gabor filter and gray level co-occurrence matrix (GLCM) while leaf shape is captured using curvelet transform, together with invariant moments. Zhang et al. [16] aimed to solve the difficult problem of plant leaf recognition on the large-scale database and proposed a two-stage local similarity based classification learning method by combining local mean-based clustering method and local sparse representation based classification (SRC). Zeng et al. [17] presented a shape descriptor, namely periodic wavelet descriptor (PWD) to extract plant leaf feature, and constructed a database of PWDs for plant recognition. Zhang et al. [18] proposed a discriminant weighted SRC (DWSRC) algorithm for large-scale plant species classification. Different from the traditional SRC and its improved approaches, DWSRC represented the test sample sparsely on a sub-dictionary, whose basic elements are the training samples of the selected similar classes, instead of using the generic over-complete dictionary on the entire training samples. Thyagharajan et al. [19] reviewed several image processing methods in the feature extraction of leaves, and indicted that feature extraction is a crucial technique in computer vision study.

From the above methods, it is found that the classical leaf image based plant recognition methods generally include the following distinct steps: (1) acquiring leaf images. It is easy to collect the leaf images by cameras, smart phones and other IoT camera devices so that analysis towards classification can be performed; (2) preprocessing. Each original leaf image is preprocessed to enhance the discriminant performance, typically which includes image denoising, image content enhancement, and segmentation; (3) feature extraction. Various classification features are extracted to describe the leaf image; (4) classifying plant leaves. In this step, all extracted features are concatenated into a feature vector for plant species recognition.

Plant leaf recognition has been a hot research spot in recent years, which has produced the improvement in both recognition accuracy and speed. However, many existing methods usually only extract the features of shape and texture of leaf image, and adopt traditional neural network or SVM classifiers to recognize the leaf images. These methods have limitations in recognition accuracy and speed, especially when facing a large leaf database. From the above analysis, we can conclude that the recognition results mainly rely on the extracted features from leaves. However, plant leaf images are various, complex and irregular with a large intra-class difference and inter-class similarity, as shown in Fig. 1. It is difficult to determine which features are optimal, and it is also ineffective to utilize all possible kinds of features to classify the plant species. Thus, many existing classical leaf image feature extraction based plant recognition methods cannot achieve satisfactory results.

In recent years, many manifold-based learning algorithms, such as maximum margin criterion (MMC) and maximum neighborhood margin discriminant projection (MNMDP) [20], [21], [22], have been proposed to discover the intrinsic low-dimensional embedding feature of the original image, and yielded impressive results on artificial and real-world data recognition, even on plant leaf recognition [23]. There are several supervised variants of linear discriminant analysis (LDA) and locality preserving projection (LPP) [24], [25]. LDA takes care of the class information to find the global discriminant information for classification by maximizing the ratio between inter-class and intra-class scatters. MMC is more efficient than LDA for calculating the discriminant vectors since it does not need to calculate the inverse within-class scatter matrix. LPP tries to find an embedding to preserve the local neighborhood information. MNMDP makes use of the class label information for discovering the inherent manifold structure. Shao proposed a supervised global-locality preserving projection (SGLPP) algorithm for plant leaf recognition [26]. SGLPP utilizes the local information and class information of the training samples to construct the global weighted inter-scatter matrix, which can enlarge the distance between different classes in the data and then effectively reveal the intrinsic manifold structure for classification. Based on MMC, MNMDP and SGLPP, we propose a global–local maximum margin discriminant projection (GLMMDP) algorithm for plant species classification. Different from the classical manifold learning methods in constructing the neighborhood weights and the optimal objective function, the objective function of GLMMDP can be constructed by combining the global and local information of the intra-class samples and the discriminating information of the inter-class neighbors of a given sample. It not only can preserve well the intrinsic sub-manifold structure of the data, but also can enhance the discrimination among different classes, which helps improve the classification performance. The experimental results show that GLMMDP is effective on improving classification performance. The contributions of this paper include followings,

(1) A global–local maximum margin discriminant projection (GLMMDP) algorithm is proposed for plant leaf recognition.

(2) The proposed GLMMDP not only can detect the true intrinsic manifold structure of the data, but also can enhance the pattern discrimination between different classes by incorporating a global between-class scatter.

(3) The experimental results on the ICL (Intelligent Computing Laboratory) leaf datasets and Leafsnap leaf image datasets demonstrate the effectiveness of the proposed method. The classification rate is more than 95% on ICL leaf datasets and more than 90% on Leafsnap datasets.

The remainder of this paper is organized as follows. Section 2 reviews MNMDP. GLMMDP is proposed for the plant recognition in Section 3. Section 4 shows the experimental results and comparisons. Finally, this paper is concluded in Section 5.

Section snippets

Maximum neighborhood margin discriminant projection

Maximum neighborhood margin discriminant projection (MNMDP) is a linear graph embedding method, which can not only detect the underlying intrinsic sub-manifold structure of the data, but also strengthen the pattern discrimination among different classes [24].

Suppose we have n samples from C classes $X = [x_{1}, x_{2}, \dots, x_{n}]$ , $c_{i}$ is the class label of the ith point $x_{i}$ , and $Y = [y_{1}, y_{2}, \dots, y_{n}]$ is the projection of X, i.e. $y_{i} = A^{T} x_{i}$ . In MNMDP, the between-class scatter matrix $S^{b}$ and within-class scatter matrix $S^{w}$

Idea

In fact, some leaf images of the different species are similar to each other, which results in the difficulty to classify species, as shown in Fig. 1B. As for the plant recognition task, enlarging the distance between any two leaf images of the different classes can enhance the classification ability of the plant species. So we impose a global between-class scatter to MNMDP to enlarge the distances between the inter-class leaf images, and then propose a novel global–local maximum margin

Experiments and analysis

In this section, we evaluate the effectiveness of GLMMDP for leaf based plant species classification, and compare it with four state-of-the-art plant species classification algorithms: texture and shape features with neural classifiers (TSNC) [11], SIFT and probabilistic neural network algorithm (SIFT ＋ PNN) [13], orthogonal locally discriminant spline embedding (OLDSE) [22], and the latest method, namely supervised global-locality preserving projection (SGLPP) [28]. To further test the

Conclusions

Because leaf images are various and irregular with large within-class difference and between-class similarity, many classical feature extraction based plant classification algorithms are often vague about which and why features need to be extracted and selected from each plant leaf image. A novel method for plant recognition based on GLMMDP is proposed in this paper, which is to seek optimal projection matrix to preserve the global–local neighborhood relationship and improve the discriminant

CRediT authorship contribution statement

Shanwen Zhang: Writing - original draf. Chuanlei Zhang: Conceptualization. Xuqi Wang: Methodology.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work is supported by the key project of Tianjin natural science foundation[http://dx.doi.org/10.13039/501100006606] (No. 18JCZDJC32100) and Tianjin Science and Technology Commissioner project (No. 19JCTPJC51100).

References (28)

DuJ.X. et al.
Leaf shape based plant species recognition
Appl. Math. Comput.
(2007)
ChakiJyotismita et al.
Plant leaf recognition using texture and shape features with neural classifiers
Pattern Recognit. Lett.
(2015)
ChakiJ. et al.
Plant leaf recognition using texture and shape features with neural classifiers
Pattern Recognit. Lett.
(2015)
LuG.F. et al.
Face recognition using discriminant locality preserving projections based on maximum margin criterion
Pattern Recognit.
(2010)
LeiY.K. et al.
Orthogonal locally discriminant spline embedding for plant leaf recognition
Comput. Vis. Image Underst.
(2014)
LiB. et al.
Constrained Discriminant Neighborhood Embedding for High Dimensional Data Feature Extraction, 173 (P2)
(2016)
ShaoYu
Supervised global-locality preserving projection for plant leaf recognition
Comput. Electron. Agric.
(2019)
HumphreysA.M. et al.
Global datasets shows geography and life form predict modern plant extinction and rediscovery
Nat. Ecol. Evol.
(2019)
WäldchenJ. et al.
Plant species identification using computer vision techniques: A systematic literature review
Arch. Comput. Methods Eng.
(2018)
PurohitS. et al.
Automatic plant species recognition technique using machine learning approaches

AnantBhardwaj

A review on plant recognition and classification techniques using leaf images

Int. J. Eng. Trends Technol.

(2013)

HandaA. et al.

A review and a comparative study of various plant recognition and classification techniques using leaf images

Int. J. Comput. Appl.

(2015)

JanaW. et al.

Automated plant species identification—Trends and future directions

PLoS Comput. Biol.

(2018)

WangZ. et al.

Review of plant recognition based on image processing

Arch. Comput. Methods Eng.

(2016)

Cited by (13)

P2S distance induced locally conjugated orthogonal subspace learning for feature extraction
2024, Expert Systems with Applications
When performing data classification tasks, it often occurs to them the curse of dimensionality problem. To address the issue, a manifold learning method termed locally conjugated orthogonal subspace (LCOS) is put forward for dimensionality reduction or feature extraction in this paper. Note that point to feature space (P2S) distance contributes to mining local geometry information, both a local margin characterizing data apartness and a locally conjugated orthogonal constraint beneficial to removing data redundancy are well studied from the P2S distance metric. They are all exploited to model the proposed LCOS. Then, a low dimensional subspace can be explored by maximizing the P2S distance induced local margin under the constraint. Compared with some other related dimensionality reduction methods, experimental results on benchmark face and object data sets validate the performance of the proposed method.
Deep convolutional feature aggregation for fine-grained cultivar recognition
2023, Knowledge-Based Systems
Fine-grained cultivar recognition has recently attracted considerable attention from researchers in pattern recognition and botany. However, this problem is highly challenging because the differences between cultivated species are so subtle that it is difficult to distinguish them effectively. This article presents a novel deep convolutional feature aggregation approach for fine-grained cultivar recognition. First, we propose a description method of the regional convolution covariance feature (RCCF), which describes the subtle changes in cultivated species by accumulating low-level convolution features and has a strong discriminating ability. Second, we also improve the regional maximum activation of convolutions (RMAC) and present a multiresolution RMAC high-level convolutional feature. Finally, we combine complementary RCCF with multiresolution RMAC features for fine-grained cultivar recognition, which significantly improves the recognition accuracy of fine-grained cultivated plants. We have carried out extensive tests on four benchmark cultivar plant datasets. The results show that our approach achieves state-of-the-art recognition performance on four benchmark cultivar plant datasets, surpassing other plant species identification methods.
Discriminative and Geometry-Preserving Adaptive Graph Embedding for dimensionality reduction
2023, Neural Networks
Citation Excerpt :
Unlike PCA and LDA, LPP aims to learn the non-linear local manifold structure of a high-dimensional data via a simple linear manifold learning with the constructed adjacency graphs. Due to the property of preserving the intrinsic manifold structure of data contained in the used adjacency graphs, a great many of LPP extensions have been proposed by either the ways of generating graph embedding or the ways of constructing adjacency graphs from data (Gou et al., 2020; Long et al., 2020; Zhang, Zhang, & Wang, 2020). Furthermore, those LPP variants varying in the generation of graph embedding can often be clustered into three categories, i.e., one-dimensional graph embedding (Gou et al., 2020; Gou & Zhang, 2013; Long et al., 2020; Zhang et al., 2017; Zhang, Zhang, & Wang, 2020), two-dimensional graph embedding (Chen et al., 2019, 2017; Li & You, 2019; Lu et al., 2012; Yu, 2009), and kernel graph embedding (Kong et al., 2021; Li, Pan, & Chen, 2011; Li et al., 2008; Peng, Shi, et al., 2011).
Learning graph embeddings for high-dimensional data is an important technology for dimensionality reduction. The learning process is expected to preserve the discriminative and geometric information of high-dimensional data in a new low-dimensional subspace via either manual or automatic graph construction. Although both manual and automatic graph constructions can capture the geometry and discrimination of data to a certain degree, they working alone cannot fully explore the underlying data structure. To learn and preserve more discriminative and geometric information of the high-dimensional data in the low-dimensional subspace as much as possible, we develop a novel Discriminative and Geometry-Preserving Adaptive Graph Embedding (DGPAGE). It systematically integrates manual and adaptive graph constructions in one unified graph embedding framework, which is able to effectively inject the essential information of data involved in predefined graphs into the learning of an adaptive graph, in order to achieve both adaptability and specificity of data. Learning the adaptive graph jointly with the optimized projections, DGPAGE can generate an embedded subspace that has better pattern discrimination for image classification. Results derived from extensive experiments on image data sets have shown that DGPAGE outperforms the state-of-the-art graph-based dimensionality reduction methods. The ablation studies show that it is beneficial to have an integrated framework, like DGPAGE, that brings together the advantages of manual/adaptive graph construction.
Fault diagnosis of rotor based on Local-Global Balanced Orthogonal Discriminant Projection
2021, Measurement: Journal of the International Measurement Confederation
Citation Excerpt :
Subsequently, Zhang et al. [22,23] proposed Modifified orthogonal discriminant projection (Modified ODP) algorithm and semi-supervised orthogonal discrimination projection(SSODP) algorithm based on ODP algorithm in 2011 and 2016 respectively, which were well verified in face recognition and plant leaf recognition. Then in 2020, Zhang [24] proposed a global–local maximum margin discriminant projection (GLMMDP) algorithm based on maximum neighborhood margin discriminant projection (MNMDP). GLMMDP makes effective use of local and global structure information and category information of data.
The rotor is the most important part of the whole rotating machinery. Whether the rotor is normal directly determines the normal operation of the whole rotating machinery. Aiming at the problem of classification difficulty caused by multi-class and high-dimensional complex characteristics of rotor fault data, a fault data set reduction method based on Local-Global Balanced Orthogonal Discriminant Projection (LGBODP) is proposed. The algorithm comprehensively considers the intra-class local information, intra-class non-local information, inter-class local information and inter-class non-local information of the data, so as to avoid the loss of structure information in the dimension reduction process. By maximizing the inter-class distance and minimizing the intra-class distance, the intrinsic manifold structure information of the fault feature data set is effectively extracted while maintaining the global feature information. First of all, the mixed feature of the rotor vibration signal was extracted from multiple angles in time domain, frequency domain and time-frequency domain, and the high-dimensional feature set was constructed. The low-dimensional fault sensitive feature subsets are extracted by the proposed LGBODP algorithm. Then, the K-nearest neighbor (KNN) method is used as a fault feature classifier to recognize different fault types of rotors. The effectiveness of the proposed algorithm is verified by the vibration signal sets of two different types of double-span rotor systems. Application examples show that this method can be used to comprehensively extract the global and local discriminant information of vibration signals of rotors and effectively diagnosis the fault of rotors.
Intelligent detection for sustainable agriculture: A review of IoT-based embedded systems, cloud platforms, DL, and ML for plant disease detection
2024, Multimedia Tools and Applications
Effective shape features for leaf classification
2023, Journal of Electronic Imaging

View all citing articles on Scopus

View full text

Plant species recognition based on global–local maximum margin discriminant projection

Abstract

Introduction

Section snippets

Maximum neighborhood margin discriminant projection

Idea

Experiments and analysis

Conclusions

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Appl. Math. Comput.

Pattern Recognit. Lett.

Pattern Recognit. Lett.

Pattern Recognit.

Comput. Vis. Image Underst.

Comput. Electron. Agric.

Global datasets shows geography and life form predict modern plant extinction and rediscovery

Nat. Ecol. Evol.

Plant species identification using computer vision techniques: A systematic literature review

Arch. Comput. Methods Eng.

Automatic plant species recognition technique using machine learning approaches

A review on plant recognition and classification techniques using leaf images

Int. J. Eng. Trends Technol.

A review and a comparative study of various plant recognition and classification techniques using leaf images

Int. J. Comput. Appl.

Automated plant species identification—Trends and future directions

PLoS Comput. Biol.

Review of plant recognition based on image processing

Arch. Comput. Methods Eng.