Extreme learning machine for a new hybrid morphological/linear perceptron

doi:10.1016/j.neunet.2019.12.003

Neural Networks

Volume 123, March 2020, Pages 288-298

https://doi.org/10.1016/j.neunet.2019.12.003 Get rights and content

Highlights

•
New HMLP model having linear neurons and morphological units in hidden layer.
•
A morphological unit (dendrite) computes the infimum of an erosion and anti-dilation.
•
Circumventing non-differentiability issues, the HMLP can be trained using ELM.
•
Higher mean classification rate than related models in some benchmark problems.
•
Significant statistical difference compared to other models except the MP/CL which is very susceptible to the curse of dimensionality.

Abstract

Morphological neural networks (MNNs) can be characterized as a class of artificial neural networks that perform an operation of mathematical morphology at every node, possibly followed by the application of an activation function. Morphological perceptrons (MPs) and (gray-scale) morphological associative memories are among the most widely known MNN models. Since their neuronal aggregation functions are not differentiable, classical methods of non-linear optimization can in principle not be directly applied in order to train these networks. The same observation holds true for hybrid morphological/linear perceptrons and other related models. Circumventing these problems of non-differentiability, this paper introduces an extreme learning machine approach for training a hybrid morphological/linear perceptron, whose morphological components were drawn from previous MP models. We apply the resulting model to a number of well-known classification problems from the literature and compare the performance of our model with the ones of several related models, including some recent MNNs and hybrid morphological/linear neural networks.

Introduction

Along with morphological associative memories (MAMs), morphological perceptrons (MPs) were among the first morphological neural network (MNN) models that appeared in the literature (Ritter et al., 1999, Ritter and Sussner, 1996, Ritter and Sussner, 1997, Ritter et al., 1998, Sussner, 1998, Sussner and Esmi, 2011, Sussner and Valle, 2006). The commonality shared by all MNNs is that their nodes, called “morphological neurons”, apply a “morphological operator” before applying an activation function. Unfortunately, there is no precise definition of a morphological operator as far as applications in image processing and analysis are concerned. According to Henk Heijman’s seminal book entitled “Morphological Image Operators” (Heijmans, 1994): “Any attempt to find a formal definition of a morphological operator, however, would lead inevitably to the following dilemma: either it would be too restrictive, excluding operators that should not be excluded a priori, or it would be too general, leading to a ‘theory of everything’”.

However, when resorting to the algebraic framework of mathematical morphology (MM), one finds exact definitions of the four types of elementary operators of MM on complete lattices. These four operators are erosion, dilation, anti-erosion, and anti-dilation (Banon & Barrera, 1993). Banon and Barrera have shown that every mapping between complete lattices can be expressed both as a supremum of pair-wise infima of erosions and anti-dilations and as an infimum of pair-wise suprema of dilations and anti-erosions. In view of this fact, it is interesting to observe that the overall computations at every node in the output layer of an MP can be represented as a supremum of pair-wise infima of gray-scale erosions and anti-dilations (Sussner & Esmi, 2011). The level $0$ curve of an infimum of a gray-scale erosion and an anti-dilation represents a hyperbox. Several constructive (hyperbox) learning algorithms for training MPs (Sussner, 1998), in particular competitive training algorithms (Sussner and Esmi, 2009a, Sussner and Esmi, 2009b, Sussner and Esmi, 2011), have appeared in the literature. Other hyperbox learning algorithms were used by Ritter, Urcid, Sossa and Guevara to train parameters of dendritic MNNs (Ritter and Urcid, 2003, Sossa and Guevara, 2014). In classification problems, each resulting hyperbox encloses patterns, all or most of which belong to a single class. Note that hyperbox-based algorithms were also independently developed by Kaburlasos et al. for training fuzzy lattice neurocomputing models (Kaburlasos et al., 2007, Kaburlasos and Petridis, 2000).

One of the main reasons why researchers devised constructive training algorithms in order to train morphological perceptrons or dendritic MNNs, instead of resorting to standard methods of non-linear optimization, is the fact that the morphological operations used in these models are not differentiable. In their recent approaches towards training dendritic morphological and hybrid morphological/linear neural networks via stochastic gradient descent (Hernández et al., 2018, Zamora and Sossa, 2017), Zamora, Sossa, et al. apparently dealt with locations where the partial derivatives do not exist set by setting the search directions equal to $0$ . In any case, the vector of search directions is not continuous, which is certainly a disadvantage. Also note that at locations where the function described by a dendrite is differentiable, all but one of its partial derivatives with respect to a weight are equal to $0$ .

An alternative for training the weights of morphological neurons without encountering these problems consists in smoothing the morphological operations. This approach was employed by Pessoa and Maragos in their hybrid morphological/rank/linear neural network (MRL-NN) (Pessoa & Maragos, 2000) as well as in Araújo et al.’s morphological and hybrid morphological/linear neural networks (Araújo, 2012, Araújo et al., 2017, Araújo and Sussner, 2010). Finally, note that MRL-NNs, other hybrid models, and dendritic MNNs can also be trained by using methods of evolutionary computation (Araújo and Ferreira, 2013, Arce et al., 2018).

Our approach for training hybrid morphological/linear neural networks differs greatly from all aforementioned approaches since it is based on Huang et al.’s extreme learning machine. According to Huang, Zhu, and Siew (2006), training a network using extreme learning machine (ELM) (Huang et al., 2006) is computationally inexpensive compared to evolutionary optimization (Simon, 2013) and classical neural network training algorithms and generally leads to a good generalization performance without requiring some form of regularization in order to avoid “overfitting” (Bishop, 1995, Bishop, 2006, Ripley, 1996). The hybrid morphological/linear perceptron (HMLP) introduced in this paper is a feedforward neural network model that includes hidden morphological units taken from the previous MP models. The outputs of these morphological units and traditional semi-linear neurons are then combined using linear aggregation functions in the output neurons. Therefore, an ELM-based approach can effectively be employed in order to learn the weights of these linear combinations. In this paper, we apply our HMLP approach to a number of classification problems from the literature and compare the resulting classification rates with the ones of several related models.

The paper is organized as follows: Section 2 reviews some relevant concepts of lattice theory and mathematical morphology,including a few pertinent comments on gray-scale mathematical morphology. After discussing morphological perceptrons and their relationship with some related models from the literature, as well as providing some motivation, we introduce a new hybrid morphological/linear perceptron (HMLP) in Section 3. Then we present an extreme learning machine approach for HMLP training in Section 4 and compare the classification performance achieved by our model with related models from the recent literature in Section 5. We finish the paper with some concluding remarks.

Section snippets

Some relevant concepts of lattice theory and mathematical morphology

Lattice-theoretical concepts play an important role in mathematical morphology ever since complete lattices were established as an appropriate mathematical framework for MM (Heijmans, 1994, Heijmans and Ronse, 1990, Ronse, 1990, Serra, 1988). Therefore MNNs can be viewed as a lattice computing approach towards computational intelligence (Kaburlasos & Kehagias, 2014). Let us proceed by reviewing some basic concepts.

A partially ordered set or poset is a non-empty set $L$ together with a

From morphological to hybrid morphological/linear perceptrons: Some background and motivation

Note that the previous section refers to computations in $R_{\pm \infty}^{n}$ since this set, together with the product partial order and addition, represents a complete lattice-ordered group extension (Sussner & Esmi, 2011), which is a suitable mathematical structure in order to establish links with mathematical morphology and minimax algebra (Cuninghame-Green, 1979, Heijmans, 1994). This said, both the weight vectors and the input patterns of morphological and hybrid morphological/linear neural networks have

Training hybrid morphological/linear perceptrons using extreme learning machine

The technical term “extreme learning machine”, coined by Huang et al. (2006), essentially refers to a class of feedforward computational models whose first layer of weights can be randomly selected and whose second layer of weights can be determined as the global minimum of a certain objective function. Notwithstanding some controversies regarding the novelty and the naming of the method, ELM has proven to be very useful for training single and multiple hidden layer feedforward neural networks.

Experimental results in classification

The purpose of this section is to compare the classification performance of an HMLP-EL, trained using the strategies described in the previous section, with the ones of similar models from the recent literature, namely:

1.
Dendritic morphological neural network trained by stochastic gradient descent (DMNN) (Zamora & Sossa, 2017);
2.
Dilation/erosion linear perceptron (DELP) (Araújo, Oliveira, & Meira, 2012);
3.
Morphological/linear neural network (MLNN) (Hernández et al., 2018);
4.
Morphological perceptron

Concluding remarks

In this paper, we introduced an artificial neural network model called hybrid morphological/ linear perceptron that is equipped with classical neurons as well as morphological units. Each of these morphological units or components computes the infimum of two elementary morphological operators, namely an erosion $ε_{v}$ and an anti-dilation ${\bar{δ}}_{w}$ . Computational units of this type were already employed in previous models of morphological perceptrons that were trained using constructive algorithms (

Acknowledgments

This work was supported in part by CNPq under Grant No. 313145/2017-2 and by FAPESP under Grant Nos. 2018/13657-1 and 2017/10224-4. We would like to thank the anonymous reviewers, whose comments and suggestions helped to improve the paper.

References (57)

AraújoR.A. et al.
A morphological-rank-linear evolutionary method for stock market prediction
Information Sciences
(2013)
AraújoR.A. et al.
A morphological neural network for binary classification problems
Engineering Applications of Artificial Intelligence
(2017)
ArceF. et al.
Differential evolution training algorithm for dendrite morphological neural networks
Applied Soft Computing
(2018)
BanonG.J.F. et al.
Decomposition of mappings between complete lattices by mathematical morphology, part 1. general lattices
Signal Processing
(1993)
ChangH. et al.
Robust path-based spectral clustering
Pattern Recognition
(2008)
HeijmansH.J.A.M. et al.
The algebraic basis of mathematical morphology: I. Dilations and erosions
Computer Vision, Graphics, and Image Processing
(1990)
HornikK.
Approximation capabilities of multilayer feedforward networks
Neural Networks
(1991)
HuangG.-B. et al.
Extreme learning machine: Theory and applications
Neurocomputing
(2006)
KaburlasosV.G. et al.
Fuzzy lattice reasoning (FLR) classifier and its application for ambient ozone estimation
International Journal of Approximate Reasoning
(2007)
KaburlasosV.G. et al.
Fuzzy lattice neurocomputing (FLN) models
Neural Networks
(2000)

PessoaL.F.C. et al.

Neural networks with hybrid morphological/rank/linear nodes: a unifying framework with applications to handwritten character recognition

Pattern Recognition

(2000)

RitterG.X. et al.

Morphological bidirectional associative memories

Neural Networks

(1999)

RonseC.

Why mathematical morphology needs complete lattices

Signal Processing

(1990)

SchmidhuberJ.

Deep learning in neural networks

Neural Networks

(2015)

SossaH. et al.

Efficient training for dendrite morphological neural networks

Neurocomputing

(2014)

SussnerP.

Lattice fuzzy transforms from the perspective of mathematical morphology

Fuzzy Sets and Systems

(2016)

SussnerP. et al.

Morphological perceptrons with competitive learning: lattice-theoretical framework and constructive learning algorithm

Information Sciences

(2011)

ZamoraE. et al.

Dendrite morphological neurons trained by stochastic gradient descent

Neurocomputing

(2017)

AraújoR.A.

A morphological perceptron with gradient-based learning for Brazilian stock market forecasting

Neural Networks

(2012)

AraújoR.A. et al.

A dilation-erosion-linear perceptron for Bovespa index prediction

Araújo, R. A., & Sussner, P. (2010). An increasing hybrid morphological-linear perceptron with pseudo-gradient-based...

BanonG.J.F. et al.

Minimal representations for translation-invariant set mappings by mathematical morphology

SIAM Journal on Applied Mathematics

(1991)

BartlettP.L.

The sample complexity of pattern classification with neural networks: the size of the weights is more important than the size of the network

IEEE Transactions on Information Theory

(1998)

BergstraJ. et al.

Random search for hyper-parameter optimization

Journal of Machine Learning Research

(2012)

BirkhoffG.

Lattice theory

(1993)

BishopC.M.

Neural networks for pattern recognition

(1995)

BishopC.M.

Pattern recognition and machine learning (information science and statistics)

(2006)

CampiottiI.

HMLP-EL

(2019)

Cited by (31)

Learning smooth dendrite morphological neurons by stochastic gradient descent for pattern classification
2023, Neural Networks
This article presents a learning algorithm for dendrite morphological neurons (DMN) based on stochastic gradient descent (SGD). In particular, we focus on a DMN topology that comprises spherical dendrites, smooth maximum activation function nodes, and a softmax output layer, whose original learning algorithm is performed in two independent stages: (1) dendrites’ centroids are learned by k-means, and (2) softmax layer weights are adjusted by gradient descent. A drawback of this learning method is that both stages are unplugged; once dendrites’ centroids are defined, they keep static during weights learning, so no feedback is performed to correct the dendrites’ positions to improve classification performance. To overcome this issue, we derive the delta rules for adjusting the dendrites’ centroids and the output layer weights by minimizing the cross-entropy loss function under an SGD scheme. This gradient descent-based learning is feasible because the smooth maximum activation function that interfaces the dendrite units with the output layer is differentiable. The proposed DMN is compared against eight morphological neuron models with distinct topologies and learning methods and four well-established classifiers: support vector machine (SVM), multilayer perceptron (MLP), and random forest (RF), and k-nearest neighbors (k-NN). Besides, the classification performance is evaluated on 81 datasets. The experimental results show that the proposed method tends to outperform the DMN methods and is competitive or even better than SVM, MLP, RF, and k-NN. Thus, it is an alternative approach that can effectively be used for pattern classification. Moreover, SGD for DMN learning standardizes this neural model, like current artificial neural networks.
A multilayered bidirectional associative memory model for learning nonlinear tasks
2023, Neural Networks
A multilayered bidirectional associative memory neural network is proposed to account for learning nonlinear types of association. The model (denoted as the MF-BAM) is composed of two modules, the Multi-Feature extracting bidirectional associative memory (MF), which contains various unsupervised network layers, and a modified Bidirectional Associative Memory (BAM), which consists of a single supervised network layer. The MF generates successive feature patterns from the original inputs. These patterns change the relationship between the inputs and targets in a way that the BAM can learn. The model was tested on different nonlinear tasks, such as the N-bit, Double Moon and its variants, and the 3-class spiral task. Behaviors were reported through learning errors, decision zones, and recall performances. Results showed that it was possible to learn all tasks consistently. By manipulating the number of units per layer and the number of unsupervised network layers in the MF, it was possible to change the level of nonlinearity observed in the decision boundaries. Furthermore, results indicated that different behaviors were achieved from the same set of inputs by using the different generated patterns. These findings are significant as they showed how a BAM-inspired model could solve nonlinear tasks in a more cognitively plausible fashion.
Learning smooth dendrite morphological neurons for pattern classification using linkage trees and evolutionary-based hyperparameter tuning
2023, Pattern Recognition Letters
The current learning approach for smooth dendrite morphological neurons (DMNs) determines dendrite parameters using $k$ -means clustering, which is non-reproducible due to its stochastic nature, risking falling into local minima. To overcome this issue, we introduce a DMN learning approach based on a deterministic hierarchical clustering method, which builds a linkage tree for each class of patterns. In addition, a micro genetic algorithm automatically tunes the cut-off points in the linkage trees hierarchy to create suitable clusters of dendrites. The classification experiments consider 40 real-world datasets. The proposed approach outperforms three DMN models in classification performance and is quite competitive with a hybrid morphological-linear perceptron, multilayer perceptron, random forest, and support vector machine. Therefore, the proposed method is a suitable alternative for pattern classification applications.
Statistical learning algorithms for dendritic neuron model artificial neural network based on sine cosine algorithm
2023, Information Sciences
Training of dendritic neuron model artificial neural networks is generally achieved by using nonlinear least square methods. The distribution of random error terms is ignored in training algorithms although error terms are random variables. Maximum likelihood estimators can be obtained for dendritic neuron model artificial neural networks by using some indefinite symmetric probability distributions. In this study, statistical learning algorithms are proposed for dendritic neuron model artificial neural networks. Maximum likelihood estimators for dendritic neuron model artificial neural networks are obtained by using Normal, Cauchy, Logistic, Gumbel and Laplace distributions. The Sine cosine algorithm is used for maximization of the likelihood function under error terms that have Normal, Cauchy, Logistic, Gumbel and Laplace distributions. The proposed learning algorithms are applied to Istanbul Stock Exchange time series data sets. At the end of the analysis of application results, the performance of the proposed method is statistically better than well-known deep and shallow artificial neural networks.
Generalized morphological components based on interval descriptors and n-ary aggregation functions
2022, Information Sciences
Morphological perceptrons (MPs) can be characterized as feedforward morphological neural networks (MNNs) with applications in classification and regression. The neuronal aggregation functions of current MP versions are drawn from gray-scale mathematical morphology (MM) that can be described in terms of matrix products in a lattice algebra called minimax algebra. Specifically, MPs have components each of which computes a pair-wise infimum of an erosion and an anti-dilation that can be expressed in terms of products of matrices with entries in a complete l-group extension.
In this paper, we use the novel concept of an interval descriptor and an n-ary aggregation function on a bounded poset in order to generalize existing gray-scale and fuzzy morphological components (MCs) of morphological and hybrid morphological/linear perceptrons (HMLPs). In addition, we present several other examples of generalized morphological components (GMCs) that can and will be incorporated as computational units into shallow and deep artificial neural networks.
Smooth dendrite morphological neurons
2021, Neural Networks
Citation Excerpt :
Both approaches are trained with SGD, and they have the capability of extracting features. The Hybrid Morphological/Linear Perceptron (HMLP), proposed by Sussner and Campiotti (2020), is a two-layer model whose hidden layer includes morphological units and classical semi-linear neurons, and linear nodes in the output layer. HMLP is trained via an extreme learning machine approach.
A typical feature of hyperbox-based dendrite morphological neurons (DMN) is the generation of sharp and rough decision boundaries that inaccurately track the distribution shape of classes of patterns. This feature is because the minimum and maximum activation functions force the decision boundaries to match the faces of the hyperboxes. To improve the DMN response, we introduce a dendritic model that uses smooth maximum and minimum functions to soften the decision boundaries. The classification performance assessment is conducted on nine synthetic and 28 real-world datasets. Based on the experimental results, we demonstrate that the smooth activation functions improve the generalization capacity of DMN. The proposed approach is competitive with four machine learning techniques, namely, Multilayer Perceptron, Radial Basis Function Network, Support Vector Machine, and Nearest Neighbor algorithm. Besides, the computational complexity of DMN training is lower than MLP and SVM classifiers.

View all citing articles on Scopus

View full text

Extreme learning machine for a new hybrid morphological/linear perceptron

Highlights

Abstract

Introduction

Section snippets

Some relevant concepts of lattice theory and mathematical morphology

From morphological to hybrid morphological/linear perceptrons: Some background and motivation

Training hybrid morphological/linear perceptrons using extreme learning machine

Experimental results in classification

Concluding remarks

Acknowledgments

Information Sciences

Engineering Applications of Artificial Intelligence

Applied Soft Computing

Signal Processing

Pattern Recognition

Computer Vision, Graphics, and Image Processing

Neural Networks

Neurocomputing

International Journal of Approximate Reasoning

Neural Networks

Pattern Recognition

Neural Networks

Signal Processing

Neural Networks

Neurocomputing

Fuzzy Sets and Systems

Information Sciences

Neurocomputing

A morphological perceptron with gradient-based learning for Brazilian stock market forecasting

Neural Networks

A dilation-erosion-linear perceptron for Bovespa index prediction

Minimal representations for translation-invariant set mappings by mathematical morphology

SIAM Journal on Applied Mathematics

The sample complexity of pattern classification with neural networks: the size of the weights is more important than the size of the network

IEEE Transactions on Information Theory

Random search for hyper-parameter optimization

Journal of Machine Learning Research

Lattice theory

Neural networks for pattern recognition

Pattern recognition and machine learning (information science and statistics)

HMLP-EL