Morphological Component Analysis-Based Perceptual Medical Image Fusion Using Convolutional Sparsity-Motivated PCNN

Tian, Chuangeng; Tang, Lu; Li, Xiao; Liu, Kaili; Wang, Jian

doi:https://doi.org/10.1155/2021/6647200

Scientific Programming

On this page

Abstract Introduction Related Work Conclusion Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2021 | Article ID 6647200 | https://doi.org/10.1155/2021/6647200

Morphological Component Analysis-Based Perceptual Medical Image Fusion Using Convolutional Sparsity-Motivated PCNN

Chuangeng Tian,¹Lu Tang,²Xiao Li,¹Kaili Liu,¹and Jian Wang¹

Academic Editor: Qinhu Zhang

Received28 Dec 2020

Revised24 Feb 2021

Accepted18 Mar 2021

Published29 Mar 2021

Abstract

This paper proposes a perceptual medical image fusion framework based on morphological component analysis combining convolutional sparsity and pulse-coupled neural network, which is called MCA-CS-PCNN for short. Source images are first decomposed into cartoon components and texture components by morphological component analysis, and a convolutional sparse representation of cartoon layers and texture layers is produced by prelearned dictionaries. Then, convolutional sparsity is used as a stimulus to motivate the PCNN for dealing with cartoon layers and texture layers. Finally, the medical fused image is computed via combining fused cartoon layers and texture layers. Experimental results verify that the MCA-CS-PCNN model is superior to the state-of-the-art fusion strategy.

1. Introduction

In clinical applications, medical images include anatomical images and functional images. Anatomical images provide information of dense structures [1], for instance, X-ray computed tomography (CT) and magnetic resonance imaging (MRI). Functional images reflect information of blood flow and blood activity [2], for instance, positron emission CT (PET) and single-photon emission CT (SPECT). Medical images with single modality do not provide sufficient information in diagnosing diseases; medical image fusion (MIF) technology provides an effective method via merging medical images with different modalities into a comprehensive MIF image to aid radiologists for better diagnosis [3–5].

Many MIF algorithms have been addressed in the last dozen years. These methods include the multiscale decomposition- (MSD-) based fusion strategy [6–10], sparse representation- (SR-) based fusion strategy [11], and pulse-coupled neural network- (PCNN-) based fusion strategy [12, 13]. To pursue satisfactory fusion performance, attempts were made to use the PCNN based on MST [14–16]. PCNN is a cat visual cortex biologically inspired neural network, which is used in medical image fusion. Huang et al. [17] integrated non-subsampled contourlet transform (NSCT) with the PCNN for SPECT and CT image fusion. Non-subsampled shearlet transform (NSST) was combined with the PCNN to fuse medical images [18]. However, NSCT- or NSST-based fusion strategy has high computational complexity due to proper contours, which may limit the fusion performance. Furthermore, normalized coefficient values are employed to stimulate the PCNN, which may cause detail loss and blurring effect in the fused image. Electrophysiological experiments have proved that the neuron representations of complex stimulation in the cat visual cortex are represented by sparse coding [19–21]. Morphological component analysis (MCA) has been widely studied as effective image decomposition. Combining MCA with SR can acquire the SR of cartoon and texture components of an image [22, 23]. To resolve the disadvantage produced by patch coding, convolutional sparse representation (CSR) has been shown to be more effective than sparse representation in extracting features [24]. It is implemented on the whole image instead of a local image patch. Based on the above considerations, this paper presents a medical image fusion algorithm using convolutional sparsity to stimulate the PCNN based on morphological component analysis (MCA-CS-PCNN). Source images are first decomposed into cartoon components and texture components by MCA, and CSR of cartoon layers and texture layers is obtained by prelearned dictionaries. Then, convolutional sparsity is employed to stimulate the PCNN for processing cartoon layers and texture layers. The MIF image is computed via combining fused cartoon layers and texture layers. We test the performance of the proposed MCA-CS-PCNN fusion method, and the experimental results verify the advantages of our fusion strategy.

2.1. Convolutional Sparsity Based on Morphological Component Analysis (CSMCA)

Convolutional sparsity is a sparse representation model applying the convolutional form [24], which is based on an entire image rather than overlapped patch. The CSR is defined aswhere denotes an image, and denote the global sparse coefficient maps and dictionary filter, respectively, represents the convolution operator, and is the regularization parameter.

Morphological component analysis of an image is regarded as a linear combination of different components, which is defined as [23]where and denote cartoon components and texture components, respectively. According to CSR theory, the model of convolutional sparsity based on morphological component analysis (CSMCA) is expressed aswhere and denote the dictionary and convolution sparse coefficient corresponding to , respectively. and represent the dictionary and convolution sparse coefficient corresponding to , respectively. The image is computed and denoted by

2.2. Pulse-Coupled Neural Network

The diagrammatic diagram of the simplified PCNN is shown in Figure 1. There are three modules in the simplified PCNN model [12], which include the dendritic, linking modulation, and pulse generator, where feeding and linking input are built into the dendritic, denoted by and . and denote the linking modulation and the pulse generator, respectively. The simplified PCNN model is denoted bywhere , denote pixel locations, , represent the dislocation in the symmetric neighborhood around a pixel, and denote the synaptic weight matrices and the external stimulus, respectively, and are normalizing constants, and varies the weight of the linking field, which denotes the linking parameter. The threshold magnitude coefficient and attenuation coefficient are represented by and , respectively.

3. Proposed MIF Fusion Framework

3.1. MCA-CS-PCNN

The flowchart of the MCA-CS-PCNN framework is shown in Figure 2. Images A and B denote different source images, which are decomposed into cartoon components and the texture components by applying MCA, respectively. According to equations (1)–(4), the CSR of cartoon components and texture components is computed aswhere represents functions, described in Section 2.1, and denote the convolution sparse coefficient map of , respectively, and and denote the convolution sparse coefficient map of .

Next, the convolutional sparse representation is used to stimulate the PCNN because complex stimulation in the cat visual cortex is based on sparse coding. and are employed to stimulate the PCNN for processing and , respectively.where denotes the PCNN functions; the firing time matrices of and of are obtained according to equations (5)–(9), until the iteration number , where denotes the max iteration times, and then iteration stops.

Then, fused coefficients of the convolution sparse coefficient map in cartoon components are computed by

The fused coefficients of the convolution sparse coefficient map in texture components are computed by

The fused image of cartoon component and fused image of texture component are computed and denoted bywhere and are dictionaries.

Finally, the medical fused image is acquired and denoted as

3.2. Extension to Anatomical and Functional Image Fusion Based on MCA-CS-PCNN

The proposed MCA-CS-PCNN is extended to conduct anatomical and functional image fusion. Considering that functional images are pseudo-color images, the YUV color space transform has shown to be effective in processing pseudo-color images [10, 16]. Specifically speaking, a functional image with RGB is firstly transformed into the Y channel, U channel, and V channel. Then, the new Y channel is produced via the fusion of the Y channel and grayscale image based on MCA-CS-PCNN, and the new YUV is acquired via merging the new Y, U, and V. Finally, YUV is converted into RGB, and the medical fused image with color is obtained. The flowchart of the anatomical and functional image fusion strategy based on MCA-CS-PCNN is shown in Figure 3.

4. Experiments

4.1. Experimental Settings

To test and verify the performance of the MCA-CS-PCNN fusion algorithm, ten pairs of medical images with the same size of 256 × 256 pixels are used to conduct the experiments, including five pairs of anatomical image and functional image fusion and five pairs of anatomical image and anatomical image fusion (Figures 4 and 5). Five representative medical image fusion algorithms are selected for experimental comparison; they are convolutional sparse representation (CSR) [24], NSCT-based modified spatial frequency and PCNN (NSCT-MSF-PCNN) [14], guided filtering (GFF) [25], cross-scale coefficient selection (CSCS) [26], and sparse representation based on the Laplacian pyramid (LP-SR) [11]. Objective quality evaluation is important for image quality [27–31]. The existing fusion quality metrics include the human perception quality metric [32], feature mutual information quality metric [33], spatial frequency quality metric [34], standard deviation quality metric [11], nonlinear correlation information entropy metric [35], and mutual information metric [36]. In the above quality metrics, the higher the values of , , , , , and , the higher the fusion performance.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(i)

(j)

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(i)

(j)

4.2. Analysis of Experimental Results

In the example of anatomical and anatomical image fusion, we can see that the anatomical information of the bones or soft tissues is contained in the fused images by the six algorithms; still, differences between fused images can be clearly distinguished, such as focal regions blur (Figures 6(a), 6(c), and 6(d)), information of soft tissues regions are missing (Figures 6(b) and 6(e)). Our method obtains better performance than other methods. The example of anatomical and functional image fusion shows that the fused images obtained by GFF and CSCS lead to the loss of color information (Figures 7(a), 7(c), and 7(d)), and the NSCT-MSF-PCNN and LP-SR algorithms lead to poor visual effect, for instance, the details of the anatomical image are lost (Figures 7(b) and 7(e)). From the comparisons, our proposed algorithm demonstrates more advantages than the existing algorithms.

(a)

(b)

(c)

(d)

(e)

(f)

(a)

(b)

(c)

(d)

(e)

(f)

Tables 1 and 2 give the objective evaluation results of the proposed MCA-CS-PCNN fusion algorithm and five fusion methods via using objective fusion quality metrics. We mark best results employing the boldface in each row. Table 1 shows an objective evaluation of the fused image about anatomical image and functional image. We values are only slightly lower than LP-SR in the second pair of images of Figure 4. Our method achieves the significant superiority. From Table 2, it can be see that values are only slightly lower than GFF in the second pair of images of Figure 5, and the values of and in our proposed algorithm demonstrate advantages.

5. Conclusion

This paper proposes a perceptual medical image fusion framework based on morphological component analysis combining convolutional sparsity and pulse-coupled neural network, which is called MCA-CS-PCNN for short. It is basically based on the visual system feature that the cat visual cortex can produce complex stimulation, and the neuron representations of complex stimulation can be represented using sparse coding. To this end, we first decomposed source images into cartoon components and texture components by morphological component analysis, and convolutional sparse representation of cartoon layers and texture layers is obtained by prelearned dictionaries. Then, convolutional sparsity is employed to stimulate the PCNN for processing cartoon layers and texture layers. Finally, the medical fused image is computed via combining fused cartoon layers and texture layers. The experimental results verify that the proposed model can produce high performance, which is superior to the state-of-the-art fusion strategy.

Data Availability

The data used to support the findings of this study can be downloaded from http://www.med.harvard.edu/AANLIB/home.html.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (no. 82001912), Project funded by the China Postdoctoral Science Foundation (2018M642325), and Xu Zhou Science and Technology Program, China (KC19146).

References

R. C. Krempien, S. Daeuber, F. W. Hensley, M. Wannenmacher, and W. Harms, “Image fusion of CT and MRI data enables improved target volume definition in 3D-brachytherapy treatment planning,” Brachytherapy, vol. 2, no. 3, pp. 164–171, 2003.
View at: Publisher Site | Google Scholar
A. C. Paulino, W. L. Thorstad, and T. Fox, “Role of fusion in radiotherapy treatment planning,” Seminars in Nuclear Medicine, vol. 33, no. 3, pp. 238–243, 2003.
View at: Publisher Site | Google Scholar
R. R. Nair and T. Singh, “Multi-sensor medical image fusion using pyramid‐based DWT: a multi-resolution approach,” IET Image Processing, vol. 13, no. 9, pp. 1447–1459, 2019.
View at: Publisher Site | Google Scholar
J. Wang, X. Li, Y. Zhang, and X. Zhang, “Adaptive decomposition method for multi-modal medical image fusion,” IET Image Processing, vol. 12, no. 8, pp. 1403–1412, 2018.
View at: Publisher Site | Google Scholar
Y. Na, L. Zhao, Y. Yang, and M. Ren, “Guided filter-based images fusion algorithm for CT and MRI medical images,” IET Image Processing, vol. 12, no. 1, pp. 138–148, 2018.
View at: Publisher Site | Google Scholar
V. S. Petrovic and C. S. Xydeas, “Gradient-based multiresolution image fusion,” IEEE Transactions on Image Processing, vol. 13, no. 2, pp. 228–237, 2004.
View at: Publisher Site | Google Scholar
G. Qu, D. Zhang, and P. Yan, “Medical image fusion by wavelet transform modulus maxima,” Optics Express, vol. 9, no. 4, pp. 184–190, 2001.
View at: Google Scholar
L. Yang, B. L. Guo, and W. Ni, “Multimodality medical image fusion based on multiscale geometric analysis of contourlet transform,” Neurocomputing, vol. 72, no. 1–3, pp. 203–211, 2008.
View at: Publisher Site | Google Scholar
Y. Yang, S. Tong, S. Huang, and P. Lin, “Log-Gabor energy based multimodal medical image fusion in NSCT domain,” Computational and Mathematical Methods in Medicine, vol. 2014, Article ID 835481, 12 pages, 2014.
View at: Publisher Site | Google Scholar
G. Bhatnagar, Q. M. J. Wu, and Z. Liu, “A new contrast based multimodal medical image fusion framework,” Neurocomputing, vol. 157, pp. 143–152, 2015.
View at: Publisher Site | Google Scholar
Y. Liu, S. Liu, and Z. Wang, “A general framework for image fusion based on multi-scale transform and sparse representation,” Information Fusion, vol. 24, pp. 147–164, 2015.
View at: Publisher Site | Google Scholar
Z. Wang and Y. Ma, “Medical image fusion using m-PCNN,” Information Fusion, vol. 9, no. 2, pp. 176–185, 2008.
View at: Publisher Site | Google Scholar
L. Tang, J. Qian, L. Li, J. Hu, and X. Wu, “Multimodal medical image fusion based on discrete tchebichef moments and pulse coupled neural network,” International Journal of Imaging Systems and Technology, vol. 27, no. 1, pp. 57–65, 2017.
View at: Publisher Site | Google Scholar
S. Das and M. K. Kundu, “NSCT-based Multimodal medical image fusion using pulse-coupled neural network and modified spatial frequency,” Medical & Biological Engineering & Computing, vol. 50, no. 10, pp. 1105–1114, 2012.
View at: Publisher Site | Google Scholar
P. Ganasala and V. Kumar, “Multimodality medical image fusion based on new features in NSST domain,” Biomedical Engineering Letters, vol. 4, no. 4, pp. 414–424, 2014.
View at: Publisher Site | Google Scholar
M. Yin, X. Liu, Y. Liu, and X. Chen, “Medical image fusion with parameter-adaptive pulse coupled neural network in nonsubsampled shearlet transform domain,” IEEE Transactions on Instrumentation and Measurement, vol. 68, no. 1, pp. 49–64, 2019.
View at: Publisher Site | Google Scholar
C. X. Huang, G. Tian, Y. Lan et al., “A new pulse coupled neural network (PCNN) for brain medical image fusion empowered by shufed frog leaping algorithm,” Frontiers Neuroscience, vol. 13, pp. 1–10, 2019.
View at: Publisher Site | Google Scholar
X. Jin, G. Chen, J. Hou, Q. Jiang, D. Zhou, and S. Yao, “Multimodal sensor medical image fusion based on nonsubsampled shearlet transform and S-PCNNs in HSV space,” Signal Processing, vol. 153, pp. 379–395, 2018.
View at: Publisher Site | Google Scholar
E. T. Rolls and M. J. Tovee, “Sparseness of the neuronal representation of stimuli in the primate temporal visual cortex,” Journal of Neurophysiology, vol. 73, no. 2, pp. 713–726, 1995.
View at: Publisher Site | Google Scholar
D. Ferster, S. Chung, and H. Wheat, “Orientation selectivity of thalamic input to simple cells of cat visual cortex,” Nature, vol. 380, no. 6571, pp. 249–252, 1996.
View at: Publisher Site | Google Scholar
B. A. Olshausen and D. J. Field, “Emergency of simple-cell receptive field proper ties by learning a sparse code for natural images,” Nature, vol. 381, pp. 607–609, 1994.
View at: Google Scholar
J.-L. Starck, M. Elad, and D. Donoho, “Redundant multiscale transforms and their application for morphological component separation,” Advances in Imaging and Electron Physics, vol. 132, pp. 287–348, 2004.
View at: Publisher Site | Google Scholar
Y. Jiang and M. Wang, “Image fusion with morphological component analysis,” Information Fusion, vol. 18, pp. 107–118, 2014.
View at: Publisher Site | Google Scholar
Y. Liu, X. Chen, R. K. Ward, and Z. Jane Wang, “Image fusion with convolutional sparse representation,” IEEE Signal Processing Letters, vol. 23, no. 12, pp. 1882–1886, 2016.
View at: Publisher Site | Google Scholar
S. Li, X. Kang, and J. Hu, “Image fusion with guided filtering,” IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society, vol. 22, no. 7, pp. 2864–2875, 2013.
View at: Publisher Site | Google Scholar
R. Shen, I. Cheng, and A. Basu, “Cross-scale coefficient selection for volumetric medical image fusion,” IEEE Transactions on Bio-Medical Engineering, vol. 60, no. 4, pp. 1069–1079, 2013.
View at: Publisher Site | Google Scholar
H. Liu and I. Heynderickx, “A perceptually relevant no-reference blockiness metric based on local image characteristics,” EURASIP Journal on Advances in Signal Processing, vol. 2009, pp. 1–14, 2009.
View at: Publisher Site | Google Scholar
H. Liu, N. Klomp, and I. Heynderickx, “A no-reference metric for perceived ringing artifacts in images,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 4, pp. 529–539, 2010.
View at: Publisher Site | Google Scholar
S. Wang, K. Gu, X. Zhang, W. Lin, S. Ma, and W. Gao, “Reduced-reference quality assessment of screen content images,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 28, no. 1, pp. 1–14, 2018.
View at: Publisher Site | Google Scholar
K. Gu, J. Qiao, X. Min, G. Yue, W. Lin, and D. Thalmann, “Evaluating quality of screen content images via structural variation analysis,” IEEE Transactions on Visualization and Computer Graphics, vol. 24, no. 10, pp. 2689–2701, 2018.
View at: Publisher Site | Google Scholar
V. Jakhetiya, K. Gu, W. Lin, Q. Li, and S. P. Jaiswal, “A prediction backed model for quality assessment of screen content and 3-D synthesized images,” IEEE Transactions on Industrial Informatics, vol. 14, no. 2, pp. 652–660, 2018.
View at: Publisher Site | Google Scholar
H. Chen and P. K. Varshney, “A human perception inspired quality metric for image fusion based on regional information,” Information Fusion, vol. 8, no. 2, pp. 193–207, 2007.
View at: Publisher Site | Google Scholar
M. B. A. Haghighat, A. Aghagolzadeh, and H. Seyedarabi, “A non-reference image fusion metric based on mutual information of image features,” Computers & Electrical Engineering, vol. 37, no. 5, pp. 744–756, 2011.
View at: Publisher Site | Google Scholar
S. Li, J. T. Kwok, and Y. Wang, “Combination of images with diverse focuses using the spatial frequency,” Information Fusion, vol. 2, no. 3, pp. 169–176, 2001.
View at: Publisher Site | Google Scholar
Q. Wang, Y. Shen, and J. Jin, “Performance evaluation of image fusion techniques,” in Image Fusion: Algorithms and Applications, T. Stathaki, Ed., pp. 469–492, Elsevier, Amsterdam, Netherlands, 2008.
View at: Google Scholar
N. Cvejic, C. N. Canagarajah, and D. R. Bull, “Image fusion metric based on mutual information and tsallis entropy,” Electronics Letters, vol. 42, no. 11, pp. 626-627, 2006.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2021 Chuangeng Tian et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

557

Downloads

602

Citations

Scientific Programming

Morphological Component Analysis-Based Perceptual Medical Image Fusion Using Convolutional Sparsity-Motivated PCNN

Abstract

1. Introduction

2. Related Work

2.1. Convolutional Sparsity Based on Morphological Component Analysis (CSMCA)

2.2. Pulse-Coupled Neural Network

3. Proposed MIF Fusion Framework

3.1. MCA-CS-PCNN

3.2. Extension to Anatomical and Functional Image Fusion Based on MCA-CS-PCNN

4. Experiments

4.1. Experimental Settings

4.2. Analysis of Experimental Results

5. Conclusion

Data Availability

Conflicts of Interest

Acknowledgments

References

Copyright