Abstract

Brain tumors are one of the most deadly diseases with a high mortality rate. The shape and size of the tumor are random during the growth process. Brain tumor segmentation is a brain tumor assisted diagnosis technology that separates different brain tumor structures such as edema and active and tumor necrosis tissues from normal brain tissue. Magnetic resonance imaging (MRI) technology has the advantages of no radiation impact on the human body, good imaging effect on structural tissues, and an ability to realize tomographic imaging of any orientation. Therefore, doctors often use MRI brain tumor images to analyze and process brain tumors. In these images, the tumor structure is only characterized by grayscale changes, and the developed images obtained by different equipment and different conditions may also be different. This makes it difficult for traditional image segmentation methods to deal well with the segmentation of brain tumor images. Considering that the traditional single-mode MRI brain tumor images contain incomplete brain tumor information, it is difficult to segment the single-mode brain tumor images to meet clinical needs. In this paper, a sparse subspace clustering (SSC) algorithm is introduced to process the diagnosis of multimodal MRI brain tumor images. In the absence of added noise, the proposed algorithm has better advantages than traditional methods. Compared with the top 15 in the Brats 2015 competition, the accuracy is not much different, being basically stable between 10 and 15. In order to verify the noise resistance of the proposed algorithm, this paper adds 5%, 10%, 15%, and 20% Gaussian noise to the test image. Experimental results show that the proposed algorithm has better noise immunity than a comparable algorithm.

1. Introduction

Tumor is one of the common malignant diseases that endanger human health. According to origin, tumors are generally divided into primary and secondary. Compared with breast, lung, and esophageal tumors, the incidence of brain tumors is relatively low. Compared with the overall incidence of human tumors, it accounts for about 1.4%; however, the mortality rate reaches 2.4% of human tumors [1]. Glioma is the most common primary brain tumor in adults. It is mainly distributed in glial cells and the tissues it infiltrates, and it is the most common malignant brain tumor. According to the nature of tumors, gliomas are generally divided into benign and malignant. Benign gliomas generally grow relatively slowly, patients have a longer survival period, and the long course of disease is the main manifestation of benign gliomas. Malignant gliomas generally grow faster, and the short course is a prominent manifestation of malignant gliomas. If the intracranial lesions can be detected as soon as possible, and the corresponding treatments can be implemented, the health hazards of brain tumors to humans can be reduced. CT or MRI imaging to analyze the pathological state of brain tissue is currently the mainstream method for examining brain tumors. Different imaging techniques have different advantages for tumor diagnosis. Compared with CT imaging, MRI uses a noninvasive imaging method, which can provide the observer with high-quality images without damage and skull artifacts, with clear anatomical structure, and with very good soft tissue resolution. At the same time, intracranial images in any direction can be obtained by adjusting the relevant parameters. In addition, using different imaging sequences, MRI of different angles or modalities of the same tissue can be obtained. This type of image is generally referred to as a multimodal MRI image.

The effective diagnosis of brain tumors requires the successful segmentation of tumors in brain images. Based on the results of the segmentation, the doctor can determine the shape, size, and specific location of the tumor. According to the segmentation results of the tumor in the image, a corresponding treatment plan is given. Due to the increase in the number of patients with brain diseases and the development of intelligent diagnostic technology, the research work based on brain tumors continues to increase. The International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) began in 2012 and has organized competitions based on multimodal brain tumor segmentation for four consecutive years, greatly promoting the development of brain tumor segmentation technology. It is of great research value and practical significance to improve the diagnosis efficiency by mining potential pathological information of MRI brain tumor images through image processing technology and machine learning methods. However, the tumor is only characterized by gray-scale information on the MRI image, and the edge of the tumor structure and the normal tissue have significant gray-scale similarity. Simultaneously, the size, location, shape, and corresponding expansion of the tumor in the brain tissue will show different states with different patients. These characteristics pose challenges to the development of tumor segmentation technology.

The so-called brain tumor segmentation refers to the process of segmenting various tumor tissues from a variety of conventional brain tissues. In general, the segmentation methods of brain tumor images can be summarized into three categories [2]: purely artificial, semiautomatic, and fully automatic segmentation. Manual segmentation refers to manually drawing the outline of the target tissue. Figure 1 is a schematic diagram of manual segmentation. Manual segmentation is boring and time consuming, so it cannot meet the growing demand for segmentation. In addition, each segmenter has a different segmentation style, which leads to deviations in segmentation results. Although manual segmentation has many disadvantages, manual segmentation has the highest segmentation accuracy so far, and is often used as the ground truth for automatic segmentation. Semiautomatic segmentation is sensitive to initialization. Users need to input certain initialization data to get the final segmentation result. Fully automatic segmentation does not need to set any parameters manually and can automatically locate and segment the tumor area.

There is still a lack of a general method that can process all brain tumor images and obtain satisfactory results currently. Usually, the segmentation method is aimed at specific image data. Reviewing related literature, tumor image segmentation methods can be summarized as follows: (1)Threshold based method. The practicability and segmentation effect of this method are very good. The histogram in the global threshold can be expressed as a bimodal model, and a single threshold can be used to distinguish tumor from background. Reference [3] proposes an unsupervised method to enhance pixel grayscale and utilize it to segment brain tumors in T1c images. If there are multiple types of regions in the image, a multithreshold strategy needs to be added to the segmentation method, called local threshold. For the local mean, it can be obtained by estimating the local statistical characteristics, such as gray average [4] and data Gaussian distribution [5]. Generally, the threshold-based method cannot use all the information of the MRI image, and the segmentation result is relatively rough. Therefore, the threshold-based method in brain tumor segmentation is first applied(2)Area-based approach. Through predefined similarity criteria, in the way of merging neighboring pixels in the intersecting areas, the target MRI brain image is divided into the required subareas. Reference [6] applied region growth to MRI tumor segmentation image segmentation with good results. Reference [7] proposed an improved method of region growth. This method obtains a more exact boundary message by reducing the volume effect. The leak gap that may be generated after the division is also filled to a certain extent. As a morphological method, watershed segmentation represents the target contour edge as a partial watershed, which is widely used in brain tumor segmentation. References [8, 9] proposed a multiscale watershed transformation method. Reference [10] constructed an artificially assisted segmentation method by the hierarchical watershed method. From the principle of the watershed segmentation method, this kind of image edge and region watershed conversion easily produces oversegmentation. In order to solve this problem, some related processing methods have been excavated one after another(3)Pixel classification method. The collected MRI brain tumor data generally has two formats, namely, 2D slices and 3D volume. If it is a brain tumor segmentation based on slice format, its essence is the same as traditional image segmentation. The pixel-based method mainly uses the pixel characteristics of the image, and uses some related classifiers to classify all the pixels in the brain tissue image, so as to achieve the effect of segmentation. Unsupervised classification is mainly represented by clustering [11, 12]. The core idea is to measure the relationship between tumor tissue and other tissues in the tumor image. The supervised classifier [13, 14] mainly uses those labeled training samples to train the relevant parameters in the model, which has reached the optimal tumor segmentation effect [15].(4)Model-based method. Model-based tumor segmentation methods are mainly 3D-oriented volume data, followed by 2D slice data. The most typical are the active contour model [16] and the level set method [17]. On the basis of these two models, tumor segmentation has formed two schools: the segmentation methods based on the generative algorithm and the discriminant algorithm. The generation algorithm uses the unique information of various organizations to predict the information of brain tissue that cannot be captured in the image [1820]. In some generative models, in order to solve the problem of difficult coding of a priori knowledge of tumors, the diseased tissue of the tumor can be modeled as the desired shape [2123], or it can be inferred using the given patient image and the tumor growth model’s possible location of the tumor structure [24]. Discriminant methods generally require a certain size of training samples [2527]. After many trainings, the processing effect of the discriminant model is more robust to the effects of MRI image artifacts and grayscale information. Figure 2 shows the basic flow of model-based tumor segmentation. For effective training, the first step of this type of method is generally to extract local gray-scale differences [28] or gray-scale distribution and other voxel-wise features [29], and then send these features to the discriminant classifier of the model. In order to combine the advantages of discriminant models and generative models, a method called generative discriminant model [30, 31] was proposed

In this paper, the BRATS 2015 competition database is used as the experimental object, and the traditional segmentation method and the sparse subspace clustering method based on sparse representation are used to segment the brain tumor images. The main innovations of this article are as follows: (1)Introduce the sparse subspace clustering algorithm to achieve brain tumor image segmentation. The advantage of this algorithm is to use low-dimensional data to recover and approximate high-dimensional data, effectively reducing the dimension of high-dimensional data while retaining the correlation between the data. The introduction of this algorithm can solve the problem of excessive data dimension(2)This article focuses on the segmentation of MRI brain tumor images under multimodality. In the single-modality image fusion strategy, a simple and fast linear fusion strategy is selected. Before segmenting multimodal images, the image is preprocessed by superpixel segmentation, feature vectors are extracted, and the data dimension is reduced. Experimental results show that for brain tumor segmentation, multimodal brain tumor information can be used as much as possible to obtain more accurate segmentation results

2.1. Multimodal MRI Brain Tumor Image Introduction

Multimodal MRI images are images of the same tissue under different contrasts obtained through different MR development sequences. When tumors and other lesions occur in brain tissue, water molecules existing in free form in brain tumors begin to undergo lesion reactions, such as tissue edema. In Flair and T2 images, the water molecules in the bound state are displayed in the form of high signals. Therefore, it is theoretically feasible to use Flair modal MRI images as the main basis for segmenting the entire tumor. However, due to some special circumstances, the tumor will also show irregular changes in the Flair image. At this time, the image data of the T2 mode can provide additional reference. Figure 3 depicts three different sets of Flair and T2 images. Among them is (1) the Flair image, (2) the T2 image, and (3) the artificially labeled tumor structure image. The data used in this paper are all from the BRATS 2015 [32] database, and the database includes images in four modes: T1, T1c, T2, and Flair.

2.2. Difficulties in MRI Brain Tumor Image Segmentation

There are many difficulties in the segmentation of MRI brain tumor images. These difficulties can be summarized as follows: (1)The most typical problem of MRI comes from the different nonstandard intensity ranges obtained by different scanners. Because of different magnetic field strengths and acquisition protocols, for the same patient, the brain MRI strength values are also different between hospitals(2)The brain tumor itself has no fixed shape or prior knowledge. Brain pathology can appear anywhere in the brain and can have any shape. In addition, the gray value range of this pathology may overlap with the gray value range of healthy tissues, making segmentation of brain tumors more complicated(3)MRI has nonnegligible white Rician noise during the acquisition process [33](4)Uniform organization is often affected by changes in the spatial intensity of each dimension. This is caused by the bias field effect. The MRI bias affects the smoothed low-frequency signal of the image intensity. This problem requires an offset field correction preprocessing step, which usually increases the intensity value around the brain(5)Large tumors or lesions in the brain may distort the overall structure of the brain, making some procedures impossible to perform. For example, a larger tumor may affect the overall symmetry of the brain, making it impossible to calculate the left-right symmetry feature. In addition, brains with large tumors are difficult to register with healthy brain templates

3. Brain Tumor Image Segmentation Based on Sparse Subspace Clustering Algorithm

Sparse representations are widely used in image segmentation algorithms. Sparse representations can effectively reduce the complexity of data operations and bring convenience to the subsequent processing of data. SSC is a clustering algorithm based on sparse representation and subspace clustering [34]. Before segmentation, the target image needs to be preprocessed.

3.1. Image Preprocessing

Before SSC splits an image, the image needs to be split into superpixels. Superpixels are irregular image blocks composed of a series of adjacent pixels with similar characteristics, such as texture, color, and brightness. It replaces a large number of pixels with a few superpixels, which effectively reduces the amount of data that expresses the features of the picture, thereby reducing the complexity of image postprocessing. Superpixel segmentation algorithms are currently divided into two types, one is based on graph theory, and the other is based on gradient descent, such as Simple Linear Iterative Clustering (SLIC) [35]. The segmentation method based on gradient descent belongs to an iterative segmentation method. First, an initial clustering is given, and then the gradient clustering method is used to modify the result of the previous clustering, and iterate continuously until the convergence condition is satisfied. The superpixel rendering using SLIC segmentation is shown in Figure 4.

3.2. Basic Model

The algorithm is to assume that the data is composed of high-dimensional spatial data, and each data can be represented in a low-dimensional subspace. That is, by letting the data in the high-dimensional space be expressed linearly with the data in the low-dimensional subspace, the low-dimensional subspace to which the data belongs can be clearly known, which is beneficial to the clustering operation. The basic framework of sparse subspace clustering is shown in Figure 5.

The SSC model building process is as follows.

Given a set of datasets , the dimension is , located in linear subspaces . The dimensions of the linear subspace are . Then define the matrix where is a matrix of rank composed of the th subspace data. is the permutation matrix. Subspace clustering is essentially to obtain the matrix.

Subspace representation means that every data in matrix can be linearly represented by data in the same subspace except for itself: where . Formula (1) can be written in matrix form as follows: where is a sparse matrix. In order to make the sparse matrix the most sparse, that is, the nonzero values in matrix are minimized, by obtaining the norm to minimize, we use convex optimization to perform the following process:

However, the solution of the norm is an NP-Hard problem in practical problems. Usually the norm is used to replace the norm to solve, so as to convert the subspace representation model to

3.3. Brain Tumor Image Segmentation Based on Sparse Subspace Clustering

Image segmentation is the process of segmenting images into nonoverlapping regions and extracting ROI from them, while sparse subspace clustering is a process used to cluster data of the same class into the same subspace. An image contains multiple target images with a complex texture structure, but the features on the image are composed of multiple low-dimensional subspace data. Therefore, the sparse subspace clustering algorithm can be used to segment the image. First, divide the image to be divided into multiple superpixel blocks, and divide the superpixel blocks of the same target image into the same subspace, so as to achieve the purpose of extracting the target image. The process is shown in Figure 6.

A variety of modal image fusion strategies use linear fusion. Linear fusion is the simplest multimodal MRI brain tumor image fusion method. It is a pixel-level fusion method, and the processing object is pixels. It is mainly to operate the pixel unit in each modal brain image, so as to comprehensively process the pixel information in each modal brain tumor image. Through the linear fusion operation, multimodal brain images can be converted into single-modal brain images containing multimodal brain tumor tissue information. Thus, multimodal image segmentation is converted into single-modal image segmentation, and the operation of multimodal processing is simplified. The specific operation of linear fusion is as follows: where is the fused image; , , and are the pixel values of , , and at position ; and , , and are the weights of each modal image, and meets . Figure 7 is a fusion image of multimodal images. Using the linear fusion operation, we use the following Flair ratio to obtain the fusion image in the figure: . After preprocessing the fused image, SSC can be used to complete the multimodal image segmentation.

The steps of the SSC-based multimodal image segmentation algorithm are as follows: (1)Input image and use the preprocessing algorithm described in Section 3.1 to divide the fused image into superpixel blocks(2)Extract -dimensional feature vectors from superpixel blocks to form a feature matrix (3)Use the basic model of sparse subspace clustering to obtain the sparse coefficient matrix (4)Calculate the similarity matrix , where(5)The clustering result is obtained by using the spectral clustering algorithm

4. Simulation Experiment Analysis

4.1. Experiment-Related Settings

The comparison algorithms mainly include FCM, SVM, and the top 15 results of the Brats 2015 challenge. The experimental data of this paper is Brats 2015 [28]. The database contains data of two types of patients, those with benign tumors and those with malignant tumors, and contains brain image data of 274 patients. Each patient’s brain image data contains Flair images, T1 images, T1c images, T2 images, and golden section results. The size of each modal image is . We randomly selected data from 25 patients with brain tumors. Each patient’s data includes five parts, namely, the Flair mode, the T1 mode, the T1c mode, the T2 mode, and the golden section results. The data size of each mode is . Because the two-dimensional tumor pictures of the same patient are similar, a set of two-dimensional multimodal brain tumor images is extracted from the data of each patient. There was a total of 25 sets of multimodal brain tumor image data. Among them, there are 15 groups of malignant tumor data and 10 groups of benign tumor data.

The performance of the algorithm in this paper mainly depends on the quality of the superpixels. The quality of the superpixels is controlled by the number of the superpixels and the compact factor . In this paper, the SLIC superpixel segmentation method needs to consider the density factor and the number of target superpixel blocks [36]. In order to study the influence of the density factor , the numberof predefined superpixel blocks is 1000 at first. Then, we explore the impact of the change of the compact factor size on the segmentation results. The compact factor leads to a more rigid boundary, while will produce a very flexible boundary, but it will increase the shape and irregularity of the superpixel. Figure 7 is the result of FLAIR image segmentation when the value of is different. By visually checking the superpixel boundary and area, when , the boundary can obtain a better segmentation result.

The next step is to determine the number of target superpixel blocks. Figure 8 shows the result of the FLAIR image segmentation when the value of is 20 and the number of the target superpixel blocks is different. When the compaction factor is fixed at , by changing the numberof the target superpixel blocks, the Dice measure is used to evaluate the formation performance of the superpixels.

Based on the above experimental results, the compact factor in this experiment and the number of superpixels . The fuzzy factor in FCM is 2, and the parameter in SVM .

4.2. Evaluation Index

There are four evaluation indicators commonly used in objective evaluation criteria, namely, the Dice coefficient, the Jaccard coefficient, the false positive rate (Precision), and the true positive rate (Recall). The four evaluation indicators are shown in Table 1.

4.3. Simulation Results and Analysis

Table 2 shows the evaluation index results of the algorithm for different groups of multimodal image segmentation results, and Table 3 shows the top 15 segmentation results of the Brats 2015 challenge. It can be seen from the comparison of the data in the table that the average Dice index of this algorithm is 0.8577. Compared with the top 15 of the Brats 2015 competition, the accuracy is not much different, and it can even exceed the results of several of the rankings. The average Precision index is as high as 0.9615, which is a big advantage compared with the top 15 data. Compared with the top 15, the true positive rate is slightly inadequate. This is because the top 15 competition algorithms use a deep learning algorithm to segment the tumor in three dimensions and use the three-dimensional information of the brain tumor. Considering comprehensively, the algorithm in this paper can use the two-dimensional information of brain tumors to obtain a segmentation accuracy similar to the top 15 algorithms in the competition. It can be seen that the algorithm in this paper has certain value.

Table 4 shows the comparison of the average evaluation indexes of these three algorithms in 25 sets of data tests. From the comparison of the data in the table, we can see that the SSC algorithm used in each index is greatly improved compared to the other two algorithms.

Figure 9 is a comparison of the histograms of the various evaluation methods on the four evaluation indicators. From the figure, the greater advantages of the SSC algorithm can be clearly found.

In order to verify the noise resistance of the SSC algorithm, this paper adds 5%, 10%, 15%, and 20% Gaussian noise to the original image. The segmentation results after noise addition are shown in Tables 58. From the changing trends of the values of the four evaluation indicators in Tables 58, it can be analyzed that the tumor segmentation effect decreases with increasing noise. The greater the noise content, the worse the segmentation effect. This is completely consistent with theory.

Table 9 gives a comparison of the segmentation performance of the three algorithms under different noise ratios. Each data in the table is the average value of the above 25 sample data after division. Figure 10 shows the changing trend of the segmentation accuracy of the three algorithms with increasing noise. It can be concluded from Table 9 and Figure 10 that the SSC algorithm is relatively better in terms of the antinoise performance of the three algorithms. As the amount of noise increases, the performance of FCM declines the fastest, followed by SVM, and the relative decline of SSC is smaller. This further illustrates the feasibility and reference value of the SSC algorithm selected in this paper.

5. Conclusion

Different features have different effects on tumor segmentation results. In order to make better use of multimodal brain tumor image information, this paper proposes an SSC-based multimodal brain tumor image segmentation method. First, linear fusion is used to fuse multiple single-modality brain MRI images into one image to be processed; secondly, superpixel features are extracted to construct a feature matrix; and finally, a sparse subspace clustering algorithm based on sparse representation is used to complete the segmentation. Using Brats 2015 competition data to experiment with the proposed method, the results show that the method used can well integrate the tumor information of the multimodal images and obtain good segmentation results. After adding different proportions of noise, the segmentation performance of the proposed algorithm decreases significantly slower than that of the comparison algorithm, which also verifies that the proposed algorithm has good noise resistance. However, the method used in this paper has certain limitations. It needs to optimize the weights of various modal data fusions, which is very time consuming.

Data Availability

The labeled dataset used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

Acknowledgments

This work was supported by the Natural Science Foundation of Jiangsu Province (BK20131097), the Jiangsu Province Higher Vocational Education Industry-Education Integration Platform Construction Project, the Jiangsu Province High-Level Backbone Professional Construction Project, the Jiangsu Province 333 High-Level Talent Cultivation Project ((2016) III-2887), and the Jiangsu Province Key Education Reform Project “Internet of Things Application Technology” Cross-Border Integration “Exploration and Practice of Project-Based Curriculum System Optimization” (2017JSJG077).