Introduction

Stroke is one of the life-threatening cerebrovascular diseases that occurs due to the abrupt termination of blood supply to the brain. As the brain tissues cannot receive oxygen and nutrients, they begin to die, which can result in substantial lifetime disability and even death [1, 2]. Stroke is the second leading cause of lifetime disability as well as death worldwide [3]. Intracerebral haemorrhage (ICH) is one of the devastating stroke subtypes with poor outcome and high mortality rate within the first year. Hence, early diagnosis and management of ICH are essential for improved functional outcomes and to save patient lives [4].

Computed tomography (CT) is the preferred modality of choice for assessment of haemorrhage due to its wide availability, lesser cost, rapidity, and high sensitivity for haemorrhage [5]. In CT imagery, haemorrhagic lesions can be characterised as brighter areas as compared to the surroundings. Manual segmentation of haemorrhage from CT imagery remains challenging due to its uneven boundaries, overlapping pixel intensities, noise, and artefacts in CT scans. Even though manual demarcation and estimation appears accurate, it is a time-consuming task, heavily dependent on the expertise of clinicians, and subject to intraobserver and interobserver variability [6,7,8,9]. Furthermore, the irregularities and complexities associated with varied shapes and sizes of haemorrhagic lesions with time will also make the process more difficult and strenuous. Moreover, the process can become a laborious and daunting task, particularly in large clinical settings, which can introduce inadvertent error and delay [10,11,12,13,14]. This can cause additional morbidity and even mortality to the patient. Thus, development of automated methods which will support clinicians for making rapid, reliable, and efficient diagnosis of haemorrhage from CT images would be highly beneficial.

Computer aided diagnosis (CAD) for haemorrhage detection

CAD systems can play an important role in early diagnosis of haemorrhage. CAD offers quick and consistent results with improved accuracy, thus enabling clinicians to make better treatment planning and strategic decisions [15, 16]. The triage workflow can also be streamlined and optimized by automating the routine scanning of CT images with little or no human intervention. The following subsection discusses the existing works related to detection of haemorrhage using CAD. Several semi-automated and automated methods have been developed to detect ICH from CT images. Prakash et al. [17] applied the modified distance regularized level set (MDLRSE) for the detection of ICH and intraventricular haemorrhage (IVH) in CT scans. This two-step method involves shrinking and expansion and is highly sensitive to several DRLSE parameters; it requires a long computation time. Shahangian and Pourghassem [18] presented another modified version of the DRLSE method to detect small and uncertain haemorrhages. A hierarchical classifier was further used to distinguish the segmented regions into three different types of haemorrhage. However, control parameters require proper adjustment for accurate segmentation. Gillebert et al. [19] developed a voxel-based outlier detection procedure using a set of control images to classify infarct and haemorrhage in stroke CT slices. They reported a dice similarity index (DSI) of 0.52–0.89 for 500 CT scans. Muschelli et al. [20] proposed a fully automated method based on voxel-level probabilities to segment the haemorrhage. They obtained a higher median DSI of 0.899 using the random forest algorithm on 112 3D CT scan images. Al-Ayoob et al. [21] presented an automated segmentation method using Otsu’s thresholding, morphological operations, and a region-growing technique. The method was tested on 76 CT images, and achieved a 92% accuracy for classification. Zhang et al. [22] applied adaptive thresholding and case-based reasoning to detect ICH and IVH in CT head scans. They achieved a segmentation accuracy of 95% for a set of ten CT images with different haemorrhage sizes ranging from small to large. Kumar et al. [23] combined fuzzy c-mean (FCM) clustering and DRLSE for haemorrhage segmentation using CT images. An entropy-based thresholding is applied to automatically selected FCM clustered imagery, and DRLSE is applied to the resultant images for ICH segmentation. They reported an average sensitivity of 87.06% for 35 brain CT images. Phan et al. [24] developed another method using Hounsfield values and k-nearest neighbour to classify four types of haemorrhage. The method obtained an accuracy of 93.33% on 150 images. Gautam and Raman [25] have used white matter fuzzy c-means (WMFCM) and wavelet based thresholding to detect and localise haemorrhage in 20 CT images. Nag et al. [26] has delineated haemorrhagic regions in 48 CT images by combining an autoencoder and the Chan Vese model. Foo et al. [27] developed a CAD system using multiple thresholding and symmetry detection to identify acute ICH in 108 CT volumes.

From the above studies, it is evident that a limited dataset of CT images has been utilized for ICH detection and classification. Furthermore, a few of the existing techniques involves intricate engineering techniques such as skull stripping, rigid body transformation and feature extraction based on intensity of voxels and information on local moments. It can also be noted that some of the segmentation methods are time consuming and complex due to the intricacies associated with the initialisation of mathematical models and also require manual intervention. In addition, some of these methods are based on hardcoded logic and presumptive rules, which will limit their application to certain specific scenarios. Hence, there is a need for a quick, robust, efficient, and cost-effective automated method which can accurately detect and classify ICH for a large set of CT images. The main goal of this paper is to identify the ICH very quickly in the CT images using efficient feature extraction method. The contributions of the paper are two-fold:

  • Entropy based non-linear features are efficiently extracted on a significantly large dataset to characterise the haemorrhagic brain stroke using CT images.

  • ICH risk index is formulated using only three significant features. This helps to perform the categorisation using a unique number.

The remainder of the paper is prepared as follows: “Materials” designates the used materials and the acquisition method. The framework of the proposed assessment tool is described in “Framework of our approach”. The experimental results and its analysis is discussed in “Experimental results and discussions”. Finally, we conclude the paper in “Conclusion”.

Materials

The CT images were collected from ICH patients who are admitted for the CT scan in the Department of Radio-diagnosis of Kalinga Institute of Medical Science (KIMS), Bhubaneswar, India. The images were retrospectively acquired on a Digital p64-Slice CT scan machine in consultation with the radiologist. Based on visual inspection by the experts, the presence or absence of lesions was considered for the image collection. A total of 1603 non-contrast head CT axial images were collected from 48 patients, of which 784 were normal and 819 were ICH. The CT dataset was properly anonymized by an individual who was not part of the study. The CT slice which includes the largest area of haemorrhage was selected and saved in JPEG format. Figure 1 displays utilized samples of CT images in the current study.

Fig. 1
figure 1

Specimens of brain CT images used in the current study

Framework of our approach

We propose a new automated technique that can classify normal and haemorrhagic CT imagery using a minimal set of highly discriminating non-linear features along with normal classifiers. The proposed brain pathology identification system utilizes the following stages: (1) Feature generation, (2) feature organization, and (3) classification. In the first stage, brain imagery is pre-processed and nonlinear features are extracted with seven different types of entropy measures. Thereafter, features are rearranged using Student's t test. Finally, the significant features are categorized using the probabilistic neural network classifier. The details of the proposed technique are explained in the following sections. The overview of the proposed technique is illustrated in Fig. 2.

Fig. 2
figure 2

Overview of the proposed technique

Feature generation

This stage generates the features to characterise brain CT imagery. It involves pre-processing and nonlinear feature extraction stages, which are briefly explained below.

Pre-processing

The main aim of the initial pre-processing phase is to improve image quality for further processing. This phase involves removal of noise and unwanted regions from the input images, which will contribute to an overall improvement in the performance of the automated system. The following steps are applied to each input image as part of pre-processing. (a) A mask is initially created to remove the skull from the CT image [28], (b) contrast limited adaptive histogram equalisation (CLAHE) is applied to enhance the contrast and intensity of the image [29], and (c) the mask is then applied to the thresholded image to extract the brain region [28]. Figure 3 depicts the output of the pre-processing stage.

Fig. 3
figure 3

Pre-processed brain CT images

Non-linear feature extraction

Morphological alteration in brain tissue can be captured by automated extraction of key non-linear texture-based features. In this work, textural features are extracted using statistical moments of the image gray level histogram. Entropy is one such robust textural descriptor which can be used to accurately classify normal versus abnormal images. Entropy indicates the degree of randomness in the intensity distribution of a grayscale image. Consider an image f(x, y) with a gray level histogram hi where i = 0,1,…L − 1 denotes the various gray levels present. Then the normalised histogram Hi of image f with dimensions M and N in the X and Y directions is given as:

$${H}_{i}= \frac{{h}_{i}}{M\times N}.$$
(1)

Various entropy measures for textural feature extraction are as follows:

Shannon entropy (Shan.entropy) [30]: It is defined as:

$${S}_{e}= -\sum_{i=0}^{L-1}{H}_{i}{\mathrm{log}}_{2}{H}_{i}.$$
(2)

Rényi entropy (Ren. entropy) [30, 31]: It is defined as:

$${R}_{e}= \frac{1}{1-\propto } {\mathrm{log}}_{2}\sum_{i=0}^{L-1}{H}_{i}^{\propto } \quad \mathrm{where} \; \propto \ne 1, \propto >0,$$
(3)

where \(\propto \) is the diversity index, and it is considered that \(\propto =3\).

Kapur entropy (Kap. entropy) [30, 31]: It is defined as:

$${K}_{e}= \frac{1}{\beta - \alpha }{\mathrm{log}}_{2}\frac{\sum_{i=0}^{L-1}{H}_{i}^{\alpha }}{\sum_{i=0}^{L-1}{H}_{i}^{\beta }} \quad \mathrm{where} \; \alpha \ne \beta , \alpha >0, \beta >0,$$
(4)

where \(\alpha and \beta \) are diversity indices with \(\alpha \ne 1\), \(\alpha + \beta -1 >0\), and \(\alpha \) = 0.5 and \(\beta \) = 0.7.

Yager entropy (Yag. entropy) [31]: It is defined as:

$${Y}_{e}=1- \frac{\sum_{i=0}^{L-1}\left|2{H}_{i}- 1\right|}{\left| M \times N\right|}.$$
(5)

Vajda entropy (Vaj. entropy) [32]: It is defined as:

$${V}_{e}= \frac{1}{1- \alpha }{\mathrm{log}}_{2}\frac{\sum_{i=0}^{L-1}{H}_{i}^{\alpha }}{\sum_{i=0}^{L-1}{H}_{i}}.$$
(6)

Maximum entropy (Max. entropy) [33]:

Assume (i, j) where i is the gray level of a pixel and j is the average gray level of its neighborhood in the image f(x, y) and p(i, j) is the gray level probability. Supposing that the threshold of maximum entropy is set at [s, t] gray level pair, then the target segmented areas will consist of pairs (i1, j1) where i1 = 0,1,..,s and j1 = 0,1,..,t and background areas will have pairs (i2, j2) where i2 = s + 1,..,L − 1 and j2 = t + 1,..,L − 1. Then the total entropy H(s, t) can be computed as,

$$H\left(s,t\right)=\mathrm{lg}{p}_{1}\left(s, t\right){p}_{2}\left(s,t\right)+\frac{{h}_{1}\left(s, t\right) }{{p}_{1}\left(s, t\right)} +\frac{{h}_{2}\left(s, t\right) }{{p}_{2}\left(s, t\right)},$$
(7)
$${h}_{1}\left(s, t\right)= -\sum_{i=0}^{s}\sum_{j=0}^{t}p\left(i, j\right)\mathrm{lg}p\left(i,j\right) , {h}_{2}\left(s, t\right)= -\sum_{i=s+1}^{L-1}\sum_{j=t+1}^{L-1}p\left(i, j\right)\mathrm{lg}p\left(i,j\right) ,$$
$${p}_{1}\left(s,t\right)= \sum_{i=0}^{s}\sum_{j=0}^{t}p\left(i, j\right) , { p}_{2}\left(s,t\right)= \sum_{i=s+1}^{L-1}\sum_{j=t+1}^{L-1}p\left(i, j\right).$$

The pair (s*, t*) with highest entropy will be selected as the optimal threshold [33], such that:

$$ \left( {s*,t*} \right)\, = \,\max \, \left( {H\left( {s,t} \right)} \right) \, \quad {\text{where}}\; \, s\, = \,0,1,..L - 1\quad {\text{and}}\quad t\, = \,0,1,..L - 1. $$
(8)

Log energy entropy: It is defined as:

$${L}_{e}= \sum_{i=0}^{L-1}\mathrm{log}\left({H}_{i}^{2}\right).$$
(9)

Feature organization

Student's t test is used for optimal feature selection. It measures the statistical difference between two sets. This method helps to compute a ratio between the difference of the two class means and the variability of the two classes [34]. The t value indicates the significance of the difference between the two groups, and a significant p value denotes that the sample data does not occur by chance. The t value and p value will be calculated for the features of both classes, and better ranking of features can be done by selecting a higher t value with corresponding lesser p value [35].

Classification

To classify normal versus ICH images, we have used three conventional classifiers, namely, k-nearest neighbour (k-NN), probabilistic neural network (PNN) and support vector machine (SVM) to evaluate feature performance.

k-Nearest neighbour

k-NN is a supervised learning method which assigns a class label to the unknown sample based on its resemblance with the known sample [36]. The k-nearest points in proximity to the unknown data are evaluated, and the most common class among k-nearest neighbours will be the class label for the unknown data [37, 38]. The distance criteria used to measure the resemblance of new test data with the known data is the Euclidean distance. The set of samples with accurate classification should be selected as neighbors to predict the class for the new test data [38].

Probabilistic neural network

PNN is a multiclass classifier which uses a kernel discriminant analysis algorithm to classify the new test data into one of the various classes [39]. This multi-layered feedforward neural network architecture consists of four layers, namely the input layer, pattern layer, summation layer, and output layer [40].

Support vector machine

This supervised classifier is aimed at determining the hyperplane which maximises the margin among the different input classes, which are viewed in n-dimensional space [41]. The hyperplane that forms the maximum separation between the two input classes is selected for classification. Initially, SVM is designed for two-class problems and was extended to address multi-class problems. The Radial Basis Function (RBF) and polynomial kernel are the most frequently used kernel functions utilized by the classifier to solve non-linear boundary problems [42].

ICH indexing

Indexing is employed by several research groups to demonstrate the implication of generated features. As a result, a single number is composed to discriminate healthy versus pathology images [43,44,45,46,47,48,49,50,51]. Hence, we have developed a novel ICH index, which is formulated using only three clinically significant features. The mathematical equation for the ICH index is given as

$$\mathrm{ICH index}=\left(0.2{\mathrm{log}}_{10}{F}_{1}\right)-\left(2.75 \times {F}_{2}\right)+\left(5\times {F}_{3}\right),$$
(10)

where \({{\varvec{F}}}_{1}\) is Max. entropy, \({{\varvec{F}}}_{2}\) is Yag. entropy, and \({{\varvec{F}}}_{3}\) is Kap. entropy. The above equation is developed with the assistance of mathematical simulation. All the constant values and the equation itself are formulated empirically to obtain a maximum separation between the normal and ICH classes using a single index value.

Experimental results and discussion

The proposed model is tested on a set of 1603 axial CT images (normal: 784, ICH: 819). Initially, the brain tissue was separated from non-brain tissues (such as skull, fat, etc.) from all the images by thresholding in the pre-processing stage to obtain better classification results. Then, the entropies of each contrast enhanced image are calculated, i.e., in total seven entropy features. Hence, a complete feature set for these brain CT images is generated, and the significance is tested using t values, as shown in Fig. 4. These features are arranged in order of decreasing t value for classification. A further set of seven features are input to the classifier, such as {Max. entropy and Yag. entropy}, {Max. entropy, Yag. entropy, and Kap. entropy},…….., and {Max. entropy, Yag. entropy, Kap. entropy, Ren. entropy, Shan. entropy, Log entropy, and Vaj. entropy}. The classifiers employed in this study are k-NN, PNN, and SVM. The parameters for system evaluation are computed using true positive and negative (TP and TN), and false positive and negative (FP and FN) statistical metrics. For generality, tenfold cross-validation is used. The average value of the tenfold is taken to assess the system performance. k-NN achieved an accuracy of 97.06% for k = 5, PNN achieved an accuracy of 97.37% for \(\sigma =0.03\), and SVM with RBF (i.e., radial basis function) kernel achieved an accuracy of 97% using only six features. Table 1 shows the obtained maximum results for various classifiers using only six features. The complete algorithm is executed under the MATLAB environment using a standalone personal computer.

Fig. 4
figure 4

The significance level of extracted features

Table 1 The performance of our approach using six features

Discussion

The main goal of this study was to develop a quick-assessment tool for ICH estimation with CT brain imagery. In the current study, the identification of ICH is explored, with an accuracy of 97.37%. It was observed that the entropies are efficiently used to analyse medical data, i.e., 1D signals and 2D signals [52,53,54,55]. Hence, seven entropies were used in our study. These entropies were able to capture the sudden changes in the pixular distribution. They quantify these variations so as to suitably discern the presence of intracerebral haemorrhage in brain CT imagery. It is observed that the system achieved a maximum performance using only six features. Table 2 shows the means and standard deviations (SD) of these features.

Table 2 Features are arranged based upon p value

It is noted that the first six features are statistically significant with a p value < 0.05, and these features are arranged in descending order using t value. These entropies powerfully distinguish brain CT imagery, and can be analysed using a box plot as shown in Fig. 5. It is noted from Fig. 5 that the range of distributions of the entropy features to characterise brain CT imagery differ. Thus, the features are able to provide clear structural information about imagery and play a key role in performing accurate classification. Therefore, the combination of all six features produces a remarkable performance, shown in Fig. 6. As observed in Table 1, PNN clearly distinguishes normal versus ICH imagery efficiently, with an accuracy of 97.37%, an achieved sensitivity of 96.94%, and a specificity of 97.83%. Both k-NN and SVM classifies the normal versus abnormal group with a sensitivity of 97.43 and 96.21% as compared to PNN. It is also observed that the system reached a PPV of 97.90%. Figure 6 reveals that the performance of the proposed algorithm increases monotonically with the number of features incorporated. To the best of our knowledge, this is the first method which deploys a minimal set of highly discriminative entropy-based non-linear features to classify a large dataset of 1603 CT images. The sensitivity of the system showed that it is robust and can support doctor’s decisions at a high level. Hence, we expect such tools will be incorporated in hospital settings for improved interpretation.

Fig. 5
figure 5

Box plot of all features having p value < 0.05

Fig. 6
figure 6

Feature-by-feature performance using the PNN classifier

To perform improved discrimination between normal and ICH classes, we have introduced an ICH index using Max. entropy, Yag. entropy, and Vaj. entropy. Figure 7 shows the plot of ICH index for the two classes. It can be observed from Fig. 7 that the range of ICH indexing values provides a clear distinction between normal versus ICH classes. This helps to provide quick and accurate results using a single quantity. Hence, we can conclude as follows. From the result, it is evident that the false positive and negative values are 17 and 25 samples, respectively. It is noted that the specificity approaches 97.83%, which reduces doctor workload by approximately 50%. Furthermore, only three features are employed to develop ICH risk indexing. It is noted that for ICH, mean values of Max. entropy and Kap. entropy are high when compared to normal brain images. This helps to formulate a unique equation to enhance the level of discernment between the two classes. Furthermore, the proposed methodology can be used to analyse medical images with various modalities. In the future, we would like to develop a generalised risk index using a greater number of images.

Fig. 7
figure 7

Error plot of ICH indexing for normal and ICH categories

Several studies have designed a CAD tool for assessment of ICH using brain CT imagery [56,57,58]. A summary of these methods is shown in Table 3. Most of the approaches are segment-based; however the proposed approach is feature-based. Hence, it avoids localization of the region of interest. Our proposed method requires minimal features to model a large set of data. After validating the algorithm with a greater number of subjects, the proposed method can be used as an assistive tool for clinicians in hospital and polyclinic settings. In the future, we intend to extend this work by incorporating deep learning models with an even larger data set, as deep learning models outperform all conventional machine-learning models [59,60,61,62]. Figure 8 shows the proposed architecture for future digital healthcare using the Internet of Things (IoT) for detecting ICH, wherein patients will receive an instant diagnostic result to their mobile phone numbers through cloud-based services. This will automatically reduce the overall turnaround time for diagnosis and decision-making.

Table 3 Performance comparison of different techniques
Fig. 8
figure 8

Proposed architecture of future digital healthcare for detecting ICH

To summarize the advantages of the proposed model:

  • It is a feature-based method requiring a minimal number of features, and can be implemented in systems with limited resources. It avoids the localization of clinical features for brain CT images.

  • Speedup of the estimation is done using ICH indexing, which helps to categorise brain CT imagery with a unique value.

Conclusion

We have developed a fully automated system for early diagnosis and management of intracerebral haemorrhage in CT imagery. The technique, which consists of extraction of a minimal set of entropy features and supervised learning methods (k-NN, PNN and SVM), can clearly discern normal versus ICH imagery. Four evaluation metrics were used to assess system performance, and the PNN was found to successfully classify CT imagery with an accuracy of 97.37%. This shows that our approach can perform better as compared to existing state-of-the-art methods. Furthermore, the developed ICH indexing offers a swift, cost effective, and highly accurate solution for identification of ICH lesions. Moreover, this method simplifies the entire diagnostic procedure, and enables doctors to analyse a large number of CT scans in quick succession with highly accurate and consistent results. We would like to extend this work by incorporating more subjects, as well as by including additional features and deeper convolutional neural network techniques.