BlessMark: a blind diagnostically-lossless watermarking framework for medical applications based on deep neural networks

Zarrabi, Hamidreza; Emami, Ali; Khadivi, Pejman; Karimi, Nader; Samavi, Shadrokh

doi:10.1007/s11042-020-08698-9

BlessMark: a blind diagnostically-lossless watermarking framework for medical applications based on deep neural networks

Published: 24 May 2020

Volume 79, pages 22473–22495, (2020)
Cite this article

Download PDF

Multimedia Tools and Applications Aims and scope Submit manuscript

BlessMark: a blind diagnostically-lossless watermarking framework for medical applications based on deep neural networks

Download PDF

Hamidreza Zarrabi¹,
Ali Emami¹,
Pejman Khadivi ORCID: orcid.org/0000-0001-7793-0669²,
Nader Karimi¹ &
…
Shadrokh Samavi^1,3

389 Accesses
4 Citations
Explore all metrics

Abstract

Nowadays, with the development of public network usage, medical information is transmitted throughout the hospitals. A watermarking system can help for the confidentiality of medical information distributed over the internet. In medical images, regions-of-interest (ROI) contain diagnostic information. The watermark should be embedded only into non-regions-of-interest (NROI) regions to keep diagnostically important details without distortion. Recently, ROI based watermarking has attracted the attention of the medical research community. The ROI map can be used as an embedding key for improving confidentiality protection purposes. However, in most existing works, the ROI map that is used for the embedding process must be sent as side-information along with the watermarked image. This side information is a disadvantage and makes the extraction process non-blind. Also, most existing algorithms do not recover NROI of the original cover image after the extraction of the watermark. In this paper, we propose a framework for blind diagnostically-lossless watermarking, which iteratively embeds only into NROI. The significance of the proposed framework is in satisfying the confidentiality of the patient information through a blind watermarking system, while it preserves diagnostic/medical information of the image throughout the watermarking process. A deep neural network is used to recognize the ROI map in the embedding, extraction, and recovery processes. In the extraction process, the same ROI map of the embedding process is recognized without requiring any additional information. Hence, the watermark is blindly extracted from the NROI. Furthermore, a three-layer fully connected neural network is used for the detection of distorted NROI blocks in the recovery process to recover the distorted NROI blocks to their original form. The proposed framework is compared with one lossless watermarking algorithm. Experimental results demonstrate the superiority of the proposed framework in terms of side information.

Methods for image denoising using convolutional neural network: a review

Article Open access 10 June 2021

Ademola E. Ilesanmi & Taiwo O. Ilesanmi

Medical image data augmentation: techniques, comparisons and interpretations

Article 20 March 2023

Evgin Goceri

Medical image analysis based on deep learning approach

Article 06 April 2021

Muralikrishna Puttagunta & S. Ravi

1 Introduction

Transmission of Electronic Patient Record (EPR) between medical research centers, schools, and hospitals is necessary. Sending of EPR is essential for educational purposes and e-healthcare applications. Many applications, such as teleconsulting, telediagnosis, and remote surgery, require EPR. EPR encompasses confidential medical information about the patient, medical situation, diagnosis, and treatment [8]. Since EPR is transmitted through a public network such as the Internet, the confidentiality protection of EPR is a real concern in the Health Information System (HIS). Watermarking is one solution to the confidentiality protection problem in HIS. Besides watermarking, various other methods have been proposed for improving the confidentiality protection of EPR. One approach is to embed encrypted EPR into a cover image with encryption algorithms such as chaotic encryption [25] and compressive sensing [34]. Another scheme is to insert the EPR into selected areas of the cover image rather than the whole cover image and use the binary location map as a key for embedding [9, 36]. In the extraction process, the binary key used for the embedding process is required to extract the embedded EPR.

In the watermarking systems, capacity and imperceptibility are two vital concerns [10]. One concern in the watermarking schemes is to maximize embedding capacity. Another concern is to avoid the watermarked image to suffer subtle distortions after the embedding process. Imperceptibility means that the watermarked image should be perceptually indistinguishable from the original cover image. Another concern in watermarking systems is the concept of blindness [10]. Blind systems have been thoroughly investigated [6, 12, 23, 30], and they generally lead to higher complexity. However, blind systems are practically superior to non-blind systems, which require various side information for the extraction or recovery on the receiver side [3, 17, 31].

Due to the rapid growth of machine learning and artificial intelligence in the last decades, several research works have used machine-learning tools for watermarking [2, 13, 18, 20, 22, 33]. Some researches propose prediction based watermarking methods for natural images [22, 33]. In natural images, pixels have a high correlation with their neighbors and can be predicted based on neighbor values. The prediction process may utilize various machine-learning tools such as the Extreme Learning Machine (ELM) [33] and Lagrangian Support Vector Regression (LSVR) [22]. Heidari et al. [13] embed the watermark data redundantly in multiple spectral zones. For extraction, they use SVM to detect a zone with minimum distortion, from which watermark is extracted. Abdelhakim et al. [2], use K-Nearest Neighborhood (K-NN) regression to find the optimum value of the embedding parameter. Among machine learning tools, deep Convolutional Neural Networks (CNN) have become popular in many computer vision applications such as object detection [27], pattern recognition [39], image classification [15], and watermarking [18, 20]. For instance, Kandi et al. [18] use auto-encoders for feature extraction. In their method, watermark data is embedded in the feature space, generated by the auto-encoder, rather than embedding in a transform domain such as DCT.

Medical images are divided into two regions, ROI and NROI. Since the ROI region includes critical information for diagnostic purposes, small distortions of ROI may cause problems in medical diagnosis. On the other hand, NROI contains nonessential medical information. Hence, small deformations of NROI are tolerable and do not cause serious issues. In watermarking systems introduced by [2, 3, 13], the whole cover image is considered for embedding, and this distortion caused during embedding is irreversible. Two approaches for medical watermarking are utilized to assure that diagnostic information is not distorted during watermarking or undesired distortion can be restored. The first approach is ROI-based watermarking, in which the cover image is divided into two regions, ROI and NROI. Then the watermark is embedded only in NROI. Thus, ROI remains intact, and only NROI is distorted during embedding. The second approach is the lossless watermarking. In this approach, the whole cover image is considered for embedding with lossless methods, so that in the recovery process, the original cover image is recovered without any loss of information.

One limitation of ROI based watermarking approaches is that they cannot detect the ROI map on the receiver side. Consequently, the ROI map needs to be sent as side information to the receiver side for the extraction of the watermark [11, 32]. Hence, regardless of their segmentation method, they are non-blind algorithms, which require side information for the extraction of watermark data or the recovery of the original cover image. On the other hand, some other researches [7, 26, 37] do not send an ROI map as side information. Therefore, they are blind algorithms, and the watermark data can be extracted solely from the watermarked image. Yang et al. [37] and Chaitanya et al. [7] embed ROI map inside the watermarked image to make it blind. However, the watermarking capacity is reduced as the embedded ROI map occupies part of data capacity. The most relevant research to our work is [19]. Their algorithm recognizes the ROI map from the watermarked image on the receiver side and does not need to send an ROI map as additional information. They segment the image by Otsu and then utilize a histogram-shifting method for the embedding of the watermark. This embedding method processes pixel values that are close to the peak bin. Hence, the ROI map is not changed during the embedding process and can be recognized before the extraction process. Despite the elegant idea suggested in this work, it is only applicable to the specific watermarking method utilized in this work. Hence, it cannot be generalized for other watermarking approaches. Furthermore, the method of [19] lacks a wide variety of experimental evaluations, as it is only tested on three medical image samples.

In lossless watermarking approaches, the whole cover image is distorted during embedding since all regions of the image are considered for embedding. However, due to utilizing lossless methods, the original cover image is entirely recovered after watermark extraction [36], [4, 5, 14, 16, 21, 35, 38]. Despite complete recovery and extraction in the receiver side, some papers are non-blind, implying that some side information from the embedding module is required by the extraction or recovery modules on the receiver side [36], [4, 5, 38], [21].

In this paper, we propose a recursive method for blind diagnostically-lossless watermarking in medical applications, which is named BlessMark. We use a deep network for segmentation and generation of ROI map. Watermark is only embedded in NROI blocks within a novel iterative scheme. Therefore, sensitive medical information remains intact, which leads to a diagnostically-lossless watermarking system, where only NROI is distorted during the embedding process. The proposed framework is blind, as neither of the ROI map or the original cover image and information about watermark is transmitted to the receiver side. The utilized segmentation procedure helps further improve the confidentiality of the embedded information, due to the proprietary network weights, which are not publicly known by others. In other words, the ROI map may be considered as a key for confidentiality since watermark data is extracted solely from NROI blocks.

Furthermore, we train a classifier to detect distorted NROI regions. Hence, distorted areas of NROI are mostly recovered to the original form.

To test our proposed framework, we use a simple embedding method in the DCT domain. The choice of the DCT domain for embedding is just a convenient option for the proof of concept. Hence, different embedding methods in other transform domains, such as DWT and Hadamard, may be applied. Also, to evaluate the framework, we use a CNN for ROI segmentation and a three-layer fully connected neural network for the detection of distorted NROI blocks.

The main contributions of this work are as follows:

1)
Introducing a framework that can be used as a platform for applying a desirable watermarking method to any medical image by utilizing various network structures in different transform domains.
2)
In the extraction process, the same ROI map, which is used for the embedding process, is generated without any additional information. Therefore, the watermark is blindly extracted from NROI blocks.
3)
NROI blocks, which are modified due to the embedding process, are recovered to their original state wherever possible.

The remainder of the paper is organized as follows: Details of the proposed framework is explained in Section 2. The experimental results are explained in Section 3. Finally, we conclude our proposed framework in Section 4.

2 General Watermarking Framework

In this section, we go through the technical details of the proposed watermarking framework, BlessMark. As shown in the block diagram of Fig. 1, the framework is composed of three separate modules for the embedding of the watermark, extraction of the watermark, and the recovery of the original cover image. There is a standard core block, ROI segmentation network, which is responsible for segmentation the ROI regions and generating the ROI map for the embedding, extraction and recovery modules. The problem is that embedding of the watermark data into the image may cause distortions, which could affect the segmentation results. We embed the watermark data into NROI. Hence, the ROI map is key for the extraction process. It is crucial to obtain the same segmentation on the watermarked image; otherwise, accurate extraction of the watermark data is not possible. To this end, we propose a novel iterative scheme for embedding based on the ROI segmentation function. This is done such that the ROI segmentation network can accurately detect the same ROI map in both transmitter and receiver sides. ROI map is automatically recognized on the receiver side without requiring any additional information. Therefore, the ROI map works as a key for confidentiality, as any third party cannot recognize it without having access to our proprietary segmentation tool.

On the other hand, due to embedding watermark data in NROI, crucial medical information in ROI remains intact. Another core block of the framework is a distortion detection network used in the recovery module. This network is responsible for the detection of distorted blocks due to the embedding process for the recovery operation in the next step.

In the following sections, we elaborate on the details of each block separately. We first introduce the proposed ROI segmentation network. In Sections 2.2 and 2.3, embedding and extraction modules are described. Details of the distortion detection network and the recovery module are presented in Section 2.4.

2.1 ROI segmentation network

The ROI Segmentation unit is responsible for the detection of ROI pixels from NROI pixels, and the generation of ROI block map based on the ROI detected pixels. A block is considered as NROI if all of its pixels are detected as NROI. Otherwise, the block is considered as ROI. In this work, we use a CNN structure inspired by U-Net [29], without max-pooling, up-sampling, and concatenation layers, for the segmentation of the ROI pixels. Figure 2 demonstrates the structure of the utilized network. As shown in Fig. 2, the ROI segmentation network is composed of 11 convolutional layers. Input to the network is a small m × m image block, while block size remains constant across all the convolution layers. Hidden layers consist of 3 × 3 convolutions followed by ReLU (Rectified Linear Unit) activation function. At the final layer, a 1 × 1 convolution is applied, followed by the same ReLU activation function. The number of channels in each layer is shown on the top of each layer in Fig. 2. The network output represents a segmentation probability map for pixels of the input block. This probability map is binarized by using a 0.5 threshold.

2.2 Embedding module

We propose a novel iterative approach for the embedding process. The block diagram of the embedding module is shown in Fig. 3. The ROI block map of the cover image is generated based on the strategy discussed in Section 2.1. Watermark is embedded into NROI blocks of the cover image. Then NROI and ROI blocks are merged to construct the tentative watermarked image. The ROI block map of the tentative watermarked image is then generated based on the strategy discussed in Section 2.1. One image block might be detected as NROI before embedding and the same block may be detected as ROI after the embedding. If the watermarking process causes a change in the ROI block map, then the modified cover image is constructed, and the watermark is embedded into the NROI blocks of the modified cover image. The embedding process may cause some of the NROI blocks of the original cover image to be identified as ROI. The changed versions of such NROI blocks are placed in the cover image and from there on, they will be considered as ROI. We repeat the embedding process into the modified cover image until the ROI block map remains unchanged, and the final watermarked image is produced. Thus, we do not need to send the ROI block map to the receiver side with the watermarked image, and the proposed framework is blind.

We use a simple embedding method in the DCT domain for proof of concept. However, the proposed framework is not limited to this specific domain, and watermarking can be performed in other known transform domains. A pseudo-code of the embedding module is presented in Algorithm 1. In the first step, m × m NROI blocks are detected. Then one bit of watermark is embedded in every NROI block. The embedding process is continued until the whole watermark is embedded. When two DCT coefficients are swapped, we add a constant threshold to secure a minimum distance between the two coefficients. Changing DCT coefficients may lead to under/overflow in the spatial domain, i.e., when they are transformed back to the spatial domain, the pixel values may exceed the valid range [0, 255]. In this situation, the under/overflowed pixels are assigned 0 and 255, respectively.

2.3 Extraction module

In the extraction module, we blindly extract the watermark data from the watermarked image. The block diagram of the extraction module is shown in Fig. 4. The ROI map of the watermarked image is generated based on the strategy discussed in Section 2.1. We accurately attain the same ROI map from the final embedding loop. The watermark is extracted from the NROI blocks of the watermarked image. The pseudo-code of the extraction module is presented in Algorithm 2. In the first step, the m × m NROI blocks are detected. Then one bit of watermark is extracted from each NROI block. The extraction process is continued until the whole watermark is extracted.

2.4 Recovery module

The proposed recovery module performs recovery of distorted NROI blocks of the cover image to the extent feasible. We introduce a distortion detection classifier which is responsible for the detection of distorted NROI blocks that are altered during the embedding process from original NROI blocks. Then distorted blocks are recovered to the closest estimate of the original block. The block diagram of the recovery module is shown in Fig. 5. This module gets the NROI blocks of the watermarked image and the extracted watermark as its inputs and recovers the NROI blocks of the original cover image to the extent feasible. Recovered NROI blocks are merged with ROI blocks to construct the mostly recovered cover image. The pseudocode for the recovery module is presented in Algorithm 3. In the first step, m × m NROI blocks, which have been distorted through the embedding process, are detected. The changed blocks are recovered to their original state by reversing the embedding operation. Underflow/Overflow pixels are assigned 0 and 255 respectively.

3 Experimental Results

The proposed framework is implemented in Tensor-flow [1] and executed on NVidia GeForce® GTX 1080 Ti. The framework is evaluated on two datasets of Retina [28] and X-ray Angiography [24], independently. The datasets are composed of 40 color images of size 584 × 565 and 44 grayscale images of size 512 × 512, respectively. We divided datasets into equal sets for training and test purposes. The CNN network and three-layer fully connected neural network performing ROI segmentation and distortion detection are separately trained on each dataset. We generate binary random sequences to be used as watermark data. For color images, watermark data is embedded in all three color channels. Thus, the distortion detection classifier is separately applied on single channels to detect the distorted blocks.

To evaluate the embedding method, we use the two standard measures, PSNR (Peak Signal to Noise Ratio) and SSIM (Structural Similarity Index). We utilize the Dice score to evaluate the segmentation network and the accuracy metric for evaluating the distortion detection classifier. Definitions of all the metrics are presented in Table 1. In the PSNR formula, w and h are the image dimensions. In this formula, I and I_W are the cover image and the watermarked image, respectively. In the SSIM formula, I and I_W are the cover and the watermarked images, respectively. In this formula, μ and σ² represent mean and variance values, respectively. Also, \( {\sigma}_{I,{I}_w} \) is the covariance between I and I_W and c₁ and c₂ are constants. In Dice score and Accuracy measures, variables TP and FP are cases that are predicted as positive, while actual outputs are positive and negative, respectively. Also, TN and FN are cases that are predicted as negative while the actual outputs are negative and positive, respectively.

Table 1 Evaluation metrics employed for testing the framework

Full size table

All the networks are evaluated in Section 3.1. Performance of the whole framework is analyzed in Section 3.2. Finally, we compare BlessMark with state-of-the-art watermarking systems in Section 3.3.

3.1 Independent evaluation of networks

3.1.1 Segmentation network

As discussed in previous sections, the segmentation network is responsible for attaining ROI pixel map. This network takes a block as the input and produces an ROI pixel map for that. block. The network input is a gray-scale image block. Therefore, color images are converted to grayscale, and one ROI map is generated for all channels. A softmax activation function is used for creating a probability map in the last layer. We use a cross-entropy loss function with stochastic gradient descent (SGD) optimizer for training. Table 2 presents the Dice scores of the trained networks on two datasets with different block sizes. Two training sets constructed from 20 Retina images and 22 Angiography images are used separately for training each network within 150 epochs. The other half of the datasets is used as a test set. As shown in Table 2, the trained networks recognize 73% of ROI pixels for Retina and 61% of the ROI pixels for Angiography test sets, when the block size is 6 × 6. Dice scores increase by using larger block sizes, usually. Also, it is shown that a trained network for retina reaches a higher Dice score than the trained network for angiography test sets for the same block size. As an example, the produced ROI pixel maps for retina and angiography are shown in Fig. 6.

Table 2 Performance of segmentation network for two datasets with different block sizes

Full size table

3.1.2 Distortion detection network

As discussed in previous sections, the distortion detection network is responsible for detecting the distorted blocks, i.e., the blocks that have distorted during the embedding process. In this work, we train a three-layer fully connected neural network for distortion detection. The structure of this network is shown in Fig. 7. The network input is a small m × m NROI block. In the first step, the block is flattened by a flatten layer. The two hidden layers are dense layers with m² node and each layer is followed by a ReLU activation function. At the final layer, a dense layer with one neuron is used, which is followed by a sigmoid activation function. We use Adam with default parameters as an optimizer and cross-entropy as a loss function. The network output represents a classification probability, in which this probability is converted to a binary by thresholding the output layer on 0.5.

Every block has an intrinsic embedded value. Hence, for arranging a training set for the network, we pick all the NROI blocks of all channels and invert the intrinsic embedded value inside them. The training set is composed of all the NROI blocks and their inverted versions. Thus, the three-layer fully connected neural network is trained to classify these two sets of blocks for all channels. It worth mentioning that we can embed different watermark data in various channels. Consequently, the classifier conducts independent analyses on separate channels.

The accuracy of the classifier for the two datasets is demonstrated in Table 3. Training set in the Retina and the Angiography datasets are used for the training of the two networks with 100 epochs. Test sets are constructed similar to the training sets by using a test set in the Retina and the Angiography datasets. Index i in Table 3 represents the two DCT coefficients c(i, i + 1) and c(i + 1, i), which are swapped for embedding. As shown in Table 3, DCT coefficients swapped for embedding in various block sizes (‘6 × 6’, ‘8 × 8’, ‘10 × 10’) are different. The parameter th = 0.01 represents the threshold as used in our embedding algorithm (Algorithm 1). As shown in Table 3, the trained networks detect the distorted blocks with an accuracy of 94% for Retina and 97% for Angiography, when the block size is 6 × 6.

3.2 Evaluation of the whole framework

In Table 4, we evaluate our framework for the 20 Retina and 22 Angiography test images in terms of imperceptibility and variations in the ROI map. Since the image is segmented by CNN, false classifications are probable, i.e., it is probable that one pixel is segmented as NROI by CNN while it is labeled as ROI in the ground truth or vice versa. Therefore, we use the ground truth ROI map for calculating PSNR of images.

Table 4 Performance of the proposed framework on Retina and Angiography test sets

Full size table

Table 3 Performance of classifier for two datasets with different block sizes. th = 0.01

Full size table

Table 5 Training times of two networks on Retina training set

Full size table

Table 6 Average times of the proposed framework on Retina test set

Full size table

Table 8 Comparison of average PSNR and side information with other method for different block sizes

Full size table

Table 7 Qualitative comparison of the proposed framework with other methods

Full size table

Table 9 Details of the segmentation and detection networks

Full size table

In Table 4, PSNR of the whole image, as well as PSNR of ROI and NROI regions, are presented. For Retina image with block size 6 × 6, PSNR of the watermarked image NROI is 54.96 (dB) on the maximum capacity of 0.021 (BPP). The image NROI imperceptibility is enhanced to 59.81 (dB) after recovery, i.e., imperceptibility of NROI may be improved by 4.85 (dB). The recovery process is not perfect due to the distortion detection classifier errors, irreversible distortions caused by under/overflow clipping of embedding operation, and switching of segmentation results after embedding. We also demonstrate the average percentage of NROI blocks, which are switched to ROI blocks during the embedding process. It is shown that distortions caused by the iterative embedding process are minimal and decrease by using larger block sizes.

Boxplot of Fig. 8 shows the distribution of PSNR and SSIM for watermarked images across the Retina and Angiography test sets. Each box corresponds to block size. For block sizes 6 × 6, 8 × 8, 10 × 10, maximum experimental capacities are 0.021, 0.010, and 0.006 (BPP) for Retina and 0.022, 0.012 and 0.007 (BPP) for Angiography test sets.

In Table 5, the training durations, for the segmentation and classifier networks, are shown for a retina training set with a 6 × 6 block size. We used 2 M blocks to train the segmentation network in 150 epochs. Also, the classifier network is trained by using 820,950 blocks in 100 epochs. The segmentation and the classification networks are trained in 23 h and 4.5 h, respectively.

In Table 6, embedding, extraction, and recovery times with GPU and without GPU are evaluated for the Retina test set. Average experimental results for 20 Retina test images with block size 6 × 6 and capacity 0.021 (BPP) is demonstrated in Table 6. Embedding time with GPU is 7 s. Extraction and recovery time with GPU is 3 s. These times are increased to 78 and 18 s without GPU.

In Fig. 9, the visual qualities of the embedding result with block size 6 × 6 for two test images are shown. The left column shows the original cover image, the middle column is the embedded watermark, and the right column demonstrates the watermarked image with its corresponding PSNR and SSIM. We have not seen any distinguishable difference between the watermarked image and the cover image.

3.3 Comparison with State-of-the-art Methods

Properties of various watermarking algorithms [36], [11, 32], [4, 5, 7, 14, 16, 19, 21, 35, 37, 38] are compared with our framework in Table 7. The eight lossless watermarking methods [36], [4, 5, 14, 16, 21, 35, 38] use the whole cover image for embedding, causing distortions across all image regions including ROI. Therefore, they are not applicable to medical watermarking, since diagnostic information in ROI may be corrupted. Also methods of [36], [4, 5, 38], [21] are non-blind.

The two non-blind ROI based watermarking methods [11, 32] send an ROI map with the watermarked image to the receiver side. The two blind ROI based watermarking methods [7, 37] embed ROI map inside the watermarked image. Hence, the extraction of an ROI map from the watermarked image is a prior step for the extraction of the watermark data. The ROI based watermarking method [19] utilize the same segmentation method for recognizing the ROI map in embedding and extraction modules of their proposed system. In our framework, the watermark is embedded in NROI blocks to keep the sensitive ROI information intact during the embedding process. ROI map can be recognized from the watermarked image in the extraction module by the trained segmentation network.

In Table 8, imperceptibility and side-information of one lossless method [38] are compared with our system. Similar to Table 4, we use the ground truth ROI map for calculating PSNR of ROI and NROI. Note that the false classification of CNN is probable, i.e., a pixel may be segmented as NROI by CNN while it is labeled as ROI in the ground truth or vice versa.

The lossless method [38] embeds the watermark into 1-level. of IWT of the whole image in two iterations so that iteration-2 compensate produced distortion in iteration-1.

In Table 8, the PSNR and side-information results are calculated over the 20 Retina test images. For a fair comparison, a single watermark is embedded in all methods. The infinity values of PSNR in the table demonstrate that the original cover image has been completely recovered. The proposed framework does not produce any side information, while the method of [38] produce 16,516.5 bits of side information on the capacity of 0.020 (BPP). However, the lossless method [38] can completely recover the original cover image. Since NROI does not contain valuable information for medical diagnosis, accurate recovery of NROI is not critical in medical applications.

Figures 2 and 7 show the general structure of the ROI segmentation network and the distortion detection network. Details of layers of these two networks are listed in Table 9.

4 Conclusion

In this paper, we presented BlessMark, a framework for blind diagnostically-lossless watermarking. The proposed watermarking scheme is used as a means for the simultaneous improvement of confidentiality and preservation of diagnostic medical information. BlessMark consists of a deep neural network for image segmentation and one fully connected neural network for classification.

BlessMark ROI segmentation network generates an ROI map for the embedding, extraction, and recovery modules. The ROI segmentation network is applied using an iterative scheme to accurately produce the same ROI map, both in the transmitter and receiver sides. Hence, in the proposed blind watermarking framework, an ROI map is automatically generated on the receiver side without requiring any additional information. The. proposed ROI map is critical for improving confidentiality protection in our system. Third parties cannot create the ROI map without having access to our proprietary segmentation tool. Since the watermark is embedded only in NROI blocks, ROI remains intact and leading to a diagnostically-lossless watermarking system. The distortion detection classifier used in the recovery module helps the detection of blocks that have distorted during the embedding process. Distorted blocks are mostly recovered to their original form.

The choice of a simple embedding method in the DCT domain is just a convenient option to prove the concept of the proposed framework. Hence, different embedding methods in other transform domains such as DWT and Hadamard may be applied.

Furthermore, we used a CNN and a three-layer fully connected neural network for the ROI segmentation and detection of distorted NROI blocks. However, other structures can be investigated for these purposes. Our watermarking method is non-robust since the ROI map of the watermarked image can be changed as a result of attacks. Since the ROI map is vital for embedding and extraction, it is necessary to detect the same ROI map on both sides.

We evaluated our framework on two Retina and Angiography datasets. The proposed framework demonstrates producing any side information in comparison with one lossless watermarking method. Our proposed method is advantageous to recent methods such as the work of [19]. We are offering a framework while other research works implement a specific embedding routine.

One future research is to explore the capabilities of the proposed framework by using it as a platform for testing other networks and embedding domains. We are also planning to work on the robustness of the proposed framework such that the watermark can be extracted accurately in the presence of attacks. Furthermore, the new robust framework needs to be compared against more recent watermarking schemes, some of which are reviewed in the introduction section.

References

Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Kudlur M (2016) Tensorflow: a system for large-scale machine learning. In: OSDI, vol 16, pp 265–283
Google Scholar
Abdelhakim AM, Abdelhakim M (2018) A time-efficient optimization for robust image watermarking using machine learning. Expert Syst Appl 100:197–210
Article Google Scholar
Anbarjafari G, Ozcinar C (2018) Imperceptible non-blind watermarking and robustness against tone mapping operation attacks for high dynamic range images. Multimed Tools Appl:1–15
Ansari IA, Pant M, Ahn CW (2017) Artificial bee colony optimized robust-reversible image watermarking. Multimed Tools Appl 76(17):18001–18025
Article Google Scholar
Bamal R, Kasana SS (2018) Slantlet based hybrid watermarking technique for medical images. Multimed Tools Appl 77(10):12493–12518
Article Google Scholar
Benrhouma O, Hermassi H, El-Latif AAA, Belghith S (2016) Chaotic watermark for blind forgery detection in images. Multimed Tools Appl 75(14):8695–8718
Article Google Scholar
Chaitanya K, Rao KG (2018) A novel approach to medical image watermarking for tamper detection and recovery of region of interest using predictive coding and hashing. J Theoretical Appl Inf Technol 96(7)
Chao HM, Hsu CM, Miaou SG (2002) A data-hiding technique with authentication, integration, and confidentiality for electronic patient records. IEEE Trans Inf Technol Biomed 6(1):46–53
Article Google Scholar
Chauhan DS, Singh AK, Kumar B, Saini JP (2019) Quantization based multiple medical information watermarking for secure e-health. Multimed Tools Appl 78(4):3911–3923
Article Google Scholar
Cox I, Miller M, Bloom J, Fridrich J, Kalker T (2007) Digital watermarking and steganography. Morgan Kaufmann
Eswaraiah R, Reddy ES (2015) Robust medical image watermarking technique for accurate detection of tampers inside region of interest and recovering original region of interest. IET Image Process 9(8):615–625
Article Google Scholar
Etemad E, Samavi S, Soroushmehr SR, Karimi N, Etemad M, Shirani S, Najarian K (2018) Robust image watermarking scheme using bit-plane of Hadamard coefficients. Multimed Tools Appl 77(2):2033–2055
Article Google Scholar
Heidari M, Samavi S, Soroushmehr SMR, Shirani S, Karimi N, Najarian K (2017) Framework for robust blind image watermarking based on classification of attacks. Multimed Tools Appl 76(22):23459–23479
Article Google Scholar
Hou D, Zhang W, Chen K, Lin SJ, Yu N (2018) Reversible data hiding in color image with grayscale invariance. IEEE Transactions on Circuits and Systems for Video Technology 29(2):363–374
Article Google Scholar
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In CVPR (Vol. 1, no. 2, p. 3).
Jaiswal SP, Au OC, Jakhetiya V, Guo Y, Tiwari AK, Yue K (2013) Efficient adaptive prediction based reversible image watermarking. In: Image processing (ICIP), 2013 20th IEEE international conference on. IEEE, pp 4540–4544
Jane O, Elbaşi E (2014) Hybrid non-blind watermarking based on DWT and SVD. Journal of applied research and technology 12(4):750–761
Article Google Scholar
Kandi H, Mishra D, Gorthi SRS (2017) Exploring the learning capabilities of convolutional neural networks for robust image watermarking. Computers Security 65:247–268
Article Google Scholar
Kelkar V, Tuckley K, Nemade H (2017, 2017) Novel variants of a histogram shift-based reversible watermarking technique for medical images to improve hiding capacity. J Healthcare Eng
Kim WH, Hou JU, Mun SM, Lee HK (2018) Convolutional neural network architecture for recovering watermark synchronization. arXiv preprint arXiv:1805.06199.
Liu X, Lou J, Fang H, Chen Y, Ouyang P, Wang Y, Wang L (2019) A novel robust reversible watermarking scheme for protecting authenticity and integrity of medical images. IEEE Access 7:76580–76598
Article Google Scholar
Mehta R, Rajpal N, Vishwakarma VP (2017) A robust and efficient image watermarking scheme based on Lagrangian SVR and lifting wavelet transform. Int J Mach Learn Cybern 8(2):379–395
Article Google Scholar
Naik K, Trivedy S, Pal AK (2018) An IWT based blind and robust image watermarking scheme using secret key matrix. Multimed Tools Appl 77(11):13721–13752
Article Google Scholar
Nasr-Esfahani E, Karimi N, Jafari MH, Soroushmehr SMR, Samavi S, Nallamothu BK, Najarian K (2018) Segmentation of vessels in angiograms using convolutional neural networks. Biomedical Signal Processing and Control 40:240–251
Article Google Scholar
Parah SA, Ahad F, Sheikh JA, Bhat GM (2017) Hiding clinical information in medical images: a new high capacity and reversible data hiding technique. J Biomed Inform 66:214–230
Article Google Scholar
Parah SA, Sheikh JA, Ahad F, Loan NA, Bhat GM (2017) Information hiding in medical images: a robust medical image watermarking system for E-healthcare. Multimed Tools Appl 76(8):10599–10633
Article Google Scholar
Ren S, He K, Girshick R, Sun J (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis Machine Intelligence 6:1137–1149
Article Google Scholar
“Retina-unet.” [Online]. Available: https://github.com/orobix/retina-unet.
Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, Cham, pp 234–241
Google Scholar
Sadreazami H, Amini M (2018) A robust image watermarking scheme using local statistical distribution in the Contourlet domain. Express Briefs, IEEE Transactions on Circuits and Systems II
Google Scholar
Savakar DG, Ghuli A (2017) Non-blind digital watermarking with enhanced image embedding capacity using DMeyer wavelet decomposition, SVD, and DFT. Pattern Recognition and Image Analysis 27(3):511–517
Article Google Scholar
Shih FY, Zhong X, Chang IC, Satoh SI (2018) An adjustable-purpose image watermarking technique by particle swarm optimization. Multimed Tools Appl 77(2):1623–1642
Article Google Scholar
Singh RP, Dabas N, Chaudhary V (2016) Online sequential extreme learning machine for watermarking in DWT domain. Neurocomputing 174:238–249
Article Google Scholar
Thanki R, Borra S (2019) Fragile watermarking for copyright authentication and tamper detection of medical images using compressive sensing (CS) based encryption and contourlet domain processing. Multimed Tools Appl 78(10):13905–13924
Article Google Scholar
Tian J (2002) Reversible watermarking by difference expansion. In: Proceedings of workshop on multimedia and security, vol 19
Google Scholar
Turuk MP, Dhande AP (2016) A novel reversible multiple medical image watermarking for health information system. J Med Syst 40(12):269
Article Google Scholar
Yang Y, Zhang W, Liang D, Yu N (2018) A ROI-based high capacity reversible data hiding scheme with contrast enhancement for medical images. Multimed Tools Appl 77(14):18043–18065
Article Google Scholar
Zarrabi H, Hajabdollahi M, Soroushmehr SMR, Karimi N, Samavi S, Najarian K (2018) Reversible image watermarking for health informatics systems using distortion compensation in wavelet domain. In: Annual International Conference of the IEEE Engineering in Medicine and Biology Society
Google Scholar
Zheng Z, Zheng L, Yang Y (2017) A discriminatively learned cnn embedding for person re-identification. ACM transactions on multimedia computing, communications, and applications (TOMM), 14(1), 13

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan, 84156-83111, Iran
Hamidreza Zarrabi, Ali Emami, Nader Karimi & Shadrokh Samavi
Department of Computer Science, Seattle University, Seattle, WA, 98122, USA
Pejman Khadivi
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
Shadrokh Samavi

Authors

Hamidreza Zarrabi
View author publications
You can also search for this author in PubMed Google Scholar
Ali Emami
View author publications
You can also search for this author in PubMed Google Scholar
Pejman Khadivi
View author publications
You can also search for this author in PubMed Google Scholar
Nader Karimi
View author publications
You can also search for this author in PubMed Google Scholar
Shadrokh Samavi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pejman Khadivi.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zarrabi, H., Emami, A., Khadivi, P. et al. BlessMark: a blind diagnostically-lossless watermarking framework for medical applications based on deep neural networks. Multimed Tools Appl 79, 22473–22495 (2020). https://doi.org/10.1007/s11042-020-08698-9

Download citation

Received: 20 January 2019
Revised: 19 December 2019
Accepted: 28 January 2020
Published: 24 May 2020
Issue Date: August 2020
DOI: https://doi.org/10.1007/s11042-020-08698-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

BlessMark: a blind diagnostically-lossless watermarking framework for medical applications based on deep neural networks

Abstract

Similar content being viewed by others

Methods for image denoising using convolutional neural network: a review

Medical image data augmentation: techniques, comparisons and interpretations

Medical image analysis based on deep learning approach

1 Introduction