Deformable MR-CT image registration using an unsupervised, dual-channel network for neurosurgical guidance

doi:10.1016/j.media.2021.102292

Medical Image Analysis

Volume 75, January 2022, 102292

https://doi.org/10.1016/j.media.2021.102292 Get rights and content

Highlights

•
Unsupervised MR-CT deformable registration network using image synthesis and dual-channel registration.
•
Image synthesis using probabilistic CycleGAN converts multi-modality registration into mono-modality registration in both MR and CT channel.
•
Dual-channel registration estimates stationary velocity field in MR and CT channel and fuses the two into a single diffeomorphic deformation field.
•
End-to-end training strategy jointly optimizes image synthesis and registration subnetworks.

Abstract

Purpose

The accuracy of minimally invasive, intracranial neurosurgery can be challenged by deformation of brain tissue – e.g., up to 10 mm due to egress of cerebrospinal fluid during neuroendoscopic approach. We report an unsupervised, deep learning-based registration framework to resolve such deformations between preoperative MR and intraoperative CT with fast runtime for neurosurgical guidance.

Method

The framework incorporates subnetworks for MR and CT image synthesis with a dual-channel registration subnetwork (with synthesis uncertainty providing spatially varying weights on the dual-channel loss) to estimate a diffeomorphic deformation field from both the MR and CT channels. An end-to-end training is proposed that jointly optimizes both the synthesis and registration subnetworks. The proposed framework was investigated using three datasets: (1) paired MR/CT with simulated deformations; (2) paired MR/CT with real deformations; and (3) a neurosurgery dataset with real deformation. Two state-of-the-art methods (Symmetric Normalization and VoxelMorph) were implemented as a basis of comparison, and variations in the proposed dual-channel network were investigated, including single-channel registration, fusion without uncertainty weighting, and conventional sequential training of the synthesis and registration subnetworks.

Results

The proposed method achieved: (1) Dice coefficient = 0.82±0.07 and TRE = 1.2 ± 0.6 mm on paired MR/CT with simulated deformations; (2) Dice coefficient = 0.83 ± 0.07 and TRE = 1.4 ± 0.7 mm on paired MR/CT with real deformations; and (3) Dice = 0.79 ± 0.13 and TRE = 1.6 ± 1.0 mm on the neurosurgery dataset with real deformations. The dual-channel registration with uncertainty weighting demonstrated superior performance (e.g., TRE = 1.2 ± 0.6 mm) compared to single-channel registration (TRE = 1.6 ± 1.0 mm, p < 0.05 for CT channel and TRE = 1.3 ± 0.7 mm for MR channel) and dual-channel registration without uncertainty weighting (TRE = 1.4 ± 0.8 mm, p < 0.05). End-to-end training of the synthesis and registration subnetworks also improved performance compared to the conventional sequential training strategy (TRE = 1.3 ± 0.6 mm). Registration runtime with the proposed network was ∼3 s.

Conclusion

The deformable registration framework based on dual-channel MR/CT registration with spatially varying weights and end-to-end training achieved geometric accuracy and runtime that was superior to state-of-the-art baseline methods and various ablations of the proposed network. The accuracy and runtime of the method may be compatible with the requirements of high-precision neurosurgery.

Graphical abstract

Introduction

Navigation relative to preoperative 3D imaging is prevalent in a wide spectrum of neurosurgical treatments, including tumor biopsy (Oppido et al., 2011), cyst resection (Sribnick et al., 2014), hydrocephalus (Spennato et al., 2007), and deep brain stimulation (Groiss et al., 2009; Laxton et al., 2010). Such surgeries are commonly performed through a cranial burr hole and/or endoscopically via the lateral or third ventricles for access to deep brain structures. Magnetic resonance (MR) imaging (commonly T1-weighted MR) offers clear delineation of white and gray matter, CSF, and subcortical structures and is the basis for preoperative planning (e.g., segmentation of the target, eloquent brain, and vessels as well as definition of desired electrode trajectories). Intraoperative CT provides high-resolution visualization of bone and instrumentation during surgery but offers limited soft-tissue contrast.

Even with a minimally invasive approach, deep brain deformations induced by egress of cerebrospinal fluid (CSF) and introduction of instrumentation present a challenge to accurate navigation. Conventional neuro-navigation using stereotactic frames and rigid registration between preoperative MR and intraoperative CT does not address such nonrigid motion, with deformation of deep brain targets up to 10 mm (Nowell et al., 2014) associated with inaccurate targeting and device placement (Nabavi et al., 2001). Deformable registration attempts to solve the non-linear transformation that establishes anatomical correspondence between MR and CT images and transforms preoperative planning into the intraoperative coordinates. A number of methods for multi-modality deformable registration have been reported (Denis de Senneville et al., 2016; Han et al., 2018; Modat et al., 2010; Reaungamornrat et al., 2016; Rueckert et al., 2006) to solve multi-modality deformable registration via iterative numerical optimization, but the high computational load tends to carry long runtimes and may be limited within the context of intraoperative workflow. Recent advances in deep learning-based registration demonstrate robustness and fast runtime over conventional methods, making them an important candidate for further development and translation to clinical application.

Deep learning-based deformable registration methods often use convolutional neural networks (CNNs) to predict either a set of deformation parameters or a full deformation field. Depending on the type of annotation available in training data, deep learning registration approaches can be broadly categorized as:

(i)
Supervised learning. Supervised learning requires the training dataset to include ground-truth deformation fields. Since the performance of registration depends on the quality of the ground-truth definition, this approach can be limited by the accuracy of the conventional registration used to obtain ground truth (Cao et al., 2018b; Sokooti et al., 2017; Yang et al., 2017). Alternatively, ground-truth can be defined via simulated deformations (Eppenhof and Pluim, 2019; Mahapatra et al., 2018; Sun et al., 2018).
(ii)
Weakly supervised learning. Weakly supervised learning methods perform optimization on image surrogates, such as segmentation maps or landmarks. For example, Hu et al. (2018); Xu and Niethammer (2019)) demonstrated networks trained to maximize the alignment between tissue labels. Alternatively, Blendowski et al. (2020) used a shape encoder-decoder network to extract cardiac shape representations as a basis for registration. The time-consuming nature of tissue labeling and the dependence of performance of the resulting network on the accuracy of tissue labeling are well recognized.
(iii)
Unsupervised learning. To overcome the limitations of supervised and weakly supervised learning, unsupervised learning methods have been developed that learn to minimizes losses between fixed and registered images. Loss functions are often based on either similarity metrics such as sum of squared difference (SSD) and normalized cross-correlation (NCC) (Balakrishnan et al., 2018; Cao et al., 2018a; Dalca et al., 2018), or neural network-based “deep metrics” (Haskins et al., 2019; Niethammer et al., 2019).

While a considerable amount of previous work has focused on deep learning-based deformable registration within a single imaging modality (e.g., MR-to-MR registration), multi-modality registration presents a challenging problem. Multi-modality registration commonly relies on some degree of supervision, either ground-truth deformation fields or labeled landmarks/segmentations. Unsupervised, multi-modality deformable registration approaches have been demonstrated that optimize a multi-modality similarity metric, such as mutual information (MI) (Che et al., 2019; Guo, 2019). Such metrics, however, can be insensitive to local spatial information, which can diminish registration accuracy compared to mono-modality metrics.

To mitigate challenges associated with multi-modality similarity metrics, a popular approach is to convert multi-modality registration to a mono-modality registration via image synthesis, allowing optimization according to a mono-modality metric. For example, Liu et al. (2019); Tanner et al. (2018); Wei et al. (2019); Yang et al., 2020a, Yang et al., 2020b used Generative Adversarial Networks (GANs) to generate synthetic CT from MR images and perform mono-modality CT registration. Similarly, Xu et al. (2020) further fused the multi-modality MR-CT and mono-modality CT registration into a single prediction. Such methods, however, only use the MR-to-CT synthesis, and the inverse (CT-to-MR) synthesis was omitted. Alternatively, Qin et al. (2019) used disentangled networks to decouple images into shape and appearance representations, and mono-modality registration was performed on the resulting shape representations.

The method reported below extends previous work using image synthesis for unsupervised, multi-modality MR-CT registration. Inspired by multi-modality and mono-modality fusion (Xu et al., 2020) and multi-channel registration (Chen et al., 2017; Fan et al., 2019), this work utilizes MR-CT synthesis to reduce the registration to two, mono-modality registrations in the MR and CT domains, subsequently fusing the two channels for the final estimate of the deformation. Contributions of this work include:

(i)
A novel unsupervised, deformable registration network is proposed for MR-CT registration to provide guidance in minimally invasive neurosurgery. The network contains two subnetworks: (1) an image synthesis subnetwork to generate synthetic MR/CT images from the input image pairs; and (2) a dual-channel registration subnetwork that predicts the deformations in MR and CT channels and fuses the two into a final diffeomorphic deformation field.
(ii)
The image synthesis subnetwork implements a novel probabilistic CycleGAN that generates both the synthetic images and the associated uncertainty. Instead of global averaging of the dual-channel registration loss functions as in conventional dual-channel registration (Chen et al., 2017), the uncertainties are used to provide a principled, spatially varying weighting of the dual channels.
(iii)
An end-to-end training strategy is employed to jointly optimize image synthesis and registration subnetworks, which guides the synthesis subnetwork in generating intermediate representations that are advantageous to the task of deformable registration.

The paper is organized as follows: in Section 2, the details of the proposed method are described along with an end-to-end training strategy; Sections 3 and 4 present the experimental methods, ablation studies (variations of the algorithm with and without dual-channel fusion and uncertainty weighting), and results comparing the proposed method to two baseline algorithms (symmetric normalization (Avants et al., 2008) and VoxelMorph (Balakrishnan et al., 2018)); and Section 5 demonstrates the effects of dual-channel fusion and end-to-end training. The proposed deformable registration method is tested on a spectrum of datasets, including datasets with a broad variety of simulated deformations, real deformations associated with long-time baseline longitudinal studies, and real deformations induced by neurosurgical intervention.

Section snippets

Algorithmic methods

An unsupervised deformable registration framework is proposed for registering preoperative MR images to intraoperative CT images. Let $I_{MR}$ be the moving preoperative MR image and $I_{CT}$ be the fixed intraoperative CT image defined over a 3D spatial domain $Ω \subset R^{3}$ . The two images are first rigidly aligned (alternatively, affine registration if scaling or skew difference is observed) as a preprocessing initialization step, such that the network only learns the nonlinear local deformation – an essential

Image datasets

Three datasets were used in training, validation, and testing of the proposed method. The first dataset contained 50 paired T1-weighted MR and CT images acquired on the same day without neuro-intervention or evidence of deformation. These images were used to create a large dataset with simulated deformations as detailed below. A second dataset consisted of 9 MR images with the same MR scan protocols as the first dataset along with 9 corresponding CT images with real deformations that were

MR-CT image synthesis

The performance of the intermediate MR-CT synthesis from probabilistic CycleGAN was first examined. A series of CycleGAN models were trained in this work according to Table 1, including sequential (SEQ) training and the end-to-end (E2E) training variations (E2E:CT, E2E:MR, and E2E:2CH+U). This section details the results from SEQ training, providing a baseline evaluation for the case in which probabilistic CycleGAN is trained on its own (separate from registration). Performance comparison of

Single-channel vs. dual-channel registration

The performance of the dual-channel registration with uncertainty weighting compared to the ablation variations (SEQ:CT, SEQ:MR, SEQ:2CH, and SEQ:2CH+U) demonstrated several findings with respect to the effects of MR, CT, and the combination of MR and CT on deformable image registration. First, using the MR channel alone (SEQ:MR) showed higher performance than using the CT channel alone (SEQ:CT) except for registration of the lateral ventricles, where comparable performance was achieved from

Conclusions

An unsupervised, dual-channel network for MR-CT deformable registration was reported. The method uses a probabilistic CycleGAN for MR-CT image synthesis and a dual-channel registration to predict and fuse the deformation field in both MR and CT channels. The image synthesis uncertainties, a representation of the aleatoric uncertainty, are used as spatially varying weights to balance the contributions of the MR and CT channel registration loss functions. In addition to a conventional sequential

CRediT authorship contribution statement

R. Han: Conceptualization, Methodology, Software, Investigation, Writing – original draft. C.K. Jones: Validation, Writing – review & editing. J. Lee: Data curation, Supervision, Validation, Writing – review & editing. P. Wu: Resources, Writing – review & editing. P. Vagdargi: Resources, Writing – review & editing. A. Uneri: Resources, Writing – review & editing. P.A. Helm: Supervision. M. Luciano: Supervision. W.S. Anderson: Validation, Supervision. J.H. Siewerdsen: Supervision, Writing –

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

This research was supported by NIH Grant U01-NS-107133 and academic-industry partnership with Medtronic Inc. (Littleton, MA).

References (56)

B.B. Avants et al.
Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain
Med. Image Anal.
(2008)
M. Chen et al.
Cross contrast multi-channel image registration using image synthesis for MR brain images
Med. Image Anal.
(2017)
B.D. de Vos et al.
A deep learning framework for unsupervised affine and deformable image registration
Med. Image Anal.
(2019)
J. Fan et al.
BIRNet: brain image registration using dual-supervised fully convolutional networks
Med. Image Anal.
(2019)
A. Fedorov et al.
3D Slicer as an image computing platform for the quantitative imaging network
Magn. Reson. Imaging
(2012)
Y. Hu et al.
Weakly-supervised convolutional neural networks for multimodal image registration
Med. Image Anal.
(2018)
A. Klein et al.
Evaluation of 14 nonlinear deformation algorithms applied to human brain MRI registration
Neuroimage
(2009)
C. Ledig et al.
Robust whole-brain segmentation: application to traumatic brain injury
Med. Image Anal.
(2015)
T. Nair et al.
Exploring uncertainty measures in deep networks for Multiple sclerosis lesion detection and segmentation
Med. Image Anal.
(2020)
E.A. Sribnick et al.
Neuroendoscopic colloid cyst resection: a case cohort with follow-up and patient satisfaction
World Neurosurg.
(2014)

X. Yang et al.

Quicksilver: fast predictive image registration - a deep learning approach

Neuroimage

(2017)

V. Arsigny et al.

A log-euclidean framework for statistics on diffeomorphisms

Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

(2006)

G. Balakrishnan et al.

VoxelMorph: a learning framework for deformable medical image registration

IEEE Trans. Med. Imaging

(2018)

M. Blendowski et al.

Multimodal 3D medical image registration guided by shape encoder–decoder networks

Int. J. Comput. Assist. Radiol. Surg.

(2020)

X. Cao et al.

Deep learning based inter-modality image registration supervised by intra-modality similarity

MLMI 2018: Machine Learning in Medical Imaging

(2018)

X. Cao et al.

Deformable image registration using a cue-aware deep regression network

IEEE Trans. Biomed. Eng.

(2018)

T. Che et al.

Deep group-wise registration for multi-spectral images from fundus images

IEEE Access

(2019)

Chen, Z., Badrinarayanan, V., Lee, C.Y., Rabinovich, A., 2018. GradNorm: gradient normalization for adaptive loss...

A.V. Dalca et al.

Unsupervised learning for fast probabilistic diffeomorphic registration

B. Denis de Senneville et al.

EVolution: an edge-based variational method for non-rigid multi-modal image registration

Phys. Med. Biol.

(2016)

K.A.J. Eppenhof et al.

Pulmonary CT registration through supervised learning with convolutional neural networks

IEEE Trans. Med. Imaging

(2019)

S.J. Groiss et al.

Deep brain stimulation in Parkinson-s disease

Ther. Adv. Neurol. Disord.

(2009)

K. Guo

Multi-Modal Image Registration With Unsupervised Deep Learning

(2019)

Han, R., De Silva, T., Ketcha, M., Uneri, A., Siewerdsen, J.H., 2018. A momentum-based diffeomorphic demons framework...

R. Han et al.

Deformable MR-CT image registration using an unsupervised synthesis and registration network for neuro-endoscopic surgery

Medical Imaging 2021: Image-Guided Procedures

(2021)

G. Haskins et al.

Learning deep similarity metric for 3D MR–TRUS image registration

Int. J. Comput. Assist. Radiol. Surg.

(2019)

Y. Huang et al.

Difficulty-aware hierarchical convolutional neural networks for deformable registration of Brain MR Images

Med. Image Anal.

(2020)

P. Isola et al.

Image-to-image translation with conditional adversarial networks

Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017

(2017)

Cited by (22)

Deformable registration of preoperative MR and intraoperative long-length tomosynthesis images for guidance of spine surgery via image synthesis
2024, Computerized Medical Imaging and Graphics
Improved integration and use of preoperative imaging during surgery hold significant potential for enhancing treatment planning and instrument guidance through surgical navigation. Despite its prevalent use in diagnostic settings, MR imaging is rarely used for navigation in spine surgery. This study aims to leverage MR imaging for intraoperative visualization of spine anatomy, particularly in cases where CT imaging is unavailable or when minimizing radiation exposure is essential, such as in pediatric surgery.
This work presents a method for deformable 3D-2D registration of preoperative MR images with a novel intraoperative long-length tomosynthesis imaging modality (viz., Long-Film [LF]). A conditional generative adversarial network is used to translate MR images to an intermediate bone image suitable for registration, followed by a model-based 3D-2D registration algorithm to deformably map the synthesized images to LF images. The algorithm’s performance was evaluated on cadaveric specimens with implanted markers and controlled deformation, and in clinical images of patients undergoing spine surgery as part of a large-scale clinical study on LF imaging.
The proposed method yielded a median 2D projection distance error of 2.0 mm (interquartile range [IQR]: 1.1–3.3 mm) and a 3D target registration error of 1.5 mm (IQR: 0.8–2.1 mm) in cadaver studies. Notably, the multi-scale approach exhibited significantly higher accuracy compared to rigid solutions and effectively managed the challenges posed by piecewise rigid spine deformation. The robustness and consistency of the method were evaluated on clinical images, yielding no outliers on vertebrae without surgical instrumentation and 3% outliers on vertebrae with instrumentation.
This work constitutes the first reported approach for deformable MR to LF registration based on deep image synthesis. The proposed framework provides access to the preoperative annotations and planning information during surgery and enables surgical navigation within the context of MR images and/or dual-plane LF images.
Real-time motion management in MRI-guided radiotherapy: Current status and AI-enabled prospects
2024, Radiotherapy and Oncology
MRI-guided radiotherapy (MRIgRT) is a highly complex treatment modality, allowing adaptation to anatomical changes occurring from one treatment day to the other (inter-fractional), but also to motion occurring during a treatment fraction (intra-fractional). In this vision paper, we describe the different steps of intra-fractional motion management during MRIgRT, from imaging to beam adaptation, and the solutions currently available both clinically and at a research level. Furthermore, considering the latest developments in the literature, a workflow is foreseen in which motion-induced over- and/or under-dosage is compensated in 3D, with minimal impact to the radiotherapy treatment time. Considering the time constraints of real-time adaptation, a particular focus is put on artificial intelligence (AI) solutions as a fast and accurate alternative to conventional algorithms.
Few-shot multi-modal registration with mono-modal knowledge transfer
2023, Biomedical Signal Processing and Control
Multi-modal registration is a key problem in many medical image analysis applications. Recent learning-based deformable image registration methods become attractive alternatives to traditional methods because of their great performance and fast run time. However, their success relies on large training datasets that are rarely available in multi-modal registration scenarios. To address this, we propose a novel knowledge transfer-based network (KT-Net) for few-shot multi-modal registration, which focuses on transferring knowledge of the mono-modal registration model to multi-modal registration. The contributions can be two-fold: (1) we propose model decoupling to disentangle the registration model into a feature learning network and an alignment learning network. The two networks are trained on large mono-modal datasets, making preparations for knowledge transfer. (2) a reverse teaching strategy is further designed to align the features of multi-modal images with a few samples, enabling the knowledge from mono-modal registration to transfer to multi-modal registration. Experimental results on multi-contrast brain MRI datasets demonstrate that our proposed method yields accurate and robust registration performance under the constraint of a few multi-modal samples. Compared with state-of-the-art registration methods, our proposed method achieves better registration performance with a high average Dice score of up to 83.5% and an average 95% percentile of Hausdorff distance as low as $1.26 mm$ in various anatomical structures, showing great potential for mono-modal knowledge transfer to be applied in few-shot multi-modal registration.
NCCT-CECT image synthesizers and their application to pulmonary vessel segmentation
2023, Computer Methods and Programs in Biomedicine
Non-contrast CT (NCCT) and contrast-enhanced CT (CECT) are important diagnostic tools with distinct features and applications for chest diseases. We developed two synthesizers for the mutual synthesis of NCCT and CECT and evaluated their applications.
Two synthesizers (S₁ and S₂) were proposed based on a generative adversarial network. S₁ generated synthetic CECT (SynCECT) from NCCT and S₂ generated synthetic NCCT (SynNCCT) from CECT. A new training procedure for synthesizers was proposed. Initially, the synthesizers were pretrained using self-supervised learning (SSL) and dual-energy CT (DECT) and then fine-tuned using the registered NCCT and CECT images. Pulmonary vessel segmentation from NCCT was used as an example to demonstrate the effectiveness of the synthesizers. Two strategies (ST₁ and ST₂) were proposed for pulmonary vessel segmentation. In ST₁, CECT images were used to train a segmentation model (Model-CECT), NCCT images were converted to SynCECT through S₁, and SynCECT was input to Model-CECT for testing. In ST₂, CECT data were converted to SynNCCT through S₂. SynNCCT and CECT-based annotations were used to train an additional model (Model-NCCT), and NCCT was input to Model-NCCT for testing. Three datasets, D1 (40 paired CTs), D2 (14 NCCTs and 14 CECTs), and D3 (49 paired DECTs), were used to evaluate the synthesizers and strategies.
For S₁, the mean absolute error (MAE), mean squared error (MSE), peak signal-to-noise ratio (PSNR), and structural similarity index (SSIM) were 14.60± 2.19, 1644± 890, 34.34± 1.91, and 0.94± 0.02, respectively. For S₂, they were 12.52± 2.59, 1460± 922, 35.08± 2.35, and 0.95± 0.02, respectively. Our synthesizers outperformed the counterparts of CycleGAN, Pix2Pix, and Pix2PixHD. The results of ablation studies on SSL pretraining, DECT pretraining, and fine-tuning showed that performance worsened (for example, for S₁, MAE increased to 16.53± 3.10, 17.98± 3.10, and 20.57± 3.75, respectively). Model-NCCT and Model-CECT achieved dice similarity coefficients (DSC) of 0.77 and 0.86 on D1 and 0.77 and 0.72 on D2, respectively.
The proposed synthesizers realized mutual and high-quality synthesis between NCCT and CECT images; the training procedures, including SSL pretraining, DECT pretraining, and fine-tuning, were critical to their effectiveness. The results demonstrated the usefulness of synthesizers for pulmonary vessel segmentation from NCCT images.
QACL: Quartet attention aware closed-loop learning for abdominal MR-to-CT synthesis via simultaneous registration
2023, Medical Image Analysis
Citation Excerpt :
To this end, if sufficient paired abdominal MR-CT images are available, it would be more valuable to overcome the challenges in the supervised mode. Recently, some innovative studies have used the MR-to-CT synthesis to convert the MR-CT registration into pCT-CT registration (Han et al., 2022; Cao et al., 2017; Fu et al., 2020a; McKenzie et al., 2020; Wei et al., 2020) for neurosurgical guidance. For example, Wei et al. (2020) first used a CycleGAN model with a mutual information constraint to generate the pCT images, and then the registration of MR and intra-procedural CT images was carried out using a classical tool of the ANTS software1 and an unsupervised registration network.
Synthesis of computed tomography (CT) images from magnetic resonance (MR) images is an important task to overcome the lack of electron density information in MR-only radiotherapy treatment planning (RTP). Some innovative methods have been proposed for abdominal MR-to-CT synthesis. However, it is still challenging due to the large misalignment between preprocessed abdominal MR and CT images and the insufficient feature information learned by models. Although several studies have used the MR-to-CT synthesis to alleviate the difficulty of multi-modal registration, this misalignment remains unsolved when training the MR-to-CT synthesis model. In this paper, we propose an end-to-end quartet attention aware closed-loop learning (QACL) framework for MR-to-CT synthesis via simultaneous registration. Specifically, the proposed quartet attention generator and mono-modal registration network form a closed-loop to improve the performance of MR-to-CT synthesis via simultaneous registration. In particular, a quartet-attention mechanism is developed to enlarge the receptive fields in networks to extract the long-range and cross-dimension spatial dependencies. Experimental results on two independent abdominal datasets demonstrate that our QACL achieves impressive results with MAE of $55.30 \pm 10.59$ HU, PSNR of $22.85 \pm 1.43$ dB, and SSIM of $0.83 \pm 0.04$ for synthesis, and with Dice of $0.799 \pm 0.129$ for registration. The proposed QACL outperforms the state-of-the-art MR-to-CT synthesis and multi-modal registration methods.
CDFRegNet: A cross-domain fusion registration network for CT-to-CBCT image registration
2022, Computer Methods and Programs in Biomedicine
Citation Excerpt :
However, Liang et al. [34] only synthesized images in a single direction (CBCT-to-CT), and the inverse direction was omitted. Han et al. [36] proposed probabilistic CycleGAN for multi-modal diffeomorphic registration, in which bi-direction image synthesis and dual-channel registration network were employed. These domain-translation-based methods did reduce the difficulty of CT-CBCT registration by image synthesis, as the intensity of the synthetic CT is more similar to CT than CBCT.
Computer tomography (CT) to cone-beam computed tomography (CBCT) image registration plays an important role in radiotherapy treatment placement, dose verification, and anatomic changes monitoring during radiotherapy. However, fast and accurate CT-to-CBCT image registration is still very challenging due to the intensity differences, the poor image quality of CBCT images, and inconsistent structure information.
To address these problems, a novel unsupervised network named cross-domain fusion registration network (CDFRegNet) is proposed. First, a novel edge-guided attention module (EGAM) is designed, aiming at capturing edge information based on the gradient prior images and guiding the network to model the spatial correspondence between two image domains. Moreover, a novel cross-domain attention module (CDAM) is proposed to improve the network's ability to guide the network to effectively map and fuse the domain-specific features.
Extensive experiments on a real clinical dataset were carried out, and the experimental results verify that the proposed CDFRegNet can register CT to CBCT images effectively and obtain the best performance, while compared with other representative methods, with a mean DSC of 80.01±7.16%, a mean TRE of 2.27±0.62 mm, and a mean MHD of 1.50±0.32 mm. The ablation experiments also proved that our EGAM and CDAM can further improve the accuracy of the registration network and they can generalize well to other registration networks.
This paper proposed a novel CT-to-CBCT registration method based on EGAM and CDAM, which has the potential to improve the accuracy of multi-domain image registration.

View all citing articles on Scopus

View full text

Deformable MR-CT image registration using an unsupervised, dual-channel network for neurosurgical guidance

Highlights

Abstract

Purpose

Method

Results

Conclusion

Graphical abstract

Introduction

Section snippets

Algorithmic methods

Image datasets

MR-CT image synthesis

Single-channel vs. dual-channel registration

Conclusions

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgment

Med. Image Anal.

Med. Image Anal.

Med. Image Anal.

Med. Image Anal.

Magn. Reson. Imaging

Med. Image Anal.

Neuroimage

Med. Image Anal.

Med. Image Anal.

World Neurosurg.

Neuroimage

A log-euclidean framework for statistics on diffeomorphisms

Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

VoxelMorph: a learning framework for deformable medical image registration

IEEE Trans. Med. Imaging

Multimodal 3D medical image registration guided by shape encoder–decoder networks

Int. J. Comput. Assist. Radiol. Surg.

Deep learning based inter-modality image registration supervised by intra-modality similarity

MLMI 2018: Machine Learning in Medical Imaging

Deformable image registration using a cue-aware deep regression network

IEEE Trans. Biomed. Eng.

Deep group-wise registration for multi-spectral images from fundus images

IEEE Access

Unsupervised learning for fast probabilistic diffeomorphic registration

EVolution: an edge-based variational method for non-rigid multi-modal image registration

Phys. Med. Biol.

Pulmonary CT registration through supervised learning with convolutional neural networks

IEEE Trans. Med. Imaging

Deep brain stimulation in Parkinson-s disease

Ther. Adv. Neurol. Disord.

Multi-Modal Image Registration With Unsupervised Deep Learning

Deformable MR-CT image registration using an unsupervised synthesis and registration network for neuro-endoscopic surgery

Medical Imaging 2021: Image-Guided Procedures

Learning deep similarity metric for 3D MR–TRUS image registration

Int. J. Comput. Assist. Radiol. Surg.

Difficulty-aware hierarchical convolutional neural networks for deformable registration of Brain MR Images

Med. Image Anal.

Image-to-image translation with conditional adversarial networks

Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017