当前位置: X-MOL 学术Phys. Med. Biol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A comparison of methods for fully automatic segmentation of tumors and involved nodes in PET/CT of head and neck cancers
Physics in Medicine & Biology ( IF 3.5 ) Pub Date : 2021-03-04 , DOI: 10.1088/1361-6560/abe553
Aurora Rosvoll Groendahl 1 , Ingerid Skjei Knudtsen , Bao Ngoc Huynh , Martine Mulstad , Yngve Mardal Moe , Franziska Knuth , Oliver Tomic , Ulf Geir Indahl , Turid Torheim , Einar Dale , Eirik Malinen , Cecilia Marie Futsaether
Affiliation  

Target volume delineation is a vital but time-consuming and challenging part of radiotherapy, where the goal is to deliver sufficient dose to the target while reducing risks of side effects. For head and neck cancer (HNC) this is complicated by the complex anatomy of the head and neck region and the proximity of target volumes to organs at risk. The purpose of this study was to compare and evaluate conventional PET thresholding methods, six classical machine learning algorithms and a 2D U-Net convolutional neural network (CNN) for automatic gross tumor volume (GTV) segmentation of HNC in PET/CT images. For the latter two approaches the impact of single versus multimodality input on segmentation quality was also assessed. 197 patients were included in the study. The cohort was split into training and test sets (157 and 40 patients, respectively). Five-fold cross-validation was used on the training set for model comparison and selection. Manual GTV delineations represented the ground truth. Tresholding, classical machine learning and CNN segmentation models were ranked separately according to the cross-validation Srensen–Dice similarity coefficient (Dice). PET thresholding gave a maximum mean Dice of 0.62, whereas classical machine learning resulted in maximum mean Dice scores of 0.24 (CT) and 0.66 (PET; PET/CT). CNN models obtained maximum mean Dice scores of 0.66 (CT), 0.68 (PET) and 0.74 (PET/CT). The difference in cross-validation Dice between multimodality PET/CT and single modality CNN models was significant (p≤0.0001). The top-ranked PET/CT-based CNN model outperformed the best-performing thresholding and classical machine learning models, giving significantly better segmentations in terms of cross-validation and test set Dice, true positive rate, positive predictive value and surface distance-based metrics (p≤0.0001). Thus, deep learning based on multimodality PET/CT input resulted in superior target coverage and less inclusion of surrounding normal tissue.



中文翻译:

头颈癌PET/CT肿瘤及受累淋巴结全自动分割方法比较

目标体积描绘是放射治疗中一个重要但耗时且具有挑战性的部分,其目标是向目标提供足够的剂量,同时降低副作用的风险。对于头颈癌 (HNC),头颈部区域的复杂解剖结构以及目标体积与风险器官的接近度使情况变得复杂。本研究的目的是比较和评估传统的 PET 阈值方法、六种经典机器学习算法和 2D U-Net 卷积神经网络 (CNN),用于 PET/CT 图像中 HNC 的自动大体肿瘤体积 (GTV) 分割。对于后两种方法,还评估了单模态输入与多模态输入对分割质量的影响。197 名患者被纳入研究。该队列分为训练集和测试集(分别为 157 名和 40 名患者)。在训练集上使用五折交叉验证进行模型比较和选择。手动 GTV 描述代表了基本事实。根据交叉验证 Srensen–Dice 相似系数(骰子)。PET 阈值给出的最大平均Dice值为 0.62,而经典机器学习导致最大平均Dice分数为 0.24 (CT) 和 0.66 (PET; PET/CT)。CNN 模型获得的最大平均Dice分数为 0.66 (CT)、0.68 (PET) 和 0.74 (PET/CT)。多模态 PET/CT 和单模态 CNN 模型之间的交叉验证Dice差异显着 ( p ≤ 0.0001)。排名靠前的基于 PET/CT 的 CNN 模型优于性能最佳的阈值和经典机器学习模型,在交叉验证和测试集Dice方面提供了更好的分割、真阳性率、阳性预测值和基于表面距离的指标 ( p ≤0.0001)。因此,基于多模态 PET/CT 输入的深度学习导致更好的目标覆盖和更少的周围正常组织包含。

更新日期:2021-03-04
down
wechat
bug