VINet: A Visually Interpretable Image Diagnosis Network,IEEE Transactions on Multimedia

当前位置： X-MOL 学术 › IEEE Trans. Multimedia › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

VINet: A Visually Interpretable Image Diagnosis Network
IEEE Transactions on Multimedia ( IF 7.3 ) Pub Date : 2020-07-01 , DOI: 10.1109/tmm.2020.2971170
Donghao Gu , Yaowei Li , Feng Jiang , Zhaojing Wen , Shaohui Liu , Wuzhen Shi , Guangming Lu , Changsheng Zhou

Recently, due to the black box characteristics of deep learning techniques, the deep network-based computer-aided diagnosis (CADx) systems have encountered many difficulties in practical applications. The crux of the problem is that these models should be explainable the model should give doctors rationales that can explain the diagnosis. In this paper, we propose a visually interpretable network (VINet) which can generate diagnostic visual interpretations while making accurate diagnoses. VINet is an end-to-end model consisting of an importance estimation network and a classification network. The former produces a diagnostic visual interpretation for each case, and the classifier diagnoses the case. In the classifier, by exploring the information in the diagnostic visual interpretation, the irrelevant information in the feature maps is eliminated by our proposed feature destruction process. This allows the classification network to concentrate on the important features and use them as the primary references for classification. Through a joint optimization of higher classification accuracy and eliminating as many irrelevant features as possible, a precise, fine-grained diagnostic visual interpretation, along with an accurate diagnosis, can be produced by our proposed network simultaneously. Based on a computed tomography image dataset (LUNA16) on pulmonary nodule, extensive experiments have been conducted, demonstrating that the proposed VINet can produce state-of-the-art diagnostic visual interpretations compared with all baseline methods.

中文翻译：

VINet：视觉上可解释的图像诊断网络

近年来，由于深度学习技术的黑匣子特性，基于深度网络的计算机辅助诊断（CADx）系统在实际应用中遇到了很多困难。问题的关键是这些模型应该是可解释的，模型应该给医生提供可以解释诊断的基本原理。在本文中，我们提出了一种视觉可解释网络（VINet），它可以在进行准确诊断的同时生成诊断性视觉解释。VINet 是一个端到端的模型，由重要性估计网络和分类网络组成。前者为每个病例产生诊断视觉解释，分类器诊断病例。在分类器中，通过探索诊断视觉解释中的信息，我们提出的特征破坏过程消除了特征图中不相关的信息。这使得分类网络可以专注于重要的特征，并将它们用作分类的主要参考。通过对更高分类精度的联合优化和尽可能多地消除不相关的特征，我们提出的网络可以同时产生精确、细粒度的诊断视觉解释以及准确的诊断。基于肺结节的计算机断层扫描图像数据集（LUNA16），已经进行了大量实验，表明与所有基线方法相比，所提出的 VINet 可以产生最先进的诊断视觉解释。这使得分类网络可以专注于重要的特征，并将它们用作分类的主要参考。通过对更高分类精度的联合优化和尽可能多地消除不相关的特征，我们提出的网络可以同时产生精确、细粒度的诊断视觉解释以及准确的诊断。基于肺结节的计算机断层扫描图像数据集（LUNA16），已经进行了大量实验，表明与所有基线方法相比，所提出的 VINet 可以产生最先进的诊断视觉解释。这使得分类网络可以专注于重要的特征，并将它们用作分类的主要参考。通过对更高分类精度的联合优化和尽可能多地消除不相关的特征，我们提出的网络可以同时产生精确、细粒度的诊断视觉解释以及准确的诊断。基于肺结节的计算机断层扫描图像数据集（LUNA16），已经进行了大量实验，表明与所有基线方法相比，所提出的 VINet 可以产生最先进的诊断视觉解释。通过对更高分类精度的联合优化和尽可能多地消除不相关的特征，我们提出的网络可以同时产生精确、细粒度的诊断视觉解释以及准确的诊断。基于肺结节的计算机断层扫描图像数据集（LUNA16），已经进行了大量实验，表明与所有基线方法相比，所提出的 VINet 可以产生最先进的诊断视觉解释。通过对更高分类精度的联合优化和尽可能多地消除不相关的特征，我们提出的网络可以同时产生精确、细粒度的诊断视觉解释以及准确的诊断。基于肺结节的计算机断层扫描图像数据集（LUNA16），已经进行了大量实验，表明与所有基线方法相比，所提出的 VINet 可以产生最先进的诊断视觉解释。

更新日期：2020-07-01

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>