当前位置: X-MOL 学术Pattern Recogn. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Automatic medical image interpretation: State of the art and future directions
Pattern Recognition ( IF 8 ) Pub Date : 2021-01-29 , DOI: 10.1016/j.patcog.2021.107856
Hareem Ayesha , Sajid Iqbal , Mehreen Tariq , Muhammad Abrar , Muhammad Sanaullah , Ishaq Abbas , Amjad Rehman , Muhammad Farooq Khan Niazi , Shafiq Hussain

Automatic Natural language interpretation of medical images is an emerging field of Artificial Intelligence (AI). The task combines two fields of AI; computer vision and natural language processing. This is a challenging task that goes beyond object detection, segmentation, and classification because it also requires the understanding of the relationship between different objects of an image and the actions performed by these objects as visual representations. Image interpretation is helpful in many tasks like helping visually impaired persons, information retrieval, early childhood learning, producing human like natural interaction between robots, and many more applications. Recently this work fascinated researchers to use the same approach by using more complex biomedical images. It has been applied from generating single sentence captions to multi sentence paragraph descriptions. Medical image captioning can assist and speed up the diagnosis process of medical professionals and generated report can be used for many further tasks. This is a comprehensive review of recent years’ research of medical image captioning published in different international conferences and journals. Their common parameters are extracted to compare their methods, performance, strengths, limitations, and our recommendations are discussed. Further publicly available datasets and evaluation measures used for deep-learning based captioning of medical images are also discussed.



中文翻译:

自动医学图像解释:最新技术和未来方向

医学图像的自动自然语言解释是人工智能(AI)的新兴领域。该任务结合了AI的两个领域:计算机视觉和自然语言处理。这是一项具有挑战性的任务,超出了对象检测,分割和分类的范围,因为它还需要了解图像的不同对象之间的关系以及这些对象作为视觉表示所执行的操作。图像解释在许多任务中很有帮助,例如帮助视力障碍者,信息检索,儿童早期学习,在机器人之间产生类似于人类的自然互动以及更多应用程序。最近,这项工作使研究人员着迷于通过使用更复杂的生物医学图像来使用相同的方法。它已从生成单句标题应用于多句段落描述。医学图像字幕可以帮助并加快医学专业人员的诊断过程,并且生成的报告可以用于许多其他任务。这是对近年来在不同国际会议和期刊上发表的医学图像字幕研究的全面综述。提取它们的常用参数以比较它们的方法,性能,优势,局限性,并讨论我们的建议。还讨论了用于基于深度学习的医学图像字幕的其他公共可用数据集和评估措施。医学图像字幕可以帮助并加快医学专业人员的诊断过程,并且生成的报告可以用于许多其他任务。这是对近年来在不同国际会议和期刊上发表的医学图像字幕研究的全面综述。提取它们的常用参数以比较它们的方法,性能,优势,局限性,并讨论我们的建议。还讨论了用于基于深度学习的医学图像字幕的其他公共可用数据集和评估措施。医学图像字幕可以帮助并加快医学专业人员的诊断过程,并且生成的报告可以用于许多其他任务。这是对近年来在不同国际会议和期刊上发表的医学图像字幕研究的全面综述。提取它们的常用参数以比较它们的方法,性能,优势,局限性,并讨论我们的建议。还讨论了用于基于深度学习的医学图像字幕的其他公共可用数据集和评估措施。和我们的建议进行了讨论。还讨论了用于基于深度学习的医学图像字幕的其他公共可用数据集和评估措施。和我们的建议进行了讨论。还讨论了用于基于深度学习的医学图像字幕的其他公共可用数据集和评估措施。

更新日期:2021-02-05
down
wechat
bug