Skip to main content
Log in

Multi-Attention Mechanism Medical Image Segmentation Combined with Word Embedding Technology

  • Published:
Automatic Control and Computer Sciences Aims and scope Submit manuscript

Abstract

In order to solve the problems of low gray scale contrast and blurred organ boundaries in some medical images, we proposed a joint algorithm of Multi-Attention Parallel CNNs and Independent Recurrent Neural Networks (MACIR) with word embedding technique combined. First, the word embedding technique is used to map the sparse spatial relation matrix into a real dense vector, which is combined with gray scale and edge matrix as input features. The multi -attention mechanism is used to add weight information to capture the importance of each feature more sensitive. Then, the Parallel Convolutional Neural Networks are used to fully exploit the deep semantic information, and IndRNN is introduced to avoid the loss of pixel hierarchy information and realize the integration of information flow. Finally, the Softmax classifier is used to complete the medical image segmentation task. Experiments showed that word embedding MACIR algorithm could effectively improve the segmentation performance of medical images on the data sets of lung X-ray and cervical CT images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.
Fig. 5.

Similar content being viewed by others

REFERENCES

  1. Hwang, S. and Park, S., Accurate lung segmentation via network-wise training of convolutional networks, in Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, Cham: Springer, 2017.

    Google Scholar 

  2. Tian Juanxiu, Liu Guocai, Gu Shanshan, et al., Research and challenge of medical image analysis deep learning method, J. Autom., 2018, vol. 44, no. 3, pp. 401–424.

    Google Scholar 

  3. LeCun, Y., Bengio, Y., and Hinton, G., Deep learning, Nature, 2015, vol. 521, no. 7553, p. 436.

    Article  Google Scholar 

  4. Ma Chao, Liu Yashu, Luo Gongning, et al., 3D MR image segmentation based on cascaded random forest and active contours, J. Autom., 2019, vol. 45, no. 5, pp. 1004–1014.

    Google Scholar 

  5. Li Xiangxia, Li Bin, Tian Lianfang, et al., Segmentation of ground glass-type pulmonary nodules based on sparse representation and random walk, J. Autom., 2018, vol. 44, no. 9, pp. 1637–1647.

    Google Scholar 

  6. Onoma, D.P., Ruan, S., Thureau, S., et al., Segmentation of heterogeneous or small FDG PET positive tissue based on a 3D-locally adaptive random walk algorithm, Comput. Med. Imaging Graphics, 2014, vol. 38, no. 8, pp. 753–763.

    Article  Google Scholar 

  7. Garnavi, R., Aldeen, M., Celebi, M.E., et al., Border detection in dermoscopy images using hybrid thresholding on optimized color channels, Comput. Med. Imaging Graphics, 2011, vol. 35, no. 2, pp. 105–115.

    Article  Google Scholar 

  8. Ge, Z., Demyanov, S., Bozorgtabar, B., et al., Exploiting local and generic features for accurate skin lesions classification using clinical and dermoscopy imaging, 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017), 2017, pp. 986–990.

  9. Abuzaghleh, O., Barkana, B.D., and Faezipour, M., Automated skin lesion analysis based on color and shape geometry feature set for melanoma early detection and prevention, IEEE Long Island Systems, Applications and Technology (LISAT) Conference 2014, 2014, pp. 1–6.

    Google Scholar 

  10. Akbulut, Y., Guo, Y., Sengür, A., et al., An effective color texture image segmentation algorithm based on hermite transform, Appl. Soft Comput., 2018, vol. 67, pp. 494–504.

    Article  Google Scholar 

  11. Tahir, B., Iqbal, S., Usman, Ghani., Khan, M., et al., Feature enhancement framework for brain tumor segmentation and classification, Microsc. Res. Tech., 2019, vol. 82, no. 6, pp. 803–811.

    Article  Google Scholar 

  12. Barui, S., Latha, S., Samiappan, D., et al., SVM pixel classification on colour image segmentation, J. Phys.: Conf. Ser., 2018, vol. 1000.

  13. Chan, Y.H., Zeng, Y.Z., Wu, H.C., et al., Effective pneumothorax detection for chest X-ray images using local binary pattern and support vector machine, J. Healthcare Eng., 2018, vol. 2018.

  14. Long, J., Shelhamer, E., and Darrell, T., Fully convolutional networks for semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015, pp. 3431–3440.

  15. Ronneberger, O., Fischer, P., and Brox, T., U-net: Convolutional networks for biomedical image segmentation, International Conference on Medical Image Computing and Computer-Assisted Intervention, Cham: Springer, 2015, pp. 234–241.

  16. Li, Z., Gan, Y., Liang, X., et al., LSTM-CF: Unifying context modeling and fusion with LSTMS for RGB-d scene labeling, European Conference on Computer Vision, Cham: Springer, 2016, pp. 541–557.

  17. Wang, X., Girshick, R., Gupta, A., et al., Non-local neural networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 7794–7803.

  18. Wang, S., Zhou, M., Liu, Z., et al., Central focused convolutional neural networks: Developing a data-driven model for lung nodule segmentation, Med. Image Anal., 2017, vol. 40, pp. 172–183.

    Article  Google Scholar 

  19. Goldberg, Y. and Levy, O., Word2vec explained: Deriving Mikolov et al.'s negative-sampling word-embedding method, 2014. arXiv:1402.3722.

  20. Henry, S., Cuffy, C., and McInnes, B.T., Vector representations of multi-word terms for semantic relatedness, J. Biomed. Inf., 2018, p. 77.

  21. Bamler, R. and Mandt, S., Dynamic word embeddings, Proceedings of the 34th International Conference on Machine Learning, 2017, vol. 70, pp. 380–389.

  22. Mikolov, T., Karafiát, M., Burget, L., et al., Recurrent neural network based language model, Eleventh Annual Conference of the International Speech Communication Association, 2010.

  23. Li, S., Li, W., Cook, C., et al., Independently recurrent neural network (INDRNN): Building a longer and deeper RNN, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 5457–5466.

  24. Van Ginneken, B., Stegmann, M.B., and Loog, M., Segmentation of anatomical structures in chest radiographs using supervised methods: A comparative study on a public database, Med. Image Anal., 2006, vol. 10, no. 1, pp. 19–40.

    Article  Google Scholar 

  25. Clark, K., Vendt, B., Smith, K., et al., The Cancer Imaging Archive (TCIA): Maintaining and operating a public information repository, J. Digital Imaging, 2013, vol. 26, no. 6, pp. 1045–1057.

    Article  Google Scholar 

Download references

Funding

This work was supported by the National Natural Science Foundation of China (NSFC) (no. 61765014); Reserve Talents Project of National High-level Personnel of Special Support Program (QN2016YX0324); Urumqi Science and Technology Project (nos. P161310002 and Y161010025); and Reserve Talents Project of National High-level Personnel of Special Support Program (Xinjiang [2014]22).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shengwei Tian.

Ethics declarations

The authors declare no conflicts of interest.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Junlong Cheng, Tian, S., Yu, L. et al. Multi-Attention Mechanism Medical Image Segmentation Combined with Word Embedding Technology. Aut. Control Comp. Sci. 54, 560–571 (2020). https://doi.org/10.3103/S0146411620060024

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3103/S0146411620060024

Keywords:

Navigation