Radon Cumulative Distribution Transform Subspace Modeling for Image Classification

Shifat-E-Rabbi, Mohammad; Yin, Xuwang; Rubaiyat, Abu Hasnat Mohammad; Li, Shiying; Kolouri, Soheil; Aldroubi, Akram; Nichols, Jonathan M.; Rohde, Gustavo K.

doi:10.1007/s10851-021-01052-0

Radon Cumulative Distribution Transform Subspace Modeling for Image Classification

Published: 05 August 2021

Volume 63, pages 1185–1203, (2021)
Cite this article

Journal of Mathematical Imaging and Vision Aims and scope Submit manuscript

Mohammad Shifat-E-Rabbi ORCID: orcid.org/0000-0002-0972-5353¹,
Xuwang Yin²^na1,
Abu Hasnat Mohammad Rubaiyat²^na1,
Shiying Li¹,
Soheil Kolouri³,
Akram Aldroubi⁴,
Jonathan M. Nichols⁵ &
…
Gustavo K. Rohde^6,7

1329 Accesses
10 Citations
1 Altmetric
Explore all metrics

Abstract

We present a new supervised image classification method applicable to a broad class of image deformation models. The method makes use of the previously described Radon Cumulative Distribution Transform (R-CDT) for image data, whose mathematical properties are exploited to express the image data in a form that is more suitable for machine learning. While certain operations such as translation, scaling, and higher-order transformations are challenging to model in native image space, we show the R-CDT can capture some of these variations and thus render the associated image classification problems easier to solve. The method—utilizing a nearest-subspace algorithm in the R-CDT space—is simple to implement, non-iterative, has no hyper-parameters to tune, is computationally efficient, label efficient, and provides competitive accuracies to state-of-the-art neural networks for many types of classification problems. In addition to the test accuracy performances, we show improvements (with respect to neural network-based methods) in terms of computational efficiency (it can be implemented without the use of GPUs), number of training samples needed for training, as well as out-of-distribution generalization. The Python code for reproducing our results is available at Shifat-E-Rabbi et al. (Python code implementing the Radon cumulative distribution transform subspace model for image classification. https://github.com/rohdelab/rcdt_ns_classifier).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

U-Net: Convolutional Networks for Biomedical Image Segmentation

A survey on Image Data Augmentation for Deep Learning

Article Open access 06 July 2019

Connor Shorten & Taghi M. Khoshgoftaar

ImageNet Large Scale Visual Recognition Challenge

Article 11 April 2015

Olga Russakovsky, Jia Deng, … Li Fei-Fei

Notes

We are using a slightly different definition of the CDT than in [31]. The properties of the CDT outlined here hold in both definitions.
Rigorously speaking, if \(\widehat{\mathbb {V}}^{(p)}\) is a closed subspace, then \(d^2( \widehat{s},\widehat{\mathbb {V}}^{(p)})>0\) if and only if \(\widehat{s}\notin \widehat{\mathbb {V}}^{(p)} \). In practice, \(\widehat{\mathbb {V}}^{(p)}\) will be a finite dimensional space and hence, the closedness condition is satisfied.
The same grid is chosen for all images. m, n are positive integers.

References

Shifat-E-Rabbi, M., Yin, X., Rubaiyat, A.H.M., Li, S., Kolouri, S., Aldroubi, A., Nichols, J.M., Rohde, G.K: Python code implementing the Radon cumulative distribution transform subspace model for image classification. https://github.com/rohdelab/rcdt_ns_classifier
Sertel, O., Kong, J., Shimada, H., Catalyurek, U.V., Saltz, J.H., Gurcan, M.N.: Computer-aided prognosis of neuroblastoma on whole-slide images: classification of stromal development. Pattern Recognit. 42(6), 1093–1103 (2009)
Article Google Scholar
Basu, S., Kolouri, S., Rohde, G.K.: Detecting and visualizing cell phenotype differences from microscopy images using transport-based morphometry. Proc. Natl. Acad. Sci. 111(9), 3448–3453 (2014)
Article Google Scholar
Kundu, S., Kolouri, S., Erickson, K.I., Kramer, A.F., McAuley, E., Rohde, G.K.: Discovery and visualization of structural biomarkers from MRI using transport-based morphometry. Neuroimage 167, 256–275 (2018)
Article Google Scholar
Schulz, J.B., Borkert, J., Wolf, S., Schmitz-Hübsch, T., Rakowicz, M., Mariotti, C., Schoels, L., Timmann, D., Warrenburg, B., Dürr, A., Pandolfo, M., Kang, J., Mandly, A.G., Nagele, T., Grisoli, M., Boguslawska, R., Bauer, P., Klockgether, T., Hauser, T.: Visualization, quantification and correlation of brain atrophy with clinical symptoms in spinocerebellar ataxia types 1, 3 and 6. Neuroimage 49(1), 158–168 (2010)
Article Google Scholar
Hadid, A., Heikkila, J.Y., Silvén, O., Pietikainen, M.: Face and eye detection for person authentication in mobile phones. In: 2007 First ACM/IEEE International Conference on Distributed Smart Cameras, pp. 101–108 (2007)
Shifat-E-Rabbi, M., Yin, X., Fitzgerald, C.E., Rohde, G.K.: Cell image classification: a comparative overview. Cytometry A 97A(4), 347–362 (2020)
Article Google Scholar
Rawat, W., Wang, Z.: Deep convolutional neural networks for image classification: a comprehensive review. Neural Comput. 29(9), 2352–2449 (2017)
Article MathSciNet Google Scholar
Lu, D., Weng, Q.: A survey of image classification methods and techniques for improving classification performance. Int. J. Remote Sens. 28(5), 823–870 (2007)
Article Google Scholar
Prewitt, J.M.S., Mendelsohn, M.L.: The analysis of cell images. Ann. N. Y. Acad. Sci. 128(3), 1035–1053 (1966)
Article Google Scholar
Orlov, N., Shamir, L., Macura, T., Johnston, J., Eckley, D.M., Goldberg, I.G.: WND-CHARM: multi-purpose image classification using compound image transforms. Pattern Recognit. Lett. 29(11), 1684–1693 (2008)
Article Google Scholar
Ponomarev, G.V., Arlazarov, V.L., Gelfand, M.S., Kazanov, M.D.: Ana hep-2 cells image classification using number, size, shape and localization of targeted cell regions. Pattern Recognit. 47(7), 2360–2366 (2014)
Article Google Scholar
Bandos, T.V., Bruzzone, L., Camps-Valls, G.: Classification of hyperspectral images with regularized linear discriminant analysis. IEEE Trans. Geosci. Remote Sens. 47(3), 862–873 (2009)
Article Google Scholar
Muldoon, T.J., Thekkek, N., Roblyer, D.M., Maru, D., Harpaz, N., Potack, J., Anandasabapathy, S., Richards-Kortum, R.R.: Evaluation of quantitative image analysis criteria for the high-resolution microendoscopic detection of neoplasia in Barrett’s esophagus. J. Biomed. Opt. 15(2), 026027 (2010)
Article Google Scholar
Zhang, J., Marszałek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: a comprehensive study. Int. J. Comput. Vis. 73(2), 213–238 (2007)
Article Google Scholar
Perronnin, F., Sánchez, J., Mensink, T.: Improving the fisher kernel for large-scale image classification. In: European Conference on Computer Vision, pp. 143–156 (2010)
Bosch, A., Zisserman, A., Munoz, X.: Image classification using random forests and ferns. In: 2007 IEEE 11th International Conference on Computer Vision, pp. 1–8. IEEE (2007)
Du, P., Samat, A., Waske, B., Liu, S., Li, Z.: Random forest and rotation forest for fully polarized SAR image classification using polarimetric and spatial features. ISPRS J. Photogramm. Remote Sens. 105, 38–53 (2015)
Article Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Article Google Scholar
Shin, H.-C., Roth, H.R., Gao, M., Lu, L., Xu, Z., Nogues, I., Yao, J., Mollura, D., Summers, R.M.: Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans. Med. Imaging 35(5), 1285–1298 (2016)
Article Google Scholar
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826 (2016)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Wolberg, G.: Image morphing: a survey. Vis. Comput. 14(8), 360–372 (1998)
Article Google Scholar
Kolouri, S., Park, S.R., Rohde, G.K.: The radon cumulative distribution transform and its application to image classification. IEEE Trans. Image Process. 25(2), 920–934 (2016)
Article MathSciNet Google Scholar
Kolouri, S., Park, S.R., Thorpe, M., Slepcev, D., Rohde, G.K.: Optimal mass transport: signal processing and machine-learning applications. IEEE Signal Process. Mag. 34(4), 43–59 (2017)
Article Google Scholar
Villani, C.: Optimal Transport: Old and New, vol. 338. Springer, Berlin (2008)
MATH Google Scholar
Wang, W., Slepčev, D., Basu, S., Ozolek, J.A., Rohde, G.K.: A linear optimal transportation framework for quantifying and visualizing variations in sets of images. Int. J. Comput. Vis. 101(2), 254–269 (2013)
Article MathSciNet Google Scholar
Kolouri, S., Zou, Y., Rohde, G.K.: Sliced Wasserstein kernels for probability distributions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5258–5267 (2016)
Park, S.R., Cattell, L., Nichols, J.M., Watnik, A., Doster, T., Rohde, G.K.: De-multiplexing vortex modes in optical communications using transport-based pattern recognition. Opt. Express 26(4), 4004–4022 (2018)
Article Google Scholar
Fitzgerald, C.E., Cattell, L., Rohde, G.K.: Training classifiers with limited data using the Radon cumulative distribution transform. Med. Imaging Image Process. 10574, 105742 (2018)
Google Scholar
Park, S.R., Kolouri, S., Kundu, S., Rohde, G.K.: The cumulative distribution transform and linear pattern classification. Appl. Comput. Harmon. Anal. 45(3), 616–641 (2018)
Article MathSciNet Google Scholar
Bracewell, R.N.: The Fourier Transform and Its Applications, vol. 31999. McGraw-Hill, New York (1986)
MATH Google Scholar
Yang, I.: A convex optimization approach to distributionally robust Markov decision processes with Wasserstein distance. IEEE Control Syst. Lett. 1(1), 164–9 (2017)
Article MathSciNet Google Scholar
Quinto, E.T.: An introduction to x-ray tomography and radon transforms. In: Proceedings of Symposia in Applied Mathematics, vol. 63, p. 1 (2006)
Natterer, F.: The Mathematics of Computerized Tomography. SIAM, Philadelphia (2001)
Book Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). arXiv:1409.1556
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2014). arXiv:1412.6980
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–30 (2011)
MathSciNet MATH Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2005)
Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 2, pp. 1150–1157 (1999)
Lee, G.R., Gommers, R., Waselewski, F., Wohlfahrt, K., O’Leary, A.: PyWavelets: a Python package for wavelet analysis. J. Open Source Softw. 4(36), 1237 (2019)
Article Google Scholar
Kaggle: Sign Language MNIST. https://www.kaggle.com/datamunge/sign-language-mnist. Accessed 10 Mar 2020
Vondrick, C., Khosla, A., Malisiewicz, T., Torralba, A.: Hoggles: Visualizing object detection features. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1–8 (2013)
Marcus, D.S., Wang, T.H., Parker, J., Csernansky, J.G., Morris, J.C., Buckner, R.L.: Open access series of imaging studies (OASIS): cross-sectional MRI data in young, middle aged, nondemented, and demented older adults. J. Cogn. Neurosci. 19(9), 1498–1507 (2007)
Article Google Scholar
Krizhevsky, A., Hinton, G.: Learning multiple layers of features from tiny images (2009)
Gardner, M.W., Dorling, S.R.: Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmos. Environ. 32(14–15), 2627–2636 (1998)
Article Google Scholar
Pampel, F.C.: Logistic Regression: A Primer. SAGE Publications Incorporated, Thousand Oaks (2020)
MATH Google Scholar
Rubaiyat, A.H., Hallam, K.M., Nichols, J.M., Hutchinson, M.N., Li, S., Rohde, G.K.: Parametric signal estimation using the cumulative distribution transform. IEEE Trans. Signal Process. 68, 3312–24 (2020)
Article MathSciNet Google Scholar
Nichols, J.M., Emerson, T.H., Cattell, L., Park, S., Kanaev, A., Bucholtz, F., Watnik, A., Doster, T., Rohde, G.K.: Transport-based model for turbulence-corrupted imagery. Appl. Opt. 57(16), 4524–36 (2018)
Article Google Scholar

Download references

Author information

Xuwang Yin and Abu Hasnat Mohammad Rubaiyat have contributed equally to this work.

Authors and Affiliations

Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, 22908, USA
Mohammad Shifat-E-Rabbi & Shiying Li
Department of Electrical and Computer Engineering, University of Virginia, Charlottesville, VA, 22904, USA
Xuwang Yin & Abu Hasnat Mohammad Rubaiyat
Department of Computer Science, Vanderbilt University, Nashville, TN, 37212, USA
Soheil Kolouri
Department of Mathematics, Vanderbilt University, Nashville, TN, 37212, USA
Akram Aldroubi
U.S. Naval Research Laboratory, Washington, DC, 20375, USA
Jonathan M. Nichols
Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, 22908, USA
Gustavo K. Rohde
Department of Electrical and Computer Engineering, University of Virginia, Charlottesville, VA, 22908, USA
Gustavo K. Rohde

Authors

Mohammad Shifat-E-Rabbi
View author publications
You can also search for this author in PubMed Google Scholar
Xuwang Yin
View author publications
You can also search for this author in PubMed Google Scholar
Abu Hasnat Mohammad Rubaiyat
View author publications
You can also search for this author in PubMed Google Scholar
Shiying Li
View author publications
You can also search for this author in PubMed Google Scholar
Soheil Kolouri
View author publications
You can also search for this author in PubMed Google Scholar
Akram Aldroubi
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan M. Nichols
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo K. Rohde
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammad Shifat-E-Rabbi.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported in part by NIH Grants GM130825, GM090033.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 226 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shifat-E-Rabbi, M., Yin, X., Rubaiyat, A.H.M. et al. Radon Cumulative Distribution Transform Subspace Modeling for Image Classification. J Math Imaging Vis 63, 1185–1203 (2021). https://doi.org/10.1007/s10851-021-01052-0

Download citation

Received: 16 October 2020
Accepted: 16 July 2021
Published: 05 August 2021
Issue Date: November 2021
DOI: https://doi.org/10.1007/s10851-021-01052-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Radon Cumulative Distribution Transform Subspace Modeling for Image Classification

Abstract

Access this article

Similar content being viewed by others

U-Net: Convolutional Networks for Biomedical Image Segmentation

A survey on Image Data Augmentation for Deep Learning

ImageNet Large Scale Visual Recognition Challenge

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Supplementary Information

Supplementary material 1 (pdf 226 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Abstract

Access this article

Similar content being viewed by others

U-Net: Convolutional Networks for Biomedical Image Segmentation

A survey on Image Data Augmentation for Deep Learning

ImageNet Large Scale Visual Recognition Challenge

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Supplementary Information

Supplementary material 1 (pdf 226 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation