Skip to main content
Log in

A review of scientific advancements in datasets derived from big data for monitoring the Sustainable Development Goals

  • Review Article
  • Published:
Sustainability Science Aims and scope Submit manuscript

Abstract

The Sustainable Development Goals (SDGs) suffer from a lack of national data needed for effective monitoring and implementation. Almost half of the SDG indicators are not regularly produced, and available datasets are often out-of-date. New monitoring approaches using big data are advancing rapidly and can complement official statistics to help fill critical data gaps. However, there is poor information-sharing on the latest innovations and research collaborations across different thematic areas, and limited evaluation of strengths and weaknesses for supporting national monitoring. This paper provides a systematic review of the academic literature over the past 5 years relating to the use of big data to support monitoring of the SDGs. It reviews the state-of-the-art research using big data and advanced analytics to produce new datasets, the alignment of these datasets with the official SDG indicators, the main types and sources of big data used, and the analytical methods applied. We developed a set of evaluation criteria and applied it to highlight some of the strengths and limitations of these datasets derived from big data. We find that recent research has developed a considerable range of new datasets that could contribute to monitoring 15 goals, 51 targets, and 69 indicators. Dominant focal areas of research include land and biodiversity, health, water, cities and settlements, and poverty. Satellite and Earth Observation data were the primary sources used, most commonly applied with machine learning methods and cloud computing. However, several challenges remain, including ensuring the relevance of new datasets for monitoring SDG indicators, cost and accessibility considerations, sustainability aspects, and linking global datasets to nationally owned monitoring processes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. Tier 1: Indicator is conceptually clear, has an internationally established methodology and standards are available, and data are regularly produced by countries for at least 50% of countries and of the population in every region where the indicator is relevant.

  2. Tier 2: Indicator is conceptually clear, has an internationally established methodology and standards are available, but data are not regularly produced by countries.

  3. (ALL = (SDG* OR “sustainable development goals” OR “official statistics”) AND ALL = (“big data” OR geospatial OR “remote sensing” OR “satellite imagery”) AND ALL = (monitor* OR indicator*) / PUBYEAR > 2015 > 150).

  4. https://worldbank.github.io/OpenNightLights/wb-light-every-night-readme.html.

  5. SDG 6.6.1 (sdg661.app).

  6. https://unstats.un.org/bigdata/inventory/?selectID=GlobalPulse6.

  7. https://marketplace.officialstatistics.org/.

  8. http://ghdx.healthdata.org/.

  9. https://www.thelancet.com/lancet/visualisations/gbd-SDGs.

References

Download references

Acknowledgements

This research was led by the Sustainable Development Solutions Network (SDSN) Thematic Research Network on Data and Statistics (TReNDS) with project funding provided by GIZ (Partners for Review).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cameron Allen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Handled by Ram Avtar, Hokkaido Daigaku, Japan.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 160 KB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Allen, C., Smith, M., Rabiee, M. et al. A review of scientific advancements in datasets derived from big data for monitoring the Sustainable Development Goals. Sustain Sci 16, 1701–1716 (2021). https://doi.org/10.1007/s11625-021-00982-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11625-021-00982-3

Keywords

Navigation