Skip to main content
Log in

Disk storage failure prediction in datacenter using machine learning models

  • Original Article
  • Published:
Applied Nanoscience Aims and scope Submit manuscript

Abstract

Data centers are located centralized to do computation and accessing huge amount of data by the network devices which are interconnected to form the network path. Servers are stacked, data storage is placed in them. Data server backup and server redundancies are the recovery mechanisms implemented. Data centers compute, store, distribute the data by processing them and the data center controls all the interconnected network equipment in the distributed network. In current, RAID system is implemented to avoid the service disruptions due to disk failures, the availability of system and services are achieved with this expensive model. But still the availability is lost, and service disruptions happen due to disk failures, the machine learnings models to be used to predict the disk failures well in advance. Data center has increased usage of system with increased data storage, the failure in disc makes the system failed and down time increases. Analysis on the methods of problems in disk and methods of disk availability and predict the disk failure is the main goal. Various machine learning models are identified and discussed along with the SMART parameters for measuring the failure of the disk. Improved method of Ensembling of trees, random forest and boosting techniques are also discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25
Fig. 26
Fig. 27
Fig. 28
Fig. 29
Fig. 30
Fig. 31
Fig. 32
Fig. 33
Fig. 34
Fig. 35

Similar content being viewed by others

References

  • Anantharaman P, Qiao M, Ja-dav D (2018) Large scale predictive analytics for hard disk remaining useful life estimation. In Proceedings of the 2018 IEEE International Congress on Big Data (BigData Congress), pages 251–254. IEEE

  • Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Dally W, Towles B (2004) Principles and practices of interconnection networks. Morgan Kaufmann

  • Greenberg A, Hamilton J, Maltz DA, Patel P (2009a) The cost of a cloud: research problems in data center networks. SIGCOMM Comput Commun Rev 39(1):68–73

    Article  Google Scholar 

  • Greenberg A, Hamilton JR, Jain N, Kandula S, Kim C, Lahiri P, Maltz DA, Patel P, Sengupta S (2009b) Vl2: a scalable and flexible data center network. SIGCOMM Comput Commun Rev 39(4):51–62

    Article  Google Scholar 

  • Harsh V, Jyothi SA, Singh I, Godfrey P (2018) Expander datacenters: from theory to practice., in ArXiv 2018

  • Kim C, Caesar M, Rexford J (2008) “Floodless in seattle: a scalable ethernet architecture for large enterprises,” in ACM SIGCOMM Computer

  • Liu Y, Muppala JK, Veeraraghavan M, Lin D, Hamdi M (2013) Data Center Networks, SpringerBriefs in Computer Science

  • Lu S, Luo B, Patel T, Yao Y, Tiwari D (2020a) Making disk failure predictions SMARTer!, 18th USENIX Conference on File and Storage Technologies

  • Lu S, Luo B, Patel T, Yao Y, Tiwari D, Shi W (2020b) Making disk failure predictions SMARTer!, Proceedings of 18th USENIX Conference on File and Storage Technologies (FAST ’20)

  • Murray JF, Hughes GF, Kreutz-Delgado K, Jolla L (2005) Machine learning methods for predicting failures in hard drives: a multiple-instance application, J Mach Learn Res

  • Patterson D, et al. (1988a) A case for redundant arrays of inexpensive disks (RAID), ACM SIGMOD conference

  • Patterson D, et al. (1988b) Introduction to redundant arrays of inexpensive disks (RAID), Report No. UCB/CSD 88/479, University of California Berkeley

  • Pinheiro E, Weber W-D, Barroso LA (2007a) “Failure trends in a large disk drive population,” in Proceedings of the 5th USENIX Conference disk drive population,” in Proceedings of the 5th USENIX Conference on File and Storage Technologies, FAST ’07, (Berkeley, CA, USA), pp. 17–29, USENIX Association

  • Pinheiro E, Weber W-D, Barroso LA (2007b) Google Inc. 1600 Amphitheatre Pkwy Mountain View, CA 94043 on Failure Trends in a Large Disk Drive Population (Conclusion section)

  • Schroeder B, Gibson GA (2007) “Disk failures in the real world: what does an MTTF of 1,000,000 hours mean to you?,” in Proceedings of the 5th USENIX Conference on File and Storage Technologies, FAST ’07, (Berkeley, CA, USA), pp. 1–16, USENIX Association

  • Schulze M (1988) Considerations on the Design of a RAID Prototype, Report No. UCB/CSD 88/448, University of California Berkley

  • Wu K, Xiao J, Ni L (2012) Rethinking the architecture design of data center networks. Front Comput Sci 6:596–603. https://doi.org/10.1007/s11704-012-1155-6

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manikandan Ramanathan.

Ethics declarations

Conflict of interest

Author declares that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ramanathan, M., Narayanan, K. Disk storage failure prediction in datacenter using machine learning models. Appl Nanosci 13, 1569–1590 (2023). https://doi.org/10.1007/s13204-021-02039-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13204-021-02039-4

Keywords

Navigation