当前位置: X-MOL 学术Circuit World › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Commercial hard drive failures in a data center application and the role of SMART attribute information
Circuit World ( IF 0.8 ) Pub Date : 2020-09-24 , DOI: 10.1108/cw-07-2020-0127
Michael Pecht , Edmond Elburn

Purpose

The reliability of hard disk drives (HDDs) is dependent on the drive construction, as well as the operational and environmental conditions, in which the drive is used. Self-monitoring, analysis and reporting technology (SMART) continuously provides attribute information on HDD usage and degradation characteristics.

Design/methodology/approach

This paper aims to analyze the reported failures Backblaze data set for ST3000DM001 HDDs intended for desktop applications within a data center application. SMART attributes used for predicting failure are discussed and analyzed over the life of many hard drives. A case study on the actual use of SMART and the limitations of the SMART attribute information, the data center’s information and the use of desktop drives in a commercial application are also presented.

Findings

The analysis showed that when Backblaze started to record the data, the hard disk drives had already worked for a while with power on hours mean and standard deviation of 6,683 and 365 h, respectively. Therefore, it is possible that some SMART attributes have experienced critical values that have not been recorded by Backblaze. Additionally, 8% of all ST3000DM001 drives that Backblaze labeled as failed did not have raw values above zero for the five attributes that were considered critical. Backblaze recorded 25 SMART attributes in total for all hard disk drive brands where ST3000DM001 having 83.3% of the attributes ranked as the drive with the most attributes recorded. Having more recorded attributes with critical values leads to label more ST3000DM001 drives as failed while there might be the hard drives from the other brands or part numbers that experienced more critical SMART attributes but were not labeled as failed because of the lack of records.

Originality/value

It is an original work carried out at the Center for Advanced Life Cycle Engineering, University of Maryland.



中文翻译:

数据中心应用中的商用硬盘故障和 SMART 属性信息的作用

目的

硬盘驱动器 (HDD) 的可靠性取决于驱动器结构以及使用驱动器的操作和环境条件。自我监控、分析和报告技术 (SMART) 持续提供有关 HDD 使用和退化特征的属性信息。

设计/方法/方法

本文旨在分析用于数据中心应用程序中桌面应用程序的 ST3000DM001 HDD 报告的故障 Backblaze 数据集。在许多硬盘驱动器的生命周期中讨论和分析了用于预测故障的 SMART 属性。还介绍了 SMART 实际使用的案例研究以及 SMART 属性信息、数据中心信息和桌面驱动器在商业应用中的使用的局限性。

发现

分析表明,当 Backblaze 开始记录数据时,硬盘驱动器已经工作了一段时间,开机时间平均值和标准偏差分别为 6,683 和 365 小时。因此,有可能某些 SMART 属性经历了 Backblaze 没有记录的临界值。此外,在被 Backblaze 标记为故障的所有 ST3000DM001 驱动器中,有 8% 的五个被视为关键属性的原始值未高于零。Backblaze 为所有硬盘驱动器品牌总共记录了 25 项 SMART 属性,其中 ST3000DM001 拥有 83.3% 的属性被列为记录最多属性的驱动器。

原创性/价值

这是在马里兰大学高级生命周期工程中心进行的原创工作。

更新日期:2020-09-24
down
wechat
bug