Layerwise Perturbation-Based Adversarial Training for Hard Drive Health Degree Prediction

Zhang, Jianguo; Wang, Ji; He, Lifang; Li, Zhao; Yu, Philip S.

Computer Science > Machine Learning

arXiv:1809.04188 (cs)

[Submitted on 11 Sep 2018 (v1), last revised 28 Sep 2018 (this version, v4)]

Title:Layerwise Perturbation-Based Adversarial Training for Hard Drive Health Degree Prediction

Authors:Jianguo Zhang, Ji Wang, Lifang He, Zhao Li, Philip S. Yu

View PDF

Abstract:With the development of cloud computing and big data, the reliability of data storage systems becomes increasingly important. Previous researchers have shown that machine learning algorithms based on SMART attributes are effective methods to predict hard drive failures. In this paper, we use SMART attributes to predict hard drive health degrees which are helpful for taking different fault tolerant actions in advance. Given the highly imbalanced SMART datasets, it is a nontrivial work to predict the health degree precisely. The proposed model would encounter overfitting and biased fitting problems if it is trained by the traditional methods. In order to resolve this problem, we propose two strategies to better utilize imbalanced data and improve performance. Firstly, we design a layerwise perturbation-based adversarial training method which can add perturbations to any layers of a neural network to improve the generalization of the network. Secondly, we extend the training method to the semi-supervised settings. Then, it is possible to utilize unlabeled data that have a potential of failure to further improve the performance of the model. Our extensive experiments on two real-world hard drive datasets demonstrate the superiority of the proposed schemes for both supervised and semi-supervised classification. The model trained by the proposed method can correctly predict the hard drive health status 5 and 15 days in advance. Finally, we verify the generality of the proposed training method in other similar anomaly detection tasks where the dataset is imbalanced. The results argue that the proposed methods are applicable to other domains.

Comments:	The 2018 IEEE International Conference on Data Mining (ICDM'18)
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1809.04188 [cs.LG]
	(or arXiv:1809.04188v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1809.04188

Submission history

From: Jianguo Zhang [view email]
[v1] Tue, 11 Sep 2018 22:43:19 UTC (1,284 KB)
[v2] Thu, 13 Sep 2018 02:31:51 UTC (1,284 KB)
[v3] Wed, 19 Sep 2018 01:41:21 UTC (1,284 KB)
[v4] Fri, 28 Sep 2018 20:00:03 UTC (1,284 KB)

Computer Science > Machine Learning

Title:Layerwise Perturbation-Based Adversarial Training for Hard Drive Health Degree Prediction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Layerwise Perturbation-Based Adversarial Training for Hard Drive Health Degree Prediction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators