Stage-based Hyper-parameter Optimization for Deep Learning

Shin, Ahnjae; Shin, Dong-Jin; Cho, Sungwoo; Kim, Do Yoon; Jeong, Eunji; Yu, Gyeong-In; Chun, Byung-Gon

Computer Science > Machine Learning

arXiv:1911.10504 (cs)

[Submitted on 24 Nov 2019]

Title:Stage-based Hyper-parameter Optimization for Deep Learning

Authors:Ahnjae Shin, Dong-Jin Shin, Sungwoo Cho, Do Yoon Kim, Eunji Jeong, Gyeong-In Yu, Byung-Gon Chun

View PDF

Abstract:As deep learning techniques advance more than ever, hyper-parameter optimization is the new major workload in deep learning clusters. Although hyper-parameter optimization is crucial in training deep learning models for high model performance, effectively executing such a computation-heavy workload still remains a challenge. We observe that numerous trials issued from existing hyper-parameter optimization algorithms share common hyper-parameter sequence prefixes, which implies that there are redundant computations from training the same hyper-parameter sequence multiple times. We propose a stage-based execution strategy for efficient execution of hyper-parameter optimization algorithms. Our strategy removes redundancy in the training process by splitting the hyper-parameter sequences of trials into homogeneous stages, and generating a tree of stages by merging the common prefixes. Our preliminary experiment results show that applying stage-based execution to hyper-parameter optimization algorithms outperforms the original trial-based method, saving required GPU-hours and end-to-end training time by up to 6.60 times and 4.13 times, respectively.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1911.10504 [cs.LG]
	(or arXiv:1911.10504v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.10504
Journal reference:	Workshop on Systems for ML at NeurIPS 2019

Submission history

From: Ahnjae Shin [view email]
[v1] Sun, 24 Nov 2019 11:24:33 UTC (467 KB)

Computer Science > Machine Learning

Title:Stage-based Hyper-parameter Optimization for Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Stage-based Hyper-parameter Optimization for Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators