An effective algorithm for hyperparameter optimization of neural networks

Diaz, Gonzalo; Fokoue, Achille; Nannicini, Giacomo; Samulowitz, Horst

Computer Science > Artificial Intelligence

arXiv:1705.08520 (cs)

[Submitted on 23 May 2017]

Title:An effective algorithm for hyperparameter optimization of neural networks

Authors:Gonzalo Diaz, Achille Fokoue, Giacomo Nannicini, Horst Samulowitz

View PDF

Abstract:A major challenge in designing neural network (NN) systems is to determine the best structure and parameters for the network given the data for the machine learning problem at hand. Examples of parameters are the number of layers and nodes, the learning rates, and the dropout rates. Typically, these parameters are chosen based on heuristic rules and manually fine-tuned, which may be very time-consuming, because evaluating the performance of a single parametrization of the NN may require several hours. This paper addresses the problem of choosing appropriate parameters for the NN by formulating it as a box-constrained mathematical optimization problem, and applying a derivative-free optimization tool that automatically and effectively searches the parameter space. The optimization tool employs a radial basis function model of the objective function (the prediction accuracy of the NN) to accelerate the discovery of configurations yielding high accuracy. Candidate configurations explored by the algorithm are trained to a small number of epochs, and only the most promising candidates receive full training. The performance of the proposed methodology is assessed on benchmark sets and in the context of predicting drug-drug interactions, showing promising results. The optimization tool used in this paper is open-source.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1705.08520 [cs.AI]
	(or arXiv:1705.08520v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1705.08520

Submission history

From: Horst Samulowitz [view email]
[v1] Tue, 23 May 2017 20:17:44 UTC (530 KB)

Computer Science > Artificial Intelligence

Title:An effective algorithm for hyperparameter optimization of neural networks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:An effective algorithm for hyperparameter optimization of neural networks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators