Multi-Objective Evolutionary Design of Deep Convolutional Neural Networks for Image Classification

Lu, Zhichao; Whalen, Ian; Dhebar, Yashesh; Deb, Kalyanmoy; Goodman, Erik; Banzhaf, Wolfgang; Boddeti, Vishnu Naresh

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.01369 (cs)

[Submitted on 3 Dec 2019 (v1), last revised 15 Sep 2020 (this version, v3)]

Title:Multi-Objective Evolutionary Design of Deep Convolutional Neural Networks for Image Classification

Authors:Zhichao Lu, Ian Whalen, Yashesh Dhebar, Kalyanmoy Deb, Erik Goodman, Wolfgang Banzhaf, Vishnu Naresh Boddeti

View PDF

Abstract:Early advancements in convolutional neural networks (CNNs) architectures are primarily driven by human expertise and by elaborate design processes. Recently, neural architecture search was proposed with the aim of automating the network design process and generating task-dependent architectures. While existing approaches have achieved competitive performance in image classification, they are not well suited to problems where the computational budget is limited for two reasons: (1) the obtained architectures are either solely optimized for classification performance, or only for one deployment scenario; (2) the search process requires vast computational resources in most approaches. To overcome these limitations, we propose an evolutionary algorithm for searching neural architectures under multiple objectives, such as classification performance and floating-point operations (FLOPs). The proposed method addresses the first shortcoming by populating a set of architectures to approximate the entire Pareto frontier through genetic operations that recombine and modify architectural components progressively. Our approach improves computational efficiency by carefully down-scaling the architectures during the search as well as reinforcing the patterns commonly shared among past successful architectures through Bayesian model learning. The integration of these two main contributions allows an efficient design of architectures that are competitive and in most cases outperform both manually and automatically designed architectures on benchmark image classification datasets: CIFAR, ImageNet, and human chest X-ray. The flexibility provided from simultaneously obtaining multiple architecture choices for different compute requirements further differentiates our approach from other methods in the literature. Code is available at this https URL

Comments:	Published in IEEE Transactions on Evolutionary Computation, 23 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1912.01369 [cs.CV]
	(or arXiv:1912.01369v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.01369

Submission history

From: Vishnu Naresh Boddeti [view email]
[v1] Tue, 3 Dec 2019 13:57:25 UTC (4,459 KB)
[v2] Fri, 11 Sep 2020 18:38:21 UTC (24,361 KB)
[v3] Tue, 15 Sep 2020 13:35:27 UTC (23,916 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Objective Evolutionary Design of Deep Convolutional Neural Networks for Image Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Objective Evolutionary Design of Deep Convolutional Neural Networks for Image Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators