When less is more: evolving large neural networks from small ones

Radhakrishnan, Anil; Lindner, John F.; Miller, Scott T.; Sinha, Sudeshna; Ditto, William L.

Computer Science > Machine Learning

arXiv:2501.18012 (cs)

[Submitted on 29 Jan 2025]

Title:When less is more: evolving large neural networks from small ones

Authors:Anil Radhakrishnan, John F. Lindner, Scott T. Miller, Sudeshna Sinha, William L. Ditto

View PDF HTML (experimental)

Abstract:In contrast to conventional artificial neural networks, which are large and structurally static, we study feed-forward neural networks that are small and dynamic, whose nodes can be added (or subtracted) during training. A single neuronal weight in the network controls the network's size, while the weight itself is optimized by the same gradient-descent algorithm that optimizes the network's other weights and biases, but with a size-dependent objective or loss function. We train and evaluate such Nimble Neural Networks on nonlinear regression and classification tasks where they outperform the corresponding static networks. Growing networks to minimal, appropriate, or optimal sizes while training elucidates network dynamics and contrasts with pruning large networks after training but before deployment.

Comments:	8 pages, 7 figures
Subjects:	Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn)
Cite as:	arXiv:2501.18012 [cs.LG]
	(or arXiv:2501.18012v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.18012

Submission history

From: John Lindner [view email]
[v1] Wed, 29 Jan 2025 21:56:38 UTC (1,472 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2025-01

Change to browse by:

cond-mat
cond-mat.dis-nn
cs

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:When less is more: evolving large neural networks from small ones

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:When less is more: evolving large neural networks from small ones

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators