Function Gradient Approximation with Random Shallow ReLU Networks with Control Applications

Lamperski, Andrew; Salapaka, Siddharth

Computer Science > Machine Learning

arXiv:2410.05071 (cs)

[Submitted on 7 Oct 2024]

Title:Function Gradient Approximation with Random Shallow ReLU Networks with Control Applications

Authors:Andrew Lamperski, Siddharth Salapaka

View PDF HTML (experimental)

Abstract:Neural networks are widely used to approximate unknown functions in control. A common neural network architecture uses a single hidden layer (i.e. a shallow network), in which the input parameters are fixed in advance and only the output parameters are trained. The typical formal analysis asserts that if output parameters exist to approximate the unknown function with sufficient accuracy, then desired control performance can be achieved. A long-standing theoretical gap was that no conditions existed to guarantee that, for the fixed input parameters, required accuracy could be obtained by training the output parameters. Our recent work has partially closed this gap by demonstrating that if input parameters are chosen randomly, then for any sufficiently smooth function, with high-probability there are output parameters resulting in $O((1/m)^{1/2})$ approximation errors, where $m$ is the number of neurons. However, some applications, notably continuous-time value function approximation, require that the network approximates the both the unknown function and its gradient with sufficient accuracy. In this paper, we show that randomly generated input parameters and trained output parameters result in gradient errors of $O((\log(m)/m)^{1/2})$, and additionally, improve the constants from our prior work. We show how to apply the result to policy evaluation problems.

Comments:	Under Review for American Control Conference, 2025
Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC); Statistics Theory (math.ST)
Cite as:	arXiv:2410.05071 [cs.LG]
	(or arXiv:2410.05071v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.05071

Submission history

From: Andrew Lamperski [view email]
[v1] Mon, 7 Oct 2024 14:26:49 UTC (227 KB)

Computer Science > Machine Learning

Title:Function Gradient Approximation with Random Shallow ReLU Networks with Control Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Function Gradient Approximation with Random Shallow ReLU Networks with Control Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators