Derivative-Free Optimization via Finite Difference Approximation: An Experimental Study

Du-Yi, Wang; Guo, Liang; Guangwu, Liu; Kun, Zhang

Computer Science > Machine Learning

arXiv:2411.00112 (cs)

[Submitted on 31 Oct 2024 (v1), last revised 18 Feb 2025 (this version, v2)]

Title:Derivative-Free Optimization via Finite Difference Approximation: An Experimental Study

Authors:Wang Du-Yi, Liang Guo, Liu Guangwu, Zhang Kun

View PDF HTML (experimental)

Abstract:Derivative-free optimization (DFO) is vital in solving complex optimization problems where only noisy function evaluations are available through an oracle. Within this domain, DFO via finite difference (FD) approximation has emerged as a powerful method. Two classical approaches are the Kiefer-Wolfowitz (KW) and simultaneous perturbation stochastic approximation (SPSA) algorithms, which estimate gradients using just two samples in each iteration to conserve samples. However, this approach yields imprecise gradient estimators, necessitating diminishing step sizes to ensure convergence, often resulting in slow optimization progress. In contrast, FD estimators constructed from batch samples approximate gradients more accurately. While gradient descent algorithms using batch-based FD estimators achieve more precise results in each iteration, they require more samples and permit fewer iterations. This raises a fundamental question: which approach is more effective -- KW-style methods or DFO with batch-based FD estimators? This paper conducts a comprehensive experimental comparison among these approaches, examining the fundamental trade-off between gradient estimation accuracy and iteration steps. Through extensive experiments in both low-dimensional and high-dimensional settings, we demonstrate a surprising finding: when an efficient batch-based FD estimator is applied, its corresponding gradient descent algorithm generally shows better performance compared to classical KW and SPSA algorithms in our tested scenarios.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC)
MSC classes:	90-05
ACM classes:	I.6.1; I.6.6
Cite as:	arXiv:2411.00112 [cs.LG]
	(or arXiv:2411.00112v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.00112

Submission history

From: Kun Zhang [view email]
[v1] Thu, 31 Oct 2024 18:07:44 UTC (1,443 KB)
[v2] Tue, 18 Feb 2025 01:29:21 UTC (1,523 KB)

Computer Science > Machine Learning

Title:Derivative-Free Optimization via Finite Difference Approximation: An Experimental Study

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Derivative-Free Optimization via Finite Difference Approximation: An Experimental Study

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators