Better Approximations of High Dimensional Smooth Functions by Deep Neural Networks with Rectified Power Units

Li, Bo; Tang, Shanshan; Yu, Haijun

doi:10.4208/cicp.OA-2019-0168

Mathematics > Numerical Analysis

arXiv:1903.05858 (math)

[Submitted on 14 Mar 2019 (v1), last revised 3 Nov 2019 (this version, v4)]

Title:Better Approximations of High Dimensional Smooth Functions by Deep Neural Networks with Rectified Power Units

Authors:Bo Li, Shanshan Tang, Haijun Yu

View PDF

Abstract:Deep neural networks with rectified linear units (ReLU) are getting more and more popular due to their universal representation power and successful applications. Some theoretical progress regarding the approximation power of deep ReLU network for functions in Sobolev space and Korobov space have recently been made by [D. Yarotsky, Neural Network, 94:103-114, 2017] and [H. Montanelli and Q. Du, SIAM J Math. Data Sci., 1:78-92, 2019], etc. In this paper, we show that deep networks with rectified power units (RePU) can give better approximations for smooth functions than deep ReLU networks. Our analysis bases on classical polynomial approximation theory and some efficient algorithms proposed in this paper to convert polynomials into deep RePU networks of optimal size with no approximation error. Comparing to the results on ReLU networks, the sizes of RePU networks required to approximate functions in Sobolev space and Korobov space with an error tolerance $\varepsilon$, by our constructive proofs, are in general $\mathcal{O}(\log\frac{1}{\varepsilon})$ times smaller than the sizes of corresponding ReLU networks constructed in most of the existing literature. Comparing to the classical results of Mhaskar [Mhaskar, Adv. Comput. Math. 1:61-80, 1993], our constructions use less number of activation functions and numerically more stable, they can be served as good initials of deep RePU networks and further trained to break the limit of linear approximation theory. The functions represented by RePU networks are smooth functions, so they naturally fit in the places where derivatives are involved in the loss function.

Comments:	28 pages, 4 figures
Subjects:	Numerical Analysis (math.NA)
MSC classes:	65D15, 65M12, 65M15
Cite as:	arXiv:1903.05858 [math.NA]
	(or arXiv:1903.05858v4 [math.NA] for this version)
	https://doi.org/10.48550/arXiv.1903.05858
Journal reference:	Communications in Computational Physics 27(2):379--411, 2020
Related DOI:	https://doi.org/10.4208/cicp.OA-2019-0168

Submission history

From: Haijun Yu [view email]
[v1] Thu, 14 Mar 2019 08:45:13 UTC (29 KB)
[v2] Thu, 28 Mar 2019 07:24:33 UTC (31 KB)
[v3] Mon, 1 Apr 2019 03:40:00 UTC (31 KB)
[v4] Sun, 3 Nov 2019 00:26:43 UTC (247 KB)

Mathematics > Numerical Analysis

Title:Better Approximations of High Dimensional Smooth Functions by Deep Neural Networks with Rectified Power Units

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Numerical Analysis

Title:Better Approximations of High Dimensional Smooth Functions by Deep Neural Networks with Rectified Power Units

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators