-
Deep Symmetric Autoencoders from the Eckart-Young-Schmidt Perspective
Authors:
Simone Brivio,
Nicola Rares Franco
Abstract:
Deep autoencoders have become a fundamental tool in various machine learning applications, ranging from dimensionality reduction and reduced order modeling of partial differential equations to anomaly detection and neural machine translation. Despite their empirical success, a solid theoretical foundation for their expressiveness remains elusive, particularly when compared to classical projection-…
▽ More
Deep autoencoders have become a fundamental tool in various machine learning applications, ranging from dimensionality reduction and reduced order modeling of partial differential equations to anomaly detection and neural machine translation. Despite their empirical success, a solid theoretical foundation for their expressiveness remains elusive, particularly when compared to classical projection-based techniques. In this work, we aim to take a step forward in this direction by presenting a comprehensive analysis of what we refer to as symmetric autoencoders, a broad class of deep learning architectures ubiquitous in the literature. Specifically, we introduce a formal distinction between different classes of symmetric architectures, analyzing their strengths and limitations from a mathematical perspective. For instance, we show that the reconstruction error of symmetric autoencoders with orthonormality constraints can be understood by leveraging the well-renowned Eckart-Young-Schmidt (EYS) theorem. As a byproduct of our analysis, we end up developing the EYS initialization strategy for symmetric autoencoders, which is based on an iterated application of the Singular Value Decomposition (SVD). To validate our findings, we conduct a series of numerical experiments where we benchmark our proposal against conventional deep autoencoders, discussing the importance of model design and initialization.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Handling geometrical variability in nonlinear reduced order modeling through Continuous Geometry-Aware DL-ROMs
Authors:
Simone Brivio,
Stefania Fresca,
Andrea Manzoni
Abstract:
Deep Learning-based Reduced Order Models (DL-ROMs) provide nowadays a well-established class of accurate surrogate models for complex physical systems described by parametrized PDEs, by nonlinearly compressing the solution manifold into a handful of latent coordinates. Until now, design and application of DL-ROMs mainly focused on physically parameterized problems. Within this work, we provide a n…
▽ More
Deep Learning-based Reduced Order Models (DL-ROMs) provide nowadays a well-established class of accurate surrogate models for complex physical systems described by parametrized PDEs, by nonlinearly compressing the solution manifold into a handful of latent coordinates. Until now, design and application of DL-ROMs mainly focused on physically parameterized problems. Within this work, we provide a novel extension of these architectures to problems featuring geometrical variability and parametrized domains, namely, we propose Continuous Geometry-Aware DL-ROMs (CGA-DL-ROMs). In particular, the space-continuous nature of the proposed architecture matches the need to deal with multi-resolution datasets, which are quite common in the case of geometrically parametrized problems. Moreover, CGA-DL-ROMs are endowed with a strong inductive bias that makes them aware of geometrical parametrizations, thus enhancing both the compression capability and the overall performance of the architecture. Within this work, we justify our findings through a thorough theoretical analysis, and we practically validate our claims by means of a series of numerical tests encompassing physically-and-geometrically parametrized PDEs, ranging from the unsteady Navier-Stokes equations for fluid dynamics to advection-diffusion-reaction equations for mathematical biology.
△ Less
Submitted 8 November, 2024;
originally announced November 2024.
-
On latent dynamics learning in nonlinear reduced order modeling
Authors:
Nicola Farenga,
Stefania Fresca,
Simone Brivio,
Andrea Manzoni
Abstract:
In this work, we present the novel mathematical framework of latent dynamics models (LDMs) for reduced order modeling of parameterized nonlinear time-dependent PDEs. Our framework casts this latter task as a nonlinear dimensionality reduction problem, while constraining the latent state to evolve accordingly to an (unknown) dynamical system. A time-continuous setting is employed to derive error an…
▽ More
In this work, we present the novel mathematical framework of latent dynamics models (LDMs) for reduced order modeling of parameterized nonlinear time-dependent PDEs. Our framework casts this latter task as a nonlinear dimensionality reduction problem, while constraining the latent state to evolve accordingly to an (unknown) dynamical system. A time-continuous setting is employed to derive error and stability estimates for the LDM approximation of the full order model (FOM) solution. We analyze the impact of using an explicit Runge-Kutta scheme in the time-discrete setting, resulting in the $Δ\text{LDM}$ formulation, and further explore the learnable setting, $Δ\text{LDM}_θ$, where deep neural networks approximate the discrete LDM components, while providing a bounded approximation error with respect to the FOM. Moreover, we extend the concept of parameterized Neural ODE - recently proposed as a possible way to build data-driven dynamical systems with varying input parameters - to be a convolutional architecture, where the input parameters information is injected by means of an affine modulation mechanism, while designing a convolutional autoencoder neural network able to retain spatial-coherence, thus enhancing interpretability at the latent level. Numerical experiments, including the Burgers' and the advection-reaction-diffusion equations, demonstrate the framework's ability to obtain, in a multi-query context, a time-continuous approximation of the FOM solution, thus being able to query the LDM approximation at any given time instance while retaining a prescribed level of accuracy. Our findings highlight the remarkable potential of the proposed LDMs, representing a mathematically rigorous framework to enhance the accuracy and approximation capabilities of reduced order modeling for time-dependent parameterized PDEs.
△ Less
Submitted 28 November, 2024; v1 submitted 27 August, 2024;
originally announced August 2024.
-
PTPI-DL-ROMs: pre-trained physics-informed deep learning-based reduced order models for nonlinear parametrized PDEs
Authors:
Simone Brivio,
Stefania Fresca,
Andrea Manzoni
Abstract:
The coupling of Proper Orthogonal Decomposition (POD) and deep learning-based ROMs (DL-ROMs) has proved to be a successful strategy to construct non-intrusive, highly accurate, surrogates for the real time solution of parametric nonlinear time-dependent PDEs. Inexpensive to evaluate, POD-DL-ROMs are also relatively fast to train, thanks to their limited complexity. However, POD-DL-ROMs account for…
▽ More
The coupling of Proper Orthogonal Decomposition (POD) and deep learning-based ROMs (DL-ROMs) has proved to be a successful strategy to construct non-intrusive, highly accurate, surrogates for the real time solution of parametric nonlinear time-dependent PDEs. Inexpensive to evaluate, POD-DL-ROMs are also relatively fast to train, thanks to their limited complexity. However, POD-DL-ROMs account for the physical laws governing the problem at hand only through the training data, that are usually obtained through a full order model (FOM) relying on a high-fidelity discretization of the underlying equations. Moreover, the accuracy of POD-DL-ROMs strongly depends on the amount of available data. In this paper, we consider a major extension of POD-DL-ROMs by enforcing the fulfillment of the governing physical laws in the training process -- that is, by making them physics-informed -- to compensate for possible scarce and/or unavailable data and improve the overall reliability. To do that, we first complement POD-DL-ROMs with a trunk net architecture, endowing them with the ability to compute the problem's solution at every point in the spatial domain, and ultimately enabling a seamless computation of the physics-based loss by means of the strong continuous formulation. Then, we introduce an efficient training strategy that limits the notorious computational burden entailed by a physics-informed training phase. In particular, we take advantage of the few available data to develop a low-cost pre-training procedure; then, we fine-tune the architecture in order to further improve the prediction reliability. Accuracy and efficiency of the resulting pre-trained physics-informed DL-ROMs (PTPI-DL-ROMs) are then assessed on a set of test cases ranging from non-affinely parametrized advection-diffusion-reaction equations, to nonlinear problems like the Navier-Stokes equations for fluid flows.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Error estimates for POD-DL-ROMs: a deep learning framework for reduced order modeling of nonlinear parametrized PDEs enhanced by proper orthogonal decomposition
Authors:
Simone Brivio,
Stefania Fresca,
Nicola Rares Franco,
Andrea Manzoni
Abstract:
POD-DL-ROMs have been recently proposed as an extremely versatile strategy to build accurate and reliable reduced order models (ROMs) for nonlinear parametrized partial differential equations, combining (i) a preliminary dimensionality reduction obtained through proper orthogonal decomposition (POD) for the sake of efficiency, (ii) an autoencoder architecture that further reduces the dimensionalit…
▽ More
POD-DL-ROMs have been recently proposed as an extremely versatile strategy to build accurate and reliable reduced order models (ROMs) for nonlinear parametrized partial differential equations, combining (i) a preliminary dimensionality reduction obtained through proper orthogonal decomposition (POD) for the sake of efficiency, (ii) an autoencoder architecture that further reduces the dimensionality of the POD space to a handful of latent coordinates, and (iii) a dense neural network to learn the map that describes the dynamics of the latent coordinates as a function of the input parameters and the time variable. Within this work, we aim at justifying the outstanding approximation capabilities of POD-DL-ROMs by means of a thorough error analysis, showing how the sampling required to generate training data, the dimension of the POD space, and the complexity of the underlying neural networks, impact on the solution accuracy. This decomposition, combined with the constructive nature of the proofs, allows us to formulate practical criteria to control the relative error in the approximation of the solution field of interest, and derive general error estimates. Furthermore, we show that, from a theoretical point of view, POD-DL-ROMs outperform several deep learning-based techniques in terms of model complexity. Finally, we validate our findings by means of suitable numerical experiments, ranging from parameter-dependent operators analytically defined to several parametrized PDEs.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
Higher-rank Brill-Noether loci on nodal reducible curves
Authors:
Sonia Brivio,
Filippo F. Favale
Abstract:
In this paper we deal with Brill-Noether theory for higher-rank sheaves on a polarized nodal reducible curve $(C,\underline{w})$ following the ideas of [arXiv:alg-geom/9511003v1]. We study the Brill-Noether loci of $\underline{w}$-stable depth one sheaves on $C$ having rank $r$ on all irreducible components and having small slope. In analogy with what happens in the smooth case, we prove that thes…
▽ More
In this paper we deal with Brill-Noether theory for higher-rank sheaves on a polarized nodal reducible curve $(C,\underline{w})$ following the ideas of [arXiv:alg-geom/9511003v1]. We study the Brill-Noether loci of $\underline{w}$-stable depth one sheaves on $C$ having rank $r$ on all irreducible components and having small slope. In analogy with what happens in the smooth case, we prove that these loci are closely related to BGN extensions. Moreover, we produce irreducible components of the expected dimension for these Brill-Noether loci.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
Coherent systems and BGN extensions on nodal reducible curves
Authors:
Sonia Brivio,
Filippo F. Favale
Abstract:
Let $(C,\underline{w})$ be a polarized nodal reducible curve. In this paper we consider coherent systems of type $(r,d,k)$ on $C$ with $k < r$. We prove that the moduli spaces of $(\underline{w},α)$-stable coherent systems stabilize for large $α$ and we generalize several results known for the irreducible case when we chose a good polarization. Then, we study in details the components of moduli sp…
▽ More
Let $(C,\underline{w})$ be a polarized nodal reducible curve. In this paper we consider coherent systems of type $(r,d,k)$ on $C$ with $k < r$. We prove that the moduli spaces of $(\underline{w},α)$-stable coherent systems stabilize for large $α$ and we generalize several results known for the irreducible case when we chose a good polarization. Then, we study in details the components of moduli spaces containing coherent systems arising from locally free sheaves.
△ Less
Submitted 14 April, 2021;
originally announced April 2021.
-
Nodal curves and polarizations with good properties
Authors:
S. Brivio,
F. F. Favale
Abstract:
In this paper we deal with polarizations on a nodal curve $C$ with smooth components. Our aim is to study and characterize a class of polarizations, which we call "good", for which depth one sheaves on $C$ reflect some properties that hold for vector bundles on smooth curves. We will concentrate, in particular, on the relation between the $\underline{w}$-stability of $\mathcal{O}_C$ and the goodne…
▽ More
In this paper we deal with polarizations on a nodal curve $C$ with smooth components. Our aim is to study and characterize a class of polarizations, which we call "good", for which depth one sheaves on $C$ reflect some properties that hold for vector bundles on smooth curves. We will concentrate, in particular, on the relation between the $\underline{w}$-stability of $\mathcal{O}_C$ and the goodness of $\underline{w}$. We prove that these two concepts agree when $C$ is of compact type and we conjecture that the same should hold for all nodal curves.
△ Less
Submitted 23 July, 2021; v1 submitted 3 August, 2020;
originally announced August 2020.
-
Coherent systems on curves of compact type
Authors:
Sonia Brivio,
Filippo F. Favale
Abstract:
Let $C$ be a polarized nodal curve of compact type. In this paper we study coherent systems $(E,V)$ on $C$ given by a depth one sheaf $E$ having rank $r$ on each irreducible component of $C$ and a subspace $V \subset H^0(E)$ of dimension $k$. Moduli spaces of stable coherent systems have been introduced by King and Newstead and depend on a real parameter $α$. We show that when $k \geq r$, these mo…
▽ More
Let $C$ be a polarized nodal curve of compact type. In this paper we study coherent systems $(E,V)$ on $C$ given by a depth one sheaf $E$ having rank $r$ on each irreducible component of $C$ and a subspace $V \subset H^0(E)$ of dimension $k$. Moduli spaces of stable coherent systems have been introduced by King and Newstead and depend on a real parameter $α$. We show that when $k \geq r$, these moduli spaces coincide for $α$ big enough. Then we deal with the case $k=r+1$: when the degrees of the restrictions of $E$ are big enough we are able to describe an irreducible component of this moduli space by using the dual span construction.
△ Less
Submitted 6 April, 2020;
originally announced April 2020.
-
On kernel bundles over reducible curves with a node
Authors:
S. Brivio,
F. F. Favale
Abstract:
Given a vector bundle $E$ on a complex reduced curve $C$ and a subspace $V$ of $H^0(E)$ which generates $E$, one can consider the kernel of the evaluation map $ev_V:V\otimes \mathcal{O}_C\to E$, i.e. the {\it kernel bundle } $M_{E,V}$ associated to the pair $(E,V)$. Motivated by a well known conjecture of Butler about the semistability of $M_{E,V}$ and by the results obtained by several authors wh…
▽ More
Given a vector bundle $E$ on a complex reduced curve $C$ and a subspace $V$ of $H^0(E)$ which generates $E$, one can consider the kernel of the evaluation map $ev_V:V\otimes \mathcal{O}_C\to E$, i.e. the {\it kernel bundle } $M_{E,V}$ associated to the pair $(E,V)$. Motivated by a well known conjecture of Butler about the semistability of $M_{E,V}$ and by the results obtained by several authors when the ambient space is a smooth curve, we investigate the case of a curve with one node. Unexpectedly, we are able to prove results which goes in the opposite direction with respect to what is known in the smooth case. For example, $M_{E,H^0(E)}$ is actually quite never $w$-semistable. Conditions which gives the $w$-semistability of $M_{E,V}$ when $V\subset H^0(E)$ or when $E$ is a line bundle are then given.
△ Less
Submitted 14 April, 2020; v1 submitted 22 July, 2019;
originally announced July 2019.
-
On vector bundles over reducible curves with a node
Authors:
Sonia Brivio,
Filippo F. Favale
Abstract:
Let $C$ be a curve with two smooth components and a single node. Let $\mathcal{U}_C(r,w,χ)$ be the moduli space of $w$-semistable classes of depth one sheaves on $C$ having rank $r$ on both components and Euler characteristic $χ$. In this paper, under suitable assumptions, we produce a projective bundle over the product of the moduli spaces of semistable vector bundles of rank $r$ on each componen…
▽ More
Let $C$ be a curve with two smooth components and a single node. Let $\mathcal{U}_C(r,w,χ)$ be the moduli space of $w$-semistable classes of depth one sheaves on $C$ having rank $r$ on both components and Euler characteristic $χ$. In this paper, under suitable assumptions, we produce a projective bundle over the product of the moduli spaces of semistable vector bundles of rank $r$ on each components and we show that it is birational to an irreducible component of $\mathcal{U}_C(r,w,χ)$. Then we prove the rationality of the closed subset containing vector bundles with given fixed determinant.
△ Less
Submitted 28 July, 2020; v1 submitted 25 March, 2019;
originally announced March 2019.
-
Genus 2 curves and generalized theta divisors
Authors:
Sonia Brivio,
Filippo F. Favale
Abstract:
In this paper we investigate generalized theta divisors $Θ_r$ in the moduli spaces $\mathcal{U}_C(r,r)$ of semistable vector bundles on a curve $C$ of genus $2$. We provide a desingularization $Φ$ of $Θ_r$ in terms of a projective bundle $π:\mathbb{P}(\mathcal{V})\to\mathcal{U}_C(r-1,r)$ which parametrizes extensions of stable vector bundles on the base by $\mathcal{O}_C$. Then, we study the compo…
▽ More
In this paper we investigate generalized theta divisors $Θ_r$ in the moduli spaces $\mathcal{U}_C(r,r)$ of semistable vector bundles on a curve $C$ of genus $2$. We provide a desingularization $Φ$ of $Θ_r$ in terms of a projective bundle $π:\mathbb{P}(\mathcal{V})\to\mathcal{U}_C(r-1,r)$ which parametrizes extensions of stable vector bundles on the base by $\mathcal{O}_C$. Then, we study the composition of $Φ$ with the well known theta map $θ$. We prove that, when it is restricted to the general fiber of $π$, we obtain a linear embedding.
△ Less
Submitted 14 May, 2019; v1 submitted 24 July, 2018;
originally announced July 2018.
-
Coherent systems and modular subvarieties of SU_C(r)
Authors:
Michele Bolognesi,
Sonia Brivio
Abstract:
Let $C$ be an algebraic smooth complex curve of genus $g>1$. The object of this paper is the study of the birational structure of certain moduli spaces of vector bundles and of coherent systems on $C$ and the comparison of different type of notions of stability arising in moduli theory. Notably we show that in certain cases these moduli spaces are birationally equivalent to fibrations over simple…
▽ More
Let $C$ be an algebraic smooth complex curve of genus $g>1$. The object of this paper is the study of the birational structure of certain moduli spaces of vector bundles and of coherent systems on $C$ and the comparison of different type of notions of stability arising in moduli theory. Notably we show that in certain cases these moduli spaces are birationally equivalent to fibrations over simple projective varieties, whose fibers are GIT quotients $(\PP^{r-1})^{rg}//PGL(r)$, where $r$ is the rank of the considered vector bundles. This allows us to compare different definitions of (semi-)stability (slope stability, $α$-stability, GIT stability) for vector bundles, coherent systems and point sets, and derive relations between them. In certain cases of vector bundles of low rank when $C$ has small genus, our construction produces families of classical modular varieties contained in the Coble hypersurfaces.
△ Less
Submitted 26 September, 2011;
originally announced September 2011.
-
Modular subvarieties and birational geometry of $SU_C(r)$
Authors:
Michele Bolognesi,
Sonia Brivio
Abstract:
Let $C$ be an algebraic smooth complex genus $g>1$ curve. The object of this paper is the study of the birational structure of the coarse moduli space $U_C(r,0)$ of semi-stable rank r vector bundles on $C$ with degree 0 determinant and of its moduli subspace $SU_C(r)$ given by the vector bundles with trivial determinant. Notably we prove that $U_C(r,0)$ (resp. $SU_C(r)$) is birational to a fibra…
▽ More
Let $C$ be an algebraic smooth complex genus $g>1$ curve. The object of this paper is the study of the birational structure of the coarse moduli space $U_C(r,0)$ of semi-stable rank r vector bundles on $C$ with degree 0 determinant and of its moduli subspace $SU_C(r)$ given by the vector bundles with trivial determinant. Notably we prove that $U_C(r,0)$ (resp. $SU_C(r)$) is birational to a fibration over the symmetric product $C^(rg)$ (resp. over $P^{(r-1)g}$) whose fibres are GIT quotients $(P^{r-1})^{rg}//PGL(r)$. In the cases of low rank and genus our construction produces families of classical modular varieties contained in the Coble hypersurfaces.
△ Less
Submitted 10 March, 2010; v1 submitted 23 February, 2010;
originally announced February 2010.
-
Plucker forms and the theta map
Authors:
Sonia Brivio,
Alessandro Verra
Abstract:
In this paper we introduce the elementary notion of Plücker form of a pair $(E,S)$, where $E$ is a vector bundle of rank $r$ on a smooth, irreducible, complex projective variety $X$ and $S \subset H^0(E)$ is a subspace of dimension $rm$. We apply this notion to the study of theta map $θ_r$ on the moduli space $SU_X(r,0)$ of semistable vector bundles of rank $r$ and trivial determinant on a curve…
▽ More
In this paper we introduce the elementary notion of Plücker form of a pair $(E,S)$, where $E$ is a vector bundle of rank $r$ on a smooth, irreducible, complex projective variety $X$ and $S \subset H^0(E)$ is a subspace of dimension $rm$. We apply this notion to the study of theta map $θ_r$ on the moduli space $SU_X(r,0)$ of semistable vector bundles of rank $r$ and trivial determinant on a curve $X$ of genus $g$. We prove that $θ_r$ is generically injective if $X$ is general and $g >> r$.
△ Less
Submitted 6 February, 2011; v1 submitted 29 October, 2009;
originally announced October 2009.
-
The Brill-Noether Curve of a Stable Vector Bundle on a Genus Two Curve
Authors:
Sonia Brivio,
Alessandro Verra
Abstract:
Let U(r) be the moduli space of rank r vector bundles with trivial determinant on a smooth curve of genus 2. The map theta_r: U(r) -> |r Theta|, which associates to a general bundle its theta divisor, is generically finite. In this paper we give a geometric interpretation of the generic fibre of theta_r.
Let U(r) be the moduli space of rank r vector bundles with trivial determinant on a smooth curve of genus 2. The map theta_r: U(r) -> |r Theta|, which associates to a general bundle its theta divisor, is generically finite. In this paper we give a geometric interpretation of the generic fibre of theta_r.
△ Less
Submitted 12 October, 2005;
originally announced October 2005.
-
Alternating groups and rational functions on surfaces
Authors:
Sonia Brivio,
Gian Pietro Pirola
Abstract:
Let X be a smooth complex projective surface. We prove that for any sufficiently big m there exists a rational dominant map f from X into a complex rational ruled surface Y, such that f is generically finite of degree m and has monodromy the alternating group Am.
Let X be a smooth complex projective surface. We prove that for any sufficiently big m there exists a rational dominant map f from X into a complex rational ruled surface Y, such that f is generically finite of degree m and has monodromy the alternating group Am.
△ Less
Submitted 19 April, 2005; v1 submitted 11 January, 2004;
originally announced January 2004.
-
On the theta divisor of SU(2,1)
Authors:
Sonia Brivio,
Alessandro Verra
Abstract:
Let SU(2,1) be the moduli space of stable rank two vector bundles having fixed determinant of odd degree over a compact Riemann surface C. In this paper it is shown that the Theta divisor of SU(2,1) is very ample for every C. The proof is related to the study of the base locus of the pencil of divisors 2-theta in the Jacobian of C which is naturally associated to a point of SU(2,1).
Let SU(2,1) be the moduli space of stable rank two vector bundles having fixed determinant of odd degree over a compact Riemann surface C. In this paper it is shown that the Theta divisor of SU(2,1) is very ample for every C. The proof is related to the study of the base locus of the pencil of divisors 2-theta in the Jacobian of C which is naturally associated to a point of SU(2,1).
△ Less
Submitted 13 November, 1997;
originally announced November 1997.
-
The Theta Divisor of $SU_C(2,2d)^s$ is very Ample if $C$ is not Hyperelliptic
Authors:
S. Brivio,
A. Verra
Abstract:
Let $X$ be the moduli space of semistable rank 2 vector bundles over a smooth curve C of genus $g \ge 2$ and $θ: X \to PH^0(L)^*$ be the map associated to the generalized theta divisor L on X. We prove that for C not hyperelliptic, the map $θ$ is injective and the differential of $θ$ is injective at smooth points of X.
Let $X$ be the moduli space of semistable rank 2 vector bundles over a smooth curve C of genus $g \ge 2$ and $θ: X \to PH^0(L)^*$ be the map associated to the generalized theta divisor L on X. We prove that for C not hyperelliptic, the map $θ$ is injective and the differential of $θ$ is injective at smooth points of X.
△ Less
Submitted 21 October, 1994;
originally announced October 1994.