Search | arXiv e-print repository

Weighted Proper Orthogonal Decomposition for High-Dimensional Optimization

Authors: Sebastiaan P. C. van Schie, Boris Kramer, John T. Hwang

Abstract: While proper orthogonal decomposition (POD) is widely used for model reduction, its standard form does not take into account any parametric model structure. Extensions to POD have been proposed to address this, but these either require large amounts of solution data, lack online adaptivity, or have limited approximation accuracy. We circumvent these limitations by instead assigning weights to the… ▽ More While proper orthogonal decomposition (POD) is widely used for model reduction, its standard form does not take into account any parametric model structure. Extensions to POD have been proposed to address this, but these either require large amounts of solution data, lack online adaptivity, or have limited approximation accuracy. We circumvent these limitations by instead assigning weights to the snapshot matrix columns, and updating these whenever the model is evaluated at a new point in the parameter space. We derive an a posteriori error bound that depends on these snapshot weights, show how these weights can be chosen to tighten the error bound, and present an algorithm to compute the corresponding reduced basis efficiently. We show how this weighted POD approach can be used to naturally generalize the calculation of reduced basis derivatives to situations with multidimensional parameter spaces and snapshots at multiple locations in the parameter space. Lastly, we cover how these approaches can be implemented within an optimization algorithm, without the need for an offline training phase. The proposed weighted POD methods with and without reduced basis derivatives are applied to a gradient-based shell thickness optimization problem with 105 design parameters and a time-dependent partial differential equation. The numerical solutions obtained for this problem attain errors that are several orders of magnitude smaller when using weighted POD than those computed with regular POD and Grassmann manifold interpolation, while having comparable wall times per query and requiring fewer high-dimensional model snapshots to reach an optimal solution. △ Less

Submitted 12 August, 2025; originally announced August 2025.

Comments: 26 pages, 7 figures

MSC Class: 15A18; 49M41; 65F55; 65M15; 65M60

arXiv:2507.10884 [pdf, ps, other]

Learning from Imperfect Data: Robust Inference of Dynamic Systems using Simulation-based Generative Model

Authors: Hyunwoo Cho, Hyeontae Jo, Hyung Ju Hwang

Abstract: System inference for nonlinear dynamic models, represented by ordinary differential equations (ODEs), remains a significant challenge in many fields, particularly when the data are noisy, sparse, or partially observable. In this paper, we propose a Simulation-based Generative Model for Imperfect Data (SiGMoID) that enables precise and robust inference for dynamic systems. The proposed approach int… ▽ More System inference for nonlinear dynamic models, represented by ordinary differential equations (ODEs), remains a significant challenge in many fields, particularly when the data are noisy, sparse, or partially observable. In this paper, we propose a Simulation-based Generative Model for Imperfect Data (SiGMoID) that enables precise and robust inference for dynamic systems. The proposed approach integrates two key methods: (1) physics-informed neural networks with hyper-networks that constructs an ODE solver, and (2) Wasserstein generative adversarial networks that estimates ODE parameters by effectively capturing noisy data distributions. We demonstrate that SiGMoID quantifies data noise, estimates system parameters, and infers unobserved system components. Its effectiveness is validated validated through realistic experimental examples, showcasing its broad applicability in various domains, from scientific research to engineered systems, and enabling the discovery of full system dynamics. △ Less

Submitted 14 July, 2025; originally announced July 2025.

MSC Class: 68T07; 68T05; 70G60

arXiv:2506.20085 [pdf, ps, other]

Deformations of the tangent bundle of a projective hypersurface

Authors: Insong Choe, Kiryong Chung, Jun-Muk Hwang

Abstract: For a nonsingular hypersurface $X \subset \mathbb{P}^n, n \geq 4,$ of degree $d \geq 2$, we show that the space $H^1(X, \End(T_X))$ of infinitesimal deformations of the tangent bundle $T_X$ has dimension ${n+d-1 \choose d} (d-1)$ and all infinitesimal deformations are unobstructed even though $H^2(X, \End(T_X))$ can be nonzero. Furthermore, we prove that the irreducible component of the moduli spa… ▽ More For a nonsingular hypersurface $X \subset \mathbb{P}^n, n \geq 4,$ of degree $d \geq 2$, we show that the space $H^1(X, \End(T_X))$ of infinitesimal deformations of the tangent bundle $T_X$ has dimension ${n+d-1 \choose d} (d-1)$ and all infinitesimal deformations are unobstructed even though $H^2(X, \End(T_X))$ can be nonzero. Furthermore, we prove that the irreducible component of the moduli space of stable bundles containing the tangent bundle is a rational variety, by constructing an explicit birational model. △ Less

Submitted 24 June, 2025; originally announced June 2025.

Comments: 10 pages

MSC Class: 14J60; 14J70; 14D20

arXiv:2504.20408 [pdf, other]

FourierSpecNet: Neural Collision Operator Approximation Inspired by the Fourier Spectral Method for Solving the Boltzmann Equation

Authors: Jae Yong Lee, Gwang Jae Jung, Byung Chan Lim, Hyung Ju Hwang

Abstract: The Boltzmann equation, a fundamental model in kinetic theory, describes the evolution of particle distribution functions through a nonlinear, high-dimensional collision operator. However, its numerical solution remains computationally demanding, particularly for inelastic collisions and high-dimensional velocity domains. In this work, we propose the Fourier Neural Spectral Network (FourierSpecNet… ▽ More The Boltzmann equation, a fundamental model in kinetic theory, describes the evolution of particle distribution functions through a nonlinear, high-dimensional collision operator. However, its numerical solution remains computationally demanding, particularly for inelastic collisions and high-dimensional velocity domains. In this work, we propose the Fourier Neural Spectral Network (FourierSpecNet), a hybrid framework that integrates the Fourier spectral method with deep learning to approximate the collision operator in Fourier space efficiently. FourierSpecNet achieves resolution-invariant learning and supports zero-shot super-resolution, enabling accurate predictions at unseen resolutions without retraining. Beyond empirical validation, we establish a consistency result showing that the trained operator converges to the spectral solution as the discretization is refined. We evaluate our method on several benchmark cases, including Maxwellian and hard-sphere molecular models, as well as inelastic collision scenarios. The results demonstrate that FourierSpecNet offers competitive accuracy while significantly reducing computational cost compared to traditional spectral solvers. Our approach provides a robust and scalable alternative for solving the Boltzmann equation across both elastic and inelastic regimes. △ Less

Submitted 29 April, 2025; originally announced April 2025.

Comments: 27 pages, 11 figures

MSC Class: 68T20; 35Q20; 35B40; 82C40

arXiv:2501.08633 [pdf, ps, other]

Symmetrizer group of a projective hypersurface

Authors: Jun-Muk Hwang

Abstract: To each projective hypersurface which is not a cone, we associate an abelian linear algebraic group called the symmetrizer group of the corresponding symmetric form. This group describes the set of homogeneous polynomials with the same Jacobian ideal and gives a conceptual explanation of results by Ueda--Yoshinaga and Wang. In particular, the diagonalizable part of the symmetrizer group detects Se… ▽ More To each projective hypersurface which is not a cone, we associate an abelian linear algebraic group called the symmetrizer group of the corresponding symmetric form. This group describes the set of homogeneous polynomials with the same Jacobian ideal and gives a conceptual explanation of results by Ueda--Yoshinaga and Wang. In particular, the diagonalizable part of the symmetrizer group detects Sebastiani-Thom property of the hypersurface and its unipotent part is related to the singularity of the hypersurface. △ Less

Submitted 15 January, 2025; originally announced January 2025.

Comments: to appear in J. Math. Soc. Japan

MSC Class: 14J70; 14J17

arXiv:2412.19517 [pdf, other]

Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model

Authors: Hyunwoo Cho, Sung Woong Cho, Hyeontae Jo, Hyung Ju Hwang

Abstract: Differential equations (DEs) are crucial for modeling the evolution of natural or engineered systems. Traditionally, the parameters in DEs are adjusted to fit data from system observations. However, in fields such as politics, economics, and biology, available data are often independently collected at distinct time points from different subjects (i.e., repeated cross-sectional (RCS) data). Convent… ▽ More Differential equations (DEs) are crucial for modeling the evolution of natural or engineered systems. Traditionally, the parameters in DEs are adjusted to fit data from system observations. However, in fields such as politics, economics, and biology, available data are often independently collected at distinct time points from different subjects (i.e., repeated cross-sectional (RCS) data). Conventional optimization techniques struggle to accurately estimate DE parameters when RCS data exhibit various heterogeneities, leading to a significant loss of information. To address this issue, we propose a new estimation method called the emulator-informed deep-generative model (EIDGM), designed to handle RCS data. Specifically, EIDGM integrates a physics-informed neural network-based emulator that immediately generates DE solutions and a Wasserstein generative adversarial network-based parameter generator that can effectively mimic the RCS data. We evaluated EIDGM on exponential growth, logistic population models, and the Lorenz system, demonstrating its superior ability to accurately capture parameter distributions. Additionally, we applied EIDGM to an experimental dataset of Amyloid beta 40 and beta 42, successfully capturing diverse parameter distribution shapes. This shows that EIDGM can be applied to model a wide range of systems and extended to uncover the operating principles of systems based on limited data. △ Less

Submitted 27 December, 2024; originally announced December 2024.

MSC Class: 62F30; 65Z05; 68T09 ACM Class: G.1.7; I.2.m; J.2

arXiv:2410.12942 [pdf, other]

modOpt: A modular development environment and library for optimization algorithms

Authors: Anugrah Jo Joshy, John T. Hwang

Abstract: Recent advances in computing hardware and modeling software have given rise to new applications for numerical optimization. These new applications occasionally uncover bottlenecks in existing optimization algorithms and necessitate further specialization of the algorithms. However, such specialization requires expert knowledge of the underlying mathematical theory and the software implementation o… ▽ More Recent advances in computing hardware and modeling software have given rise to new applications for numerical optimization. These new applications occasionally uncover bottlenecks in existing optimization algorithms and necessitate further specialization of the algorithms. However, such specialization requires expert knowledge of the underlying mathematical theory and the software implementation of existing algorithms. To address this challenge, we present modOpt, an open-source software framework that facilitates the construction of optimization algorithms from modules. The modular environment provided by modOpt enables developers to tailor an existing algorithm for a new application by only altering the relevant modules. modOpt is designed as a platform to support students and beginner developers in quickly learning and developing their own algorithms. With that aim, the entirety of the framework is written in Python, and it is well-documented, well-tested, and hosted open-source on GitHub. Several additional features are embedded into the framework to assist both beginner and advanced developers. In addition to providing stock modules, the framework also includes fully transparent implementations of pedagogical optimization algorithms in Python. To facilitate testing and benchmarking of new algorithms, the framework features built-in visualization and recording capabilities, interfaces to modeling frameworks such as OpenMDAO and CSDL, interfaces to general-purpose optimization algorithms such as SNOPT and SLSQP, an interface to the CUTEst test problem set, etc. In this paper, we present the underlying software architecture of modOpt, review its various features, discuss several educational and performance-oriented algorithms within modOpt, and present numerical studies illustrating its unique benefits. △ Less

Submitted 16 October, 2024; originally announced October 2024.

Comments: 37 pages with 13 figures. For associated code, see https://github.com/LSDOlab/modopt

ACM Class: D.2.2; D.2.13; G.1.6; G.4; J.2

arXiv:2410.02225 [pdf, other]

Open-source shape optimization for isogeometric shells using FEniCS and OpenMDAO

Authors: Han Zhao, John T. Hwang, Jiun-Shyan Chen

Abstract: We present an open-source Python framework for the shape optimization of complex shell structures using isogeometric analysis (IGA). IGA seamlessly integrates computer-aided design (CAD) and analysis models by employing non-uniform rational B-splines (NURBS) as basis functions, enabling the natural implementation of the Kirchhoff--Love shell model due to their higher order of continuity. We levera… ▽ More We present an open-source Python framework for the shape optimization of complex shell structures using isogeometric analysis (IGA). IGA seamlessly integrates computer-aided design (CAD) and analysis models by employing non-uniform rational B-splines (NURBS) as basis functions, enabling the natural implementation of the Kirchhoff--Love shell model due to their higher order of continuity. We leverage the recently developed FEniCS-based analysis framework, PENGoLINS, for the direct structural analysis of shell structures consisting of a collection of NURBS patches through a penalty-based formulation. This contribution introduces the open-source implementation of gradient-based shape optimization for isogeometric Kirchhoff--Love shells with a modular architecture. Complex shell structures with non-matching intersections are handled using a free-form deformation (FFD) approach and a moving intersections formulation. The symbolic differentiation and code generation capabilities in FEniCS are utilized to compute the analytical derivatives. By integrating FEniCS with OpenMDAO, we build modular components that facilitate gradient-based shape optimization of shell structures. The modular architecture in this work supports future extensions and integration with other disciplines and solvers, making it highly customizable and suitable for a wide range of applications. We validate the design-analysis-optimization workflow through several benchmark problems and demonstrate its application to aircraft wing design optimization. The framework is implemented in a Python library named GOLDFISH (Gradient-based Optimization and Large-scale Design Framework for Isogeometric SHells) and the source code will be maintained at https://github.com/hanzhao2020/GOLDFISH. △ Less

Submitted 4 February, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

Comments: 39 pages, 14 figures

arXiv:2408.15537 [pdf, ps, other]

Generalized Tanaka prolongation and convergence of formal equivalence between embeddings

Authors: Jaehyun Hong, Jun-Muk Hwang

Abstract: The works of Commichau--Grauert and Hirschowitz showed that a formal equivalence between embeddings of a compact complex manifold is convergent, if the embeddings have sufficiently positive normal bundles in a suitable sense. We show that the convergence still holds under the weaker assumption of semi-positive normal bundles if some geometric conditions are satisfied. Our result can be applied to… ▽ More The works of Commichau--Grauert and Hirschowitz showed that a formal equivalence between embeddings of a compact complex manifold is convergent, if the embeddings have sufficiently positive normal bundles in a suitable sense. We show that the convergence still holds under the weaker assumption of semi-positive normal bundles if some geometric conditions are satisfied. Our result can be applied to many examples of general minimal rational curves, including general lines on a smooth hypersurface of degree less than $n$ in the $(n+1)$-dimensional projective space. As a key ingredient of our arguments, we formulate and prove a generalized version of Tanaka's prolongation procedure for geometric structures subordinate to vector distributions, a result of independent interest. When applied to the universal family of the deformations of the compact submanifolds satisfying our geometric conditions, the generalized Tanaka prolongation gives a natural absolute parallelism on a suitable fiber space. A formal equivalence of embeddings must preserve these absolute parallelisms, which implies its convergence. △ Less

Submitted 28 August, 2024; originally announced August 2024.

MSC Class: 32K07; 58A30; 32C22

arXiv:2408.13420 [pdf, other]

PySLSQP: A transparent Python package for the SLSQP optimization algorithm modernized with utilities for visualization and post-processing

Authors: Anugrah Jo Joshy, John T. Hwang

Abstract: PySLSQP is a seamless interface for using the SLSQP algorithm from Python. It wraps the original SLSQP Fortran code sourced from the SciPy repository and provides a host of new features to improve the research utility of the original algorithm. Some of the additional features offered by PySLSQP include auto-generation of unavailable derivatives using finite differences, independent scaling of the… ▽ More PySLSQP is a seamless interface for using the SLSQP algorithm from Python. It wraps the original SLSQP Fortran code sourced from the SciPy repository and provides a host of new features to improve the research utility of the original algorithm. Some of the additional features offered by PySLSQP include auto-generation of unavailable derivatives using finite differences, independent scaling of the problem variables and functions, access to internal optimization data, live-visualization, saving optimization data from each iteration, warm/hot restarting of optimization, and various other utilities for post-processing. △ Less

Submitted 23 August, 2024; originally announced August 2024.

Comments: 9 pages with 2 figures. For associated code, see https://github.com/anugrahjo/PySLSQP

ACM Class: G.1.6; J.2

arXiv:2408.11459 [pdf, ps, other]

Symmetries of $(2,3,5)$-distributions and associated Legendrian cone structures

Authors: Jun-Muk Hwang, Dennis The

Abstract: We exploit a natural correspondence between holomorphic $(2,3,5)$-distributions and nondegenerate lines on holomorphic contact manifolds of dimension $5$ to present a new perspective in the study of symmetries of $(2,3,5)$-distributions. This leads to a number of new results in this classical subject, including an unexpected relation between the multiply-transitive families of models having $7$- a… ▽ More We exploit a natural correspondence between holomorphic $(2,3,5)$-distributions and nondegenerate lines on holomorphic contact manifolds of dimension $5$ to present a new perspective in the study of symmetries of $(2,3,5)$-distributions. This leads to a number of new results in this classical subject, including an unexpected relation between the multiply-transitive families of models having $7$- and $6$-dimensional symmetries, and a one-to-one correspondence between equivalence classes of nontransitive $(2,3,5)$-distributions with $6$-dimensional symmetries and nonhomogeneous nondegenerate Legendrian curves in $\mathbb{P}^3$. An ingredient for establishing the former is an explicit classification of homogeneous nondegenerate Legendrian curves in $\mathbb{P}^3$, which we present. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Comments: 33 pages

MSC Class: Primary: 58A30; 58J70; Secondary: 32L25; 34C41; 53A55

arXiv:2407.16263 [pdf, ps, other]

Characteristic conic connections and torsion-free principal connections

Authors: Jun-Muk Hwang, Qifeng Li

Abstract: We study the relation between torsion tensors of principal connections on G-structures and characteristic conic connections on associated cone structures. We formulate sufficient conditions under which the existence of a characteristic conic connection implies the existence of a torsion-free principal connection. We verify these conditions for adjoint varieties of simple Lie algebras, excluding th… ▽ More We study the relation between torsion tensors of principal connections on G-structures and characteristic conic connections on associated cone structures. We formulate sufficient conditions under which the existence of a characteristic conic connection implies the existence of a torsion-free principal connection. We verify these conditions for adjoint varieties of simple Lie algebras, excluding those of type $\textsf{A}_{\ell \neq 2}$ or $\textsf{C}_{\ell}$. As an application, we give a complete classification of the germs of minimal rational curves whose VMRT at a general point is such an adjoint variety: nontrivial ones come from lines on hyperplane sections of certain Grassmannians or minimal rational curves on wonderful group compactifications. △ Less

Submitted 23 July, 2024; originally announced July 2024.

Comments: To appear in Journal de Mathématiques Pures et Appliquées

arXiv:2407.07438 [pdf, ps, other]

Near-order relation of power means

Authors: Jinmi Hwang, Sejong Kim

Abstract: On the setting of positive definite operators we study the near-order properties of power means such as the quasi-arithmetic mean (Hölder mean) and Rényi power mean. We see the monotonicity of spectral geometric mean and Wasserstein mean on parameters with respect to the near-order and the near-order relationship between the spectral geometric mean and Wasserstein mean. Furthermore, the monotonici… ▽ More On the setting of positive definite operators we study the near-order properties of power means such as the quasi-arithmetic mean (Hölder mean) and Rényi power mean. We see the monotonicity of spectral geometric mean and Wasserstein mean on parameters with respect to the near-order and the near-order relationship between the spectral geometric mean and Wasserstein mean. Furthermore, the monotonicity of quasi-arithmetic mean on parameters and the convergence of Rényi power mean to the log-Euclidean mean with respect to the near-order have been established. △ Less

Submitted 10 July, 2024; originally announced July 2024.

arXiv:2407.00185 [pdf, other]

Shape optimization of non-matching isogeometric shells with moving intersections

Authors: Han Zhao, John T. Hwang, J. S. Chen

Abstract: While shape optimization using isogeometric shells exhibits appealing features by integrating design geometries and analysis models, challenges arise when addressing computer-aided design (CAD) geometries comprised of multiple non-uniform rational B-splines (NURBS) patches, which are common in practice. The intractability stems from surface intersections within these CAD models. In this paper, we… ▽ More While shape optimization using isogeometric shells exhibits appealing features by integrating design geometries and analysis models, challenges arise when addressing computer-aided design (CAD) geometries comprised of multiple non-uniform rational B-splines (NURBS) patches, which are common in practice. The intractability stems from surface intersections within these CAD models. In this paper, we develop an approach for shape optimization of non-matching isogeometric shells incorporating intersection movement. Separately parametrized NURBS surfaces are modeled using Kirchhoff--Love shell theory and coupled using a penalty-based formulation. The optimization scheme allows shell patches to move without preserving relative location with other members during the shape optimization. This flexibility is achieved through an implicit state function, and analytical sensitivities are derived for the relative movement of shell patches. The introduction of differentiable intersections expands the design space and overcomes challenges associated with large mesh distortion, particularly when optimal shapes involve significant movement of patch intersections in physical space. Throughout optimization iterations, all members within the shell structures maintain the NURBS geometry representation, enabling efficient integration of analysis and design models. The optimization approach leverages the multilevel design concept by selecting a refined model for accurate analysis from a coarse design model while maintaining the same geometry. We adopt several example problems to verify the effectiveness of the proposed scheme and demonstrate its applicability to the optimization of the internal stiffeners of an aircraft wing. △ Less

Submitted 28 June, 2024; originally announced July 2024.

Comments: 41 pages, 18 figures

arXiv:2404.19376 [pdf, ps, other]

Characterizing subadjoint varieties among Legendrian varieties

Authors: Jun-Muk Hwang

Abstract: For a symplectic vector space $V$, a projective subvariety $Z \subset {\bf P} V$ is a Legendrian variety if its affine cone $\widehat{Z} \subset V$ is Lagrangian. In addition to the classical examples of subadjoint varieties associated to simple Lie algebras, many examples of nonsingular Legendrian varieties have been discovered which have positive-dimensional automorphism groups. We give a charac… ▽ More For a symplectic vector space $V$, a projective subvariety $Z \subset {\bf P} V$ is a Legendrian variety if its affine cone $\widehat{Z} \subset V$ is Lagrangian. In addition to the classical examples of subadjoint varieties associated to simple Lie algebras, many examples of nonsingular Legendrian varieties have been discovered which have positive-dimensional automorphism groups. We give a characterization of subadjoint varieties among such Legendrian varieties in terms of the isotropy representation. Our proof uses some special features of the projective third fundamental forms of Legendrian varieties and their relation to the lines on the Legendrian varieties. △ Less

Submitted 30 April, 2024; originally announced April 2024.

Comments: to appear in Ann. Inst. Fourier (Grenoble)

MSC Class: 14M15; 53A20; 53D10

arXiv:2404.15570 [pdf, other]

Air-taxi trajectory optimization with aerodynamic and motor models

Authors: Nicholas C. Orndorff, John T. Hwang

Abstract: To fulfill the vision for large-scale urban air mobility, air-taxi concepts must be carefully designed and optimized for their intended mission. Proposed air-taxi missions contain dynamic segments that are dominated by nonlinear dynamics. One such segment is the transition to and from hover and cruise that occurs at the start and end of the mission. Because this transition involves low-altitude an… ▽ More To fulfill the vision for large-scale urban air mobility, air-taxi concepts must be carefully designed and optimized for their intended mission. Proposed air-taxi missions contain dynamic segments that are dominated by nonlinear dynamics. One such segment is the transition to and from hover and cruise that occurs at the start and end of the mission. Because this transition involves low-altitude and high-power flight, analyzing transition trajectories is critical for safe and economical urban air mobility. Optimization of the transition maneuver requires an optimal control approach that characterizes the trajectories of the system states through time. In this paper we solve this optimal control problem for air-taxi transition within a large-scale design-optimization framework. This framework allows us to include five physics-based models that describe flight dynamics, rotor aerodynamics, wing aerodynamics, motor performance, and acoustics with which we create a low-fidelity model of NASA's Lift-plus-Cruise air-taxi concept. We use this optimization problem formulation to compute transition trajectories that minimize time or minimize energy. Our results show that the Lift-plus-Cruise aircraft completes a minimum-energy transition in 80s with an energy expenditure of 13.3MJ and a minimum-time transition in 28s with an energy expenditure of 16.4MJ. We find that these trajectories contain large pitch angles and high sound pressure levels which are both undesirable for practical urban air mobility. Consequently, we explore trajectories that include pitch angle and acoustic constraints, and find that minimum time trajectories are significantly more affected by these constraints than minimum energy trajectories. △ Less

Submitted 5 April, 2025; v1 submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.14873 [pdf, ps, other]

Estimating the Distribution of Parameters in Differential Equations with Repeated Cross-Sectional Data

Authors: Hyeontae Jo, Sung Woong Cho, Hyung Ju Hwang

Abstract: Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found tha… ▽ More Differential equations are pivotal in modeling and understanding the dynamics of various systems, offering insights into their future states through parameter estimation fitted to time series data. In fields such as economy, politics, and biology, the observation data points in the time series are often independently obtained (i.e., Repeated Cross-Sectional (RCS) data). With RCS data, we found that traditional methods for parameter estimation in differential equations, such as using mean values of time trajectories or Gaussian Process-based trajectory generation, have limitations in estimating the shape of parameter distributions, often leading to a significant loss of data information. To address this issue, we introduce a novel method, Estimation of Parameter Distribution (EPD), providing accurate distribution of parameters without loss of data information. EPD operates in three main steps: generating synthetic time trajectories by randomly selecting observed values at each time point, estimating parameters of a differential equation that minimize the discrepancy between these trajectories and the true solution of the equation, and selecting the parameters depending on the scale of discrepancy. We then evaluated the performance of EPD across several models, including exponential growth, logistic population models, and target cell-limited models with delayed virus production, demonstrating its superiority in capturing the shape of parameter distributions. Furthermore, we applied EPD to real-world datasets, capturing various shapes of parameter distributions rather than a normal distribution. These results effectively address the heterogeneity within systems, marking a substantial progression in accurately modeling systems using RCS data. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: 16 pages, 10 figures

MSC Class: 65L08; 65D17; 68U07

arXiv:2404.05941 [pdf, ps, other]

Formal principle with convergence for rational curves of Goursat type

Authors: Jun-Muk Hwang

Abstract: We propose a conjecture that a general member of a bracket-generating family of rational curves in a complex manifold satisfies the formal principle with convergence, namely, any formal equivalence between such curves is convergent. If the normal bundles of the rational curves are positive, the conjecture follows from the results of Commichau-Grauert and Hirschowitz. We prove the conjecture for th… ▽ More We propose a conjecture that a general member of a bracket-generating family of rational curves in a complex manifold satisfies the formal principle with convergence, namely, any formal equivalence between such curves is convergent. If the normal bundles of the rational curves are positive, the conjecture follows from the results of Commichau-Grauert and Hirschowitz. We prove the conjecture for the opposite case when the normal bundles are furthest from positive vector bundles among bracket-generating families, namely, when the families of rational curves are of Goursat type. The proof uses natural ODEs associated to rational curves of Goursat type and corresponding Cartan connections constructed by Doubrov-Komrakov-Morimoto. As an example, we see that a general line on a smooth cubic fourfold satisfies the formal principle with convergence. △ Less

Submitted 8 April, 2024; originally announced April 2024.

Comments: to appear in Algebraic Geometry and Physics

MSC Class: 32K07; 58A30; 32C22

arXiv:2404.05931 [pdf, ps, other]

Lagrangian loci in moduli of abelian surfaces

Authors: Jun-Muk Hwang

Abstract: We show that any smooth surface germ in the moduli of abelian surfaces arises from a Lagrangian fibration of abelian surfaces. By Donagi-Markman's cubic condition, the key issue of the proof is to find a suitable affine structure with a compatible cubic form on the base space of the family. We achieve this by analyzing the properties of cubic forms in two variables and proving the existence of the… ▽ More We show that any smooth surface germ in the moduli of abelian surfaces arises from a Lagrangian fibration of abelian surfaces. By Donagi-Markman's cubic condition, the key issue of the proof is to find a suitable affine structure with a compatible cubic form on the base space of the family. We achieve this by analyzing the properties of cubic forms in two variables and proving the existence of the solution of the resulting partial differential equations by Cauchy-Kowalewski Theorem. Modifying the argument, we show also that a smooth curve germ in the moduli of abelian surfaces arises from a Lagrangian fibration if and only if the curve is a null curve with respect to the natural holomorphic conformal structure on the moduli of abelian surfaces. △ Less

Submitted 8 April, 2024; originally announced April 2024.

MSC Class: 14K20; 32G20

Journal ref: Eur. J. Math. 8 (2022) no. 3

arXiv:2402.08187 [pdf, other]

Learning time-dependent PDE via graph neural networks and deep operator network for robust accuracy on irregular grids

Authors: Sung Woong Cho, Jae Yong Lee, Hyung Ju Hwang

Abstract: Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions… ▽ More Scientific computing using deep learning has seen significant advancements in recent years. There has been growing interest in models that learn the operator from the parameters of a partial differential equation (PDE) to the corresponding solutions. Deep Operator Network (DeepONet) and Fourier Neural operator, among other models, have been designed with structures suitable for handling functions as inputs and outputs, enabling real-time predictions as surrogate models for solution operators. There has also been significant progress in the research on surrogate models based on graph neural networks (GNNs), specifically targeting the dynamics in time-dependent PDEs. In this paper, we propose GraphDeepONet, an autoregressive model based on GNNs, to effectively adapt DeepONet, which is well-known for successful operator learning. GraphDeepONet exhibits robust accuracy in predicting solutions compared to existing GNN-based PDE solver models. It maintains consistent performance even on irregular grids, leveraging the advantages inherited from DeepONet and enabling predictions on arbitrary grids. Additionally, unlike traditional DeepONet and its variants, GraphDeepONet enables time extrapolation for time-dependent PDE solutions. We also provide theoretical analysis of the universal approximation capability of GraphDeepONet in approximating continuous operators across arbitrary time intervals. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 25 pages, 11 figures

MSC Class: 65D17; 68U07

arXiv:2312.15949 [pdf, other]

HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork

Authors: Jae Yong Lee, Sung Woong Cho, Hyung Ju Hwang

Abstract: Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear mappings between function spaces. However, the DeepONet requires many parameters a… ▽ More Fast and accurate predictions for complex physical dynamics are a significant challenge across various applications. Real-time prediction on resource-constrained hardware is even more crucial in real-world problems. The deep operator network (DeepONet) has recently been proposed as a framework for learning nonlinear mappings between function spaces. However, the DeepONet requires many parameters and has a high computational cost when learning operators, particularly those with complex (discontinuous or non-smooth) target functions. This study proposes HyperDeepONet, which uses the expressive power of the hypernetwork to enable the learning of a complex operator with a smaller set of parameters. The DeepONet and its variant models can be thought of as a method of injecting the input function information into the target function. From this perspective, these models can be viewed as a particular case of HyperDeepONet. We analyze the complexity of DeepONet and conclude that HyperDeepONet needs relatively lower complexity to obtain the desired accuracy for operator learning. HyperDeepONet successfully learned various operators with fewer computational resources compared to other benchmarks. △ Less

Submitted 26 December, 2023; originally announced December 2023.

Comments: 26 pages, 13 figures. Published as a conference paper at Eleventh International Conference on Learning Representations (ICLR 2023)

MSC Class: 65D17; 68U07

arXiv:2308.03781 [pdf, other]

doi 10.1007/s00366-024-01947-7

Automated shape and thickness optimization for non-matching isogeometric shells using free-form deformation

Authors: Han Zhao, David Kamensky, John T. Hwang, Jiun-Shyan Chen

Abstract: Isogeometric analysis (IGA) has emerged as a promising approach in the field of structural optimization, benefiting from the seamless integration between the computer-aided design (CAD) geometry and the analysis model by employing non-uniform rational B-splines (NURBS) as basis functions. However, structural optimization for real-world CAD geometries consisting of multiple non-matching NURBS patch… ▽ More Isogeometric analysis (IGA) has emerged as a promising approach in the field of structural optimization, benefiting from the seamless integration between the computer-aided design (CAD) geometry and the analysis model by employing non-uniform rational B-splines (NURBS) as basis functions. However, structural optimization for real-world CAD geometries consisting of multiple non-matching NURBS patches remains a challenging task. In this work, we propose a unified formulation for shape and thickness optimization of separately-parametrized shell structures by adopting the free-form deformation (FFD) technique, so that continuity with respect to design variables is preserved at patch intersections during optimization. Shell patches are modeled with isogeometric Kirchhoff--Love theory and coupled using a penalty-based method in the analysis. We use Lagrange extraction to link the control points associated with the B-spline FFD block and shell patches, and we perform IGA using the same extraction matrices by taking advantage of existing finite element assembly procedures in the FEniCS partial differential equation (PDE) solution library. Moreover, we enable automated analytical derivative computation by leveraging advanced code generation in FEniCS, thereby facilitating efficient gradient-based optimization algorithms. The framework is validated using a collection of benchmark problems, demonstrating its applications to shape and thickness optimization of aircraft wings with complex shell layouts. △ Less

Submitted 2 August, 2023; originally announced August 2023.

arXiv:2305.13998 [pdf, other]

doi 10.1016/j.advengsoft.2023.103571

SMT 2.0: A Surrogate Modeling Toolbox with a focus on Hierarchical and Mixed Variables Gaussian Processes

Authors: Paul Saves, Remi Lafage, Nathalie Bartoli, Youssef Diouane, Jasper Bussemaker, Thierry Lefebvre, John T. Hwang, Joseph Morlier, Joaquim R. R. A. Martins

Abstract: The Surrogate Modeling Toolbox (SMT) is an open-source Python package that offers a collection of surrogate modeling methods, sampling techniques, and a set of sample problems. This paper presents SMT 2.0, a major new release of SMT that introduces significant upgrades and new features to the toolbox. This release adds the capability to handle mixed-variable surrogate models and hierarchical varia… ▽ More The Surrogate Modeling Toolbox (SMT) is an open-source Python package that offers a collection of surrogate modeling methods, sampling techniques, and a set of sample problems. This paper presents SMT 2.0, a major new release of SMT that introduces significant upgrades and new features to the toolbox. This release adds the capability to handle mixed-variable surrogate models and hierarchical variables. These types of variables are becoming increasingly important in several surrogate modeling applications. SMT 2.0 also improves SMT by extending sampling methods, adding new surrogate models, and computing variance and kernel derivatives for Kriging. This release also includes new functions to handle noisy and use multifidelity data. To the best of our knowledge, SMT 2.0 is the first open-source surrogate library to propose surrogate models for hierarchical and mixed inputs. This open-source software is distributed under the New BSD license. △ Less

Submitted 23 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: 10.1016/j.advengsoft.2023.103571

Journal ref: Advances in Engineering Software Volume 188, February 2024, 103571

arXiv:2304.14889 [pdf, other]

Large-scale multidisciplinary design optimization of the NASA lift-plus-cruise concept using a novel aircraft design framework

Authors: Marius L. Ruh, Darshan Sarojini, Andrew Fletcher, Isaac Asher, John T. Hwang

Abstract: The conceptual design of eVTOL aircraft is a high-dimensional optimization problem that involves large numbers of continuous design parameters. Therefore, eVTOL design method would benefit from numerical optimization algorithms capable of systematically searching these high-dimensional parameters spaces, using comprehensive and multidisciplinary models of the aircraft. By leveraging recent progres… ▽ More The conceptual design of eVTOL aircraft is a high-dimensional optimization problem that involves large numbers of continuous design parameters. Therefore, eVTOL design method would benefit from numerical optimization algorithms capable of systematically searching these high-dimensional parameters spaces, using comprehensive and multidisciplinary models of the aircraft. By leveraging recent progress in sensitivity analysis methods, a computational framework called the Comprehensive Aircraft high-Dimensional DEsign Environment (CADDEE) has been developed for large-scale multidisciplinary design optimization (MDO) of electric air taxis. CADDEE uses a geometry-centric approach that propagates geometry changes in a differentiable manner to meshes for physics-based models of arbitrary fidelity level. The paper demonstrates the capabilities of this new aircraft design tool, by presenting large-scale MDO results for NASA's Lift+Cruise eVTOL concept. MDO with over 100 design variables, 17 constraints, and low-fidelity predictive models for key disciplines is demonstrated with an optimization time of less than one hour with a desktop computer. The results show a reduction in gross weight of 11.4% and suggest that CADDEE can be valuable in the conceptual design and optimization of eVTOL aircraft. △ Less

Submitted 25 April, 2023; originally announced April 2023.

Comments: 21 pages, 15 figures

ACM Class: J.2

arXiv:2303.14936 [pdf, ps, other]

TALOS: A toolbox for spacecraft conceptual design

Authors: Victor Gandarillas, John T. Hwang

Abstract: We present the Toolbox for Analysis and Large-scale Optimization of Spacecraft (TALOS), a framework designed for applying large-scale multidisciplinary design optimization (MDO) to spacecraft design problems. The framework is built using the Computational System Design Language (CSDL), with abstractions for users to describe systems at a high level. CSDL is a compiled, embedded domain-specific lan… ▽ More We present the Toolbox for Analysis and Large-scale Optimization of Spacecraft (TALOS), a framework designed for applying large-scale multidisciplinary design optimization (MDO) to spacecraft design problems. The framework is built using the Computational System Design Language (CSDL), with abstractions for users to describe systems at a high level. CSDL is a compiled, embedded domain-specific language that fully automates derivative computation using the adjoint method. CSDL provides a unified interface for defining MDO problems, separating model definition from low-level program implementation details. TALOS provides discipline models for spacecraft mission designers to perform analyses, optimizations, and trade studies early in the design process. TALOS also provides interfaces for users to provide high-level system descriptions without the need to use CSDL directly, which simplifies the exploration of different spacecraft configurations. We describe the interfaces in TALOS available to users and run analyses on selected spacecraft subsystem disciplines to demonstrate the current capabilities of TALOS. △ Less

Submitted 27 March, 2023; originally announced March 2023.

arXiv:2302.13295 [pdf, ps, other]

Persistence of the solution to the Euler equations in the end-point critical Triebel-Lizorkin space $F^{d+1}_{1, \infty}(\mathbb{R}^d)$

Authors: Hee Chul Pak, Jun Seok Hwang

Abstract: Local stay of the solutions to the Euler equations for an ideal incompressible fluid in the end-point Triebel-Lizorkin spaces $F^s_{1, \infty}(\mathbb{R}^d)$ with $s \geq d + 1$ is clarified. Local stay of the solutions to the Euler equations for an ideal incompressible fluid in the end-point Triebel-Lizorkin spaces $F^s_{1, \infty}(\mathbb{R}^d)$ with $s \geq d + 1$ is clarified. △ Less

Submitted 26 February, 2023; originally announced February 2023.

Comments: 15 pages

MSC Class: 76B03; 35Q31

arXiv:2301.12622 [pdf, ps, other]

Lines on holomorphic contact manifolds and a generalization of $(2,3,5)$-distributions to higher dimensions

Authors: Jun-Muk Hwang, Qifeng Li

Abstract: Since the celebrated work by Cartan, distributions with \nobreak{small} growth vector $(2,3,5)$ have been studied extensively. In the holomorphic setting, there is a natural correspondence between holomorphic $(2,3,5)$-distributions and nondegenerate lines on holomorphic contact manifolds of dimension 5. We generalize this correspondence to higher dimensions by studying nondegenerate lines on holo… ▽ More Since the celebrated work by Cartan, distributions with \nobreak{small} growth vector $(2,3,5)$ have been studied extensively. In the holomorphic setting, there is a natural correspondence between holomorphic $(2,3,5)$-distributions and nondegenerate lines on holomorphic contact manifolds of dimension 5. We generalize this correspondence to higher dimensions by studying nondegenerate lines on holomorphic contact manifolds and the corresponding class of distributions of small growth vector $(2m, 3m, 3m+2)$ for any positive integer $m$. △ Less

Submitted 29 January, 2023; originally announced January 2023.

Comments: To appear in Nagoya Mathematical Journal

arXiv:2212.09226 [pdf, ps, other]

Recognizing the ${\rm G}_2$-horospherical manifold of Picard number 1 by varieties of minimal rational tangents

Authors: Jun-Muk Hwang, Qifeng Li

Abstract: Pasquier and Perrin discovered that the ${\rm G}_2$-horospherical manifold ${\bf X}$ of Picard number 1 can be realized as a smooth specialization of the rational homogeneous space parameterizing the lines on the 5-dimensional hyperquadric, in other words, it can be deformed nontrivially to the rational homogeneous space. We show that ${\bf X}$ is the only smooth projective variety with this prope… ▽ More Pasquier and Perrin discovered that the ${\rm G}_2$-horospherical manifold ${\bf X}$ of Picard number 1 can be realized as a smooth specialization of the rational homogeneous space parameterizing the lines on the 5-dimensional hyperquadric, in other words, it can be deformed nontrivially to the rational homogeneous space. We show that ${\bf X}$ is the only smooth projective variety with this property. This is obtained as a consequence of our main result that ${\bf X}$ can be recognized by its VMRT, namely, a Fano manifold of Picard number 1 is biregular to ${\bf X}$ if and only if its VMRT at a general point is projectively isomorphic to that of ${\bf X}$. We employ the method the authors developed to solve the corresponding problem for symplectic Grassmannians, which constructs a flat Cartan connection in a neighborhood of a general minimal rational curve. In adapting this method to ${\bf X}$, we need an intricate study of the positivity/negativity of vector bundles with respect to a family of rational curves, which is subtler than the case of symplectic Grassmannians because of the nature of the differential geometric structure on ${\bf X}$ arising from VMRT. △ Less

Submitted 18 December, 2022; originally announced December 2022.

Comments: To appear in Transformation Groups

MSC Class: 14M17; 32G05; 53C15

arXiv:2211.15880 [pdf, other]

Mirror descent of Hopfield model

Authors: Hyungjoon Soh, Dongyeob Kim, Juno Hwang, Junghyo Jo

Abstract: Mirror descent is an elegant optimization technique that leverages a dual space of parametric models to perform gradient descent. While originally developed for convex optimization, it has increasingly been applied in the field of machine learning. In this study, we propose a novel approach for utilizing mirror descent to initialize the parameters of neural networks. Specifically, we demonstrate t… ▽ More Mirror descent is an elegant optimization technique that leverages a dual space of parametric models to perform gradient descent. While originally developed for convex optimization, it has increasingly been applied in the field of machine learning. In this study, we propose a novel approach for utilizing mirror descent to initialize the parameters of neural networks. Specifically, we demonstrate that by using the Hopfield model as a prototype for neural networks, mirror descent can effectively train the model with significantly improved performance compared to traditional gradient descent methods that rely on random parameter initialization. Our findings highlight the potential of mirror descent as a promising initialization technique for enhancing the optimization of machine learning models. △ Less

Submitted 9 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

Comments: 3 figures

arXiv:2207.08383 [pdf, ps, other]

doi 10.1016/j.chaos.2022.112055

A New Necessary and Sufficient Condition for the Existence of Global Solutions to Semilinear Parabolic Equations on Bounded Domains

Authors: Soon-Yeong Chung, Jaeho Hwang

Abstract: The purpose of this paper is to give a necessary and sufficient condition for the existence and non-existence of global solutions of the following semilinear parabolic equations \[ u_{t}=Δu+ψ(t)f(u),\,\,\mbox{ in }Ω\times (0,t^{*}), \] under the Dirichlet boundary condition on a bounded domain. In fact, this has remained as an open problem for a few decades, even for the case $f(u)=u^{p}$.… ▽ More The purpose of this paper is to give a necessary and sufficient condition for the existence and non-existence of global solutions of the following semilinear parabolic equations \[ u_{t}=Δu+ψ(t)f(u),\,\,\mbox{ in }Ω\times (0,t^{*}), \] under the Dirichlet boundary condition on a bounded domain. In fact, this has remained as an open problem for a few decades, even for the case $f(u)=u^{p}$. As a matter of fact, we prove: \[ \begin{aligned} &\mbox{there is no global solution for any initial data if and only if } &\mbox{the function } f \mbox{ satisfies} &\hspace{20mm}\int_{0}^{\infty}ψ(t)\frac{f\left(\lVert S(t)u_{0}\rVert_{\infty}\right)}{\lVert S(t)u_{0}\rVert_{\infty}}dt=\infty &\mbox{for every }\,ε>0\,\mbox{ and nonnegative nontrivial initial data }\,u_{0}\in C_{0}(Ω). \end{aligned} \] Here, $(S(t))_{t\geq 0}$ is the heat semigroup with the Dirichlet boundary condition. △ Less

Submitted 17 September, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

Comments: 12 pages

MSC Class: 35F31; 35K91; 35K57

arXiv:2207.01765 [pdf, other]

doi 10.1016/j.jcp.2023.112031

opPINN: Physics-Informed Neural Network with operator learning to approximate solutions to the Fokker-Planck-Landau equation

Authors: Jae Yong Lee, Juhi Jang, Hyung Ju Hwang

Abstract: We propose a hybrid framework opPINN: physics-informed neural network (PINN) with operator learning for approximating the solution to the Fokker-Planck-Landau (FPL) equation. The opPINN framework is divided into two steps: Step 1 and Step 2. After the operator surrogate models are trained during Step 1, PINN can effectively approximate the solution to the FPL equation during Step 2 by using the pr… ▽ More We propose a hybrid framework opPINN: physics-informed neural network (PINN) with operator learning for approximating the solution to the Fokker-Planck-Landau (FPL) equation. The opPINN framework is divided into two steps: Step 1 and Step 2. After the operator surrogate models are trained during Step 1, PINN can effectively approximate the solution to the FPL equation during Step 2 by using the pre-trained surrogate models. The operator surrogate models greatly reduce the computational cost and boost PINN by approximating the complex Landau collision integral in the FPL equation. The operator surrogate models can also be combined with the traditional numerical schemes. It provides a high efficiency in computational time when the number of velocity modes becomes larger. Using the opPINN framework, we provide the neural network solutions for the FPL equation under the various types of initial conditions, and interaction models in two and three dimensions. Furthermore, based on the theoretical properties of the FPL equation, we show that the approximated neural network solution converges to the a priori classical solution of the FPL equation as the pre-defined loss function is reduced. △ Less

Submitted 4 July, 2022; originally announced July 2022.

Comments: 28 pages, 12 figures

MSC Class: 68T20; 35Q84; 35B40; 82C40

arXiv:2205.01059 [pdf, other]

Enhanced Physics-Informed Neural Networks with Augmented Lagrangian Relaxation Method (AL-PINNs)

Authors: Hwijae Son, Sung Woong Cho, Hyung Ju Hwang

Abstract: Physics-Informed Neural Networks (PINNs) have become a prominent application of deep learning in scientific computation, as they are powerful approximators of solutions to nonlinear partial differential equations (PDEs). There have been numerous attempts to facilitate the training process of PINNs by adjusting the weight of each component of the loss function, called adaptive loss-balancing algori… ▽ More Physics-Informed Neural Networks (PINNs) have become a prominent application of deep learning in scientific computation, as they are powerful approximators of solutions to nonlinear partial differential equations (PDEs). There have been numerous attempts to facilitate the training process of PINNs by adjusting the weight of each component of the loss function, called adaptive loss-balancing algorithms. In this paper, we propose an Augmented Lagrangian relaxation method for PINNs (AL-PINNs). We treat the initial and boundary conditions as constraints for the optimization problem of the PDE residual. By employing Augmented Lagrangian relaxation, the constrained optimization problem becomes a sequential max-min problem so that the learnable parameters $λ$ adaptively balance each loss component. Our theoretical analysis reveals that the sequence of minimizers of the proposed loss functions converges to an actual solution for the Helmholtz, viscous Burgers, and Klein--Gordon equations. We demonstrate through various numerical experiments that AL-PINNs yield a much smaller relative error compared with that of state-of-the-art adaptive loss-balancing algorithms. △ Less

Submitted 30 May, 2023; v1 submitted 29 April, 2022; originally announced May 2022.

arXiv:2204.09927 [pdf, ps, other]

Partial compactification of metabelian Lie groups with prescribed varieties of minimal rational tangents

Authors: Jun-Muk Hwang

Abstract: We study minimal rational curves on a complex manifold that are tangent to a distribution. In this setting, the variety of minimal rational tangents (VMRT) has to be isotropic with respect to the Levi tensor of the distribution. Our main result is a converse of this: any smooth projective variety isotropic with respect to a vector-valued anti-symmetric form can be realized as VMRT of minimal ratio… ▽ More We study minimal rational curves on a complex manifold that are tangent to a distribution. In this setting, the variety of minimal rational tangents (VMRT) has to be isotropic with respect to the Levi tensor of the distribution. Our main result is a converse of this: any smooth projective variety isotropic with respect to a vector-valued anti-symmetric form can be realized as VMRT of minimal rational curves tangent to a distribution on a complex manifold. The complex manifold is constructed as a partial equivariant compactification of a metabelian group, which is a result of independent interest. △ Less

Submitted 21 April, 2022; originally announced April 2022.

Comments: 19 pages, to appear in IMRN

MSC Class: 58A30; 32J05; 14L30

arXiv:2204.02529 [pdf, ps, other]

Minimal rational curves and 1-flat irreducible G-structures

Authors: Jun-Muk Hwang, Qifeng Li

Abstract: 1-flat irreducible G-structures, equivalently, irreducible G-structures admitting torsion-free affine connections, have been studied extensively in differential geometry, especially in connection with the theory of affine holonomy groups. We propose to study them in a setting in algebraic geometry, where they arise from varieties of minimal rational tangents (VMRT) associated to families of minima… ▽ More 1-flat irreducible G-structures, equivalently, irreducible G-structures admitting torsion-free affine connections, have been studied extensively in differential geometry, especially in connection with the theory of affine holonomy groups. We propose to study them in a setting in algebraic geometry, where they arise from varieties of minimal rational tangents (VMRT) associated to families of minimal rational curves on uniruled projective manifolds. We prove that such a structure is locally symmetric when the dimension of the uniruled projective manifold is at least 5. By the classification result of Merkulov and Schwachhöfer on irreducible affine holonomy, the problem is reduced to the case when the VMRT at a general point of the uniruled projective manifold is isomorphic to a subadjoint variety. In the latter situation, we prove a stronger result that, without the assumption of 1-flatness, the structure arising from VMRT is always locally flat. The proof employs the method of Cartan connections. An interesting feature is that Cartan connections are considered not for the G-structures themselves, but for certain geometric structures on the spaces of minimal rational curves. △ Less

Submitted 5 April, 2022; originally announced April 2022.

Comments: 36 pages, to appear in Journal of Geometric Analysis in the collection in memory of Nessim Sibony

MSC Class: 53C10; 14M17; 14M22

arXiv:2201.11967 [pdf, other]

Pseudo-Differential Neural Operator: Generalized Fourier Neural Operator for Learning Solution Operators of Partial Differential Equations

Authors: Jin Young Shin, Jae Yong Lee, Hyung Ju Hwang

Abstract: Learning the mapping between two function spaces has garnered considerable research attention. However, learning the solution operator of partial differential equations (PDEs) remains a challenge in scientific computing. Fourier neural operator (FNO) was recently proposed to learn solution operators, and it achieved an excellent performance. In this study, we propose a novel \textit{pseudo-differe… ▽ More Learning the mapping between two function spaces has garnered considerable research attention. However, learning the solution operator of partial differential equations (PDEs) remains a challenge in scientific computing. Fourier neural operator (FNO) was recently proposed to learn solution operators, and it achieved an excellent performance. In this study, we propose a novel \textit{pseudo-differential integral operator} (PDIO) to analyze and generalize the Fourier integral operator in FNO. PDIO is inspired by a pseudo-differential operator, which is a generalized differential operator characterized by a certain symbol. We parameterize this symbol using a neural network and demonstrate that the neural network-based symbol is contained in a smooth symbol class. Subsequently, we verify that the PDIO is a bounded linear operator, and thus is continuous in the Sobolev space. We combine the PDIO with the neural operator to develop a \textit{pseudo-differential neural operator} (PDNO) and learn the nonlinear solution operator of PDEs. We experimentally validate the effectiveness of the proposed model by utilizing Darcy flow and the Navier-Stokes equation. The obtained results indicate that the proposed PDNO outperforms the existing neural operator approaches in most experiments. △ Less

Submitted 4 March, 2024; v1 submitted 28 January, 2022; originally announced January 2022.

Comments: 23 pages, 13 figures

MSC Class: 35S05; 47G30; 68U07

arXiv:2111.04941 [pdf, other]

doi 10.1609/aaai.v36i4.20373

Solving PDE-constrained Control Problems Using Operator Learning

Authors: Rakhoon Hwang, Jae Yong Lee, Jin Young Shin, Hyung Ju Hwang

Abstract: The modeling and control of complex physical systems are essential in real-world problems. We propose a novel framework that is generally applicable to solving PDE-constrained optimal control problems by introducing surrogate models for PDE solution operators with special regularizers. The procedure of the proposed framework is divided into two phases: solution operator learning for PDE constraint… ▽ More The modeling and control of complex physical systems are essential in real-world problems. We propose a novel framework that is generally applicable to solving PDE-constrained optimal control problems by introducing surrogate models for PDE solution operators with special regularizers. The procedure of the proposed framework is divided into two phases: solution operator learning for PDE constraints (Phase 1) and searching for optimal control (Phase 2). Once the surrogate model is trained in Phase 1, the optimal control can be inferred in Phase 2 without intensive computations. Our framework can be applied to both data-driven and data-free cases. We demonstrate the successful application of our method to various optimal control problems for different control variables with diverse PDE constraints from the Poisson equation to Burgers' equation. △ Less

Submitted 26 December, 2023; v1 submitted 8 November, 2021; originally announced November 2021.

Comments: 15 pages, 12 figures. Published as a conference paper at Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI 2022)

MSC Class: 68U07

arXiv:2109.14851 [pdf, other]

doi 10.1016/j.jcp.2023.112518

The Deep Minimizing Movement Scheme

Authors: Min Sue Park, Cheolhyeong Kim, Hwijae Son, Hyung Ju Hwang

Abstract: Solutions of certain partial differential equations (PDEs) are often represented by the steepest descent curves of corresponding functionals. Minimizing movement scheme was developed in order to study such curves in metric spaces. Especially, Jordan-Kinderlehrer-Otto studied the Fokker-Planck equation in this way with respect to the Wasserstein metric space. In this paper, we propose a deep learni… ▽ More Solutions of certain partial differential equations (PDEs) are often represented by the steepest descent curves of corresponding functionals. Minimizing movement scheme was developed in order to study such curves in metric spaces. Especially, Jordan-Kinderlehrer-Otto studied the Fokker-Planck equation in this way with respect to the Wasserstein metric space. In this paper, we propose a deep learning-based minimizing movement scheme for approximating the solutions of PDEs. The proposed method is highly scalable for high-dimensional problems as it is free of mesh generation. We demonstrate through various kinds of numerical examples that the proposed method accurately approximates the solutions of PDEs by finding the steepest descent direction of a functional even in high dimensions. △ Less

Submitted 25 September, 2023; v1 submitted 30 September, 2021; originally announced September 2021.

Comments: 29 pages, 17 figures

Journal ref: J. Comput. Phys. 494 (2023) 112518

arXiv:2106.12147 [pdf, other]

Lagrangian dual framework for conservative neural network solutions of kinetic equations

Authors: Hyung Ju Hwang, Hwijae Son

Abstract: In this paper, we propose a novel conservative formulation for solving kinetic equations via neural networks. More precisely, we formulate the learning problem as a constrained optimization problem with constraints that represent the physical conservation laws. The constraints are relaxed toward the residual loss function by the Lagrangian duality. By imposing physical conservation properties of t… ▽ More In this paper, we propose a novel conservative formulation for solving kinetic equations via neural networks. More precisely, we formulate the learning problem as a constrained optimization problem with constraints that represent the physical conservation laws. The constraints are relaxed toward the residual loss function by the Lagrangian duality. By imposing physical conservation properties of the solution as constraints of the learning problem, we demonstrate far more accurate approximations of the solutions in terms of errors and the conservation laws, for the kinetic Fokker-Planck equation and the homogeneous Boltzmann equation. △ Less

Submitted 23 June, 2021; originally announced June 2021.

arXiv:2102.07331 [pdf, ps, other]

Unbendable rational curves of Goursat type and Cartan type

Authors: Jun-Muk Hwang, Qifeng Li

Abstract: We study unbendable rational curves, i.e., nonsingular rational curves in a complex manifold of dimension $n$ with normal bundles isomorphic to $\mathcal{O}_{\mathbb{P}^1}(1)^{\oplus p} \oplus \mathcal{O}_{\mathbb{P}^1}^{\oplus (n-1-p)}$ for some nonnegative integer $p$. Well-known examples arise from algebraic geometry as general minimal rational curves of uniruled projective manifolds. After des… ▽ More We study unbendable rational curves, i.e., nonsingular rational curves in a complex manifold of dimension $n$ with normal bundles isomorphic to $\mathcal{O}_{\mathbb{P}^1}(1)^{\oplus p} \oplus \mathcal{O}_{\mathbb{P}^1}^{\oplus (n-1-p)}$ for some nonnegative integer $p$. Well-known examples arise from algebraic geometry as general minimal rational curves of uniruled projective manifolds. After describing the relations between the differential geometric properties of the natural distributions on the deformation spaces of unbendable rational curves and the projective geometric properties of their varieties of minimal rational tangents, we concentrate on the case of $p=1$ and $n \leq 5$, which is the simplest nontrivial situation. In this case, the families of unbendable rational curves fall essentially into two classes: Goursat type or Cartan type. Those of Goursat type arise from ordinary differential equations and those of Cartan type have special features related to contact geometry. We show that the family of lines on any nonsingular cubic 4-fold is of Goursat type, whereas the family of lines on a general quartic 5-fold is of Cartan type, in the proof of which the projective geometry of varieties of minimal rational tangents plays a key role. △ Less

Submitted 14 February, 2021; originally announced February 2021.

Comments: To appear in Journal de Mathématiques Pures et Appliquées

arXiv:2101.08932 [pdf, other]

Sobolev Training for Physics Informed Neural Networks

Authors: Hwijae Son, Jin Woo Jang, Woo Jin Han, Hyung Ju Hwang

Abstract: Physics Informed Neural Networks (PINNs) is a promising application of deep learning. The smooth architecture of a fully connected neural network is appropriate for finding the solutions of PDEs; the corresponding loss function can also be intuitively designed and guarantees the convergence for various kinds of PDEs. However, the rate of convergence has been considered as a weakness of this approa… ▽ More Physics Informed Neural Networks (PINNs) is a promising application of deep learning. The smooth architecture of a fully connected neural network is appropriate for finding the solutions of PDEs; the corresponding loss function can also be intuitively designed and guarantees the convergence for various kinds of PDEs. However, the rate of convergence has been considered as a weakness of this approach. This paper proposes Sobolev-PINNs, a novel loss function for the training of PINNs, making the training substantially efficient. Inspired by the recent studies that incorporate derivative information for the training of neural networks, we develop a loss function that guides a neural network to reduce the error in the corresponding Sobolev space. Surprisingly, a simple modification of the loss function can make the training process similar to \textit{Sobolev Training} although PINNs is not a fully supervised learning task. We provide several theoretical justifications that the proposed loss functions upper bound the error in the corresponding Sobolev spaces for the viscous Burgers equation and the kinetic Fokker--Planck equation. We also present several simulation results, which show that compared with the traditional $L^2$ loss function, the proposed loss function guides the neural network to a significantly faster convergence. Moreover, we provide the empirical evidence that shows that the proposed loss function, together with the iterative sampling techniques, performs better in solving high dimensional PDEs. △ Less

Submitted 8 December, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

arXiv:2101.08520 [pdf, other]

Traveling Wave Solutions of Partial Differential Equations via Neural Networks

Authors: Sung Woong Cho, Hyung Ju Hwang, Hwijae Son

Abstract: This paper focuses on how to approximate traveling wave solutions for various kinds of partial differential equations via artificial neural networks. A traveling wave solution is hard to obtain with traditional numerical methods when the corresponding wave speed is unknown in advance. We propose a novel method to approximate both the traveling wave solution and the unknown wave speed via a neural… ▽ More This paper focuses on how to approximate traveling wave solutions for various kinds of partial differential equations via artificial neural networks. A traveling wave solution is hard to obtain with traditional numerical methods when the corresponding wave speed is unknown in advance. We propose a novel method to approximate both the traveling wave solution and the unknown wave speed via a neural network and an additional free parameter. We proved that under a mild assumption, the neural network solution converges to the analytic solution and the free parameter accurately approximates the wave speed as the corresponding loss tends to zero for the Keller-Segel equation. We also demonstrate in the experiments that reducing loss through training assures an accurate approximation of the traveling wave solution and the wave speed for the Keller-Segel equation, the Allen-Cahn model with relaxation, and the Lotka-Volterra competition model. △ Less

Submitted 28 June, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

arXiv:2101.05409 [pdf, ps, other]

Varieties of minimal rational tangents of unbendable rational curves subordinate to contact structures

Authors: Jun-Muk Hwang

Abstract: A nonsingular rational curve $C$ in a complex manifold $X$ whose normal bundle is isomorphic to $${\mathcal O}_{{\mathbb P}^1}(1)^{\oplus p} \oplus {\mathcal O}_{{\mathbb P}^1}^{\oplus q}$$ for some nonnegative integers $p$ and $q$ is called an unbendable rational curve on $X$. Associated with it is the variety of minimal rational tangents (VMRT) at a point $x \in C,$ which is the germ of submanif… ▽ More A nonsingular rational curve $C$ in a complex manifold $X$ whose normal bundle is isomorphic to $${\mathcal O}_{{\mathbb P}^1}(1)^{\oplus p} \oplus {\mathcal O}_{{\mathbb P}^1}^{\oplus q}$$ for some nonnegative integers $p$ and $q$ is called an unbendable rational curve on $X$. Associated with it is the variety of minimal rational tangents (VMRT) at a point $x \in C,$ which is the germ of submanifolds ${\mathcal C}^C_x \subset {\mathbb P} T_x X$ consisting of tangent directions of small deformations of $C$ fixing $x$. Assuming that there exists a distribution $D \subset TX$ such that all small deformations of $C$ are tangent to $D$, one asks what kind of submanifolds of projective space can be realized as the VMRT ${\mathcal C}^C_x \subset {\mathbb P} D_x$. When $D \subset TX$ is a contact distribution, a well-known necessary condition is that ${\mathcal C}_x^C$ should be Legendrian with respect to the induced contact structure on ${\mathbb P} D_x$. We prove that this is also a sufficient condition: we construct a complex manifold $X$ with a contact structure $D \subset TX$ and an unbendable rational curve $C \subset X$ such that all small deformations of $C$ are tangent to $D$ and the VMRT ${\mathcal C}^C_x \subset {\mathbb P} D_x$ at some point $x\in C$ is projectively isomorphic to an arbitrarily given Legendrian submanifold. Our construction uses the geometry of contact lines on the Heisenberg group and a technical ingredient is the symplectic geometry of distributions the study of which has originated from geometric control theory. △ Less

Submitted 13 January, 2021; originally announced January 2021.

Comments: 20 pages, to appear in J. Math. Soc. Japan

MSC Class: 58A30; 32C25; 14L40

arXiv:2010.14958 [pdf, other]

Cone structures and parabolic geometries

Authors: Jun-Muk Hwang, Katharina Neusser

Abstract: A cone structure on a complex manifold $M$ is a closed submanifold $\mathcal C \subset \mathbb P TM$ of the projectivized tangent bundle which is submersive over $M$. A conic connection on $\mathcal C$ specifies a distinguished family of curves on $M$ in the directions specified by $\mathcal C$. There are two common sources of cone structures and conic connections, one in differential geometry and… ▽ More A cone structure on a complex manifold $M$ is a closed submanifold $\mathcal C \subset \mathbb P TM$ of the projectivized tangent bundle which is submersive over $M$. A conic connection on $\mathcal C$ specifies a distinguished family of curves on $M$ in the directions specified by $\mathcal C$. There are two common sources of cone structures and conic connections, one in differential geometry and another in algebraic geometry. In differential geometry, we have cone structures induced by the geometric structures underlying holomorphic parabolic geometries, a classical example of which is the null cone bundle of a holomorphic conformal structure. In algebraic geometry, we have the cone structures consisting of varieties of minimal rational tangents (VMRT) given by minimal rational curves on uniruled projective manifolds. The local invariants of the cone structures in parabolic geometries are given by the curvature of the parabolic geometries, the nature of which depend on the type of the parabolic geometry, i.e., the type of the fibers of $\mathcal C \to M$. For the VMRT-structures, more intrinsic invariants of the conic connections which do not depend on the type of the fiber play important roles. We study the relation between these two different aspects for the cone structures induced by parabolic geometries associated with a long simple root of a complex simple Lie algebra. As an application, we obtain a local differential-geometric version of the global algebraic-geometric recognition theorem due to Mok and Hong--Hwang. In our local version, the role of rational curves is completely replaced by appropriate torsion conditions on the conic connection. △ Less

Submitted 28 October, 2020; originally announced October 2020.

Comments: 39 pages

MSC Class: 53B99 (Primary) 53C56; 14M15 (Secondary)

arXiv:2010.10818 [pdf, ps, other]

Legendrian cone structures and contact prolongations

Authors: Jun-Muk Hwang

Abstract: We study a cone structure ${\mathcal C} \subset {\mathbb P} D$ on a holomorphic contact manifold $(M, D \subset T_M)$ such that each fiber ${\mathcal C}_x \subset {\mathbb P} D_x$ is isomorphic to a Legendrian submanifold of fixed isomorphism type. By characterizing subadjoint varieties among Legendrian submanifolds in terms of contact prolongations, we prove that the canonical distribution on the… ▽ More We study a cone structure ${\mathcal C} \subset {\mathbb P} D$ on a holomorphic contact manifold $(M, D \subset T_M)$ such that each fiber ${\mathcal C}_x \subset {\mathbb P} D_x$ is isomorphic to a Legendrian submanifold of fixed isomorphism type. By characterizing subadjoint varieties among Legendrian submanifolds in terms of contact prolongations, we prove that the canonical distribution on the associated contact G-structure admits a holomorphic horizontal splitting. △ Less

Submitted 21 October, 2020; originally announced October 2020.

Comments: 14 pages, to appear in the proceedings volume of the Abel Symposium 2019

MSC Class: 53B99; 14J45

arXiv:2009.13280 [pdf, other]

The model reduction of the Vlasov-Poisson-Fokker-Planck system to the Poisson-Nernst-Planck system via the Deep Neural Network Approach

Authors: Jae Yong Lee, Jin Woo Jang, Hyung Ju Hwang

Abstract: The model reduction of a mesoscopic kinetic dynamics to a macroscopic continuum dynamics has been one of the fundamental questions in mathematical physics since Hilbert's time. In this paper, we consider a diagram of the diffusion limit from the Vlasov-Poisson-Fokker-Planck (VPFP) system on a bounded interval with the specular reflection boundary condition to the Poisson-Nernst-Planck (PNP) system… ▽ More The model reduction of a mesoscopic kinetic dynamics to a macroscopic continuum dynamics has been one of the fundamental questions in mathematical physics since Hilbert's time. In this paper, we consider a diagram of the diffusion limit from the Vlasov-Poisson-Fokker-Planck (VPFP) system on a bounded interval with the specular reflection boundary condition to the Poisson-Nernst-Planck (PNP) system with the no-flux boundary condition. We provide a Deep Learning algorithm to simulate the VPFP system and the PNP system by computing the time-asymptotic behaviors of the solution and the physical quantities. We analyze the convergence of the neural network solution of the VPFP system to that of the PNP system via the Asymptotic-Preserving (AP) scheme. Also, we provide several theoretical evidence that the Deep Neural Network (DNN) solutions to the VPFP and the PNP systems converge to the a priori classical solutions of each system if the total loss function vanishes. △ Less

Submitted 28 September, 2020; originally announced September 2020.

Comments: 49 pages, 16 figures

MSC Class: 68T20; 35Q84; 35B40; 82C40

arXiv:2009.01391 [pdf, ps, other]

$L^2$ decay for the linearized Landau equation with the specular boundary condition

Authors: Yan Guo, Hyung Ju Hwang, Jin Woo Jang, Zhimeng Ouyang

Abstract: In this paper, we develop an alternative approach to establish the $L^2$ decay estimate for the linearized Landau equation in a bounded domain with specular boundary condition. The proof is based on the methodology of proof by contradiction motivated by [Guo, Comm. Pure Appl. Math., 55(9):1104-1135, 2002] and [Guo, Arch. Ration. Mech. Anal., 197(3):713-809, 2010]. In this paper, we develop an alternative approach to establish the $L^2$ decay estimate for the linearized Landau equation in a bounded domain with specular boundary condition. The proof is based on the methodology of proof by contradiction motivated by [Guo, Comm. Pure Appl. Math., 55(9):1104-1135, 2002] and [Guo, Arch. Ration. Mech. Anal., 197(3):713-809, 2010]. △ Less

Submitted 28 September, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

Comments: 19 pages. More details have been added

arXiv:2003.03885 [pdf, ps, other]

Extending Nirenberg-Spencer's question on holomorphic embeddings to families of holomorphic embeddings

Authors: Jun-Muk Hwang

Abstract: Nirenberg and Spencer posed the question whether the germ of a compact complex submanifold in a complex manifold is determined by its infinitesimal neighborhood of finite order when the normal bundle is sufficiently positive. To study the problem for a larger class of submanifolds, including free rational curves, we reformulate the question in the setting of families of submanifolds and their infi… ▽ More Nirenberg and Spencer posed the question whether the germ of a compact complex submanifold in a complex manifold is determined by its infinitesimal neighborhood of finite order when the normal bundle is sufficiently positive. To study the problem for a larger class of submanifolds, including free rational curves, we reformulate the question in the setting of families of submanifolds and their infinitesimal neighborhoods. When the submanifolds have no nonzero vector fields, we prove that it is sufficient to consider only first-order neighborhoods to have an affirmative answer to the reformulated question. When the submanifolds do have nonzero vector fields, we obtain an affirmative answer to the question under the additional assumption that submanifolds have certain nice deformation properties, which is applicable to free rational curves. As an application, we obtain a stronger version of the Cartan-Fubini type extension theorem for Fano manifolds of Picard number 1. We also propose a potential application on hyperplane sections of projective K3 surfaces. △ Less

Submitted 13 January, 2021; v1 submitted 8 March, 2020; originally announced March 2020.

Comments: 24 pages. The result in Section 4 has been replaced by a weaker statement, because the argument in the first version had a gap, which was pointed out by Yong Hu. To appear in Duke Math. J

MSC Class: 32C22; 58A15; 14J45; 14J28

arXiv:1911.09843 [pdf, other]

doi 10.1016/j.jcp.2020.109665

Trend to Equilibrium for the Kinetic Fokker-Planck Equation via the Neural Network Approach

Authors: Hyung Ju Hwang, Jin Woo Jang, Hyeontae Jo, Jae Yong Lee

Abstract: The issue of the relaxation to equilibrium has been at the core of the kinetic theory of rarefied gas dynamics. In the paper, we introduce the Deep Neural Network (DNN) approximated solutions to the kinetic Fokker-Planck equation in a bounded interval and study the large-time asymptotic behavior of the solutions and other physically relevant macroscopic quantities. We impose the varied types of bo… ▽ More The issue of the relaxation to equilibrium has been at the core of the kinetic theory of rarefied gas dynamics. In the paper, we introduce the Deep Neural Network (DNN) approximated solutions to the kinetic Fokker-Planck equation in a bounded interval and study the large-time asymptotic behavior of the solutions and other physically relevant macroscopic quantities. We impose the varied types of boundary conditions including the inflow-type and the reflection-type boundaries as well as the varied diffusion and friction coefficients and study the boundary effects on the asymptotic behaviors. These include the predictions on the large-time behaviors of the pointwise values of the particle distribution and the macroscopic physical quantities including the total kinetic energy, the entropy, and the free energy. We also provide the theoretical supports for the pointwise convergence of the neural network solutions to the \textit{a priori} analytic solutions. We use the library \textit{PyTorch}, the activation function \textit{tanh} between layers, and the \textit{Adam} optimizer for the Deep Learning algorithm. △ Less

Submitted 21 November, 2019; originally announced November 2019.

Comments: 31 pages, 13 figures

MSC Class: 68T20; 35Q84; 35B40; 82C40; 97R40

arXiv:1908.09261 [pdf, ps, other]

Tensor product and Hadamard product for the Wasserstein means

Authors: Jinmi Hwang, Sejong Kim

Abstract: As one of the least squares mean, we consider the Wasserstein mean of positive definite Hermitian matrices. We verify in this paper the inequalities of the Wasserstein mean related with a strictly positive and unital linear map, the identity of the Wasserstein mean for tensor product, and some inequalities of the Wasserstein mean for Hadamard product. As one of the least squares mean, we consider the Wasserstein mean of positive definite Hermitian matrices. We verify in this paper the inequalities of the Wasserstein mean related with a strictly positive and unital linear map, the identity of the Wasserstein mean for tensor product, and some inequalities of the Wasserstein mean for Hadamard product. △ Less

Submitted 25 August, 2019; originally announced August 2019.

Comments: 14 pages

MSC Class: 15B48; 15A69

arXiv:1907.12925 [pdf, other]

Deep Neural Network Approach to Forward-Inverse Problems

Authors: Hyeontae Jo, Hwijae Son, Hyung Ju Hwang, Eunheui Kim

Abstract: In this paper, we construct approximated solutions of Differential Equations (DEs) using the Deep Neural Network (DNN). Furthermore, we present an architecture that includes the process of finding model parameters through experimental data, the inverse problem. That is, we provide a unified framework of DNN architecture that approximates an analytic solution and its model parameters simultaneously… ▽ More In this paper, we construct approximated solutions of Differential Equations (DEs) using the Deep Neural Network (DNN). Furthermore, we present an architecture that includes the process of finding model parameters through experimental data, the inverse problem. That is, we provide a unified framework of DNN architecture that approximates an analytic solution and its model parameters simultaneously. The architecture consists of a feed forward DNN with non-linear activation functions depending on DEs, automatic differentiation, reduction of order, and gradient based optimization method. We also prove theoretically that the proposed DNN solution converges to an analytic solution in a suitable function space for fundamental DEs. Finally, we perform numerical experiments to validate the robustness of our simplistic DNN architecture for 1D transport equation, 2D heat equation, 2D wave equation, and the Lotka-Volterra system. △ Less

Submitted 27 July, 2019; originally announced July 2019.

Showing 1–50 of 123 results for author: Hwang, J