-
Non-commutative resolutions and pre-quotients of Calabi-Yau double covers
Authors:
Tsung-Ju Lee,
Bong H. Lian,
Mauricio Romo,
Leonardo Santilli
Abstract:
Following an earlier proposal arXiv:2307.02038 to apply the GLSM formalism to understand the so-called non-commutative resolution, this paper takes one important step further to extend this formalism to a much larger class of non-commutative resolutions. The proposal was initially motivated by the discovery of a new class of mirror pairs singular Calabi-Yau varieties arXiv:2003.07148, given by cer…
▽ More
Following an earlier proposal arXiv:2307.02038 to apply the GLSM formalism to understand the so-called non-commutative resolution, this paper takes one important step further to extend this formalism to a much larger class of non-commutative resolutions. The proposal was initially motivated by the discovery of a new class of mirror pairs singular Calabi-Yau varieties arXiv:2003.07148, given by certain branched double covers over toric varieties of MPCP type. The overarching problem was to understand these mirror pairs from the viewpoint of homological mirror symmetry arXiv:alg-geom/9411018. In the present paper, we propose two main results along this line. First, one new insight is that the `gauge-fixing' condition on the branching locus of the double cover used in arXiv:2003.07148 can be relaxed in an interesting way. This turns out to produce GLSMs that describe a much larger class of non-commutative resolutions, leading to $A$-periods for a larger class of non-commutative resolutions, as well as the GKZ systems for their $A$-periods. Second, we show that the $A$-periods can also be realized as $A$-periods of a certain smooth CICY family in a toric variety of MPCP type, such that a suitable finite quotient of this family recovers the double cover CY we have started with. We call this CICY family the `pre-quotient' of the double cover CY. This realization strongly suggests that pre-quotient may provide an important approach for understanding homological mirror symmetry for singular double cover CY varieties and non-commutative resolutions.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
An Intersection Principle for Mean Curvature Flow
Authors:
Tang-Kai Lee,
Alec Payne
Abstract:
The avoidance principle says that mean curvature flows of hypersurfaces remain disjoint if they are disjoint at the initial time. We prove several generalizations of the avoidance principle that allow for intersections of hypersurfaces. First, we prove that the Hausdorff dimension of the intersection of two mean curvature flows is non-increasing over time, and we find precise information on how th…
▽ More
The avoidance principle says that mean curvature flows of hypersurfaces remain disjoint if they are disjoint at the initial time. We prove several generalizations of the avoidance principle that allow for intersections of hypersurfaces. First, we prove that the Hausdorff dimension of the intersection of two mean curvature flows is non-increasing over time, and we find precise information on how the dimension changes. We then show that the self-intersection of an immersed mean curvature flow has non-increasing dimension over time. Next, we extend the intersection dimension monotonicity to Brakke flows and level set flows which satisfy a localizability condition, and we provide examples showing that the monotonicity fails for general weak solutions. We find a localization result for level set flows with finitely many singularities, and as a consequence, we obtain a fattening criterion for these flows which depends on the behavior of intersections with smooth flows.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
Alternating Methods for Large-Scale AC Optimal Power Flow with Unit Commitment
Authors:
Matthew Brun,
Thomas Lee,
Dirk Lauinger,
Xin Chen,
Xu Andy Sun
Abstract:
Security-constrained unit commitment with alternating current optimal power flow (SCUC-ACOPF) is a central problem in power grid operations that optimizes commitment and dispatch of generators under a physically accurate power transmission model while encouraging robustness against component failures. SCUC-ACOPF requires solving large-scale problems that involve multiple time periods and networks…
▽ More
Security-constrained unit commitment with alternating current optimal power flow (SCUC-ACOPF) is a central problem in power grid operations that optimizes commitment and dispatch of generators under a physically accurate power transmission model while encouraging robustness against component failures. SCUC-ACOPF requires solving large-scale problems that involve multiple time periods and networks with thousands of buses within strict time limits. In this work, we study a detailed SCUC-ACOPF model with a rich set of features of modern power grids, including price-sensitive load, reserve products, transformer controls, and energy-limited devices. We propose a decomposition scheme and a penalty alternating direction method to find high-quality solutions to this model. Our methodology leverages spatial and temporal decomposition, separating the problem into a set of mixed-integer linear programs for each bus and a set of continuous nonlinear programs for each time period. To improve the performance of the algorithm, we introduce a variety of heuristics, including restrictions of temporal linking constraints, a second-order cone relaxation, and a contingency screening algorithm. We quantify the quality of feasible solutions through a dual bound from a convex second-order cone program. To evaluate our algorithm, we use large-scale test cases from Challenge 3 of the U.S. Department of Energy's Grid Optimization Competition that resemble real power grid data under a variety of operating conditions and decision horizons. The experiments yield feasible solutions with an average optimality gap of 1.33%, demonstrating that this approach generates near-optimal solutions within stringent time limits.
△ Less
Submitted 9 May, 2025;
originally announced May 2025.
-
Planarity and convexity for pinched ancient solutions of mean curvature flow
Authors:
Tang-Kai Lee,
Keaton Naff,
Jingze Zhu
Abstract:
We prove a parabolically scale-invariant variation of the planarity estimate in \cite{Na22} for higher codimension mean curvature flow, borrowing ideas from work of Brendle--Huisken--Sinestrari \cite{BHS}. Additionally, we prove convexity for pinched complete ancient solutions of the mean curvature flow in codimension one. Then we put these estimates together to characterize certain pinched comple…
▽ More
We prove a parabolically scale-invariant variation of the planarity estimate in \cite{Na22} for higher codimension mean curvature flow, borrowing ideas from work of Brendle--Huisken--Sinestrari \cite{BHS}. Additionally, we prove convexity for pinched complete ancient solutions of the mean curvature flow in codimension one. Then we put these estimates together to characterize certain pinched complete ancient solutions and shrinkers in higher codimension. We include some discussion of future research directions in this area of mean curvature flow.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
Authors:
Beomyeol Yu,
Taeyoung Lee
Abstract:
Improving sampling efficiency and generalization capability is critical for the successful data-driven control of quadrotor unmanned aerial vehicles (UAVs) that are inherently unstable. While various reinforcement learning (RL) approaches have been applied to autonomous quadrotor flight, they often require extensive training data, posing multiple challenges and safety risks in practice. To address…
▽ More
Improving sampling efficiency and generalization capability is critical for the successful data-driven control of quadrotor unmanned aerial vehicles (UAVs) that are inherently unstable. While various reinforcement learning (RL) approaches have been applied to autonomous quadrotor flight, they often require extensive training data, posing multiple challenges and safety risks in practice. To address these issues, we propose data-efficient, equivariant monolithic and modular RL frameworks for quadrotor low-level control. Specifically, by identifying the rotational and reflectional symmetries in quadrotor dynamics and encoding these symmetries into equivariant network models, we remove redundancies of learning in the state-action space. This approach enables the optimal control action learned in one configuration to automatically generalize into other configurations via symmetry, thereby enhancing data efficiency. Experimental results demonstrate that our equivariant approaches significantly outperform their non-equivariant counterparts in terms of learning efficiency and flight performance.
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
A Supplement to the anticanonical Volumes of weak $\mathbb{Q}$-Fano threefolds of Picard rank two
Authors:
Ching-Jui Lai,
Tsung-Ju Lee
Abstract:
We show that for a weak $\mathbb{Q}$-Fano threefold $X$ ($\mathbb{Q}$-factorial with terminal singularities and $-K_X$ is nef and big) of Picard rank $ρ(X)\leq 2$, either $-K_X^3\leq 64$ or $-K_X^3=72$ and $X=\mathbb{P}_{\mathbb{P}^2}(\mathcal{O}_{\mathbb{P}^2}\oplus\mathcal{O}_{\mathbb{P}^2}(3))$. This is supplementary to the previous work in arXiv:2501.12555.
We show that for a weak $\mathbb{Q}$-Fano threefold $X$ ($\mathbb{Q}$-factorial with terminal singularities and $-K_X$ is nef and big) of Picard rank $ρ(X)\leq 2$, either $-K_X^3\leq 64$ or $-K_X^3=72$ and $X=\mathbb{P}_{\mathbb{P}^2}(\mathcal{O}_{\mathbb{P}^2}\oplus\mathcal{O}_{\mathbb{P}^2}(3))$. This is supplementary to the previous work in arXiv:2501.12555.
△ Less
Submitted 23 January, 2025;
originally announced February 2025.
-
Ancient caloric functions and parabolic frequency on graphs
Authors:
Tang-Kai Lee,
Archana Mohandas
Abstract:
We study ancient solutions to discrete heat equations on some weighted graphs. On a graph of the form of a product with $\bb Z,$ we show that there are no non-trivial ancient solutions with polynomial growth. This result is parallel to the case of finite graphs, which is also discussed. Along the way, we prove a backward uniqueness result for solutions with appropriate decaying rate based on a mon…
▽ More
We study ancient solutions to discrete heat equations on some weighted graphs. On a graph of the form of a product with $\bb Z,$ we show that there are no non-trivial ancient solutions with polynomial growth. This result is parallel to the case of finite graphs, which is also discussed. Along the way, we prove a backward uniqueness result for solutions with appropriate decaying rate based on a monotonicity formula of parabolic frequency.
△ Less
Submitted 19 December, 2024;
originally announced December 2024.
-
Mean Field Game and Control for Switching Hybrid Systems
Authors:
Tejaswi K. C.,
Taeyoung Lee
Abstract:
Mean field games and controls involve guiding the behavior of large populations of interacting agents, where each individual's influence on the group is negligible but collectively impacts overall dynamics. Hybrid systems integrate continuous dynamics with discrete transitions, effectively modeling the complex interplay between continuous flows and instantaneous jumps in a unified framework. This…
▽ More
Mean field games and controls involve guiding the behavior of large populations of interacting agents, where each individual's influence on the group is negligible but collectively impacts overall dynamics. Hybrid systems integrate continuous dynamics with discrete transitions, effectively modeling the complex interplay between continuous flows and instantaneous jumps in a unified framework. This paper formulates mean field game and control strategies for switching hybrid systems and proposes computational methods to solve the resulting integro-partial differential equation. This approach enables scalable, decentralized decision-making in large-scale switching systems, which is illustrated through numerical examples in an emergency evacuation scenario with sudden changes in the surrounding environment.
△ Less
Submitted 13 December, 2024;
originally announced December 2024.
-
Invariant Kalman Filter for Relative Dynamics
Authors:
Tejaswi K. C.,
Maneesha Wickramasuriya,
Taeyoung Lee
Abstract:
This paper presents an invariant Kalman filter for estimating the relative trajectories between two dynamic systems. Invariant Kalman filters formulate the estimation error in terms of the group operation, ensuring that the error state does not depend on the current state estimate - a property referred to as state trajectory independence. This is particularly advantageous in extended Kalman filter…
▽ More
This paper presents an invariant Kalman filter for estimating the relative trajectories between two dynamic systems. Invariant Kalman filters formulate the estimation error in terms of the group operation, ensuring that the error state does not depend on the current state estimate - a property referred to as state trajectory independence. This is particularly advantageous in extended Kalman filters, as it makes the propagation of the error covariance robust to large estimation errors. In this work, we construct invariant Kalman filters to the trajectory of one system relative to another. Specifically, we show that if the relative dynamics can be described solely by relative state variables, they automatically satisfy state trajectory independence, allowing for the development of an invariant Kalman filter. The corresponding relative invariant Kalman filter is formulated in an abstract fashion and is demonstrated numerically for the attitude dynamics of a rigid body.
△ Less
Submitted 13 December, 2024;
originally announced December 2024.
-
Uncertainty propagation of stochastic hybrid systems: a case study for types of jump
Authors:
Tejaswi K. C.,
William Clark,
Taeyoung Lee
Abstract:
Stochastic hybrid systems are dynamic systems that undergo both random continuous-time flows and random discrete jumps. Depending on how randomness is introduced into the continuous dynamics, discrete transitions, or both, stochastic hybrid systems exhibit distinct characteristics. This paper investigates the role of uncertainties in the interplay between continuous flows and discrete jumps by stu…
▽ More
Stochastic hybrid systems are dynamic systems that undergo both random continuous-time flows and random discrete jumps. Depending on how randomness is introduced into the continuous dynamics, discrete transitions, or both, stochastic hybrid systems exhibit distinct characteristics. This paper investigates the role of uncertainties in the interplay between continuous flows and discrete jumps by studying probability density propagation. Specifically, we formulate stochastic Koopman/Frobenius-Perron operators for three types of one-dimensional stochastic hybrid systems to uncover their unique dynamic characteristics and verify them using Monte Carlo simulations.
△ Less
Submitted 13 December, 2024;
originally announced December 2024.
-
Prime rings having nontrivial centralizers of (skew) traces of Lie ideals
Authors:
Tsiu-Kwen Lee,
Jheng-Huei Lin
Abstract:
Let $R$ be a prime ring with center $Z(R)$ and with involution $*$. Given an additive subgroup $A$ of $R$, let $T(A):=\{x+x^*\mid x\in A\}$ and $K_0(A):=\{x-x^*\mid x\in A\}$. Let $L$ be a non-abelian Lie ideal of $R$. It is proved that if $d$ is a nonzero derivation of $R$ satisfying $d(T(L))=0$ (resp. $d(K_0(L))=0$), then $T(R)^2\subseteq Z(R)$ (resp. $K_0(R)^2\subseteq Z(R)$). These results are…
▽ More
Let $R$ be a prime ring with center $Z(R)$ and with involution $*$. Given an additive subgroup $A$ of $R$, let $T(A):=\{x+x^*\mid x\in A\}$ and $K_0(A):=\{x-x^*\mid x\in A\}$. Let $L$ be a non-abelian Lie ideal of $R$. It is proved that if $d$ is a nonzero derivation of $R$ satisfying $d(T(L))=0$ (resp. $d(K_0(L))=0$), then $T(R)^2\subseteq Z(R)$ (resp. $K_0(R)^2\subseteq Z(R)$). These results are applied to the study of $d(T(M))=0$ and $d(K_0(M))=0$ for noncentral $*$-subrings $M$ of a division ring $R$ such that $M$ is invariant under all inner automorphisms of $R$, and for noncentral additive subgroups $M$ of a prime ring $R$ containing a nontrivial idempotent such that $M$ is invariant under all special inner automorphisms of $R$. The obtained theorems also generalize some recent results on simple artinian rings with involution due to M. Chacron.
△ Less
Submitted 24 December, 2024; v1 submitted 7 December, 2024;
originally announced December 2024.
-
Commutators and products of Lie ideals of prime rings
Authors:
Tsiu-Kwen Lee,
Jheng-Huei Lin
Abstract:
Motivated by some recent results on Lie ideals, it is proved that if $L$ is a Lie ideal of a simple ring $R$ with center $Z(R)$, then $L\subseteq Z(R)$, $L=Z(R)a+Z(R)$ for some noncentral $a\in L$, or $[R, R]\subseteq L$, which gives a generalization of a classical theorem due to Herstein. We also study commutators and products of noncentral Lie ideals of prime rings. Precisely, let $R$ be a prime…
▽ More
Motivated by some recent results on Lie ideals, it is proved that if $L$ is a Lie ideal of a simple ring $R$ with center $Z(R)$, then $L\subseteq Z(R)$, $L=Z(R)a+Z(R)$ for some noncentral $a\in L$, or $[R, R]\subseteq L$, which gives a generalization of a classical theorem due to Herstein. We also study commutators and products of noncentral Lie ideals of prime rings. Precisely, let $R$ be a prime ring with extended centroid $C$. We completely characterize Lie ideals $L$ and elements $a$ of $R$ such that $L+aL$ contains a nonzero ideal of $R$. Given noncentral Lie ideals $K, L$ of $R$, it is proved that $[K, L]=0$ if and only if $KC=LC=Ca+C$ for any noncentral element $a\in L$. As a consequence, we characterize noncentral Lie ideals $K_1,\ldots,K_m$ with $m\geq 2$ such that $K_1K_2\cdots K_m$ contains a nonzero ideal of $R$. Finally, we characterize noncentral Lie ideals $K_j$'s and $L_k$'s satisfying $\big[K_1K_2\cdots K_m, L_1L_2\cdots L_n\big]=0$ from the viewpoint of centralizers.
△ Less
Submitted 7 February, 2025; v1 submitted 1 October, 2024;
originally announced October 2024.
-
Fully noncentral Lie ideals and invariant additive subgroups in rings
Authors:
Eusebio Gardella,
Tsiu-Kwen Lee,
Hannes Thiel
Abstract:
We prove conditions ensuring that a Lie ideal or an invariant additive subgroup in a ring contains all additive commutators. A crucial assumption is that the subgroup is fully noncentral, that is, its image in every quotient is noncentral.
For a unital algebra over a field of characteristic $\neq 2$ where every additive commutator is a sum of square-zero elements, we show that a fully noncentral…
▽ More
We prove conditions ensuring that a Lie ideal or an invariant additive subgroup in a ring contains all additive commutators. A crucial assumption is that the subgroup is fully noncentral, that is, its image in every quotient is noncentral.
For a unital algebra over a field of characteristic $\neq 2$ where every additive commutator is a sum of square-zero elements, we show that a fully noncentral subspace is a Lie ideal if and only if it is invariant under all inner automorphisms. This applies in particular to zero-product balanced algebras.
△ Less
Submitted 2 March, 2025; v1 submitted 5 September, 2024;
originally announced September 2024.
-
Effectiveness of Social Distancing under Partial Compliance of Individuals
Authors:
Hyelim Shin,
Taesik Lee
Abstract:
Social distancing reduces infectious disease transmission by limiting contact frequency and proximity within a community. However, compliance varies due to its impact on daily life. This paper explores the effects of compliance on social distancing effectiveness through a "social distancing game," where community members make decisions based on personal utility. We conducted numerical experiments…
▽ More
Social distancing reduces infectious disease transmission by limiting contact frequency and proximity within a community. However, compliance varies due to its impact on daily life. This paper explores the effects of compliance on social distancing effectiveness through a "social distancing game," where community members make decisions based on personal utility. We conducted numerical experiments to evaluate how different policy settings for social distancing affect disease transmission.
Our findings suggest several key points for developing effective social distancing policies. Firstly, while generally effective, overly strict policies may lead to noncompliance and reduced effectiveness. Secondly, the public health benefits of social distancing need to be balanced against social costs, emphasizing policy efficiency. Lastly, for diseases with low reinfection risk, a segmented policy exempting immune individuals could lessen both infections and socioeconomic costs.
△ Less
Submitted 26 August, 2024;
originally announced August 2024.
-
Change Point Detection in Pairwise Comparison Data with Covariates
Authors:
Yi Han,
Thomas C. M. Lee
Abstract:
This paper introduces the novel piecewise stationary covariate-assisted ranking estimation (PS-CARE) model for analyzing time-evolving pairwise comparison data, enhancing item ranking accuracy through the integration of covariate information. By partitioning the data into distinct, stationary segments, the PS-CARE model adeptly detects temporal shifts in item rankings, known as change points, whos…
▽ More
This paper introduces the novel piecewise stationary covariate-assisted ranking estimation (PS-CARE) model for analyzing time-evolving pairwise comparison data, enhancing item ranking accuracy through the integration of covariate information. By partitioning the data into distinct, stationary segments, the PS-CARE model adeptly detects temporal shifts in item rankings, known as change points, whose number and positions are initially unknown. Leveraging the minimum description length (MDL) principle, this paper establishes a statistically consistent model selection criterion to estimate these unknowns. The practical optimization of this MDL criterion is done with the pruned exact linear time (PELT) algorithm. Empirical evaluations reveal the method's promising performance in accurately locating change points across various simulated scenarios. An application to an NBA dataset yielded meaningful insights that aligned with significant historical events, highlighting the method's practical utility and the MDL criterion's effectiveness in capturing temporal ranking changes. To the best of the authors' knowledge, this research pioneers change point detection in pairwise comparison data with covariate information, representing a significant leap forward in the field of dynamic ranking analysis.
△ Less
Submitted 24 August, 2024;
originally announced August 2024.
-
Arnold-Thom conjecture for the arrival time of surfaces
Authors:
Tang-Kai Lee,
Jingze Zhu
Abstract:
Following Łojasiewicz's uniqueness theorem and Thom's gradient conjecture, Arnold proposed a stronger version about the existence of limit tangents of gradient flow lines for analytic functions. We prove Łojasiewicz's theorem and Arnold's conjecture in the context of arrival time functions for mean curvature flows in $\mathbb R^{n+1}$ with neck or non-degenerate cylindrical singularities. In parti…
▽ More
Following Łojasiewicz's uniqueness theorem and Thom's gradient conjecture, Arnold proposed a stronger version about the existence of limit tangents of gradient flow lines for analytic functions. We prove Łojasiewicz's theorem and Arnold's conjecture in the context of arrival time functions for mean curvature flows in $\mathbb R^{n+1}$ with neck or non-degenerate cylindrical singularities. In particular, we prove the conjectures for all mean convex mean curvature flows of surfaces, including the cases when the arrival time functions are not $C^2.$ The results also apply to mean curvature flows starting from two-spheres or generic closed surfaces.
△ Less
Submitted 25 June, 2025; v1 submitted 29 May, 2024;
originally announced May 2024.
-
Closed mean curvature flows with asymptotically conical singularities
Authors:
Tang-Kai Lee,
Xinrui Zhao
Abstract:
In this paper, we prove that for any asymptotically conical self-shrinker, there exists an embedded closed hypersurface such that the mean curvature flow starting from it develops a singularity modeled on the given shrinker. The main technique is the Ważewski box argument, used by Stolarski in the proof of the corresponding theorem in the Ricci flow case. As a corollary, our construction, combined…
▽ More
In this paper, we prove that for any asymptotically conical self-shrinker, there exists an embedded closed hypersurface such that the mean curvature flow starting from it develops a singularity modeled on the given shrinker. The main technique is the Ważewski box argument, used by Stolarski in the proof of the corresponding theorem in the Ricci flow case. As a corollary, our construction, combined with the works of Angenent--Ilmanen--Velázquez and Chodosh--Daniels-Holgate--Schulze, implies the existence of fattening level set flows starting from smooth embedded closed hypersurfaces. These provide examples related to a question asked by Evans--Spruck.
△ Less
Submitted 13 August, 2024; v1 submitted 24 May, 2024;
originally announced May 2024.
-
Ancient mean curvature flows with finite total curvature
Authors:
Kyeongsu Choi,
Jiuzhou Huang,
Taehun Lee
Abstract:
We construct an $I$-family of ancient graphical mean curvature flows over a minimal hypersurface in $\mathbb{R}^{n+1}$ of finite total curvature with the Morse index $I$ by establishing exponentially fast convergence in terms of $|x|^2-t$. As a corollary, we show that these ancient flows have finite total curvature and finite mass drop. Moreover, one family of these flows is mean convex by a point…
▽ More
We construct an $I$-family of ancient graphical mean curvature flows over a minimal hypersurface in $\mathbb{R}^{n+1}$ of finite total curvature with the Morse index $I$ by establishing exponentially fast convergence in terms of $|x|^2-t$. As a corollary, we show that these ancient flows have finite total curvature and finite mass drop. Moreover, one family of these flows is mean convex by a pointwise estimate.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Finite distance problem on the moduli of non-Kähler Calabi--Yau $\partial\bar{\partial}$-threefolds
Authors:
Tsung-Ju Lee
Abstract:
In this article, we study the finite distance problem with respect to the period-map metric on the moduli of non-Kähler Calabi--Yau $\partial\bar{\partial}$-threefolds via Hodge theory. We extended C.-L. Wang's finite distance criterion for one-parameter degenerations to the present setting. As a byproduct, we also obtained a sufficient condition for a non-Kähler Calabi--Yau to support the…
▽ More
In this article, we study the finite distance problem with respect to the period-map metric on the moduli of non-Kähler Calabi--Yau $\partial\bar{\partial}$-threefolds via Hodge theory. We extended C.-L. Wang's finite distance criterion for one-parameter degenerations to the present setting. As a byproduct, we also obtained a sufficient condition for a non-Kähler Calabi--Yau to support the $\partial\bar{\partial}$-lemma which generalizes the results by Friedman and Li. We also proved that the non-Kähler Calabi--Yau threefolds constructed by Hashimoto and Sano support the $\partial\bar{\partial}$-lemma.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Improving the Bit Complexity of Communication for Distributed Convex Optimization
Authors:
Mehrdad Ghadiri,
Yin Tat Lee,
Swati Padmanabhan,
William Swartworth,
David Woodruff,
Guanghao Ye
Abstract:
We consider the communication complexity of some fundamental convex optimization problems in the point-to-point (coordinator) and blackboard communication models. We strengthen known bounds for approximately solving linear regression, $p$-norm regression (for $1\leq p\leq 2$), linear programming, minimizing the sum of finitely many convex nonsmooth functions with varying supports, and low rank app…
▽ More
We consider the communication complexity of some fundamental convex optimization problems in the point-to-point (coordinator) and blackboard communication models. We strengthen known bounds for approximately solving linear regression, $p$-norm regression (for $1\leq p\leq 2$), linear programming, minimizing the sum of finitely many convex nonsmooth functions with varying supports, and low rank approximation; for a number of these fundamental problems our bounds are nearly optimal, as proven by our lower bounds.
Among our techniques, we use the notion of block leverage scores, which have been relatively unexplored in this context, as well as dropping all but the ``middle" bits in Richardson-style algorithms. We also introduce a new communication problem for accurately approximating inner products and establish a lower bound using the spherical Radon transform. Our lower bound can be used to show the first separation of linear programming and linear systems in the distributed model when the number of constraints is polynomial, addressing an open question in prior work.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Certain functional identities on division rings of characteristic two
Authors:
Münevver Pınar Eroğlu,
Tsiu-Kwen Lee,
Jheng-Huei Lin
Abstract:
Let $D$ be a noncommutative division ring. In a recent paper, Lee and Lin proved that if $\text{char}\, D\ne 2$, the only solution of additive maps $f, g$ on $D$ satisfying the identity $f(x) = x^n g(x^{-1})$ on $D\setminus \{0\}$ with $n\ne 2$ a positive integer is the trivial case, that is, $f=0$ and $g=0$. Applying Hua's identity and the theory of functional and generalized polynomial identitie…
▽ More
Let $D$ be a noncommutative division ring. In a recent paper, Lee and Lin proved that if $\text{char}\, D\ne 2$, the only solution of additive maps $f, g$ on $D$ satisfying the identity $f(x) = x^n g(x^{-1})$ on $D\setminus \{0\}$ with $n\ne 2$ a positive integer is the trivial case, that is, $f=0$ and $g=0$. Applying Hua's identity and the theory of functional and generalized polynomial identities, we give a complete solution of the same identity for any nonnegative integer $n$ if $\text{char}\, D=2$.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Branching algebras for the general linear Lie superalgebra
Authors:
Soo Teck Lee,
Ruibin Zhang
Abstract:
We develop an algebraic approach to the branching of representations of the general linear Lie superalgebra $\mathfrak{gl}_{p|q}({\mathbb C})$, by constructing certain super commutative algebras whose structure encodes the branching rules. Using this approach, we derive the branching rules for restricting any irreducible polynomial representation $V$ of $\mathfrak{gl}_{p|q}({\mathbb C})$ to a regu…
▽ More
We develop an algebraic approach to the branching of representations of the general linear Lie superalgebra $\mathfrak{gl}_{p|q}({\mathbb C})$, by constructing certain super commutative algebras whose structure encodes the branching rules. Using this approach, we derive the branching rules for restricting any irreducible polynomial representation $V$ of $\mathfrak{gl}_{p|q}({\mathbb C})$ to a regular subalgebra isomorphic to $\mathfrak{gl}_{r|s}({\mathbb C})\oplus \mathfrak{gl}_{r'|s'}({\mathbb C})$, $\mathfrak{gl}_{r|s}({\mathbb C})\oplus\mathfrak{gl}_1({\mathbb C})^{r'+s'}$ or $\mathfrak{gl}_{r|s}({\mathbb C})$, with $r+r'=p$ and $s+s'=q$. In the case of $\mathfrak{gl}_{r|s}({\mathbb C})\oplus\mathfrak{gl}_1({\mathbb C})^{r'+s'}$ with $s=0$ or $s=1$ but general $r$, we also construct a basis for the space of $\mathfrak{gl}_{r|s}({\mathbb C})$ highest weight vectors in $V$; when $r=s=0$, the branching rule leads to explicit expressions for the weight multiplicities of $V$ in terms of Kostka numbers.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
The $X$-semiprimeness of Rings
Authors:
Grigore Călugăreanu,
Tsiu-Kwen Lee,
Jerzy Matczuk
Abstract:
For a nonempty subset $X$ of a ring $R$, the ring $R$ is called $X$-semiprime if, given $a\in R$, $aXa=0$ implies $a=0$. This provides a proper class of semiprime rings. First, we clarify the relationship between idempotent semiprime and unit-semiprime rings. Secondly, given a Lie ideal $L$ of a ring $R$, we offer a criterion for $R$ to be $L$-semiprime. For a prime ring $R$, we characterizes Lie…
▽ More
For a nonempty subset $X$ of a ring $R$, the ring $R$ is called $X$-semiprime if, given $a\in R$, $aXa=0$ implies $a=0$. This provides a proper class of semiprime rings. First, we clarify the relationship between idempotent semiprime and unit-semiprime rings. Secondly, given a Lie ideal $L$ of a ring $R$, we offer a criterion for $R$ to be $L$-semiprime. For a prime ring $R$, we characterizes Lie ideals $L$ of $R$ such that $R$ is $L$-semiprime. Moreover, $X$-semiprimeness of matrix rings, prime rings (with a nontrivial idempotent), semiprime rings, regular rings, and subdirect products are studied.
△ Less
Submitted 9 April, 2024; v1 submitted 29 February, 2024;
originally announced February 2024.
-
Certain functional identities on division rings
Authors:
Tsiu-Kwen Lee,
Jheng-Huei Lin
Abstract:
We study the functional identity $G(x)f(x)=H(x)$ on a division ring $D$, where $f \colon D\to D$ is an additive map and $G(X)\ne 0, H(X)$ are generalized polynomials in the variable $X$ with coefficients in $D$. Precisely, it is proved that either $D$ is finite-dimensional over its center or $f$ is an elementary operator. Applying the result and its consequences, we prove that if $D$ is a noncommu…
▽ More
We study the functional identity $G(x)f(x)=H(x)$ on a division ring $D$, where $f \colon D\to D$ is an additive map and $G(X)\ne 0, H(X)$ are generalized polynomials in the variable $X$ with coefficients in $D$. Precisely, it is proved that either $D$ is finite-dimensional over its center or $f$ is an elementary operator. Applying the result and its consequences, we prove that if $D$ is a noncommutative division ring of characteristic not $2$, then the only solution of additive maps $f, g$ on $D$ satisfying the identity $f(x) = x^n g(x^{-1})$ with $n\ne 2$ a positive integer is the trivial case, that is, $f=0$ and $g=0$. This extends Catalano and Merchán's result in 2023 to get a complete solution.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
Neural Network with Local Converging Input (NNLCI) for Supersonic Flow Problems with Unstructured Grids
Authors:
Weiming Ding,
Haoxiang Huang,
Tzu Jung Lee,
Yingjie Liu,
Vigor Yang
Abstract:
In recent years, surrogate models based on deep neural networks (DNN) have been widely used to solve partial differential equations, which were traditionally handled by means of numerical simulations. This kind of surrogate models, however, focuses on global interpolation of the training dataset, and thus requires a large network structure. The process is both time consuming and computationally co…
▽ More
In recent years, surrogate models based on deep neural networks (DNN) have been widely used to solve partial differential equations, which were traditionally handled by means of numerical simulations. This kind of surrogate models, however, focuses on global interpolation of the training dataset, and thus requires a large network structure. The process is both time consuming and computationally costly, thereby restricting their use for high-fidelity prediction of complex physical problems. In the present study, we develop a neural network with local converging input (NNLCI) for high-fidelity prediction using unstructured data. The framework utilizes the local domain of dependence with converging coarse solutions as input, which greatly reduces computational resource and training time. As a validation case, the NNLCI method is applied to study inviscid supersonic flows in channels with bumps. Different bump geometries and locations are considered to benchmark the effectiveness and versability of the proposed approach. Detailed flow structures, including shock-wave interactions, are examined systematically.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Gauss curvature flow with shrinking obstacle
Authors:
Ki-Ahm Lee,
Taehun Lee
Abstract:
We consider a flow by powers of Gauss curvature under the obstruction that the flow cannot penetrate a prescribed region, so called an obstacle. For all dimensions and positive powers, we prove the optimal curvature bounds of solutions and all time existence with its long time behavior. We also prove the $C^1$ regularity of free boundaries under a uniform thickness assumption.
We consider a flow by powers of Gauss curvature under the obstruction that the flow cannot penetrate a prescribed region, so called an obstacle. For all dimensions and positive powers, we prove the optimal curvature bounds of solutions and all time existence with its long time behavior. We also prove the $C^1$ regularity of free boundaries under a uniform thickness assumption.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale
Authors:
Hao-Jun Michael Shi,
Tsung-Hsien Lee,
Shintaro Iwasaki,
Jose Gallego-Posada,
Zhijing Li,
Kaushik Rangadurai,
Dheevatsa Mudigere,
Michael Rabbat
Abstract:
Shampoo is an online and stochastic optimization algorithm belonging to the AdaGrad family of methods for training neural networks. It constructs a block-diagonal preconditioner where each block consists of a coarse Kronecker product approximation to full-matrix AdaGrad for each parameter of the neural network. In this work, we provide a complete description of the algorithm as well as the perform…
▽ More
Shampoo is an online and stochastic optimization algorithm belonging to the AdaGrad family of methods for training neural networks. It constructs a block-diagonal preconditioner where each block consists of a coarse Kronecker product approximation to full-matrix AdaGrad for each parameter of the neural network. In this work, we provide a complete description of the algorithm as well as the performance optimizations that our implementation leverages to train deep networks at-scale in PyTorch. Our implementation enables fast multi-GPU distributed data-parallel training by distributing the memory and computation associated with blocks of each parameter via PyTorch's DTensor data structure and performing an AllGather primitive on the computed search directions at each iteration. This major performance enhancement enables us to achieve at most a 10% performance reduction in per-step wall-clock time compared against standard diagonal-scaling-based adaptive gradient methods. We validate our implementation by performing an ablation study on training ImageNet ResNet50, demonstrating Shampoo's superiority over standard training recipes with minimal hyperparameter tuning.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Oriented embedding functors of tori as homogeneous spaces
Authors:
Philippe Gille,
Ting-Yu Lee
Abstract:
We provide a characterization of homogeneous spaces under a reductive group scheme such that the geometric stabilizers are maximal tori. The quasi-split case over a semilocal base is of special interest and permits to answer a question raised by Marc Levine on homogeneous SL$_n$-spaces. At the end, we provide an application to the local-global principles for embeddings of étale algebras wit…
▽ More
We provide a characterization of homogeneous spaces under a reductive group scheme such that the geometric stabilizers are maximal tori. The quasi-split case over a semilocal base is of special interest and permits to answer a question raised by Marc Levine on homogeneous SL$_n$-spaces. At the end, we provide an application to the local-global principles for embeddings of étale algebras with involution into central simple algebras with involution.
△ Less
Submitted 3 February, 2025; v1 submitted 31 July, 2023;
originally announced July 2023.
-
Non-commutative resolutions as mirrors of singular Calabi--Yau varieties
Authors:
Tsung-Ju Lee,
Bong H. Lian,
Mauricio Romo
Abstract:
It has been conjectured that the hemisphere partition function arXiv:1308.2217, arXiv:1308.2438 in a gauged linear sigma model (GLSM) computes the central charge arXiv:math/0212237 of an object in the bounded derived category of coherent sheaves for Calabi--Yau (CY) manifolds. There is also evidence in arXiv:alg-geom/ 9511001, arXiv:hep-th/0007071. On the other hand, non-commutative resolutions of…
▽ More
It has been conjectured that the hemisphere partition function arXiv:1308.2217, arXiv:1308.2438 in a gauged linear sigma model (GLSM) computes the central charge arXiv:math/0212237 of an object in the bounded derived category of coherent sheaves for Calabi--Yau (CY) manifolds. There is also evidence in arXiv:alg-geom/ 9511001, arXiv:hep-th/0007071. On the other hand, non-commutative resolutions of singular CY varieties have been studied in the context of abelian GLSMs arXiv:0709.3855. In this paper, we study an analogous construction of abelian GLSMs for non-commutative resolutions and propose they can be used to study a class of recently discovered mirror pairs of singular CY varieties. Our main result shows that the hemisphere partition functions (a.k.a.~$A$-periods) in the new GLSM are in fact period integrals (a.k.a.~$B$-periods) of the singular CY varieties. We conjecture that the two are completely equivalent: $B$-periods are the same as $A$-periods. We give some examples to support this conjecture and formulate some expected homological mirror symmetry (HMS) relation between the GLSM theory and the CY. As shown in arXiv:2003.07148, the $B$-periods in this case are precisely given by a certain fractional version of the $B$-series of arXiv:alg-geom/9511001. Since a hemisphere partition function is defined as a contour integral in a cone in the complexified secondary fan (or FI-theta parameter space) arXiv:1308.2438, it can be reduced to a sum of residues (by theorems of Passare-Tsikh-Zhdanov and Tsikh-Zhdanov). Our conjecture shows that this residue sum may now be amenable to computations in terms of the $B$-series.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
On the uniqueness of energy-minimizing curves in constrained spaces
Authors:
Ki-Ahm Lee,
Taehun Lee
Abstract:
In this paper, we investigate energy-minimizing curves with fixed endpoints $p$ and $q$ in a constrained space. We prove that when one of the endpoints, say $p$, is fixed, the set of points $q$ for which the energy-minimizing curve is not unique has no interior points.
In this paper, we investigate energy-minimizing curves with fixed endpoints $p$ and $q$ in a constrained space. We prove that when one of the endpoints, say $p$, is fixed, the set of points $q$ for which the energy-minimizing curve is not unique has no interior points.
△ Less
Submitted 20 July, 2023; v1 submitted 12 June, 2023;
originally announced June 2023.
-
Curvature bound for $L_p$ Minkowski problem
Authors:
Kyeongsu Choi,
Minhyun Kim,
Taehun Lee
Abstract:
We establish curvature estimates for anisotropic Gauss curvature flows. By using this, we show that given a measure $μ$ with a positive smooth density $f$, any solution to the $L_p$ Minkowski problem in $\mathbb{R}^{n+1}$ with $p \le -n+2$ is a hypersurface of class $C^{1,1}$. This is a sharp result because for each $p\in [-n+2,1)$ there exists a convex hypersurface of class…
▽ More
We establish curvature estimates for anisotropic Gauss curvature flows. By using this, we show that given a measure $μ$ with a positive smooth density $f$, any solution to the $L_p$ Minkowski problem in $\mathbb{R}^{n+1}$ with $p \le -n+2$ is a hypersurface of class $C^{1,1}$. This is a sharp result because for each $p\in [-n+2,1)$ there exists a convex hypersurface of class $C^{1,\frac{1}{n+p-1}}$ which is a solution to the $L_p$ Minkowski problem for a positive smooth density $f$. In particular, the $C^{1,1}$ regularity is optimal in the case $p=-n+2$ which includes the logarithmic Minkowski problem in $\mathbb{R}^3$.
△ Less
Submitted 17 September, 2024; v1 submitted 23 April, 2023;
originally announced April 2023.
-
Regularization of the inverse Laplace transform by Mollification
Authors:
Pierre Maréchal,
Faouzi Triki,
Walter C. Simo Tao Lee
Abstract:
In this paper we study the inverse Laplace transform. We first derive a new global logarithmic stability estimate that shows that the inversion is severely ill-posed. Then we propose a regularization method to compute the inverse Laplace transform using the concept of mollification. Taking into account the exponential instability we derive a criterion for selection of the regularization parameter.…
▽ More
In this paper we study the inverse Laplace transform. We first derive a new global logarithmic stability estimate that shows that the inversion is severely ill-posed. Then we propose a regularization method to compute the inverse Laplace transform using the concept of mollification. Taking into account the exponential instability we derive a criterion for selection of the regularization parameter. We show that by taking the optimal value of this parameter we improve significantly the convergence of the method. Finally, making use of the holomorphic extension of the Laplace transform, we suggest a new PDEs based numerical method for the computation of the solution. The effectiveness of the proposed regularization method is demonstrated through several numerical examples.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
An eigenvalue problem for prescribed curvature equations
Authors:
Taehun Lee
Abstract:
We study an eigenvalue problem for prescribed $σ_k$-curvature equations of star-shaped, $k$-convex, closed hypersurfaces. We establish the existence of a unique eigenvalue and its associated hypersurface, which is also unique, provided that the given data is even. Moreover, we show that the hypersurface must be strictly convex. A crucial aspect of our proof involves deriving uniform estimates in…
▽ More
We study an eigenvalue problem for prescribed $σ_k$-curvature equations of star-shaped, $k$-convex, closed hypersurfaces. We establish the existence of a unique eigenvalue and its associated hypersurface, which is also unique, provided that the given data is even. Moreover, we show that the hypersurface must be strictly convex. A crucial aspect of our proof involves deriving uniform estimates in $p$ for $L_p$-type prescribed curvature equations.
△ Less
Submitted 22 September, 2023; v1 submitted 15 April, 2023;
originally announced April 2023.
-
Convex Minimization with Integer Minima in $\widetilde O(n^4)$ Time
Authors:
Haotian Jiang,
Yin Tat Lee,
Zhao Song,
Lichen Zhang
Abstract:
Given a convex function $f$ on $\mathbb{R}^n$ with an integer minimizer, we show how to find an exact minimizer of $f$ using $O(n^2 \log n)$ calls to a separation oracle and $O(n^4 \log n)$ time. The previous best polynomial time algorithm for this problem given in [Jiang, SODA 2021, JACM 2022] achieves $O(n^2\log\log n/\log n)$ oracle complexity. However, the overall runtime of Jiang's algorithm…
▽ More
Given a convex function $f$ on $\mathbb{R}^n$ with an integer minimizer, we show how to find an exact minimizer of $f$ using $O(n^2 \log n)$ calls to a separation oracle and $O(n^4 \log n)$ time. The previous best polynomial time algorithm for this problem given in [Jiang, SODA 2021, JACM 2022] achieves $O(n^2\log\log n/\log n)$ oracle complexity. However, the overall runtime of Jiang's algorithm is at least $\widetildeΩ(n^8)$, due to expensive sub-routines such as the Lenstra-Lenstra-Lovász (LLL) algorithm [Lenstra, Lenstra, Lovász, Math. Ann. 1982] and random walk based cutting plane method [Bertsimas, Vempala, JACM 2004]. Our significant speedup is obtained by a nontrivial combination of a faster version of the LLL algorithm due to [Neumaier, Stehlé, ISSAC 2016] that gives similar guarantees, the volumetric center cutting plane method (CPM) by [Vaidya, FOCS 1989] and its fast implementation given in [Jiang, Lee, Song, Wong, STOC 2020].
For the special case of submodular function minimization (SFM), our result implies a strongly polynomial time algorithm for this problem using $O(n^3 \log n)$ calls to an evaluation oracle and $O(n^4 \log n)$ additional arithmetic operations. Both the oracle complexity and the number of arithmetic operations of our more general algorithm are better than the previous best-known runtime algorithms for this specific problem given in [Lee, Sidford, Wong, FOCS 2015] and [Dadush, Végh, Zambelli, SODA 2018, MOR 2021].
△ Less
Submitted 14 November, 2023; v1 submitted 6 April, 2023;
originally announced April 2023.
-
Twisted GKZ hypergeometric functions and relative cohomology
Authors:
Tsung-Ju Lee,
Dingxin Zhang
Abstract:
We investigate the GKZ $A$-hypergeometric $\mathscr{D}$-modules, introduced by Gel'fand, Kapranov, and Zelevinskii, arising from cyclic covers of toric varieties and find its Riemann--Hilbert partner. This extends our earlier results in arXiv:1902.01536.
We investigate the GKZ $A$-hypergeometric $\mathscr{D}$-modules, introduced by Gel'fand, Kapranov, and Zelevinskii, arising from cyclic covers of toric varieties and find its Riemann--Hilbert partner. This extends our earlier results in arXiv:1902.01536.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
Algorithmic Aspects of the Log-Laplace Transform and a Non-Euclidean Proximal Sampler
Authors:
Sivakanth Gopi,
Yin Tat Lee,
Daogao Liu,
Ruoqi Shen,
Kevin Tian
Abstract:
The development of efficient sampling algorithms catering to non-Euclidean geometries has been a challenging endeavor, as discretization techniques which succeed in the Euclidean setting do not readily carry over to more general settings. We develop a non-Euclidean analog of the recent proximal sampler of [LST21], which naturally induces regularization by an object known as the log-Laplace transfo…
▽ More
The development of efficient sampling algorithms catering to non-Euclidean geometries has been a challenging endeavor, as discretization techniques which succeed in the Euclidean setting do not readily carry over to more general settings. We develop a non-Euclidean analog of the recent proximal sampler of [LST21], which naturally induces regularization by an object known as the log-Laplace transform (LLT) of a density. We prove new mathematical properties (with an algorithmic flavor) of the LLT, such as strong convexity-smoothness duality and an isoperimetric inequality, which are used to prove a mixing time on our proximal sampler matching [LST21] under a warm start. As our main application, we show our warm-started sampler improves the value oracle complexity of differentially private convex optimization in $\ell_p$ and Schatten-$p$ norms for $p \in [1, 2]$ to match the Euclidean setting [GLL22], while retaining state-of-the-art excess risk bounds [GLLST23]. We find our investigation of the LLT to be a promising proof-of-concept of its utility as a tool for designing samplers, and outline directions for future exploration.
△ Less
Submitted 22 February, 2023; v1 submitted 12 February, 2023;
originally announced February 2023.
-
Uniqueness of conical singularities for mean curvature flows
Authors:
Tang-Kai Lee,
Xinrui Zhao
Abstract:
In this paper, we prove the uniqueness of asymptotically conical tangent flows in all codimensions. This is based on an early work of Chodosh-Schulze, who proved the uniqueness in the hypersurface case.
In this paper, we prove the uniqueness of asymptotically conical tangent flows in all codimensions. This is based on an early work of Chodosh-Schulze, who proved the uniqueness in the hypersurface case.
△ Less
Submitted 2 February, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
ReSQueing Parallel and Private Stochastic Convex Optimization
Authors:
Yair Carmon,
Arun Jambulapati,
Yujia Jin,
Yin Tat Lee,
Daogao Liu,
Aaron Sidford,
Kevin Tian
Abstract:
We introduce a new tool for stochastic convex optimization (SCO): a Reweighted Stochastic Query (ReSQue) estimator for the gradient of a function convolved with a (Gaussian) probability density. Combining ReSQue with recent advances in ball oracle acceleration [CJJJLST20, ACJJS21], we develop algorithms achieving state-of-the-art complexities for SCO in parallel and private settings. For a SCO obj…
▽ More
We introduce a new tool for stochastic convex optimization (SCO): a Reweighted Stochastic Query (ReSQue) estimator for the gradient of a function convolved with a (Gaussian) probability density. Combining ReSQue with recent advances in ball oracle acceleration [CJJJLST20, ACJJS21], we develop algorithms achieving state-of-the-art complexities for SCO in parallel and private settings. For a SCO objective constrained to the unit ball in $\mathbb{R}^d$, we obtain the following results (up to polylogarithmic factors). We give a parallel algorithm obtaining optimization error $ε_{\text{opt}}$ with $d^{1/3}ε_{\text{opt}}^{-2/3}$ gradient oracle query depth and $d^{1/3}ε_{\text{opt}}^{-2/3} + ε_{\text{opt}}^{-2}$ gradient queries in total, assuming access to a bounded-variance stochastic gradient estimator. For $ε_{\text{opt}} \in [d^{-1}, d^{-1/4}]$, our algorithm matches the state-of-the-art oracle depth of [BJLLS19] while maintaining the optimal total work of stochastic gradient descent. Given $n$ samples of Lipschitz loss functions, prior works [BFTT19, BFGT20, AFKT21, KLL21] established that if $n \gtrsim d ε_{\text{dp}}^{-2}$, $(ε_{\text{dp}}, δ)$-differential privacy is attained at no asymptotic cost to the SCO utility. However, these prior works all required a superlinear number of gradient queries. We close this gap for sufficiently large $n \gtrsim d^2 ε_{\text{dp}}^{-3}$, by using ReSQue to design an algorithm with near-linear gradient query complexity in this regime.
△ Less
Submitted 27 October, 2023; v1 submitted 1 January, 2023;
originally announced January 2023.
-
Learning threshold neurons via the "edge of stability"
Authors:
Kwangjun Ahn,
Sébastien Bubeck,
Sinho Chewi,
Yin Tat Lee,
Felipe Suarez,
Yi Zhang
Abstract:
Existing analyses of neural network training often operate under the unrealistic assumption of an extremely small learning rate. This lies in stark contrast to practical wisdom and empirical studies, such as the work of J. Cohen et al. (ICLR 2021), which exhibit startling new phenomena (the "edge of stability" or "unstable convergence") and potential benefits for generalization in the large learni…
▽ More
Existing analyses of neural network training often operate under the unrealistic assumption of an extremely small learning rate. This lies in stark contrast to practical wisdom and empirical studies, such as the work of J. Cohen et al. (ICLR 2021), which exhibit startling new phenomena (the "edge of stability" or "unstable convergence") and potential benefits for generalization in the large learning rate regime. Despite a flurry of recent works on this topic, however, the latter effect is still poorly understood. In this paper, we take a step towards understanding genuinely non-convex training dynamics with large learning rates by performing a detailed analysis of gradient descent for simplified models of two-layer neural networks. For these models, we provably establish the edge of stability phenomenon and discover a sharp phase transition for the step size below which the neural network fails to learn "threshold-like" neurons (i.e., neurons with a non-zero first-layer bias). This elucidates one possible mechanism by which the edge of stability can in fact lead to better generalization, as threshold neurons are basic building blocks with useful inductive bias for many tasks.
△ Less
Submitted 19 October, 2023; v1 submitted 14 December, 2022;
originally announced December 2022.
-
Parabolic frequency for the mean curvature flow
Authors:
Julius Baldauf,
Tang-Kai Lee
Abstract:
This paper defines a parabolic frequency for solutions of the heat equation along homothetically shrinking mean curvature flows and proves its monotonicity along such flows. As a corollary, frequency monotonicity provides a proof of backwards uniqueness. Additionally, for solutions of more general parabolic equations on mean curvature flow shrinkers, this paper provides bounds on the derivative of…
▽ More
This paper defines a parabolic frequency for solutions of the heat equation along homothetically shrinking mean curvature flows and proves its monotonicity along such flows. As a corollary, frequency monotonicity provides a proof of backwards uniqueness. Additionally, for solutions of more general parabolic equations on mean curvature flow shrinkers, this paper provides bounds on the derivative of the frequency, which similarly imply backwards uniqueness.
△ Less
Submitted 25 October, 2022;
originally announced October 2022.
-
Condition-number-independent convergence rate of Riemannian Hamiltonian Monte Carlo with numerical integrators
Authors:
Yunbum Kook,
Yin Tat Lee,
Ruoqi Shen,
Santosh S. Vempala
Abstract:
We study the convergence rate of discretized Riemannian Hamiltonian Monte Carlo on sampling from distributions in the form of $e^{-f(x)}$ on a convex body $\mathcal{M}\subset\mathbb{R}^{n}$. We show that for distributions in the form of $e^{-α^{\top}x}$ on a polytope with $m$ constraints, the convergence rate of a family of commonly-used integrators is independent of…
▽ More
We study the convergence rate of discretized Riemannian Hamiltonian Monte Carlo on sampling from distributions in the form of $e^{-f(x)}$ on a convex body $\mathcal{M}\subset\mathbb{R}^{n}$. We show that for distributions in the form of $e^{-α^{\top}x}$ on a polytope with $m$ constraints, the convergence rate of a family of commonly-used integrators is independent of $\left\Vert α\right\Vert _{2}$ and the geometry of the polytope. In particular, the implicit midpoint method (IMM) and the generalized Leapfrog method (LM) have a mixing time of $\widetilde{O}\left(mn^{3}\right)$ to achieve $ε$ total variation distance to the target distribution. These guarantees are based on a general bound on the convergence rate for densities of the form $e^{-f(x)}$ in terms of parameters of the manifold and the integrator. Our theoretical guarantee complements the empirical results of [KLSV22], which shows that RHMC with IMM can sample ill-conditioned, non-smooth and constrained distributions in very high dimension efficiently in practice.
△ Less
Submitted 10 February, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Period domains for gravitational instantons
Authors:
Tsung-Ju Lee,
Yu-Shen Lin
Abstract:
Based on the uniformization theorems of gravitation instantons by Chen--Chen arXiv:1505.01790, Chen--Viaclovsky arXiv:2110.06498, Collins--Jacob--Lin arXiv:2111.09260, and Hein--Sun--Viaclovsky--Zhang arXiv:2111.09287, we prove that the period maps for the ALH*, ALG, and ALG* gravitational instantons are surjective.
Based on the uniformization theorems of gravitation instantons by Chen--Chen arXiv:1505.01790, Chen--Viaclovsky arXiv:2110.06498, Collins--Jacob--Lin arXiv:2111.09260, and Hein--Sun--Viaclovsky--Zhang arXiv:2111.09287, we prove that the period maps for the ALH*, ALG, and ALG* gravitational instantons are surjective.
△ Less
Submitted 29 December, 2022; v1 submitted 27 August, 2022;
originally announced August 2022.
-
A Slightly Improved Bound for the KLS Constant
Authors:
Arun Jambulapati,
Yin Tat Lee,
Santosh S. Vempala
Abstract:
We refine the recent breakthrough technique of Klartag and Lehec to obtain an improved polylogarithmic bound for the KLS constant.
We refine the recent breakthrough technique of Klartag and Lehec to obtain an improved polylogarithmic bound for the KLS constant.
△ Less
Submitted 6 October, 2022; v1 submitted 24 August, 2022;
originally announced August 2022.
-
Diameter estimate for planar $L_p$ dual Minkowski problem
Authors:
Minhyun Kim,
Taehun Lee
Abstract:
In this paper, given a prescribed measure on $\mathbb{S}^1$ whose density is bounded and positive, we establish a uniform diameter estimate for solutions to the planar $L_p$ dual Minkowski problem when $0<p<1$ and $q\ge 2$. We also prove the uniqueness and positivity of solutions to the $L_p$ Minkowski problem when the density of the measure is sufficiently close to a constant in $C^α$.
In this paper, given a prescribed measure on $\mathbb{S}^1$ whose density is bounded and positive, we establish a uniform diameter estimate for solutions to the planar $L_p$ dual Minkowski problem when $0<p<1$ and $q\ge 2$. We also prove the uniqueness and positivity of solutions to the $L_p$ Minkowski problem when the density of the measure is sufficiently close to a constant in $C^α$.
△ Less
Submitted 12 August, 2022;
originally announced August 2022.
-
Decomposable Non-Smooth Convex Optimization with Nearly-Linear Gradient Oracle Complexity
Authors:
Sally Dong,
Haotian Jiang,
Yin Tat Lee,
Swati Padmanabhan,
Guanghao Ye
Abstract:
Many fundamental problems in machine learning can be formulated by the convex program \[ \min_{θ\in R^d}\ \sum_{i=1}^{n}f_{i}(θ), \] where each $f_i$ is a convex, Lipschitz function supported on a subset of $d_i$ coordinates of $θ$. One common approach to this problem, exemplified by stochastic gradient descent, involves sampling one $f_i$ term at every iteration to make progress. This approach cr…
▽ More
Many fundamental problems in machine learning can be formulated by the convex program \[ \min_{θ\in R^d}\ \sum_{i=1}^{n}f_{i}(θ), \] where each $f_i$ is a convex, Lipschitz function supported on a subset of $d_i$ coordinates of $θ$. One common approach to this problem, exemplified by stochastic gradient descent, involves sampling one $f_i$ term at every iteration to make progress. This approach crucially relies on a notion of uniformity across the $f_i$'s, formally captured by their condition number. In this work, we give an algorithm that minimizes the above convex formulation to $ε$-accuracy in $\widetilde{O}(\sum_{i=1}^n d_i \log (1 /ε))$ gradient computations, with no assumptions on the condition number. The previous best algorithm independent of the condition number is the standard cutting plane method, which requires $O(nd \log (1/ε))$ gradient computations. As a corollary, we improve upon the evaluation oracle complexity for decomposable submodular minimization by Axiotis et al. (ICML 2021). Our main technical contribution is an adaptive procedure to select an $f_i$ term at every iteration via a novel combination of cutting-plane and interior-point methods.
△ Less
Submitted 7 August, 2022;
originally announced August 2022.
-
Private Convex Optimization in General Norms
Authors:
Sivakanth Gopi,
Yin Tat Lee,
Daogao Liu,
Ruoqi Shen,
Kevin Tian
Abstract:
We propose a new framework for differentially private optimization of convex functions which are Lipschitz in an arbitrary norm $\|\cdot\|$. Our algorithms are based on a regularized exponential mechanism which samples from the density $\propto \exp(-k(F+μr))$ where $F$ is the empirical loss and $r$ is a regularizer which is strongly convex with respect to $\|\cdot\|$, generalizing a recent work o…
▽ More
We propose a new framework for differentially private optimization of convex functions which are Lipschitz in an arbitrary norm $\|\cdot\|$. Our algorithms are based on a regularized exponential mechanism which samples from the density $\propto \exp(-k(F+μr))$ where $F$ is the empirical loss and $r$ is a regularizer which is strongly convex with respect to $\|\cdot\|$, generalizing a recent work of [Gopi, Lee, Liu '22] to non-Euclidean settings. We show that this mechanism satisfies Gaussian differential privacy and solves both DP-ERM (empirical risk minimization) and DP-SCO (stochastic convex optimization) by using localization tools from convex geometry. Our framework is the first to apply to private convex optimization in general normed spaces and directly recovers non-private SCO rates achieved by mirror descent as the privacy parameter $ε\to \infty$. As applications, for Lipschitz optimization in $\ell_p$ norms for all $p \in (1, 2)$, we obtain the first optimal privacy-utility tradeoffs; for $p = 1$, we improve tradeoffs obtained by the recent works [Asi, Feldman, Koren, Talwar '21, Bassily, Guzman, Nandi '21] by at least a logarithmic factor. Our $\ell_p$ norm and Schatten-$p$ norm optimization frameworks are complemented with polynomial-time samplers whose query complexity we explicitly bound.
△ Less
Submitted 10 November, 2022; v1 submitted 17 July, 2022;
originally announced July 2022.
-
Mirror duality between Calabi-Yau fractional complete intersections
Authors:
Tsung-Ju Lee
Abstract:
This is an expanded version of the author's talk at the third annual meeting of International Consortium of Chinese Mathematicians held at USTC in December 2020. In this expository article, we give a survey on joint works with Hosono, Lian, and Yau in arXiv:2003.07148 and arXiv:2008.04039. We also carry out some explicit examples to illustrate our results in enumerative geometry which will appear…
▽ More
This is an expanded version of the author's talk at the third annual meeting of International Consortium of Chinese Mathematicians held at USTC in December 2020. In this expository article, we give a survey on joint works with Hosono, Lian, and Yau in arXiv:2003.07148 and arXiv:2008.04039. We also carry out some explicit examples to illustrate our results in enumerative geometry which will appear in our forthcoming papers.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
SYZ mirror symmetry for del Pezzo surfaces and affine structures
Authors:
Siu-Cheong Lau,
Tsung-Ju Lee,
Yu-Shen Lin
Abstract:
We prove that the Landau--Ginzburg superpotential of del Pezzo surfaces can be realized as a limit of their hyperKähler rotation toward the large complex structure limit point. As a corollary, we compute the limit of the complex affine structure of the special Lagrangian fibrations constructed by Collins--Jacob--Lin in $\mathbf{P}^1\times \mathbf{P}^1$ arXiv:1904.08363 and compare it with the inte…
▽ More
We prove that the Landau--Ginzburg superpotential of del Pezzo surfaces can be realized as a limit of their hyperKähler rotation toward the large complex structure limit point. As a corollary, we compute the limit of the complex affine structure of the special Lagrangian fibrations constructed by Collins--Jacob--Lin in $\mathbf{P}^1\times \mathbf{P}^1$ arXiv:1904.08363 and compare it with the integral affine structures used in the work of Carl--Pumperla--Siebert arXiv:2205.07753. We also construct the Floer-theoretical Landau--Ginzburg mirrors of smoothing of $A_n$-singularities and monotone del Pezzo surfaces, by using the gluing method of Cho--Hong--Lau arXiv:1810.02045 and Hong--Kim--Lau arXiv:1805.11738. They agree with the result of hyperKähler rotation.
△ Less
Submitted 3 April, 2024; v1 submitted 3 June, 2022;
originally announced June 2022.
-
Equivariant Reinforcement Learning for Quadrotor UAV
Authors:
Beomyeol Yu,
Taeyoung Lee
Abstract:
This paper presents an equivariant reinforcement learning framework for quadrotor unmanned aerial vehicles. Successful training of reinforcement learning often requires numerous interactions with the environments, which hinders its applicability especially when the available computational resources are limited, or when there is no reliable simulation model. We identified an equivariance property o…
▽ More
This paper presents an equivariant reinforcement learning framework for quadrotor unmanned aerial vehicles. Successful training of reinforcement learning often requires numerous interactions with the environments, which hinders its applicability especially when the available computational resources are limited, or when there is no reliable simulation model. We identified an equivariance property of the quadrotor dynamics such that the dimension of the state required in the training is reduced by one, thereby improving the sampling efficiency of reinforcement learning substantially. This is illustrated by numerical examples with popular reinforcement learning techniques of TD3 and SAC.
△ Less
Submitted 25 February, 2023; v1 submitted 2 June, 2022;
originally announced June 2022.
-
Closed-Form Solution of the Unit Normal Loss Integral in Two-Dimensions
Authors:
Tae Yoon Lee,
Paul Gustafson,
Mohsen Sadatsafavi
Abstract:
In Value of Information (VoI) analysis, the unit normal loss integral (UNLI) frequently emerges as a solution for the computation of various VoI metrics. However, one limitation of the UNLI has been that its closed-form solution is available for only one dimension, and thus can be used for comparisons involving only two strategies (where it is applied to the scalar incremental net benefit). We der…
▽ More
In Value of Information (VoI) analysis, the unit normal loss integral (UNLI) frequently emerges as a solution for the computation of various VoI metrics. However, one limitation of the UNLI has been that its closed-form solution is available for only one dimension, and thus can be used for comparisons involving only two strategies (where it is applied to the scalar incremental net benefit). We derived a closed-form solution for the two-dimensional UNLI, enabling closed-form VoI calculations for three strategies. We verified the accuracy of this method via simulation studies. A case study based on a three-arm clinical trial was used as an example. VoI methods based on the closed-form solutions for the UNLI can now be extended to three-decision comparisons, taking a fraction of a second to compute and not being subject to Monte Carlo error. An R implementation of this method is provided as part of the predtools package (https://github.com/resplab/predtools/).
△ Less
Submitted 23 July, 2022; v1 submitted 12 May, 2022;
originally announced May 2022.