-
Analysis of A Mixed Finite Element Method for Poisson's Equation with Rough Boundary Data
Authors:
Huadong Gao,
Yuhui Huang,
Wen Xie
Abstract:
This paper is concerned with finite element methods for Poisson's equation with rough boundary data. Conventional methods require that the boundary data $g$ of the problem belongs to $H^{1/2} (\partial Ω)$. However, in many applications one has to consider the case when $g$ is in $L^2(\partial Ω)$ only. To this end, very weak solutions are considered to establish the well-posedness of the problem.…
▽ More
This paper is concerned with finite element methods for Poisson's equation with rough boundary data. Conventional methods require that the boundary data $g$ of the problem belongs to $H^{1/2} (\partial Ω)$. However, in many applications one has to consider the case when $g$ is in $L^2(\partial Ω)$ only. To this end, very weak solutions are considered to establish the well-posedness of the problem. Most previously proposed numerical methods use regularizations of the boundary data. The main purpose of this paper is to use the Raviart--Thomas mixed finite element method to solve the Poisson equation with rough boundary data directly. We prove that the solution to the proposed mixed method converges to the very weak solution. In particular, we prove that the convergence rate of the numerical solution is $O(h^{1/2})$ in convex domains and $O(h^{s-1/2})$ in nonconvex domains, where $s > 1/2$ depends on the geometry of the domain. The analysis is based on a regularized approach and a rigorous estimate for the corresponding dual problem. Numerical experiments confirm the theoretically predicted convergence rates for the proposed mixed method for Poisson's equation with rough boundary data.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
A residual driven multiscale method for Darcy's flow in perforated domains
Authors:
Wei Xie,
Shubin Fu,
Yin Yang,
Yunqing Huang
Abstract:
In this paper, we present a residual-driven multiscale method for simulating Darcy flow in perforated domains, where complex geometries and highly heterogeneous permeability make direct simulations computationally expensive. To address this, we introduce a velocity elimination technique that reformulates the mixed velocity-pressure system into a pressure-only formulation, significantly reducing co…
▽ More
In this paper, we present a residual-driven multiscale method for simulating Darcy flow in perforated domains, where complex geometries and highly heterogeneous permeability make direct simulations computationally expensive. To address this, we introduce a velocity elimination technique that reformulates the mixed velocity-pressure system into a pressure-only formulation, significantly reducing complexity by focusing on the dominant pressure variable. Our method is developed within the Generalized Multiscale Finite Element Method (GMsFEM) framework. For each coarse block, we construct offline basis functions from local spectral problems that capture key geometric and physical features. Online basis functions are then adaptively enriched using residuals, allowing the method to incorporate global effects such as source terms and boundary conditions, thereby improving accuracy. We provide detailed error analysis demonstrating how the offline and online spaces contribute to the accuracy and efficiency of the solution. Numerical experiments confirm the method's effectiveness, showing substantial reductions in computational cost while maintaining high accuracy, particularly through adaptive online enrichment. These results highlight the method's potential for efficient and accurate simulation of Darcy flow in complex, heterogeneous perforated domains.
△ Less
Submitted 29 June, 2025;
originally announced June 2025.
-
Robust space-time multiscale upscaling via multicontinuum homogenization for evolving perforated media
Authors:
Wei Xie,
Viet Ha Hoang,
Yin Yang,
Yunqing Huang
Abstract:
Time-evolving perforated domains arise in many engineering and geoscientific applications, including reactive transport, particle deposition, and structural degradation in porous media. Accurately capturing the macroscopic behavior of such systems poses significant computational challenges due to the dynamic fine-scale geometries. In this paper, we develop a robust and generalizable multiscale mod…
▽ More
Time-evolving perforated domains arise in many engineering and geoscientific applications, including reactive transport, particle deposition, and structural degradation in porous media. Accurately capturing the macroscopic behavior of such systems poses significant computational challenges due to the dynamic fine-scale geometries. In this paper, we develop a robust and generalizable multiscale modeling framework based on multicontinuum homogenization to derive effective macroscopic equations in shrinking domains. The method distinguishes multiple continua according to the physical characteristics (e.g., channel widths), and couples them via space-time local cell problems formulated on representative volume elements. These local problems incorporate temporal derivatives and domain evolution, ensuring consistency with underlying fine-scale dynamics. The resulting upscaled system yields computable macroscopic coefficients and is suitable for large-scale simulations. Several numerical experiments are presented to validate the accuracy, efficiency, and potential applicability of the method to complex time-dependent engineering problems.
△ Less
Submitted 26 June, 2025;
originally announced June 2025.
-
Dynamic Investment Strategies Through Market Classification and Volatility: A Machine Learning Approach
Authors:
Jinhui Li,
Wenjia Xie,
Luis Seco
Abstract:
This study introduces a dynamic investment framework to enhance portfolio management in volatile markets, offering clear advantages over traditional static strategies. Evaluates four conventional approaches : equal weighted, minimum variance, maximum diversification, and equal risk contribution under dynamic conditions. Using K means clustering, the market is segmented into ten volatility-based st…
▽ More
This study introduces a dynamic investment framework to enhance portfolio management in volatile markets, offering clear advantages over traditional static strategies. Evaluates four conventional approaches : equal weighted, minimum variance, maximum diversification, and equal risk contribution under dynamic conditions. Using K means clustering, the market is segmented into ten volatility-based states, with transitions forecasted by a Bayesian Markov switching model employing Dirichlet priors and Gibbs sampling. This enables real-time asset allocation adjustments. Tested across two asset sets, the dynamic portfolio consistently achieves significantly higher risk-adjusted returns and substantially higher total returns, outperforming most static methods. By integrating classical optimization with machine learning and Bayesian techniques, this research provides a robust strategy for optimizing investment outcomes in unpredictable market environments.
△ Less
Submitted 19 March, 2025;
originally announced April 2025.
-
Efficient QR-Based CP Decomposition Acceleration via Dimension Tree and Extrapolation
Authors:
Wenchao Xie,
Jiawei Xu,
Zheng Peng,
Qingsong Wang
Abstract:
The canonical polyadic (CP) decomposition is one of the most widely used tensor decomposition techniques. The conventional CP decomposition algorithm combines alternating least squares (ALS) with the normal equation. However, the normal equation is susceptible to numerical ill-conditioning, which can adversely affect the decomposition results. To mitigate this issue, ALS combined with QR decomposi…
▽ More
The canonical polyadic (CP) decomposition is one of the most widely used tensor decomposition techniques. The conventional CP decomposition algorithm combines alternating least squares (ALS) with the normal equation. However, the normal equation is susceptible to numerical ill-conditioning, which can adversely affect the decomposition results. To mitigate this issue, ALS combined with QR decomposition has been proposed as a more numerically stable alternative. Although this method enhances stability, its iterative process involves tensor-times-matrix (TTM) operations, which typically result in higher computational costs. To reduce this cost, we propose branch reutilization of dimension tree, which increases the reuse of intermediate tensors and reduces the number of TTM operations. This strategy achieves a $33\%$ reduction in computational complexity for third and fourth order tensors. Additionally, we introduce a specialized extrapolation method in CP-ALS-QR algorithm, leveraging the unique structure of the matrix $\mathbf{Q}_0$ to further enhance convergence. By integrating both techniques, we develop a novel CP decomposition algorithm that significantly improves efficiency. Numerical experiments on five real-world datasets show that our proposed algorithm reduces iteration costs and enhances fitting accuracy compared to the CP-ALS-QR algorithm.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
A hierarchical approach for multicontinuum homogenization in high contrast media
Authors:
Wei Xie,
Viet Ha Hoang,
Yin Yang,
Yunqing Huang
Abstract:
A recently developed upscaling technique, the multicontinuum homogenization method, has gained significant attention for its effectiveness in modeling complex multiscale systems. This method defines multiple continua based on distinct physical properties and solves a series of constrained cell problems to capture localized information for each continuum. However, solving all these cell problems on…
▽ More
A recently developed upscaling technique, the multicontinuum homogenization method, has gained significant attention for its effectiveness in modeling complex multiscale systems. This method defines multiple continua based on distinct physical properties and solves a series of constrained cell problems to capture localized information for each continuum. However, solving all these cell problems on very fine grids at every macroscopic point is computationally expensive, which is a common limitation of most homogenization approaches for non-periodic problems. To address this challenge, we propose a hierarchical multicontinuum homogenization framework. The core idea is to define hierarchical macroscopic points and solve the constrained problems on grids of varying resolutions. We assume that the local solutions can be represented as a combination of a linear interpolation of local solutions from preceding levels and an additional correction term. This combination is substituted into the original constrained problems, and the correction term is resolved using finite element (FE) grids of varying sizes, depending on the level of the macropoint. By normalizing the computational cost of fully resolving the local problem to $\mathcal{O}(1)$, we establish that our approach incurs a cost of $\mathcal{O}(L η^{(1-L)d})$, highlighting substantial computational savings across hierarchical layers $L$, coarsening factor $η$, and spatial dimension $d$. Numerical experiments validate the effectiveness of the proposed method in media with slowly varying properties, underscoring its potential for efficient multiscale modeling.
△ Less
Submitted 9 June, 2025; v1 submitted 3 March, 2025;
originally announced March 2025.
-
Multicontinuum Modeling of Time-Fractional Diffusion-Wave Equation in Heterogeneous Media
Authors:
Huiran Bai,
Dmitry Ammosov,
Yin Yang,
Wei Xie,
Mohammed Al Kobaisi
Abstract:
This paper considers a time-fractional diffusion-wave equation with a high-contrast heterogeneous diffusion coefficient. A numerical solution to this problem can present great computational challenges due to its multiscale nature. Therefore, in this paper, we derive a multicontinuum time-fractional diffusion-wave model using the multicontinuum homogenization method. For this purpose, we formulate…
▽ More
This paper considers a time-fractional diffusion-wave equation with a high-contrast heterogeneous diffusion coefficient. A numerical solution to this problem can present great computational challenges due to its multiscale nature. Therefore, in this paper, we derive a multicontinuum time-fractional diffusion-wave model using the multicontinuum homogenization method. For this purpose, we formulate constraint cell problems considering various homogenized effects. These cell problems are implemented in oversampled regions to avoid boundary effects. By solving the cell problems, we obtain multicontinuum expansions of fine-scale solutions. Then, using these multicontinuum expansions and supposing the smoothness of the macroscopic variables, we rigorously derive the corresponding multicontinuum model. Finally, we present numerical results for two-dimensional model problems with different time-fractional derivatives to verify the accuracy of our proposed approach.
△ Less
Submitted 13 February, 2025;
originally announced February 2025.
-
Training Deep Learning Models with Norm-Constrained LMOs
Authors:
Thomas Pethick,
Wanyun Xie,
Kimon Antonakopoulos,
Zhenyu Zhu,
Antonio Silveti-Falls,
Volkan Cevher
Abstract:
In this work, we study optimization methods that leverage the linear minimization oracle (LMO) over a norm-ball. We propose a new stochastic family of algorithms that uses the LMO to adapt to the geometry of the problem and, perhaps surprisingly, show that they can be applied to unconstrained problems. The resulting update rule unifies several existing optimization methods under a single framework…
▽ More
In this work, we study optimization methods that leverage the linear minimization oracle (LMO) over a norm-ball. We propose a new stochastic family of algorithms that uses the LMO to adapt to the geometry of the problem and, perhaps surprisingly, show that they can be applied to unconstrained problems. The resulting update rule unifies several existing optimization methods under a single framework. Furthermore, we propose an explicit choice of norm for deep architectures, which, as a side benefit, leads to the transferability of hyperparameters across model sizes. Experimentally, we demonstrate significant speedups on nanoGPT training using our algorithm, Scion, without any reliance on Adam. The proposed method is memory-efficient, requiring only one set of model weights and one set of gradients, which can be stored in half-precision. The code is available at https://github.com/LIONS-EPFL/scion .
△ Less
Submitted 6 June, 2025; v1 submitted 11 February, 2025;
originally announced February 2025.
-
The Global Sections of Chiral de Rham Complexes on Closed Complex Curves
Authors:
Bailin Song,
Wujie Xie
Abstract:
The space of global sections of the chiral de Rham complex on any closed complex curve with genus $g \ge2$ is calculated.
The space of global sections of the chiral de Rham complex on any closed complex curve with genus $g \ge2$ is calculated.
△ Less
Submitted 6 February, 2025; v1 submitted 15 January, 2025;
originally announced January 2025.
-
On the ReLU Lagrangian Cuts for Stochastic Mixed Integer Programming
Authors:
Haoyun Deng,
Weijun Xie
Abstract:
We study stochastic mixed integer programs with both first-stage and recourse decisions involving mixed integer variables. A new family of Lagrangian cuts, termed ``ReLU Lagrangian cuts," is introduced by reformulating the nonanticipativity constraints using ReLU functions. These cuts can be integrated into scenario decomposition methods. We show that including ReLU Lagrangian cuts is sufficient t…
▽ More
We study stochastic mixed integer programs with both first-stage and recourse decisions involving mixed integer variables. A new family of Lagrangian cuts, termed ``ReLU Lagrangian cuts," is introduced by reformulating the nonanticipativity constraints using ReLU functions. These cuts can be integrated into scenario decomposition methods. We show that including ReLU Lagrangian cuts is sufficient to achieve optimality in the original stochastic mixed integer programs. Without solving the Lagrangian dual problems, we derive closed-form expressions for these cuts. Furthermore, to speed up the cut-generating procedures, we introduce linear programming-based methods to enhance the cut coefficients. Numerical studies demonstrate the effectiveness of the proposed cuts compared to existing cut families.
△ Less
Submitted 9 November, 2024; v1 submitted 2 November, 2024;
originally announced November 2024.
-
Double star arrangement and the pointed multinet
Authors:
Yongqiang Liu,
Wentao Xie
Abstract:
Let $\mathcal{A}$ be a hyperplane arrangement in a complex projective space. It is an open question if the degree one cohomology jump loci (with complex coefficients) are determined by the combinatorics of $\mathcal{A}$. By the work of Falk and Yuzvinsky \cite{FY}, all the irreducible components passing through the origin are determined by the multinet structure, which are combinatorially determin…
▽ More
Let $\mathcal{A}$ be a hyperplane arrangement in a complex projective space. It is an open question if the degree one cohomology jump loci (with complex coefficients) are determined by the combinatorics of $\mathcal{A}$. By the work of Falk and Yuzvinsky \cite{FY}, all the irreducible components passing through the origin are determined by the multinet structure, which are combinatorially determined. Denham and Suciu introduced the pointed multinet structure to obtain examples of arrangements with translated positive-dimensional components in the degree one cohomology jump loci \cite{DS}. Suciu asked the question if all translated positive-dimensional components appear in this manner \cite{Suc14}. In this paper, we show that the double star arrangement introduced by Ishibashi, Sugawara and Yoshinaga \cite[Example 3.2]{ISY22} gives a negative answer to this question.
△ Less
Submitted 7 May, 2025; v1 submitted 6 September, 2024;
originally announced September 2024.
-
The Blessing of Strategic Customers in Personalized Pricing
Authors:
Zhi Chen,
Bradley Sturt,
Weijun Xie
Abstract:
We consider a feature-based personalized pricing problem in which the buyer is strategic: given the seller's pricing policy, the buyer can augment the features that they reveal to the seller to obtain a low price for the product. We model the seller's pricing problem as a stochastic program over an infinite-dimensional space of pricing policies where the radii by which the buyer can perturb the fe…
▽ More
We consider a feature-based personalized pricing problem in which the buyer is strategic: given the seller's pricing policy, the buyer can augment the features that they reveal to the seller to obtain a low price for the product. We model the seller's pricing problem as a stochastic program over an infinite-dimensional space of pricing policies where the radii by which the buyer can perturb the features are strictly positive. We establish that the sample average approximation of this problem is asymptotically consistent; that is, we prove that the objective value of the sample average approximation converges almost surely to the objective value of the stochastic problem as the number of samples tends to infinity under mild technical assumptions. This consistency guarantee thus shows that incorporating strategic consumer behavior into a data-driven pricing problem can, in addition to making the pricing problem more realistic, also help prevent overfitting.
△ Less
Submitted 16 August, 2024;
originally announced August 2024.
-
Multicontinuum homogenization in perforated domains
Authors:
Wei Xie,
Yalchin Efendiev,
Yunqing Huang,
Wing Tat Leung,
Yin Yang
Abstract:
In this paper, we develop a general framework for multicontinuum homogenization in perforated domains. The simulations of problems in perforated domains are expensive and, in many applications, coarse-grid macroscopic models are developed. Many previous approaches include homogenization, multiscale finite element methods, and so on. In our paper, we design multicontinuum homogenization based on ou…
▽ More
In this paper, we develop a general framework for multicontinuum homogenization in perforated domains. The simulations of problems in perforated domains are expensive and, in many applications, coarse-grid macroscopic models are developed. Many previous approaches include homogenization, multiscale finite element methods, and so on. In our paper, we design multicontinuum homogenization based on our recently proposed framework. In this setting, we distinguish different spatial regions in perforations based on their sizes. For example, very thin perforations are considered as one continua, while larger perforations are considered as another continua. By differentiating perforations in this way, we are able to predict flows in each of them more accurately. We present a framework by formulating cell problems for each continuum using appropriate constraints for the solution averages and their gradients. These cell problem solutions are used in a multiscale expansion and in deriving novel macroscopic systems for multicontinuum homogenization. Our proposed approaches are designed for problems without scale separation. We present numerical results for two continuum problems and demonstrate the accuracy of the proposed methods.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
CEM-GMsFEM for Poisson equations in heterogeneous perforated domains
Authors:
Wei Xie,
Yin Yang,
Eric Chung,
Yunqing Huang
Abstract:
In this paper, we propose a novel multiscale model reduction strategy tailored to address the Poisson equation within heterogeneous perforated domains. The numerical simulation of this intricate problem is impeded by its multiscale characteristics, necessitating an exceptionally fine mesh to adequately capture all relevant details. To overcome the challenges inherent in the multiscale nature of th…
▽ More
In this paper, we propose a novel multiscale model reduction strategy tailored to address the Poisson equation within heterogeneous perforated domains. The numerical simulation of this intricate problem is impeded by its multiscale characteristics, necessitating an exceptionally fine mesh to adequately capture all relevant details. To overcome the challenges inherent in the multiscale nature of the perforations, we introduce a coarse space constructed using the Constraint Energy Minimizing Generalized Multiscale Finite Element Method (CEM-GMsFEM). This involves constructing basis functions through a sequence of local energy minimization problems over eigenspaces containing localized information pertaining to the heterogeneities. Through our analysis, we demonstrate that the oversampling layers depend on the local eigenvalues, thereby implicating the local geometry as well. Additionally, we provide numerical examples to illustrate the efficacy of the proposed scheme.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Regularized MIP Model for Integrating Energy Storage Systems and its Application for Solving a Trilevel Interdiction Problem
Authors:
Dahye Han,
Nan Jiang,
Santanu S. Dey,
Weijun Xie
Abstract:
Incorporating energy storage systems (ESS) into power systems has been studied in many recent works, where binary variables are often introduced to model the complementary nature of battery charging and discharging. A conventional approach for these ESS optimization problems is to relax binary variables and convert the problem into a linear program. However, such linear programming relaxation mode…
▽ More
Incorporating energy storage systems (ESS) into power systems has been studied in many recent works, where binary variables are often introduced to model the complementary nature of battery charging and discharging. A conventional approach for these ESS optimization problems is to relax binary variables and convert the problem into a linear program. However, such linear programming relaxation models can yield unrealistic fractional solutions, such as simultaneous charging and discharging. In this paper, we develop a regularized Mixed-Integer Programming (MIP) model for the ESS optimal power flow (OPF) problem. We prove that under mild conditions, the proposed regularized model admits a zero integrality gap with its linear programming relaxation; hence, it can be solved efficiently. By studying the properties of the regularized MIP model, we show that its optimal solution is also near-optimal to the original ESS OPF problem, thereby providing a valid and tight upper bound for the ESS OPF problem. The use of the regularized MIP model allows us to solve a trilevel min-max-min network contingency problem which is otherwise intractable to solve.
△ Less
Submitted 9 January, 2025; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Distributionally Fair Stochastic Optimization using Wasserstein Distance
Authors:
Qing Ye,
Grani A. Hanasusanto,
Weijun Xie
Abstract:
A traditional stochastic program under a finite population typically seeks to optimize efficiency by maximizing the expected profits or minimizing the expected costs, subject to a set of constraints. However, implementing such optimization-based decisions can have varying impacts on individuals, and when assessed using the individuals' utility functions, these impacts may differ substantially acro…
▽ More
A traditional stochastic program under a finite population typically seeks to optimize efficiency by maximizing the expected profits or minimizing the expected costs, subject to a set of constraints. However, implementing such optimization-based decisions can have varying impacts on individuals, and when assessed using the individuals' utility functions, these impacts may differ substantially across demographic groups delineated by sensitive attributes, such as gender, race, age, and socioeconomic status. As each group comprises multiple individuals, a common remedy is to enforce group fairness, which necessitates the measurement of disparities in the distributions of utilities across different groups. This paper introduces the concept of Distributionally Fair Stochastic Optimization (DFSO) based on the Wasserstein fairness measure. The DFSO aims to minimize distributional disparities among groups, quantified by the Wasserstein distance, while adhering to an acceptable level of inefficiency. Our analysis reveals that: (i) the Wasserstein fairness measure recovers the demographic parity fairness prevalent in binary classification literature; (ii) this measure can approximate the well-known Kolmogorov-Smirnov fairness measure with considerable accuracy; and (iii) despite DFSO's biconvex nature, the epigraph of the Wasserstein fairness measure is generally Mixed-Integer Convex Programming Representable (MICP-R). Additionally, we introduce two distinct lower bounds for the Wasserstein fairness measure: the Jensen bound, applicable to the general Wasserstein fairness measure, and the Gelbrich bound, specific to the type-2 Wasserstein fairness measure. We establish the exactness of the Gelbrich bound and quantify the theoretical difference between the Wasserstein fairness measure and the Gelbrich bound.
△ Less
Submitted 8 February, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
On Tractability, Complexity, and Mixed-Integer Convex Programming Representability of Distributionally Favorable Optimization
Authors:
Nan Jiang,
Weijun Xie
Abstract:
Distributionally Favorable Optimization (DFO) is an important framework for decision-making under uncertainty, with applications across fields such as reinforcement learning, online learning, robust statistics, chance-constrained programming, and two-stage stochastic optimization without relatively complete recourse. In contrast to the traditional Distributionally Robust Optimization (DRO) paradig…
▽ More
Distributionally Favorable Optimization (DFO) is an important framework for decision-making under uncertainty, with applications across fields such as reinforcement learning, online learning, robust statistics, chance-constrained programming, and two-stage stochastic optimization without relatively complete recourse. In contrast to the traditional Distributionally Robust Optimization (DRO) paradigm, DFO presents a unique challenge -- the application of the inner infimum operator often fails to retain the convexity. In light of this challenge, we study the tractability and complexity of DFO. We establish sufficient and necessary conditions for determining when DFO problems are tractable or intractable. Despite the typical nonconvex nature of DFO problems, our findings show that they are mixed-integer convex programming representable (MICP-R), thereby enabling solutions via standard optimization solvers. Finally, we numerically validate the efficacy of our MICP-R formulations.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
On Sparse Canonical Correlation Analysis
Authors:
Yongchun Li,
Santanu S. Dey,
Weijun Xie
Abstract:
The classical Canonical Correlation Analysis (CCA) identifies the correlations between two sets of multivariate variables based on their covariance, which has been widely applied in diverse fields such as computer vision, natural language processing, and speech analysis. Despite its popularity, CCA can encounter challenges in explaining correlations between two variable sets within high-dimensiona…
▽ More
The classical Canonical Correlation Analysis (CCA) identifies the correlations between two sets of multivariate variables based on their covariance, which has been widely applied in diverse fields such as computer vision, natural language processing, and speech analysis. Despite its popularity, CCA can encounter challenges in explaining correlations between two variable sets within high-dimensional data contexts. Thus, this paper studies Sparse Canonical Correlation Analysis (SCCA) that enhances the interpretability of CCA. We first show that SCCA generalizes three well-known sparse optimization problems, sparse PCA, sparse SVD, and sparse regression, which are all classified as NP-hard problems. This result motivates us to develop strong formulations and efficient algorithms. Our main contributions include (i) the introduction of a combinatorial formulation that captures the essence of SCCA and allows the development of approximation algorithms; (ii) the derivation of an equivalent mixed-integer semidefinite programming model that facilitates a specialized branch-and-cut algorithm with analytical cuts; and (iii) the establishment of the complexity results for two low-rank special cases of SCCA. The effectiveness of our proposed formulations and algorithms is validated through numerical experiments.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
-
Learning Fair Policies for Multi-stage Selection Problems from Observational Data
Authors:
Zhuangzhuang Jia,
Grani A. Hanasusanto,
Phebe Vayanos,
Weijun Xie
Abstract:
We consider the problem of learning fair policies for multi-stage selection problems from observational data. This problem arises in several high-stakes domains such as company hiring, loan approval, or bail decisions where outcomes (e.g., career success, loan repayment, recidivism) are only observed for those selected. We propose a multi-stage framework that can be augmented with various fairness…
▽ More
We consider the problem of learning fair policies for multi-stage selection problems from observational data. This problem arises in several high-stakes domains such as company hiring, loan approval, or bail decisions where outcomes (e.g., career success, loan repayment, recidivism) are only observed for those selected. We propose a multi-stage framework that can be augmented with various fairness constraints, such as demographic parity or equal opportunity. This problem is a highly intractable infinite chance-constrained program involving the unknown joint distribution of covariates and outcomes. Motivated by the potential impact of selection decisions on people's lives and livelihoods, we propose to focus on interpretable linear selection rules. Leveraging tools from causal inference and sample average approximation, we obtain an asymptotically consistent solution to this selection problem by solving a mixed binary conic optimization problem, which can be solved using standard off-the-shelf solvers. We conduct extensive computational experiments on a variety of datasets adapted from the UCI repository on which we show that our proposed approaches can achieve an 11.6% improvement in precision and a 38% reduction in the measure of unfairness compared to the existing selection policy.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
The homology groups of finite cyclic covering of line arrangement complement
Authors:
Yongqiang Liu,
Wentao Xie
Abstract:
In this paper, we study the first homology group of finite cyclic covering of complex line arrangement complement. We show that this first integral homology group is torsion-free under certain condition similar to the one used by Cohen-Dimca-Orlik. In particular, this includes the case of the Milnor fiber, which generalizes the previous results obtained by Williams for complexified line arrangemen…
▽ More
In this paper, we study the first homology group of finite cyclic covering of complex line arrangement complement. We show that this first integral homology group is torsion-free under certain condition similar to the one used by Cohen-Dimca-Orlik. In particular, this includes the case of the Milnor fiber, which generalizes the previous results obtained by Williams for complexified line arrangement to any complex line arrangement.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Torsion energy with boundary mean zero condition
Authors:
Qinfeng Li,
Weihong Xie,
Hang Yang
Abstract:
Motivated by establishing Neumann Talenti type comparison results, we concern the minimization of the following shape functional under volume constraint:
\begin{align*}
T(Ω):=\inf\left\{\frac12 \int_Ω |\nabla u|^2\,dx -\int_Ωu\,dx: u\in H^1(Ω),\ \int_{\partial Ω}udσ=0 \right\}.
\end{align*} We prove that ball is a local minimizer to $T(\cdot)$ under smooth perturbation, but quite surprisingl…
▽ More
Motivated by establishing Neumann Talenti type comparison results, we concern the minimization of the following shape functional under volume constraint:
\begin{align*}
T(Ω):=\inf\left\{\frac12 \int_Ω |\nabla u|^2\,dx -\int_Ωu\,dx: u\in H^1(Ω),\ \int_{\partial Ω}udσ=0 \right\}.
\end{align*} We prove that ball is a local minimizer to $T(\cdot)$ under smooth perturbation, but quite surprisingly, ball is not locally minimal to $T(\cdot)$ under Lipschitz perturbation. In fact, let $P_N$ be the regular polygon in $\mathbb{R}^2$ with $N$ sides and area $π$, then we prove that $T(P_N)$ is a strictly increasing function with respect to $N$ and $\lim_{N\rightarrow \infty}T(P_N)=T(B)$ where $B$ is the unit disk.
As another side result, we prove that in dimension bigger than or equal to three, rigidity results of Serrin's seminal overdetermined system is not stable under Dirichlet perturbations, in contrast to the stability of rigidity under Neumann perturbation.
△ Less
Submitted 6 November, 2023; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Stable Nonconvex-Nonconcave Training via Linear Interpolation
Authors:
Thomas Pethick,
Wanyun Xie,
Volkan Cevher
Abstract:
This paper presents a theoretical analysis of linear interpolation as a principled method for stabilizing (large-scale) neural network training. We argue that instabilities in the optimization process are often caused by the nonmonotonicity of the loss landscape and show how linear interpolation can help by leveraging the theory of nonexpansive operators. We construct a new optimization scheme cal…
▽ More
This paper presents a theoretical analysis of linear interpolation as a principled method for stabilizing (large-scale) neural network training. We argue that instabilities in the optimization process are often caused by the nonmonotonicity of the loss landscape and show how linear interpolation can help by leveraging the theory of nonexpansive operators. We construct a new optimization scheme called relaxed approximate proximal point (RAPP), which is the first explicit method without anchoring to achieve last iterate convergence rates for $ρ$-comonotone problems while only requiring $ρ> -\tfrac{1}{2L}$. The construction extends to constrained and regularized settings. By replacing the inner optimizer in RAPP we rediscover the family of Lookahead algorithms for which we establish convergence in cohypomonotone problems even when the base optimizer is taken to be gradient descent ascent. The range of cohypomonotone problems in which Lookahead converges is further expanded by exploiting that Lookahead inherits the properties of the base optimizer. We corroborate the results with experiments on generative adversarial networks which demonstrates the benefits of the linear interpolation present in both RAPP and Lookahead.
△ Less
Submitted 14 March, 2024; v1 submitted 20 October, 2023;
originally announced October 2023.
-
MSAT: Matrix stability analysis tool for shock-capturing schemes
Authors:
Weijie Ren,
Wenjia Xie,
Ye Zhang,
Hang Yu,
Zhengyu Tian
Abstract:
The simulation of supersonic or hypersonic flows often suffers from numerical shock instabilities if the flow field contains strong shocks, limiting the further application of shock-capturing schemes. In this paper, we develop the unified matrix stability analysis method for schemes with three-point stencils and present MSAT, an open-source tool to quantitatively analyze the shock instability prob…
▽ More
The simulation of supersonic or hypersonic flows often suffers from numerical shock instabilities if the flow field contains strong shocks, limiting the further application of shock-capturing schemes. In this paper, we develop the unified matrix stability analysis method for schemes with three-point stencils and present MSAT, an open-source tool to quantitatively analyze the shock instability problem. Based on the finite-volume approach on the structured grid, MSAT can be employed to investigate the mechanism of the shock instability problem, evaluate the robustness of numerical schemes, and then help to develop robust schemes. Also, MSAT has the ability to analyze the practical simulation of supersonic or hypersonic flows, evaluate whether it will suffer from shock instabilities, and then assist in selecting appropriate numerical schemes accordingly. As a result, MSAT is a helpful tool that can investigate the shock instability problem and help to cure it.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
Numerical stability analysis of shock-capturing methods for strong shocks II: high-order finite-volume schemes
Authors:
Weijie Ren,
Wenjia Xie,
Ye Zhang,
Hang Yu,
Zhengyu Tian
Abstract:
The shock instability problem commonly arises in flow simulations involving strong shocks, particularly when employing high-order schemes, limiting their applications in hypersonic flow simulations. This study focuses on exploring the numerical characteristics and underlying mechanisms of shock instabilities in fifth-order finite-volume WENO schemes. To this end, for the first time, we have establ…
▽ More
The shock instability problem commonly arises in flow simulations involving strong shocks, particularly when employing high-order schemes, limiting their applications in hypersonic flow simulations. This study focuses on exploring the numerical characteristics and underlying mechanisms of shock instabilities in fifth-order finite-volume WENO schemes. To this end, for the first time, we have established the matrix stability analysis method for the fifth-order scheme. By predicting the evolution of perturbation errors in the exponential growth stage, this method provides quantitative insights into the behavior of shock-capturing and helps elucidate the mechanisms that cause shock instabilities. Results reveal that even dissipative solvers also suffer from shock instabilities when the spatial accuracy is increased to fifth-order. Further investigation indicates that this is due to the excessively high spatial accuracy of the WENO scheme near the numerical shock structure. Moreover, the shock instability problem of fifth-order schemes is demonstrated to be a multidimensional coupling problem. To stably capture strong shocks, it is crucial to have sufficient dissipation on transverse faces and ensure at least two points within the numerical shock structure in the direction perpendicular to the shock. The source location of instability is also clarified by the matrix stability analysis method, revealing that the instability arises from the numerical shock structure. Additionally, stability analysis demonstrates that local characteristic decomposition helps mitigate shock instabilities in high-order schemes, although the instability still persists. These conclusions pave the way for a better understanding of the shock instability in fifth-order schemes and provide guidance for the development of more reliable high-order shock-capturing methods for compressible flows with high Mach numbers.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
On the Partial Convexification for Low-Rank Spectral Optimization: Rank Bounds and Algorithms
Authors:
Yongchun Li,
Weijun Xie
Abstract:
A Low-rank Spectral Optimization Problem (LSOP) minimizes a linear objective subject to multiple two-sided linear matrix inequalities intersected with a low-rank and spectral constrained domain set. Although solving LSOP is, in general, NP-hard, its partial convexification (i.e., replacing the domain set by its convex hull) termed "LSOP-R," is often tractable and yields a high-quality solution. Th…
▽ More
A Low-rank Spectral Optimization Problem (LSOP) minimizes a linear objective subject to multiple two-sided linear matrix inequalities intersected with a low-rank and spectral constrained domain set. Although solving LSOP is, in general, NP-hard, its partial convexification (i.e., replacing the domain set by its convex hull) termed "LSOP-R," is often tractable and yields a high-quality solution. This motivates us to study the strength of LSOP-R. Specifically, we derive rank bounds for any extreme point of the feasible set of LSOP-R and prove their tightness for the domain sets with different matrix spaces. The proposed rank bounds recover two well-known results in the literature from a fresh angle and also allow us to derive sufficient conditions under which the relaxation LSOP-R is equivalent to the original LSOP. To effectively solve LSOP-R, we develop a column generation algorithm with a vector-based convex pricing oracle, coupled with a rank-reduction algorithm, which ensures the output solution satisfies the theoretical rank bound. Finally, we numerically verify the strength of the LSOP-R and the efficacy of the proposed algorithms.
△ Less
Submitted 20 June, 2023; v1 submitted 12 May, 2023;
originally announced May 2023.
-
Numerical stability analysis of shock-capturing methods for strong shocks I: second-order MUSCL schemes
Authors:
Weijie Ren,
Wenjia Xie,
Ye Zhang,
Hang Yu,
Zhengyu Tian
Abstract:
Modern shock-capturing schemes often suffer from numerical shock anomalies if the flow field contains strong shocks, which may limit their further application in hypersonic flow computations. In the current study, we devote our efforts to exploring the primary numerical characteristics and the underlying mechanism of shock instability for second-order finite-volume schemes. To this end, we, for th…
▽ More
Modern shock-capturing schemes often suffer from numerical shock anomalies if the flow field contains strong shocks, which may limit their further application in hypersonic flow computations. In the current study, we devote our efforts to exploring the primary numerical characteristics and the underlying mechanism of shock instability for second-order finite-volume schemes. To this end, we, for the first time, develop the matrix stability analysis method for the finite-volume MUSCL approach. Such a linearized analysis method allows to investigate the shock instability problem of the finite-volume shock-capturing schemes in a quantitative and efficient manner. Results of the stability analysis demonstrate that the shock stability of second-order scheme is strongly related to the Riemann solver, Mach number, limiter function, numerical shock structure, and computational grid. Unique stability characteristics associated with these factors for second-order methods are revealed quantitatively with the established method. Source location of instability is also clarified by the matrix stability analysis method. Results show that the shock instability originates from the numerical shock structure. Such conclusions pave the way to better understand the shock instability problem and may shed new light on developing more reliable shock-capturing methods for compressible flows with high Mach number.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
ALSO-X#: Better Convex Approximations for Distributionally Robust Chance Constrained Programs
Authors:
Nan Jiang,
Weijun Xie
Abstract:
This paper studies distributionally robust chance constrained programs (DRCCPs), where the uncertain constraints must be satisfied with at least a probability of a prespecified threshold for all probability distributions from the Wasserstein ambiguity set. As DRCCPs are often nonconvex and challenging to solve optimally, researchers have been developing various convex inner approximations. Recentl…
▽ More
This paper studies distributionally robust chance constrained programs (DRCCPs), where the uncertain constraints must be satisfied with at least a probability of a prespecified threshold for all probability distributions from the Wasserstein ambiguity set. As DRCCPs are often nonconvex and challenging to solve optimally, researchers have been developing various convex inner approximations. Recently, ALSO-X has been proven to outperform the conditional value-at-risk (CVaR) approximation of a regular chance constrained program when the deterministic set is convex. In this work, we relax this assumption by introducing a new ALSO-X\# method for solving DRCCPs. Namely, in the bilevel reformulations of ALSO-X and CVaR approximation, we observe that the lower-level ALSO-X is a special case of the lower-level CVaR approximation and the upper-level CVaR approximation is more restricted than the one in ALSO-X. This observation motivates us to propose the ALSO-X\#, which still resembles a bilevel formulation -- in the lower-level problem, we adopt the more general CVaR approximation, and for the upper-level one, we choose the less restricted ALSO-X. We show that ALSO-X\# can always be better than the CVaR approximation and can outperform ALSO-X under regular chance constrained programs and type $\infty-$Wasserstein ambiguity set. We also provide new sufficient conditions under which ALSO-X\# outputs an optimal solution to a DRCCP. We apply the proposed ALSO-X\# to a wireless communication problem and numerically demonstrate that the solution quality can be even better than the exact method.
△ Less
Submitted 3 February, 2023;
originally announced February 2023.
-
Quantum State Transfer in Graphs with Tails
Authors:
Pierre-Antoine Bernard,
Christino Tamon,
Luc Vinet,
Weichen Xie
Abstract:
We consider quantum state transfer on finite graphs which are attached to infinite paths. The finite graph represents an operational quantum system for performing useful quantum information tasks. In contrast, the infinite paths represent external infinite-dimensional systems which have limited (but nontrivial) interaction with the finite quantum system. We show that {\em perfect} state transfer c…
▽ More
We consider quantum state transfer on finite graphs which are attached to infinite paths. The finite graph represents an operational quantum system for performing useful quantum information tasks. In contrast, the infinite paths represent external infinite-dimensional systems which have limited (but nontrivial) interaction with the finite quantum system. We show that {\em perfect} state transfer can surprisingly still occur on the finite graph even in the presence of the infinite tails. Our techniques are based on a decoupling theorem for eventually-free Jacobi matrices, equitable partitions, and standard Lie theoretic arguments. Through these methods, we rehabilitate the notion of a dark subspace which had been so far viewed in an unflattering light.
△ Less
Submitted 26 November, 2022;
originally announced November 2022.
-
On the Exactness of Dantzig-Wolfe Relaxation for Rank Constrained Optimization Problems
Authors:
Yongchun Li,
Weijun Xie
Abstract:
In the rank-constrained optimization problem (RCOP), it minimizes a linear objective function over a prespecified closed rank-constrained domain set and $m$ generic two-sided linear matrix inequalities. Motivated by the Dantzig-Wolfe (DW) decomposition, a popular approach of solving many nonconvex optimization problems, we investigate the strength of DW relaxation (DWR) of the RCOP, which admits t…
▽ More
In the rank-constrained optimization problem (RCOP), it minimizes a linear objective function over a prespecified closed rank-constrained domain set and $m$ generic two-sided linear matrix inequalities. Motivated by the Dantzig-Wolfe (DW) decomposition, a popular approach of solving many nonconvex optimization problems, we investigate the strength of DW relaxation (DWR) of the RCOP, which admits the same formulation as RCOP except replacing the domain set by its closed convex hull. Notably, our goal is to characterize conditions under which the DWR matches RCOP for any m two-sided linear matrix inequalities. From the primal perspective, we develop the first-known simultaneously necessary and sufficient conditions that achieve: (i) extreme point exactness -- all the extreme points of the DWR feasible set belong to that of the RCOP; (ii) convex hull exactness -- the DWR feasible set is identical to the closed convex hull of RCOP feasible set; and (iii) objective exactness -- the optimal values of the DWR and RCOP coincide. The proposed conditions unify, refine, and extend the existing exactness results in the quadratically constrained quadratic program (QCQP) and fair unsupervised learning. These conditions can be very useful to identify new results, including the extreme point exactness for a QCQP problem that admits an inhomogeneous objective function with two homogeneous two-sided quadratic constraints and the convex hull exactness for fair SVD.
△ Less
Submitted 14 June, 2023; v1 submitted 28 October, 2022;
originally announced October 2022.
-
A note on quadratic constraints with indicator variables: Convex hull description and perspective relaxation
Authors:
Andres Gomez,
Weijun Xie
Abstract:
In this paper, we study the mixed-integer nonlinear set given by a separable quadratic constraint on continuous variables, where each continuous variable is controlled by an additional indicator. This set occurs pervasively in optimization problems with uncertainty and in machine learning. We show that optimization over this set is NP-hard. Despite this negative result, we characterize the structu…
▽ More
In this paper, we study the mixed-integer nonlinear set given by a separable quadratic constraint on continuous variables, where each continuous variable is controlled by an additional indicator. This set occurs pervasively in optimization problems with uncertainty and in machine learning. We show that optimization over this set is NP-hard. Despite this negative result, we characterize the structure of the convex hull, and show that it can be formally studied using polyhedral theory. Moreover, we show that although perspective relaxation in the literature for this set fails to match the structure of its convex hull, it is guaranteed to be a close approximation.
△ Less
Submitted 3 September, 2022;
originally announced September 2022.
-
D-optimal Data Fusion: Exact and Approximation Algorithms
Authors:
Yongchun Li,
Marcia Fampa,
Jon Lee,
Feng Qiu,
Weijun Xie,
Rui Yao
Abstract:
We study the D-optimal Data Fusion (DDF) problem, which aims to select new data points, given an existing Fisher information matrix, so as to maximize the logarithm of the determinant of the overall Fisher information matrix. We show that the DDF problem is NP-hard and has no constant-factor polynomial-time approximation algorithm unless P $=$ NP. Therefore, to solve the DDF problem effectively, w…
▽ More
We study the D-optimal Data Fusion (DDF) problem, which aims to select new data points, given an existing Fisher information matrix, so as to maximize the logarithm of the determinant of the overall Fisher information matrix. We show that the DDF problem is NP-hard and has no constant-factor polynomial-time approximation algorithm unless P $=$ NP. Therefore, to solve the DDF problem effectively, we propose two convex integer-programming formulations and investigate their corresponding complementary and Lagrangian-dual problems. We also develop scalable randomized-sampling and local-search algorithms with provable performance guarantees. Leveraging the concavity of the objective functions in the two proposed formulations, we design an exact algorithm, aimed at solving the DDF problem to optimality. We further derive a family of submodular valid inequalities and optimality cuts, which can significantly enhance the algorithm performance. Finally, we test our algorithms using real-world data on the new phasor-measurement-units placement problem for modern power grids, considering the existing conventional sensors. Our numerical study demonstrates the efficiency of our exact algorithm and the scalability and high-quality outputs of our approximation algorithms.
△ Less
Submitted 6 August, 2022;
originally announced August 2022.
-
Of Shadows and Gaps in Spatial Search
Authors:
Ada Chan,
Chris Godsil,
Christino Tamon,
Weichen Xie
Abstract:
Spatial search occurs in a connected graph if a continuous-time quantum walk on the adjacency matrix of the graph, suitably scaled, plus a rank-one perturbation induced by any vertex will unitarily map the principal eigenvector of the graph to the characteristic vector of the vertex. This phenomenon is a natural continuous-time analogue of Grover search. The spatial search is said to be optimal if…
▽ More
Spatial search occurs in a connected graph if a continuous-time quantum walk on the adjacency matrix of the graph, suitably scaled, plus a rank-one perturbation induced by any vertex will unitarily map the principal eigenvector of the graph to the characteristic vector of the vertex. This phenomenon is a natural continuous-time analogue of Grover search. The spatial search is said to be optimal if it occurs with constant fidelity and in time inversely proportional to the shadow of the target vertex on the principal eigenvector. Extending a result of Chakraborty et al. (Physical Review A, 102:032214, 2020), we prove a simpler characterization of optimal spatial search. Based on this characterization, we observe that some families of distance-regular graphs, such as Hamming and Grassmann graphs, have optimal spatial search. We also show a matching lower bound on time for spatial search with constant fidelity, which extends a bound due to Farhi and Gutmann for perfect fidelity. Our elementary proofs employ standard tools, such as Weyl inequalities and Cauchy determinant formula.
△ Less
Submitted 30 August, 2022; v1 submitted 8 April, 2022;
originally announced April 2022.
-
Improved Beckner's inequality for axially symmetric functions on $\mathbb{S}^n$
Authors:
Changfeng Gui,
Yeyao Hu,
Weihong Xie
Abstract:
In this article we present various uniqueness and existence results for Q-curvature type equations with a Paneitz operator on $\s^n$ in axially symmetric function spaces. In particular, we show uniqueness results for $n=6, 8$ and improve the best constant of Beckner's inequality in these dimensions for axially symmetric functions under the constraint that their centers of mass are at the origin. A…
▽ More
In this article we present various uniqueness and existence results for Q-curvature type equations with a Paneitz operator on $\s^n$ in axially symmetric function spaces. In particular, we show uniqueness results for $n=6, 8$ and improve the best constant of Beckner's inequality in these dimensions for axially symmetric functions under the constraint that their centers of mass are at the origin. As a consequence, the associated first Szegö limit theorem is also proven for axially symmetric functions.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Improved Beckner's inequality for axially symmetric functions on $\mathbb{S}^4$
Authors:
Changfeng Gui,
Yeyao Hu,
Weihong Xie
Abstract:
We show that axially symmetric solutions on $\mathbb{S}^4$ to a constant $Q$-curvature type equation (it may also be called fourth order mean field equation) must be constant, provided that the parameter $α$ in front of the Paneitz operator belongs to $[\frac{473 + \sqrt{209329}}{1800}\approx0.517, 1)$. This is in contrast to the case $α=1$, where a family of solutions exist, known as standard bub…
▽ More
We show that axially symmetric solutions on $\mathbb{S}^4$ to a constant $Q$-curvature type equation (it may also be called fourth order mean field equation) must be constant, provided that the parameter $α$ in front of the Paneitz operator belongs to $[\frac{473 + \sqrt{209329}}{1800}\approx0.517, 1)$. This is in contrast to the case $α=1$, where a family of solutions exist, known as standard bubbles. The phenomenon resembles the Gaussian curvature equation on $ \mathbb{S}^2$. As a consequence, we prove an improved Beckner's inequality on $\mathbb{S}^4$ for axially symmetric functions with their centers of mass at the origin. Furthermore, we show uniqueness of axially symmetric solutions when $α=\frac15$ by exploiting Pohozaev-type identities, and prove existence of a non-constant axially symmetric solution for $α\in (\frac15, \frac12)$ via a bifurcation method.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Second-Order Conic and Polyhedral Approximations of the Exponential Cone: Application to Mixed-Integer Exponential Conic Programs
Authors:
Qing Ye,
Weijun Xie
Abstract:
Exponents and logarithms are fundamental components in many important applications such as logistic regression, maximum likelihood, relative entropy, and so on. Since the exponential cone can be viewed as the epigraph of perspective of the natural exponential function or the hypograph of perspective of the natural logarithm function, many mixed-integer convex programs involving exponential or loga…
▽ More
Exponents and logarithms are fundamental components in many important applications such as logistic regression, maximum likelihood, relative entropy, and so on. Since the exponential cone can be viewed as the epigraph of perspective of the natural exponential function or the hypograph of perspective of the natural logarithm function, many mixed-integer convex programs involving exponential or logarithm functions can be recast as mixed-integer exponential conic programs (MIECPs). However, unlike mixed-integer linear programs (MILPs) and mixed-integer second-order conic programs (MISOCPs), MIECPs are still under development. To harvest the past efforts on MILPs and MISOCPs, this paper presents second-order conic (SOC) and polyhedral approximation schemes for the exponential cone with application to MIECPs. To do so, we first extend and generalize existing SOC approximation approaches in the extended space, propose new scaling and shifting methods, prove approximation accuracies, and derive lower bounds of approximations. We then study the polyhedral outer approximation of the exponential cones in the original space using gradient inequalities, show its approximation accuracy, and derive a lower bound of the approximation. When implementing SOC approximations, we suggest learning the approximation pattern by testing smaller cases and then applying it to the large-scale ones; and for the polyhedral approximation, we suggest using the branch and cut method for MIECPs. Our numerical study shows that the proposed methods show speed-ups over solver MOSEK for MIECPs, and the scaling, shifting, and polyhedral outer approximation methods work very well.
△ Less
Submitted 18 March, 2022; v1 submitted 16 June, 2021;
originally announced June 2021.
-
Infinitely many solutions for Schrödinger-Newton equations
Authors:
Yeyao Hu,
Aleks Jevnikar,
Weihong Xie
Abstract:
We prove the existence of infinitely many non-radial positive solutions for the Schrödinger-Newton system $$
\left\{\begin{array}{ll}
Δu- V(|x|)u + Ψu=0, &x\in\mathbb{R}^3,\newline
ΔΨ+\frac12 u^2=0, &x\in\mathbb{R}^3, \end{array}\right. $$ provided that $V(r)$ has the following behavior at infinity:…
▽ More
We prove the existence of infinitely many non-radial positive solutions for the Schrödinger-Newton system $$
\left\{\begin{array}{ll}
Δu- V(|x|)u + Ψu=0, &x\in\mathbb{R}^3,\newline
ΔΨ+\frac12 u^2=0, &x\in\mathbb{R}^3, \end{array}\right. $$ provided that $V(r)$ has the following behavior at infinity: $$
V(r)=V_0+\frac{a}{r^m}+O\left(\frac{1}{r^{m+θ}}\right)
\quad\mbox{ as } r\rightarrow\infty, $$ where $\frac12\le m<1$ and $a, V_0, θ$ are some positive constants. In particular, for any $s$ large we use a reduction method to construct $s-$bump solutions lying on a circle of radius $r\sim (s\log s)^{\frac{1}{1-m}}$.
△ Less
Submitted 8 June, 2021;
originally announced June 2021.
-
On the Stratification of Product Portfolios
Authors:
Vikram Govindan,
Wei Xie
Abstract:
Stratifying commercial product portfolios into multiple classes of decreasing priority, ABCD analysis, is a common supply chain tool. Key planning parameters that drive strategic and execution priorities are tied to the resulting segmentation. These priorities in turn drive supply chain performance. For large product assortments, manual segmentation is infeasible so an automated algorithm is neede…
▽ More
Stratifying commercial product portfolios into multiple classes of decreasing priority, ABCD analysis, is a common supply chain tool. Key planning parameters that drive strategic and execution priorities are tied to the resulting segmentation. These priorities in turn drive supply chain performance. For large product assortments, manual segmentation is infeasible so an automated algorithm is needed. We therefore advocate that careful attention be paid to the design of such an ABCD algorithm and present three key features that can be incorporated into such a calculation to improve its quality and commercial utility.
△ Less
Submitted 5 June, 2021;
originally announced June 2021.
-
Beyond Symmetry: Best Submatrix Selection for the Sparse Truncated SVD
Authors:
Yongchun Li,
Weijun Xie
Abstract:
Truncated singular value decomposition (SVD), also known as the best low-rank matrix approximation, has been successfully applied to many domains such as biology, healthcare, and others, where high-dimensional datasets are prevalent. To enhance the interpretability of the truncated SVD, sparse SVD (SSVD) is introduced to select a few rows and columns of the original matrix along with the low rank…
▽ More
Truncated singular value decomposition (SVD), also known as the best low-rank matrix approximation, has been successfully applied to many domains such as biology, healthcare, and others, where high-dimensional datasets are prevalent. To enhance the interpretability of the truncated SVD, sparse SVD (SSVD) is introduced to select a few rows and columns of the original matrix along with the low rank approximation. Different from the literature, this paper presents a novel SSVD formulation that can select the best submatrix precisely up to a given size to maximize its truncated Ky Fan norm. The fact that the SSVD problem is NP-hard motivates us to study effective algorithms with provable performance guarantees. To do so, we first reformulate SSVD as a mixed-integer semidefinite program, which can be solved exactly for small- or medium-sized instances by a customized branch and cut algorithm with closed-form cuts, and is extremely useful to evaluate the quality of approximation algorithms. We next develop three selection algorithms based on different selection criteria and two searching algorithms -- greedy and local search. We prove the approximation ratios for all the approximation algorithms and show that all the ratios are tight, i.e., we demonstrate that these approximation ratios are unimprovable. Finally, our numerical study demonstrates the high solution quality and computational efficiency of the proposed algorithms.
△ Less
Submitted 6 August, 2022; v1 submitted 7 May, 2021;
originally announced May 2021.
-
Generalized fractional grey system models: Memory effects perspective
Authors:
Wanli Xie,
Wen-Ze Wu,
Chong Liu,
Mark Goh
Abstract:
As an essential characteristics of fractional calculus, the memory effect is served as one of key factors to deal with diverse practical issues, thus has been received extensive attention since it was born. By combining the fractional derivative with memory effects and grey modeling theory, this paper aims to construct an unified framework for the commonly-used fractional grey models already in pl…
▽ More
As an essential characteristics of fractional calculus, the memory effect is served as one of key factors to deal with diverse practical issues, thus has been received extensive attention since it was born. By combining the fractional derivative with memory effects and grey modeling theory, this paper aims to construct an unified framework for the commonly-used fractional grey models already in place. In particular, by taking different kernel and normalization functions, this framework can deduce some other new fractional grey models. To further improve the prediction performance, the four popular intelligent algorithms are employed to determine the emerging coefficients for the UFGM(1,1) model. Two published cases are then utilized to verify the validity of the UFGM(1,1) model and explore the effects of fractional accumulation order and initial value on the prediction accuracy, respectively. Finally, this model is also applied to dealing with two real examples so as to further explain its efficacy and equally show how to use the unified framework in practical applications.
△ Less
Submitted 9 July, 2021; v1 submitted 11 March, 2021;
originally announced March 2021.
-
A unified construction of all-speed HLL-type schemes for hypersonic heating computations
Authors:
Wenjia Xie,
Zhengyu Tian,
Ye Zhang,
Hang Yu
Abstract:
In this paper, a unified framework to develop all-speed HLL-type schemes for hypersonic heating computations is constructed. Such a unified construction method combines two effective improving techniques: a shock robustness improvement and a low-Mach number fix. It is implemented by properly modifying the approximate solutions of the local Riemann problem in the HLL framework, resulting in two all…
▽ More
In this paper, a unified framework to develop all-speed HLL-type schemes for hypersonic heating computations is constructed. Such a unified construction method combines two effective improving techniques: a shock robustness improvement and a low-Mach number fix. It is implemented by properly modifying the approximate solutions of the local Riemann problem in the HLL framework, resulting in two all-speed HLL-type schemes, namely ASHLLC and ASHLLEM solvers. Results from both numerical analysis and experiments demonstrate that the newly proposed schemes not only preserve desirable properties of their original versions, but are also able to provide accurate and robust solutions for complex flows ranging from low-Mach number incompressible to hypersonic compressible regimes. Thus, both the ASHLLC and ASHLLEM schemes can be used as reliable methods for hypersonic heating computations.
△ Less
Submitted 20 February, 2021;
originally announced March 2021.
-
ALSO-X and ALSO-X+: Better Convex Approximations for Chance Constrained Programs
Authors:
Nan Jiang,
Weijun Xie
Abstract:
In a chance constrained program (CCP), the decision-makers aim to seek the best decision whose probability of violating the uncertainty constraints is within the prespecified risk level. As a CCP is often nonconvex and is difficult to solve to optimality, much effort has been devoted to developing convex inner approximations for a CCP, among which the conditional value-at-risk (CVaR) has been know…
▽ More
In a chance constrained program (CCP), the decision-makers aim to seek the best decision whose probability of violating the uncertainty constraints is within the prespecified risk level. As a CCP is often nonconvex and is difficult to solve to optimality, much effort has been devoted to developing convex inner approximations for a CCP, among which the conditional value-at-risk (CVaR) has been known to be the best for more than a decade. This paper studies and generalizes the ALSO-X, originally proposed by Ahmed, Luedtke, SOng, and Xie (2017), for solving a CCP. We first show that the ALSO-X resembles a bilevel optimization, where the upper-level problem is to find the best objective function value and enforce the feasibility of a CCP for a given decision from the lower-level problem, and the lower-level problem is to minimize the expectation of constraint violations subject to the upper bound of the objective function value provided by the upper-level problem. This interpretation motivates us to prove that when uncertain constraints are convex in the decision variables, ALSO-X always outperforms the CVaR approximation. We further show (i) sufficient conditions under which ALSO-X can recover an optimal solution to a CCP; (ii) an equivalent bilinear programming formulation of a CCP, inspiring us to enhance ALSO-X with a convergent alternating minimization method (ALSO-X+); (iii) extensions of ALSO-X and ALSO-X+ to solve distributionally robust chance constrained programs (DRCCPs) under $\infty$-Wasserstein ambiguity set. Our numerical study demonstrates the effectiveness of the proposed methods.
△ Less
Submitted 14 October, 2021; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Exact and Approximation Algorithms for Sparse PCA
Authors:
Yongchun Li,
Weijun Xie
Abstract:
Sparse PCA (SPCA) is a fundamental model in machine learning and data analytics, which has witnessed a variety of application areas such as finance, manufacturing, biology, healthcare. To select a prespecified-size principal submatrix from a covariance matrix to maximize its largest eigenvalue for the better interpretability purpose, SPCA advances the conventional PCA with both feature selection a…
▽ More
Sparse PCA (SPCA) is a fundamental model in machine learning and data analytics, which has witnessed a variety of application areas such as finance, manufacturing, biology, healthcare. To select a prespecified-size principal submatrix from a covariance matrix to maximize its largest eigenvalue for the better interpretability purpose, SPCA advances the conventional PCA with both feature selection and dimensionality reduction. This paper proposes two exact mixed-integer SDPs (MISDPs) by exploiting the spectral decomposition of the covariance matrix and the properties of the largest eigenvalues. We then analyze the theoretical optimality gaps of their continuous relaxation values and prove that they are stronger than that of the state-of-art one. We further show that the continuous relaxations of two MISDPs can be recast as saddle point problems without involving semi-definite cones, and thus can be effectively solved by first-order methods such as the subgradient method. Since off-the-shelf solvers, in general, have difficulty in solving MISDPs, we approximate SPCA with arbitrary accuracy by a mixed-integer linear program (MILP) of a similar size as MISDPs. To be more scalable, we also analyze greedy and local search algorithms, prove their first-known approximation ratios, and show that the approximation ratios are tight. Our numerical study demonstrates that the continuous relaxation values of the proposed MISDPs are quite close to optimality, the proposed MILP model can solve small and medium-size instances to optimality, and the approximation algorithms work very well for all the instances. Finally, we extend the analyses to Rank-one Sparse SVD (R1-SSVD) with non-symmetric matrices and Sparse Fair PCA (SFPCA) when there are multiple covariance matrices, each corresponding to a protected group.
△ Less
Submitted 27 August, 2020;
originally announced August 2020.
-
Continuous grey model with conformable fractional derivative
Authors:
Wanli Xie,
Caixia Liu,
Weidong Li,
Wenze Wu,
Chong Liu
Abstract:
The existing fractional grey prediction models mainly use discrete fractional-order difference and accumulation, but in the actual modeling, continuous fractional-order calculus has been proved to have many excellent properties, such as hereditary. Now there are grey models established with continuous fractional-order calculus method, and they have achieved good results. However, the models are ve…
▽ More
The existing fractional grey prediction models mainly use discrete fractional-order difference and accumulation, but in the actual modeling, continuous fractional-order calculus has been proved to have many excellent properties, such as hereditary. Now there are grey models established with continuous fractional-order calculus method, and they have achieved good results. However, the models are very complicated in the calculation and are not conducive to the actual application. In order to further simplify and improve the grey prediction models with continuous fractional-order derivative, we propose a simple and effective grey model based on conformable fractional derivatives in this paper, and two practical cases are used to demonstrate the validity of the proposed model.
△ Less
Submitted 13 August, 2020; v1 submitted 22 June, 2020;
originally announced August 2020.
-
Distributionally Robust Bottleneck Combinatorial Problems: Uncertainty Quantification and Robust Decision Making
Authors:
Weijun Xie,
Jie Zhang,
Shabbir Ahmed
Abstract:
This paper studies data-driven distributionally robust bottleneck combinatorial problems (DRBCP) with stochastic costs, where the probability distribution of the cost vector is contained in a ball of distributions centered at the empirical distribution specified by the Wasserstein distance. We study two distinct versions of DRBCP from different applications: (i) Motivated by the multi-hop wireless…
▽ More
This paper studies data-driven distributionally robust bottleneck combinatorial problems (DRBCP) with stochastic costs, where the probability distribution of the cost vector is contained in a ball of distributions centered at the empirical distribution specified by the Wasserstein distance. We study two distinct versions of DRBCP from different applications: (i) Motivated by the multi-hop wireless network application, we first study the uncertainty quantification of DRBCP (denoted by DRBCP-U), where decision-makers would like to have an accurate estimation of the worst-case value of DRBCP. The difficulty of DRBCP-U is to handle its max-min-max form. Fortunately, the alternative forms of the bottleneck combinatorial problems from their blockers allow us to derive equivalent deterministic reformulations, which can be computed via mixed-integer programs. In addition, by drawing the connection between DRBCP-U and its sampling average approximation counterpart under empirical distribution, we show that the Wasserstein radius can be chosen in the order of negative square root of sample size, improving the existing known results; and (ii) Next, motivated by the ride-sharing application, decision-makers choose the best service-and-passenger matching that minimizes the unfairness. This gives rise to the decision-making DRBCP (denoted by DRBCP-D). For DRBCP-D, we show that its optimal solution is also optimal to its sampling average approximation counterpart, and the Wasserstein radius can be chosen in a similar order as DRBCP-U. When the sample size is small, we propose to use the optimal value of DRBCP-D to construct an indifferent solution space and propose an alternative decision-robust model, which finds the best indifferent solution to minimize the empirical variance. We further show that the decision robust model can be recast as a mixed-integer program.
△ Less
Submitted 21 February, 2021; v1 submitted 1 March, 2020;
originally announced March 2020.
-
Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees
Authors:
Yongchun Li,
Weijun Xie
Abstract:
This paper studies a classic maximum entropy sampling problem (MESP), which aims to select the most informative principal submatrix of a prespecified size from a covariance matrix. MESP has been widely applied to many areas, including healthcare, power system, manufacturing and data science. By investigating its Lagrangian dual and primal characterization, we derive a novel convex integer program…
▽ More
This paper studies a classic maximum entropy sampling problem (MESP), which aims to select the most informative principal submatrix of a prespecified size from a covariance matrix. MESP has been widely applied to many areas, including healthcare, power system, manufacturing and data science. By investigating its Lagrangian dual and primal characterization, we derive a novel convex integer program for MESP and show that its continuous relaxation yields a near-optimal solution. The results motivate us to study an efficient sampling algorithm and develop its approximation bound for MESP, which improves the best-known bound in literature. We then provide an efficient deterministic implementation of the sampling algorithm with the same approximation bound. By developing new mathematical tools for the singular matrices and analyzing the Lagrangian dual of the proposed convex integer program, we investigate the widely-used local search algorithm and prove its first-known approximation bound for MESP. The proof techniques further inspire us with an efficient implementation of the local search algorithm. Our numerical experiments demonstrate that these approximation algorithms can efficiently solve medium-sized and large-scale instances to near-optimality. Our proposed algorithms are coded and released as open-source software. Finally, we extend the analyses to the A-Optimal MESP (A-MESP), where the objective is to minimize the trace of the inverse of the selected principal submatrix.
△ Less
Submitted 1 May, 2023; v1 submitted 23 January, 2020;
originally announced January 2020.
-
A Note on Quantum Markov Models
Authors:
Christino Tamon,
Weichen Xie
Abstract:
The study of Markov models is central to control theory and machine learning. A quantum analogue of partially observable Markov decision process was studied in (Barry, Barry, and Aaronson, Phys. Rev. A, 90, 2014). It was proved that goal-state reachability is undecidable in the quantum setting, whereas it is decidable classically. In contrast to this classical-to-quantum transition from decidable…
▽ More
The study of Markov models is central to control theory and machine learning. A quantum analogue of partially observable Markov decision process was studied in (Barry, Barry, and Aaronson, Phys. Rev. A, 90, 2014). It was proved that goal-state reachability is undecidable in the quantum setting, whereas it is decidable classically. In contrast to this classical-to-quantum transition from decidable to undecidable, we observe that the problem of approximating the optimal policy which maximizes the average discounted reward over an infinite horizon remains decidable in the quantum setting. Given that most relevant problems related to Markov decision process are undecidable classically (which immediately implies undecidability in the quantum case), this provides one of the few examples where the quantum problem is tractable.
△ Less
Submitted 5 November, 2019;
originally announced November 2019.
-
Global-Local Metamodel Assisted Two-Stage Optimization via Simulation
Authors:
Wei Xie,
Yuan Yi,
Hua Zheng
Abstract:
To integrate strategic, tactical and operational decisions, the two-stage optimization has been widely used to guide dynamic decision making. In this paper, we study the two-stage stochastic programming for complex systems with unknown response estimated by simulation. We introduce the global-local metamodel assisted two-stage optimization via simulation that can efficiently employ the simulation…
▽ More
To integrate strategic, tactical and operational decisions, the two-stage optimization has been widely used to guide dynamic decision making. In this paper, we study the two-stage stochastic programming for complex systems with unknown response estimated by simulation. We introduce the global-local metamodel assisted two-stage optimization via simulation that can efficiently employ the simulation resource to iteratively solve for the optimal first- and second-stage decisions. Specifically, at each visited first-stage decision, we develop a local metamodel to simultaneously solve a set of scenario-based second-stage optimization problems, which also allows us to estimate the optimality gap. Then, we construct a global metamodel accounting for the errors induced by: (1) using a finite number of scenarios to approximate the expected future cost occurring in the planning horizon, (2) second-stage optimality gap, and (3) finite visited first-stage decisions. Assisted by the global-local metamodel, we propose a new simulation optimization approach that can efficiently and iteratively search for the optimal first- and second-stage decisions. Our framework can guarantee the convergence of optimal solution for the discrete two-stage optimization with unknown objective, and the empirical study indicates that it achieves substantial efficiency and accuracy.
△ Less
Submitted 13 October, 2019;
originally announced October 2019.
-
Tractable Reformulations of Distributionally Robust Two-stage Stochastic Programs with $\infty-$Wasserstein Distance
Authors:
Weijun Xie
Abstract:
In the optimization under uncertainty, decision-makers first select a wait-and-see policy before any realization of uncertainty and then place a here-and-now decision after the uncertainty has been observed. Two-stage stochastic programming is a popular modeling paradigm for the optimization under uncertainty that the decision-makers first specifies a probability distribution, and then seek the be…
▽ More
In the optimization under uncertainty, decision-makers first select a wait-and-see policy before any realization of uncertainty and then place a here-and-now decision after the uncertainty has been observed. Two-stage stochastic programming is a popular modeling paradigm for the optimization under uncertainty that the decision-makers first specifies a probability distribution, and then seek the best decisions to jointly optimize the deterministic wait-and-see and expected here-and-now costs. In practice, such a probability distribution may not be fully available but is probably observable through an empirical dataset. Therefore, this paper studies distributionally robust two-stage stochastic program (DRTSP) which jointly optimizes the deterministic wait-and-see and worst-case expected here-and-now costs, and the probability distribution comes from a family of distributions which are centered at the empirical distribution using $\infty-$Wasserstein metric. There have been successful developments on deriving tractable approximations of the worst-case expected here-and-now cost in DRTSP. Unfortunately, limited results on exact tractable reformulations of DRTSP. This paper fills this gap by providing sufficient conditions under which the worst-case expected here-and-now cost in DRTSP can be efficiently computed via a tractable convex program. By exploring the properties of binary variables, the developed reformulation techniques are extended to DRTSP with binary random parameters. The main tractable reformulations in this paper are projected into the original decision space and thus can be interpreted as conventional two-stage stochastic programs under discrete support with extra penalty terms enforcing the robustness. These tractable results are further demonstrated to be sharp through complexity analysis.
△ Less
Submitted 22 August, 2019;
originally announced August 2019.
-
Tetradic motif profiles of horizontal visibility graphs
Authors:
Wen-Jie Xie,
Rui-Qi Han,
Wei-Xing Zhou
Abstract:
Network motif analysis is a useful tool for the investigation of complex networks. We study the profiles of tetradic motifs in horizontal visibility graphs (HVGs) converted from multifractal binomial measures, fractional Gaussian noises, and heartbeat rates. The profiles of tetradic motifs contains the spatial information (visibility) and temporal information (relative magnitude) among the data po…
▽ More
Network motif analysis is a useful tool for the investigation of complex networks. We study the profiles of tetradic motifs in horizontal visibility graphs (HVGs) converted from multifractal binomial measures, fractional Gaussian noises, and heartbeat rates. The profiles of tetradic motifs contains the spatial information (visibility) and temporal information (relative magnitude) among the data points in the corresponding time series. For multifractal binomial measures, the occurrence frequencies of the tetradic motifs are determined, which converge to a constant vector $(2/3,0,8/99,8/33,1/99,0)$. For fractional Gaussian noises, the motif occurrence frequencies are found to depend nonlinearly on the Hurst exponent and the length of time series. These findings suggest the potential ability of tetradic motif profiles in distinguishing different types of time series. Finally, we apply the tetradic motif analysis to heartbeat rates of healthy subjects, congestive heart failure (CHF) subjects, and atrial fibrillation (AF) subjects. Different subjects can be distinguished from the occurrence frequencies of tetradic motifs.
△ Less
Submitted 9 November, 2018;
originally announced November 2018.
-
Fine gradings and their Weyl groups for twisted Heisenberg Lie superalgebras
Authors:
Wenjuan Xie,
Wende Liu
Abstract:
In this paper we define the so-called twisted Heisenberg superalgebras over the complex number field by adding derivations to Heisenberg superalgebras. We classify the fine gradings up to equivalence on twisted Heisenberg superalgebras and determine the Weyl groups of those gradings.
In this paper we define the so-called twisted Heisenberg superalgebras over the complex number field by adding derivations to Heisenberg superalgebras. We classify the fine gradings up to equivalence on twisted Heisenberg superalgebras and determine the Weyl groups of those gradings.
△ Less
Submitted 5 September, 2018;
originally announced September 2018.