-
Modeling the Curbside Congestion Effects of Ride-hailing Services for Morning Commute using Bi-modal Two-Tandem Bottlenecks
Authors:
Yao Deng,
Zhi-Chun Li,
Sean Qian,
Wei Ma
Abstract:
With the proliferation of ride-hailing services, curb space in urban areas has become highly congested due to the massive passenger pick-ups and drop-offs. Particularly during peak hours, the massive ride-hailing vehicles waiting to drop off obstruct curb spaces and even disrupt the flow of mainline traffic. However, there is a lack of an analytical model that formulates and mitigates the congesti…
▽ More
With the proliferation of ride-hailing services, curb space in urban areas has become highly congested due to the massive passenger pick-ups and drop-offs. Particularly during peak hours, the massive ride-hailing vehicles waiting to drop off obstruct curb spaces and even disrupt the flow of mainline traffic. However, there is a lack of an analytical model that formulates and mitigates the congestion effects of ride-hailing drop-offs in curb spaces. To address this issue, this paper proposes a novel bi-modal two-tandem bottleneck model to depict the commuting behaviors of private vehicles (PVs) and ride-hailing vehicles (RVs) during the morning peak in a linear city. In the model, the upstream bottleneck models the congestion on highways, and the downstream curbside bottlenecks depict the congestion caused by RV drop-offs in curb spaces, PV queue on main roads, and the spillover effects between them in the urban area. The proposed model can be solved in a closed form under eight different scenarios. A time-varying optimal congestion pricing scheme, combined curbside pricing and parking pricing, is proposed to achieve the social optimum. It is found that potential waste of road capacity could occur when there is a mismatch between the highway and curbside bottlenecks, and hence the optimal pricing should be determined in a coordinated manner. A real-world case from Hong Kong shows that the limited curb space and main road in the urban area could be the major congestion bottleneck. Expanding the capacity of the curb space or the main road in the urban area, rather than the highway bottleneck, can effectively reduce social costs. This paper highlights the critical role of curbside management and provides policy implications for the coordinated management of highways and curb spaces.
△ Less
Submitted 14 June, 2025; v1 submitted 11 June, 2025;
originally announced June 2025.
-
A Bayesian Composite Risk Approach for Stochastic Optimal Control and Markov Decision Processes
Authors:
Wentao Ma,
Zhiping Chen,
Huifu Xu
Abstract:
Inspired by \cite{shapiro2023episodic}, we consider a stochastic optimal control (SOC) and Markov decision process (MDP) where the risks arising from epistemic and aleatoric uncertainties are assessed using Bayesian composite risk (BCR) measures (\cite{qian2019composite}). The time dependence of the risk measures allows us to capture the decision maker's (DM) dynamic risk preferences opportunely a…
▽ More
Inspired by \cite{shapiro2023episodic}, we consider a stochastic optimal control (SOC) and Markov decision process (MDP) where the risks arising from epistemic and aleatoric uncertainties are assessed using Bayesian composite risk (BCR) measures (\cite{qian2019composite}). The time dependence of the risk measures allows us to capture the decision maker's (DM) dynamic risk preferences opportunely as increasing information about both uncertainties is obtained. This makes the new BCR-SOC/MDP model more flexible than conventional risk-averse SOC/MDP models. Unlike \cite{shapiro2023episodic} where the control/action at each episode is based on the current state alone, the new model allows the control to depend on the probability distribution of the epistemic uncertainty, which reflects the fact that in many practical instances the cumulative information about epistemic uncertainty often affects the DM's belief about the future aleatoric uncertainty and hence the DM's action (\cite{strens2000bayesian}). The new modeling paradigm incorporates several existing SOC/MDP models including distributionally robust SOC/MDP models and Bayes-adaptive MDP models and generates so-called preference robust SOC/MDP models. Moreover, we derive conditions under which the BCR-SOC/MDP model is well-defined, demonstrate that BCR-SOC/MDP models can be solved using dynamic programming techniques. By using Bellman equations, we show that under some standard conditions, asymptotic convergence of the optimal values and optimal actions as the episode goes to infinity is achieved. Finally, we carry out numerical tests on a finite horizon spread betting problem and an inventory control problem to show the effectiveness of the proposed model and numerical schemes.
△ Less
Submitted 21 December, 2024;
originally announced December 2024.
-
Upper semi-continuity of metric entropy for diffeomorphisms with dominated splitting
Authors:
Chiyi Luo,
Wenhui Ma,
Yun Zhao
Abstract:
For a $C^{r}$ $(r>1)$ diffeomorphism on a compact manifold that admits a dominated splitting, this paper establishes the upper semi-continuity of the entropy map. More precisely, this paper establishes the upper semi-continuity of the entropy map in the following two cases:
(1) if a sequence of invariant measures has only positive Lyapunov exponents along a sub-bundle and non-positive Lyapunov e…
▽ More
For a $C^{r}$ $(r>1)$ diffeomorphism on a compact manifold that admits a dominated splitting, this paper establishes the upper semi-continuity of the entropy map. More precisely, this paper establishes the upper semi-continuity of the entropy map in the following two cases:
(1) if a sequence of invariant measures has only positive Lyapunov exponents along a sub-bundle and non-positive Lyapunov exponents along another sub-bundle, then the upper limit of their metric entropies is less than or equal to the entropy of the limiting measure; (2) if an invariant measure has positive Lyapunov exponents along a sub-bundle and non-positive Lyapunov exponents along another sub-bundle, then the entropy map is upper semi-continuous at this measure.
△ Less
Submitted 24 December, 2024; v1 submitted 6 December, 2024;
originally announced December 2024.
-
An End-to-End Smart Predict-then-Optimize Framework for Vehicle Relocation Problems in Large-Scale Vehicle Crowd Sensing
Authors:
Xinyu Wang,
Yiyang Peng,
Wei Ma
Abstract:
Ubiquitous mobile devices have catalyzed the development of vehicle crowd sensing (VCS). In particular, vehicle sensing systems show great potential in the flexible acquisition of spatio-temporal urban data through built-in sensors under diverse sensing scenarios. However, vehicle systems often exhibit biased coverage due to the heterogeneous nature of trip requests and routes. To achieve a high s…
▽ More
Ubiquitous mobile devices have catalyzed the development of vehicle crowd sensing (VCS). In particular, vehicle sensing systems show great potential in the flexible acquisition of spatio-temporal urban data through built-in sensors under diverse sensing scenarios. However, vehicle systems often exhibit biased coverage due to the heterogeneous nature of trip requests and routes. To achieve a high sensing coverage, a critical challenge lies in optimally relocating vehicles to minimize the divergence between vehicle distributions and target sensing distributions. Conventional approaches typically employ a two-stage predict-then-optimize (PTO) process: first predicting real-time vehicle distributions and subsequently generating an optimal relocation strategy based on the predictions. However, this approach can lead to suboptimal decision-making due to the propagation of errors from upstream prediction. To this end, we develop an end-to-end Smart Predict-then-Optimize (SPO) framework by integrating optimization into prediction within the deep learning architecture, and the entire framework is trained by minimizing the task-specific matching divergence rather than the upstream prediction error. Methodologically, we formulate the vehicle relocation problem by quadratic programming (QP) and incorporate a novel unrolling approach based on the Alternating Direction Method of Multipliers (ADMM) within the SPO framework to compute gradients of the QP layer, facilitating backpropagation and gradient-based optimization for end-to-end learning. The effectiveness of the proposed framework is validated by real-world taxi datasets in Hong Kong. Utilizing the alternating differentiation method, the general SPO framework presents a novel concept of addressing decision-making problems with uncertainty, demonstrating significant potential for advancing applications in intelligent transportation systems.
△ Less
Submitted 27 November, 2024;
originally announced November 2024.
-
On the achievability of efficiency bounds for covariate-adjusted response-adaptive randomization
Authors:
Jiahui Xin,
Wei Ma
Abstract:
In the context of precision medicine, covariate-adjusted response-adaptive randomization (CARA) has garnered much attention from both academia and industry due to its benefits in providing ethical and tailored treatment assignments based on patients' profiles while still preserving favorable statistical properties. Recent years have seen substantial progress in understanding the inference for vari…
▽ More
In the context of precision medicine, covariate-adjusted response-adaptive randomization (CARA) has garnered much attention from both academia and industry due to its benefits in providing ethical and tailored treatment assignments based on patients' profiles while still preserving favorable statistical properties. Recent years have seen substantial progress in understanding the inference for various adaptive experimental designs. In particular, research has focused on two important perspectives: how to obtain robust inference in the presence of model misspecification, and what the smallest variance, i.e., the efficiency bound, an estimator can achieve. Notably, Armstrong (2022) derived the asymptotic efficiency bound for any randomization procedure that assigns treatments depending on covariates and accrued responses, thus including CARA, among others. However, to the best of our knowledge, no existing literature has addressed whether and how the asymptotic efficiency bound can be achieved under CARA. In this paper, by connecting two strands of literature on adaptive randomization, namely robust inference and efficiency bound, we provide a definitive answer to this question for an important practical scenario where only discrete covariates are observed and used to form stratification. We consider a specific type of CARA, i.e., a stratified version of doubly-adaptive biased coin design, and prove that the stratified difference-in-means estimator achieves Armstrong (2022)'s efficiency bound, with possible ethical constraints on treatment assignments. Our work provides new insights and demonstrates the potential for more research regarding the design and analysis of CARA that maximizes efficiency while adhering to ethical considerations. Future studies could explore how to achieve the asymptotic efficiency bound for general CARA with continuous covariates, which remains an open question.
△ Less
Submitted 25 November, 2024;
originally announced November 2024.
-
Statistical Inference in Tensor Completion: Optimal Uncertainty Quantification and Statistical-to-Computational Gaps
Authors:
Wanteng Ma,
Dong Xia
Abstract:
This paper presents a simple yet efficient method for statistical inference of tensor linear forms using incomplete and noisy observations. Under the Tucker low-rank tensor model and the missing-at-random assumption, we utilize an appropriate initial estimate along with a debiasing technique followed by a one-step power iteration to construct an asymptotically normal test statistic. This method is…
▽ More
This paper presents a simple yet efficient method for statistical inference of tensor linear forms using incomplete and noisy observations. Under the Tucker low-rank tensor model and the missing-at-random assumption, we utilize an appropriate initial estimate along with a debiasing technique followed by a one-step power iteration to construct an asymptotically normal test statistic. This method is suitable for various statistical inference tasks, including constructing confidence intervals, inference under heteroskedastic and sub-exponential noise, and simultaneous testing. We demonstrate that the estimator achieves the Cramér-Rao lower bound on Riemannian manifolds, indicating its optimality in uncertainty quantification. We comprehensively examine the statistical-to-computational gaps and investigate the impact of initialization on the minimal conditions regarding sample size and signal-to-noise ratio required for accurate inference. Our findings show that with independent initialization, statistically optimal sample sizes and signal-to-noise ratios are sufficient for accurate inference. Conversely, if only dependent initialization is available, computationally optimal sample sizes and signal-to-noise ratio conditions still guarantee asymptotic normality without the need for data-splitting. We present the phase transition between computational and statistical limits. Numerical simulation results align with the theoretical findings.
△ Less
Submitted 1 November, 2024; v1 submitted 14 October, 2024;
originally announced October 2024.
-
Harmonic metrics of $\mathrm{SO}_{0}(n,n)$-Higgs bundles in the Hitchin section on non-compact hyperbolic surfaces
Authors:
Weihan Ma
Abstract:
Let $X$ be a Riemann surface. Hitchin constructed the $G$-Higgs bundles in the Hitchin section for a split real form $G$ of a complex simple Lie group,using the canonical line bundle $K$ and some holomorphic differentials $\boldsymbol{q}$. We study the case of ${\mathrm{SO}_0(n,n)}$. In our work, we establish the existence of harmonic metrics for these Higgs bundles, which are compatible with the…
▽ More
Let $X$ be a Riemann surface. Hitchin constructed the $G$-Higgs bundles in the Hitchin section for a split real form $G$ of a complex simple Lie group,using the canonical line bundle $K$ and some holomorphic differentials $\boldsymbol{q}$. We study the case of ${\mathrm{SO}_0(n,n)}$. In our work, we establish the existence of harmonic metrics for these Higgs bundles, which are compatible with the ${\mathrm{SO}_0(n,n)}$-structure on any non-compact hyperbolic Riemann surface. Furthermore, these harmonic metrics weakly dominate $h_X$, the natural diagonal harmonic metric induced by the unique complete Kähler hyperbolic metric $g_X$ on $X$. Assuming these holomorphic differentials are all bounded with respect to $g_X$, we prove the uniqueness of such a harmonic metric.
△ Less
Submitted 19 June, 2025; v1 submitted 16 August, 2024;
originally announced August 2024.
-
Robo-GS: A Physics Consistent Spatial-Temporal Model for Robotic Arm with Hybrid Representation
Authors:
Haozhe Lou,
Yurong Liu,
Yike Pan,
Yiran Geng,
Jianteng Chen,
Wenlong Ma,
Chenglong Li,
Lin Wang,
Hengzhen Feng,
Lu Shi,
Liyi Luo,
Yongliang Shi
Abstract:
Real2Sim2Real plays a critical role in robotic arm control and reinforcement learning, yet bridging this gap remains a significant challenge due to the complex physical properties of robots and the objects they manipulate. Existing methods lack a comprehensive solution to accurately reconstruct real-world objects with spatial representations and their associated physics attributes.
We propose a…
▽ More
Real2Sim2Real plays a critical role in robotic arm control and reinforcement learning, yet bridging this gap remains a significant challenge due to the complex physical properties of robots and the objects they manipulate. Existing methods lack a comprehensive solution to accurately reconstruct real-world objects with spatial representations and their associated physics attributes.
We propose a Real2Sim pipeline with a hybrid representation model that integrates mesh geometry, 3D Gaussian kernels, and physics attributes to enhance the digital asset representation of robotic arms.
This hybrid representation is implemented through a Gaussian-Mesh-Pixel binding technique, which establishes an isomorphic mapping between mesh vertices and Gaussian models. This enables a fully differentiable rendering pipeline that can be optimized through numerical solvers, achieves high-fidelity rendering via Gaussian Splatting, and facilitates physically plausible simulation of the robotic arm's interaction with its environment using mesh-based methods.
The code,full presentation and datasets will be made publicly available at our website https://robostudioapp.com
△ Less
Submitted 17 September, 2024; v1 submitted 27 August, 2024;
originally announced August 2024.
-
Online Matching and Contention Resolution for Edge Arrivals with Vanishing Probabilities
Authors:
Will Ma,
Calum MacRury,
Pranav Nuti
Abstract:
We study the performance of sequential contention resolution and matching algorithms on random graphs with vanishing edge probabilities. When the edges of the graph are processed in an adversarially-chosen order, we derive a new OCRS that is $0.382$-selectable, attaining the "independence benchmark" from the literature under the vanishing edge probabilities assumption. Complementary to this positi…
▽ More
We study the performance of sequential contention resolution and matching algorithms on random graphs with vanishing edge probabilities. When the edges of the graph are processed in an adversarially-chosen order, we derive a new OCRS that is $0.382$-selectable, attaining the "independence benchmark" from the literature under the vanishing edge probabilities assumption. Complementary to this positive result, we show that no OCRS can be more than $0.390$-selectable, significantly improving upon the upper bound of $0.428$ from the literature. We also derive negative results that are specialized to bipartite graphs or subfamilies of OCRS's. Meanwhile, when the edges of the graph are processed in a uniformly random order, we show that the simple greedy contention resolution scheme which accepts all active and feasible edges is $1/2$-selectable. This result is tight due to a known upper bound. Finally, when the algorithm can choose the processing order, we show that a slight tweak to the random order -- give each vertex a random priority and process edges in lexicographic order -- results in a strictly better contention resolution scheme that is $1-\ln(2-1/e)\approx0.510$-selectable. Our positive results also apply to online matching on $1$-uniform random graphs with vanishing (non-identical) edge probabilities, extending and unifying some results from the random graphs literature.
△ Less
Submitted 8 October, 2024; v1 submitted 20 June, 2024;
originally announced June 2024.
-
Inference under covariate-adaptive randomization with many strata
Authors:
Jiahui Xin,
Hanzhong Liu,
Wei Ma
Abstract:
Covariate-adaptive randomization is widely employed to balance baseline covariates in interventional studies such as clinical trials and experiments in development economics. Recent years have witnessed substantial progress in inference under covariate-adaptive randomization with a fixed number of strata. However, concerns have been raised about the impact of a large number of strata on its design…
▽ More
Covariate-adaptive randomization is widely employed to balance baseline covariates in interventional studies such as clinical trials and experiments in development economics. Recent years have witnessed substantial progress in inference under covariate-adaptive randomization with a fixed number of strata. However, concerns have been raised about the impact of a large number of strata on its design and analysis, which is a common scenario in practice, such as in multicenter randomized clinical trials. In this paper, we propose a general framework for inference under covariate-adaptive randomization, which extends the seminal works of Bugni et al. (2018, 2019) by allowing for a diverging number of strata. Furthermore, we introduce a novel weighted regression adjustment that ensures efficiency improvement. On top of establishing the asymptotic theory, practical algorithms for handling situations involving an extremely large number of strata are also developed. Moreover, by linking design balance and inference robustness, we highlight the advantages of stratified block randomization, which enforces better covariate balance within strata compared to simple randomization. This paper offers a comprehensive landscape of inference under covariate-adaptive randomization, spanning from fixed to diverging to extremely large numbers of strata.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Extreme Point Pursuit -- Part II: Further Error Bound Analysis and Applications
Authors:
Junbin Liu,
Ya Liu,
Wing-Kin Ma,
Mingjie Shao,
Anthony Man-Cho So
Abstract:
In the first part of this study, a convex-constrained penalized formulation was studied for a class of constant modulus (CM) problems. In particular, the error bound techniques were shown to play a vital role in providing exact penalization results. In this second part of the study, we continue our error bound analysis for the cases of partial permutation matrices, size-constrained assignment matr…
▽ More
In the first part of this study, a convex-constrained penalized formulation was studied for a class of constant modulus (CM) problems. In particular, the error bound techniques were shown to play a vital role in providing exact penalization results. In this second part of the study, we continue our error bound analysis for the cases of partial permutation matrices, size-constrained assignment matrices and non-negative semi-orthogonal matrices. We develop new error bounds and penalized formulations for these three cases, and the new formulations possess good structures for building computationally efficient algorithms. Moreover, we provide numerical results to demonstrate our framework in a variety of applications such as the densest k-subgraph problem, graph matching, size-constrained clustering, non-negative orthogonal matrix factorization and sparse fair principal component analysis.
△ Less
Submitted 11 November, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
Extreme Point Pursuit -- Part I: A Framework for Constant Modulus Optimization
Authors:
Junbin Liu,
Ya Liu,
Wing-Kin Ma,
Mingjie Shao,
Anthony Man-Cho So
Abstract:
This study develops a framework for a class of constant modulus (CM) optimization problems, which covers binary constraints, discrete phase constraints, semi-orthogonal matrix constraints, non-negative semi-orthogonal matrix constraints, and several types of binary assignment constraints. Capitalizing on the basic principles of concave minimization and error bounds, we study a convex-constrained p…
▽ More
This study develops a framework for a class of constant modulus (CM) optimization problems, which covers binary constraints, discrete phase constraints, semi-orthogonal matrix constraints, non-negative semi-orthogonal matrix constraints, and several types of binary assignment constraints. Capitalizing on the basic principles of concave minimization and error bounds, we study a convex-constrained penalized formulation for general CM problems. The advantage of such formulation is that it allows us to leverage non-convex optimization techniques, such as the simple projected gradient method, to build algorithms. As the first part of this study, we explore the theory of this framework. We study conditions under which the formulation provides exact penalization results. We also examine computational aspects relating to the use of the projected gradient method for each type of CM constraint. Our study suggests that the proposed framework has a broad scope of applicability.
△ Less
Submitted 11 November, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
Online Contention Resolution Schemes for Network Revenue Management and Combinatorial Auctions
Authors:
Will Ma,
Calum MacRury,
Jingwei Zhang
Abstract:
In the Network Revenue Management (NRM) problem, products composed of up to L resources are sold to stochastically arriving customers. We take a randomized rounding approach to NRM, motivated by the modern tool of Online Contention Resolution Schemes (OCRS). The goal is to take a fractional solution to NRM that satisfies the resource constraints in expectation, and implement it in an online policy…
▽ More
In the Network Revenue Management (NRM) problem, products composed of up to L resources are sold to stochastically arriving customers. We take a randomized rounding approach to NRM, motivated by the modern tool of Online Contention Resolution Schemes (OCRS). The goal is to take a fractional solution to NRM that satisfies the resource constraints in expectation, and implement it in an online policy that satisfies the resource constraints with probability 1, while (approximately) preserving all of the sales that were prescribed by the fractional solution.
In NRM problems, customer substitution induces a negative correlation between products being demanded, making it difficult to apply the standard definition of OCRS. We start by deriving a more powerful notion of "random-element" OCRS that achieves a guarantee of 1/(1+L) for NRM with customer substitution, matching a common benchmark in the literature. We show this benchmark is unbeatable for all integers L that are the power of a prime number. We then show how to beat this benchmark under three widely applied assumptions. Finally, we show that under several assumptions, it is possible to do better than offline CRS when L>= 5.
Our results have corresponding implications for Online Combinatorial Auctions, in which buyers bid for bundles of up to L items, and buyers being single-minded is akin to having no substitution. Our result under the assumption that products comprise one item from each of up to L groups implies that 1/(1+L) can be beaten for Prophet Inequality on the intersection of L partition matroids, a problem of interest. In sum, our paper shows how to apply OCRS to all of these problems and establishes a surprising separation in the achievable guarantees when substitution is involved, under general resource constraints parametrized by L.
△ Less
Submitted 7 December, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
A unified framework for covariate adjustment under stratified randomization
Authors:
Fuyi Tu,
Wei Ma,
Hanzhong Liu
Abstract:
Randomization, as a key technique in clinical trials, can eliminate sources of bias and produce comparable treatment groups. In randomized experiments, the treatment effect is a parameter of general interest. Researchers have explored the validity of using linear models to estimate the treatment effect and perform covariate adjustment and thus improve the estimation efficiency. However, the relati…
▽ More
Randomization, as a key technique in clinical trials, can eliminate sources of bias and produce comparable treatment groups. In randomized experiments, the treatment effect is a parameter of general interest. Researchers have explored the validity of using linear models to estimate the treatment effect and perform covariate adjustment and thus improve the estimation efficiency. However, the relationship between covariates and outcomes is not necessarily linear, and is often intricate. Advances in statistical theory and related computer technology allow us to use nonparametric and machine learning methods to better estimate the relationship between covariates and outcomes and thus obtain further efficiency gains. However, theoretical studies on how to draw valid inferences when using nonparametric and machine learning methods under stratified randomization are yet to be conducted. In this paper, we discuss a unified framework for covariate adjustment and corresponding statistical inference under stratified randomization and present a detailed proof of the validity of using local linear kernel-weighted least squares regression for covariate adjustment in treatment effect estimators as a special case. In the case of high-dimensional data, we additionally propose an algorithm for statistical inference using machine learning methods under stratified randomization, which makes use of sample splitting to alleviate the requirements on the asymptotic properties of machine learning methods. Finally, we compare the performances of treatment effect estimators using different machine learning methods by considering various data generation scenarios, to guide practical research.
△ Less
Submitted 2 December, 2023;
originally announced December 2023.
-
Multiple Testing of Linear Forms for Noisy Matrix Completion
Authors:
Wanteng Ma,
Lilun Du,
Dong Xia,
Ming Yuan
Abstract:
Many important tasks of large-scale recommender systems can be naturally cast as testing multiple linear forms for noisy matrix completion. These problems, however, present unique challenges because of the subtle bias-and-variance tradeoff of and an intricate dependence among the estimated entries induced by the low-rank structure. In this paper, we develop a general approach to overcome these dif…
▽ More
Many important tasks of large-scale recommender systems can be naturally cast as testing multiple linear forms for noisy matrix completion. These problems, however, present unique challenges because of the subtle bias-and-variance tradeoff of and an intricate dependence among the estimated entries induced by the low-rank structure. In this paper, we develop a general approach to overcome these difficulties by introducing new statistics for individual tests with sharp asymptotics both marginally and jointly, and utilizing them to control the false discovery rate (FDR) via a data splitting and symmetric aggregation scheme. We show that valid FDR control can be achieved with guaranteed power under nearly optimal sample size requirements using the proposed methodology. Extensive numerical simulations and real data examples are also presented to further illustrate its practical merits.
△ Less
Submitted 10 March, 2025; v1 submitted 30 November, 2023;
originally announced December 2023.
-
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms
Authors:
Xiangyuan Zhang,
Weichao Mao,
Saviz Mowlavi,
Mouhacine Benosman,
Tamer Başar
Abstract:
We introduce controlgym, a library of thirty-six industrial control settings, and ten infinite-dimensional partial differential equation (PDE)-based control problems. Integrated within the OpenAI Gym/Gymnasium (Gym) framework, controlgym allows direct applications of standard reinforcement learning (RL) algorithms like stable-baselines3. Our control environments complement those in Gym with contin…
▽ More
We introduce controlgym, a library of thirty-six industrial control settings, and ten infinite-dimensional partial differential equation (PDE)-based control problems. Integrated within the OpenAI Gym/Gymnasium (Gym) framework, controlgym allows direct applications of standard reinforcement learning (RL) algorithms like stable-baselines3. Our control environments complement those in Gym with continuous, unbounded action and observation spaces, motivated by real-world control applications. Moreover, the PDE control environments uniquely allow the users to extend the state dimensionality of the system to infinity while preserving the intrinsic dynamics. This feature is crucial for evaluating the scalability of RL algorithms for control. This project serves the learning for dynamics & control (L4DC) community, aiming to explore key questions: the convergence of RL algorithms in learning control policies; the stability and robustness issues of learning-based controllers; and the scalability of RL algorithms to high- and potentially infinite-dimensional systems. We open-source the controlgym project at https://github.com/xiangyuan-zhang/controlgym.
△ Less
Submitted 23 April, 2024; v1 submitted 30 November, 2023;
originally announced November 2023.
-
Interaction tests with covariate-adaptive randomization
Authors:
Likun Zhang,
Wei Ma
Abstract:
Treatment-covariate interaction tests are commonly applied by researchers to examine whether the treatment effect varies across patient subgroups defined by baseline characteristics. The objective of this study is to explore treatment-covariate interaction tests involving covariate-adaptive randomization. Without assuming a parametric data generating model, we investigate usual interaction tests a…
▽ More
Treatment-covariate interaction tests are commonly applied by researchers to examine whether the treatment effect varies across patient subgroups defined by baseline characteristics. The objective of this study is to explore treatment-covariate interaction tests involving covariate-adaptive randomization. Without assuming a parametric data generating model, we investigate usual interaction tests and observe that they tend to be conservative: specifically, their limiting rejection probabilities under the null hypothesis do not exceed the nominal level and are typically strictly lower than it. To address this problem, we propose modifications to the usual tests to obtain corresponding valid tests. Moreover, we introduce a novel class of stratified-adjusted interaction tests that are simple, more powerful than the usual and modified tests, and broadly applicable to most covariate-adaptive randomization methods. The results are general to encompass two types of interaction tests: one involving stratification covariates and the other involving additional covariates that are not used for randomization. Our study clarifies the application of interaction tests in clinical trials and offers valuable tools for revealing treatment heterogeneity, crucial for advancing personalized medicine.
△ Less
Submitted 10 March, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
Random-order Contention Resolution via Continuous Induction: Tightness for Bipartite Matching under Vertex Arrivals
Authors:
Calum MacRury,
Will Ma
Abstract:
We introduce a new approach for designing Random-order Contention Resolution Schemes (RCRS) via exact solution in continuous time. Given a function $c(y):[0,1] \rightarrow [0,1]$, we show how to select each element which arrives at time $y \in [0,1]$ with probability exactly $c(y)$. We provide a rigorous algorithmic framework for achieving this, which discretizes the time interval and also needs t…
▽ More
We introduce a new approach for designing Random-order Contention Resolution Schemes (RCRS) via exact solution in continuous time. Given a function $c(y):[0,1] \rightarrow [0,1]$, we show how to select each element which arrives at time $y \in [0,1]$ with probability exactly $c(y)$. We provide a rigorous algorithmic framework for achieving this, which discretizes the time interval and also needs to sample its past execution to ensure these exact selection probabilities. We showcase our framework in the context of online contention resolution schemes for matching with random-order vertex arrivals. For bipartite graphs with two-sided arrivals, we design a $(1+e^{-2})/2 \approx 0.567$-selectable RCRS, which we also show to be tight. Next, we show that the presence of short odd-length cycles is the only barrier to attaining a (tight) $(1+e^{-2})/2$-selectable RCRS on general graphs. By generalizing our bipartite RCRS, we design an RCRS for graphs with odd-length girth $g$ which is $(1+e^{-2})/2$-selectable as $g \rightarrow \infty$. This convergence happens very rapidly: for triangle-free graphs (i.e., $g \ge 5$), we attain a $121/240 + 7/16 e^2 \approx 0.563$-selectable RCRS. Finally, for general graphs we improve on the $8/15 \approx 0.533$-selectable RCRS of Fu et al. (ICALP, 2021) and design an RCRS which is at least $0.535$-selectable. Due to the reduction of Ezra et al. (EC, 2020), our bounds yield a $0.535$-competitive (respectively, $(1+e^{-2})/2$-competitive) algorithm for prophet secretary matching on general (respectively, bipartite) graphs under vertex arrivals.
△ Less
Submitted 12 December, 2023; v1 submitted 16 October, 2023;
originally announced October 2023.
-
Multi-Agent Search for a Moving and Camouflaging Target
Authors:
Miguel Lejeune,
Johannes O. Royset,
Wenbo Ma
Abstract:
In multi-agent search planning for a randomly moving and camouflaging target, we examine heterogeneous searchers that differ in terms of their endurance level, travel speed, and detection ability. This leads to a convex mixed-integer nonlinear program, which we reformulate using three linearization techniques. We develop preprocessing steps, outer approximations via lazy constraints, and bundle-ba…
▽ More
In multi-agent search planning for a randomly moving and camouflaging target, we examine heterogeneous searchers that differ in terms of their endurance level, travel speed, and detection ability. This leads to a convex mixed-integer nonlinear program, which we reformulate using three linearization techniques. We develop preprocessing steps, outer approximations via lazy constraints, and bundle-based cutting plane methods to address large-scale instances. Further specializations emerge when the target moves according to a Markov chain. We carry out an extensive numerical study to show the computational efficiency of our methods and to derive insights regarding which approach should be favored for which type of problem instance.
△ Less
Submitted 1 November, 2023; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Dynamic Pricing for Reusable Resources: The Power of Two Prices
Authors:
Santiago R. Balseiro,
Will Ma,
Wenxin Zhang
Abstract:
Motivated by real-world applications such as rental and cloud computing services, we investigate pricing for reusable resources. We consider a system where a single resource with a fixed number of identical copies serves customers with heterogeneous willingness-to-pay (WTP), and the usage duration distribution is general. Optimal dynamic policies are computationally intractable when usage duration…
▽ More
Motivated by real-world applications such as rental and cloud computing services, we investigate pricing for reusable resources. We consider a system where a single resource with a fixed number of identical copies serves customers with heterogeneous willingness-to-pay (WTP), and the usage duration distribution is general. Optimal dynamic policies are computationally intractable when usage durations are not memoryless, so existing literature has focused on static pricing, which incurs a steady-state performance loss of ${O}(\sqrt{c})$ compared to optimality when supply and demand scale with $c$. We propose a class of dynamic "stock-dependent" policies that 1) are computationally tractable and 2) can attain a steady-state performance loss of $o(\sqrt{c})$. We give parametric bounds based on the local shape of the reward function at the optimal fluid admission probability and show that the performance loss of stock-dependent policies can be as low as ${O}((\log{c})^2)$. We characterize the tight performance loss for stock-dependent policies and show that they can in fact be achieved by a simple two-price policy that sets a higher price when the stock is below some threshold and a lower price otherwise. We extend our results to settings with multiple resources and multiple customer classes. Finally, we demonstrate this "minimally dynamic" class of two-price policies performs well numerically, even in non-asymptotic settings, suggesting that a little dynamicity can go a long way.
△ Less
Submitted 23 June, 2025; v1 submitted 26 August, 2023;
originally announced August 2023.
-
Data-driven Approximation of Distributionally Robust Chance Constraints using Bayesian Credible Intervals
Authors:
Zhiping Chen,
Wentao Ma,
Bingbing Ji
Abstract:
The non-convexity and intractability of distributionally robust chance constraints make them challenging to cope with. From a data-driven perspective, we propose formulating it as a robust optimization problem to ensure that the distributionally robust chance constraint is satisfied with high probability. To incorporate available data and prior distribution knowledge, we construct ambiguity sets f…
▽ More
The non-convexity and intractability of distributionally robust chance constraints make them challenging to cope with. From a data-driven perspective, we propose formulating it as a robust optimization problem to ensure that the distributionally robust chance constraint is satisfied with high probability. To incorporate available data and prior distribution knowledge, we construct ambiguity sets for the distributionally robust chance constraint using Bayesian credible intervals. We establish the congruent relationship between the ambiguity set in Bayesian distributionally robust chance constraints and the uncertainty set in a specific robust optimization. In contrast to most existent uncertainty set construction methods which are only applicable for particular settings, our approach provides a unified framework for constructing uncertainty sets under different marginal distribution assumptions, thus making it more flexible and widely applicable. Additionally, under the concavity assumption, our method provides strong finite sample probability guarantees for optimal solutions. The practicality and effectiveness of our approach are illustrated with numerical experiments on portfolio management and queuing system problems. Overall, our approach offers a promising solution to distributionally robust chance constrained problems and has potential applications in other fields.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
Decay of geometry for a class of cubic polynomials
Authors:
Haoyang Ji,
Wenxiu Ma
Abstract:
In this paper we study a class of bimodal cubic polynomials for which its critical points have the same $ω$-limit set which is an invariant Cantor set. These maps have generalized Fibonacci combinatorics in terms of generalized renormalization on the twin principal nest. It is proved that such maps possess `decay of geometry' in the sense that the scaling factor of the twin principal nest decrease…
▽ More
In this paper we study a class of bimodal cubic polynomials for which its critical points have the same $ω$-limit set which is an invariant Cantor set. These maps have generalized Fibonacci combinatorics in terms of generalized renormalization on the twin principal nest. It is proved that such maps possess `decay of geometry' in the sense that the scaling factor of the twin principal nest decreases at least exponentially fast. As an application, we prove that they have no Cantor attractor.
△ Less
Submitted 7 December, 2024; v1 submitted 20 April, 2023;
originally announced April 2023.
-
Efficient Solution of Bimaterial Riemann Problems for Compressible Multi-Material Flow Simulations
Authors:
Wentao Ma,
Xuning Zhao,
Shafquat Islam,
Aditya Narkhede,
Kevin Wang
Abstract:
When solving compressible multi-material flow problems, an unresolved challenge is the computation of advective fluxes across material interfaces that separate drastically different thermodynamic states and relations. A popular idea in this regard is to locally construct bimaterial Riemann problems, and to apply their exact solutions in flux computation. For general equations of state, however, fi…
▽ More
When solving compressible multi-material flow problems, an unresolved challenge is the computation of advective fluxes across material interfaces that separate drastically different thermodynamic states and relations. A popular idea in this regard is to locally construct bimaterial Riemann problems, and to apply their exact solutions in flux computation. For general equations of state, however, finding the exact solution of a Riemann problem is expensive as it requires nested loops. Multiplied by the large number of Riemann problems constructed during a simulation, the computational cost often becomes prohibitive. The work presented in this paper aims to accelerate the solution of bimaterial Riemann problems without introducing approximations or offline precomputation tasks. The basic idea is to exploit some special properties of the Riemann problem equations, and to recycle previous solutions as much as possible. Following this idea, four acceleration methods are developed, including (1) a change of integration variable through rarefaction fans, (2) storing and reusing integration trajectory data, (3) step size adaptation, and (4) constructing an R-tree on the fly to generate initial guesses. The performance of these acceleration methods are assessed using four example problems in underwater explosion, laser-induced cavitation, and hypervelocity impact. These problems exhibit strong shock waves, large interface deformation, contact of multiple (>2) interfaces, and interaction between gases and condensed matters. In these challenging cases, the solution of bimaterial Riemann problems is accelerated by 37 to 87 times. As a result, the total cost of advective flux computation, which includes the exact Riemann problem solution at material interfaces and the numerical flux calculation over the entire computational domain, is accelerated by 18 to 81 times.
△ Less
Submitted 22 August, 2023; v1 submitted 15 March, 2023;
originally announced March 2023.
-
Noda Iteration for Computing Generalized Tensor Eigenpairs
Authors:
Wanli Ma,
Weiyang Ding,
Yimin Wei
Abstract:
In this paper, we propose the tensor Noda iteration (NI) and its inexact version for solving the eigenvalue problem of a particular class of tensor pairs called generalized $\mathcal{M}$-tensor pairs. A generalized $\mathcal{M}$-tensor pair consists of a weakly irreducible nonnegative tensor and a nonsingular $\mathcal{M}$-tensor within a linear combination. It is shown that any generalized…
▽ More
In this paper, we propose the tensor Noda iteration (NI) and its inexact version for solving the eigenvalue problem of a particular class of tensor pairs called generalized $\mathcal{M}$-tensor pairs. A generalized $\mathcal{M}$-tensor pair consists of a weakly irreducible nonnegative tensor and a nonsingular $\mathcal{M}$-tensor within a linear combination. It is shown that any generalized $\mathcal{M}$-tensor pair admits a unique positive generalized eigenvalue with a positive eigenvector. A modified tensor Noda iteration(MTNI) is developed for extending the Noda iteration for nonnegative matrix eigenproblems. In addition, the inexact generalized tensor Noda iteration method (IGTNI) and the generalized Newton-Noda iteration method (GNNI) are also introduced for more efficient implementations and faster convergence. Under a mild assumption on the initial values, the convergence of these algorithms is guaranteed. The efficiency of these algorithms is illustrated by numerical experiments.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
From Contextual Data to Newsvendor Decisions: On the Actual Performance of Data-Driven Algorithms
Authors:
Omar Besbes,
Will Ma,
Omar Mouchtaki
Abstract:
In this work, we explore a framework for contextual decision-making to study how the relevance and quantity of past data affects the performance of a data-driven policy. We analyze a contextual Newsvendor problem in which a decision-maker needs to trade-off between an underage and an overage cost in the face of uncertain demand. We consider a setting in which past demands observed under ``close by…
▽ More
In this work, we explore a framework for contextual decision-making to study how the relevance and quantity of past data affects the performance of a data-driven policy. We analyze a contextual Newsvendor problem in which a decision-maker needs to trade-off between an underage and an overage cost in the face of uncertain demand. We consider a setting in which past demands observed under ``close by'' contexts come from close by distributions and analyze the performance of data-driven algorithms through a notion of context-dependent worst-case expected regret. We analyze the broad class of Weighted Empirical Risk Minimization (WERM) policies which weigh past data according to their similarity in the contextual space. This class includes classical policies such as ERM, k-Nearest Neighbors and kernel-based policies. Our main methodological contribution is to characterize exactly the worst-case regret of any WERM policy on any given configuration of contexts. To the best of our knowledge, this provides the first understanding of tight performance guarantees in any contextual decision-making problem, with past literature focusing on upper bounds via concentration inequalities. We instead take an optimization approach, and isolate a structure in the Newsvendor loss function that allows to reduce the infinite-dimensional optimization problem over worst-case distributions to a simple line search.
This in turn allows us to unveil fundamental insights that were obfuscated by previous general-purpose bounds. We characterize actual guaranteed performance as a function of the contexts, as well as granular insights on the learning curve of algorithms.
△ Less
Submitted 24 December, 2024; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Theory of generating spaces of convex sets and their applications to solvability of convex programs in Banach spaces
Authors:
Lixin Cheng,
Weihao Mao
Abstract:
When optimization theorists consider optimization problems in infinite dimensional spaces, they need to deal with closed convex subsets(usually cones) which mostly have empty interior. These subsets often prevent optimization theorists from applying powerful techniques to study these optimization problems. In this paper, by nonsupport point, we present generating spaces which are relative to a Ban…
▽ More
When optimization theorists consider optimization problems in infinite dimensional spaces, they need to deal with closed convex subsets(usually cones) which mostly have empty interior. These subsets often prevent optimization theorists from applying powerful techniques to study these optimization problems. In this paper, by nonsupport point, we present generating spaces which are relative to a Banach space and a nonsupport point of its convex closed subset. Then for optimization problems in infinite dimensional spaces, in some general cases, we replace original spaces by generating spaces while containing solutions. Thus this method enable us to apply powerful classical techniques to optimization problems in very general class of infinite dimensional spaces. Based on functional analysis, from classical Banach spaces to separable Banach spaces, from Banach lattice to latticization, we give characterizations of generating spaces and conclude that they are actually linearly isometric to $L_\infty$($\ell _\infty$) or their closed subspaces. Thus continuous linear functional involved in these techniques could be chosen from $L_\infty^*$($\ell_\infty^*$). After that, applications in Penalty principle, Lagrange duality and scalarization function are further studied by this method.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
Degeneracy is OK: Logarithmic Regret for Network Revenue Management with Indiscrete Distributions
Authors:
Jiashuo Jiang,
Will Ma,
Jiawei Zhang
Abstract:
We study the classical Network Revenue Management (NRM) problem with accept/reject decisions and $T$ IID arrivals. We consider a distributional form where each arrival must fall under a finite number of possible categories, each with a deterministic resource consumption vector, but a random value distributed continuously over an interval. We develop an online algorithm that achieves $O(\log^2 T)$…
▽ More
We study the classical Network Revenue Management (NRM) problem with accept/reject decisions and $T$ IID arrivals. We consider a distributional form where each arrival must fall under a finite number of possible categories, each with a deterministic resource consumption vector, but a random value distributed continuously over an interval. We develop an online algorithm that achieves $O(\log^2 T)$ regret under this model, with the only (necessary) assumption being that the probability densities are bounded away from 0. We derive a second result that achieves $O(\log T)$ regret under an additional assumption of second-order growth. To our knowledge, these are the first results achieving logarithmic-level regret in an NRM model with continuous values that do not require any kind of "non-degeneracy" assumptions. Our results are achieved via new techniques including a new method of bounding myopic regret, a "semi-fluid" relaxation of the offline allocation, and an improved bound on the "dual convergence".
△ Less
Submitted 2 January, 2025; v1 submitted 14 October, 2022;
originally announced October 2022.
-
On (Random-order) Online Contention Resolution Schemes for the Matching Polytope of (Bipartite) Graphs
Authors:
Calum MacRury,
Will Ma,
Nathaniel Grammel
Abstract:
Online Contention Resolution Schemes (OCRS's) represent a modern tool for selecting a subset of elements, subject to resource constraints, when the elements are presented to the algorithm sequentially. OCRS's have led to some of the best-known competitive ratio guarantees for online resource allocation problems, with the added benefit of treating different online decisions -- accept/reject, probin…
▽ More
Online Contention Resolution Schemes (OCRS's) represent a modern tool for selecting a subset of elements, subject to resource constraints, when the elements are presented to the algorithm sequentially. OCRS's have led to some of the best-known competitive ratio guarantees for online resource allocation problems, with the added benefit of treating different online decisions -- accept/reject, probing, pricing -- in a unified manner. This paper analyzes OCRS's for resource constraints defined by matchings in graphs, a fundamental structure in combinatorial optimization. We consider two dimensions of variants: the elements being presented in adversarial or random order; and the graph being bipartite or general. We improve the state of the art for all combinations of variants, both in terms of algorithmic guarantees and impossibility results. Some of our algorithmic guarantees are best-known even compared to Contention Resolution Schemes that can choose the order of arrival or are offline. All in all, our results for OCRS directly improve the best-known competitive ratios for online accept/reject, probing, and pricing problems on graphs in a unified manner.
△ Less
Submitted 1 April, 2024; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Optimal Regularized Online Allocation by Adaptive Re-Solving
Authors:
Wanteng Ma,
Ying Cao,
Danny H. K. Tsang,
Dong Xia
Abstract:
This paper introduces a dual-based algorithm framework for solving the regularized online resource allocation problems, which have potentially non-concave cumulative rewards, hard resource constraints, and a non-separable regularizer. Under a strategy of adaptively updating the resource constraints, the proposed framework only requests approximate solutions to the empirical dual problems up to a c…
▽ More
This paper introduces a dual-based algorithm framework for solving the regularized online resource allocation problems, which have potentially non-concave cumulative rewards, hard resource constraints, and a non-separable regularizer. Under a strategy of adaptively updating the resource constraints, the proposed framework only requests approximate solutions to the empirical dual problems up to a certain accuracy and yet delivers an optimal logarithmic regret under a locally second-order growth condition. Surprisingly, a delicate analysis of the dual objective function enables us to eliminate the notorious log-log factor in regret bound. The flexible framework renders renowned and computationally fast algorithms immediately applicable, e.g., dual stochastic gradient descent. Additionally, an infrequent re-solving scheme is proposed, which significantly reduces computational demands without compromising the optimal regret performance. A worst-case square-root regret lower bound is established if the resource constraints are not adaptively updated during dual optimization, which underscores the critical role of adaptive dual variable update. Comprehensive numerical experiments demonstrate the merits of the proposed algorithm framework.
△ Less
Submitted 15 July, 2023; v1 submitted 1 September, 2022;
originally announced September 2022.
-
Drone-Delivery Network for Opioid Overdose -- Nonlinear Integer Queueing-Optimization Models and Methods
Authors:
Miguel Lejeune,
Wenbo Ma
Abstract:
We propose a new stochastic emergency network design model that uses a fleet of drones to quickly deliver naxolone in response to opioid overdoses. The network is represented as a collection of M/G/K queuing systems in which the capacity K of each system is a decision variable and the service time is modelled as a decision-dependent random variable. The model is an optimization-based queuing probl…
▽ More
We propose a new stochastic emergency network design model that uses a fleet of drones to quickly deliver naxolone in response to opioid overdoses. The network is represented as a collection of M/G/K queuing systems in which the capacity K of each system is a decision variable and the service time is modelled as a decision-dependent random variable. The model is an optimization-based queuing problem which locates fixed (drone bases) and mobile (drones) servers and determines the drone dispatching decisions, and takes the form of a nonlinear integer problem, which is intractable in its original form. We develop an efficient reformulation and algorithmic framework. Our approach reformulates the multiple nonlinearities (fractional, polynomial, exponential, factorial terms) to give a mixed-integer linear programming (MILP) formulation. We demonstrate its generalizablity and show that the problem of minimizing the average response time of a network of M/G/K queuing systems with unknown capacity K is always MILP-representable. We design two algorithms and demonstrate that the outer approximation branch-and-cut method is the most efficient and scales well. The analysis based on real-life overdose data reveals that drones can in Virginia Beach: 1) decrease the response time by 78%, 2) increase the survival chance by 432%, 3) save up to 34 additional lives per year, and 4) provide annually up to 287 additional quality-adjusted life years.
△ Less
Submitted 25 January, 2024; v1 submitted 28 June, 2022;
originally announced June 2022.
-
Online Bipartite Matching with Advice: Tight Robustness-Consistency Tradeoffs for the Two-Stage Model
Authors:
Billy Jin,
Will Ma
Abstract:
Two-stage bipartite matching is a fundamental problem of optimization under uncertainty introduced by Feng, Niazadeh, and Saberi (2021), who study it under the stochastic and adversarial paradigms of uncertainty. We propose a method to interpolate between these paradigms, using the Algorithms with Predictions (ALPS) framework. To elaborate, given some form of information (e.g. a distributional pre…
▽ More
Two-stage bipartite matching is a fundamental problem of optimization under uncertainty introduced by Feng, Niazadeh, and Saberi (2021), who study it under the stochastic and adversarial paradigms of uncertainty. We propose a method to interpolate between these paradigms, using the Algorithms with Predictions (ALPS) framework. To elaborate, given some form of information (e.g. a distributional prediction) about the uncertainty, we consider the optimal decision assuming that information is correct to be some "advice", whose accuracy is unknown. In the ALPS framework, we define Consistency to be an algorithm's performance relative to the advice, and Robustness to be an algorithm's performance relative to the hindsight-optimal decision. We characterize the tight tradeoff between Consistency and Robustness for four settings of two-stage matching: unweighted, vertex-weighted, edge-weighted, and fractional budgeted allocation. Additionally, we show our algorithm achieves state-of-the-art performance in both synthetic and real-data simulations.
△ Less
Submitted 4 November, 2024; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Beyond IID: data-driven decision-making in heterogeneous environments
Authors:
Omar Besbes,
Will Ma,
Omar Mouchtaki
Abstract:
How should one leverage historical data when past observations are not perfectly indicative of the future, e.g., due to the presence of unobserved confounders which one cannot "correct" for? Motivated by this question, we study a data-driven decision-making framework in which historical samples are generated from unknown and different distributions assumed to lie in a heterogeneity ball with known…
▽ More
How should one leverage historical data when past observations are not perfectly indicative of the future, e.g., due to the presence of unobserved confounders which one cannot "correct" for? Motivated by this question, we study a data-driven decision-making framework in which historical samples are generated from unknown and different distributions assumed to lie in a heterogeneity ball with known radius and centered around the (also) unknown future (out-of-sample) distribution on which the performance of a decision will be evaluated. This work aims at analyzing the performance of central data-driven policies but also near-optimal ones in these heterogeneous environments and understanding key drivers of performance. We establish a first result which allows to upper bound the asymptotic worst-case regret of a broad class of policies. Leveraging this result, for any integral probability metric, we provide a general analysis of the performance achieved by Sample Average Approximation (SAA) as a function of the radius of the heterogeneity ball. This analysis is centered around the approximation parameter, a notion of complexity we introduce to capture how the interplay between the heterogeneity and the problem structure impacts the performance of SAA. In turn, we illustrate through several widely-studied problems -- e.g., newsvendor, pricing -- how this methodology can be applied and find that the performance of SAA varies considerably depending on the combinations of problem classes and heterogeneity. The failure of SAA for certain instances motivates the design of alternative policies to achieve rate-optimality. We derive problem-dependent policies achieving strong guarantees for the illustrative problems described above and provide initial results towards a principled approach for the design and analysis of general rate-optimal algorithms.
△ Less
Submitted 1 January, 2025; v1 submitted 20 June, 2022;
originally announced June 2022.
-
Estimating probabilistic dynamic origin-destination demands using multi-day traffic data on computational graphs
Authors:
Wei Ma,
Sean Qian
Abstract:
System-level decision making in transportation needs to understand day-to-day variation of network flows, which calls for accurate modeling and estimation of probabilistic dynamic travel demand on networks. Most existing studies estimate deterministic dynamic origin-destination (OD) demand, while the day-to-day variation of demand and flow is overlooked. Estimating probabilistic distributions of d…
▽ More
System-level decision making in transportation needs to understand day-to-day variation of network flows, which calls for accurate modeling and estimation of probabilistic dynamic travel demand on networks. Most existing studies estimate deterministic dynamic origin-destination (OD) demand, while the day-to-day variation of demand and flow is overlooked. Estimating probabilistic distributions of dynamic OD demand is challenging due to the complexity of the spatio-temporal networks and the computational intensity of the high-dimensional problems. With the availability of massive traffic data and the emergence of advanced computational methods, this paper develops a data-driven framework that solves the probabilistic dynamic origin-destination demand estimation (PDODE) problem using multi-day data. Different statistical distances (e.g., lp-norm, Wasserstein distance, KL divergence, Bhattacharyya distance) are used and compared to measure the gap between the estimated and the observed traffic conditions, and it is found that 2-Wasserstein distance achieves a balanced accuracy in estimating both mean and standard deviation. The proposed framework is cast into the computational graph and a reparametrization trick is developed to estimate the mean and standard deviation of the probabilistic dynamic OD demand simultaneously. We demonstrate the effectiveness and efficiency of the proposed PDODE framework on both small and real-world networks. In particular, it is demonstrated that the proposed PDODE framework can mitigate the overfitting issues by considering the demand variation. Overall, the developed PDODE framework provides a practical tool for public agencies to understand the sources of demand stochasticity, evaluate day-to-day variation of network flow, and make reliable decisions for intelligent transportation systems.
△ Less
Submitted 20 April, 2022;
originally announced April 2022.
-
Optimization of bus scheduling and bus-berth matching at curbside stops under connected vehicle environment
Authors:
Wanjing Ma,
Shiqi Ou,
Chunhui Yu
Abstract:
It is commonly seen that buses are blocked by the ones in front serving passengers and have to queue outside a curbside bus stop although there are vacant berths at the stop. The resultant bus delays degrade the service level of urban public transportation. A potential solution is to reschedule the arrivals of the buses at the stop for full utilization of the berths with the aid of connected vehic…
▽ More
It is commonly seen that buses are blocked by the ones in front serving passengers and have to queue outside a curbside bus stop although there are vacant berths at the stop. The resultant bus delays degrade the service level of urban public transportation. A potential solution is to reschedule the arrivals of the buses at the stop for full utilization of the berths with the aid of connected vehicle technologies. This study proposes a mixed-integer linear programming model to optimize the scheduling of bus arrivals and the bus-berth matching at a curbside stop under connected vehicle environment. The objective is the minimization of the bus delays weighted by the number of passengers on the buses. Bus arrival times at the stop and the assignment of berths are optimized together with bus departure times from the stop. Bus punctuality is also taken into consideration. The proposed model could be applied dynamically to cater to time-varying traffic conditions. Numerical studies validate the advantages of the proposed model over the first-come-first-service strategy and the relaxed model without bus punctuality in terms of weighted bus delays and bus punctuality. Sensitivity analyses show that: 1) the proposed model is robust to the fluctuation of bus service time; and 2) a smaller number of berths may be preferred on condition that the bus demand does not exceed the stop capacity.
△ Less
Submitted 29 November, 2021; v1 submitted 22 June, 2021;
originally announced June 2021.
-
Optimizing for Strategy Diversity in the Design of Video Games
Authors:
Oussama Hanguir,
Will Ma,
Christopher Thomas Ryan,
Jiangze Han
Abstract:
We consider the problem of designing a linear program that has diverse solutions as the right-hand side varies. This problem arises in video game settings where designers aim to have players use different "weapons" or "tactics" as they progress. We model this design question as a choice over the constraint matrix $A$ and cost vector $c$ to maximize the number of possible \emph{supports} of unique…
▽ More
We consider the problem of designing a linear program that has diverse solutions as the right-hand side varies. This problem arises in video game settings where designers aim to have players use different "weapons" or "tactics" as they progress. We model this design question as a choice over the constraint matrix $A$ and cost vector $c$ to maximize the number of possible \emph{supports} of unique optimal solutions (what we call "loadouts") of Linear Programs $\max\{c^\top x \mid Ax \le b, x \ge 0\}$ with nonnegative data considered over all resource vectors $b$. We provide an upper bound on the optimal number of loadouts and provide a family of constructions that have an asymptotically optimal number of loadouts. The upper bound is based on a connection between our problem and the study of triangulations of point sets arising from polyhedral combinatorics, and specifically the combinatorics of the cyclic polytope. Our asymptotically optimal construction also draws inspiration from the properties of the cyclic polytope.
△ Less
Submitted 30 June, 2024; v1 submitted 22 June, 2021;
originally announced June 2021.
-
The convergence rate of of multivariate operators on simplex in Orlicz space
Authors:
Wan Ma,
Lihong Chang,
Yongxia Qiang
Abstract:
The approximation of functions in Orlicz space by multivariate operators on simplex is considered. The convergence rate is given by using modulus of smoothness.
The approximation of functions in Orlicz space by multivariate operators on simplex is considered. The convergence rate is given by using modulus of smoothness.
△ Less
Submitted 21 January, 2021;
originally announced January 2021.
-
A $\dbar$-steepest descent method for oscillatory Riemann-Hilbert problems
Authors:
Fudong Wang,
Wen-Xiu Ma
Abstract:
We study the asymptotic behavior of Riemann-Hilbert problems (RHP) arising in the AKNS hierarchy of integrable equations. Our analysis is based on the $\dbar$-steepest descent method. We consider RHPs arising from the inverse scattering transform of the AKNS hierarchy with $H^{1,1}(\R)$ initial data. The analysis will be divided into three regions: fast decay region, oscillating region and self-si…
▽ More
We study the asymptotic behavior of Riemann-Hilbert problems (RHP) arising in the AKNS hierarchy of integrable equations. Our analysis is based on the $\dbar$-steepest descent method. We consider RHPs arising from the inverse scattering transform of the AKNS hierarchy with $H^{1,1}(\R)$ initial data. The analysis will be divided into three regions: fast decay region, oscillating region and self-similarity region (the Painlevé region). The resulting formulas can be directly applied to study the long-time asymptotic of the solutions of integrable equations such as NLS, mKdV and their higher-order generalizations.
△ Less
Submitted 11 June, 2021; v1 submitted 28 November, 2020;
originally announced November 2020.
-
A general theory of regression adjustment for covariate-adaptive randomization: OLS, Lasso, and beyond
Authors:
Hanzhong Liu,
Fuyi Tu,
Wei Ma
Abstract:
We consider the problem of estimating and inferring treatment effects in randomized experiments. In practice, stratified randomization, or more generally, covariate-adaptive randomization, is routinely used in the design stage to balance the treatment allocations with respect to a few variables that are most relevant to the outcomes. Then, regression is performed in the analysis stage to adjust th…
▽ More
We consider the problem of estimating and inferring treatment effects in randomized experiments. In practice, stratified randomization, or more generally, covariate-adaptive randomization, is routinely used in the design stage to balance the treatment allocations with respect to a few variables that are most relevant to the outcomes. Then, regression is performed in the analysis stage to adjust the remaining imbalances to yield more efficient treatment effect estimators. Building upon and unifying the recent results obtained for ordinary least squares adjusted estimators under covariate-adaptive randomization, this paper presents a general theory of regression adjustment that allows for arbitrary model misspecification and the presence of a large number of baseline covariates. We exemplify the theory on two Lasso-adjusted treatment effect estimators, both of which are optimal in their respective classes. In addition, nonparametric consistent variance estimators are proposed to facilitate valid inferences, which work irrespective of the specific randomization methods used. The robustness and improved efficiency of the proposed estimators are demonstrated through a simulation study and a clinical trial example. This study sheds light on improving treatment effect estimation efficiency by implementing machine learning methods in covariate-adaptive randomized experiments.
△ Less
Submitted 19 November, 2020;
originally announced November 2020.
-
Batalin--Vilkovisky algebra structures on the Hochschild cohomology of generalized Weyl algebras
Authors:
Liyu Liu,
Wen Ma
Abstract:
This paper is devoted to the calculation of Batalin-Vilkovisky algebra structures on the Hochschild cohomology of skew Calabi-Yau generalized Weyl algebras. We firstly establish a Van den Bergh duality at the level of complex. Then based on the results of Solotar et al., we apply Kowalzig and Krähmer's method to the Hochschild homology of generalized Weyl algebras, and translate the homological in…
▽ More
This paper is devoted to the calculation of Batalin-Vilkovisky algebra structures on the Hochschild cohomology of skew Calabi-Yau generalized Weyl algebras. We firstly establish a Van den Bergh duality at the level of complex. Then based on the results of Solotar et al., we apply Kowalzig and Krähmer's method to the Hochschild homology of generalized Weyl algebras, and translate the homological information into cohomological one by virtue of the Van den Bergh duality, obtaining the desired Batalin-Vilkovisky algebra structures. Finally, we apply our results to quantum weighted projective lines and Podleś quantum spheres, and the Batalin-Vilkovisky algebra structures for them are described completely.
△ Less
Submitted 12 October, 2021; v1 submitted 12 September, 2020;
originally announced September 2020.
-
Testing for Treatment Effect in Covariate-Adaptive Randomized Clinical Trials with Generalized Linear Models and Omitted Covariates
Authors:
Li Yang,
Wei Ma,
Yichen Qin,
Feifang Hu
Abstract:
Concerns have been expressed over the validity of statistical inference under covariate-adaptive randomization despite the extensive use in clinical trials. In the literature, the inferential properties under covariate-adaptive randomization have been mainly studied for continuous responses; in particular, it is well known that the usual two sample t-test for treatment effect is typically conserva…
▽ More
Concerns have been expressed over the validity of statistical inference under covariate-adaptive randomization despite the extensive use in clinical trials. In the literature, the inferential properties under covariate-adaptive randomization have been mainly studied for continuous responses; in particular, it is well known that the usual two sample t-test for treatment effect is typically conservative, in the sense that the actual test size is smaller than the nominal level. This phenomenon of invalid tests has also been found for generalized linear models without adjusting for the covariates and are sometimes more worrisome due to inflated Type I error. The purpose of this study is to examine the unadjusted test for treatment effect under generalized linear models and covariate-adaptive randomization. For a large class of covariate-adaptive randomization methods, we obtain the asymptotic distribution of the test statistic under the null hypothesis and derive the conditions under which the test is conservative, valid, or anti-conservative. Several commonly used generalized linear models, such as logistic regression and Poisson regression, are discussed in detail. An adjustment method is also proposed to achieve a valid size based on the asymptotic results. Numerical studies confirm the theoretical findings and demonstrate the effectiveness of the proposed adjustment method.
△ Less
Submitted 2 May, 2021; v1 submitted 9 September, 2020;
originally announced September 2020.
-
Managing connected and automated vehicles with flexible routing at "lane-allocation-free'' intersections
Authors:
Wanjing Ma,
Ruochen Hao,
Chunhui Yu,
Tuo Sun,
Bart van Arem
Abstract:
Trajectory planning and coordination for connected and automated vehicles (CAVs) have been studied at isolated ``signal-free'' intersections and in ``signal-free'' corridors under the fully CAV environment in the literature. Most of the existing studies are based on the definition of approaching and exit lanes. The route a vehicle takes to pass through an intersection is determined from its moveme…
▽ More
Trajectory planning and coordination for connected and automated vehicles (CAVs) have been studied at isolated ``signal-free'' intersections and in ``signal-free'' corridors under the fully CAV environment in the literature. Most of the existing studies are based on the definition of approaching and exit lanes. The route a vehicle takes to pass through an intersection is determined from its movement. That is, only the origin and destination arms are included. This study proposes a mixed-integer linear programming (MILP) model to optimize vehicle trajectories at an isolated ``signal-free'' intersection without lane allocation, which is denoted as ``lane-allocation-free'' (LAF) control. Each lane can be used as both approaching and exit lanes for all vehicle movements including left-turn, through, and right-turn. A vehicle can take a flexible route by way of multiple arms to pass through the intersection. In this way, the spatial-temporal resources are expected to be fully utilized. The interactions between vehicle trajectories are modeled explicitly at the microscopic level. Vehicle routes and trajectories (i.e., car-following and lane-changing behaviors) at the intersection are optimized in one unified framework for system optimality in terms of total vehicle delay. Considering varying traffic conditions, the planning horizon is adaptively adjusted in the implementation procedure of the proposed model to make a balance between solution feasibility and computational burden. Numerical studies validate the advantages of the proposed LAF control in terms of both vehicle delay and throughput with different demand structures and temporal safety gaps.
△ Less
Submitted 24 August, 2020;
originally announced August 2020.
-
Understanding Notions of Stationarity in Non-Smooth Optimization
Authors:
Jiajin Li,
Anthony Man-Cho So,
Wing-Kin Ma
Abstract:
Many contemporary applications in signal processing and machine learning give rise to structured non-convex non-smooth optimization problems that can often be tackled by simple iterative methods quite effectively. One of the keys to understanding such a phenomenon---and, in fact, one of the very difficult conundrums even for experts---lie in the study of "stationary points" of the problem in quest…
▽ More
Many contemporary applications in signal processing and machine learning give rise to structured non-convex non-smooth optimization problems that can often be tackled by simple iterative methods quite effectively. One of the keys to understanding such a phenomenon---and, in fact, one of the very difficult conundrums even for experts---lie in the study of "stationary points" of the problem in question. Unlike smooth optimization, for which the definition of a stationary point is rather standard, there is a myriad of definitions of stationarity in non-smooth optimization. In this article, we give an introduction to different stationarity concepts for several important classes of non-convex non-smooth functions and discuss the geometric interpretations and further clarify the relationship among these different concepts. We then demonstrate the relevance of these constructions in some representative applications and how they could affect the performance of iterative methods for tackling these applications.
△ Less
Submitted 26 June, 2020;
originally announced June 2020.
-
Coupled Control Systems: Periodic Orbit Generation with Application to Quadrupedal Locomotion
Authors:
Wen-Loong Ma,
Noel Csomay-Shanklin,
Aaron D. Ames
Abstract:
A robotic system can be viewed as a collection of lower-dimensional systems that are coupled via reaction forces (Lagrange multipliers) enforcing holonomic constraints. Inspired by this viewpoint, this paper presents a novel formulation for nonlinear control systems that are subject to coupling constraints via virtual "coupling" inputs that abstractly play the role of Lagrange multipliers. The mai…
▽ More
A robotic system can be viewed as a collection of lower-dimensional systems that are coupled via reaction forces (Lagrange multipliers) enforcing holonomic constraints. Inspired by this viewpoint, this paper presents a novel formulation for nonlinear control systems that are subject to coupling constraints via virtual "coupling" inputs that abstractly play the role of Lagrange multipliers. The main contribution of this paper is a process---mirroring solving for Lagrange multipliers in robotic systems---wherein we isolate subsystems free of coupling constraints that provably encode the full-order dynamics of the coupled control system from which it was derived. This dimension reduction is leveraged in the formulation of a nonlinear optimization problem for the isolated subsystem that yields periodic orbits for the full-order coupled system. We consider the application of these ideas to robotic systems, which can be decomposed into subsystems. Specifically, we view a quadruped as a coupled control system consisting of two bipedal robots, wherein applying the framework developed allows for gaits (periodic orbits) to be generated for the individual biped yielding a gait for the full-order quadruped. This is demonstrated through walking experiments of a quadrupedal robot in simulation and on rough terrains.
△ Less
Submitted 18 March, 2020;
originally announced March 2020.
-
A Distributionally Robust Area Under Curve Maximization Model
Authors:
Wenbo Ma,
Miguel A. Lejeune
Abstract:
Area under ROC curve (AUC) is a widely used performance measure for classification models. We propose two new distributionally robust AUC maximization models (DR-AUC) that rely on the Kantorovich metric and approximate the AUC with the hinge loss function. We consider the two cases with respectively fixed and variable support for the worst-case distribution. We use duality theory to reformulate th…
▽ More
Area under ROC curve (AUC) is a widely used performance measure for classification models. We propose two new distributionally robust AUC maximization models (DR-AUC) that rely on the Kantorovich metric and approximate the AUC with the hinge loss function. We consider the two cases with respectively fixed and variable support for the worst-case distribution. We use duality theory to reformulate the DR-AUC models and derive tractable convex optimization problems. The numerical experiments show that the proposed DR-AUC models -- benchmarked with the standard deterministic AUC and the support vector machine models - perform better in general and in particular improve the worst-case out-of-sample performance over the majority of the considered datasets, thereby showing their robustness. The results are particularly encouraging since our numerical experiments are conducted with training sets of small size which have been known to be conducive to low out-of-sample performance.
△ Less
Submitted 7 May, 2020; v1 submitted 17 February, 2020;
originally announced February 2020.
-
Hybrid Inexact BCD for Coupled Structured Matrix Factorization in Hyperspectral Super-Resolution
Authors:
Ruiyuan Wu,
Hoi-To Wai,
Wing-Kin Ma
Abstract:
This paper develops a first-order optimization method for coupled structured matrix factorization (CoSMF) problems that arise in the context of hyperspectral super-resolution (HSR) in remote sensing. To best leverage the problem structures for computational efficiency, we introduce a hybrid inexact block coordinate descent (HiBCD) scheme wherein one coordinate is updated via the fast proximal grad…
▽ More
This paper develops a first-order optimization method for coupled structured matrix factorization (CoSMF) problems that arise in the context of hyperspectral super-resolution (HSR) in remote sensing. To best leverage the problem structures for computational efficiency, we introduce a hybrid inexact block coordinate descent (HiBCD) scheme wherein one coordinate is updated via the fast proximal gradient (FPG) method, while another via the Frank-Wolfe (FW) method. The FPG-type methods are known to take less number of iterations to converge, by numerical experience, while the FW-type methods can offer lower per-iteration complexity in certain cases; and we wish to take the best of both. We show that the limit points of this HiBCD scheme are stationary. Our proof treats HiBCD as an optimization framework for a class of multi-block structured optimization problems, and our stationarity claim is applicable not only to CoSMF but also to many other problems. Previous optimization research showed the same stationarity result for inexact block coordinate descent with either FPG or FW updates only. Numerical results indicate that the proposed HiBCD scheme is computationally much more efficient than the state-of-the-art CoSMF schemes in HSR.
△ Less
Submitted 20 February, 2020; v1 submitted 19 September, 2019;
originally announced September 2019.
-
Multi-stage and Multi-customer Assortment Optimization with Inventory Constraints
Authors:
Elaheh Fata,
Will Ma,
David Simchi-Levi
Abstract:
We consider an assortment optimization problem where a customer chooses a single item from a sequence of sets shown to her, while limited inventories constrain the items offered to customers over time. In the special case where all of the assortments have size one, our problem captures the online stochastic matching with timeouts problem. For this problem, we derive a polynomial-time approximation…
▽ More
We consider an assortment optimization problem where a customer chooses a single item from a sequence of sets shown to her, while limited inventories constrain the items offered to customers over time. In the special case where all of the assortments have size one, our problem captures the online stochastic matching with timeouts problem. For this problem, we derive a polynomial-time approximation algorithm which earns at least 1-ln(2-1/e), or 0.51, of the optimum. This improves upon the previous-best approximation ratio of 0.46, and furthermore, we show that it is tight. For the general assortment problem, we establish the first constant-factor approximation ratio of 0.09 for the case that different types of customers value items differently, and an approximation ratio of 0.15 for the case that different customers value each item the same. Our algorithms are based on rounding an LP relaxation for multi-stage assortment optimization, and improve upon previous randomized rounding schemes to derive the tight ratio of 1-ln(2-1/e).
△ Less
Submitted 26 July, 2020; v1 submitted 26 August, 2019;
originally announced August 2019.
-
Long-time asymptotic behaviour for the fifth order modified Korteweg-de Vries equation
Authors:
Fudong Wang,
Wen-Xiu Ma
Abstract:
Following Deift-Zhou's nonlinear steepest descent method, the long-time asymptotic behavior for the Cauchy problem of the 5th order modified Korteweg-de Vries equation is analyzed. Based on the inverse scattering transform, the 5th order MKdV is transformed to a 2 by 2 oscillatory Riemann-Hilbert problem, then by manipulating the Cauchy operator and reducing the degree of the phase function, the l…
▽ More
Following Deift-Zhou's nonlinear steepest descent method, the long-time asymptotic behavior for the Cauchy problem of the 5th order modified Korteweg-de Vries equation is analyzed. Based on the inverse scattering transform, the 5th order MKdV is transformed to a 2 by 2 oscillatory Riemann-Hilbert problem, then by manipulating the Cauchy operator and reducing the degree of the phase function, the long-time asymptotics of the solution is given in terms of solutions of the parabolic cylinder equation.
△ Less
Submitted 30 July, 2019;
originally announced July 2019.
-
Online Matching Frameworks under Stochastic Rewards, Product Ranking, and Unknown Patience
Authors:
Brian Brubach,
Nathaniel Grammel,
Will Ma,
Aravind Srinivasan
Abstract:
We study generalizations of online bipartite matching in which each arriving vertex (customer) views a ranked list of offline vertices (products) and matches to (purchases) the first one they deem acceptable. The number of products that the customer has patience to view can be stochastic and dependent on the products seen. We develop a framework that views the interaction with each customer as an…
▽ More
We study generalizations of online bipartite matching in which each arriving vertex (customer) views a ranked list of offline vertices (products) and matches to (purchases) the first one they deem acceptable. The number of products that the customer has patience to view can be stochastic and dependent on the products seen. We develop a framework that views the interaction with each customer as an abstract resource consumption process, and derive new results for these online matching problems under the adversarial, non-stationary, and IID arrival models, assuming we can (approximately) solve the product ranking problem for each single customer. To that end, we show new results for product ranking under two cascade-click models: an optimal algorithm when each item has its own hazard rate for making the customer depart, and a 1/2-approximate algorithm when the customer has a general item-independent patience distribution. We also present a constant-factor 0.027-approximate algorithm in a new model where items are not initially available and arrive over time. We complement these positive results by presenting three additional negative results relating to these problems.
△ Less
Submitted 26 June, 2023; v1 submitted 8 July, 2019;
originally announced July 2019.
-
A Fine-Grained Variant of the Hierarchy of Lasserre
Authors:
Wann-Jiun Ma,
Jakub Marecek,
Martin Mevissen
Abstract:
There has been much recent interest in hierarchies of progressively stronger convexifications of polynomial optimisation problems (POP). These often converge to the global optimum of the POP, asymptotically, but prove challenging to solve beyond the first level in the hierarchy for modest instances. We present a finer-grained variant of the Lasserre hierarchy, together with first-order methods for…
▽ More
There has been much recent interest in hierarchies of progressively stronger convexifications of polynomial optimisation problems (POP). These often converge to the global optimum of the POP, asymptotically, but prove challenging to solve beyond the first level in the hierarchy for modest instances. We present a finer-grained variant of the Lasserre hierarchy, together with first-order methods for solving the convexifications, which allow for efficient warm-starting with solutions from lower levels in the hierarchy.
△ Less
Submitted 23 June, 2019;
originally announced June 2019.
-
Hierarchical and Safe Motion Control for Cooperative Locomotion of Robotic Guide Dogs and Humans: A Hybrid Systems Approach
Authors:
Kaveh Akbari Hamed,
Vinay R. Kamidi,
Wen-Loong Ma,
Alexander Leonessa,
Aaron D. Ames
Abstract:
This paper presents a hierarchical control strategy based on hybrid systems theory, nonlinear control, and safety-critical systems to enable cooperative locomotion of robotic guide dogs and visually impaired people. We address high-dimensional and complex hybrid dynamical models that represent collaborative locomotion. At the high level of the control scheme, local and nonlinear baseline controlle…
▽ More
This paper presents a hierarchical control strategy based on hybrid systems theory, nonlinear control, and safety-critical systems to enable cooperative locomotion of robotic guide dogs and visually impaired people. We address high-dimensional and complex hybrid dynamical models that represent collaborative locomotion. At the high level of the control scheme, local and nonlinear baseline controllers, based on the virtual constraints approach, are designed to induce exponentially stable dynamic gaits. The baseline controller for the leash is assumed to be a nonlinear controller that keeps the human in a safe distance from the dog while following it. At the lower level, a real-time quadratic programming (QP) is solved for modifying the baseline controllers of the robot as well as the leash to avoid obstacles. In particular, the QP framework is set up based on control barrier functions (CBFs) to compute optimal control inputs that guarantee safety while being close to the baseline controllers. The stability of the complex periodic gaits is investigated through the Poincare return map. To demonstrate the power of the analytical foundation, the control algorithms are transferred into an extensive numerical simulation of a complex model that represents cooperative locomotion of a quadrupedal robot, referred to as Vision 60, and a human model. The complex model has 16 continuous-time domains with 60 state variables and 20 control inputs.
△ Less
Submitted 5 April, 2019;
originally announced April 2019.