-
The anomalous magnetic moment of the muon in the Standard Model: an update
Authors:
R. Aliberti,
T. Aoyama,
E. Balzani,
A. Bashir,
G. Benton,
J. Bijnens,
V. Biloshytskyi,
T. Blum,
D. Boito,
M. Bruno,
E. Budassi,
S. Burri,
L. Cappiello,
C. M. Carloni Calame,
M. Cè,
V. Cirigliano,
D. A. Clarke,
G. Colangelo,
L. Cotrozzi,
M. Cottini,
I. Danilkin,
M. Davier,
M. Della Morte,
A. Denig,
C. DeTar
, et al. (206 additional authors not shown)
Abstract:
We present the current Standard Model (SM) prediction for the muon anomalous magnetic moment, $a_μ$, updating the first White Paper (WP20) [1]. The pure QED and electroweak contributions have been further consolidated, while hadronic contributions continue to be responsible for the bulk of the uncertainty of the SM prediction. Significant progress has been achieved in the hadronic light-by-light s…
▽ More
We present the current Standard Model (SM) prediction for the muon anomalous magnetic moment, $a_μ$, updating the first White Paper (WP20) [1]. The pure QED and electroweak contributions have been further consolidated, while hadronic contributions continue to be responsible for the bulk of the uncertainty of the SM prediction. Significant progress has been achieved in the hadronic light-by-light scattering contribution using both the data-driven dispersive approach as well as lattice-QCD calculations, leading to a reduction of the uncertainty by almost a factor of two. The most important development since WP20 is the change in the estimate of the leading-order hadronic-vacuum-polarization (LO HVP) contribution. A new measurement of the $e^+e^-\toπ^+π^-$ cross section by CMD-3 has increased the tensions among data-driven dispersive evaluations of the LO HVP contribution to a level that makes it impossible to combine the results in a meaningful way. At the same time, the attainable precision of lattice-QCD calculations has increased substantially and allows for a consolidated lattice-QCD average of the LO HVP contribution with a precision of about 0.9\%. Adopting the latter in this update has resulted in a major upward shift of the total SM prediction, which now reads $a_μ^\text{SM} = 116\,592\,033(62)\times 10^{-11}$ (530 ppb). When compared against the current experimental average based on the E821 experiment and runs 1-3 of E989 at Fermilab, one finds $a_μ^\text{exp} - a_μ^\text{SM} =26(66)\times 10^{-11}$, which implies that there is no tension between the SM and experiment at the current level of precision. The final precision of E989 is expected to be around 140 ppb, which is the target of future efforts by the Theory Initiative. The resolution of the tensions among data-driven dispersive evaluations of the LO HVP contribution will be a key element in this endeavor.
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
Improved evaluation of the electroweak contribution to muon $g-2$
Authors:
Martin Hoferichter,
Jan Lüdtke,
Luca Naterop,
Massimiliano Procura,
Peter Stoffer
Abstract:
A precise evaluation of the electroweak contribution to the anomalous magnetic moment of the muon requires control over all aspects of the Standard Model, ranging from Higgs physics, over multi-loop computations for bosonic and (heavy-)fermion diagrams, to non-perturbative effects in the presence of light quarks. Currently, the dominant uncertainties arise from such hadronic effects in the vector-…
▽ More
A precise evaluation of the electroweak contribution to the anomalous magnetic moment of the muon requires control over all aspects of the Standard Model, ranging from Higgs physics, over multi-loop computations for bosonic and (heavy-)fermion diagrams, to non-perturbative effects in the presence of light quarks. Currently, the dominant uncertainties arise from such hadronic effects in the vector-vector-axial-vector three-point function, an improved understanding of which has recently emerged in the context of hadronic light-by-light scattering. Profiting from these developments as well as new perturbative and non-perturbative input for the charm contribution, we obtain $a_μ^\text{EW}=154.4(4)\times 10^{-11}$.
△ Less
Submitted 20 May, 2025; v1 submitted 6 March, 2025;
originally announced March 2025.
-
Accelerating Benders decomposition for solving a sequence of sample average approximation replications
Authors:
Harshit Kothari,
James R. Luedtke
Abstract:
Sample average approximation (SAA) is a technique for obtaining approximate solutions to stochastic programs that uses the average from a random sample to approximate the expected value that is being optimized. Since the outcome from solving an SAA is random, statistical estimates on the optimal value of the true problem can be obtained by solving multiple SAA replications with independent samples…
▽ More
Sample average approximation (SAA) is a technique for obtaining approximate solutions to stochastic programs that uses the average from a random sample to approximate the expected value that is being optimized. Since the outcome from solving an SAA is random, statistical estimates on the optimal value of the true problem can be obtained by solving multiple SAA replications with independent samples. We study techniques to accelerate the solution of this set of SAA replications, when solving them sequentially via Benders decomposition. We investigate how to exploit similarities in the problem structure, as the replications just differ in the realizations of the random samples. Our extensive computational experiments provide empirical evidence that our techniques for using information from solving previous replications can significantly reduce the solution time of later replications.
△ Less
Submitted 13 November, 2024;
originally announced November 2024.
-
Dispersion relations for the hadronic VVA correlator
Authors:
Jan Lüdtke,
Massimiliano Procura,
Peter Stoffer
Abstract:
We derive two types of dispersion relations for the hadronic vector-vector-axial-vector (VVA) correlator: one in generic three-point kinematics with fixed photon virtualities, the second in the kinematic limit of one soft photon. The VVA correlator enters in the electroweak contribution to the muon anomalous magnetic moment and it also emerges as the leading term in the operator-product expansion…
▽ More
We derive two types of dispersion relations for the hadronic vector-vector-axial-vector (VVA) correlator: one in generic three-point kinematics with fixed photon virtualities, the second in the kinematic limit of one soft photon. The VVA correlator enters in the electroweak contribution to the muon anomalous magnetic moment and it also emerges as the leading term in the operator-product expansion for hadronic light-by-light scattering in the limit of two large photon momenta. Previously, this correlator was only described in terms of hadronic models. Our new dispersive treatments are in analogy to the established and the newly proposed dispersive approaches to hadronic light-by-light. The VVA correlator allows us to investigate the relation between the two types of dispersion relations in a simpler example and to elucidate the reshuffling of different hadronic contributions in this comparison. As a byproduct, we reduce the theoretical uncertainties on the first-family VVA contribution to the muon anomalous magnetic moment by combining the dispersive representation with both asymptotic and low-energy constraints.
△ Less
Submitted 22 April, 2025; v1 submitted 15 October, 2024;
originally announced October 2024.
-
Probing-Enhanced Stochastic Programming
Authors:
Zhichao Ma,
Youngdae Kim,
Jeff Linderoth,
James R. Luedtke,
Logan R. Matthews
Abstract:
We consider a two-stage stochastic decision problem where the decision-maker has the opportunity to obtain information about the distribution of the random variables $ξ$ that appear in the problem through a set of discrete actions that we refer to as \emph{probing}. Probing components of a random vector $η$ that is jointly-distributed with $ξ$ allows the decision-maker to learn about the condition…
▽ More
We consider a two-stage stochastic decision problem where the decision-maker has the opportunity to obtain information about the distribution of the random variables $ξ$ that appear in the problem through a set of discrete actions that we refer to as \emph{probing}. Probing components of a random vector $η$ that is jointly-distributed with $ξ$ allows the decision-maker to learn about the conditional distribution of $ξ$ given the observed components of $η$. We propose a three-stage optimization model for this problem, where in the first stage some components of $η$ are chosen to be observed, and decisions in subsequent stages must be consistent with the obtained information. In the case that $η$ and $ξ$ have finite support, Goel and Grossmann gave a mixed-integer programming (MIP) formulation of this problem whose size is proportional to the square of cardinality of the sample space of the random variables. We propose to solve the model using bounds obtained from an information-based relaxation, combined with a branching scheme that enforces the consistency of decisions with observed information. The branch-and-bound approach can naturally be combined with sampling in order to estimate both lower and upper bounds on the optimal solution value and does not require $η$ or $ξ$ to have finite support. We conduct a computational study of our method on instances of a stochastic facility location and sizing problem with the option to probe customers to learn about their demands before building facilities. We find that on instances with finite support, our approach scales significantly better than the MIP formulation and also demonstrate that our method can compute statistical bounds on instances with continuous distributions that improve upon the perfect information bounds.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
A Framework for Balancing Power Grid Efficiency and Risk with Bi-objective Stochastic Integer Optimization
Authors:
Ramsey Rossmann,
Mihai Anitescu,
Julie Bessac,
Michael Ferris,
Mitchell Krock,
James Luedtke,
Line Roald
Abstract:
Power grid expansion planning requires making large investment decisions in the present that will impact the future cost and reliability of a system exposed to wide-ranging uncertainties. Extreme temperatures can pose significant challenges to providing power by increasing demand and decreasing supply and have contributed to recent major power outages. We propose to address a modeling challenge of…
▽ More
Power grid expansion planning requires making large investment decisions in the present that will impact the future cost and reliability of a system exposed to wide-ranging uncertainties. Extreme temperatures can pose significant challenges to providing power by increasing demand and decreasing supply and have contributed to recent major power outages. We propose to address a modeling challenge of such high-impact, low-frequency events with a bi-objective stochastic integer optimization model that finds solutions with different trade-offs between efficiency in normal conditions and risk to extreme events. We propose a conditional sampling approach paired with a risk measure to address the inherent challenge in approximating the risk of low-frequency events within a sampling based approach. We present a model for spatially correlated, county-specific temperatures and a method to generate both unconditional and conditionally extreme temperature samples from this model efficiently. These models are investigated within an extensive case study with realistic data that demonstrates the effectiveness of the bi-objective approach and the conditional sampling technique. We find that spatial correlations in the temperature samples are essential to finding good solutions and that modeling generator temperature dependence is an important consideration for finding efficient, low-risk solutions.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Community recommendations on cryoEM data archiving and validation
Authors:
Gerard J. Kleywegt,
Paul D. Adams,
Sarah J. Butcher,
Cathy Lawson,
Alexis Rohou,
Peter B. Rosenthal,
Sriram Subramaniam,
Maya Topf,
Sanja Abbott,
Philip R. Baldwin,
John M. Berrisford,
Gérard Bricogne,
Preeti Choudhary,
Tristan I. Croll,
Radostin Danev,
Sai J. Ganesan,
Timothy Grant,
Aleksandras Gutmanas,
Richard Henderson,
J. Bernard Heymann,
Juha T. Huiskonen,
Andrei Istrate,
Takayuki Kato,
Gabriel C. Lander,
Shee-Mei Lok
, et al. (22 additional authors not shown)
Abstract:
In January 2020, a workshop was held at EMBL-EBI (Hinxton, UK) to discuss data requirements for deposition and validation of cryoEM structures, with a focus on single-particle analysis. The meeting was attended by 45 experts in data processing, model building and refinement, validation, and archiving of such structures. This report describes the workshop's motivation and history, the topics discus…
▽ More
In January 2020, a workshop was held at EMBL-EBI (Hinxton, UK) to discuss data requirements for deposition and validation of cryoEM structures, with a focus on single-particle analysis. The meeting was attended by 45 experts in data processing, model building and refinement, validation, and archiving of such structures. This report describes the workshop's motivation and history, the topics discussed, and consensus recommendations resulting from the workshop. Some challenges for future methods-development efforts in this area are also highlighted, as is the implementation to date of some of the recommendations.
△ Less
Submitted 2 February, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
An Integer Programming Approach To Subspace Clustering With Missing Data
Authors:
Akhilesh Soni,
Jeff Linderoth,
Jim Luedtke,
Daniel Pimentel-Alarcon
Abstract:
In the Subspace Clustering with Missing Data (SCMD) problem, we are given a collection of n partially observed d-dimensional vectors. The data points are assumed to be concentrated near a union of low-dimensional subspaces. The goal of SCMD is to cluster the vectors according to their subspace membership and recover the underlying basis, which can then be used to infer their missing entries. State…
▽ More
In the Subspace Clustering with Missing Data (SCMD) problem, we are given a collection of n partially observed d-dimensional vectors. The data points are assumed to be concentrated near a union of low-dimensional subspaces. The goal of SCMD is to cluster the vectors according to their subspace membership and recover the underlying basis, which can then be used to infer their missing entries. State-of-the-art algorithms for SCMD can fail on instances with a high proportion of missing data, full-rank data, or if the underlying subspaces are similar to each other. We propose a novel integer programming approach for SCMD. The approach is based on dynamically determining a set of candidate subspaces and optimally assigning points to selected subspaces. The problem structure is identical to the classical facility-location problem, with subspaces playing the role of facilities and data points that of customers. We propose a column-generation approach for identifying candidate subspaces combined with a Benders decomposition approach for solving the linear programming relaxation of the formulation. An empirical study demonstrates that the proposed approach can achieve better clustering accuracy than state-of-the-art methods when the data is high-rank, the percentage of missing data is high, or the subspaces are similar.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Dispersion relations for hadronic light-by-light scattering in triangle kinematics
Authors:
Jan Lüdtke,
Massimiliano Procura,
Peter Stoffer
Abstract:
We present a new strategy for the dispersive evaluation of the hadronic light-by-light contribution to the anomalous magnetic moment of the muon $a_μ$. The new approach directly applies in the kinematic limit relevant for $a_μ$: one of the photons is treated as an external electromagnetic field with vanishing momentum, so that the kinematics corresponds to a triangle. We derive expressions for the…
▽ More
We present a new strategy for the dispersive evaluation of the hadronic light-by-light contribution to the anomalous magnetic moment of the muon $a_μ$. The new approach directly applies in the kinematic limit relevant for $a_μ$: one of the photons is treated as an external electromagnetic field with vanishing momentum, so that the kinematics corresponds to a triangle. We derive expressions for the relevant single-particle intermediate states, as well as the tensor decompositions of the two-pion sub-processes that appear in addition to those needed in the established dispersive approach. The existing approach is based on a set of dispersion relations for the hadronic light-by-light tensor in four-point kinematics. At present it is not known how to consistently include in this framework resonant intermediate states of spin 2 or larger, due to the appearance of kinematic singularities that can be traced back to the redundancy of the tensor decomposition. We show that our new approach circumvents this problem and enables dispersion relations in the limit of triangle kinematics that are manifestly free from kinematic singularities, paving the way towards a data-driven evaluation of all relevant exclusive hadronic intermediate states.
△ Less
Submitted 27 April, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Stochastic Dynamic Lot-sizing with Supplier-Driven Substitution and Service Level Constraints
Authors:
Narges Sereshti,
Merve Bodur,
James R. Luedtke
Abstract:
We consider a multi-stage stochastic lot-sizing problem with service level constraints and supplier-driven product substitution. A firm has multiple products and it has the option to meet demand from substitutable products at a cost. Considering the uncertainty in future demands, the firm wishes to make ordering decisions in every period such that the probability that all demands can be met in the…
▽ More
We consider a multi-stage stochastic lot-sizing problem with service level constraints and supplier-driven product substitution. A firm has multiple products and it has the option to meet demand from substitutable products at a cost. Considering the uncertainty in future demands, the firm wishes to make ordering decisions in every period such that the probability that all demands can be met in the next period meets or exceeds a minimum service level. We propose a rolling-horizon policy in which a two-stage joint chance-constrained stochastic program is solved to make decisions in each time period. We demonstrate how to effectively solve this formulation. In addition, we propose two policies based on deterministic approximations. We demonstrate that the proposed chance-constraint policy can achieve the service levels more reliably and at a lower cost. We also explore the value of product substitution in this model, demonstrating that the substitution option allows achieving service levels while reducing costs by 7% to 25% in our experiments, and that the majority of the benefit can be obtained with limited levels of substitution allowed.
△ Less
Submitted 31 December, 2022;
originally announced January 2023.
-
Data-Driven Sample Average Approximation with Covariate Information
Authors:
Rohit Kannan,
Güzin Bayraksan,
James R. Luedtke
Abstract:
We study optimization for data-driven decision-making when we have observations of the uncertain parameters within the optimization model together with concurrent observations of covariates. Given a new covariate observation, the goal is to choose a decision that minimizes the expected cost conditioned on this observation. We investigate three data-driven frameworks that integrate a machine learni…
▽ More
We study optimization for data-driven decision-making when we have observations of the uncertain parameters within the optimization model together with concurrent observations of covariates. Given a new covariate observation, the goal is to choose a decision that minimizes the expected cost conditioned on this observation. We investigate three data-driven frameworks that integrate a machine learning prediction model within a stochastic programming sample average approximation (SAA) for approximating the solution to this problem. Two of the SAA frameworks are new and use out-of-sample residuals of leave-one-out prediction models for scenario generation. The frameworks we investigate are flexible and accommodate parametric, nonparametric, and semiparametric regression techniques. We derive conditions on the data generation process, the prediction model, and the stochastic program under which solutions of these data-driven SAAs are consistent and asymptotically optimal, and also derive convergence rates and finite sample guarantees. Computational experiments validate our theoretical results, demonstrate the potential advantages of our data-driven formulations over existing approaches (even when the prediction model is misspecified), and illustrate the benefits of our new data-driven formulations in the limited data regime.
△ Less
Submitted 27 July, 2022;
originally announced July 2022.
-
Prospects for precise predictions of $a_μ$ in the Standard Model
Authors:
G. Colangelo,
M. Davier,
A. X. El-Khadra,
M. Hoferichter,
C. Lehner,
L. Lellouch,
T. Mibe,
B. L. Roberts,
T. Teubner,
H. Wittig,
B. Ananthanarayan,
A. Bashir,
J. Bijnens,
T. Blum,
P. Boyle,
N. Bray-Ali,
I. Caprini,
C. M. Carloni Calame,
O. Catà,
M. Cè,
J. Charles,
N. H. Christ,
F. Curciarello,
I. Danilkin,
D. Das
, et al. (57 additional authors not shown)
Abstract:
We discuss the prospects for improving the precision on the hadronic corrections to the anomalous magnetic moment of the muon, and the plans of the Muon $g-2$ Theory Initiative to update the Standard Model prediction.
We discuss the prospects for improving the precision on the hadronic corrections to the anomalous magnetic moment of the muon, and the plans of the Muon $g-2$ Theory Initiative to update the Standard Model prediction.
△ Less
Submitted 29 March, 2022;
originally announced March 2022.
-
Sparse multi-term disjunctive cuts for the epigraph of a function of binary variables
Authors:
Rui Chen,
James Luedtke
Abstract:
We propose a new method for separating valid inequalities for the epigraph of a function of binary variables. The proposed inequalities are disjunctive cuts defined by disjunctive terms obtained by enumerating a subset $I$ of the binary variables. We show that by restricting the support of the cut to the same set of variables $I$, a cut can be obtained by solving a linear program with $2^{|I|}$ co…
▽ More
We propose a new method for separating valid inequalities for the epigraph of a function of binary variables. The proposed inequalities are disjunctive cuts defined by disjunctive terms obtained by enumerating a subset $I$ of the binary variables. We show that by restricting the support of the cut to the same set of variables $I$, a cut can be obtained by solving a linear program with $2^{|I|}$ constraints. While this limits the size of the set $I$ used to define the multi-term disjunction, the procedure enables generation of multi-term disjunctive cuts using far more terms than existing approaches. We present two approaches for choosing the subset of variables. Experience on three MILP problems with block diagonal structure using $|I|$ up to size 10 indicates the sparse cuts can often close nearly as much gap as the multi-term disjunctive cuts without this restriction and in a fraction of the time. We also find that including these cuts within a cut-and-branch solution method for these MILP problems leads to significant reductions in solution time or ending optimality gap for instances that were not solved within the time limit. Finally, we describe how the proposed approach can be adapted to optimally "tilt" a given valid inequality by modifying the coefficients of a sparse subset of the variables.
△ Less
Submitted 9 July, 2022; v1 submitted 15 November, 2021;
originally announced November 2021.
-
On Generating Lagrangian Cuts for Two-Stage Stochastic Integer Programs
Authors:
Rui Chen,
James Luedtke
Abstract:
We investigate new methods for generating Lagrangian cuts to solve two-stage stochastic integer programs. Lagrangian cuts can be added to a Benders reformulation, and are derived from solving single scenario integer programming subproblems identical to those used in the nonanticipative Lagrangian dual of a stochastic integer program. While Lagrangian cuts have the potential to significantly streng…
▽ More
We investigate new methods for generating Lagrangian cuts to solve two-stage stochastic integer programs. Lagrangian cuts can be added to a Benders reformulation, and are derived from solving single scenario integer programming subproblems identical to those used in the nonanticipative Lagrangian dual of a stochastic integer program. While Lagrangian cuts have the potential to significantly strengthen the Benders relaxation, generating Lagrangian cuts can be computationally demanding. We investigate new techniques for generating Lagrangian cuts with the goal of obtaining methods that provide significant improvements to the Benders relaxation quickly. Computational results demonstrate that our proposed method improves the Benders relaxation significantly faster than previous methods for generating Lagrangian cuts and, when used within a branch-and-cut algorithm, significantly reduces the size of the search tree for three classes of test problems.
△ Less
Submitted 20 January, 2022; v1 submitted 7 June, 2021;
originally announced June 2021.
-
Improved Standard-Model prediction for $π^0\to e^+e^-$
Authors:
Martin Hoferichter,
Bai-Long Hoid,
Bastian Kubis,
Jan Lüdtke
Abstract:
We present an improved Standard-Model (SM) prediction for the dilepton decay of the neutral pion. The loop amplitude is determined by the pion transition form factor for $π^0\toγ^*γ^*$, for which we employ a dispersive representation that incorporates both space-like and time-like data as well as short-distance constraints. The resulting SM branching fraction,…
▽ More
We present an improved Standard-Model (SM) prediction for the dilepton decay of the neutral pion. The loop amplitude is determined by the pion transition form factor for $π^0\toγ^*γ^*$, for which we employ a dispersive representation that incorporates both space-like and time-like data as well as short-distance constraints. The resulting SM branching fraction, $\text{BR}[π^0\to e^+e^-]=6.25(3)\times 10^{-8}$ , sharpens constraints on physics beyond the SM, including pseudoscalar and axial-vector mediators.
△ Less
Submitted 29 April, 2022; v1 submitted 10 May, 2021;
originally announced May 2021.
-
Heteroscedasticity-aware residuals-based contextual stochastic optimization
Authors:
Rohit Kannan,
Güzin Bayraksan,
James Luedtke
Abstract:
We explore generalizations of some integrated learning and optimization frameworks for data-driven contextual stochastic optimization that can adapt to heteroscedasticity. We identify conditions on the stochastic program, data generation process, and the prediction setup under which these generalizations possess asymptotic and finite sample guarantees for a class of stochastic programs, including…
▽ More
We explore generalizations of some integrated learning and optimization frameworks for data-driven contextual stochastic optimization that can adapt to heteroscedasticity. We identify conditions on the stochastic program, data generation process, and the prediction setup under which these generalizations possess asymptotic and finite sample guarantees for a class of stochastic programs, including two-stage stochastic mixed-integer programs with continuous recourse. We verify that our assumptions hold for popular parametric and nonparametric regression methods.
△ Less
Submitted 8 January, 2021;
originally announced January 2021.
-
Residuals-based distributionally robust optimization with covariate information
Authors:
Rohit Kannan,
Güzin Bayraksan,
James R. Luedtke
Abstract:
We consider data-driven approaches that integrate a machine learning prediction model within distributionally robust optimization (DRO) given limited joint observations of uncertain parameters and covariates. Our framework is flexible in the sense that it can accommodate a variety of regression setups and DRO ambiguity sets. We investigate asymptotic and finite sample properties of solutions obtai…
▽ More
We consider data-driven approaches that integrate a machine learning prediction model within distributionally robust optimization (DRO) given limited joint observations of uncertain parameters and covariates. Our framework is flexible in the sense that it can accommodate a variety of regression setups and DRO ambiguity sets. We investigate asymptotic and finite sample properties of solutions obtained using Wasserstein, sample robust optimization, and phi-divergence-based ambiguity sets within our DRO formulations, and explore cross-validation approaches for sizing these ambiguity sets. Through numerical experiments, we validate our theoretical results, study the effectiveness of our approaches for sizing ambiguity sets, and illustrate the benefits of our DRO formulations in the limited data regime even when the prediction model is misspecified.
△ Less
Submitted 25 May, 2022; v1 submitted 2 December, 2020;
originally announced December 2020.
-
Effects of Longitudinal Short-Distance Constraints on the Hadronic Light-by-Light Contribution to the Muon $g-2$
Authors:
Jan Lüdtke,
Massimiliano Procura
Abstract:
We present a model-independent method to estimate the effects of short-distance constraints (SDCs) on the hadronic light-by-light contribution to the muon anomalous magnetic moment $a_μ^\text{HLbL}$. The relevant loop integral is evaluated using multi-parameter families of interpolation functions, which satisfy by construction all constraints derived from general principles and smoothly connect th…
▽ More
We present a model-independent method to estimate the effects of short-distance constraints (SDCs) on the hadronic light-by-light contribution to the muon anomalous magnetic moment $a_μ^\text{HLbL}$. The relevant loop integral is evaluated using multi-parameter families of interpolation functions, which satisfy by construction all constraints derived from general principles and smoothly connect the low-energy region with those where either two or all three independent photon virtualities become large. In agreement with other recent model-based analyses, we find that the SDCs and thus the infinite towers of heavy intermediate states that are responsible for saturating them have a rather small effect on $a_μ^\text{HLbL}$. Taking as input the known ground-state pseudoscalar pole contributions, we obtain that the longitudinal SDCs increase $a_μ^\text{HLbL}$ by $(9.1\pm 5.0) \times 10^{-11}$, where the isovector channel is responsible for $(2.6\pm 1.5) \times 10^{-11}$. More precise estimates can be obtained with our method as soon as further accurate, model-independent information about important low-energy contributions from hadronic states with masses up to 1-2 GeV become available.
△ Less
Submitted 23 December, 2020; v1 submitted 29 May, 2020;
originally announced June 2020.
-
Lagrangian Dual Decision Rules for Multistage Stochastic Mixed Integer Programming
Authors:
Maryam Daryalal,
Merve Bodur,
James R. Luedtke
Abstract:
Multistage stochastic programs can be approximated by restricting policies to follow decision rules. Directly applying this idea to problems with integer decisions is difficult because of the need for decision rules that lead to integral decisions. In this work, we introduce Lagrangian dual decision rules (LDDRs) for multistage stochastic mixed-integer programming (MSMIP) which overcome this diffi…
▽ More
Multistage stochastic programs can be approximated by restricting policies to follow decision rules. Directly applying this idea to problems with integer decisions is difficult because of the need for decision rules that lead to integral decisions. In this work, we introduce Lagrangian dual decision rules (LDDRs) for multistage stochastic mixed-integer programming (MSMIP) which overcome this difficulty by applying decision rules in a Lagrangian dual of the MSMIP. We propose two new bounding techniques based on stagewise (SW) and nonanticipative (NA) Lagrangian duals where the Lagrangian multiplier policies are restricted by LDDRs. We demonstrate how the solutions from these duals can be used to drive primal policies. Our proposal requires fewer assumptions than most existing MSMIP methods. We compare the theoretical strength of the restricted duals and show that the restricted NA dual can provide relaxation bounds at least as good as the ones obtained by the restricted SW dual. In our numerical study on two problem classes, one traditional and one novel, we observe that the proposed LDDR approaches yield significant optimality gap reductions compared to existing general-purpose bounding methods for MSMIP problems.
△ Less
Submitted 15 August, 2022; v1 submitted 3 January, 2020;
originally announced January 2020.
-
On sample average approximation for two-stage stochastic programs without relatively complete recourse
Authors:
Rui Chen,
James Luedtke
Abstract:
We investigate sample average approximation (SAA) for two-stage stochastic programs without relatively complete recourse, i.e., for problems in which there are first-stage feasible solutions that are not guaranteed to have a feasible recourse action. As a feasibility measure of the SAA solution, we consider the "recourse likelihood", which is the probability that the solution has a feasible recour…
▽ More
We investigate sample average approximation (SAA) for two-stage stochastic programs without relatively complete recourse, i.e., for problems in which there are first-stage feasible solutions that are not guaranteed to have a feasible recourse action. As a feasibility measure of the SAA solution, we consider the "recourse likelihood", which is the probability that the solution has a feasible recourse action. For $ε\in (0,1)$, we demonstrate that the probability that a SAA solution has recourse likelihood below $1-ε$ converges to zero exponentially fast with the sample size. Next, we analyze the rate of convergence of optimal solutions of the SAA to optimal solutions of the true problem for problems with a finite feasible region, such as bounded integer programming problems. For problems with non-finite feasible region, we propose modified "padded" SAA problems and demonstrate in two cases that such problems can yield, with high confidence, solutions that are certain to have a feasible recourse decision. Finally, we conduct a numerical study on a two-stage resource planning problem that illustrates the results, and also suggests there may be room for improvement in some of the theoretical analysis.
△ Less
Submitted 30 November, 2021; v1 submitted 30 December, 2019;
originally announced December 2019.
-
Stochastic DC Optimal Power Flow With Reserve Saturation
Authors:
Rohit Kannan,
James R. Luedtke,
Line A. Roald
Abstract:
We propose an optimization framework for stochastic optimal power flow with uncertain loads and renewable generator capacity. Our model follows previous work in assuming that generator outputs respond to load imbalances according to an affine control policy, but introduces a model of saturation of generator reserves by assuming that when a generator's target level hits its limit, it abandons the a…
▽ More
We propose an optimization framework for stochastic optimal power flow with uncertain loads and renewable generator capacity. Our model follows previous work in assuming that generator outputs respond to load imbalances according to an affine control policy, but introduces a model of saturation of generator reserves by assuming that when a generator's target level hits its limit, it abandons the affine policy and produces at that limit. This is a particularly interesting feature in models where wind power plants, which have uncertain upper generation limits, are scheduled to provide reserves to balance load fluctuations. The resulting model is a nonsmooth nonconvex two-stage stochastic program, and we use a stochastic approximation method to find stationary solutions to a smooth approximation. Computational results on 6-bus and 118-bus test instances demonstrates that by considering the effects of saturation, our model can yield solutions with lower expected generation costs (at the same target line violation probability level) than those obtained from a model that enforces the affine policy to stay within generator limits with high probability.
△ Less
Submitted 10 October, 2019;
originally announced October 2019.
-
Solving Chance-Constrained Problems via a Smooth Sample-Based Nonlinear Approximation
Authors:
Alejandra Peña-Ordieres,
James R. Luedtke,
Andreas Wächter
Abstract:
We introduce a new method for solving nonlinear continuous optimization problems with chance constraints. Our method is based on a reformulation of the probabilistic constraint as a quantile function. The quantile function is approximated via a differentiable sample average approximation. We provide theoretical statistical guarantees of the approximation, and illustrate empirically that the reform…
▽ More
We introduce a new method for solving nonlinear continuous optimization problems with chance constraints. Our method is based on a reformulation of the probabilistic constraint as a quantile function. The quantile function is approximated via a differentiable sample average approximation. We provide theoretical statistical guarantees of the approximation, and illustrate empirically that the reformulation can be directly used by standard nonlinear optimization solvers in the case of single chance constraints. Furthermore, we propose an S$\ell_1$QP-type trust-region method to solve instances with joint chance constraints. We demonstrate the performance of the method on several problems, and show that it scales well with the sample size and that the smoothing can be used to counteract the bias in the chance constraint approximation induced by the sample approximation.
△ Less
Submitted 14 March, 2020; v1 submitted 17 May, 2019;
originally announced May 2019.
-
A complete data processing workflow for CryoET and subtomogram averaging
Authors:
Muyuan Chen,
James M. Bell,
Xiaodong Shi,
Stella Y. Sun,
Zhao Wang,
Steven J. Ludtke
Abstract:
Electron cryotomography (CryoET) is currently the only method capable of visualizing cells in 3D at nanometer resolutions. While modern instruments produce massive amounts of tomography data containing extremely rich structural information, the data processing is very labor intensive and results are often limited by the skills of the personnel rather than the data. We present an integrated workflo…
▽ More
Electron cryotomography (CryoET) is currently the only method capable of visualizing cells in 3D at nanometer resolutions. While modern instruments produce massive amounts of tomography data containing extremely rich structural information, the data processing is very labor intensive and results are often limited by the skills of the personnel rather than the data. We present an integrated workflow that covers the entire tomography data processing pipeline, from automated tilt series alignment to subnanometer resolution subtomogram averaging. This workflow greatly reduces human effort and increases throughput, and is capable of determining protein structures at state-of-the-art resolutions for both purified macromolecules and cells.
△ Less
Submitted 11 February, 2019;
originally announced February 2019.
-
Intersection disjunctions for reverse convex sets
Authors:
Eli Towle,
James Luedtke
Abstract:
We present a framework to obtain valid inequalities for a reverse convex set: the set of points in a polyhedron that lie outside a given open convex set. Reverse convex sets arise in many models, including bilevel optimization and polynomial optimization. An intersection cut is a well-known valid inequality for a reverse convex set that is generated from a basic solution that lies within the conve…
▽ More
We present a framework to obtain valid inequalities for a reverse convex set: the set of points in a polyhedron that lie outside a given open convex set. Reverse convex sets arise in many models, including bilevel optimization and polynomial optimization. An intersection cut is a well-known valid inequality for a reverse convex set that is generated from a basic solution that lies within the convex set. We introduce a framework for deriving valid inequalities for the reverse convex set from basic solutions that lie outside the convex set. We first propose an extension to intersection cuts that defines a two-term disjunction for a reverse convex set, which we refer to as an intersection disjunction. Next, we generalize this analysis to a multi-term disjunction by considering the convex set's recession directions. These disjunctions can be used in a cut-generating linear program to obtain valid inequalities for the reverse convex set.
△ Less
Submitted 1 December, 2020; v1 submitted 7 January, 2019;
originally announced January 2019.
-
A stochastic approximation method for approximating the efficient frontier of chance-constrained nonlinear programs
Authors:
Rohit Kannan,
James Luedtke
Abstract:
We propose a stochastic approximation method for approximating the efficient frontier of chance-constrained nonlinear programs. Our approach is based on a bi-objective viewpoint of chance-constrained programs that seeks solutions on the efficient frontier of optimal objective value versus risk of constraint violation. To this end, we construct a reformulated problem whose objective is to minimize…
▽ More
We propose a stochastic approximation method for approximating the efficient frontier of chance-constrained nonlinear programs. Our approach is based on a bi-objective viewpoint of chance-constrained programs that seeks solutions on the efficient frontier of optimal objective value versus risk of constraint violation. To this end, we construct a reformulated problem whose objective is to minimize the probability of constraints violation subject to deterministic convex constraints (which includes a bound on the objective function value). We adapt existing smoothing-based approaches for chance-constrained problems to derive a convergent sequence of smooth approximations of our reformulated problem, and apply a projected stochastic subgradient algorithm to solve it. In contrast with exterior sampling-based approaches (such as sample average approximation) that approximate the original chance-constrained program with one having finite support, our proposal converges to stationary solutions of a smooth approximation of the original problem, thereby avoiding poor local solutions that may be an artefact of a fixed sample. Our proposal also includes a tailored implementation of the smoothing-based approach that chooses key algorithmic parameters based on problem data. Computational results on four test problems from the literature indicate that our proposed approach can efficiently determine good approximations of the efficient frontier.
△ Less
Submitted 28 May, 2020; v1 submitted 17 December, 2018;
originally announced December 2018.
-
Strong Convex Nonlinear Relaxations of the Pooling Problem
Authors:
James Luedtke,
Claudia D'Ambrosio,
Jeff Linderoth,
Jonas Schweiger
Abstract:
We investigate new convex relaxations for the pooling problem, a classic nonconvex production planning problem in which input materials are mixed in intermediate pools, with the outputs of these pools further mixed to make output products meeting given attribute percentage requirements. Our relaxations are derived by considering a set which arises from the formulation by considering a single produ…
▽ More
We investigate new convex relaxations for the pooling problem, a classic nonconvex production planning problem in which input materials are mixed in intermediate pools, with the outputs of these pools further mixed to make output products meeting given attribute percentage requirements. Our relaxations are derived by considering a set which arises from the formulation by considering a single product, a single attibute, and a single pool. The convex hull of the resulting nonconvex set is not polyhedral. We derive valid linear and convex nonlinear inequalities for the convex hull, and demonstrate that different subsets of these inequalities define the convex hull of the nonconvex set in three cases determined by the parameters of the set. Computational results on literature instances and newly created larger test instances demonstrate that the inequalities can significantly strengthen the convex relaxation of the pq-formulation of the pooling problem, which is the relaxation known to have the strongest bound.
△ Less
Submitted 7 March, 2018;
originally announced March 2018.
-
A Dual Approximate Dynamic Programming Approach to Multi-stage Stochastic Unit Commitment
Authors:
Jagdish Ramakrishnan,
James Luedtke
Abstract:
We study the multi-stage stochastic unit commitment problem in which commitment and generation decisions can be made and adjusted in each time period. We formulate this problem as a Markov decision process, which is "weakly-coupled" in the sense that if the demand constraint is relaxed, the problem decomposes into a separate, low-dimensional, Markov decision process for each generator. We demonstr…
▽ More
We study the multi-stage stochastic unit commitment problem in which commitment and generation decisions can be made and adjusted in each time period. We formulate this problem as a Markov decision process, which is "weakly-coupled" in the sense that if the demand constraint is relaxed, the problem decomposes into a separate, low-dimensional, Markov decision process for each generator. We demonstrate how the dual approximate dynamic programming method of Barty, Carpentier, and Girardeau (RAIRO Operations Research, 44:167-183, 2010) can be adapted to obtain bounds and a policy for this problem. Previous approaches have let the Lagrange multipliers depend only on time; this can result in weak lower bounds. Other approaches have let the multipliers depend on the entire history of past random observations; although this provides a strong lower bound, its ability to handle a large number of sample paths or scenarios is limited. We demonstrate how to bridge these approaches for the stochastic unit commitment problem by letting the multipliers depend on the current observed demand. This allows a good tradeoff between strong lower bounds and good scalability with the number of scenarios. We illustrate this approach numerically on a 168-stage stochastic unit commitment problem, including minimum uptime, downtime, and ramping constraints.
△ Less
Submitted 22 June, 2018; v1 submitted 7 January, 2018;
originally announced January 2018.
-
New solution approaches for the maximum-reliability stochastic network interdiction problem
Authors:
Eli Towle,
James Luedtke
Abstract:
We investigate methods to solve the maximum-reliability stochastic network interdiction problem (SNIP). In this problem, a defender interdicts arcs on a directed graph to minimize an attacker's probability of undetected traversal through the network. The attacker's origin and destination are unknown to the defender and assumed to be random. SNIP can be formulated as a stochastic mixed-integer prog…
▽ More
We investigate methods to solve the maximum-reliability stochastic network interdiction problem (SNIP). In this problem, a defender interdicts arcs on a directed graph to minimize an attacker's probability of undetected traversal through the network. The attacker's origin and destination are unknown to the defender and assumed to be random. SNIP can be formulated as a stochastic mixed-integer program via a deterministic equivalent formulation (DEF). As the size of this DEF makes it impractical for solving large instances, current approaches to solving SNIP rely on modifications of Benders decomposition. We present two new approaches to solve SNIP. First, we introduce a new DEF that is significantly more compact than the standard DEF. Second, we propose a new path-based formulation of SNIP. The number of constraints required to define this formulation grows exponentially with the size of the network, but the model can be solved via delayed constraint generation. We present valid inequalities for this path-based formulation which are dependent on the structure of the interdicted arc probabilities. We propose a branch-and-cut (BC) algorithm to solve this new SNIP formulation. Computational results demonstrate that directly solving the more compact SNIP formulation and this BC algorithm both provide an improvement over a state-of-the-art implementation of Benders decomposition for this problem.
△ Less
Submitted 4 June, 2018; v1 submitted 14 August, 2017;
originally announced August 2017.
-
Combining Progressive Hedging with a Frank-Wolfe Method to Compute Lagrangian Dual Bounds in Stochastic Mixed-Integer Programming
Authors:
Natashia Boland,
Jeffrey Christiansen,
Brian Dandurand,
Andrew Eberhard,
Jeff Linderoth,
James Luedtke
Abstract:
We present a new primal-dual algorithm for computing the value of the Lagrangian dual of a stochastic mixed-integer program (SMIP) formed by relaxing its nonanticipativity constraints. This dual is widely used in decomposition methods for the solution of SMIPs. The algorithm relies on the well-known progressive hedging method, but unlike previous progressive hedging approaches for SMIP, our algori…
▽ More
We present a new primal-dual algorithm for computing the value of the Lagrangian dual of a stochastic mixed-integer program (SMIP) formed by relaxing its nonanticipativity constraints. This dual is widely used in decomposition methods for the solution of SMIPs. The algorithm relies on the well-known progressive hedging method, but unlike previous progressive hedging approaches for SMIP, our algorithm can be shown to converge to the optimal Lagrangian dual value. The key improvement in the new algorithm is an inner loop of optimized linearization steps, similar to those taken in the classical Frank-Wolfe method. Numerical results demonstrate that our new algorithm empirically outperforms the standard implementation of progressive hedging for obtaining bounds in SMIP.
△ Less
Submitted 2 February, 2017;
originally announced February 2017.
-
Convolutional Neural Networks for Automated Annotation of Cellular Cryo-Electron Tomograms
Authors:
Muyuan Chen,
Wei Dai,
Ying Sun,
Darius Jonasch,
Cynthia Y He,
Michael F. Schmid,
Wah Chiu,
Steven J Ludtke
Abstract:
Cellular Electron Cryotomography (CryoET) offers the ability to look inside cells and observe macromolecules frozen in action. A primary challenge for this technique is identifying and extracting the molecular components within the crowded cellular environment. We introduce a method using neural networks to dramatically reduce the time and human effort required for subcellular annotation and featu…
▽ More
Cellular Electron Cryotomography (CryoET) offers the ability to look inside cells and observe macromolecules frozen in action. A primary challenge for this technique is identifying and extracting the molecular components within the crowded cellular environment. We introduce a method using neural networks to dramatically reduce the time and human effort required for subcellular annotation and feature extraction. Subsequent subtomogram classification and averaging yields in-situ structures of molecular components of interest.
△ Less
Submitted 11 June, 2017; v1 submitted 19 January, 2017;
originally announced January 2017.
-
Two-stage Linear Decision Rules for Multi-stage Stochastic Programming
Authors:
Merve Bodur,
James Luedtke
Abstract:
Multi-stage stochastic linear programs (MSLPs) are notoriously hard to solve in general. Linear decision rules (LDRs) yield an approximation of an MSLP by restricting the decisions at each stage to be an affine function of the observed uncertain parameters. Finding an optimal LDR is a static optimization problem that provides an upper bound on the optimal value of the MSLP, and, under certain assu…
▽ More
Multi-stage stochastic linear programs (MSLPs) are notoriously hard to solve in general. Linear decision rules (LDRs) yield an approximation of an MSLP by restricting the decisions at each stage to be an affine function of the observed uncertain parameters. Finding an optimal LDR is a static optimization problem that provides an upper bound on the optimal value of the MSLP, and, under certain assumptions, can be formulated as an explicit linear program. Similarly, as proposed by Kuhn, Wiesemann, and Georghiou (Math. Program., 130, 177-209, 2011) a lower bound for an MSLP can be obtained by restricting decisions in the dual of the MSLP to follow an LDR. We propose a new approximation approach for MSLPs, two-stage LDRs. The idea is to require only the state variables in an MSLP to follow an LDR, which is sufficient to obtain an approximation of an MSLP that is a two-stage stochastic linear program (2SLP). We similarly propose to apply LDR only to a subset of the variables in the dual of the MSLP, which yields a 2SLP approximation of the dual that provides a lower bound on the optimal value of the MSLP. Although solving the corresponding 2SLP approximations exactly is intractable in general, we investigate how approximate solution approaches that have been developed for solving 2SLP can be applied to solve these approximation problems, and derive statistical upper and lower bounds on the optimal value of the MSLP. In addition to potentially yielding better policies and bounds, this approach requires many fewer assumptions than are required to obtain an explicit reformulation when using the standard static LDR approach. A computational study on two example problems demonstrates that using a two-stage LDR can yield significantly better primal policies and modestly better dual policies than using policies based on a static LDR.
△ Less
Submitted 18 March, 2018; v1 submitted 15 January, 2017;
originally announced January 2017.
-
A Cycle-Based Formulation and Valid Inequalities for DC Power Transmission Problems with Switching
Authors:
Burak Kocuk,
Hyemin Jeon,
Santanu S. Dey,
Jeff Linderoth,
James Luedtke,
Andy Sun
Abstract:
It is well-known that optimizing network topology by switching on and off transmission lines improves the efficiency of power delivery in electrical networks. In fact, the USA Energy Policy Act of 2005 (Section 1223) states that the U.S. should "encourage, as appropriate, the deployment of advanced transmission technologies" including "optimized transmission line configurations". As such, many aut…
▽ More
It is well-known that optimizing network topology by switching on and off transmission lines improves the efficiency of power delivery in electrical networks. In fact, the USA Energy Policy Act of 2005 (Section 1223) states that the U.S. should "encourage, as appropriate, the deployment of advanced transmission technologies" including "optimized transmission line configurations". As such, many authors have studied the problem of determining an optimal set of transmission lines to switch off to minimize the cost of meeting a given power demand under the direct current (DC) model of power flow. This problem is known in the literature as the Direct-Current Optimal Transmission Switching Problem (DC-OTS). Most research on DC-OTS has focused on heuristic algorithms for generating quality solutions or on the application of DC-OTS to crucial operational and strategic problems such as contingency correction, real-time dispatch, and transmission expansion. The mathematical theory of the DC-OTS problem is less well-developed. In this work, we formally establish that DC-OTS is NP-Hard, even if the power network is a series-parallel graph with at most one load/demand pair. Inspired by Kirchoff's Voltage Law, we give a cycle-based formulation for DC-OTS, and we use the new formulation to build a cycle-induced relaxation. We characterize the convex hull of the cycle-induced relaxation, and the characterization provides strong valid inequalities that can be used in a cutting-plane approach to solve the DC-OTS. We give details of a practical implementation, and we show promising computational results on standard benchmark instances.
△ Less
Submitted 16 October, 2015; v1 submitted 19 December, 2014;
originally announced December 2014.