-
On characterizing optimal learning trajectories in a class of learning problems
Authors:
Getachew K Befekadu
Abstract:
In this brief paper, we provide a mathematical framework that exploits the relationship between the maximum principle and dynamic programming for characterizing optimal learning trajectories in a class of learning problem, which is related to point estimations for modeling of high-dimensional nonlinear functions. Here, such characterization for the optimal learning trajectories is associated with…
▽ More
In this brief paper, we provide a mathematical framework that exploits the relationship between the maximum principle and dynamic programming for characterizing optimal learning trajectories in a class of learning problem, which is related to point estimations for modeling of high-dimensional nonlinear functions. Here, such characterization for the optimal learning trajectories is associated with the solution of an optimal control problem for a weakly-controlled gradient system with small parameters, whose time-evolution is guided by a model training dataset and its perturbed version, while the optimization problem consists of a cost functional that summarizes how to gauge the quality/performance of the estimated model parameters at a certain fixed final time w.r.t. a model validating dataset. Moreover, using a successive Galerkin approximation method, we provide an algorithmic recipe how to construct the corresponding optimal learning trajectories leading to the optimal estimated model parameters for such a class of learning problem.
△ Less
Submitted 6 February, 2025; v1 submitted 27 January, 2025;
originally announced January 2025.
-
On improving generalization in a class of learning problems with the method of small parameters for weakly-controlled optimal gradient systems
Authors:
Getachew K. Befekadu
Abstract:
In this paper, we provide a mathematical framework for improving generalization in a class of learning problems which is related to point estimations for modeling of high-dimensional nonlinear functions. In particular, we consider a variational problem for a weakly-controlled gradient system, whose control input enters into the system dynamics as a coefficient to a nonlinear term which is scaled b…
▽ More
In this paper, we provide a mathematical framework for improving generalization in a class of learning problems which is related to point estimations for modeling of high-dimensional nonlinear functions. In particular, we consider a variational problem for a weakly-controlled gradient system, whose control input enters into the system dynamics as a coefficient to a nonlinear term which is scaled by a small parameter. Here, the optimization problem consists of a cost functional, which is associated with how to gauge the quality of the estimated model parameters at a certain fixed final time w.r.t. the model validating dataset, while the weakly-controlled gradient system, whose the time-evolution is guided by the model training dataset and its perturbed version with small random noise. Using the perturbation theory, we provide results that will allow us to solve a sequence of optimization problems, i.e., a set of decomposed optimization problems, so as to aggregate the corresponding approximate optimal solutions that are reasonably sufficient for improving generalization in such a class of learning problems. Moreover, we also provide an estimate for the rate of convergence for such approximate optimal solutions. Finally, we present some numerical results for a typical case of nonlinear regression problem.
△ Less
Submitted 11 December, 2024;
originally announced December 2024.
-
Further extensions on the successive approximation method for hierarchical optimal control problems and its application to learning
Authors:
Getachew K. Befekadu
Abstract:
In this paper, further extensions of the result of the paper "A successive approximation method in functional spaces for hierarchical optimal control problems and its application to learning, arXiv:2410.20617 [math.OC], 2024" concerning a class of learning problem of point estimations for modeling of high-dimensional nonlinear functions are given. In particular, we present two viable extensions wi…
▽ More
In this paper, further extensions of the result of the paper "A successive approximation method in functional spaces for hierarchical optimal control problems and its application to learning, arXiv:2410.20617 [math.OC], 2024" concerning a class of learning problem of point estimations for modeling of high-dimensional nonlinear functions are given. In particular, we present two viable extensions within the nested algorithm of the successive approximation method for the hierarchical optimal control problem, that provide better convergence property and computationally efficiency, which ultimately leading to an optimal parameter estimate. The first extension is mainly concerned with the convergence property of the steps involving how the two agents, i.e., the "leader" and the "follower," update their admissible control strategies, where we introduce augmented Hamiltonians for both agents and we further reformulate the admissible control updating steps as as sub-problems within the nested algorithm of the hierarchical optimal control problem that essentially provide better convergence property. Whereas the second extension is concerned with the computationally efficiency of the steps involving how the agents update their admissible control strategies, where we introduce intermediate state variable for each agent and we further embed the intermediate states within the optimal control problems of the "leader" and the "follower," respectively, that further lend the admissible control updating steps to be fully efficient time-parallelized within the nested algorithm of the hierarchical optimal control problem.
△ Less
Submitted 24 November, 2024;
originally announced November 2024.
-
A successive approximation method in functional spaces for hierarchical optimal control problems and its application to learning
Authors:
Getachew K. Befekadu
Abstract:
We consider a class of learning problem of point estimation for modeling high-dimensional nonlinear functions, whose learning dynamics is guided by model training dataset, while the estimated parameter in due course provides an acceptable prediction accuracy on a different model validation dataset. Here, we establish an evidential connection between such a learning problem and a hierarchical optim…
▽ More
We consider a class of learning problem of point estimation for modeling high-dimensional nonlinear functions, whose learning dynamics is guided by model training dataset, while the estimated parameter in due course provides an acceptable prediction accuracy on a different model validation dataset. Here, we establish an evidential connection between such a learning problem and a hierarchical optimal control problem that provides a framework how to account appropriately for both generalization and regularization at the optimization stage. In particular, we consider the following two objectives: (i) The first one is a controllability-type problem, i.e., generalization, which consists of guaranteeing the estimated parameter to reach a certain target set at some fixed final time, where such a target set is associated with model validation dataset. (ii) The second one is a regularization-type problem ensuring the estimated parameter trajectory to satisfy some regularization property over a certain finite time interval. First, we partition the control into two control strategies that are compatible with two abstract agents, namely, a leader, which is responsible for the controllability-type problem and that of a follower, which is associated with the regularization-type problem. Using the notion of Stackelberg's optimization, we provide conditions on the existence of admissible optimal controls for such a hierarchical optimal control problem under which the follower is required to respond optimally to the strategy of the leader, so as to achieve the overall objectives that ultimately leading to an optimal parameter estimate. Moreover, we provide a nested algorithm, arranged in a hierarchical structure-based on successive approximation methods, for solving the corresponding optimal control problem. Finally, we present some numerical results for a typical nonlinear regression problem.
△ Less
Submitted 27 October, 2024;
originally announced October 2024.
-
A naive aggregation algorithm for improving generalization in a class of learning problems
Authors:
Getachew K Befekadu
Abstract:
In this brief paper, we present a naive aggregation algorithm for a typical learning problem with expert advice setting, in which the task of improving generalization, i.e., model validation, is embedded in the learning process as a sequential decision-making problem. In particular, we consider a class of learning problem of point estimations for modeling high-dimensional nonlinear functions, wher…
▽ More
In this brief paper, we present a naive aggregation algorithm for a typical learning problem with expert advice setting, in which the task of improving generalization, i.e., model validation, is embedded in the learning process as a sequential decision-making problem. In particular, we consider a class of learning problem of point estimations for modeling high-dimensional nonlinear functions, where a group of experts update their parameter estimates using the discrete-time version of gradient systems, with small additive noise term, guided by the corresponding subsample datasets obtained from the original dataset. Here, our main objective is to provide conditions under which such an algorithm will sequentially determine a set of mixing distribution strategies used for aggregating the experts' estimates that ultimately leading to an optimal parameter estimate, i.e., as a consensus solution for all experts, which is better than any individual expert's estimate in terms of improved generalization or learning performances. Finally, as part of this work, we present some numerical results for a typical case of nonlinear regression problem.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
A new perspective on the learning dynamics for a class of learning problems via averaged gradient systems coupled with diffusion-transmutation processes
Authors:
Getachew K. Befekadu
Abstract:
In the first part of this paper, we consider a family of continuous-time dynamical systems coupled with diffusion-transmutation processes. Under certain conditions, such randomly perturbed dynamical systems can be interpreted as an averaged dynamical system, whose weighting coefficients, that depend on the state trajectory of the underlying averaged system, are assumed to be strictly positive with…
▽ More
In the first part of this paper, we consider a family of continuous-time dynamical systems coupled with diffusion-transmutation processes. Under certain conditions, such randomly perturbed dynamical systems can be interpreted as an averaged dynamical system, whose weighting coefficients, that depend on the state trajectory of the underlying averaged system, are assumed to be strictly positive with sum unity. Here, we provide a large deviation result for the corresponding family of processes, i.e., a variational problem formulation modeling the most likely sample path leading to certain noise-induced rare-events. This remarkably allows us to provide a computational algorithm for solving the corresponding variational problem. In the second part of the paper, we use some of the insights from the first part and provide a new perspective on the learning dynamics for a class of learning problems, whose averaged gradient dynamical systems, from continuous-time perspective, are guided by a set of subsampled datasets that are obtained from the original dataset via bootstrapping or other related resampling-based techniques. Finally, we present some numerical results for a typical nonlinear regression problem, where the corresponding averaged gradient system is interpreted as random walks on a graph, whose outgoing edges are uniformly chosen at random.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
Embedding generalization within the learning dynamics: An approach based-on sample path large deviation theory
Authors:
Getachew K. Befekadu
Abstract:
We consider a typical learning problem of point estimations for modeling of nonlinear functions or dynamical systems in which generalization, i.e., verifying a given learned model, can be embedded as an integral part of the learning process or dynamics. In particular, we consider an empirical risk minimization based learning problem that exploits gradient methods from continuous-time perspective w…
▽ More
We consider a typical learning problem of point estimations for modeling of nonlinear functions or dynamical systems in which generalization, i.e., verifying a given learned model, can be embedded as an integral part of the learning process or dynamics. In particular, we consider an empirical risk minimization based learning problem that exploits gradient methods from continuous-time perspective with small random perturbations, which is guided by the training dataset loss. Here, we provide an asymptotic probability estimate in the small noise limit based-on the Freidlin-Wentzell theory of large deviations, when the sample path of the random process corresponding to the randomly perturbed gradient dynamical system hits a certain target set, i.e., a rare event, when the latter is specified by the testing dataset loss landscape. Interestingly, the proposed framework can be viewed as one way of improving generalization and robustness in learning problems that provides new insights leading to optimal point estimates which is guided by training data loss, while, at the same time, the learning dynamics has an access to the testing dataset loss landscape in some form of future achievable or anticipated target goal. Moreover, as a by-product, we establish a connection with optimal control problem, where the target set, i.e., the rare event, is considered as the desired outcome or achievable target goal for a certain optimal control problem, for which we also provide a verification result reinforcing the rationale behind the proposed framework. Finally, we present a computational algorithm that solves the corresponding variational problem leading to an optimal point estimates and, as part of this work, we also present some numerical results for a typical case of nonlinear regression problem.
△ Less
Submitted 4 August, 2024;
originally announced August 2024.
-
On the rare-event simulations of diffusion processes pertaining to a chain of distributed systems with small random perturbations
Authors:
Getachew K. Befekadu
Abstract:
In this paper, we consider an importance sampling problem for a certain rare-event simulations involving the behavior of a diffusion process pertaining to a chain of distributed systems with random perturbations. We also assume that the distributed system formed by $n$-subsystems -- in which a small random perturbation enters in the first subsystem and then subsequently transmitted to the other su…
▽ More
In this paper, we consider an importance sampling problem for a certain rare-event simulations involving the behavior of a diffusion process pertaining to a chain of distributed systems with random perturbations. We also assume that the distributed system formed by $n$-subsystems -- in which a small random perturbation enters in the first subsystem and then subsequently transmitted to the other subsystems -- satisfies an appropriate Hörmander condition. Here we provide an efficient importance sampling estimator, with an exponential variance decay rate, for the asymptotics of the probabilities of the rare events involving such a diffusion process that also ensures a minimum relative estimation error in the small noise limit. The framework for such an analysis basically relies on the connection between the probability theory of large deviations and the values functions for a family of stochastic control problems associated with the underlying distributed system, where such a connection provides a computational paradigm -- based on an exponentially-tilted biasing distribution -- for constructing efficient importance sampling estimators for the rare-event simulation. Moreover, as a by-product, the framework also allows us to derive a family of Hamilton-Jacobi-Bellman for which we also provide a solvability condition for the corresponding optimal control problem.
△ Less
Submitted 24 August, 2020;
originally announced August 2020.
-
On Goal-Oriented Multiobjective Embedded Optimization for System Performance Assessment
Authors:
Getachew K Befekadu
Abstract:
In this short note, we discuss a goal-oriented multiobjective optimization problem for system performance assessment. The objective function for such optimization problem, which is usually a composite of different performance indices corresponding to different operating conditions or scenarios in the system, is then posed as a goal-oriented multiobjective optimization problem. The (sub)-optimal so…
▽ More
In this short note, we discuss a goal-oriented multiobjective optimization problem for system performance assessment. The objective function for such optimization problem, which is usually a composite of different performance indices corresponding to different operating conditions or scenarios in the system, is then posed as a goal-oriented multiobjective optimization problem. The (sub)-optimal solution(s) for such nonlinear optimization problem can be solved using a Sequential Quadratic Programming algorithm.
△ Less
Submitted 10 June, 2020;
originally announced June 2020.
-
Optimal residence time control for stochastically perturbed prescription opioid epidemic models
Authors:
Getachew K. Befekadu
Abstract:
In this paper, we consider an optimal control problem for a prescription opioid epidemic model that describes the interaction between the regular prescription or addictive use of opioid drugs, and the process of rehabilitation and that of relapsing into opioid drug use. In particular, our interest is in the situation, where the control appearing linearly in the opioid epidemics is interpreted as t…
▽ More
In this paper, we consider an optimal control problem for a prescription opioid epidemic model that describes the interaction between the regular prescription or addictive use of opioid drugs, and the process of rehabilitation and that of relapsing into opioid drug use. In particular, our interest is in the situation, where the control appearing linearly in the opioid epidemics is interpreted as the rate at which the susceptible individuals are effectively removed from the population due to an opioid-related intervention policy or when the dynamics of the addicted is strategically influenced due to an accessible addiction treatment facility, while a small perturbing noise enters through the dynamics of the susceptible group in the population compartmental model. To this end, we introduce a mathematical apparatus that minimizes the asymptotic exit-rate with which the solution for such stochastically perturbed prescription opioid epidemics exits from a given bounded open domain. Moreover, under certain assumptions, we also provide an admissible optimal Markov control for the corresponding optimal control problem that optimally effected removal of the susceptible or recovered individuals from the population dynamics.
△ Less
Submitted 7 December, 2018; v1 submitted 12 September, 2018;
originally announced September 2018.
-
Optimal control of diffusion processes pertaining to an opioid epidemic dynamical model with random perturbations
Authors:
Getachew K. Befekadu,
Quanyan Zhu
Abstract:
In this paper, we consider the problem of controlling a diffusion process pertaining to an opioid epidemic dynamical model with random perturbation so as to prevent it from leaving a given bounded open domain. Here, we assume that the random perturbation enters only through the dynamics of the susceptible group in the compartmental model of the opioid epidemic dynamics and, as a result of this, th…
▽ More
In this paper, we consider the problem of controlling a diffusion process pertaining to an opioid epidemic dynamical model with random perturbation so as to prevent it from leaving a given bounded open domain. Here, we assume that the random perturbation enters only through the dynamics of the susceptible group in the compartmental model of the opioid epidemic dynamics and, as a result of this, the corresponding diffusion is degenerate, for which we further assume that the associated diffusion operator is hypoelliptic. In particular, we minimize the asymptotic exit rate of such a controlled-diffusion process from the given bounded open domain and we derive the Hamilton-Jacobi-Bellman equation for the corresponding optimal control problem, which is closely related to a nonlinear eigenvalue problem. Finally, we also prove a verification theorem that provides a sufficient condition for optimal control.
△ Less
Submitted 25 June, 2018;
originally announced June 2018.
-
A further study on the opioid epidemic dynamical model with random perturbation
Authors:
Getachew K. Befekadu,
Quanyan Zhu
Abstract:
In this paper, we consider an opioid epidemic dynamical model with random perturbation that typically describes the interplay between regular prescription use, addictive use, and the process of rehabilitation from addiction and vice-versa. In particular, we provide two-sided bounds on the solution of the transition density function for the Fokker-Planck equation that corresponds to the opioid epid…
▽ More
In this paper, we consider an opioid epidemic dynamical model with random perturbation that typically describes the interplay between regular prescription use, addictive use, and the process of rehabilitation from addiction and vice-versa. In particular, we provide two-sided bounds on the solution of the transition density function for the Fokker-Planck equation that corresponds to the opioid epidemic dynamical model, when a random perturbation enters only through the dynamics of the susceptible group in the compartmental model. Here, the proof for such bounds basically relies on the interpretation of the solution for the transition density function as the value function of a certain optimal stochastic control problem. Finally, as a possible interesting development in this direction, we also provide an estimate for the attainable exit probability with which the solution for the randomly perturbed opioid epidemic dynamical model exits from a given bounded open domain during a certain time interval. Note that such qualitative information on the first exit-time as well as two-sided bounds on the transition density function are useful for developing effective and fact-informed intervention strategies that primarily aim at curbing opioid epidemics or assisting in interpreting outcome results from opioid-related policies.
△ Less
Submitted 17 June, 2018; v1 submitted 31 May, 2018;
originally announced May 2018.
-
On the asymptotic of exit problems for controlled Markov diffusion processes with random jumps and vanishing diffusion terms
Authors:
Getachew K. Befekadu
Abstract:
In this paper, we study the asymptotic of exit problem for controlled Markov diffusion processes with random jumps and vanishing diffusion terms, where the random jumps are introduced in order to modify the evolution of the controlled diffusions by switching from one mode of dynamics to another. That is, depending on the state-position and state-transition information, the dynamics of the controll…
▽ More
In this paper, we study the asymptotic of exit problem for controlled Markov diffusion processes with random jumps and vanishing diffusion terms, where the random jumps are introduced in order to modify the evolution of the controlled diffusions by switching from one mode of dynamics to another. That is, depending on the state-position and state-transition information, the dynamics of the controlled diffusions randomly switches between the different drift and diffusion terms. Here, we specifically investigate the asymptotic exit problem concerning such controlled Markov diffusion processes in two steps: (i) First, for each controlled diffusion model, we look for an admissible Markov control process that minimizes the principal eigenvalue for the corresponding infinitesimal generator with zero Dirichlet boundary conditions -- where such an admissible control process also forces the controlled diffusion process to remain in a given bounded open domain for a longer duration. (ii) Then, using large deviations theory, we determine the exit place and the type of distribution at the exit time for the controlled Markov diffusion processes coupled with random jumps and vanishing diffusion terms. Moreover, the asymptotic results at the exit time also allow us to determine the limiting behavior of the Dirichlet problem for the corresponding system of elliptic partial differential equations containing a small vanishing parameter.
△ Less
Submitted 6 February, 2018;
originally announced February 2018.
-
Large deviation principle for dynamical systems coupled with diffusion-transmutation processes
Authors:
Getachew K. Befekadu
Abstract:
In this paper, we introduce a mathematical apparatus that is relevant for understanding a dynamical system with small random perturbations and coupled with the so-called transmutation process -- where the latter jumps from one mode to another, and thus modifying the dynamics of the system. In particular, we study the exit problem, i.e., an asymptotic estimate for the exit probabilities with which…
▽ More
In this paper, we introduce a mathematical apparatus that is relevant for understanding a dynamical system with small random perturbations and coupled with the so-called transmutation process -- where the latter jumps from one mode to another, and thus modifying the dynamics of the system. In particular, we study the exit problem, i.e., an asymptotic estimate for the exit probabilities with which the corresponding processes exit from a given bounded open domain, and then formally prove a large deviation principle for the exit position joint with the type occupation times as the random perturbation vanishes. Moreover, under certain conditions, the exit place and the type of distribution at the exit time are determined and, as a consequence of this, such information also give the limit of the Dirichlet problems for the associated partial differential equation systems with a vanishing small parameter.
△ Less
Submitted 14 September, 2017;
originally announced September 2017.
-
On the stochastic decision problems with backward stochastic viability property
Authors:
Getachew K. Befekadu
Abstract:
In this paper, we consider a stochastic decision problem for a system governed by a stochastic differential equation, in which an optimal decision is made in such a way to minimize a vector-valued accumulated cost over a finite-time horizon that is associated with the solution of a certain multi-dimensional backward stochastic differential equation (BSDE). Here, we also assume that the solution fo…
▽ More
In this paper, we consider a stochastic decision problem for a system governed by a stochastic differential equation, in which an optimal decision is made in such a way to minimize a vector-valued accumulated cost over a finite-time horizon that is associated with the solution of a certain multi-dimensional backward stochastic differential equation (BSDE). Here, we also assume that the solution for such a multi-dimensional BSDE {\it almost surely} satisfies a backward stochastic viability property w.r.t. a given closed convex set. Moreover, under suitable conditions, we establish the existence of an optimal solution, in the sense of viscosity solutions, to the associated system of semilinear parabolic PDEs. Finally, we briefly comment on the implication of our results.
△ Less
Submitted 5 January, 2018; v1 submitted 12 September, 2017;
originally announced September 2017.
-
Acceptable risks and related decision problems with multiple risk-averse agents
Authors:
Getachew K. Befekadu,
Eduardo L. Pasiliao
Abstract:
In this paper, we consider a risk-averse decision problem for controlled-diffusion processes, with dynamic risk measures, in which multiple risk-averse agents choose their decisions in such a way to minimize their individual accumulated risk-costs over a finite-time horizon. In particular, we introduce multi-structure dynamic risk measures induced from conditional $g$-expectations, where the latte…
▽ More
In this paper, we consider a risk-averse decision problem for controlled-diffusion processes, with dynamic risk measures, in which multiple risk-averse agents choose their decisions in such a way to minimize their individual accumulated risk-costs over a finite-time horizon. In particular, we introduce multi-structure dynamic risk measures induced from conditional $g$-expectations, where the latter are associated with the generator functionals of certain BSDEs that implicitly take into account the risk-cost functionals of the risk-averse agents. Here, we also require that such solutions of the BSDEs to satisfy a stochastic viability property with respect to a given closed convex set. Moreover, using a result similar to that of the Arrow-Barankin-Blackwell theorem, we establish the existence of consistent optimal decisions for the risk-averse agents, when the set of all Pareto optimal solutions, in the sense of viscosity, for the associated dynamic programming equations is dense in the given closed convex set. Finally, we briefly comment on the characteristics of acceptable risks vis-á-vis some uncertain future costs or outcomes, in which results from the dynamic risk analysis constitute part of the information used in the risk-averse decision criteria.
△ Less
Submitted 13 November, 2016; v1 submitted 10 November, 2016;
originally announced November 2016.
-
On the dynamic consistency of hierarchical risk-averse decision problems
Authors:
Getachew K. Befekadu,
Eduardo L. Pasiliao
Abstract:
In this paper, we consider a risk-averse decision problem for controlled-diffusion processes, with dynamic risk measures, in which there are two risk-averse decision makers (i.e., {\it leader} and {\it follower}) with different risk-averse related responsibilities and information. Moreover, we assume that there are two objectives that these decision makers are expected to achieve. That is, the fir…
▽ More
In this paper, we consider a risk-averse decision problem for controlled-diffusion processes, with dynamic risk measures, in which there are two risk-averse decision makers (i.e., {\it leader} and {\it follower}) with different risk-averse related responsibilities and information. Moreover, we assume that there are two objectives that these decision makers are expected to achieve. That is, the first objective being of {\it stochastic controllability} type that describes an acceptable risk-exposure set vis-á-vis some uncertain future payoff, and while the {\it second one} is making sure the solution of a certain risk-related system equation has to stay always above a given continuous stochastic process, namely {\it obstacle}. In particular, we introduce multi-structure, time-consistent, dynamic risk measures induced from conditional $g$-expectations, where the latter are associated with the generator functionals of two backward-SDEs that implicitly take into account the above two objectives along with the given continuous obstacle process. Moreover, under certain conditions, we establish the existence of optimal hierarchical risk-averse solutions, in the sense of viscosity solutions, to the associated risk-averse dynamic programming equations that formalize the way in which both the {\it leader} and {\it follower} consistently choose their respective risk-averse decisions. Finally, we remark on the implication of our result in assessing the influence of the {\it leader'}s decisions on the risk-averseness of the {\it follower} in relation to the direction of {\it leader-follower} information flow.
△ Less
Submitted 23 October, 2016;
originally announced October 2016.
-
On the hierarchical risk-averse control problems for diffusion processes
Authors:
Getachew K. Befekadu,
Alexander Veremyev,
Eduardo L. Pasiliao
Abstract:
In this paper, we consider a risk-averse control problem for diffusion processes, in which there is a partition of the admissible control strategy into two decision-making groups (namely, the {\it leader} and {\it follower}) with different cost functionals and risk-averse satisfactions. Our approach, based on a hierarchical optimization framework, requires that a certain level of risk-averse satis…
▽ More
In this paper, we consider a risk-averse control problem for diffusion processes, in which there is a partition of the admissible control strategy into two decision-making groups (namely, the {\it leader} and {\it follower}) with different cost functionals and risk-averse satisfactions. Our approach, based on a hierarchical optimization framework, requires that a certain level of risk-averse satisfaction be achieved for the {\it leader} as a priority over that of the {\it follower's} risk-averseness. In particular, we formulate such a risk-averse control problem involving a family of time-consistent dynamic convex risk measures induced by conditional $g$-expectations (i.e., filtration-consistent nonlinear expectations associated with the generators of certain backward stochastic differential equations). Moreover, under suitable conditions, we establish the existence of optimal risk-averse solutions, in the sense of viscosity solutions, for the corresponding risk-averse dynamic programming equations. Finally, we briefly comment on the implication of our results.
△ Less
Submitted 2 January, 2018; v1 submitted 10 March, 2016;
originally announced March 2016.
-
Remarks on the hierarchical control problems with model uncertainty
Authors:
Getachew K. Befekadu,
Eduardo L. Pasiliao
Abstract:
In this paper, we consider a hierarchical control problem with model uncertainty. Specifically, we consider the following objectives that we would like to accomplish. The first one being of a controllability-type that consists of guaranteeing the terminal state to reach a target set starting from an initial condition, while the second one is keeping the state trajectory of the system close to a gi…
▽ More
In this paper, we consider a hierarchical control problem with model uncertainty. Specifically, we consider the following objectives that we would like to accomplish. The first one being of a controllability-type that consists of guaranteeing the terminal state to reach a target set starting from an initial condition, while the second one is keeping the state trajectory of the system close to a given reference trajectory over a finite time interval. We introduce the following framework. First, we partition the control subdomain into two disjoint open subdomains, with smooth boundaries, that are compatible with the strategy subspaces of the {\it leader} (which is responsible for the controllability-type criterion) and that of the {\it follower} (which is associated with the second criterion), respectively. Moreover, we account at the optimization stage for model uncertainty by allowing the {\it leader} to choose its control strategy based on a class of alternative models about the system, whereas the {\it follower} makes use of an approximate model about the system. Using the notion of Stackelberg's optimization, we provide conditions on the existence of optimal control strategies for such a hierarchical control problem, under which the {\it follower} is required to respond optimally to the strategy of the {\it leader} so as to achieve the overall objectives. Apart from the issue of modeling and uncertainty, this paper is a companion to our previous work.
△ Less
Submitted 13 October, 2015;
originally announced October 2015.
-
On the attainable distributions of controlled-diffusion processes pertaining to a chain of distributed systems
Authors:
Getachew K. Befekadu,
Eduardo L. Pasiliao
Abstract:
We consider a controlled-diffusion process pertaining to a chain of distributed systems with random perturbations that satisfies a weak Hörmander type condition. In particular, we consider a stochastic control problem with the following objectives that we would like to achieve. The first one being of a reachability-type that consists of determining a set of attainable distributions at a given time…
▽ More
We consider a controlled-diffusion process pertaining to a chain of distributed systems with random perturbations that satisfies a weak Hörmander type condition. In particular, we consider a stochastic control problem with the following objectives that we would like to achieve. The first one being of a reachability-type that consists of determining a set of attainable distributions at a given time starting from an initial distribution, while the second one involves minimizing the relative entropy subject to the initial and desired final attainable distributions. Using the logarithmic transformations approach from Fleming, we provide a sufficient condition on the existence of an optimal admissible control for such a stochastic control problem which is amounted to changing the drift by a certain perturbation suggested by Jamison in the context of reciprocal processes. Moreover, such a perturbation coincides with a minimum energy control among all admissible controls forcing the controlled-diffusion process to the desired final attainable distribution starting from the initial distribution. Finally, we briefly remark on the invariance property of the path-space measure for such a controlled-diffusion process pertaining to the chain of distributed systems.
△ Less
Submitted 25 September, 2015; v1 submitted 21 September, 2015;
originally announced September 2015.
-
On the hierarchical optimal control of a chain of distributed systems
Authors:
Getachew K. Befekadu,
Eduardo L. Pasiliao
Abstract:
In this paper, we consider a chain of distributed systems governed by a degenerate parabolic equation, which satisfies a weak Hörmander type condition, with a control distributed over an open subdomain. In particular, we consider two objectives that we would like to accomplish. The first one being of a controllability type that consists of guaranteeing the terminal state to reach a target set star…
▽ More
In this paper, we consider a chain of distributed systems governed by a degenerate parabolic equation, which satisfies a weak Hörmander type condition, with a control distributed over an open subdomain. In particular, we consider two objectives that we would like to accomplish. The first one being of a controllability type that consists of guaranteeing the terminal state to reach a target set starting from an initial condition; while the second one is keeping the state trajectory of the overall system close to a given reference trajectory on a finite, compact time intervals. We introduce the following framework. First, we partition the control subdomain into two disjoint open subdomains that are compatible with the strategy subspaces of the {\it leader} and that of the {\it follower}, respectively. Then, using the notion of Stackelberg's optimization (which is a hierarchical optimization framework), we provide a new result on the existence of optimal strategies for such an optimization problem -- where the {\it follower} (which corresponds to the second criterion) is required to respond optimally, in the sense of {\it best-response correspondence} to the strategy of the {\it leader} (that is associated to the controllability-type criterion) so as to achieve the overall objectives. Finally, we remark on the implication of our result in assessing the influence of the reachable target set on the optimal strategy of the {\it follower} in relation to the direction of {\it leader-follower} and {\it follower-leader} information flows.
△ Less
Submitted 12 August, 2015; v1 submitted 10 August, 2015;
originally announced August 2015.
-
On quantification of systemic redundancy in reliable systems
Authors:
Getachew K. Befekadu,
Panos J. Antsaklis
Abstract:
In this paper, we consider the problem of quantifying systemic redundancy in reliable systems having multiple controllers with overlapping functionality. In particular, we consider a multi-channel system with multi-controller configurations -- where controllers are required to respond optimally, in the sense of best-response correspondence, a {\it reliable-by-design} requirement, to non-faulty con…
▽ More
In this paper, we consider the problem of quantifying systemic redundancy in reliable systems having multiple controllers with overlapping functionality. In particular, we consider a multi-channel system with multi-controller configurations -- where controllers are required to respond optimally, in the sense of best-response correspondence, a {\it reliable-by-design} requirement, to non-faulty controllers so as to ensure or maintain some system properties. Here we introduce a mathematical framework, based on the notion of relative entropy of probability measures associated with steady-state solutions of Fokker-Planck equations for a family of stochastically perturbed multi-channel systems, that provides useful information towards a systemic assessment of redundancy in the system.
△ Less
Submitted 14 February, 2015;
originally announced February 2015.
-
On the controlled eigenvalue problem for stochastically perturbed multi-channel systems
Authors:
Getachew K. Befekadu
Abstract:
In this brief paper, we consider the problem of minimizing the asymptotic exit rate of diffusion processes from an open connected bounded set pertaining to a multi-channel system with small random perturbations. Specifically, we establish a connection between: (i) the existence of an invariant set for the unperturbed multi-channel system w.r.t. certain class of state-feedback controllers; and (ii)…
▽ More
In this brief paper, we consider the problem of minimizing the asymptotic exit rate of diffusion processes from an open connected bounded set pertaining to a multi-channel system with small random perturbations. Specifically, we establish a connection between: (i) the existence of an invariant set for the unperturbed multi-channel system w.r.t. certain class of state-feedback controllers; and (ii) the asymptotic behavior of the principal eigenvalues and the solutions of the Hamilton-Jacobi-Bellman (HJB) equations corresponding to a family of singularly perturbed elliptic operators. Finally, we provide a sufficient condition for the existence of a Pareto equilibrium (i.e., a set of optimal exit rates w.r.t. each of input channels) for the HJB equations -- where the latter correspond to a family of nonlinear controlled eigenvalue problems.
△ Less
Submitted 4 October, 2016; v1 submitted 6 January, 2015;
originally announced January 2015.
-
On the risk-sensitive escape control for diffusion processes pertaining to an expanding construction of distributed control systems
Authors:
Getachew K. Befekadu,
Panos J. Antsaklis
Abstract:
In this paper, we consider an expanding construction of a distributed control system, which is obtained by adding a new subsystem one after the other, until all $n$ subsystems, where $n \ge 2$, are included in the distributed control system. It is assumed that a small random perturbation enters only into the first subsystem and is then subsequently transmitted to the other subsystems. Moreover, fo…
▽ More
In this paper, we consider an expanding construction of a distributed control system, which is obtained by adding a new subsystem one after the other, until all $n$ subsystems, where $n \ge 2$, are included in the distributed control system. It is assumed that a small random perturbation enters only into the first subsystem and is then subsequently transmitted to the other subsystems. Moreover, for any $\ell \in \{2, ..., n\}$, the distributed control system, compatible with the expanding construction, which is obtained from the first $\ell$ subsystems, satisfies an appropriate Hörmander condition. As a result of this, the diffusion process is degenerate, i.e., the backward operator associated with it is a degenerate parabolic equation. Our main interest here is to prevent the diffusion process (that corresponds to a particular subsystem) from leaving a given bounded open domain. In particular, we consider a risk-sensitive version of the mean escape time criterion with respect to each of the subsystems. Using a variational representation, we characterize the risk-sensitive escape control for the diffusion process as the lower and upper values of an associated stochastic differential game. Finally, we comment on the implication of our results, where one is also interested in evaluating the performance of the risk-sensitive escape control, when there is some modeling error in the distributed control system.
△ Less
Submitted 29 September, 2014; v1 submitted 23 September, 2014;
originally announced September 2014.
-
On the minimum exit rate for a diffusion process pertaining to a chain of distributed control systems with random perturbations
Authors:
Getachew K. Befekadu,
Panos J. Antsaklis
Abstract:
In this paper, we consider the problem of minimizing the exit rate with which a diffusion process pertaining to a chain of distributed control systems, with random perturbations, exits from a given bounded open domain. In particular, we consider a chain of distributed control systems that are formed by $n$ subsystems (with $n \ge 2$), where the random perturbation enters only in the first subsyste…
▽ More
In this paper, we consider the problem of minimizing the exit rate with which a diffusion process pertaining to a chain of distributed control systems, with random perturbations, exits from a given bounded open domain. In particular, we consider a chain of distributed control systems that are formed by $n$ subsystems (with $n \ge 2$), where the random perturbation enters only in the first subsystem and is then subsequently transmitted to the other subsystems. Furthermore, we assume that, for any $\ell \in \{2, \ldots, n\}$, the distributed control systems, which is formed by the first $\ell$ subsystems, satisfies an appropriate Hörmander condition. As a result of this, the diffusion process is degenerate, in the sense that the infinitesimal generator associated with it is a degenerate parabolic equation. Our interest is to establish a connection between the minimum exit rate with which the diffusion process exits from the given domain and the principal eigenvalue for the infinitesimal generator with zero boundary conditions. Such a connection allows us to derive a family of Hamilton-Jacobi-Bellman equations for which we provide a verification theorem that shows the validity of the corresponding optimal control problems. Finally, we provide an estimate on the attainable exit probability of the diffusion process with respect to a set of admissible (optimal) Markov controls for the optimal control problems.
△ Less
Submitted 10 September, 2014; v1 submitted 9 September, 2014;
originally announced September 2014.
-
Exit Probabilities for a Chain of Distributed Control Systems with Small Random Perturbations
Authors:
Getachew K. Befekadu,
Panos J. Antsaklis
Abstract:
In this paper, we consider a diffusion process pertaining to a chain of distributed control systems with small random perturbation. The distributed control system is formed by n subsystems that satisfy an appropriate Hormander condition, i.e., the second subsystem assumes the random perturbation entered into the first subsystem, the third subsystem assumes the random perturbation entered into the…
▽ More
In this paper, we consider a diffusion process pertaining to a chain of distributed control systems with small random perturbation. The distributed control system is formed by n subsystems that satisfy an appropriate Hormander condition, i.e., the second subsystem assumes the random perturbation entered into the first subsystem, the third subsystem assumes the random perturbation entered into the first subsystem then was transmitted to the second subsystem and so on, such that the random perturbation propagates through the entire distributed control system. Note that the random perturbation enters only in one of the subsystems and, hence, the diffusion process is degenerate, in the sense that the backward operator associated with it is a degenerate parabolic equation. Our interest is to estimate the exit probability with which a diffusion process (corresponding to a particular subsystem) exits from a given bounded open domain during a certain time interval. The method for such an estimate basically relies on the interpretation of the exit probability function as a value function for a family of stochastic control problems that are associated with the underlying chain of distributed control systems.
△ Less
Submitted 2 September, 2014; v1 submitted 26 August, 2014;
originally announced August 2014.
-
On noncooperative $n$-player principal eigenvalue games
Authors:
Getachew K. Befekadu,
Panos J. Antsaklis
Abstract:
We consider a noncooperative $n$-player principal eigenvalue game which is associated with an infinitesimal generator of a stochastically perturbed multi-channel dynamical system -- where, in the course of such a game, each player attempts to minimize the asymptotic rate with which the controlled state trajectory of the system exits from a given bounded open domain. In particular, we show the exis…
▽ More
We consider a noncooperative $n$-player principal eigenvalue game which is associated with an infinitesimal generator of a stochastically perturbed multi-channel dynamical system -- where, in the course of such a game, each player attempts to minimize the asymptotic rate with which the controlled state trajectory of the system exits from a given bounded open domain. In particular, we show the existence of a Nash-equilibrium point (i.e., an $n$-tuple of equilibrium linear feedback operators) that is distinctly related to a unique maximum closed invariant set of the corresponding deterministic multi-channel dynamical system, when the latter is composed with this $n$-tuple of equilibrium linear feedback operators.
△ Less
Submitted 12 August, 2014; v1 submitted 21 May, 2014;
originally announced May 2014.
-
On the Problem of Minimum Asymptotic Exit Rate for Stochastically Perturbed Multi-Channel Dynamical Systems
Authors:
Getachew K. Befekadu,
Panos J. Antsaklis
Abstract:
We consider the problem of minimizing the asymptotic exit rate with which the controlled-diffusion process of a stochastically perturbed multi-channel dynamical system exits from a given bounded open domain. In particular, for a class of admissible bounded linear feedback operators, we establish a connection between the asymptotic exit rate with which such a controlled-diffusion process exits from…
▽ More
We consider the problem of minimizing the asymptotic exit rate with which the controlled-diffusion process of a stochastically perturbed multi-channel dynamical system exits from a given bounded open domain. In particular, for a class of admissible bounded linear feedback operators, we establish a connection between the asymptotic exit rate with which such a controlled-diffusion process exits from the given domain and the asymptotic behavior (i.e., a probabilistic characterization) of the principal eigenvalue of the infinitesimal generator, which corresponds to the stochastically perturbed dynamical system, with zero boundary conditions on the given domain. Finally, we briefly remark on the implication of our result for evaluating the performance of the associated deterministic multi-channel dynamical system, when such a dynamical system is composed with a set of (sub)-optimal admissible linear feedback operators.
△ Less
Submitted 20 August, 2014; v1 submitted 6 May, 2014;
originally announced May 2014.
-
Large Deviations for the Reliability Assessment of Redundant Multi-Channel Systems
Authors:
Getachew K. Befekadu,
Panos J. Antsaklis
Abstract:
In this paper, we are concerned with the reliability assessment of redundant multi-channel systems having multiple controllers with overlapping functionality -- where all controllers are required to respond optimally to the non-faulty controllers so as to ensure or maintain some system properties. In particular, for such redundant systems with small random perturbation, we study the relationships…
▽ More
In this paper, we are concerned with the reliability assessment of redundant multi-channel systems having multiple controllers with overlapping functionality -- where all controllers are required to respond optimally to the non-faulty controllers so as to ensure or maintain some system properties. In particular, for such redundant systems with small random perturbation, we study the relationships between the exit probabilities with which the state-trajectories exit from a given bounded open domain and the value functions corresponding to a family of stochastic exit-time control problems on the boundary of the given domain. Moreover, as the random perturbation vanishes, such relationships provide useful information concerning the reliability of the redundant multi-channel systems arising from the large deviations problem in connection with the asymptotic estimates of exit probabilities with respect to some portions of the boundary of the given domain. Finally, we briefly comment on the implication of our results on a co-design technique using multi-objective optimization frameworks for evaluating the performance of the redundant multi-channel systems.
△ Less
Submitted 26 March, 2015; v1 submitted 17 April, 2014;
originally announced April 2014.
-
On the entropy of equilibrium measures and game-theoretic equilibrium feedback operators in multi-channel dynamical systems
Authors:
Getachew K. Befekadu,
Panos J. Antsaklis
Abstract:
We investigate the connection between the entropy of equilibrium measures and game-theoretic equilibrium feedback operators in a multi-channel dynamical system. Specifically, we show that the existence of an equilibrium measure, which maximizes the free energy (i.e., the sum of the entropy and the integral over a potential), is related to an equilibrium or "maximum entropy" state for the multi-cha…
▽ More
We investigate the connection between the entropy of equilibrium measures and game-theoretic equilibrium feedback operators in a multi-channel dynamical system. Specifically, we show that the existence of an equilibrium measure, which maximizes the free energy (i.e., the sum of the entropy and the integral over a potential), is related to an equilibrium or "maximum entropy" state for the multi-channel dynamical system that is composed with a set of feedback operators. Further, we observe that such a connection makes sense when this set of feedback operators strategically interacts over an infinite-horizon, in a game-theoretic sense, using the current state-information of the system. Finally, we briefly comment on the implication of our result to the resilient behavior of the equilibrium feedback operators, when there is a random perturbation in the system.
△ Less
Submitted 29 January, 2014; v1 submitted 30 December, 2013;
originally announced December 2013.
-
Relating maximum entropy, resilient behavior and game-theoretic equilibrium feedback operators in multi-channel systems
Authors:
Getachew K. Befekadu,
Panos J. Antsaklis
Abstract:
In this paper, we first draw a connection between the existence of a stationary density function (which corresponds to an equilibrium state in the sense of statistical mechanics) and a set of feedback operators in a multi-channel system that strategically interacts in a game-theoretic framework. In particular, we show that there exists a set of (game-theoretic) equilibrium feedback operators such…
▽ More
In this paper, we first draw a connection between the existence of a stationary density function (which corresponds to an equilibrium state in the sense of statistical mechanics) and a set of feedback operators in a multi-channel system that strategically interacts in a game-theoretic framework. In particular, we show that there exists a set of (game-theoretic) equilibrium feedback operators such that the composition of the multi-channel system with this set of equilibrium feedback operators, when described by density functions, will evolve towards an equilibrium state in such a way that the entropy of the whole system is maximized. As a result of this, we are led to study, by a means of a stationary density function (i.e., a common fixed-point) for a family of Frobenius-Perron operators, how the dynamics of the system together with the equilibrium feedback operators determine the evolution of the density functions, and how this information translates into the maximum entropy behavior of the system. Later, we use such results to examine the resilient behavior of this set of equilibrium feedback operators, when there is a small random perturbation in the system.
△ Less
Submitted 14 January, 2014; v1 submitted 18 December, 2013;
originally announced December 2013.
-
A brief remark on the topological entropy for linear switched systems
Authors:
Getachew K. Befekadu
Abstract:
In this brief note, we investigate the topological entropy for linear switched systems. Specifically, we use the Levi-Malcev decomposition of Lie-algebra to establish a connection between the basic properties of the topological entropy and the stability of switched linear systems. For such systems, we show that the topological entropy for the evolution operator corresponding to a semi-simple subal…
▽ More
In this brief note, we investigate the topological entropy for linear switched systems. Specifically, we use the Levi-Malcev decomposition of Lie-algebra to establish a connection between the basic properties of the topological entropy and the stability of switched linear systems. For such systems, we show that the topological entropy for the evolution operator corresponding to a semi-simple subalgebra is always bounded from above by the negative of the largest real part of the eigenvalue that corresponds to the evolution operator of a maximal solvable ideal part.
△ Less
Submitted 20 October, 2013;
originally announced October 2013.
-
On a connection between the reliability of multi-channel systems and the notion of controlled-invariance entropy
Authors:
Getachew K Befekadu
Abstract:
The purpose of this note is to establish a connection between the problem of reliability (when there is an intermittent control-input channel failure that may occur between actuators, controllers and/or sensors in the system) and the notion of controlled-invariance entropy of a multi-channel system (with respect to a subset of control-input channels and/or a class of control functions). We remark…
▽ More
The purpose of this note is to establish a connection between the problem of reliability (when there is an intermittent control-input channel failure that may occur between actuators, controllers and/or sensors in the system) and the notion of controlled-invariance entropy of a multi-channel system (with respect to a subset of control-input channels and/or a class of control functions). We remark that such a connection could be used for assessing the reliability (or the vulnerability) of the system, when some of these control-input channels are compromised with an external "malicious" agent that may try to prevent the system from achieving more of its goal (such as from attaining invariance of a given compact state and/or output subspace).
△ Less
Submitted 9 March, 2013; v1 submitted 10 February, 2013;
originally announced February 2013.