-
Boundary Estimates for the Monge-Ampère Equation in the Polygons with Guillemin Boundary Conditions
Authors:
Masoud Bayrami-Aminlouee,
Reza Seyyedali,
Mohammad Talebi
Abstract:
We establish a Schauder-type boundary regularity result for a two-dimensional singular Monge-Ampére equation on convex polytopes with Guillemin boundary conditions. This extends the previous work of Rubin and Huang to the case where the right-hand side is less regular; specifically, Hölder continuous functions. Our method relies heavily on the sophisticated techniques developed by Donaldson in his…
▽ More
We establish a Schauder-type boundary regularity result for a two-dimensional singular Monge-Ampére equation on convex polytopes with Guillemin boundary conditions. This extends the previous work of Rubin and Huang to the case where the right-hand side is less regular; specifically, Hölder continuous functions. Our method relies heavily on the sophisticated techniques developed by Donaldson in his series of papers on the Abreu equation.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model
Authors:
Oliver Mortensen,
Mohammad Sadegh Talebi
Abstract:
In this paper we analyze the sample complexities of learning the optimal state-action value function $Q^*$ and an optimal policy $π^*$ in a discounted Markov decision process (MDP) where the agent has recursive entropic risk-preferences with risk-parameter $β\neq 0$ and where a generative model of the MDP is available. We provide and analyze a simple model based approach which we call model-based…
▽ More
In this paper we analyze the sample complexities of learning the optimal state-action value function $Q^*$ and an optimal policy $π^*$ in a discounted Markov decision process (MDP) where the agent has recursive entropic risk-preferences with risk-parameter $β\neq 0$ and where a generative model of the MDP is available. We provide and analyze a simple model based approach which we call model-based risk-sensitive $Q$-value-iteration (MB-RS-QVI) which leads to $(ε,δ)$-PAC-bounds on $\|Q^*-Q^k\|$, and $\|V^*-V^{π_k}\|$ where $Q_k$ is the output of MB-RS-QVI after k iterations and $π_k$ is the greedy policy with respect to $Q_k$. Both PAC-bounds have exponential dependence on the effective horizon $\frac{1}{1-γ}$ and the strength of this dependence grows with the learners risk-sensitivity $|β|$. We also provide two lower bounds which shows that exponential dependence on $|β|\frac{1}{1-γ}$ is unavoidable in both cases. The lower bounds reveal that the PAC-bounds are both tight in $\varepsilon$ and $δ$ and that the PAC-bound on $Q$-learning is tight in the number of actions $A$, and that the PAC-bound on policy-learning is nearly tight in $A$.
△ Less
Submitted 30 May, 2025;
originally announced June 2025.
-
Scaling Power Management in Cloud Data Centers: A Multi-Level Continuous-Time MDP Approach
Authors:
Behzad Chitsaz,
Ahmad Khonsari,
Masoumeh Moradian,
Aresh Dadlani,
Mohammad Sadegh Talebi
Abstract:
Power management in multi-server data centers~especially at scale is a vital issue of increasing importance in cloud computing paradigm. Existing studies mostly consider thresholds on the number of idle servers to switch the servers on or off and suffer from scalability issues. As a natural approach in view~of~the Markovian assumption, we present a multi-level continuous-time Markov decision proce…
▽ More
Power management in multi-server data centers~especially at scale is a vital issue of increasing importance in cloud computing paradigm. Existing studies mostly consider thresholds on the number of idle servers to switch the servers on or off and suffer from scalability issues. As a natural approach in view~of~the Markovian assumption, we present a multi-level continuous-time Markov decision process (CTMDP) model based on state aggregation of multi-server data centers with setup times that interestingly overcomes the inherent intractability of traditional MDP approaches due to their colossal state-action space. The beauty of the presented model is that, while it keeps loyalty to the Markovian behavior, it approximates the calculation of the transition probabilities in a way that keeps the accuracy of the results at a desirable level. Moreover, near-optimal performance is attained at the expense of the increased state-space dimensionality by tuning the number of levels in the multi-level approach. The simulation results were promising and confirm that in many scenarios of interest, the proposed approach attains noticeable improvements, namely a near 50% reduction in the size of CTMDP while yielding better rewards as compared to existing fixed threshold-based policies and aggregation methods.
△ Less
Submitted 19 July, 2023; v1 submitted 3 August, 2021;
originally announced August 2021.
-
Analytic torsion on manifolds with fibred boundary metrics
Authors:
Mohammad Talebi
Abstract:
In this paper, we construct the renormalized analytic torsion in the setup of manifold endowed with fibred boundary metrics. The method of construction is to determine the asymptotic of heat kernel, both in short time regime and long time regime and apply these asymptotics together with renormalization to determine the renormalized zeta function and the determinant of Hodge Laplacian.
In this paper, we construct the renormalized analytic torsion in the setup of manifold endowed with fibred boundary metrics. The method of construction is to determine the asymptotic of heat kernel, both in short time regime and long time regime and apply these asymptotics together with renormalization to determine the renormalized zeta function and the determinant of Hodge Laplacian.
△ Less
Submitted 2 February, 2021;
originally announced February 2021.
-
Spectral geometry on manifolds with fibred boundary metrics II: heat kernel asymptotics
Authors:
Mohammad Talebi,
Boris Vertman
Abstract:
In this paper we continue the analysis of spectral problems in the setting of complete manifolds with fibred boundary metrics, also referred to as $φ$-metrics, as initiated in our previous work. We consider the Hodge Laplacian for a $φ$-metric and construct the corresponding heat kernel as a polyhomogeneous conormal distribution on an appropriate manifold with corners. Our discussion is a generali…
▽ More
In this paper we continue the analysis of spectral problems in the setting of complete manifolds with fibred boundary metrics, also referred to as $φ$-metrics, as initiated in our previous work. We consider the Hodge Laplacian for a $φ$-metric and construct the corresponding heat kernel as a polyhomogeneous conormal distribution on an appropriate manifold with corners. Our discussion is a generalization of an earlier work by Albin and Sher, and provides a fundamental first step towards analysis of Ray-Singer torsion, eta-invariants and index theorems in the setting.
△ Less
Submitted 4 November, 2021; v1 submitted 21 January, 2021;
originally announced January 2021.
-
Spectral geometry on manifolds with fibred boundary metrics I: Low energy resolvent
Authors:
Daniel Grieser,
Mohammad Talebi,
Boris Vertman
Abstract:
We study the low energy resolvent of the Hodge Laplacian on a manifold equipped with a fibred boundary metric. We determine the precise asymptotic behavior of the resolvent as a fibred boundary (aka $φ$-) pseudodifferential operator when the resolvent parameter tends to zero. This generalizes previous work by Guillarmou and Sher who considered asymptotically conic metrics, which correspond to the…
▽ More
We study the low energy resolvent of the Hodge Laplacian on a manifold equipped with a fibred boundary metric. We determine the precise asymptotic behavior of the resolvent as a fibred boundary (aka $φ$-) pseudodifferential operator when the resolvent parameter tends to zero. This generalizes previous work by Guillarmou and Sher who considered asymptotically conic metrics, which correspond to the special case when the fibres are points. The new feature in the case of non-trivial fibres is that the resolvent has different asymptotic behavior on the subspace of forms that are fibrewise harmonic and on its orthogonal complement. To deal with this, we introduce an appropriate 'split' pseudodifferential calculus, building on and extending work by Grieser and Hunsicker. Our work sets the basis for the discussion of spectral invariants on $φ$-manifolds.
△ Less
Submitted 19 June, 2024; v1 submitted 21 September, 2020;
originally announced September 2020.
-
The Mean Drift: Tailoring the Mean Field Theory of Markov Processes for Real-World Applications
Authors:
Mahmoud Talebi,
Jan Friso Groote,
Jean-Paul Linnartz
Abstract:
The statement of the mean field approximation theorem in the mean field theory of Markov processes particularly targets the behaviour of population processes with an unbounded number of agents. However, in most real-world engineering applications one faces the problem of analysing middle-sized systems in which the number of agents is bounded. In this paper we build on previous work in this area an…
▽ More
The statement of the mean field approximation theorem in the mean field theory of Markov processes particularly targets the behaviour of population processes with an unbounded number of agents. However, in most real-world engineering applications one faces the problem of analysing middle-sized systems in which the number of agents is bounded. In this paper we build on previous work in this area and introduce the mean drift. We present the concept of population processes and the conditions under which the approximation theorems apply, and then show how the mean drift is derived through a systematic application of the propagation of chaos. We then use the mean drift to construct a new set of ordinary differential equations which address the analysis of population processes with an arbitrary size.
△ Less
Submitted 10 May, 2017; v1 submitted 13 March, 2017;
originally announced March 2017.
-
Combinatorial Bandits Revisited
Authors:
Richard Combes,
M. Sadegh Talebi,
Alexandre Proutiere,
Marc Lelarge
Abstract:
This paper investigates stochastic and adversarial combinatorial multi-armed bandit problems. In the stochastic setting under semi-bandit feedback, we derive a problem-specific regret lower bound, and discuss its scaling with the dimension of the decision space. We propose ESCB, an algorithm that efficiently exploits the structure of the problem and provide a finite-time analysis of its regret. ES…
▽ More
This paper investigates stochastic and adversarial combinatorial multi-armed bandit problems. In the stochastic setting under semi-bandit feedback, we derive a problem-specific regret lower bound, and discuss its scaling with the dimension of the decision space. We propose ESCB, an algorithm that efficiently exploits the structure of the problem and provide a finite-time analysis of its regret. ESCB has better performance guarantees than existing algorithms, and significantly outperforms these algorithms in practice. In the adversarial setting under bandit feedback, we propose \textsc{CombEXP}, an algorithm with the same regret scaling as state-of-the-art algorithms, but with lower computational complexity for some combinatorial problems.
△ Less
Submitted 5 November, 2015; v1 submitted 11 February, 2015;
originally announced February 2015.
-
Stochastic Online Shortest Path Routing: The Value of Feedback
Authors:
M. Sadegh Talebi,
Zhenhua Zou,
Richard Combes,
Alexandre Proutiere,
Mikael Johansson
Abstract:
This paper studies online shortest path routing over multi-hop networks. Link costs or delays are time-varying and modeled by independent and identically distributed random processes, whose parameters are initially unknown. The parameters, and hence the optimal path, can only be estimated by routing packets through the network and observing the realized delays. Our aim is to find a routing policy…
▽ More
This paper studies online shortest path routing over multi-hop networks. Link costs or delays are time-varying and modeled by independent and identically distributed random processes, whose parameters are initially unknown. The parameters, and hence the optimal path, can only be estimated by routing packets through the network and observing the realized delays. Our aim is to find a routing policy that minimizes the regret (the cumulative difference of expected delay) between the path chosen by the policy and the unknown optimal path. We formulate the problem as a combinatorial bandit optimization problem and consider several scenarios that differ in where routing decisions are made and in the information available when making the decisions. For each scenario, we derive a tight asymptotic lower bound on the regret that has to be satisfied by any online routing policy. These bounds help us to understand the performance improvements we can expect when (i) taking routing decisions at each hop rather than at the source only, and (ii) observing per-link delays rather than end-to-end path delays. In particular, we show that (i) is of no use while (ii) can have a spectacular impact. Three algorithms, with a trade-off between computational complexity and performance, are proposed. The regret upper bounds of these algorithms improve over those of the existing algorithms, and they significantly outperform state-of-the-art algorithms in numerical experiments.
△ Less
Submitted 18 January, 2017; v1 submitted 27 September, 2013;
originally announced September 2013.
-
Spectrum Bandit Optimization
Authors:
Marc Lelarge,
Alexandre Proutiere,
M. Sadegh Talebi
Abstract:
We consider the problem of allocating radio channels to links in a wireless network. Links interact through interference, modelled as a conflict graph (i.e., two interfering links cannot be simultaneously active on the same channel). We aim at identifying the channel allocation maximizing the total network throughput over a finite time horizon. Should we know the average radio conditions on each c…
▽ More
We consider the problem of allocating radio channels to links in a wireless network. Links interact through interference, modelled as a conflict graph (i.e., two interfering links cannot be simultaneously active on the same channel). We aim at identifying the channel allocation maximizing the total network throughput over a finite time horizon. Should we know the average radio conditions on each channel and on each link, an optimal allocation would be obtained by solving an Integer Linear Program (ILP). When radio conditions are unknown a priori, we look for a sequential channel allocation policy that converges to the optimal allocation while minimizing on the way the throughput loss or {\it regret} due to the need for exploring sub-optimal allocations. We formulate this problem as a generic linear bandit problem, and analyze it first in a stochastic setting where radio conditions are driven by a stationary stochastic process, and then in an adversarial setting where radio conditions can evolve arbitrarily. We provide new algorithms in both settings and derive upper bounds on their regrets.
△ Less
Submitted 17 February, 2015; v1 submitted 27 February, 2013;
originally announced February 2013.
-
NUM-Based Rate Allocation for Streaming Traffic via Sequential Convex Programming
Authors:
Ali Sehati,
Mohammad Sadegh Talebi,
Ahmad Khonsari
Abstract:
In recent years, there has been an increasing demand for ubiquitous streaming like applications in data networks. In this paper, we concentrate on NUM-based rate allocation for streaming applications with the so-called S-curve utility functions. Due to non-concavity of such utility functions, the underlying NUM problem would be non-convex for which dual methods might become quite useless. To tackl…
▽ More
In recent years, there has been an increasing demand for ubiquitous streaming like applications in data networks. In this paper, we concentrate on NUM-based rate allocation for streaming applications with the so-called S-curve utility functions. Due to non-concavity of such utility functions, the underlying NUM problem would be non-convex for which dual methods might become quite useless. To tackle the non-convex problem, using elementary techniques we make the utility of the network concave, however this results in reverse-convex constraints which make the problem non-convex. To deal with such a transformed NUM, we leverage Sequential Convex Programming (SCP) approach to approximate the non-convex problem by a series of convex ones. Based on this approach, we propose a distributed rate allocation algorithm and demonstrate that under mild conditions, it converges to a locally optimal solution of the original NUM. Numerical results validate the effectiveness, in terms of tractable convergence of the proposed rate allocation algorithm.
△ Less
Submitted 30 September, 2011;
originally announced September 2011.