Skip to main content

Showing 1–25 of 25 results for author: Gast, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.06307  [pdf, ps, other

    math.OC cs.LG math.PR stat.ML

    Model Predictive Control is Almost Optimal for Restless Bandit

    Authors: Nicolas Gast, Dheeraj Narasimha

    Abstract: We consider the discrete time infinite horizon average reward restless markovian bandit (RMAB) problem. We propose a \emph{model predictive control} based non-stationary policy with a rolling computational horizon $τ$. At each time-slot, this policy solves a $τ$ horizon linear program whose first control value is kept as a control for the RMAB. Our solution requires minimal assumptions and quantif… ▽ More

    Submitted 5 June, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

    Comments: Reviewed and accepted to COLT 2025

  2. arXiv:2408.07616  [pdf, other

    cs.DS cs.GT math.OC

    Prophet Inequalities: Competing with the Top $\ell$ Items is Easy

    Authors: Mathieu Molina, Nicolas Gast, Patrick Loiseau, Vianney Perchet

    Abstract: We explore a prophet inequality problem, where the values of a sequence of items are drawn i.i.d. from some distribution, and an online decision maker must select one item irrevocably. We establish that $\mathrm{CR}_{\ell}$ the worst-case competitive ratio between the expected optimal performance of an online decision maker compared to that of a prophet who uses the average of the top $\ell$ items… ▽ More

    Submitted 10 January, 2025; v1 submitted 14 August, 2024; originally announced August 2024.

  3. arXiv:2405.14285  [pdf, other

    stat.ML cs.LG math.OC

    Computing the Bias of Constant-step Stochastic Approximation with Markovian Noise

    Authors: Sebastian Allmeier, Nicolas Gast

    Abstract: We study stochastic approximation algorithms with Markovian noise and constant step-size $α$. We develop a method based on infinitesimal generator comparisons to study the bias of the algorithm, which is the expected difference between $θ_n$ -- the value at iteration $n$ -- and $θ^*$ -- the unique equilibrium of the corresponding ODE. We show that, under some smoothness conditions, this bias is of… ▽ More

    Submitted 25 October, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 24 pages. Accepted at NeurIPS 2024

  4. Approximations to Study the Impact of the Service Discipline in Systems with Redundancy

    Authors: Nicolas Gast, Benny van Houdt

    Abstract: As job redundancy has been recognized as an effective means to improve performance of large-scale computer systems, queueing systems with redundancy have been studied by various authors. Existing results include methods to compute the queue length distribution and response time but only when the service discipline is First-Come-First-Served (FCFS). For other service disciplines, such as Processor… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Journal ref: Proceedings of the ACM on Measurement and Analysis of Computing Systems , 2024, 8 (1)

  5. arXiv:2306.13440  [pdf, other

    cs.LG

    Trading-off price for data quality to achieve fair online allocation

    Authors: Mathieu Molina, Nicolas Gast, Patrick Loiseau, Vianney Perchet

    Abstract: We consider the problem of online allocation subject to a long-term fairness penalty. Contrary to existing works, however, we do not assume that the decision-maker observes the protected attributes -- which is often unrealistic in practice. Instead they can purchase data that help estimate them from sources of different quality; and hence reduce the fairness penalty at some cost. We model this pro… ▽ More

    Submitted 4 December, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

  6. arXiv:2301.05630  [pdf, ps, other

    cs.LG cs.AI cs.GT cs.MA

    Decentralized model-free reinforcement learning in stochastic games with average-reward objective

    Authors: Romain Cravic, Nicolas Gast, Bruno Gaujal

    Abstract: We propose the first model-free algorithm that achieves low regret performance for decentralized learning in two-player zero-sum tabular stochastic games with infinite-horizon average-reward objective. In decentralized learning, the learning agent controls only one player and tries to achieve low regret performances against an arbitrary opponent. This contrasts with centralized learning where the… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: Accepted to the 22th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023). This version is the full version with proofs in appendix

  7. arXiv:2211.11382  [pdf, other

    math.PR cs.PF

    Bias and Refinement of Multiscale Mean Field Models

    Authors: Sebastian Allmeier, Nicolas Gast

    Abstract: Mean field approximation is a powerful technique which has been used in many settings to study large-scale stochastic systems. In the case of two-timescale systems, the approximation is obtained by a combination of scaling arguments and the use of the averaging principle. This paper analyzes the approximation error of this `average' mean field model for a two-timescale model… ▽ More

    Submitted 23 January, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: 28 pages; Accepted at ACM Sigmetrics 2023

  8. Fairness in Selection Problems with Strategic Candidates

    Authors: Vitalii Emelianov, Nicolas Gast, Patrick Loiseau

    Abstract: To better understand discriminations and the effect of affirmative actions in selection problems (e.g., college admission or hiring), a recent line of research proposed a model based on differential variance. This model assumes that the decision-maker has a noisy estimate of each candidate's quality and puts forward the difference in the noise variances between different demographic groups as a ke… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in the proceedings of the Twenty-Third ACM Conference on Economics and Computation (EC'22)

  9. Testing Indexability and Computing Whittle and Gittins Index in Subcubic Time

    Authors: Nicolas Gast, Bruno Gaujal, Kimang Khun

    Abstract: Whittle index is a generalization of Gittins index that provides very efficient allocation rules for restless multi-armed bandits. In this work, we develop an algorithm to test the indexability and compute the Whittle indices of any finite-state restless bandit arm. This algorithm works in the discounted and non-discounted cases, and can compute Gittins index. Our algorithm builds on three tools:… ▽ More

    Submitted 22 June, 2023; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: Mathematical Methods of Operations Research, 2023

  10. On Fair Selection in the Presence of Implicit and Differential Variance

    Authors: Vitalii Emelianov, Nicolas Gast, Krishna P. Gummadi, Patrick Loiseau

    Abstract: Discrimination in selection problems such as hiring or college admission is often explained by implicit bias from the decision maker against disadvantaged demographic groups. In this paper, we consider a model where the decision maker receives a noisy estimate of each candidate's quality, whose variance depends on the candidate's group -- we argue that such differential variance is a key feature o… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: Accepted for publication in the Artificial Intelligence Journal. This paper is an extended version of our paper arXiv:2006.13699: we added a Bayesian-optimal baseline (in addition to the group-oblivious baseline) and generalized the model by assuming a group-dependent distribution of quality and an implicit bias; but we removed the part on two-stage selection

  11. arXiv:2111.01594  [pdf, other

    cs.PF math.PR

    Mean Field and Refined Mean Field Approximations for Heterogeneous Systems: It Works!

    Authors: Sebastian Allmeier, Nicolas Gast

    Abstract: Mean field approximation is a powerful technique to study the performance of large stochastic systems represented as $n$ interacting objects. Applications include load balancing models, epidemic spreading, cache replacement policies, or large-scale data centers. Mean field approximation is asymptotically exact for systems composed of $n$ homogeneous objects under mild conditions. In this paper, we… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 42 pages

  12. arXiv:2106.14636  [pdf, other

    cs.GT

    Asymptotic Degradation of Linear Regression Estimates With Strategic Data Sources

    Authors: Benjamin Roussillon, Nicolas Gast, Patrick Loiseau, Panayotis Mertikopoulos

    Abstract: We consider the problem of linear regression from strategic data sources with a public good component, i.e., when data is provided by strategic agents who seek to minimize an individual provision cost for increasing their data's precision while benefiting from the model's overall precision. In contrast to previous works, our model tackles the case where there is uncertainty on the attributes chara… ▽ More

    Submitted 11 March, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: 31 pages, 7 figures

  13. arXiv:2106.08771  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism?

    Authors: Nicolas Gast, Bruno Gaujal, Kimang Khun

    Abstract: We study learning algorithms for the classical Markovian bandit problem with discount. We explain how to adapt PSRL [24] and UCRL2 [2] to exploit the problem structure. These variants are called MB-PSRL and MB-UCRL2. While the regret bound and runtime of vanilla implementations of PSRL and UCRL2 are exponential in the number of bandits, we show that the episodic regret of MB-PSRL and MB-UCRL2 is… ▽ More

    Submitted 3 May, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

  14. arXiv:2012.09064  [pdf, other

    cs.PF math.OC math.PR

    Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy

    Authors: Nicolas Gast, Bruno Gaujal, Chen Yan

    Abstract: We evaluate the performance of Whittle index policy for restless Markovian bandits, when the number of bandits grows. It is proven in [30] that this performance is asymptotically optimal if the bandits are indexable and the associated deterministic system has a global attractor fixed point. In this paper we show that, under the same conditions, the convergence rate is exponential in the number of… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

  15. arXiv:2006.13699  [pdf, other

    cs.CY cs.LG stat.ML

    On Fair Selection in the Presence of Implicit Variance

    Authors: Vitalii Emelianov, Nicolas Gast, Krishna P. Gummadi, Patrick Loiseau

    Abstract: Quota-based fairness mechanisms like the so-called Rooney rule or four-fifths rule are used in selection problems such as hiring or college admission to reduce inequalities based on sensitive demographic attributes. These mechanisms are often viewed as introducing a trade-off between selection fairness and utility. In recent work, however, Kleinberg and Raghavan showed that, in the presence of imp… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: 27 pages, 10 figures, Economics and Computation (EC'20)

  16. arXiv:2004.07519  [pdf, other

    cs.PF

    Refined Mean Field Analysis of the Gossip Shuffle Protocol -- extended version --

    Authors: Nicolas Gast, Diego Latella, Mieke Massink

    Abstract: Gossip protocols form the basis of many smart collective adaptive systems. They are a class of fully decentralised, simple but robust protocols for the distribution of information throughout large scale networks with hundreds or thousands of nodes. Mean field analysis methods have made it possible to approximate and analyse performance aspects of such large scale protocols in an efficient way. Tak… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: This paper is an extended version of a short paper accepted for the LNCS proceedings of COORDINATION 2020

  17. The Price of Local Fairness in Multistage Selection

    Authors: Vitalii Emelianov, George Arvanitakis, Nicolas Gast, Krishna Gummadi, Patrick Loiseau

    Abstract: The rise of algorithmic decision making led to active researches on how to define and guarantee fairness, mostly focusing on one-shot decision making. In several important applications such as hiring, however, decisions are made in multiple stage with additional information at each stage. In such cases, fairness issues remain poorly understood. In this paper we study fairness in $k$-stage select… ▽ More

    Submitted 15 June, 2019; originally announced June 2019.

    Comments: 13 pages, 16 figures

  18. arXiv:1807.08585  [pdf, other

    cs.PF eess.SY math.DS

    A refined mean field approximation of synchronous discrete-time population models

    Authors: Nicolas Gast, Diego Latella, Mieke Massink

    Abstract: Mean field approximation is a popular method to study the behaviour of stochastic models composed of a large number of interacting objects. When the objects are asynchronous, the mean field approximation of a population model can be expressed as an ordinary differential equation. When the objects are (clock-) synchronous the mean field approximation is a discrete time dynamical system. We focus on… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

    Journal ref: Performance Evaluation, Elsevier, 2018

  19. arXiv:1805.00857  [pdf, other

    cs.DC

    A new analysis of Work Stealing with latency

    Authors: Nicolas Gast, Mohammed Khatiri, Denis Trystram, Frederic Wagner

    Abstract: We study in this paper the impact of communication latency on the classical Work Stealing load balancing algorithm. Our paper extends the reference model in which we introduce a latency parameter. By using a theoretical analysis and simulation, we study the overall impact of this latency on the Makespan (maximum completion time). We derive a new expression of the expected running time of a bag of… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

  20. arXiv:1309.7824  [pdf, other

    cs.GT cs.LG math.ST

    Linear Regression from Strategic Data Sources

    Authors: Nicolas Gast, Stratis Ioannidis, Patrick Loiseau, Benjamin Roussillon

    Abstract: Linear regression is a fundamental building block of statistical data analysis. It amounts to estimating the parameters of a linear model that maps input features to corresponding outputs. In the classical setting where the precision of each data point is fixed, the famous Aitken/Gauss-Markov theorem in statistics states that generalized least squares (GLS) is a so-called "Best Linear Unbiased Est… ▽ More

    Submitted 12 December, 2019; v1 submitted 30 September, 2013; originally announced September 2013.

    Comments: This version (v3) extends the results on the sub-optimality of GLS (Section 6) and improves writing in multiple places compared to v2. Compared to the initial version v1, it also fixes an error in Theorem 6 (now Theorem 5), and extended many of the results

  21. arXiv:1107.3734  [pdf, other

    cs.DC

    Decentralized List Scheduling

    Authors: Marc Tchiboukdjian, Nicolas Gast, Denis Trystram

    Abstract: Classical list scheduling is a very popular and efficient technique for scheduling jobs in parallel and distributed platforms. It is inherently centralized. However, with the increasing number of processors, the cost for managing a single centralized list becomes too prohibitive. A suitable approach to reduce the contention is to distribute the list among the computational units: each processor ha… ▽ More

    Submitted 19 July, 2011; originally announced July 2011.

  22. arXiv:1107.3385  [pdf, other

    math.PR cs.DM math.CO

    Computing hitting times via fluid approximation: application to the coupon collector problem

    Authors: Nicolas Gast

    Abstract: In this paper, we show how to use stochastic approximation to compute hitting time of a stochastic process, based on the study of the time for a fluid approximation of this process to be at distance 1/N of its fixed point. This approach is developed to study a generalized version of the coupon collector problem. The system is composed by N independent identical Markov chains. At each time step,… ▽ More

    Submitted 18 July, 2011; originally announced July 2011.

  23. arXiv:1004.2342  [pdf, other

    cs.AI cs.PF eess.SY math.OC math.PR

    Mean field for Markov Decision Processes: from Discrete to Continuous Optimization

    Authors: Nicolas Gast, Bruno Gaujal, Jean-Yves Le Boudec

    Abstract: We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal reward of such a Markov Decision Process, satisfying a Bellman equation, converges to the solution of a continuous Hamilton-Jacobi-Bellman (HJB) equation based on the mean field approximation of the Markov Decision Proce… ▽ More

    Submitted 19 May, 2011; v1 submitted 14 April, 2010; originally announced April 2010.

    Report number: RR-7239, RR-7239

  24. arXiv:0903.2352  [pdf, ps, other

    math.PR cs.NI cs.PF

    A Mean Field Approach for Optimization in Particles Systems and Applications

    Authors: Nicolas Gast, Bruno Gaujal

    Abstract: This paper investigates the limit behavior of Markov Decision Processes (MDPs) made of independent particles evolving in a common environment, when the number of particles goes to infinity. In the finite horizon case or with a discounted cost and an infinite horizon, we show that when the number of particles becomes large, the optimal cost of the system converges almost surely to the optimal cos… ▽ More

    Submitted 10 June, 2009; v1 submitted 13 March, 2009; originally announced March 2009.

    Report number: RR-6877

  25. arXiv:0809.1989  [pdf, ps, other

    cs.DM

    Distributing Labels on Infinite Trees

    Authors: Nicolas Gast, Bruno Gaujal

    Abstract: Sturmian words are infinite binary words with many equivalent definitions: They have a minimal factor complexity among all aperiodic sequences; they are balanced sequences (the labels 0 and 1 are as evenly distributed as possible) and they can be constructed using a mechanical definition. All this properties make them good candidates for being extremal points in scheduling problems over two proc… ▽ More

    Submitted 11 September, 2008; originally announced September 2008.

    Comments: 30 pages, use pgf/tikz

    ACM Class: G.2.2