Skip to main content

Showing 1–11 of 11 results for author: Russo, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.07824  [pdf, other

    stat.ML cs.LG

    Pure Exploration with Feedback Graphs

    Authors: Alessio Russo, Yichen Song, Aldo Pacchiano

    Abstract: We study the sample complexity of pure exploration in an online learning problem with a feedback graph. This graph dictates the feedback available to the learner, covering scenarios between full-information, pure bandit feedback, and settings with no feedback on the chosen action. While variants of this problem have been investigated for regret minimization, no prior work has addressed the pure ex… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  2. arXiv:2502.02516  [pdf, other

    cs.LG cs.AI stat.ML

    Adaptive Exploration for Multi-Reward Multi-Policy Evaluation

    Authors: Alessio Russo, Aldo Pacchiano

    Abstract: We study the policy evaluation problem in an online multi-reward multi-policy discounted setting, where multiple reward functions must be evaluated simultaneously for different policies. We adopt an $(ε,δ)$-PAC perspective to achieve $ε$-accurate estimates with high confidence across finite or convex sets of rewards, a setting that has not been investigated in the literature. Building on prior wor… ▽ More

    Submitted 28 May, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: Accepted at the International Conference on Machine Learning, 2025

  3. arXiv:2501.18790  [pdf, other

    cs.LG stat.ML

    Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models

    Authors: Alessio Russo, Alberto Maria Metelli, Marcello Restelli

    Abstract: We tackle average-reward infinite-horizon POMDPs with an unknown transition model but a known observation model, a setting that has been previously addressed in two limiting ways: (i) frequentist methods relying on suboptimal stochastic policies having a minimum probability of choosing each action, and (ii) Bayesian approaches employing the optimal policy class but requiring strong assumptions abo… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  4. arXiv:2410.01331  [pdf, other

    cs.LG stat.ML

    Efficient Learning of POMDPs with Known Observation Model in Average-Reward Setting

    Authors: Alessio Russo, Alberto Maria Metelli, Marcello Restelli

    Abstract: Dealing with Partially Observable Markov Decision Processes is notably a challenging task. We face an average-reward infinite-horizon POMDP setting with an unknown transition model, where we assume the knowledge of the observation model. Under this assumption, we propose the Observation-Aware Spectral (OAS) estimation technique, which enables the POMDP parameters to be learned from samples collect… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  5. arXiv:2211.15129  [pdf, other

    stat.ML cs.AI cs.LG

    On the Sample Complexity of Representation Learning in Multi-task Bandits with Global and Local structure

    Authors: Alessio Russo, Alexandre Proutiere

    Abstract: We investigate the sample complexity of learning the optimal arm for multi-task bandit problems. Arms consist of two components: one that is shared across tasks (that we call representation) and one that is task-specific (that we call predictor). The objective is to learn the optimal (representation, predictor)-pair for each task, under the assumption that the optimal representation is common to a… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted at the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI23)

  6. arXiv:2106.00810  [pdf

    cs.LG stat.ML

    Some Ethical Issues in the Review Process of Machine Learning Conferences

    Authors: Alessio Russo

    Abstract: Recent successes in the Machine Learning community have led to a steep increase in the number of papers submitted to conferences. This increase made more prominent some of the issues that affect the current review process used by these conferences. The review process has several issues that may undermine the nature of scientific research, which is of being fully objective, apolitical, unbiased and… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  7. arXiv:2103.01658  [pdf, other

    eess.SY stat.OT

    Minimizing Information Leakage of Abrupt Changes in Stochastic Systems

    Authors: Alessio Russo, Alexandre Proutiere

    Abstract: This work investigates the problem of analyzing privacy of abrupt changes for general Markov processes. These processes may be affected by changes, or exogenous signals, that need to remain private. Privacy refers to the disclosure of information of these changes through observations of the underlying Markov chain. In contrast to previous work on privacy, we study the problem for an online sequenc… ▽ More

    Submitted 30 September, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

  8. arXiv:2006.07119  [pdf, other

    cs.LG stat.ML

    Learning Diverse Representations for Fast Adaptation to Distribution Shift

    Authors: Daniel Pace, Alessandra Russo, Murray Shanahan

    Abstract: The i.i.d. assumption is a useful idealization that underpins many successful approaches to supervised machine learning. However, its violation can lead to models that learn to exploit spurious correlations in the training data, rendering them vulnerable to adversarial interventions, undermining their reliability, and limiting their practical application. To mitigate this problem, we present a met… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  9. arXiv:1911.13152  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    Induction of Subgoal Automata for Reinforcement Learning

    Authors: Daniel Furelos-Blanco, Mark Law, Alessandra Russo, Krysia Broda, Anders Jonsson

    Abstract: In this work we present ISA, a novel approach for learning and exploiting subgoals in reinforcement learning (RL). Our method relies on inducing an automaton whose transitions are subgoals expressed as propositional formulas over a set of observable events. A state-of-the-art inductive logic programming system is used to learn the automaton from observation traces perceived by the RL agent. The re… ▽ More

    Submitted 29 November, 2019; originally announced November 2019.

    Comments: Preprint accepted for publication to the 34th AAAI Conference on Artificial Intelligence (AAAI-20)

  10. arXiv:1909.07328  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Invariants through Soft Unification

    Authors: Nuri Cingillioglu, Alessandra Russo

    Abstract: Human reasoning involves recognising common underlying principles across many examples. The by-products of such reasoning are invariants that capture patterns such as "if someone went somewhere then they are there", expressed using variables "someone" and "somewhere" instead of mentioning specific people or places. Humans learn what variables are and how to use them at a young age. This paper expl… ▽ More

    Submitted 24 October, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: 23 pages

  11. arXiv:1907.13548  [pdf, other

    cs.LG cs.CR stat.ML

    Optimal Attacks on Reinforcement Learning Policies

    Authors: Alessio Russo, Alexandre Proutiere

    Abstract: Control policies, trained using the Deep Reinforcement Learning, have been recently shown to be vulnerable to adversarial attacks introducing even very small perturbations to the policy input. The attacks proposed so far have been designed using heuristics, and build on existing adversarial example crafting techniques used to dupe classifiers in supervised learning. In contrast, this paper investi… ▽ More

    Submitted 31 July, 2019; originally announced July 2019.