Skip to main content

Showing 1–4 of 4 results for author: Tajdini, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.04775  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards

    Authors: Artin Tajdini, Jonathan Scarlett, Kevin Jamieson

    Abstract: We study stochastic linear bandits with heavy-tailed rewards, where the rewards have a finite $(1+ε)$-absolute central moment bounded by $\upsilon$ for some $ε\in (0,1]$. We improve both upper and lower bounds on the minimax regret compared to prior work. When $\upsilon = \mathcal{O}(1)$, the best prior known regret upper bound is $\tilde{\mathcal{O}}(d T^{\frac{1}{1+ε}})$. While a lower with the… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2410.07533  [pdf, other

    cs.LG stat.ML

    Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification

    Authors: Haolin Liu, Artin Tajdini, Andrew Wagenmaker, Chen-Yu Wei

    Abstract: In linear bandits, how can a learner effectively learn when facing corrupted rewards? While significant work has explored this question, a holistic understanding across different adversarial models and corruption measures is lacking, as is a full characterization of the minimax regret bounds. In this work, we compare two types of corruptions commonly considered: strong corruption, where the corrup… ▽ More

    Submitted 17 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024

  3. arXiv:2310.18465  [pdf, other

    cs.LG stat.ML

    Nearly Minimax Optimal Submodular Maximization with Bandit Feedback

    Authors: Artin Tajdini, Lalit Jain, Kevin Jamieson

    Abstract: We consider maximizing an unknown monotonic, submodular set function $f: 2^{[n]} \rightarrow [0,1]$ with cardinality constraint under stochastic bandit feedback. At each time $t=1,\dots,T$ the learner chooses a set $S_t \subset [n]$ with $|S_t| \leq k$ and receives reward $f(S_t) + η_t$ where $η_t$ is mean-zero sub-Gaussian noise. The objective is to minimize the learner's regret with respect to a… ▽ More

    Submitted 12 December, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  4. arXiv:2308.03296  [pdf, other

    cs.LG cs.CL stat.ML

    Studying Large Language Model Generalization with Influence Functions

    Authors: Roger Grosse, Juhan Bae, Cem Anil, Nelson Elhage, Alex Tamkin, Amirhossein Tajdini, Benoit Steiner, Dustin Li, Esin Durmus, Ethan Perez, Evan Hubinger, Kamilė Lukošiūtė, Karina Nguyen, Nicholas Joseph, Sam McCandlish, Jared Kaplan, Samuel R. Bowman

    Abstract: When trying to gain better visibility into a machine learning model in order to understand and mitigate the associated risks, a potentially valuable source of evidence is: which training examples most contribute to a given behavior? Influence functions aim to answer a counterfactual: how would the model's parameters (and hence its outputs) change if a given sequence were added to the training set?… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 119 pages, 47 figures, 22 tables