Skip to main content

Showing 1–50 of 65 results for author: Tan, V

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.15141  [pdf, ps, other

    cs.LG cs.AI stat.ML

    BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

    Authors: Yunlong Hou, Fengzhuo Zhang, Cunxiao Du, Xuan Zhang, Jiachun Pan, Tianyu Pang, Chao Du, Vincent Y. F. Tan, Zhuoran Yang

    Abstract: Speculative decoding has emerged as a popular method to accelerate the inference of Large Language Models (LLMs) while retaining their superior text generation performance. Previous methods either adopt a fixed speculative decoding configuration regardless of the prefix tokens, or train draft models in an offline or online manner to align them with the context. This paper proposes a training-free… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 35 pages, 4 figures

  2. arXiv:2501.13607  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Optimal Multi-Objective Best Arm Identification with Fixed Confidence

    Authors: Zhirui Chen, P. N. Karthik, Yeow Meng Chee, Vincent Y. F. Tan

    Abstract: We consider a multi-armed bandit setting with finitely many arms, in which each arm yields an $M$-dimensional vector reward upon selection. We assume that the reward of each dimension (a.k.a. {\em objective}) is generated independently of the others. The best arm of any given objective is the arm with the largest component of mean corresponding to the objective. The end goal is to identify the bes… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: Accepted to AISTATS 2025

  3. arXiv:2410.07638  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits

    Authors: Yunlong Hou, Vincent Y. F. Tan, Zixin Zhong

    Abstract: We propose a {\em novel} piecewise stationary linear bandit (PSLB) model, where the environment randomly samples a context from an unknown probability distribution at each changepoint, and the quality of an arm is measured by its return averaged over all contexts. The contexts and their distribution, as well as the changepoints are unknown to the agent. We design {\em Piecewise-Stationary… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: 69 pages. Accepted to NeurIPS 2024

  4. arXiv:2410.05856  [pdf, other

    stat.ML cs.LG

    Stochastic Bandits for Egalitarian Assignment

    Authors: Eugene Lim, Vincent Y. F. Tan, Harold Soh

    Abstract: We study EgalMAB, an egalitarian assignment problem in the context of stochastic multi-armed bandits. In EgalMAB, an agent is tasked with assigning a set of users to arms. At each time step, the agent must assign exactly one arm to each user such that no two users are assigned to the same arm. Subsequently, each user obtains a reward drawn from the unknown reward distribution associated with its a… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  5. arXiv:2409.18909  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Best Arm Identification with Minimal Regret

    Authors: Junwen Yang, Vincent Y. F. Tan, Tianyuan Jin

    Abstract: Motivated by real-world applications that necessitate responsible experimentation, we introduce the problem of best arm identification (BAI) with minimal regret. This innovative variant of the multi-armed bandit problem elegantly amalgamates two of its most ubiquitous objectives: regret minimization and BAI. More precisely, the agent's goal is to identify the best arm with a prescribed confidence… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: Preprint

  6. arXiv:2409.05072  [pdf, other

    cs.LG cs.IT stat.ML

    A General Framework for Clustering and Distribution Matching with Bandit Feedback

    Authors: Recep Can Yavas, Yuqi Huang, Vincent Y. F. Tan, Jonathan Scarlett

    Abstract: We develop a general framework for clustering and distribution matching problems with bandit feedback. We consider a $K$-armed bandit model where some subset of $K$ arms is partitioned into $M$ groups. Within each group, the random variable associated to each arm follows the same distribution on a finite alphabet. At each time step, the decision maker pulls an arm and observes its outcome from the… ▽ More

    Submitted 9 January, 2025; v1 submitted 8 September, 2024; originally announced September 2024.

    Comments: 24 pages

    MSC Class: 68T05 ACM Class: I.2.6

  7. arXiv:2406.12205  [pdf, other

    cs.LG cs.AI cs.IT math.ST stat.ML

    Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback

    Authors: Zhirui Chen, Vincent Y. F. Tan

    Abstract: We consider offline reinforcement learning (RL) with preference feedback in which the implicit reward is a linear function of an unknown parameter. Given an offline dataset, our objective consists in ascertaining the optimal action for each state, with the ultimate goal of minimizing the {\em simple regret}. We propose an algorithm, \underline{RL} with \underline{L}ocally \underline{O}ptimal \unde… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted to Models of Human Feedback for AI Alignment Workshop, ICML 2024

  8. Adversarial Combinatorial Bandits with Switching Costs

    Authors: Yanyan Dong, Vincent Y. F. Tan

    Abstract: We study the problem of adversarial combinatorial bandit with a switching cost $λ$ for a switch of each selected arm in each round, considering both the bandit feedback and semi-bandit feedback settings. In the oblivious adversarial case with $K$ base arms and time horizon $T$, we derive lower bounds for the minimax regret and design algorithms to approach them. To prove these lower bounds, we des… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: The work has been accepted in IEEE Transactions on Information Theory. https://ieeexplore.ieee.org/document/10487974

  9. arXiv:2402.15127  [pdf, other

    cs.LG cs.IT stat.ML

    Multi-Armed Bandits with Abstention

    Authors: Junwen Yang, Tianyuan Jin, Vincent Y. F. Tan

    Abstract: We introduce a novel extension of the canonical multi-armed bandit problem that incorporates an additional strategic element: abstention. In this enhanced framework, the agent is not only tasked with selecting an arm at each time step, but also has the option to abstain from accepting the stochastic instantaneous reward before observing it. When opting for abstention, the agent either suffers a fi… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Preprint

  10. arXiv:2401.09073  [pdf, other

    cs.LG cs.AI cs.IT math.ST stat.ML

    Fixed-Budget Differentially Private Best Arm Identification

    Authors: Zhirui Chen, P. N. Karthik, Yeow Meng Chee, Vincent Y. F. Tan

    Abstract: We study best arm identification (BAI) in linear bandits in the fixed-budget regime under differential privacy constraints, when the arm rewards are supported on the unit interval. Given a finite budget $T$ and a privacy parameter $\varepsilon>0$, the goal is to minimise the error probability in finding the arm with the largest mean after $T$ sampling rounds, subject to the constraint that the pol… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR 2024

  11. arXiv:2311.00481  [pdf, ps, other

    cs.LG stat.ML

    Fixed-Budget Best-Arm Identification in Sparse Linear Bandits

    Authors: Recep Can Yavas, Vincent Y. F. Tan

    Abstract: We study the best-arm identification problem in sparse linear bandits under the fixed-budget setting. In sparse linear bandits, the unknown feature vector $θ^*$ may be of large dimension $d$, but only a few, say $s \ll d$ of these features have non-zero values. We design a two-phase algorithm, Lasso and Optimal-Design- (Lasso-OD) based linear best-arm identification. The first phase of Lasso-OD le… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 28 pages, Submitted to TMLR

    ACM Class: I.2.6

  12. arXiv:2310.17531  [pdf, ps, other

    cs.GT cs.LG stat.ML

    Learning Regularized Graphon Mean-Field Games with Unknown Graphons

    Authors: Fengzhuo Zhang, Vincent Y. F. Tan, Zhaoran Wang, Zhuoran Yang

    Abstract: We design and analyze reinforcement learning algorithms for Graphon Mean-Field Games (GMFGs). In contrast to previous works that require the precise values of the graphons, we aim to learn the Nash Equilibrium (NE) of the regularized GMFGs when the graphons are unknown. Our contributions are threefold. First, we propose the Proximal Policy Optimization for GMFG (GMFG-PPO) algorithm and show that i… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  13. arXiv:2310.13550  [pdf, other

    cs.LG stat.ML

    Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes

    Authors: Ruiquan Huang, Yuan Cheng, Jing Yang, Vincent Tan, Yingbin Liang

    Abstract: In multi-task reinforcement learning (RL) under Markov decision processes (MDPs), the presence of shared latent structures among multiple MDPs has been shown to yield significant benefits to the sample efficiency compared to single-task RL. In this paper, we investigate whether such a benefit can extend to more general sequential decision making problems, such as partially observable MDPs (POMDPs)… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  14. arXiv:2310.13393  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Optimal Best Arm Identification with Fixed Confidence in Restless Bandits

    Authors: P. N. Karthik, Vincent Y. F. Tan, Arpan Mukherjee, Ali Tajer

    Abstract: We study best arm identification in a restless multi-armed bandit setting with finitely many arms. The discrete-time data generated by each arm forms a homogeneous Markov chain taking values in a common, finite state space. The state transitions in each arm are captured by an ergodic transition probability matrix (TPM) that is a member of a single-parameter exponential family of TPMs. The real-val… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to the IEEE Transactions on Information Theory

  15. arXiv:2310.08089  [pdf, other

    cs.GT eess.SY stat.ML

    Learning Regularized Monotone Graphon Mean-Field Games

    Authors: Fengzhuo Zhang, Vincent Y. F. Tan, Zhaoran Wang, Zhuoran Yang

    Abstract: This paper studies two fundamental problems in regularized Graphon Mean-Field Games (GMFGs). First, we establish the existence of a Nash Equilibrium (NE) of any $λ$-regularized GMFG (for $λ\geq 0$). This result relies on weaker conditions than those in previous works for analyzing both unregularized GMFGs ($λ=0$) and $λ$-regularized MFGs, which are special cases of GMFGs. Second, we propose provab… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  16. arXiv:2304.12680  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Communication-Constrained Bandits under Additive Gaussian Noise

    Authors: Prathamesh Mayekar, Jonathan Scarlett, Vincent Y. F. Tan

    Abstract: We study a distributed stochastic multi-armed bandit where a client supplies the learner with communication-constrained feedback based on the rewards for the corresponding arm pulls. In our setup, the client must encode the rewards such that the second moment of the encoded rewards is no more than $P$, and this encoded reward is further corrupted by additive Gaussian noise of variance $σ^2$; the l… ▽ More

    Submitted 6 June, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  17. arXiv:2301.13393  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits

    Authors: Yunlong Hou, Vincent Y. F. Tan, Zixin Zhong

    Abstract: Motivated by concerns about making online decisions that incur undue amount of risk at each time step, in this paper, we formulate the probably anytime-safe stochastic combinatorial semi-bandits problem. In this problem, the agent is given the option to select a subset of size at most $K$ from a set of $L$ ground items. Each item is associated to a certain mean reward as well as a variance that re… ▽ More

    Submitted 2 June, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: To be presented at ICML 2023. 57 pages, 6 figures

  18. arXiv:2209.09845  [pdf, other

    cs.LG cs.MA stat.ML

    Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL

    Authors: Fengzhuo Zhang, Boyi Liu, Kaixin Wang, Vincent Y. F. Tan, Zhuoran Yang, Zhaoran Wang

    Abstract: The cooperative Multi-A gent R einforcement Learning (MARL) with permutation invariant agents framework has achieved tremendous empirical successes in real-world applications. Unfortunately, the theoretical understanding of this MARL problem is lacking due to the curse of many agents and the limited exploration of the relational reasoning in existing works. In this paper, we verify that the transf… ▽ More

    Submitted 16 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

  19. arXiv:2208.09215  [pdf, other

    cs.LG cs.IT math.ST stat.ML

    Almost Cost-Free Communication in Federated Best Arm Identification

    Authors: Kota Srinivas Reddy, P. N. Karthik, Vincent Y. F. Tan

    Abstract: We study the problem of best arm identification in a federated learning multi-armed bandit setup with a central server and multiple clients. Each client is associated with a multi-armed bandit in which each arm yields {\em i.i.d.}\ rewards following a Gaussian distribution with an unknown mean and known variance. The set of arms is assumed to be the same at all the clients. We define two notions o… ▽ More

    Submitted 19 December, 2022; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: Accepted to AAAI 2023

  20. arXiv:2205.05843  [pdf, ps, other

    stat.ML cs.IT cs.LG

    A Survey of Risk-Aware Multi-Armed Bandits

    Authors: Vincent Y. F. Tan, Prashanth L. A., Krishna Jagannathan

    Abstract: In several applications such as clinical trials and financial portfolio optimization, the expected value (or the average reward) does not satisfactorily capture the merits of a drug or a portfolio. In such applications, risk plays a crucial role, and a risk-aware performance measure is preferable, so as to capture losses in the case of adverse events. This survey aims to consolidate and summarise… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: 11 pages; Unabridged version of a a survey paper of the same title accepted to IJCAI-ECAI, 2022

  21. arXiv:2203.15236  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Best Arm Identification in Restless Markov Multi-Armed Bandits

    Authors: P. N. Karthik, Kota Srinivas Reddy, Vincent Y. F. Tan

    Abstract: We study the problem of identifying the best arm in a multi-armed bandit environment when each arm is a time-homogeneous and ergodic discrete-time Markov process on a common, finite state space. The state evolution on each arm is governed by the arm's transition probability matrix (TPM). A decision entity that knows the set of arm TPMs but not the exact mapping of the TPMs to the arms, wishes to f… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 41 pages

  22. arXiv:2202.04294  [pdf, other

    cs.LG cs.IT stat.ML

    Optimal Clustering with Bandit Feedback

    Authors: Junwen Yang, Zixin Zhong, Vincent Y. F. Tan

    Abstract: This paper considers the problem of online clustering with bandit feedback. A set of arms (or items) can be partitioned into various groups that are unknown. Within each group, the observations associated to each of the arms follow the same distribution with the same mean vector. At each time step, the agent queries or pulls an arm and obtains an independent observation from the distribution it is… ▽ More

    Submitted 15 May, 2024; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: 54 pages, 4 figures

  23. arXiv:2201.10142  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Almost Optimal Variance-Constrained Best Arm Identification

    Authors: Yunlong Hou, Vincent Y. F. Tan, Zixin Zhong

    Abstract: We design and analyze VA-LUCB, a parameter-free algorithm, for identifying the best arm under the fixed-confidence setup and under a stringent constraint that the variance of the chosen arm is strictly smaller than a given threshold. An upper bound on VA-LUCB's sample complexity is shown to be characterized by a fundamental variance-aware hardness quantity $H_{VA}$. By proving a lower bound, we sh… ▽ More

    Submitted 14 November, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: 32 pages, 15 figures

  24. arXiv:2110.14341  [pdf, ps, other

    cs.LG stat.ML

    Active-LATHE: An Active Learning Algorithm for Boosting the Error Exponent for Learning Homogeneous Ising Trees

    Authors: Fengzhuo Zhang, Anshoo Tandon, Vincent Y. F. Tan

    Abstract: The Chow-Liu algorithm (IEEE Trans.~Inform.~Theory, 1968) has been a mainstay for the learning of tree-structured graphical models from i.i.d.\ sampled data vectors. Its theoretical properties have been well-studied and are well-understood. In this paper, we focus on the class of trees that are arguably even more fundamental, namely {\em homogeneous} trees in which each pair of nodes that forms an… ▽ More

    Submitted 28 October, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

  25. arXiv:2110.08627  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Achieving the Pareto Frontier of Regret Minimization and Best Arm Identification in Multi-Armed Bandits

    Authors: Zixin Zhong, Wang Chi Cheung, Vincent Y. F. Tan

    Abstract: We study the Pareto frontier of two archetypal objectives in multi-armed bandits, namely, regret minimization (RM) and best arm identification (BAI) with a fixed horizon. It is folklore that the balance between exploitation and exploration is crucial for both RM and BAI, but exploration is more critical in achieving the optimal performance for the latter objective. To this end, we design and analy… ▽ More

    Submitted 9 June, 2023; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: 43 pages, 10 figures

  26. arXiv:2108.11345  [pdf, ps, other

    cs.LG cs.IT stat.ML

    A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits

    Authors: Joel Q. L. Chang, Vincent Y. F. Tan

    Abstract: This paper unifies the design and the analysis of risk-averse Thompson sampling algorithms for the multi-armed bandit problem for a class of risk functionals $ρ$ that are continuous and dominant. We prove generalised concentration bounds for these continuous and dominant risk functionals and show that a wide class of popular risk functionals belong to this class. Using our newly developed analytic… ▽ More

    Submitted 17 April, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: Accepted to the Association for the Advancement of Artificial Intelligence (AAAI) 2022

  27. arXiv:2106.00885  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Robustifying Algorithms of Learning Latent Trees with Vector Variables

    Authors: Fengzhuo Zhang, Vincent Y. F. Tan

    Abstract: We consider learning the structures of Gaussian latent tree models with vector observations when a subset of them are arbitrarily corrupted. First, we present the sample complexities of Recursive Grouping (RG) and Chow-Liu Recursive Grouping (CLRG) without the assumption that the effective depth is bounded in the number of observed nodes, significantly generalizing the results in Choi et al. (2011… ▽ More

    Submitted 25 October, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

  28. arXiv:2105.04770  [pdf, other

    cs.IT cs.LG eess.SP math.ST stat.ML

    Exact Recovery in the General Hypergraph Stochastic Block Model

    Authors: Qiaosheng Zhang, Vincent Y. F. Tan

    Abstract: This paper investigates fundamental limits of exact recovery in the general d-uniform hypergraph stochastic block model (d-HSBM), wherein n nodes are partitioned into k disjoint communities with relative sizes (p1,..., pk). Each subset of nodes with cardinality d is generated independently as an order-d hyperedge with a certain probability that depends on the ground-truth communities that the d no… ▽ More

    Submitted 9 September, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: Accepted by IEEE Transactions on Information Theory

  29. arXiv:2101.08917  [pdf, other

    stat.ML cs.IT cs.LG

    SGA: A Robust Algorithm for Partial Recovery of Tree-Structured Graphical Models with Noisy Samples

    Authors: Anshoo Tandon, Aldric H. J. Yuan, Vincent Y. F. Tan

    Abstract: We consider learning Ising tree models when the observations from the nodes are corrupted by independent but non-identically distributed noise with unknown statistics. Katiyar et al. (2020) showed that although the exact tree structure cannot be recovered, one can recover a partial tree structure; that is, a structure belonging to the equivalence class containing the true tree. This paper presents… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: 23 pages, 14 figures

  30. arXiv:2101.07426  [pdf, other

    stat.AP

    An Interpretable Intensive Care Unit Mortality Risk Calculator

    Authors: Eugene T. Y. Ang, Milashini Nambiar, Yong Sheng Soh, Vincent Y. F. Tan

    Abstract: Mortality risk is a major concern to patients have just been discharged from the intensive care unit (ICU). Many studies have been directed to construct machine learning models to predict such risk. Although these models are highly accurate, they are less amenable to interpretation and clinicians are typically unable to gain further insights into the patients' health conditions and the underlying… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

    Comments: 7 pages, 5 figures

  31. arXiv:2012.05757  [pdf, other

    stat.ML cs.LG q-fin.RM

    Estimation of Large Financial Covariances: A Cross-Validation Approach

    Authors: Vincent Tan, Stefan Zohren

    Abstract: We introduce a novel covariance estimator for portfolio selection that adapts to the non-stationary or persistent heteroskedastic environments of financial time series by employing exponentially weighted averages and nonlinearly shrinking the sample eigenvalues through cross-validation. Our estimator is structure agnostic, transparent, and computationally feasible in large dimensions. By correctin… ▽ More

    Submitted 20 January, 2023; v1 submitted 10 December, 2020; originally announced December 2020.

  32. arXiv:2011.08046  [pdf, other

    cs.LG stat.ML

    Risk-Constrained Thompson Sampling for CVaR Bandits

    Authors: Joel Q. L. Chang, Qiuyu Zhu, Vincent Y. F. Tan

    Abstract: The multi-armed bandit (MAB) problem is a ubiquitous decision-making problem that exemplifies the exploration-exploitation tradeoff. Standard formulations exclude risk in decision making. Risk notably complicates the basic reward-maximising objective, in part because there is no universally agreed definition of it. In this paper, we consider a popular risk measure in quantitative finance known as… ▽ More

    Submitted 4 February, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: 7 pages main paper with 11 pages supplementary material

  33. arXiv:2007.12364  [pdf, other

    eess.SP cs.LG stat.CO stat.ML

    Positive Semidefinite Matrix Factorization: A Connection with Phase Retrieval and Affine Rank Minimization

    Authors: Dana Lahat, Yanbin Lang, Vincent Y. F. Tan, Cédric Févotte

    Abstract: Positive semidefinite matrix factorization (PSDMF) expresses each entry of a nonnegative matrix as the inner product of two positive semidefinite (psd) matrices. When all these psd matrices are constrained to be diagonal, this model is equivalent to nonnegative matrix factorization. Applications include combinatorial optimization, quantum-based statistical models, and recommender systems, among ot… ▽ More

    Submitted 2 April, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: 18 pages (16 paper + 2 supplementary material), 9 figures, accepted for publication in the IEEE Transactions on Signal Processing. This is a revised version: there is a new additional PSDMF algorithm based on CGIHT, more numerical experiments, and some background material moved to Supplementary Material (pages 17 and 18 in this document). Supplementary Material also contains some extra figures

  34. arXiv:2005.04354  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Exact Asymptotics for Learning Tree-Structured Graphical Models with Side Information: Noiseless and Noisy Samples

    Authors: Anshoo Tandon, Vincent Y. F. Tan, Shiyao Zhu

    Abstract: Given side information that an Ising tree-structured graphical model is homogeneous and has no external field, we derive the exact asymptotics of learning its structure from independently drawn samples. Our results, which leverage the use of probabilistic tools from the theory of strong large deviations, refine the large deviation (error exponents) results of Tan, Anandkumar, Tong, and Willsky [IE… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

  35. arXiv:2003.06511  [pdf, other

    cs.IT cs.LG math.ST stat.ML

    Optimal Change-Point Detection with Training Sequences in the Large and Moderate Deviations Regimes

    Authors: Haiyun He, Qiaosheng Zhang, Vincent Y. F. Tan

    Abstract: This paper investigates a novel offline change-point detection problem from an information-theoretic perspective. In contrast to most related works, we assume that the knowledge of the underlying pre- and post-change distributions are not known and can only be learned from the training sequences which are available. We further require the probability of the \emph{estimation error} to decay either… ▽ More

    Submitted 3 October, 2021; v1 submitted 13 March, 2020; originally announced March 2020.

    Comments: 27 pages, 11 figures

  36. arXiv:2003.00029  [pdf, other

    stat.AP

    Estimating the impact of treatment compliance over time on smoking cessation using data from ecological momentary assessments (EMA)

    Authors: Yaoyuan Vincent Tan, Donna Coffman, Megan Piper, Jason Roy

    Abstract: The Wisconsin Smoker's Health Study (WSHS2) was a longitudinal trial conducted to compare the effectiveness of two commonly used smoking cessation treatments, varenicline and combination nicotine replacement therapy (cNRT) with the less intense standard of care, nicotine patch. The main outcome of the WSHS2 study was that all three treatments had equivalent treatment effects. However, in-depth ana… ▽ More

    Submitted 28 February, 2020; originally announced March 2020.

    Comments: 26 pages, 6 figures

  37. arXiv:2002.00232  [pdf, other

    cs.LG cs.IT stat.ML

    Thompson Sampling Algorithms for Mean-Variance Bandits

    Authors: Qiuyu Zhu, Vincent Y. F. Tan

    Abstract: The multi-armed bandit (MAB) problem is a classical learning task that exemplifies the exploration-exploitation tradeoff. However, standard formulations do not take into account {\em risk}. In online decision making systems, risk is a primary concern. In this regard, the mean-variance risk measure is one of the most common objective functions. Existing algorithms for mean-variance optimization in… ▽ More

    Submitted 3 August, 2020; v1 submitted 1 February, 2020; originally announced February 2020.

    Comments: 26 pages, 10 figures, ICML 2020

  38. arXiv:2001.09327  [pdf, other

    cs.LG cs.IT math.OC stat.ML

    Tight Regret Bounds for Noisy Optimization of a Brownian Motion

    Authors: Zexin Wang, Vincent Y. F. Tan, Jonathan Scarlett

    Abstract: We consider the problem of Bayesian optimization of a one-dimensional Brownian motion in which the $T$ adaptively chosen observations are corrupted by Gaussian noise. We show that as the smallest possible expected cumulative regret and the smallest possible expected simple regret scale as $Ω(σ\sqrt{T / \log (T)}) \cap \mathcal{O}(σ\sqrt{T} \cdot \log T)$ and… ▽ More

    Submitted 15 January, 2022; v1 submitted 25 January, 2020; originally announced January 2020.

  39. arXiv:2001.08655  [pdf, other

    cs.LG cs.IT stat.ML

    Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting

    Authors: Zixin Zhong, Wang Chi Cheung, Vincent Y. F. Tan

    Abstract: We design and analyze CascadeBAI, an algorithm for finding the best set of $K$ items, also called an arm, within the framework of cascading bandits. An upper bound on the time complexity of CascadeBAI is derived by overcoming a crucial analytical challenge, namely, that of probabilistically estimating the amount of available feedback at each step. To do so, we define a new class of random variable… ▽ More

    Submitted 15 June, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: 39 pages, 25 figures. Proceedings of the 37th International Conference on Machine Learning (ICML), Vienna, Austria, PMLR 108, 2020

  40. arXiv:1912.01170  [pdf, other

    stat.ML cs.IT cs.LG

    Sequential Classification with Empirically Observed Statistics

    Authors: Mahdi Haghifam, Vincent Y. F. Tan, Ashish Khisti

    Abstract: Motivated by real-world machine learning applications, we consider a statistical classification task in a sequential setting where test samples arrive sequentially. In addition, the generating distributions are unknown and only a set of empirically sampled sequences are available to a decision maker. The decision maker is tasked to classify a test sequence which is known to be generated according… ▽ More

    Submitted 9 February, 2021; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: 17 Pages, 5 Figures. To appear in the IEEE Transactions on Information Theory

  41. arXiv:1911.09879  [pdf, other

    cs.LG eess.SP stat.AP stat.ME stat.ML

    Economy Statistical Recurrent Units For Inferring Nonlinear Granger Causality

    Authors: Saurabh Khanna, Vincent Y. F. Tan

    Abstract: Granger causality is a widely-used criterion for analyzing interactions in large-scale networks. As most physical interactions are inherently nonlinear, we consider the problem of inferring the existence of pairwise Granger causality between nonlinearly interacting stochastic processes from their time series measurements. Our proposed approach relies on modeling the embedded nonlinearities in the… ▽ More

    Submitted 13 January, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: A new RNN architecture for inferring nonlinear Granger causality from time series data with emphasis on learning time-localized predictive features

  42. arXiv:1910.05513  [pdf, other

    cs.LG stat.ML

    On Robustness of Neural Ordinary Differential Equations

    Authors: Hanshu Yan, Jiawei Du, Vincent Y. F. Tan, Jiashi Feng

    Abstract: Neural ordinary differential equations (ODEs) have been attracting increasing attention in various research domains recently. There have been some works studying optimization issues and approximation capabilities of neural ODEs, but their robustness is still yet unclear. In this work, we fill this important gap by exploring robustness properties of neural ODEs both empirically and theoretically. W… ▽ More

    Submitted 3 March, 2022; v1 submitted 12 October, 2019; originally announced October 2019.

  43. arXiv:1903.06500  [pdf, other

    cs.LG eess.SP stat.ML

    A Ranking Model Motivated by Nonnegative Matrix Factorization with Applications to Tennis Tournaments

    Authors: Rui Xia, Vincent Y. F. Tan, Louis Filstroff, Cédric Févotte

    Abstract: We propose a novel ranking model that combines the Bradley-Terry-Luce probability model with a nonnegative matrix factorization framework to model and uncover the presence of latent variables that influence the performance of top tennis players. We derive an efficient, provably convergent, and numerically stable majorization-minimization-based algorithm to maximize the likelihood of datasets under… ▽ More

    Submitted 12 June, 2019; v1 submitted 15 March, 2019; originally announced March 2019.

    Comments: 16 pages, 2 figures, 9 tables. Accepted and to be presented at the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD) 2019. Supplementary material, code and datasets can be found in this URL https://github.com/XiaRui1996/btl-nmf

  44. arXiv:1901.10757  [pdf, other

    cs.LG math.OC stat.ML

    Distributionally Robust and Multi-Objective Nonnegative Matrix Factorization

    Authors: Nicolas Gillis, Le Thi Khanh Hien, Valentin Leplat, Vincent Y. F. Tan

    Abstract: Nonnegative matrix factorization (NMF) is a linear dimensionality reduction technique for analyzing nonnegative data. A key aspect of NMF is the choice of the objective function that depends on the noise model (or statistics of the noise) assumed on the data. In many applications, the noise model is unknown and difficult to estimate. In this paper, we define a multi-objective NMF (MO-NMF) problem,… ▽ More

    Submitted 9 February, 2021; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: Accepted in IEEE Trans. on Pattern Analysis and Machine Intelligence

  45. arXiv:1901.07504  [pdf, other

    stat.AP

    Bayesian additive regression trees and the General BART model

    Authors: Yaoyuan Vincent Tan, Jason Roy

    Abstract: Bayesian additive regression trees (BART) is a flexible prediction model/machine learning approach that has gained widespread popularity in recent years. As BART becomes more mainstream, there is an increased need for a paper that walks readers through the details of BART, from what it is to why it works. This tutorial is aimed at providing such a resource. In addition to explaining the different… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.

  46. arXiv:1812.08855  [pdf, ps, other

    stat.AP

    Accounting for selection bias due to death in estimating the effect of wealth shock on cognition for the Health and Retirement Study

    Authors: Yaoyuan Vincent Tan, Carol A. C. Flannagan, Lindsay R. Pool, Michael R. Elliott

    Abstract: The Health and Retirement Study is a longitudinal study of US adults enrolled at age 50 and older. We were interested in investigating the effect of a sudden large decline in wealth on the cognitive score of subjects. Our analysis was complicated by the lack of randomization, confounding by indication, and a substantial fraction of the sample and population will die during follow-up leading to som… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: 43 pages, 8 Tables

  47. arXiv:1811.03679  [pdf, other

    stat.ML cs.LG

    Practical Bayesian Learning of Neural Networks via Adaptive Optimisation Methods

    Authors: Samuel Kessler, Arnold Salas, Vincent W. C. Tan, Stefan Zohren, Stephen Roberts

    Abstract: We introduce a novel framework for the estimation of the posterior distribution over the weights of a neural network, based on a new probabilistic interpretation of adaptive optimisation algorithms such as AdaGrad and Adam. We demonstrate the effectiveness of our Bayesian Adam method, Badam, by experimentally showing that the learnt uncertainties correctly relate to the weights' predictive capabil… ▽ More

    Submitted 20 July, 2020; v1 submitted 8 November, 2018; originally announced November 2018.

    Comments: Presented at the ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning

  48. arXiv:1810.01187  [pdf, other

    cs.LG stat.ML

    Thompson Sampling Algorithms for Cascading Bandits

    Authors: Zixin Zhong, Wang Chi Cheung, Vincent Y. F. Tan

    Abstract: Motivated by the pressing need for efficient optimization in online recommender systems, we revisit the cascading bandit model proposed by Kveton et al. (2015). While Thompson sampling (TS) algorithms have been shown to be empirically superior to Upper Confidence Bound (UCB) algorithms for cascading bandits, theoretical guarantees are only known for the latter. In this paper, we first provide a pr… ▽ More

    Submitted 15 May, 2021; v1 submitted 2 October, 2018; originally announced October 2018.

    Comments: 62 pages, 6 figures

  49. arXiv:1801.03147  [pdf, ps, other

    stat.AP

    "Robust-squared" Imputation Models Using BART

    Authors: Yaoyuan V. Tan, Carol A. C. Flannagan, Michael R. Elliott

    Abstract: Examples of "doubly robust" estimator for missing data include augmented inverse probability weighting (AIPWT) models (Robins et al., 1994) and penalized splines of propensity prediction (PSPP) models (Zhang and Little, 2009). Doubly-robust estimators have the property that, if either the response propensity or the mean is modeled correctly, a consistent estimator of the population mean is obtaine… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

  50. arXiv:1704.00116  [pdf, other

    math.OC cs.IT stat.ML

    Stochastic L-BFGS: Improved Convergence Rates and Practical Acceleration Strategies

    Authors: Renbo Zhao, William B. Haskell, Vincent Y. F. Tan

    Abstract: We revisit the stochastic limited-memory BFGS (L-BFGS) algorithm. By proposing a new framework for the convergence analysis, we prove improved convergence rates and computational complexities of the stochastic L-BFGS algorithms compared to previous works. In addition, we propose several practical acceleration strategies to speed up the empirical performance of such algorithms. We also provide theo… ▽ More

    Submitted 24 October, 2017; v1 submitted 31 March, 2017; originally announced April 2017.