Skip to main content

Showing 1–27 of 27 results for author: Shekhar, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.13187  [pdf, ps, other

    stat.AP math.ST

    Sequential One-Sided Hypothesis Testing of Markov Chains

    Authors: Greg Fields, Tara Javidi, Shubhanshu Shekhar

    Abstract: We study the problem of sequentially testing whether a given stochastic process is generated by a known Markov chain. Formally, given access to a stream of random variables, we want to quickly determine whether this sequence is a trajectory of a Markov chain with a known transition matrix $P$ (null hypothesis) or not (composite alternative hypothesis). This problem naturally arises in many enginee… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

  2. arXiv:2407.02536  [pdf, other

    cs.LG cs.IR econ.GN stat.AP

    Reducing False Discoveries in Statistically-Significant Regional-Colocation Mining: A Summary of Results

    Authors: Subhankar Ghosh, Jayant Gupta, Arun Sharma, Shuai An, Shashi Shekhar

    Abstract: Given a set \emph{S} of spatial feature types, its feature instances, a study area, and a neighbor relationship, the goal is to find pairs $<$a region ($r_{g}$), a subset \emph{C} of \emph{S}$>$ such that \emph{C} is a statistically significant regional-colocation pattern in $r_{g}$. This problem is important for applications in various domains including ecology, economics, and sociology. The prob… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    ACM Class: E.m; F.2; E.1; H.3; I.5; J.0

  3. arXiv:2407.00317  [pdf, other

    cs.IR stat.AP

    Towards Statistically Significant Taxonomy Aware Co-location Pattern Detection

    Authors: Subhankar Ghosh, Arun Sharma, Jayant Gupta, Shashi Shekhar

    Abstract: Given a collection of Boolean spatial feature types, their instances, a neighborhood relation (e.g., proximity), and a hierarchical taxonomy of the feature types, the goal is to find the subsets of feature types or their parents whose spatial interaction is statistically significant. This problem is for taxonomy-reliant applications such as ecology (e.g., finding new symbiotic relationships across… ▽ More

    Submitted 4 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Accepted in The 16th Conference on Spatial Information Theory (COSIT) 2024

    ACM Class: E.m; H.3.3; I.5; J.4; J.4

  4. arXiv:2311.12825  [pdf, ps, other

    cs.AI cs.LG stat.ME

    A PSO Based Method to Generate Actionable Counterfactuals for High Dimensional Data

    Authors: Shashank Shekhar, Asif Salim, Adesh Bansode, Vivaswan Jinturkar, Anirudha Nayak

    Abstract: Counterfactual explanations (CFE) are methods that explain a machine learning model by giving an alternate class prediction of a data point with some minimal changes in its features. It helps the users to identify their data attributes that caused an undesirable prediction like a loan or credit card rejection. We describe an efficient and an actionable counterfactual (CF) generation method based o… ▽ More

    Submitted 30 November, 2023; v1 submitted 30 September, 2023; originally announced November 2023.

    Comments: Accepted in IEEE CSDE 2023

  5. arXiv:2310.19384  [pdf, other

    stat.ML cs.LG

    Deep anytime-valid hypothesis testing

    Authors: Teodora Pandeva, Patrick Forré, Aaditya Ramdas, Shubhanshu Shekhar

    Abstract: We propose a general framework for constructing powerful, sequential hypothesis tests for a large class of nonparametric testing problems. The null hypothesis for these problems is defined in an abstract form using the action of two known operators on the data distribution. This abstraction allows for a unified treatment of several classical tasks, such as two-sample testing, independence testing,… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  6. arXiv:2310.15179  [pdf, other

    physics.ao-ph cs.AI cs.LG math.DS stat.OT

    Reducing Uncertainty in Sea-level Rise Prediction: A Spatial-variability-aware Approach

    Authors: Subhankar Ghosh, Shuai An, Arun Sharma, Jayant Gupta, Shashi Shekhar, Aneesh Subramanian

    Abstract: Given multi-model ensemble climate projections, the goal is to accurately and reliably predict future sea-level rise while lowering the uncertainty. This problem is important because sea-level rise affects millions of people in coastal communities and beyond due to climate change's impacts on polar ice sheets and the ocean. This problem is challenging due to spatial variability and unknowns such a… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 6 pages, 5 figures, I-GUIDE 2023 conference

    ACM Class: J.2; I.2.m; I.2.6; I.2.1; I.2

  7. arXiv:2310.01547  [pdf, other

    math.ST cs.IT cs.LG stat.AP stat.ML

    On the near-optimality of betting confidence sets for bounded means

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: Constructing nonasymptotic confidence intervals (CIs) for the mean of a univariate distribution from independent and identically distributed (i.i.d.) observations is a fundamental task in statistics. For bounded observations, a classical nonparametric approach proceeds by inverting standard concentration bounds, such as Hoeffding's or Bernstein's inequalities. Recently, an alternative betting-base… ▽ More

    Submitted 24 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 53 pages, 2 figures

  8. arXiv:2309.09111  [pdf, ps, other

    math.ST cs.LG stat.ME stat.ML

    Reducing sequential change detection to sequential estimation

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We consider the problem of sequential change detection, where the goal is to design a scheme for detecting any changes in a parameter or functional $θ$ of the data stream distribution that has small detection delay, but guarantees control on the frequency of false alarms in the absence of changes. In this paper, we describe a simple reduction from sequential change detection to sequential estimati… ▽ More

    Submitted 24 November, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: 11 pages

  9. arXiv:2305.06884  [pdf, ps, other

    stat.ME cs.AI cs.LG math.ST stat.AP stat.ML

    Risk-limiting Financial Audits via Weighted Sampling without Replacement

    Authors: Shubhanshu Shekhar, Ziyu Xu, Zachary C. Lipton, Pierre J. Liang, Aaditya Ramdas

    Abstract: We introduce the notion of a risk-limiting financial auditing (RLFA): given $N$ transactions, the goal is to estimate the total misstated monetary fraction~($m^*$) to a given accuracy $ε$, with confidence $1-δ$. We do this by constructing new confidence sequences (CSs) for the weighted average of $N$ unknown values, based on samples drawn without replacement according to a (randomized) weighted sa… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 23 pages, 8 figures, to appear in the Proceedings of Uncertainty in Artificial Intelligence (UAI) 2023

  10. arXiv:2302.02544  [pdf, other

    math.ST cs.IT cs.LG stat.ME stat.ML

    Sequential change detection via backward confidence sequences

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We present a simple reduction from sequential estimation to sequential changepoint detection (SCD). In short, suppose we are interested in detecting changepoints in some parameter or functional $θ$ of the underlying distribution. We demonstrate that if we can construct a confidence sequence (CS) for $θ$, then we can also successfully perform SCD for $θ$. This is accomplished by checking if two CSs… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

    Comments: 24 pages, 10 figures

  11. arXiv:2212.09108  [pdf, ps, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-Free Kernel Independence Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: In nonparametric independence testing, we observe i.i.d.\ data $\{(X_i,Y_i)\}_{i=1}^n$, where $X \in \mathcal{X}, Y \in \mathcal{Y}$ lie in any general spaces, and we wish to test the null that $X$ is independent of $Y$. Modern test statistics such as the kernel Hilbert-Schmidt Independence Criterion (HSIC) and Distance Covariance (dCov) have intractable null distributions due to the degeneracy of… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: 52 pages, 4 figures

  12. arXiv:2211.14908  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    A Permutation-free Kernel Two-Sample Test

    Authors: Shubhanshu Shekhar, Ilmun Kim, Aaditya Ramdas

    Abstract: The kernel Maximum Mean Discrepancy~(MMD) is a popular multivariate distance metric between distributions that has found utility in two-sample testing. The usual kernel-MMD test statistic is a degenerate U-statistic under the null, and thus it has an intractable limiting distribution. Hence, to design a level-$α$ test, one usually selects the rejection threshold as the $(1-α)$-quantile of the perm… ▽ More

    Submitted 4 February, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Published at the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS), with an oral presentation

  13. arXiv:2206.14486  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Beyond neural scaling laws: beating power law scaling via data pruning

    Authors: Ben Sorscher, Robert Geirhos, Shashank Shekhar, Surya Ganguli, Ari S. Morcos

    Abstract: Widely observed neural scaling laws, in which error falls off as a power of the training set size, model size, or both, have driven substantial performance improvements in deep learning. However, these improvements through scaling alone require considerable costs in compute and energy. Here we focus on the scaling of error with dataset size and show how in theory we can break beyond power law scal… ▽ More

    Submitted 21 April, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: Outstanding Paper Award @ NeurIPS 2022. Added github link to metric scores

  14. arXiv:2203.06297  [pdf, other

    cs.LG stat.ML

    Instance-Dependent Regret Analysis of Kernelized Bandits

    Authors: Shubhanshu Shekhar, Tara Javidi

    Abstract: We study the kernelized bandit problem, that involves designing an adaptive strategy for querying a noisy zeroth-order-oracle to efficiently learn about the optimizer of an unknown function $f$ with a norm bounded by $M<\infty$ in a Reproducing Kernel Hilbert Space~(RKHS) associated with a positive definite kernel $K$. Prior results, working in a \emph{minimax framework}, have characterized the wo… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 26 pages, 1 figure

  15. arXiv:2112.09162  [pdf, ps, other

    math.ST cs.IT stat.ME

    Nonparametric Two-Sample Testing by Betting

    Authors: Shubhanshu Shekhar, Aaditya Ramdas

    Abstract: We study the problem of designing consistent sequential two-sample tests in a nonparametric setting. Guided by the principle of testing by betting, we reframe this task into that of selecting a sequence of payoff functions that maximize the wealth of a fictitious bettor, betting against the null in a repeated game. In this setting, the relative increase in the bettor's wealth has a precise interpr… ▽ More

    Submitted 18 May, 2023; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: 55 pages, 4 figures. Updated statement of Theorem 1 with an improved upper bound + new matching lower bounds in Propositions 2 and 4

  16. arXiv:2105.09240  [pdf, other

    cs.LG stat.ML

    Boosting Variational Inference With Locally Adaptive Step-Sizes

    Authors: Gideon Dresdner, Saurav Shekhar, Fabian Pedregosa, Francesco Locatello, Gunnar Rätsch

    Abstract: Variational Inference makes a trade-off between the capacity of the variational family and the tractability of finding an approximate posterior distribution. Instead, Boosting Variational Inference allows practitioners to obtain increasingly good posterior approximations by spending more compute. The main obstacle to widespread adoption of Boosting Variational Inference is the amount of resources… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

  17. arXiv:2103.12019  [pdf, ps, other

    stat.ML cs.LG stat.CO

    Statistically-Robust Clustering Techniques for Mapping Spatial Hotspots: A Survey

    Authors: Yiqun Xie, Shashi Shekhar, Yan Li

    Abstract: Mapping of spatial hotspots, i.e., regions with significantly higher rates of generating cases of certain events (e.g., disease or crime cases), is an important task in diverse societal domains, including public health, public safety, transportation, agriculture, environmental science, etc. Clustering techniques required by these domains differ from traditional clustering methods due to the high e… ▽ More

    Submitted 9 October, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: 39 pages, 5 figures, accepted by ACM Computing Surveys (CSUR)

  18. arXiv:2006.00817  [pdf

    cs.LG stat.ML

    Adversarial Attacks on Reinforcement Learning based Energy Management Systems of Extended Range Electric Delivery Vehicles

    Authors: Pengyue Wang, Yan Li, Shashi Shekhar, William F. Northrop

    Abstract: Adversarial examples are firstly investigated in the area of computer vision: by adding some carefully designed ''noise'' to the original input image, the perturbed image that cannot be distinguished from the original one by human, can fool a well-trained classifier easily. In recent years, researchers also demonstrated that adversarial examples can mislead deep reinforcement learning (DRL) agents… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: 6 pages and 14 figures

  19. arXiv:2005.04832  [pdf, other

    cs.LG math.OC stat.ML

    Multi-Scale Zero-Order Optimization of Smooth Functions in an RKHS

    Authors: Shubhanshu Shekhar, Tara Javidi

    Abstract: We aim to optimize a black-box function $f:\mathcal{X} \mapsto \mathbb{R}$ under the assumption that $f$ is Hölder smooth and has bounded norm in the RKHS associated with a given kernel $K$. This problem is known to have an agnostic Gaussian Process (GP) bandit interpretation in which an appropriately constructed GP surrogate model with kernel $K$ is used to obtain an upper confidence bound (UCB)… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    Comments: 20 pages, 2 figures. Preliminary version -- feedback welcome

  20. arXiv:2003.03297  [pdf, other

    stat.ML cs.LG

    Active Model Estimation in Markov Decision Processes

    Authors: Jean Tarbouriech, Shubhanshu Shekhar, Matteo Pirotta, Mohammad Ghavamzadeh, Alessandro Lazaric

    Abstract: We study the problem of efficient exploration in order to learn an accurate model of an environment, modeled as a Markov decision process (MDP). Efficient exploration in this problem requires the agent to identify the regions in which estimating the model is more difficult and then exploit this knowledge to collect more samples there. In this paper, we formalize this problem, introduce the first a… ▽ More

    Submitted 22 June, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

  21. arXiv:1910.12406  [pdf, other

    stat.ML cs.LG

    Adaptive Sampling for Estimating Multiple Probability Distributions

    Authors: Shubhanshu Shekhar, Tara Javidi, Mohammad Ghavamzadeh

    Abstract: We consider the problem of allocating samples to a finite set of discrete distributions in order to learn them uniformly well in terms of four common distance measures: $\ell_2^2$, $\ell_1$, $f$-divergence, and separation distance. To present a unified treatment of these distances, we first propose a general optimistic tracking algorithm and analyze its sample allocation performance w.r.t.~an orac… ▽ More

    Submitted 6 December, 2019; v1 submitted 27 October, 2019; originally announced October 2019.

    Comments: 40 pages, 3 figures

  22. arXiv:1906.00303  [pdf, other

    cs.LG stat.ML

    Active Learning for Binary Classification with Abstention

    Authors: Shubhanshu Shekhar, Mohammad Ghavamzadeh, Tara Javidi

    Abstract: We construct and analyze active learning algorithms for the problem of binary classification with abstention. We consider three abstention settings: \emph{fixed-cost} and two variants of \emph{bounded-rate} abstention, and for each of them propose an active learning algorithm. All the proposed algorithms can work in the most commonly used active learning models, i.e., \emph{membership-query}, \emp… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

    Comments: 42 pages, 1 figure

  23. arXiv:1905.09561  [pdf, other

    cs.LG stat.ML

    Binary Classification with Bounded Abstention Rate

    Authors: Shubhanshu Shekhar, Mohammad Ghavamzadeh, Tara Javidi

    Abstract: We consider the problem of binary classification with abstention in the relatively less studied \emph{bounded-rate} setting. We begin by obtaining a characterization of the Bayes optimal classifier for an arbitrary input-label distribution $P_{XY}$. Our result generalizes and provides an alternative proof for the result first obtained by \cite{chow1957optimum}, and then re-derived by \citet{denis2… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

    Comments: 35 pages, 4 figures

  24. arXiv:1902.09682  [pdf, ps, other

    stat.ML cs.LG

    Multiscale Gaussian Process Level Set Estimation

    Authors: Shubhanshu Shekhar, Tara Javidi

    Abstract: In this paper, the problem of estimating the level set of a black-box function from noisy and expensive evaluation queries is considered. A new algorithm for this problem in the Bayesian framework with a Gaussian Process (GP) prior is proposed. The proposed algorithm employs a hierarchical sequence of partitions to explore different regions of the search space at varying levels of detail depending… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Comments: 15 pages

    Journal ref: AISTATS 2019

  25. arXiv:1805.02269  [pdf, other

    cs.LG stat.ML

    Incorporating Privileged Information to Unsupervised Anomaly Detection

    Authors: Shubhranshu Shekhar, Leman Akoglu

    Abstract: We introduce a new unsupervised anomaly detection ensemble called SPI which can harness privileged information - data available only for training examples but not for (future) test examples. Our ideas build on the Learning Using Privileged Information (LUPI) paradigm pioneered by Vapnik et al. [19,17], which we extend to unsupervised learning and in particular to anomaly detection. SPI (for Spotti… ▽ More

    Submitted 24 May, 2018; v1 submitted 6 May, 2018; originally announced May 2018.

  26. arXiv:1712.01447  [pdf, other

    stat.ML cs.LG

    Gaussian Process bandits with adaptive discretization

    Authors: Shubhanshu Shekhar, Tara Javidi

    Abstract: In this paper, the problem of maximizing a black-box function $f:\mathcal{X} \to \mathbb{R}$ is studied in the Bayesian framework with a Gaussian Process (GP) prior. In particular, a new algorithm for this problem is proposed, and high probability bounds on its simple and cumulative regret are established. The query point selection rule in most existing methods involves an exhaustive search over a… ▽ More

    Submitted 5 January, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: 34 pages, 2 figures

  27. arXiv:1612.08544  [pdf, other

    cs.LG cs.AI stat.ML

    Theory-guided Data Science: A New Paradigm for Scientific Discovery from Data

    Authors: Anuj Karpatne, Gowtham Atluri, James Faghmous, Michael Steinbach, Arindam Banerjee, Auroop Ganguly, Shashi Shekhar, Nagiza Samatova, Vipin Kumar

    Abstract: Data science models, although successful in a number of commercial domains, have had limited applicability in scientific problems involving complex physical phenomena. Theory-guided data science (TGDS) is an emerging paradigm that aims to leverage the wealth of scientific knowledge for improving the effectiveness of data science models in enabling scientific discovery. The overarching vision of TG… ▽ More

    Submitted 13 November, 2017; v1 submitted 27 December, 2016; originally announced December 2016.

    Journal ref: IEEE Transactions on Knowledge and Data Engineering, 29(10), pp.2318-2331. 2017