Skip to main content

Showing 1–50 of 171 results for author: Karthik

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.00316  [pdf, ps, other

    cs.LG math.ST stat.ML

    Active Learning via Regression Beyond Realizability

    Authors: Atul Ganju, Shashaank Aiyer, Ved Sriraman, Karthik Sridharan

    Abstract: We present a new active learning framework for multiclass classification based on surrogate risk minimization that operates beyond the standard realizability assumption. Existing surrogate-based active learning algorithms crucially rely on realizability$\unicode{x2014}$the assumption that the optimal surrogate predictor lies within the model class$\unicode{x2014}$limiting their applicability in pr… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

  2. arXiv:2505.23546  [pdf, other

    math.OC stat.ML

    Going from a Representative Agent to Counterfactuals in Combinatorial Choice

    Authors: Yanqiu Ruan, Karthyek Murthy, Karthik Natarajan

    Abstract: We study decision-making problems where data comprises points from a collection of binary polytopes, capturing aggregate information stemming from various combinatorial selection environments. We propose a nonparametric approach for counterfactual inference in this setting based on a representative agent model, where the available data is viewed as arising from maximizing separable concave utility… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 22 pages, 3 figures

  3. arXiv:2504.20172  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Causal Identification in Time Series Models

    Authors: Erik Jahn, Karthik Karnik, Leonard J. Schulman

    Abstract: In this paper, we analyze the applicability of the Causal Identification algorithm to causal time series graphs with latent confounders. Since these graphs extend over infinitely many time steps, deciding whether causal effects across arbitrary time intervals are identifiable appears to require computation on graph segments of unbounded size. Even for deciding the identifiability of intervention e… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  4. arXiv:2504.03190  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    The Ground Cost for Optimal Transport of Angular Velocity

    Authors: Karthik Elamvazhuthi, Abhishek Halder

    Abstract: We revisit the optimal transport problem over angular velocity dynamics given by the controlled Euler equation. The solution of this problem enables stochastic guidance of spin states of a rigid body (e.g., spacecraft) over hard deadline constraint by transferring a given initial state statistics to a desired terminal state statistics. This is an instance of generalized optimal transport over a no… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  5. arXiv:2503.21980  [pdf, other

    math.ST stat.ME stat.ML

    Rolled Gaussian process models for curves on manifolds

    Authors: Simon Preston, Karthik Bharath, Pablo Lopez-Custodio, Alfred Kume

    Abstract: Given a planar curve, imagine rolling a sphere along that curve without slipping or twisting, and by this means tracing out a curve on the sphere. It is well known that such a rolling operation induces a local isometry between the sphere and the plane so that the two curves uniquely determine each other, and moreover, the operation extends to a general class of manifolds in any dimension. We use r… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  6. arXiv:2503.08004  [pdf, ps, other

    cs.LG stat.ML

    Multiplayer Information Asymmetric Bandits in Metric Spaces

    Authors: William Chang, Aditi Karthik

    Abstract: In recent years the information asymmetric Lipschitz bandits In this paper we studied the Lipschitz bandit problem applied to the multiplayer information asymmetric problem studied in \cite{chang2022online, chang2023optimal}. More specifically we consider information asymmetry in rewards, actions, or both. We adopt the CAB algorithm given in \cite{kleinberg2004nearly} which uses a fixed discretiza… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  7. arXiv:2501.13607  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Optimal Multi-Objective Best Arm Identification with Fixed Confidence

    Authors: Zhirui Chen, P. N. Karthik, Yeow Meng Chee, Vincent Y. F. Tan

    Abstract: We consider a multi-armed bandit setting with finitely many arms, in which each arm yields an $M$-dimensional vector reward upon selection. We assume that the reward of each dimension (a.k.a. {\em objective}) is generated independently of the others. The best arm of any given objective is the arm with the largest component of mean corresponding to the objective. The end goal is to identify the bes… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: Accepted to AISTATS 2025

  8. arXiv:2412.18818  [pdf, other

    math.ST stat.CO stat.ME

    Empirical likelihood for Fréchet means on open books

    Authors: Karthik Bharath, Huiling Le, Andrew T A Wood, Xi Yan

    Abstract: Empirical Likelihood (EL) is a type of nonparametric likelihood that is useful in many statistical inference problems, including confidence region construction and $k$-sample problems. It enjoys some remarkable theoretical properties, notably Bartlett correctability. One area where EL has potential but is under-developed is in non-Euclidean statistics where the Fréchet mean is the population chara… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

  9. arXiv:2411.18747  [pdf, other

    stat.CO

    Deterministic and Probabilistic Rounding Error Analysis for Mixed-Precision Arithmetic on Modern Computing Units

    Authors: Sahil Bhola, Karthik Duraisamy

    Abstract: Modern computer architectures support low-precision arithmetic, which present opportunities for the adoption of mixed-precision algorithms to achieve high computational throughput and reduce energy consumption. As a growing number of scientific computations leverage specialized hardware accelerators, the risk of rounding errors increases, potentially compromising the reliability of models. This sh… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  10. arXiv:2411.18416  [pdf, other

    stat.ME stat.CO stat.ML

    Probabilistic size-and-shape functional mixed models

    Authors: Fangyi Wang, Karthik Bharath, Oksana Chkrebtii, Sebastian Kurtek

    Abstract: The reliable recovery and uncertainty quantification of a fixed effect function $μ$ in a functional mixed model, for modelling population- and object-level variability in noisily observed functional data, is a notoriously challenging task: variations along the $x$ and $y$ axes are confounded with additive measurement error, and cannot in general be disentangled. The question then as to what proper… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  11. arXiv:2410.16295  [pdf, other

    physics.comp-ph cond-mat.dis-nn cond-mat.stat-mech cs.LG stat.ML

    Universal Approximation of Mean-Field Models via Transformers

    Authors: Shiba Biswal, Karthik Elamvazhuthi, Rishi Sonthalia

    Abstract: This paper investigates the use of transformers to approximate the mean-field dynamics of interacting particle systems exhibiting collective behavior. Such systems are fundamental in modeling phenomena across physics, biology, and engineering, including opinion formation, biological networks, and swarm robotics. The key characteristic of these systems is that the particles are indistinguishable, l… ▽ More

    Submitted 27 May, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

  12. arXiv:2410.02979  [pdf, ps, other

    stat.ML cs.LG math.ST

    Optimization, Isoperimetric Inequalities, and Sampling via Lyapunov Potentials

    Authors: August Y. Chen, Karthik Sridharan

    Abstract: In this paper, we prove that optimizability of any F using Gradient Flow from all initializations implies a Poincaré Inequality for Gibbs measures mu_{beta} = e^{-βF}/Z at low temperature. In particular, under mild regularity assumptions on the convergence rate of Gradient Flow, we establish that mu_{beta} satisfies a Poincaré Inequality with constant O(C'+1/beta) for beta >= Omega(d), where C' is… ▽ More

    Submitted 4 March, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: 52 pages. New version with new results on Weak Poincaré Inequalities and improved presentation

  13. arXiv:2409.19901  [pdf, other

    cs.LG stat.ML

    SurvCORN: Survival Analysis with Conditional Ordinal Ranking Neural Network

    Authors: Muhammad Ridzuan, Numan Saeed, Fadillah Adamsyah Maani, Karthik Nandakumar, Mohammad Yaqub

    Abstract: Survival analysis plays a crucial role in estimating the likelihood of future events for patients by modeling time-to-event data, particularly in healthcare settings where predictions about outcomes such as death and disease recurrence are essential. However, this analysis poses challenges due to the presence of censored data, where time-to-event information is missing for certain data points. Yet… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  14. arXiv:2404.13056  [pdf, other

    cs.LG cs.CE stat.CO stat.ME stat.ML

    Variational Bayesian Optimal Experimental Design with Normalizing Flows

    Authors: Jiayuan Dong, Christian Jacobsen, Mehdi Khalloufi, Maryam Akram, Wanjiao Liu, Karthik Duraisamy, Xun Huan

    Abstract: Bayesian optimal experimental design (OED) seeks experiments that maximize the expected information gain (EIG) in model parameters. Directly estimating the EIG using nested Monte Carlo is computationally expensive and requires an explicit likelihood. Variational OED (vOED), in contrast, estimates a lower bound of the EIG without likelihood evaluations by approximating the posterior distributions w… ▽ More

    Submitted 27 April, 2025; v1 submitted 8 April, 2024; originally announced April 2024.

    MSC Class: 62K05; 94A17; 62C10; 62F15

    Journal ref: Computer Methods in Applied Mechanics and Engineering 433 (2025) 117457

  15. arXiv:2404.12556  [pdf, other

    stat.CO

    Exploiting Higher-Order Statistics for Robust Probabilistic Rounding Error Analysis

    Authors: Sahil Bhola, Karthik Duraisamy

    Abstract: Modern computer hardware supports low- and mixed-precision arithmetic for enhanced computational efficiency. In practical predictive modeling, however, it becomes vital to quantify the uncertainty due to rounding along with other sources of uncertainty (such as measurement, sampling, and numerical discretization) to ensure efficiency gains do not compromise accuracy. Higham and Mary [1] showed tha… ▽ More

    Submitted 16 January, 2025; v1 submitted 18 April, 2024; originally announced April 2024.

  16. arXiv:2403.18124  [pdf, other

    math.OC stat.CO

    Stochastic Finite Volume Method for Uncertainty Management in Gas Pipeline Network Flows

    Authors: Saif R. Kazi, Sidhant Misra, Svetlana Tokareva, Kaarthik Sundar, Anatoly Zlotnik

    Abstract: Natural gas consumption by users of pipeline networks is subject to increasing uncertainty that originates from the intermittent nature of electric power loads serviced by gas-fired generators. To enable computationally efficient optimization of gas network flows subject to uncertainty, we develop a finite volume representation of stochastic solutions of hyperbolic partial differential equation (P… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Report number: LA-UR-24-22647 MSC Class: 65Kxx; 90C35; 90C15

  17. arXiv:2403.04033  [pdf, ps, other

    cs.LG cs.AI math.ST stat.ML

    Online Learning with Unknown Constraints

    Authors: Karthik Sridharan, Seung Won Wilson Yoo

    Abstract: We consider the problem of online learning where the sequence of actions played by the learner must adhere to an unknown safety constraint at every round. The goal is to minimize regret with respect to the best safe action in hindsight while simultaneously satisfying the safety constraint with high probability on each round. We provide a general meta-algorithm that leverages an online regression o… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  18. arXiv:2401.11515  [pdf, other

    stat.ME

    Geometry-driven Bayesian Inference for Ultrametric Covariance Matrices

    Authors: Tsung-Hung Yao, Zhenke Wu, Karthik Bharath, Veerabhadran Baladandayuthapani

    Abstract: Ultrametric matrices are a class of covariance matrices that arise in latent tree models. As a parameter space in a statistical model, the set of ultrametric matrices is neither convex nor a smooth manifold. Focus in the literature has hitherto been restricted to estimation through projections and relaxation-based techniques, and inferential methods are lacking. Motivated by this, we establish a b… ▽ More

    Submitted 24 April, 2025; v1 submitted 21 January, 2024; originally announced January 2024.

  19. arXiv:2401.09073  [pdf, other

    cs.LG cs.AI cs.IT math.ST stat.ML

    Fixed-Budget Differentially Private Best Arm Identification

    Authors: Zhirui Chen, P. N. Karthik, Yeow Meng Chee, Vincent Y. F. Tan

    Abstract: We study best arm identification (BAI) in linear bandits in the fixed-budget regime under differential privacy constraints, when the arm rewards are supported on the unit interval. Given a finite budget $T$ and a privacy parameter $\varepsilon>0$, the goal is to minimise the error probability in finding the arm with the largest mean after $T$ sampling rounds, subject to the constraint that the pol… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR 2024

  20. arXiv:2401.01231  [pdf, other

    stat.AP

    Movement of insurgent gangs: A Bayesian kernel density model for incomplete temporal data

    Authors: Karthik Sriram, Dhruv Gupta, Rajiv Parikh

    Abstract: We develop a Bayesian modeling framework to address a pressing real-life problem faced by the police in tackling insurgent gangs. Unlike criminals associated with common crimes such as robbery, theft or street crime, insurgent gangs are trained in sophisticated arms and strategise against the government to weaken its resolve. They are constantly on the move, operating over large areas causing dama… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  21. arXiv:2312.14882  [pdf, ps, other

    math.ST math.NA math.PR stat.CO stat.ML

    Sampling and estimation on manifolds using the Langevin diffusion

    Authors: Karthik Bharath, Alexander Lewis, Akash Sharma, Michael V Tretyakov

    Abstract: Error bounds are derived for sampling and estimation using a discretization of an intrinsically defined Langevin diffusion with invariant measure $\text{d}μ_φ\propto e^{-φ} \mathrm{dvol}_g $ on a compact Riemannian manifold. Two estimators of linear functionals of $μ_φ$ based on the discretized Markov process are considered: a time-averaging estimator based on a single trajectory and an ensemble-a… ▽ More

    Submitted 26 April, 2025; v1 submitted 22 December, 2023; originally announced December 2023.

  22. arXiv:2312.12361  [pdf, other

    stat.ME

    Improved multifidelity Monte Carlo estimators based on normalizing flows and dimensionality reduction techniques

    Authors: Andrea Zanoni, Gianluca Geraci, Matteo Salvador, Karthik Menon, Alison L. Marsden, Daniele E. Schiavazzi

    Abstract: We study the problem of multifidelity uncertainty propagation for computationally expensive models. In particular, we consider the general setting where the high-fidelity and low-fidelity models have a dissimilar parameterization both in terms of number of random inputs and their probability distributions, which can be either known in closed form or provided through samples. We derive novel multif… ▽ More

    Submitted 14 June, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  23. arXiv:2312.11230  [pdf, other

    stat.ML cs.LG

    Dirichlet-based Uncertainty Quantification for Personalized Federated Learning with Improved Posterior Networks

    Authors: Nikita Kotelevskii, Samuel Horváth, Karthik Nandakumar, Martin Takáč, Maxim Panov

    Abstract: In modern federated learning, one of the main challenges is to account for inherent heterogeneity and the diverse nature of data distributions for different clients. This problem is often addressed by introducing personalization of the models towards the data distribution of the particular client. However, a personalized model might be unreliable when applied to the data that is not typical for th… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  24. arXiv:2312.00854  [pdf, other

    physics.med-ph cs.AI cs.LG math.NA stat.CO

    A Probabilistic Neural Twin for Treatment Planning in Peripheral Pulmonary Artery Stenosis

    Authors: John D. Lee, Jakob Richter, Martin R. Pfaller, Jason M. Szafron, Karthik Menon, Andrea Zanoni, Michael R. Ma, Jeffrey A. Feinstein, Jacqueline Kreutzer, Alison L. Marsden, Daniele E. Schiavazzi

    Abstract: The substantial computational cost of high-fidelity models in numerical hemodynamics has, so far, relegated their use mainly to offline treatment planning. New breakthroughs in data-driven architectures and optimization techniques for fast surrogate modeling provide an exciting opportunity to overcome these limitations, enabling the use of such technology for time-critical decisions. We discuss an… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  25. arXiv:2310.13393  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Optimal Best Arm Identification with Fixed Confidence in Restless Bandits

    Authors: P. N. Karthik, Vincent Y. F. Tan, Arpan Mukherjee, Ali Tajer

    Abstract: We study best arm identification in a restless multi-armed bandit setting with finitely many arms. The discrete-time data generated by each arm forms a homogeneous Markov chain taking values in a common, finite state space. The state transitions in each arm are captured by an ergodic transition probability matrix (TPM) that is a member of a single-parameter exponential family of TPMs. The real-val… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to the IEEE Transactions on Information Theory

  26. arXiv:2309.11512  [pdf, other

    stat.AP cs.LG

    Multidimensional well-being of US households at a fine spatial scale using fused household surveys: fusionACS

    Authors: Kevin Ummel, Miguel Poblete-Cazenave, Karthik Akkiraju, Nick Graetz, Hero Ashman, Cora Kingdon, Steven Herrera Tenorio, Aaryaman "Sunny" Singhal, Daniel Aldana Cohen, Narasimha D. Rao

    Abstract: Social science often relies on surveys of households and individuals. Dozens of such surveys are regularly administered by the U.S. government. However, they field independent, unconnected samples with specialized questions, limiting research questions to those that can be answered by a single survey. The fusionACS project seeks to integrate data from multiple U.S. household surveys by statistical… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 35 pages, 6 figures

  27. arXiv:2307.09423  [pdf, other

    cs.LG cs.AI stat.ML

    Scaling Laws for Imitation Learning in Single-Agent Games

    Authors: Jens Tuyls, Dhruv Madeka, Kari Torkkola, Dean Foster, Karthik Narasimhan, Sham Kakade

    Abstract: Imitation Learning (IL) is one of the most widely used methods in machine learning. Yet, many works find it is often unable to fully recover the underlying expert behavior, even in constrained environments like single-agent games. However, none of these works deeply investigate the role of scaling up the model and data size. Inspired by recent work in Natural Language Processing (NLP) where "scali… ▽ More

    Submitted 19 December, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted at TMLR 2024

  28. arXiv:2307.04998  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Selective Sampling and Imitation Learning via Online Regression

    Authors: Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu

    Abstract: We consider the problem of Imitation Learning (IL) by actively querying noisy expert for feedback. While imitation learning has been empirically successful, much of prior work assumes access to noiseless expert feedback which is not practical in many applications. In fact, when one only has access to noisy expert feedback, algorithms that rely on purely offline data (non-interactive IL) can be sho… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  29. arXiv:2305.06082  [pdf, ps, other

    cs.LG cs.AI cs.IT math.ST stat.ML

    Best Arm Identification in Bandits with Limited Precision Sampling

    Authors: Kota Srinivas Reddy, P. N. Karthik, Nikhil Karamchandani, Jayakrishnan Nair

    Abstract: We study best arm identification in a variant of the multi-armed bandit problem where the learner has limited precision in arm selection. The learner can only sample arms via certain exploration bundles, which we refer to as boxes. In particular, at each sampling epoch, the learner selects a box, which in turn causes an arm to get pulled as per a box-specific probability distribution. The pulled a… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: ISIT 2023

  30. arXiv:2304.08740  [pdf, other

    stat.ML cs.LG eess.SP

    Estimating Joint Probability Distribution With Low-Rank Tensor Decomposition, Radon Transforms and Dictionaries

    Authors: Pranava Singhal, Waqar Mirza, Ajit Rajwade, Karthik S. Gurumoorthy

    Abstract: In this paper, we describe a method for estimating the joint probability density from data samples by assuming that the underlying distribution can be decomposed as a mixture of product densities with few mixture components. Prior works have used such a decomposition to estimate the joint density from lower-dimensional marginals, which can be estimated more reliably with the same number of samples… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    MSC Class: 62G07

  31. Estimating Global Identifiability Using Conditional Mutual Information in a Bayesian Framework

    Authors: Sahil Bhola, Karthik Duraisamy

    Abstract: A novel information-theoretic approach is proposed to assess the global practical identifiability of Bayesian statistical models. Based on the concept of conditional mutual information, an estimate of information gained for each model parameter is used to quantify the identifiability with practical considerations. No assumptions are made about the structure of the statistical model or the prior di… ▽ More

    Submitted 21 September, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  32. arXiv:2303.04390  [pdf, other

    stat.CO q-bio.PE

    Many-core algorithms for high-dimensional gradients on phylogenetic trees

    Authors: Karthik Gangavarapu, Xiang Ji, Guy Baele, Mathieu Fourment, Philippe Lemey, Frederick A. Matsen IV, Marc A. Suchard

    Abstract: The rapid growth in genomic pathogen data spurs the need for efficient inference techniques, such as Hamiltonian Monte Carlo (HMC) in a Bayesian framework, to estimate parameters of these phylogenetic models where the dimensions of the parameters increase with the number of sequences $N$. HMC requires repeated calculation of the gradient of the data log-likelihood with respect to (wrt) all branch-… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  33. arXiv:2211.07484  [pdf, ps, other

    cs.LG stat.ML

    Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression

    Authors: Aleksandrs Slivkins, Xingyu Zhou, Karthik Abinav Sankararaman, Dylan J. Foster

    Abstract: We consider contextual bandits with linear constraints (CBwLC), a variant of contextual bandits in which the algorithm consumes multiple resources subject to linear constraints on total consumption. This problem generalizes contextual bandits with knapsacks (CBwK), allowing for packing and covering constraints, as well as positive and negative resource consumption. We provide the first algorithm f… ▽ More

    Submitted 26 November, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: A preliminary version of this paper, authored by A. Slivkins, K.A. Sankararaman and D.J. Foster, has been published at COLT 2023. The present version (since Jun'24) features an important improvement, due to Xingyu Zhou. The Oct'24 version fixes an inaccuracy in Section 6 when the analysis from Section 4 is invoked

  34. arXiv:2210.14843  [pdf, other

    stat.ML cs.AI cs.LG

    TuneUp: A Simple Improved Training Strategy for Graph Neural Networks

    Authors: Weihua Hu, Kaidi Cao, Kexin Huang, Edward W Huang, Karthik Subbian, Kenji Kawaguchi, Jure Leskovec

    Abstract: Despite recent advances in Graph Neural Networks (GNNs), their training strategies remain largely under-explored. The conventional training strategy learns over all nodes in the original graph(s) equally, which can be sub-optimal as certain nodes are often more difficult to learn than others. Here we present TuneUp, a simple curriculum-based training strategy for improving the predictive performan… ▽ More

    Submitted 26 August, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

  35. arXiv:2209.12667  [pdf, other

    stat.ML cs.LG math.DG math.ST

    Shape And Structure Preserving Differential Privacy

    Authors: Carlos Soto, Karthik Bharath, Matthew Reimherr, Aleksandra Slavkovic

    Abstract: It is common for data structures such as images and shapes of 2D objects to be represented as points on a manifold. The utility of a mechanism to produce sanitized differentially private estimates from such data is intimately linked to how compatible it is with the underlying structure and geometry of the space. In particular, as recently shown, utility of the Laplace mechanism on a positively cur… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 15 pages (including supplementary material and references), 3 figures (including supplementary material), to be published in NeurIPS 2022

  36. arXiv:2208.09215  [pdf, other

    cs.LG cs.IT math.ST stat.ML

    Almost Cost-Free Communication in Federated Best Arm Identification

    Authors: Kota Srinivas Reddy, P. N. Karthik, Vincent Y. F. Tan

    Abstract: We study the problem of best arm identification in a federated learning multi-armed bandit setup with a central server and multiple clients. Each client is associated with a multi-armed bandit in which each arm yields {\em i.i.d.}\ rewards following a Gaussian distribution with an unknown mean and known variance. The set of arms is assumed to be the same at all the clients. We define two notions o… ▽ More

    Submitted 19 December, 2022; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: Accepted to AAAI 2023

  37. arXiv:2208.06115  [pdf, other

    stat.ML econ.EM math.OC

    A Nonparametric Approach with Marginals for Modeling Consumer Choice

    Authors: Yanqiu Ruan, Xiaobo Li, Karthyek Murthy, Karthik Natarajan

    Abstract: Given data on the choices made by consumers for different offer sets, a key challenge is to develop parsimonious models that describe and predict consumer choice behavior while being amenable to prescriptive tasks such as pricing and assortment optimization. The marginal distribution model (MDM) is one such model, which requires only the specification of marginal distributions of the random utilit… ▽ More

    Submitted 14 April, 2025; v1 submitted 12 August, 2022; originally announced August 2022.

  38. arXiv:2207.13797  [pdf, other

    stat.ME econ.EM

    Identification and Inference with Min-over-max Estimators for the Measurement of Labor Market Fairness

    Authors: Karthik Rajkumar

    Abstract: These notes shows how to do inference on the Demographic Parity (DP) metric. Although the metric is a complex statistic involving min and max computations, we propose a smooth approximation of those functions and derive its asymptotic distribution. The limit of these approximations and their gradients converge to those of the true max and min functions, wherever they exist. More importantly, when… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: 12 pages, 3 figures

  39. arXiv:2207.10914  [pdf, other

    stat.ME stat.CO

    Spatially Penalised Registration of Multivariate Functional Data

    Authors: Xiaohan Guo, Sebastian Kurtek, Karthik Bharath

    Abstract: Registration of multivariate functional data involves handling of both cross-component and cross-observation phase variations. Allowing for the two phase variations to be modelled as general diffeomorphic time warpings, in this work we focus on the hitherto unconsidered setting where phase variation of the component functions are spatially correlated. We propose an algorithm to optimize a metric-b… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  40. arXiv:2206.13063  [pdf, other

    cs.LG math.OC math.ST stat.ML

    On the Complexity of Adversarial Decision Making

    Authors: Dylan J. Foster, Alexander Rakhlin, Ayush Sekhari, Karthik Sridharan

    Abstract: A central problem in online learning and decision making -- from bandits to reinforcement learning -- is to understand what modeling assumptions lead to sample-efficient learning guarantees. We consider a general adversarial decision making framework that encompasses (structured) bandit problems with adversarial rewards and reinforcement learning problems with adversarial dynamics. Our main result… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  41. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  42. arXiv:2206.03040  [pdf, other

    stat.ML cs.IR cs.LG

    Learning Backward Compatible Embeddings

    Authors: Weihua Hu, Rajas Bansal, Kaidi Cao, Nikhil Rao, Karthik Subbian, Jure Leskovec

    Abstract: Embeddings, low-dimensional vector representation of objects, are fundamental in building modern machine learning systems. In industrial settings, there is usually an embedding team that trains an embedding model to solve intended tasks (e.g., product recommendation). The produced embeddings are then widely consumed by consumer teams to solve their unintended tasks (e.g., fraud detection). However… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: KDD 2022, Applied Data Science Track

  43. arXiv:2203.15236  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Best Arm Identification in Restless Markov Multi-Armed Bandits

    Authors: P. N. Karthik, Kota Srinivas Reddy, Vincent Y. F. Tan

    Abstract: We study the problem of identifying the best arm in a multi-armed bandit environment when each arm is a time-homogeneous and ergodic discrete-time Markov process on a common, finite state space. The state evolution on each arm is governed by the arm's transition probability matrix (TPM). A decision entity that knows the set of arm TPMs but not the exact mapping of the TPMs to the arms, wishes to f… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 41 pages

  44. arXiv:2203.01667  [pdf, other

    cs.LG stat.ML

    Joint Probability Estimation Using Tensor Decomposition and Dictionaries

    Authors: Shaan ul Haque, Ajit Rajwade, Karthik S. Gurumoorthy

    Abstract: In this work, we study non-parametric estimation of joint probabilities of a given set of discrete and continuous random variables from their (empirically estimated) 2D marginals, under the assumption that the joint probability could be decomposed and approximated by a mixture of product densities/mass functions. The problem of estimating the joint probability density function (PDF) using semi-par… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  45. arXiv:2203.01161  [pdf, ps, other

    math.OC cs.CC cs.LG stat.ML

    Discrete Optimal Transport with Independent Marginals is #P-Hard

    Authors: Bahar Taşkesen, Soroosh Shafieezadeh-Abadeh, Daniel Kuhn, Karthik Natarajan

    Abstract: We study the computational complexity of the optimal transport problem that evaluates the Wasserstein distance between the distributions of two K-dimensional discrete random vectors. The best known algorithms for this problem run in polynomial time in the maximum of the number of atoms of the two distributions. However, if the components of either random vector are independent, then this number ca… ▽ More

    Submitted 14 October, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  46. Probabilistic Learning of Treatment Trees in Cancer

    Authors: Tsung-Hung Yao, Zhenke Wu, Karthik Bharath, Jinju Li, Veerabhadran Baladandayuthapan

    Abstract: Accurate identification of synergistic treatment combinations and their underlying biological mechanisms is critical across many disease domains, especially cancer. In translational oncology research, preclinical systems such as patient-derived xenografts (PDX) have emerged as a unique study design evaluating multiple treatments administered to samples from the same human tumor implanted into gene… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

  47. arXiv:2111.02516  [pdf, other

    math.ST math.DG stat.ML

    Differential Privacy Over Riemannian Manifolds

    Authors: Matthew Reimherr, Karthik Bharath, Carlos Soto

    Abstract: In this work we consider the problem of releasing a differentially private statistical summary that resides on a Riemannian manifold. We present an extension of the Laplace or K-norm mechanism that utilizes intrinsic distances and volumes on the manifold. We also consider in detail the specific case where the summary is the Fréchet mean of data residing on a manifold. We demonstrate that our mecha… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 15 pages (including supplementary material and references), 2 figures (including supplementary material), published in NeurIPS

  48. arXiv:2109.01965  [pdf, other

    stat.ML cs.LG

    Scalable Feature Selection for (Multitask) Gradient Boosted Trees

    Authors: Cuize Han, Nikhil Rao, Daria Sorokina, Karthik Subbian

    Abstract: Gradient Boosted Decision Trees (GBDTs) are widely used for building ranking and relevance models in search and recommendation. Considerations such as latency and interpretability dictate the use of as few features as possible to train these models. Feature selection in GBDT models typically involves heuristically ranking the features by importance and selecting the top few, or by performing a ful… ▽ More

    Submitted 4 September, 2021; originally announced September 2021.

    Comments: Correct a mistake in the proof of Lemma B1 in http://proceedings.mlr.press/v108/han20a.html

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR 108:885-894, 2020

  49. arXiv:2106.15436  [pdf, other

    stat.ME

    Topo-Geometric Analysis of Variability in Point Clouds using Persistence Landscapes

    Authors: James Matuk, Sebastian Kurtek, Karthik Bharath

    Abstract: Topological data analysis provides a set of tools to uncover low-dimensional structure in noisy point clouds. Prominent amongst the tools is persistence homology, which summarizes birth-death times of homological features using data objects known as persistence diagrams. To better aid statistical analysis, a functional representation of the diagrams, known as persistence landscapes, enable use of… ▽ More

    Submitted 1 February, 2024; v1 submitted 29 June, 2021; originally announced June 2021.

  50. arXiv:2106.11880  [pdf, other

    cs.LG stat.ML

    Dynamic Customer Embeddings for Financial Service Applications

    Authors: Nima Chitsazan, Samuel Sharpe, Dwipam Katariya, Qianyu Cheng, Karthik Rajasethupathy

    Abstract: As financial services (FS) companies have experienced drastic technology driven changes, the availability of new data streams provides the opportunity for more comprehensive customer understanding. We propose Dynamic Customer Embeddings (DCE), a framework that leverages customers' digital activity and a wide range of financial context to learn dense representations of customers in the FS industry.… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: ICML Workshop on Representation Learning for Finance and E-Commerce Applications