Skip to main content

Showing 1–19 of 19 results for author: Agarwal, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.07647  [pdf, ps, other

    math.PR stat.ML

    Langevin Diffusion Approximation to Same Marginal Schrödinger Bridge

    Authors: Medha Agarwal, Zaid Harchaoui, Garrett Mulcahy, Soumik Pal

    Abstract: We introduce a novel approximation to the same marginal Schrödinger bridge using the Langevin diffusion. As $\varepsilon \downarrow 0$, it is known that the barycentric projection (also known as the entropic Brenier map) of the Schrödinger bridge converges to the Brenier map, which is the identity. Our diffusion approximation is leveraged to show that, under suitable assumptions, the difference be… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: These results have been extracted from the first half of arXiv:2406.10823, where they first appeared. The rest of arXiv:2406.10823 will be modified to reflect this change

    MSC Class: 49N99; 49Q22; 60J60

  2. arXiv:2504.05364  [pdf, other

    cs.SD cs.AI cs.LG stat.ML

    Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation

    Authors: Manvi Agarwal, Changhong Wang, Gael Richard

    Abstract: While music remains a challenging domain for generative models like Transformers, a two-pronged approach has recently proved successful: inserting musically-relevant structural information into the positional encoding (PE) module and using kernel approximation techniques based on Random Fourier Features (RFF) to lower the computational cost from quadratic to linear. Yet, it is not clear how such R… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  3. arXiv:2406.10823  [pdf, other

    math.PR stat.ML

    Iterated Schrödinger bridge approximation to Wasserstein Gradient Flows

    Authors: Medha Agarwal, Zaid Harchaoui, Garrett Mulcahy, Soumik Pal

    Abstract: We introduce a novel discretization scheme for Wasserstein gradient flows that involves successively computing Schrödinger bridges with the same marginals. This is different from both the forward/geodesic approximation and the backward/Jordan-Kinderlehrer-Otto (JKO) approximations. The proposed scheme has two advantages: one, it avoids the use of the score function, and, two, it is amenable to par… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 36 pages, 1 figure

    MSC Class: 49N99; 49Q22; 60J60

  4. arXiv:2310.01583  [pdf, other

    stat.ML cs.LG

    An Investigation of Representation and Allocation Harms in Contrastive Learning

    Authors: Subha Maity, Mayank Agarwal, Mikhail Yurochkin, Yuekai Sun

    Abstract: The effect of underrepresentation on the performance of minority groups is known to be a serious problem in supervised learning settings; however, it has been underexplored so far in the context of self-supervised learning (SSL). In this paper, we demonstrate that contrastive learning (CL), a popular variant of SSL, tends to collapse representations of minority groups with certain majority groups.… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  5. arXiv:2205.08234  [pdf, other

    cs.LG cs.AI stat.ML

    Delaytron: Efficient Learning of Multiclass Classifiers with Delayed Bandit Feedbacks

    Authors: Naresh Manwani, Mudit Agarwal

    Abstract: In this paper, we present online algorithm called {\it Delaytron} for learning multi class classifiers using delayed bandit feedbacks. The sequence of feedback delays $\{d_t\}_{t=1}^T$ is unknown to the algorithm. At the $t$-th round, the algorithm observes an example $\mathbf{x}_t$ and predicts a label $\tilde{y}_t$ and receives the bandit feedback $\mathbb{I}[\tilde{y}_t=y_t]$ only $d_t$ rounds… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  6. arXiv:2201.05935  [pdf, other

    math.OC stat.CO stat.ME

    Quasi-Newton acceleration of EM and MM algorithms via Broyden$'$s method

    Authors: Medha Agarwal, Jason Xu

    Abstract: The principle of majorization-minimization (MM) provides a general framework for eliciting effective algorithms to solve optimization problems. However, they often suffer from slow convergence, especially in large-scale and high-dimensional data settings. This has drawn attention to acceleration schemes designed exclusively for MM algorithms, but many existing designs are either problem-specific o… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

    Comments: 41 pages, 7 pages appendix, 4 figures, 7 tables; associated code for examples can be found at https://github.com/medhaaga/Quasi-Newton-accelerated-MM; submitted to Journal of Computational and Graphical Statistics; for R package (GitHub dev version) implementing the method, see https://github.com/medhaaga/quasiNewtonMM

  7. arXiv:2108.13289  [pdf, other

    stat.CO stat.ME

    A principled stopping rule for importance sampling

    Authors: Medha Agarwal, Dootika Vats, Víctor Elvira

    Abstract: Importance sampling (IS) is a Monte Carlo technique that relies on weighted samples, simulated from a proposal distribution, to estimate intractable integrals. The quality of the estimators improves with the number of samples. However, for achieving a desired quality of estimation, the required number of samples is unknown and depends on the quantity of interest, the estimator, and the chosen prop… ▽ More

    Submitted 14 July, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

  8. arXiv:2011.07687  [pdf, ps, other

    cs.LG stat.ML

    DART: aDaptive Accept RejecT for non-linear top-K subset identification

    Authors: Mridul Agarwal, Vaneet Aggarwal, Christopher J. Quinn, Abhishek Umrawal

    Abstract: We consider the bandit problem of selecting $K$ out of $N$ arms at each time step. The reward can be a non-linear function of the rewards of the selected individual arms. The direct use of a multi-armed bandit algorithm requires choosing among $\binom{N}{K}$ options, making the action space large. To simplify the problem, existing works on combinatorial bandits {typically} assume feedback as a lin… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Journal ref: AAAI 2021

  9. arXiv:2010.12574  [pdf, other

    cs.LG stat.ML

    Online Semi-Supervised Learning with Bandit Feedback

    Authors: Sohini Upadhyay, Mikhail Yurochkin, Mayank Agarwal, Yasaman Khazaeni, DjallelBouneffouf

    Abstract: We formulate a new problem at the intersectionof semi-supervised learning and contextual bandits,motivated by several applications including clini-cal trials and ad recommendations. We demonstratehow Graph Convolutional Network (GCN), a semi-supervised learning approach, can be adjusted tothe new problem formulation. We also propose avariant of the linear contextual bandit with semi-supervised mis… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

  10. arXiv:2009.01799  [pdf, other

    stat.CO stat.ME

    Globally-centered autocovariances in MCMC

    Authors: Medha Agarwal, Dootika Vats

    Abstract: Autocovariances are a fundamental quantity of interest in Markov chain Monte Carlo (MCMC) simulations with autocorrelation function (ACF) plots being an integral visualization tool for performance assessment. Unfortunately, for slow-mixing Markov chains, the empirical autocovariance can highly underestimate the truth. For multiple-chain MCMC sampling, we propose a globally-centered estimator of th… ▽ More

    Submitted 18 October, 2021; v1 submitted 3 September, 2020; originally announced September 2020.

  11. Learning Multiclass Classifier Under Noisy Bandit Feedback

    Authors: Mudit Agarwal, Naresh Manwani

    Abstract: This paper addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner may not receive true feedback. Instead, it receives feedback that has been flipped with some non-zero probability. We propose a novel approach to deal with noisy bandit feedback based on the unbiased estimator technique. We further offer a method that can efficiently… ▽ More

    Submitted 3 March, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: 17 pages, 6 figures 1 Table

    Journal ref: Pacific-Asia Conference on Knowledge Discovery and Data Mining,2021

  12. arXiv:1911.00218  [pdf, other

    stat.ML cs.LG

    Statistical Model Aggregation via Parameter Matching

    Authors: Mikhail Yurochkin, Mayank Agarwal, Soumya Ghosh, Kristjan Greenewald, Trong Nghia Hoang

    Abstract: We consider the problem of aggregating models learned from sequestered, possibly heterogeneous datasets. Exploiting tools from Bayesian nonparametrics, we develop a general meta-modeling framework that learns shared global latent structures by identifying correspondences among local model parameterizations. Our proposed framework is model-independent and is applicable to a wide range of model type… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019

  13. arXiv:1910.01277  [pdf, ps, other

    math.OC cs.LG stat.ML

    Escaping Saddle Points for Zeroth-order Nonconvex Optimization using Estimated Gradient Descent

    Authors: Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal

    Abstract: Gradient descent and its variants are widely used in machine learning. However, oracle access of gradient may not be available in many applications, limiting the direct use of gradient descent. This paper proposes a method of estimating gradient to perform gradient descent, that converges to a stationary point for general non-convex optimization problems. Beyond the first-order stationary properti… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: arXiv admin note: text overlap with arXiv:1703.00887 by other authors

  14. arXiv:1909.02940  [pdf, ps, other

    cs.LG cs.AI cs.GT cs.IT cs.MA stat.ML

    Reinforcement Learning for Joint Optimization of Multiple Rewards

    Authors: Mridul Agarwal, Vaneet Aggarwal

    Abstract: Finding optimal policies which maximize long term rewards of Markov Decision Processes requires the use of dynamic programming and backward induction to solve the Bellman optimality equation. However, many real-world problems require optimization of an objective that is non-linear in cumulative rewards for which dynamic programming cannot be applied directly. For example, in a resource allocation… ▽ More

    Submitted 9 January, 2023; v1 submitted 6 September, 2019; originally announced September 2019.

    Comments: Accepted JMLR, Jul 2022

    Journal ref: JMLR, 2022

  15. arXiv:1905.13357  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Reinforcement Learning for Mean Field Game

    Authors: Mridul Agarwal, Vaneet Aggarwal, Arnob Ghosh, Nilay Tiwari

    Abstract: Stochastic games provide a framework for interactions among multiple agents and enable a myriad of applications. In these games, agents decide on actions simultaneously, the state of every agent moves to the next state, and each agent receives a reward. However, finding an equilibrium (if exists) in this game is often difficult when the number of agents becomes large. This paper focuses on finding… ▽ More

    Submitted 8 October, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

  16. arXiv:1905.12022  [pdf, other

    stat.ML cs.LG

    Bayesian Nonparametric Federated Learning of Neural Networks

    Authors: Mikhail Yurochkin, Mayank Agarwal, Soumya Ghosh, Kristjan Greenewald, Trong Nghia Hoang, Yasaman Khazaeni

    Abstract: In federated learning problems, data is scattered across different servers and exchanging or pooling it is often impractical or prohibited. We develop a Bayesian nonparametric framework for federated learning with neural networks. Each data server is assumed to provide local neural network weights, which are modeled through our framework. We then develop an inference approach that allows us to syn… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: ICML 2019

  17. arXiv:1811.11925  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Stochastic Top-$K$ Subset Bandits with Linear Space and Non-Linear Feedback

    Authors: Mridul Agarwal, Vaneet Aggarwal, Christopher J. Quinn, Abhishek K. Umrawal

    Abstract: Many real-world problems like Social Influence Maximization face the dilemma of choosing the best $K$ out of $N$ options at a given time instant. This setup can be modeled as a combinatorial bandit which chooses $K$ out of $N$ arms at each time, with an aim to achieve an efficient trade-off between exploration and exploitation. This is the first work for combinatorial bandits where the feedback re… ▽ More

    Submitted 11 October, 2021; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: 38 pages, 4 figures, 32nd International Conference on Algorithmic Learning Theory

  18. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  19. Curriculum Guidelines for Undergraduate Programs in Data Science

    Authors: Richard De Veaux, Mahesh Agarwal, Maia Averett, Benjamin Baumer, Andrew Bray, Thomas Bressoud, Lance Bryant, Lei Cheng, Amanda Francis, Robert Gould, Albert Y. Kim, Matt Kretchmar, Qin Lu, Ann Moskol, Deborah Nolan, Roberto Pelayo, Sean Raleigh, Ricky J. Sethi, Mutiara Sondjaja, Neelesh Tiruviluamala, Paul Uhlig, Talitha Washington, Curtis Wesley, David White, Ping Ye

    Abstract: The Park City Math Institute (PCMI) 2016 Summer Undergraduate Faculty Program met for the purpose of composing guidelines for undergraduate programs in Data Science. The group consisted of 25 undergraduate faculty from a variety of institutions in the U.S., primarily from the disciplines of mathematics, statistics and computer science. These guidelines are meant to provide some structure for insti… ▽ More

    Submitted 21 January, 2018; originally announced January 2018.

    Journal ref: Annual Review of Statistics, Volume 4 (2017), 15-30