Skip to main content

Showing 1–9 of 9 results for author: Guha, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2302.02092  [pdf, other

    cs.LG stat.ML

    Interpolation for Robust Learning: Data Augmentation on Wasserstein Geodesics

    Authors: Jiacheng Zhu, Jielin Qiu, Aritra Guha, Zhuolin Yang, Xuanlong Nguyen, Bo Li, Ding Zhao

    Abstract: We propose to study and promote the robustness of a model as per its performance through the interpolation of training data distributions. Specifically, (1) we augment the data by finding the worst-case Wasserstein barycenter on the geodesic connecting subpopulation distributions of different categories. (2) We regularize the model for smoother performance on the continuous geodesic path connectin… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: 34 pages, 3 figures, 18 tables

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:43129-43157, 2023

  2. arXiv:2211.04568  [pdf, ps, other

    stat.AP cs.CY cs.LG

    Towards Algorithmic Fairness in Space-Time: Filling in Black Holes

    Authors: Cheryl Flynn, Aritra Guha, Subhabrata Majumdar, Divesh Srivastava, Zhengyi Zhou

    Abstract: New technologies and the availability of geospatial data have drawn attention to spatio-temporal biases present in society. For example: the COVID-19 pandemic highlighted disparities in the availability of broadband service and its role in the digital divide; the environmental justice movement in the United States has raised awareness to health implications for minority populations stemming from h… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

  3. arXiv:2102.07695  [pdf, other

    stat.ML cs.LG stat.ME

    Scalable nonparametric Bayesian learning for heterogeneous and dynamic velocity fields

    Authors: Sunrit Chakraborty, Aritra Guha, Rayleigh Lei, XuanLong Nguyen

    Abstract: Analysis of heterogeneous patterns in complex spatio-temporal data finds usage across various domains in applied science and engineering, including training autonomous vehicles to navigate in complex traffic scenarios. Motivated by applications arising in the transportation domain, in this paper we develop a model for learning heterogeneous and dynamic patterns of velocity field data. We draw from… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: 5 tables, 8 figures

  4. arXiv:2102.03895  [pdf, other

    stat.ML cs.LG stat.AP

    Functional optimal transport: map estimation and domain adaptation for functional data

    Authors: Jiacheng Zhu, Aritra Guha, Dat Do, Mengdi Xu, XuanLong Nguyen, Ding Zhao

    Abstract: We introduce a formulation of optimal transport problem for distributions on function spaces, where the stochastic map between functional domains can be partially represented in terms of an (infinite-dimensional) Hilbert-Schmidt operator mapping a Hilbert space of functions to another. For numerous machine learning tasks, data can be naturally viewed as samples drawn from spaces of functions, such… ▽ More

    Submitted 28 August, 2023; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: 48 pages, 10 figures, 3 tables

  5. arXiv:2012.07363  [pdf, other

    stat.ME

    Outlier-Robust Optimal Transport

    Authors: Debarghya Mukherjee, Aritra Guha, Justin Solomon, Yuekai Sun, Mikhail Yurochkin

    Abstract: Optimal transport (OT) measures distances between distributions in a way that depends on the geometry of the sample space. In light of recent advances in computational OT, OT distances are widely used as loss functions in machine learning. Despite their prevalence and advantages, OT loss functions can be extremely sensitive to outliers. In fact, a single adversarially-picked outlier can increase t… ▽ More

    Submitted 20 June, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

    Comments: Accepted in Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  6. arXiv:2006.10241  [pdf, other

    cs.LG cs.MA cs.RO stat.AP stat.ML

    Robust Unsupervised Learning of Temporal Dynamic Interactions

    Authors: Aritra Guha, Rayleigh Lei, Jiacheng Zhu, XuanLong Nguyen, Ding Zhao

    Abstract: Robust representation learning of temporal dynamic interactions is an important problem in robotic learning in general and automated unsupervised learning in particular. Temporal dynamic interactions can be described by (multiple) geometric trajectories in a suitable space over which unsupervised learning techniques may be applied to extract useful features from raw and high-dimensional data measu… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  7. arXiv:1905.11009  [pdf, other

    stat.ML cs.LG

    Dirichlet Simplex Nest and Geometric Inference

    Authors: Mikhail Yurochkin, Aritra Guha, Yuekai Sun, XuanLong Nguyen

    Abstract: We propose Dirichlet Simplex Nest, a class of probabilistic models suitable for a variety of data types, and develop fast and provably accurate inference algorithms by accounting for the model's convex geometry and low dimensional simplicial structure. By exploiting the connection to Voronoi tessellation and properties of Dirichlet distribution, the proposed inference algorithm is shown to achieve… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: ICML 2019

  8. arXiv:1809.08738  [pdf, other

    stat.ML cs.CL cs.LG

    Scalable inference of topic evolution via models for latent geometric structures

    Authors: Mikhail Yurochkin, Zhiwei Fan, Aritra Guha, Paraschos Koutris, XuanLong Nguyen

    Abstract: We develop new models and algorithms for learning the temporal dynamics of the topic polytopes and related geometric objects that arise in topic model based inference. Our model is nonparametric Bayesian and the corresponding inference algorithm is able to discover new topics as the time progresses. By exploiting the connection between the modeling of topic polytope evolution, Beta-Bernoulli proce… ▽ More

    Submitted 1 November, 2019; v1 submitted 23 September, 2018; originally announced September 2018.

    Comments: NeurIPS 2019

  9. arXiv:1710.02952  [pdf, other

    stat.ML

    Conic Scan-and-Cover algorithms for nonparametric topic modeling

    Authors: Mikhail Yurochkin, Aritra Guha, XuanLong Nguyen

    Abstract: We propose new algorithms for topic modeling when the number of topics is unknown. Our approach relies on an analysis of the concentration of mass and angular geometry of the topic simplex, a convex polytope constructed by taking the convex hull of vertices representing the latent topics. Our algorithms are shown in practice to have accuracy comparable to a Gibbs sampler in terms of topic estimati… ▽ More

    Submitted 9 October, 2017; originally announced October 2017.