Skip to main content

Showing 1–10 of 10 results for author: George, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.00355  [pdf, other

    cs.LG stat.ML

    Sampling in High-Dimensions using Stochastic Interpolants and Forward-Backward Stochastic Differential Equations

    Authors: Anand Jerry George, Nicolas Macris

    Abstract: We present a class of diffusion-based algorithms to draw samples from high-dimensional probability distributions given their unnormalized densities. Ideally, our methods can transport samples from a Gaussian distribution to a specified target distribution in finite time. Our approach relies on the stochastic interpolants framework to define a time-indexed collection of probability densities that b… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

    Comments: 8 pages

  2. arXiv:2502.00336  [pdf, other

    cs.LG stat.ML

    Denoising Score Matching with Random Features: Insights on Diffusion Models from Precise Learning Curves

    Authors: Anand Jerry George, Rodrigo Veiga, Nicolas Macris

    Abstract: We derive asymptotically precise expressions for test and train errors of denoising score matching (DSM) in generative diffusion models. The score function is parameterized by random features neural networks, with the target distribution being $d$-dimensional standard Gaussian. We operate in a regime where the dimension $d$, number of data samples $n$, and number of features $p$ tend to infinity w… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

    Comments: 8 pages

  3. arXiv:2409.07652  [pdf, other

    stat.ML cs.LG math.ST

    Gaussian Process Upper Confidence Bounds in Distributed Point Target Tracking over Wireless Sensor Networks

    Authors: Xingchi Liu, Lyudmila Mihaylova, Jemin George, Tien Pham

    Abstract: Uncertainty quantification plays a key role in the development of autonomous systems, decision-making, and tracking over wireless sensor networks (WSNs). However, there is a need of providing uncertainty confidence bounds, especially for distributed machine learning-based tracking, dealing with different volumes of data collected by sensors. This paper aims to fill in this gap and proposes a distr… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

  4. arXiv:2205.07488  [pdf, other

    cs.IT cs.LG math.ST stat.ML

    Robust Testing in High-Dimensional Sparse Models

    Authors: Anand Jerry George, Clément L. Canonne

    Abstract: We consider the problem of robustly testing the norm of a high-dimensional sparse signal vector under two different observation models. In the first model, we are given $n$ i.i.d. samples from the distribution $\mathcal{N}\left(θ,I_d\right)$ (with unknown $θ$), of which a small fraction has been arbitrarily corrupted. Under the promise that $\|θ\|_0\le s$, we want to correctly distinguish whether… ▽ More

    Submitted 4 November, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

    Comments: Fixed typos, added a figure and discussion section

  5. arXiv:2101.06453  [pdf, other

    stat.CO cs.IT stat.ML

    An MCMC Method to Sample from Lattice Distributions

    Authors: Anand Jerry George, Navin Kashyap

    Abstract: We introduce a Markov Chain Monte Carlo (MCMC) algorithm to generate samples from probability distributions supported on a $d$-dimensional lattice $Λ= \mathbf{B}\mathbb{Z}^d$, where $\mathbf{B}$ is a full-rank matrix. Specifically, we consider lattice distributions $P_Λ$ in which the probability at a lattice point is proportional to a given probability density function, $f$, evaluated at that poin… ▽ More

    Submitted 26 January, 2021; v1 submitted 16 January, 2021; originally announced January 2021.

    Comments: 11 pages, 7 figures

  6. arXiv:2007.06799  [pdf, other

    stat.ML cs.LG math.ST

    A Decentralized Approach to Bayesian Learning

    Authors: Anjaly Parayil, He Bai, Jemin George, Prudhvi Gurram

    Abstract: Motivated by decentralized approaches to machine learning, we propose a collaborative Bayesian learning algorithm taking the form of decentralized Langevin dynamics in a non-convex setting. Our analysis show that the initial KL-divergence between the Markov Chain and the target posterior distribution is exponentially decreasing while the error contributions to the overall KL-divergence from the ad… ▽ More

    Submitted 9 January, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 42 pages, 37 figures

  7. arXiv:2005.07041  [pdf, other

    cs.LG cs.DC stat.ML

    SQuARM-SGD: Communication-Efficient Momentum SGD for Decentralized Optimization

    Authors: Navjot Singh, Deepesh Data, Jemin George, Suhas Diggavi

    Abstract: In this paper, we propose and analyze SQuARM-SGD, a communication-efficient algorithm for decentralized training of large-scale machine learning models over a network. In SQuARM-SGD, each node performs a fixed number of local SGD steps using Nesterov's momentum and then sends sparsified and quantized updates to its neighbors regulated by a locally computable triggering criterion. We provide conver… ▽ More

    Submitted 11 October, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: 58 pages, 8 figures

  8. arXiv:1910.14280  [pdf, other

    stat.ML cs.DC cs.LG math.OC

    SPARQ-SGD: Event-Triggered and Compressed Communication in Decentralized Stochastic Optimization

    Authors: Navjot Singh, Deepesh Data, Jemin George, Suhas Diggavi

    Abstract: In this paper, we propose and analyze SPARQ-SGD, which is an event-triggered and compressed algorithm for decentralized training of large-scale machine learning models. Each node can locally compute a condition (event) which triggers a communication where quantized and sparsified local model parameters are sent. In SPARQ-SGD each node takes at least a fixed number ($H$) of local gradient steps and… ▽ More

    Submitted 24 February, 2020; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: 41 pages, 4 figures

  9. arXiv:1906.10163  [pdf

    stat.AP

    Assessing the Validity of a a priori Patient-Trial Generalizability Score using Real-world Data from a Large Clinical Data Research Network: A Colorectal Cancer Clinical Trial Case Study

    Authors: Qian Li, Zhe He, Yi Guo, Hansi Zhang, Thomas J George Jr, William Hogan, Neil Charness, Jiang Bian

    Abstract: Existing trials had not taken enough consideration of their population representativeness, which can lower the effectiveness when the treatment is applied in real-world clinical practice. We analyzed the eligibility criteria of Bevacizumab colorectal cancer treatment trials, assessed their a priori generalizability, and examined how it affects patient outcomes when applied in real-world clinical s… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

  10. arXiv:1708.01198  [pdf, other

    cs.CV stat.CO stat.ML

    Estimating speech from lip dynamics

    Authors: Jithin Donny George, Ronan Keane, Conor Zellmer

    Abstract: The goal of this project is to develop a limited lip reading algorithm for a subset of the English language. We consider a scenario in which no audio information is available. The raw video is processed and the position of the lips in each frame is extracted. We then prepare the lip data for processing and classify the lips into visemes and phonemes. Hidden Markov Models are used to predict the wo… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.