Skip to main content

Showing 1–13 of 13 results for author: Patel, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2302.03750  [pdf, other

    cs.CV cs.LG stat.ME

    Linking convolutional kernel size to generalization bias in face analysis CNNs

    Authors: Hao Liang, Josue Ortega Caro, Vikram Maheshri, Ankit B. Patel, Guha Balakrishnan

    Abstract: Training dataset biases are by far the most scrutinized factors when explaining algorithmic biases of neural networks. In contrast, hyperparameters related to the neural network architecture have largely been ignored even though different network parameterizations are known to induce different implicit biases over learned features. For example, convolutional kernel size is known to affect the freq… ▽ More

    Submitted 3 December, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: WACV 2024

  2. arXiv:2301.12083  [pdf, other

    cs.LG math.OC stat.ML

    Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic

    Authors: Wesley A. Suttle, Amrit Singh Bedi, Bhrij Patel, Brian M. Sadler, Alec Koppel, Dinesh Manocha

    Abstract: Many existing reinforcement learning (RL) methods employ stochastic gradient iteration on the back end, whose stability hinges upon a hypothesis that the data-generating process mixes exponentially fast with a rate parameter that appears in the step-size selection. Unfortunately, this assumption is violated for large state spaces or settings with sparse rewards, and the mixing time is unknown, mak… ▽ More

    Submitted 1 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  3. arXiv:2210.10964  [pdf, other

    cs.LG stat.ML

    Uncertainty Disentanglement with Non-stationary Heteroscedastic Gaussian Processes for Active Learning

    Authors: Zeel B Patel, Nipun Batra, Kevin Murphy

    Abstract: Gaussian processes are Bayesian non-parametric models used in many areas. In this work, we propose a Non-stationary Heteroscedastic Gaussian process model which can be learned with gradient-based techniques. We demonstrate the interpretability of the proposed model by separating the overall uncertainty into aleatoric (irreducible) and epistemic (model) uncertainty. We illustrate the usability of d… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS Workshop on Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems, 2023

  4. arXiv:2103.06813  [pdf, other

    stat.AP cs.CE q-bio.PE

    COVID-19: Optimal Allocation of Ventilator Supply under Uncertainty and Risk

    Authors: Xuecheng Yin, I. Esra Buyuktahtakin, Bhumi P. Patel

    Abstract: This study presents a new risk-averse multi-stage stochastic epidemics-ventilator-logistics compartmental model to address the resource allocation challenges of mitigating COVID-19. This epidemiological logistics model involves the uncertainty of untested asymptomatic infections and incorporates short-term human migration. Disease transmission is also forecasted through a new formulation of transm… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: 35 pages, 6 figures, 10 tables, Under Review for a Journal

  5. arXiv:2102.01147  [pdf

    cs.LG stat.AP

    Real-time Prediction for Mechanical Ventilation in COVID-19 Patients using A Multi-task Gaussian Process Multi-objective Self-attention Network

    Authors: Kai Zhang, Siddharth Karanth, Bela Patel, Robert Murphy, Xiaoqian Jiang

    Abstract: We propose a robust in-time predictor for in-hospital COVID-19 patient's probability of requiring mechanical ventilation. A challenge in the risk prediction for COVID-19 patients lies in the great variability and irregular sampling of patient's vitals and labs observed in the clinical setting. Existing methods have strong limitations in handling time-dependent features' complex dynamics, either ov… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: In review

  6. arXiv:2006.07460  [pdf, other

    cs.LG stat.ML

    An Improved Semi-Supervised VAE for Learning Disentangled Representations

    Authors: Weili Nie, Zichao Wang, Ankit B. Patel, Richard G. Baraniuk

    Abstract: Learning interpretable and disentangled representations is a crucial yet challenging task in representation learning. In this work, we focus on semi-supervised disentanglement learning and extend work by Locatello et al. (2019) by introducing another source of supervision that we denote as label replacement. Specifically, during training, we replace the inferred representation associated with a da… ▽ More

    Submitted 22 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

  7. arXiv:2005.04176  [pdf, other

    stat.ML cs.LG stat.AP

    In Pursuit of Interpretable, Fair and Accurate Machine Learning for Criminal Recidivism Prediction

    Authors: Caroline Wang, Bin Han, Bhrij Patel, Cynthia Rudin

    Abstract: Objectives: We study interpretable recidivism prediction using machine learning (ML) models and analyze performance in terms of prediction ability, sparsity, and fairness. Unlike previous works, this study trains interpretable models that output probabilities rather than binary predictions, and uses quantitative fairness definitions to assess the models. This study also examines whether models can… ▽ More

    Submitted 11 March, 2022; v1 submitted 8 May, 2020; originally announced May 2020.

  8. arXiv:2003.07977  [pdf, other

    eess.IV cs.LG stat.ML

    Assessing Robustness to Noise: Low-Cost Head CT Triage

    Authors: Sarah M. Hooper, Jared A. Dunnmon, Matthew P. Lungren, Sanjiv Sam Gambhir, Christopher RĂ©, Adam S. Wang, Bhavik N. Patel

    Abstract: Automated medical image classification with convolutional neural networks (CNNs) has great potential to impact healthcare, particularly in resource-constrained healthcare systems where fewer trained radiologists are available. However, little is known about how well a trained CNN can perform on images with the increased noise levels, different acquisition protocols, or additional artifacts that ma… ▽ More

    Submitted 28 March, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: AI for Affordable Healthcare Workshop at ICLR 2020. First two authors have equal contribution; last two authors have equal contribution. Revision made to manuscript header according to workshop guidelines on 3/28/20

  9. arXiv:1812.04118  [pdf

    cs.IR cs.LG stat.ML

    Montage based 3D Medical Image Retrieval from Traumatic Brain Injury Cohort using Deep Convolutional Neural Network

    Authors: Cailey I. Kerley, Yuankai Huo, Shikha Chaganti, Shunxing Bao, Mayur B. Patel, Bennett A. Landman

    Abstract: Brain imaging analysis on clinically acquired computed tomography (CT) is essential for the diagnosis, risk prediction of progression, and treatment of the structural phenotypes of traumatic brain injury (TBI). However, in real clinical imaging scenarios, entire body CT images (e.g., neck, abdomen, chest, pelvis) are typically captured along with whole brain CT scans. For instance, in a typical sa… ▽ More

    Submitted 10 December, 2018; originally announced December 2018.

    Comments: Accepted for SPIE: Medical Imaging 2019

  10. arXiv:1612.01942  [pdf, other

    stat.ML cs.LG cs.NE

    Semi-Supervised Learning with the Deep Rendering Mixture Model

    Authors: Tan Nguyen, Wanjia Liu, Ethan Perez, Richard G. Baraniuk, Ankit B. Patel

    Abstract: Semi-supervised learning algorithms reduce the high cost of acquiring labeled training data by using both labeled and unlabeled data during learning. Deep Convolutional Networks (DCNs) have achieved great success in supervised tasks and as such have been widely employed in the semi-supervised learning. In this paper we leverage the recently developed Deep Rendering Mixture Model (DRMM), a probabil… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

  11. arXiv:1612.01936  [pdf, other

    stat.ML cs.LG cs.NE

    A Probabilistic Framework for Deep Learning

    Authors: Ankit B. Patel, Tan Nguyen, Richard G. Baraniuk

    Abstract: We develop a probabilistic framework for deep learning based on the Deep Rendering Mixture Model (DRMM), a new generative probabilistic model that explicitly capture variations in data due to latent task nuisance variables. We demonstrate that max-sum inference in the DRMM yields an algorithm that exactly reproduces the operations in deep convolutional neural networks (DCNs), providing a first pri… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1504.00641

  12. A Deep Learning Approach to Structured Signal Recovery

    Authors: Ali Mousavi, Ankit B. Patel, Richard G. Baraniuk

    Abstract: In this paper, we develop a new framework for sensing and recovering structured signals. In contrast to compressive sensing (CS) systems that employ linear measurements, sparse representations, and computationally complex convex/greedy algorithms, we introduce a deep learning framework that supports both linear and mildly nonlinear measurements, that learns a structured representation from trainin… ▽ More

    Submitted 17 August, 2015; originally announced August 2015.

    Journal ref: In Proceeding of 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton)

  13. arXiv:1504.00641  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    A Probabilistic Theory of Deep Learning

    Authors: Ankit B. Patel, Tan Nguyen, Richard G. Baraniuk

    Abstract: A grand challenge in machine learning is the development of computational algorithms that match or outperform humans in perceptual inference tasks that are complicated by nuisance variation. For instance, visual object recognition involves the unknown object position, orientation, and scale in object recognition while speech recognition involves the unknown voice pronunciation, pitch, and speed. R… ▽ More

    Submitted 2 April, 2015; originally announced April 2015.

    Comments: 56 pages, 6 figures, 2 tables

    Report number: Rice University Electrical and Computer Engineering Dept. Technical Report No 2015-1