Skip to main content

Showing 1–36 of 36 results for author: Gupta, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.18234  [pdf

    cs.LG cs.AI cs.PF stat.CO

    Randomized-Grid Search for Hyperparameter Tuning in Decision Tree Model to Improve Performance of Cardiovascular Disease Classification

    Authors: Abhay Kumar Pathak, Mrityunjay Chaubey, Manjari Gupta

    Abstract: Cardiovascular disease refers to any critical condition that impacts the heart. Because heart diseases can be life-threatening. Researchers are focusing on designing smart systems to accurately diagnose them based on electronic health data, with the aid of machine learning algorithms. Heart disease classification using machine learning (ML) algorithms such as Support Vector Machine(SVM), Naïve Bay… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  2. arXiv:2311.11185  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Dueling Optimization with a Monotone Adversary

    Authors: Avrim Blum, Meghal Gupta, Gene Li, Naren Sarayu Manoj, Aadirupa Saha, Yuanyuan Yang

    Abstract: We introduce and study the problem of dueling optimization with a monotone adversary, which is a generalization of (noiseless) dueling convex optimization. The goal is to design an online algorithm to find a minimizer $\mathbf{x}^{*}$ for a function $f\colon X \to \mathbb{R}$, where $X \subseteq \mathbb{R}^d$. In each round, the algorithm submits a pair of guesses, i.e., $\mathbf{x}^{(1)}$ and… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 21 pages. comments welcome

  3. arXiv:2302.14531  [pdf, other

    stat.ME

    Finite sample inference for empirical Bayesian methods

    Authors: Hien D Nguyen, Mayetri Gupta

    Abstract: In recent years, empirical Bayesian (EB) inference has become an attractive approach for estimation in parametric models arising in a variety of real-life problems, especially in complex and high-dimensional scientific applications. However, compared to the relative abundance of available general methods for computing point estimators in the EB framework, the construction of confidence sets and hy… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

  4. arXiv:2207.12007  [pdf, other

    cs.AI stat.ML

    LETS-GZSL: A Latent Embedding Model for Time Series Generalized Zero Shot Learning

    Authors: Sathvik Bhaskarpandit, Priyanka Gupta, Manik Gupta

    Abstract: One of the recent developments in deep learning is generalized zero-shot learning (GZSL), which aims to recognize objects from both seen and unseen classes, when only the labeled examples from seen classes are provided. Over the past couple of years, GZSL has picked up traction and several models have been proposed to solve this problem. Whereas an extensive amount of research on GZSL has been car… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: 9 pages, 5 figures, 6 tables. Accepted at the IJCAI 2022 workshop on Artificial Intelligence for Time Series (AI4TS)

  5. arXiv:2202.01277  [pdf, other

    stat.ML cs.LG

    Global Optimization Networks

    Authors: Sen Zhao, Erez Louidor, Olexander Mangylov, Maya Gupta

    Abstract: We consider the problem of estimating a good maximizer of a black-box function given noisy examples. To solve such problems, we propose to fit a new type of function which we call a global optimization network (GON), defined as any composition of an invertible function and a unimodal function, whose unique global maximizer can be inferred in $\mathcal{O}(D)$ time. In this paper, we show how to con… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  6. arXiv:2102.05135  [pdf, other

    stat.ML cs.LG stat.ME

    Regularization Strategies for Quantile Regression

    Authors: Taman Narayan, Serena Wang, Kevin Canini, Maya Gupta

    Abstract: We investigate different methods for regularizing quantile regression when predicting either a subset of quantiles or the full inverse CDF. We show that minimizing an expected pinball loss over a continuous distribution of quantiles is a good regularizer even when only predicting a specific quantile. For predicting multiple quantiles, we propose achieving the classic goal of non-crossing quantiles… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

  7. arXiv:2009.08900  [pdf

    cs.LG stat.ML

    Time-series Imputation and Prediction with Bi-Directional Generative Adversarial Networks

    Authors: Mehak Gupta, Rahmatollah Beheshti

    Abstract: Multivariate time-series data are used in many classification and regression predictive tasks, and recurrent models have been widely used for such tasks. Most common recurrent models assume that time-series data elements are of equal length and the ordered observations are recorded at regular intervals. However, real-world time-series data have neither a similar length nor a same number of observa… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

  8. arXiv:2004.12289  [pdf, other

    cs.LG stat.ML

    Deep k-NN for Noisy Labels

    Authors: Dara Bahri, Heinrich Jiang, Maya Gupta

    Abstract: Modern machine learning models are often trained on examples with noisy labels that hurt performance and are hard to identify. In this paper, we provide an empirical study showing that a simple $k$-nearest neighbor-based filtering approach on the logit layer of a preliminary model can remove mislabeled training data and produce more accurate models than many recently proposed methods. We also prov… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: Full paper (including supplemental) can be found at https://github.com/dbahri/deepknn

  9. arXiv:2002.09343  [pdf, ps, other

    cs.LG stat.ML

    Robust Optimization for Fairness with Noisy Protected Groups

    Authors: Serena Wang, Wenshuo Guo, Harikrishna Narasimhan, Andrew Cotter, Maya Gupta, Michael I. Jordan

    Abstract: Many existing fairness criteria for machine learning involve equalizing some metric across protected groups such as race or gender. However, practitioners trying to audit or enforce such group-based criteria can easily face the problem of noisy or biased protected group information. First, we study the consequences of naively relying on noisy protected group labels: we provide an upper bound on th… ▽ More

    Submitted 10 November, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: To appear at 34th Conference on Neural Information Processing Systems (NeurIPS 2020); first two authors contributed equally to this work

  10. arXiv:2002.08605  [pdf, other

    cs.LG cs.AI stat.ML

    Optimizing Black-box Metrics with Adaptive Surrogates

    Authors: Qijia Jiang, Olaoluwa Adigun, Harikrishna Narasimhan, Mahdi Milani Fard, Maya Gupta

    Abstract: We address the problem of training models with black-box and hard-to-optimize metrics by expressing the metric as a monotonic function of a small number of easy-to-optimize surrogates. We pose the training problem as an optimization over a relaxed surrogate space, which we solve by estimating local gradients for the metric and performing inexact convex projections. We analyze gradient estimates ba… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

  11. arXiv:2002.06383  [pdf, other

    cs.CR cs.LG stat.ML

    Analyzing CNN Based Behavioural Malware Detection Techniques on Cloud IaaS

    Authors: Andrew McDole, Mahmoud Abdelsalam, Maanak Gupta, Sudip Mittal

    Abstract: Cloud Infrastructure as a Service (IaaS) is vulnerable to malware due to its exposure to external adversaries, making it a lucrative attack vector for malicious actors. A datacenter infected with malware can cause data loss and/or major disruptions to service for its users. This paper analyzes and compares various Convolutional Neural Networks (CNNs) for online detection of malware in cloud IaaS.… ▽ More

    Submitted 15 February, 2020; originally announced February 2020.

  12. arXiv:2001.11990  [pdf, other

    cs.LG cs.AI stat.ML

    Deontological Ethics By Monotonicity Shape Constraints

    Authors: Serena Wang, Maya Gupta

    Abstract: We demonstrate how easy it is for modern machine-learned systems to violate common deontological ethical principles and social norms such as "favor the less fortunate," and "do not penalize good attributes." We propose that in some cases such ethical principles can be incorporated into a machine-learned model by adding shape constraints that constrain the model to respond only positively to releva… ▽ More

    Submitted 12 March, 2020; v1 submitted 31 January, 2020; originally announced January 2020.

    Comments: AISTATS 2020

  13. arXiv:1912.05198  [pdf

    cs.LG cs.NE eess.SP stat.ML

    Recurrent Transform Learning

    Authors: Megha Gupta, Angshul Majumdar

    Abstract: The objective of this work is to improve the accuracy of building demand forecasting. This is a more challenging task than grid level forecasting. For the said purpose, we develop a new technique called recurrent transform learning (RTL). Two versions are proposed. The first one (RTL) is unsupervised; this is used as a feature extraction tool that is further fed into a regression model. The second… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: A slightly different version has been accepted at Neural Networks

  14. arXiv:1912.02655  [pdf

    stat.AP cs.LG q-bio.QM

    Obesity Prediction with EHR Data: A deep learning approach with interpretable elements

    Authors: Mehak Gupta, Thao-Ly T. Phan, Timothy Bunnell, Rahmatollah Beheshti

    Abstract: Childhood obesity is a major public health challenge. Early prediction and identification of the children at a high risk of developing childhood obesity may help in engaging earlier and more effective interventions to prevent and manage obesity. Most existing predictive tools for childhood obesity primarily rely on traditional regression-type methods using only a few hand-picked features and witho… ▽ More

    Submitted 22 October, 2021; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: 19 pages, 4 Tables, 7 figures

    Report number: 32

    Journal ref: ACM Transactions on Computing for Healthcare, 2022

  15. arXiv:1910.05018  [pdf, other

    cs.LG cs.FL cs.NE stat.ML

    Verification of Neural Networks: Specifying Global Robustness using Generative Models

    Authors: Nathanaël Fijalkow, Mohit Kumar Gupta

    Abstract: The success of neural networks across most machine learning tasks and the persistence of adversarial examples have made the verification of such models an important quest. Several techniques have been successfully developed to verify robustness, and are now able to evaluate neural networks with thousands of nodes. The main weakness of this approach is in the specification: robustness is asserted o… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

    Comments: A preliminary version was presented at the VNN Symposium (Verification of Neural Networks) in Stanford, 2019

  16. arXiv:1909.02939  [pdf, other

    cs.LG cs.GT stat.ML

    Optimizing Generalized Rate Metrics through Game Equilibrium

    Authors: Harikrishna Narasimhan, Andrew Cotter, Maya Gupta

    Abstract: We present a general framework for solving a large class of learning problems with non-linear functions of classification rates. This includes problems where one wishes to optimize a non-decomposable performance metric such as the F-measure or G-mean, and constrained training problems where the classifier needs to satisfy non-linear rate constraints such as predictive parity fairness, distribution… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

  17. arXiv:1906.07573  [pdf, other

    cs.CY cs.LG stat.ML

    Agriculture Commodity Arrival Prediction using Remote Sensing Data: Insights and Beyond

    Authors: Gautam Prasad, Upendra Reddy Vuyyuru, Mithun Das Gupta

    Abstract: In developing countries like India agriculture plays an extremely important role in the lives of the population. In India, around 80\% of the population depend on agriculture or its by-products as the primary means for employment. Given large population dependency on agriculture, it becomes extremely important for the government to estimate market factors in advance and prepare for any deviation f… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

    Comments: KDD'18 Fragile Earth Workshop (FEED)

  18. arXiv:1906.05330  [pdf, other

    cs.LG stat.ML

    Pairwise Fairness for Ranking and Regression

    Authors: Harikrishna Narasimhan, Andrew Cotter, Maya Gupta, Serena Wang

    Abstract: We present pairwise fairness metrics for ranking models and regression models that form analogues of statistical fairness notions such as equal opportunity, equal accuracy, and statistical parity. Our pairwise formulation supports both discrete protected groups, and continuous protected attributes. We show that the resulting training problems can be efficiently and effectively solved using existin… ▽ More

    Submitted 7 January, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

  19. arXiv:1906.00025  [pdf, other

    cs.LG cs.AI stat.ML

    Minimum-Margin Active Learning

    Authors: Heinrich Jiang, Maya Gupta

    Abstract: We present a new active sampling method we call min-margin which trains multiple learners on bootstrap samples and then chooses the examples to label based on the candidates' minimum margin amongst the bootstrapped models. This extends standard margin sampling in a way that increases its diversity in a supervised manner as it arises from the model uncertainty. We focus on the one-shot batch active… ▽ More

    Submitted 31 May, 2019; originally announced June 2019.

  20. arXiv:1809.04262  [pdf, ps, other

    cs.LG cs.IR stat.ML

    Extracting Fairness Policies from Legal Documents

    Authors: Rashmi Nagpal, Chetna Wadhwa, Mallika Gupta, Samiulla Shaikh, Sameep Mehta, Vikram Goyal

    Abstract: Machine Learning community is recently exploring the implications of bias and fairness with respect to the AI applications. The definition of fairness for such applications varies based on their domain of application. The policies governing the use of such machine learning system in a given context are defined by the constitutional laws of nations and regulatory policies enforced by the organizati… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

  21. arXiv:1809.04198  [pdf, other

    cs.LG cs.AI cs.GT math.OC stat.ML

    Optimization with Non-Differentiable Constraints with Applications to Fairness, Recall, Churn, and Other Goals

    Authors: Andrew Cotter, Heinrich Jiang, Serena Wang, Taman Narayan, Maya Gupta, Seungil You, Karthik Sridharan

    Abstract: We show that many machine learning goals, such as improved fairness metrics, can be expressed as constraints on the model's predictions, which we call rate constraints. We study the problem of training non-convex models subject to these rate constraints (or any non-convex and non-differentiable constraints). In the non-convex setting, the standard approach of Lagrange multipliers may fail. Further… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

  22. arXiv:1808.09935  [pdf, other

    cs.LG stat.ML

    Attention-based Neural Text Segmentation

    Authors: Pinkesh Badjatiya, Litton J Kurisinkel, Manish Gupta, Vasudeva Varma

    Abstract: Text segmentation plays an important role in various Natural Language Processing (NLP) tasks like summarization, context understanding, document indexing and document noise removal. Previous methods for this task require manual feature engineering, huge memory requirements and large execution times. To the best of our knowledge, this paper is the first one to present a novel supervised neural appr… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

  23. arXiv:1807.00028  [pdf, other

    cs.LG stat.ML

    Training Well-Generalizing Classifiers for Fairness Metrics and Other Data-Dependent Constraints

    Authors: Andrew Cotter, Maya Gupta, Heinrich Jiang, Nathan Srebro, Karthik Sridharan, Serena Wang, Blake Woodworth, Seungil You

    Abstract: Classifiers can be trained with data-dependent constraints to satisfy fairness goals, reduce churn, achieve a targeted false positive rate, or other policy goals. We study the generalization performance for such constrained optimization problems, in terms of how well the constraints are satisfied at evaluation time, given that they are satisfied at training time. To improve generalization performa… ▽ More

    Submitted 28 September, 2018; v1 submitted 29 June, 2018; originally announced July 2018.

  24. arXiv:1806.11212  [pdf, other

    cs.LG stat.ML

    Proxy Fairness

    Authors: Maya Gupta, Andrew Cotter, Mahdi Milani Fard, Serena Wang

    Abstract: We consider the problem of improving fairness when one lacks access to a dataset labeled with protected groups, making it difficult to take advantage of strategies that can improve fairness but require protected group labels, either at training or runtime. To address this, we investigate improving fairness metrics for proxy groups, and test whether doing so results in improved fairness for the tru… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

  25. arXiv:1806.11202  [pdf, ps, other

    cs.LG stat.ML

    Quit When You Can: Efficient Evaluation of Ensembles with Ordering Optimization

    Authors: Serena Wang, Maya Gupta, Seungil You

    Abstract: Given a classifier ensemble and a set of examples to be classified, many examples may be confidently and accurately classified after only a subset of the base models in the ensemble are evaluated. This can reduce both mean latency and CPU while maintaining the high accuracy of the original ensemble. To achieve such gains, we propose jointly optimizing a fixed evaluation order of the base models an… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

  26. arXiv:1806.00050  [pdf, other

    cs.LG cs.AI stat.ML

    Interpretable Set Functions

    Authors: Andrew Cotter, Maya Gupta, Heinrich Jiang, James Muller, Taman Narayan, Serena Wang, Tao Zhu

    Abstract: We propose learning flexible but interpretable functions that aggregate a variable-length set of permutation-invariant feature vectors to predict a label. We use a deep lattice network model so we can architect the model structure to enhance interpretability, and add monotonicity constraints between inputs-and-outputs. We then use the proposed set function to automate the engineering of dense, int… ▽ More

    Submitted 31 May, 2018; originally announced June 2018.

  27. arXiv:1805.11783  [pdf, other

    stat.ML cs.LG

    To Trust Or Not To Trust A Classifier

    Authors: Heinrich Jiang, Been Kim, Melody Y. Guan, Maya Gupta

    Abstract: Knowing when a classifier's prediction can be trusted is useful in many applications and critical for safely using AI. While the bulk of the effort in machine learning research has been towards improving classifier performance, understanding when a classifier's predictions should and should not be trusted has received far less attention. The standard approach is to use the classifier's discriminan… ▽ More

    Submitted 26 October, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: NIPS 2018

  28. arXiv:1805.10582  [pdf, other

    stat.ML cs.AI cs.LG

    Metric-Optimized Example Weights

    Authors: Sen Zhao, Mahdi Milani Fard, Harikrishna Narasimhan, Maya Gupta

    Abstract: Real-world machine learning applications often have complex test metrics, and may have training and test data that are not identically distributed. Motivated by known connections between complex test metrics and cost-weighted learning, we propose addressing these issues by using a weighted loss function with a standard loss, where the weights on the training examples are learned to optimize the te… ▽ More

    Submitted 15 June, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: Proceedings of the 36th International Conference on Machine Learning (ICML'19)

  29. arXiv:1711.04150  [pdf, other

    cs.SI cs.LG stat.ML

    STWalk: Learning Trajectory Representations in Temporal Graphs

    Authors: Supriya Pandhre, Himangi Mittal, Manish Gupta, Vineeth N Balasubramanian

    Abstract: Analyzing the temporal behavior of nodes in time-varying graphs is useful for many applications such as targeted advertising, community evolution and outlier detection. In this paper, we present a novel approach, STWalk, for learning trajectory representations of nodes in temporal graphs. The proposed framework makes use of structural properties of graphs at current and previous time-steps to lear… ▽ More

    Submitted 11 November, 2017; originally announced November 2017.

    Comments: 10 pages, 5 figures, 2 tables

  30. arXiv:1709.06680  [pdf, other

    stat.ML cs.LG

    Deep Lattice Networks and Partial Monotonic Functions

    Authors: Seungil You, David Ding, Kevin Canini, Jan Pfeifer, Maya Gupta

    Abstract: We propose learning deep models that are monotonic with respect to a user-specified set of inputs by alternating layers of linear embeddings, ensembles of lattices, and calibrators (piecewise linear functions), with appropriate constraints for monotonicity, and jointly training the resulting network. We implement the layers and projections with new computational graph nodes in TensorFlow and use t… ▽ More

    Submitted 19 September, 2017; originally announced September 2017.

    Comments: 9 pages, NIPS 2017

  31. arXiv:1605.04657  [pdf, other

    cs.IT cs.LG stat.ML

    Solve-Select-Scale: A Three Step Process For Sparse Signal Estimation

    Authors: Mithun Das Gupta

    Abstract: In the theory of compressed sensing (CS), the sparsity $\|x\|_0$ of the unknown signal $\mathbf{x} \in \mathcal{R}^n$ is of prime importance and the focus of reconstruction algorithms has mainly been either $\|x\|_0$ or its convex relaxation (via $\|x\|_1$). However, it is typically unknown in practice and has remained a challenge when nothing about the size of the support is known. As pointed rec… ▽ More

    Submitted 16 May, 2016; originally announced May 2016.

  32. arXiv:1206.4653  [pdf

    cs.LG cs.CV stat.ML

    Dimensionality Reduction by Local Discriminative Gaussians

    Authors: Nathan Parrish, Maya Gupta

    Abstract: We present local discriminative Gaussian (LDG) dimensionality reduction, a supervised dimensionality reduction technique for classification. The LDG objective function is an approximation to the leave-one-out training error of a local quadratic discriminant analysis classifier, and thus acts locally to each training point in order to find a mapping where similar data can be discriminated from diss… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  33. arXiv:1203.3483  [pdf

    cs.LG stat.ML

    Regularized Maximum Likelihood for Intrinsic Dimension Estimation

    Authors: Mithun Das Gupta, Thomas S. Huang

    Abstract: We propose a new method for estimating the intrinsic dimension of a dataset by applying the principle of regularized maximum likelihood to the distances between close neighbors. We propose a regularization scheme which is motivated by divergence minimization principles. We derive the estimator by a Poisson process approximation, argue about its convergence properties and apply it to a number of si… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-220-227

  34. arXiv:1107.4390  [pdf, other

    stat.ML stat.ME

    Multi-Task Averaging

    Authors: Sergey Feldman, Bela A. Frigyik, Maya R. Gupta

    Abstract: We present a multi-task learning approach to jointly estimate the means of multiple independent data sets. The proposed multi-task averaging (MTA) algorithm results in a convex combination of the single-task maximum likelihood estimates. We derive the optimal minimum risk estimator and the minimax estimator, and show that these estimators can be efficiently estimated. Simulations and real data exp… ▽ More

    Submitted 24 August, 2012; v1 submitted 21 July, 2011; originally announced July 2011.

    Comments: totally redone paper

  35. arXiv:1105.2952  [pdf, other

    stat.ML cs.IT

    Bounds on the Bayes Error Given Moments

    Authors: Bela A. Frigyik, Maya R. Gupta

    Abstract: We show how to compute lower bounds for the supremum Bayes error if the class-conditional distributions must satisfy moment constraints, where the supremum is with respect to the unknown class-conditional distributions. Our approach makes use of Curto and Fialkow's solutions for the truncated moment problem. The lower bound shows that the popular Gaussian assumption is not robust in this regard. W… ▽ More

    Submitted 30 January, 2012; v1 submitted 15 May, 2011; originally announced May 2011.

    Comments: 10 pages, 2 figures, to appear in IEEE Transactions on Information Theory

  36. Model selection and sensitivity analysis for sequence pattern models

    Authors: Mayetri Gupta

    Abstract: In this article we propose a maximal a posteriori (MAP) criterion for model selection in the motif discovery problem and investigate conditions under which the MAP asymptotically gives a correct prediction of model size. We also investigate robustness of the MAP to prior specification and provide guidelines for choosing prior hyper-parameters for motif models based on sensitivity considerations.

    Submitted 16 May, 2008; originally announced May 2008.

    Comments: Published in at http://dx.doi.org/10.1214/193940307000000301 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-COLL1-IMSCOLL130 MSC Class: 62F15; 62P10 (Primary) 62F12 (Secondary)

    Journal ref: IMS Collections 2008, Vol. 1, 390-407