Skip to main content

Showing 1–8 of 8 results for author: Galhotra, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2212.10839  [pdf, other

    cs.LG cs.AI cs.DB stat.ML

    Consistent Range Approximation for Fair Predictive Modeling

    Authors: Jiongli Zhu, Sainyam Galhotra, Nazanin Sabri, Babak Salimi

    Abstract: This paper proposes a novel framework for certifying the fairness of predictive models trained on biased data. It draws from query answering for incomplete and inconsistent databases to formulate the problem of consistent range approximation (CRA) of fairness queries for a predictive model on a target population. The framework employs background knowledge of the data collection process and biased… ▽ More

    Submitted 28 July, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

  2. arXiv:2105.05782  [pdf, other

    cs.DS cs.DB stat.ML

    How to Design Robust Algorithms using Noisy Comparison Oracle

    Authors: Raghavendra Addanki, Sainyam Galhotra, Barna Saha

    Abstract: Metric based comparison operations such as finding maximum, nearest and farthest neighbor are fundamental to studying various clustering techniques such as $k$-center clustering and agglomerative hierarchical clustering. These techniques crucially rely on accurate estimation of pairwise distance between records. However, computing exact features of the records, and their pairwise distances is ofte… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: PVLDB 2021

  3. arXiv:2102.03977  [pdf, other

    stat.ML cs.AI cs.CY cs.DS cs.LG

    Learning to Generate Fair Clusters from Demonstrations

    Authors: Sainyam Galhotra, Sandhya Saisubramanian, Shlomo Zilberstein

    Abstract: Fair clustering is the process of grouping similar entities together, while satisfying a mathematically well-defined fairness metric as a constraint. Due to the practical challenges in precise model specification, the prescribed fairness constraints are often incomplete and act as proxies to the intended fairness requirement, leading to biased outcomes when the system is deployed. We examine how t… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  4. arXiv:2006.06053  [pdf, other

    cs.LG cs.CY cs.DB stat.ML

    Causal Feature Selection for Algorithmic Fairness

    Authors: Sainyam Galhotra, Karthikeyan Shanmugam, Prasanna Sattigeri, Kush R. Varshney

    Abstract: The use of machine learning (ML) in high-stakes societal decisions has encouraged the consideration of fairness throughout the ML lifecycle. Although data integration is one of the primary steps to generate high quality training data, most of the fairness literature ignores this stage. In this work, we consider fairness in the integration component of data management, aiming to identify features t… ▽ More

    Submitted 31 March, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Full version of the paper at SIGMOD 2022

  5. arXiv:2002.03508  [pdf, other

    cs.DS cs.AI cs.LG stat.ML

    Fair Correlation Clustering

    Authors: Saba Ahmadi, Sainyam Galhotra, Barna Saha, Roy Schwartz

    Abstract: In this paper we study the problem of correlation clustering under fairness constraints. In the classic correlation clustering problem, we are given a complete graph where each edge is labeled positive or negative. The goal is to obtain a clustering of the vertices that minimizes disagreements -- the number of negative edges trapped inside a cluster plus positive edges between different clusters.… ▽ More

    Submitted 9 February, 2020; originally announced February 2020.

  6. arXiv:1912.07820  [pdf, other

    stat.ML cs.DS cs.LG

    Balancing the Tradeoff Between Clustering Value and Interpretability

    Authors: Sandhya Saisubramanian, Sainyam Galhotra, Shlomo Zilberstein

    Abstract: Graph clustering groups entities -- the vertices of a graph -- based on their similarity, typically using a complex distance function over a large number of features. Successful integration of clustering approaches in automated decision-support systems hinges on the interpretability of the resulting clusters. This paper addresses the problem of generating interpretable clusters, given features of… ▽ More

    Submitted 30 January, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: Accepted at AIES 2020

  7. arXiv:1903.00750  [pdf, other

    stat.ML cs.AI cs.DS cs.LG

    Lexicographically Ordered Multi-Objective Clustering

    Authors: Sainyam Galhotra, Sandhya Saisubramanian, Shlomo Zilberstein

    Abstract: We introduce a rich model for multi-objective clustering with lexicographic ordering over objectives and a slack. The slack denotes the allowed multiplicative deviation from the optimal objective value of the higher priority objective to facilitate improvement in lower-priority objectives. We then propose an algorithm called Zeus to solve this class of problems, which is characterized by a makeshi… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

  8. arXiv:1709.05510  [pdf, other

    cs.SI cs.DS stat.ML

    The Geometric Block Model

    Authors: Sainyam Galhotra, Arya Mazumdar, Soumyabrata Pal, Barna Saha

    Abstract: To capture the inherent geometric features of many community detection problems, we propose to use a new random graph model of communities that we call a Geometric Block Model. The geometric block model generalizes the random geometric graphs in the same way that the well-studied stochastic block model generalizes the Erdos-Renyi random graphs. It is also a natural extension of random community mo… ▽ More

    Submitted 24 January, 2018; v1 submitted 16 September, 2017; originally announced September 2017.

    Comments: A shorter version of this paper has appeared in 32nd AAAI Conference on Artificial Intelligence. The AAAI proceedings version as well as the previous version in arxiv contained some errors that have been corrected in this version

    ACM Class: E.1