-
OpportunityFinder: A Framework for Automated Causal Inference
Authors:
Huy Nguyen,
Prince Grover,
Devashish Khatwani
Abstract:
We introduce OpportunityFinder, a code-less framework for performing a variety of causal inference studies with panel data for non-expert users. In its current state, OpportunityFinder only requires users to provide raw observational data and a configuration file. A pipeline is then triggered that inspects/processes data, chooses the suitable algorithm(s) to execute the causal study. It returns th…
▽ More
We introduce OpportunityFinder, a code-less framework for performing a variety of causal inference studies with panel data for non-expert users. In its current state, OpportunityFinder only requires users to provide raw observational data and a configuration file. A pipeline is then triggered that inspects/processes data, chooses the suitable algorithm(s) to execute the causal study. It returns the causal impact of the treatment on the configured outcome, together with sensitivity and robustness results. Causal inference is widely studied and used to estimate the downstream impact of individual's interactions with products and features. It is common that these causal studies are performed by scientists and/or economists periodically. Business stakeholders are often bottle-necked on scientist or economist bandwidth to conduct causal studies. We offer OpportunityFinder as a solution for commonly performed causal studies with four key features: (1) easy to use for both Business Analysts and Scientists, (2) abstraction of multiple algorithms under a single I/O interface, (3) support for causal impact analysis under binary treatment with panel data and (4) dynamic selection of algorithm based on scale of data.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
PATHFINDER: Designing Stimuli for Neuromodulation through data-driven inverse estimation of non-linear functions
Authors:
Chaitanya Goswami,
Pulkit Grover
Abstract:
There has been tremendous interest in designing stimuli (e.g. electrical currents) that produce desired neural responses, e.g., for inducing therapeutic effects for treatments. Traditionally, the design of such stimuli has been model-driven. Due to challenges inherent in modeling neural responses accurately, data-driven approaches offer an attractive alternative. The problem of data-driven stimulu…
▽ More
There has been tremendous interest in designing stimuli (e.g. electrical currents) that produce desired neural responses, e.g., for inducing therapeutic effects for treatments. Traditionally, the design of such stimuli has been model-driven. Due to challenges inherent in modeling neural responses accurately, data-driven approaches offer an attractive alternative. The problem of data-driven stimulus design can be thought of as estimating an inverse of a non-linear ``forward" mapping, which takes in as inputs the stimulus parameters and outputs the corresponding neural responses. In most cases of interest, the forward mapping is many-to-one, and hence difficult to invert using traditional methods. Existing methods estimate the inverse by using conditional density estimation methods or numerically inverting an estimated forward mapping, but both approaches tend to perform poorly at small sample sizes. In this work, we develop a new optimization framework called PATHFINDER, which allows us to use regression methods for estimating an inverse mapping. We use toy examples to illustrate key aspects of PATHFINDER, and show, on computational models of biological neurons, that PATHFINDER can outperform existing methods at small sample sizes. The data-efficiency of PATHFINDER is especially valuable in stimulus design as collecting data is expensive in this domain.
△ Less
Submitted 30 May, 2023; v1 submitted 19 November, 2022;
originally announced November 2022.
-
Fraud Dataset Benchmark and Applications
Authors:
Prince Grover,
Julia Xu,
Justin Tittelfitz,
Anqi Cheng,
Zheng Li,
Jakub Zablocki,
Jianbo Liu,
Hao Zhou
Abstract:
Standardized datasets and benchmarks have spurred innovations in computer vision, natural language processing, multi-modal and tabular settings. We note that, as compared to other well researched fields, fraud detection has unique challenges: high-class imbalance, diverse feature types, frequently changing fraud patterns, and adversarial nature of the problem. Due to these, the modeling approaches…
▽ More
Standardized datasets and benchmarks have spurred innovations in computer vision, natural language processing, multi-modal and tabular settings. We note that, as compared to other well researched fields, fraud detection has unique challenges: high-class imbalance, diverse feature types, frequently changing fraud patterns, and adversarial nature of the problem. Due to these, the modeling approaches evaluated on datasets from other research fields may not work well for the fraud detection. In this paper, we introduce Fraud Dataset Benchmark (FDB), a compilation of publicly available datasets catered to fraud detection FDB comprises variety of fraud related tasks, ranging from identifying fraudulent card-not-present transactions, detecting bot attacks, classifying malicious URLs, estimating risk of loan default to content moderation. The Python based library for FDB provides a consistent API for data loading with standardized training and testing splits. We demonstrate several applications of FDB that are of broad interest for fraud detection, including feature engineering, comparison of supervised learning algorithms, label noise removal, class-imbalance treatment and semi-supervised learning. We hope that FDB provides a common playground for researchers and practitioners in the fraud detection domain to develop robust and customized machine learning techniques targeting various fraud use cases.
△ Less
Submitted 22 September, 2023; v1 submitted 30 August, 2022;
originally announced August 2022.
-
Quantifying Feature Contributions to Overall Disparity Using Information Theory
Authors:
Sanghamitra Dutta,
Praveen Venkatesh,
Pulkit Grover
Abstract:
When a machine-learning algorithm makes biased decisions, it can be helpful to understand the sources of disparity to explain why the bias exists. Towards this, we examine the problem of quantifying the contribution of each individual feature to the observed disparity. If we have access to the decision-making model, one potential approach (inspired from intervention-based approaches in explainabil…
▽ More
When a machine-learning algorithm makes biased decisions, it can be helpful to understand the sources of disparity to explain why the bias exists. Towards this, we examine the problem of quantifying the contribution of each individual feature to the observed disparity. If we have access to the decision-making model, one potential approach (inspired from intervention-based approaches in explainability literature) is to vary each individual feature (while keeping the others fixed) and use the resulting change in disparity to quantify its contribution. However, we may not have access to the model or be able to test/audit its outputs for individually varying features. Furthermore, the decision may not always be a deterministic function of the input features (e.g., with human-in-the-loop). For these situations, we might need to explain contributions using purely distributional (i.e., observational) techniques, rather than interventional. We ask the question: what is the "potential" contribution of each individual feature to the observed disparity in the decisions when the exact decision-making mechanism is not accessible? We first provide canonical examples (thought experiments) that help illustrate the difference between distributional and interventional approaches to explaining contributions, and when either is better suited. When unable to intervene on the inputs, we quantify the "redundant" statistical dependency about the protected attribute that is present in both the final decision and an individual feature, by leveraging a body of work in information theory called Partial Information Decomposition. We also perform a simple case study to show how this technique could be applied to quantify contributions.
△ Less
Submitted 16 June, 2022;
originally announced June 2022.
-
Can Information Flows Suggest Targets for Interventions in Neural Circuits?
Authors:
Praveen Venkatesh,
Sanghamitra Dutta,
Neil Mehta,
Pulkit Grover
Abstract:
Motivated by neuroscientific and clinical applications, we empirically examine whether observational measures of information flow can suggest interventions. We do so by performing experiments on artificial neural networks in the context of fairness in machine learning, where the goal is to induce fairness in the system through interventions. Using our recently developed $M$-information flow framew…
▽ More
Motivated by neuroscientific and clinical applications, we empirically examine whether observational measures of information flow can suggest interventions. We do so by performing experiments on artificial neural networks in the context of fairness in machine learning, where the goal is to induce fairness in the system through interventions. Using our recently developed $M$-information flow framework, we measure the flow of information about the true label (responsible for accuracy, and hence desirable), and separately, the flow of information about a protected attribute (responsible for bias, and hence undesirable) on the edges of a trained neural network. We then compare the flow magnitudes against the effect of intervening on those edges by pruning. We show that pruning edges that carry larger information flows about the protected attribute reduces bias at the output to a greater extent. This demonstrates that $M$-information flow can meaningfully suggest targets for interventions, answering the title's question in the affirmative. We also evaluate bias-accuracy tradeoffs for different intervention strategies, to analyze how one might use estimates of desirable and undesirable information flows (here, accuracy and bias flows) to inform interventions that preserve the former while reducing the latter.
△ Less
Submitted 9 November, 2021;
originally announced November 2021.
-
Fairness Under Feature Exemptions: Counterfactual and Observational Measures
Authors:
Sanghamitra Dutta,
Praveen Venkatesh,
Piotr Mardziel,
Anupam Datta,
Pulkit Grover
Abstract:
With the growing use of ML in highly consequential domains, quantifying disparity with respect to protected attributes, e.g., gender, race, etc., is important. While quantifying disparity is essential, sometimes the needs of an occupation may require the use of certain features that are critical in a way that any disparity that can be explained by them might need to be exempted. E.g., in hiring a…
▽ More
With the growing use of ML in highly consequential domains, quantifying disparity with respect to protected attributes, e.g., gender, race, etc., is important. While quantifying disparity is essential, sometimes the needs of an occupation may require the use of certain features that are critical in a way that any disparity that can be explained by them might need to be exempted. E.g., in hiring a software engineer for a safety-critical application, coding-skills may be weighed strongly, whereas name, zip code, or reference letters may be used only to the extent that they do not add disparity. In this work, we propose an information-theoretic decomposition of the total disparity (a quantification inspired from counterfactual fairness) into two components: a non-exempt component which quantifies the part that cannot be accounted for by the critical features, and an exempt component that quantifies the remaining disparity. This decomposition allows one to check if the disparity arose purely due to the critical features (inspired from the business necessity defense of disparate impact law) and also enables selective removal of the non-exempt component if desired. We arrive at this decomposition through canonical examples that lead to a set of desirable properties (axioms) that a measure of non-exempt disparity should satisfy. Our proposed measure satisfies all of them. Our quantification bridges ideas of causality, Simpson's paradox, and a body of work from information theory called Partial Information Decomposition. We also obtain an impossibility result showing that no observational measure can satisfy all the desirable properties, leading us to relax our goals and examine observational measures that satisfy only some of them. We perform case studies to show how one can audit/train models while reducing non-exempt disparity.
△ Less
Submitted 6 August, 2021; v1 submitted 14 June, 2020;
originally announced June 2020.
-
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control
Authors:
Yangchen Pan,
Amir-massoud Farahmand,
Martha White,
Saleh Nabi,
Piyush Grover,
Daniel Nikovski
Abstract:
Recent work has shown that reinforcement learning (RL) is a promising approach to control dynamical systems described by partial differential equations (PDE). This paper shows how to use RL to tackle more general PDE control problems that have continuous high-dimensional action spaces with spatial relationship among action dimensions. In particular, we propose the concept of action descriptors, wh…
▽ More
Recent work has shown that reinforcement learning (RL) is a promising approach to control dynamical systems described by partial differential equations (PDE). This paper shows how to use RL to tackle more general PDE control problems that have continuous high-dimensional action spaces with spatial relationship among action dimensions. In particular, we propose the concept of action descriptors, which encode regularities among spatially-extended action dimensions and enable the agent to control high-dimensional action PDEs. We provide theoretical evidence suggesting that this approach can be more sample efficient compared to a conventional approach that treats each action dimension separately and does not explicitly exploit the spatial regularity of the action space. The action descriptor approach is then used within the deep deterministic policy gradient algorithm. Experiments on two PDE control problems, with up to 256-dimensional continuous actions, show the advantage of the proposed approach over the conventional one.
△ Less
Submitted 12 June, 2018;
originally announced June 2018.