Skip to main content

Showing 1–16 of 16 results for author: Agarwal, A

Searching in archive econ. Search in all archives.
.
  1. arXiv:2504.01702  [pdf, ps, other

    econ.EM cs.LG stat.ME

    A Causal Inference Framework for Data Rich Environments

    Authors: Alberto Abadie, Anish Agarwal, Devavrat Shah

    Abstract: We propose a formal model for counterfactual estimation with unobserved confounding in "data-rich" settings, i.e., where there are a large number of units and a large number of measurements per unit. Our model provides a bridge between the structural causal model view of causal inference common in the graphical models literature with that of the latent factor model view common in the potential out… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  2. arXiv:2502.05340  [pdf, other

    q-fin.MF econ.GN

    Robust valuation and optimal harvesting of forestry resources in the presence of catastrophe risk and parameter uncertainty

    Authors: Ankush Agarwal, Christian Ewald, Yihan Zou

    Abstract: We determine forest lease value and optimal harvesting strategies under model parameter uncertainty within stochastic bio-economic models that account for catastrophe risk. Catastrophic events are modeled as a Poisson point process, with a two-factor stochastic convenience yield model capturing the lumber spot price dynamics. Using lumber futures and US wildfire data, we estimate model parameters… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  3. arXiv:2410.02091  [pdf

    cs.SE cs.AI cs.HC econ.GN

    The Impact of Generative AI on Collaborative Open-Source Software Development: Evidence from GitHub Copilot

    Authors: Fangchen Song, Ashish Agarwal, Wen Wen

    Abstract: Generative artificial intelligence (AI) enables automated content production, including coding in software development, which can significantly influence developer participation and performance. To explore its impact on collaborative open-source software (OSS) development, we investigate the role of GitHub Copilot, a generative AI pair programmer, in OSS development where multiple distributed deve… ▽ More

    Submitted 8 July, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  4. arXiv:2402.11652  [pdf, other

    econ.EM cs.LG stat.ME stat.ML

    Doubly Robust Inference in Causal Latent Factor Models

    Authors: Alberto Abadie, Anish Agarwal, Raaz Dwivedi, Abhin Shah

    Abstract: This article introduces a new estimator of average treatment effects under unobserved confounding in modern data-rich environments featuring large numbers of units and outcomes. The proposed estimator is doubly robust, combining outcome imputation, inverse probability weighting, and a novel cross-fitting procedure for matrix completion. We derive finite-sample and asymptotic guarantees, and show t… ▽ More

    Submitted 29 October, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  5. arXiv:2312.16307  [pdf, other

    econ.EM cs.GT cs.LG stat.ME

    Incentive-Aware Synthetic Control: Accurate Counterfactual Estimation via Incentivized Exploration

    Authors: Daniel Ngo, Keegan Harris, Anish Agarwal, Vasilis Syrgkanis, Zhiwei Steven Wu

    Abstract: We consider the setting of synthetic control methods (SCMs), a canonical approach used to estimate the treatment effect on the treated in a panel data setting. We shed light on a frequently overlooked but ubiquitous assumption made in SCMs of "overlap": a treated unit can be written as some combination -- typically, convex or linear combination -- of the units that remain under control. We show th… ▽ More

    Submitted 13 February, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

  6. arXiv:2307.01357  [pdf, other

    cs.LG econ.EM stat.ME stat.ML

    Adaptive Principal Component Regression with Applications to Panel Data

    Authors: Anish Agarwal, Keegan Harris, Justin Whitehouse, Zhiwei Steven Wu

    Abstract: Principal component regression (PCR) is a popular technique for fixed-design error-in-variables regression, a generalization of the linear regression setting in which the observed covariates are corrupted with random noise. We provide the first time-uniform finite sample guarantees for (regularized) PCR whenever data is collected adaptively. Since the proof techniques for analyzing PCR in the fixe… ▽ More

    Submitted 4 August, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

  7. arXiv:2306.13681  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Estimating the Value of Evidence-Based Decision Making

    Authors: Alberto Abadie, Anish Agarwal, Guido Imbens, Siwei Jia, James McQueen, Serguei Stepaniants

    Abstract: Business/policy decisions are often based on evidence from randomized experiments and observational studies. In this article we propose an empirical framework to estimate the value of evidence-based decision making (EBDM) and the return on the investment in statistical precision.

    Submitted 9 September, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  8. arXiv:2303.14226  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Synthetic Combinations: A Causal Inference Framework for Combinatorial Interventions

    Authors: Abhineet Agarwal, Anish Agarwal, Suhas Vijaykumar

    Abstract: Consider a setting where there are $N$ heterogeneous units and $p$ interventions. Our goal is to learn unit-specific potential outcomes for any combination of these $p$ interventions, i.e., $N \times 2^p$ causal parameters. Choosing a combination of interventions is a problem that naturally arises in a variety of applications such as factorial design experiments, recommendation engines, combinatio… ▽ More

    Submitted 15 January, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

  9. arXiv:2211.14236  [pdf, other

    econ.EM cs.GT cs.LG

    Strategyproof Decision-Making in Panel Data Settings and Beyond

    Authors: Keegan Harris, Anish Agarwal, Chara Podimata, Zhiwei Steven Wu

    Abstract: We consider the problem of decision-making using panel data, in which a decision-maker gets noisy, repeated measurements of multiple units (or agents). We consider a setup where there is a pre-intervention period, when the principal observes the outcomes of each unit, after which the principal uses these observations to assign a treatment to each unit. Unlike this classical setting, we permit the… ▽ More

    Submitted 21 December, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: In the fiftieth ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS 2024)

  10. arXiv:2210.11355  [pdf, other

    econ.EM cs.LG stat.ME

    Network Synthetic Interventions: A Causal Framework for Panel Data Under Network Interference

    Authors: Anish Agarwal, Sarah H. Cen, Devavrat Shah, Christina Lee Yu

    Abstract: We propose a generalization of the synthetic controls and synthetic interventions methodology to incorporate network interference. We consider the estimation of unit-specific potential outcomes from panel data in the presence of spillover across units and unobserved confounding. Key to our approach is a novel latent factor model that takes into account network interference and generalizes the fact… ▽ More

    Submitted 11 October, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 49 pages, 6 figures

  11. arXiv:2210.11003  [pdf, other

    econ.EM cs.LG stat.ME

    Synthetic Blip Effects: Generalizing Synthetic Controls for the Dynamic Treatment Regime

    Authors: Anish Agarwal, Vasilis Syrgkanis

    Abstract: We propose a generalization of the synthetic control and synthetic interventions methodology to the dynamic treatment regime. We consider the estimation of unit-specific treatment effects from panel data collected via a dynamic treatment regime and in the presence of unobserved confounding. That is, each unit receives multiple treatments sequentially, based on an adaptive policy, which depends on… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  12. arXiv:2109.15154  [pdf, other

    econ.EM cs.LG math.ST stat.ML

    Causal Matrix Completion

    Authors: Anish Agarwal, Munther Dahleh, Devavrat Shah, Dennis Shen

    Abstract: Matrix completion is the study of recovering an underlying matrix from a sparse subset of noisy observations. Traditionally, it is assumed that the entries of the matrix are "missing completely at random" (MCAR), i.e., each entry is revealed at random, independent of everything else, with uniform probability. This is likely unrealistic due to the presence of "latent confounders", i.e., unobserved… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  13. arXiv:2107.02780  [pdf, other

    econ.EM cs.LG math.ST stat.ML

    Causal Inference with Corrupted Data: Measurement Error, Missing Values, Discretization, and Differential Privacy

    Authors: Anish Agarwal, Rahul Singh

    Abstract: The US Census Bureau will deliberately corrupt data sets derived from the 2020 US Census, enhancing the privacy of respondents while potentially reducing the precision of economic analysis. To investigate whether this trade-off is inevitable, we formulate a semiparametric model of causal inference with high dimensional corrupted data. We propose a procedure for data cleaning, estimation, and infer… ▽ More

    Submitted 12 February, 2024; v1 submitted 6 July, 2021; originally announced July 2021.

    ACM Class: G.3; J.4

  14. arXiv:2006.07691  [pdf, other

    econ.EM cs.LG stat.ML

    Synthetic Interventions

    Authors: Anish Agarwal, Devavrat Shah, Dennis Shen

    Abstract: The synthetic controls (SC) methodology is a prominent tool for policy evaluation in panel data applications. Researchers commonly justify the SC framework with a low-rank matrix factor model that assumes the potential outcomes are described by low-dimensional unit and time specific latent factors. In the recent work of [Abadie '20], one of the pioneering authors of the SC method posed the questio… ▽ More

    Submitted 23 August, 2024; v1 submitted 13 June, 2020; originally announced June 2020.

  15. arXiv:2005.00072  [pdf, other

    econ.EM cs.LG stat.AP

    Two Burning Questions on COVID-19: Did shutting down the economy help? Can we (partially) reopen the economy without risking the second wave?

    Authors: Anish Agarwal, Abdullah Alomar, Arnab Sarker, Devavrat Shah, Dennis Shen, Cindy Yang

    Abstract: As we reach the apex of the COVID-19 pandemic, the most pressing question facing us is: can we even partially reopen the economy without risking a second wave? We first need to understand if shutting down the economy helped. And if it did, is it possible to achieve similar gains in the war against the pandemic while partially opening up the economy? To do so, it is critical to understand the effec… ▽ More

    Submitted 10 May, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

  16. arXiv:1902.05622  [pdf, other

    cs.GT econ.TH

    The Shapley Taylor Interaction Index

    Authors: Kedar Dhamdhere, Ashish Agarwal, Mukund Sundararajan

    Abstract: The attribution problem, that is the problem of attributing a model's prediction to its base features, is well-studied. We extend the notion of attribution to also apply to feature interactions. The Shapley value is a commonly used method to attribute a model's prediction to its base features. We propose a generalization of the Shapley value called Shapley-Taylor index that attributes the model'… ▽ More

    Submitted 7 February, 2020; v1 submitted 14 February, 2019; originally announced February 2019.