Skip to main content

Showing 1–13 of 13 results for author: Arbour, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.00168  [pdf, other

    stat.ML cs.LG stat.ME

    Continuous Treatment Effects with Surrogate Outcomes

    Authors: Zhenghao Zeng, David Arbour, Avi Feller, Raghavendra Addanki, Ryan Rossi, Ritwik Sinha, Edward H. Kennedy

    Abstract: In many real-world causal inference applications, the primary outcomes (labels) are often partially missing, especially if they are expensive or difficult to collect. If the missingness depends on covariates (i.e., missingness is not completely at random), analyses based on fully observed samples alone may be biased. Incorporating surrogates, which are fully observed post-treatment variables relat… ▽ More

    Submitted 21 May, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

    Comments: 30 pages, 7 figures

  2. Anytime-Valid Confidence Sequences in an Enterprise A/B Testing Platform

    Authors: Akash V. Maharaj, Ritwik Sinha, David Arbour, Ian Waudby-Smith, Simon Z. Liu, Moumita Sinha, Raghavendra Addanki, Aaditya Ramdas, Manas Garg, Viswanathan Swaminathan

    Abstract: A/B tests are the gold standard for evaluating digital experiences on the web. However, traditional "fixed-horizon" statistical methods are often incompatible with the needs of modern industry practitioners as they do not permit continuous monitoring of experiments. Frequent evaluation of fixed-horizon tests ("peeking") leads to inflated type-I error and can result in erroneous conclusions. We hav… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 15 pages, 12 figures. Expanded version of ACM Web Conference Proceedings paper

    ACM Class: G.3

    Journal ref: Companion Proceedings of the ACM Web Conference 2023 (WWW '23 Companion)

  3. arXiv:2210.06594  [pdf, other

    cs.LG cs.AI cs.DS econ.EM stat.ME

    Sample Constrained Treatment Effect Estimation

    Authors: Raghavendra Addanki, David Arbour, Tung Mai, Cameron Musco, Anup Rao

    Abstract: Treatment effect estimation is a fundamental problem in causal inference. We focus on designing efficient randomized controlled trials, to accurately estimate the effect of some treatment on a population of $n$ individuals. In particular, we study sample-constrained treatment effect estimation, where we must select a subset of $s \ll n$ individuals from the population to experiment on. This subset… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Conference on Neural Information Processing Systems (NeurIPS) 2022

  4. arXiv:2207.00163  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Non-Parametric Inference of Relational Dependence

    Authors: Ragib Ahsan, Zahra Fatemi, David Arbour, Elena Zheleva

    Abstract: Independence testing plays a central role in statistical and causal inference from observational data. Standard independence tests assume that the data samples are independent and identically distributed (i.i.d.) but that assumption is violated in many real-world datasets and applications centered on relational systems. This work examines the problem of estimating independence in data drawn from r… ▽ More

    Submitted 29 June, 2022; originally announced July 2022.

    Comments: To appear in UAI 2022

  5. arXiv:2205.15965  [pdf, other

    stat.AP

    Bayesian Modeling of Marketing Attribution

    Authors: Ritwik Sinha, David Arbour, Aahlad Manas Puli

    Abstract: In a multi-channel marketing world, the purchase decision journey encounters many interactions (e.g., email, mobile notifications, display advertising, social media, and so on). These impressions have direct (main effects), as well as interactive influence on the final decision of the customer. To maximize conversions, a marketer needs to understand how each of these marketing efforts individually… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: Published at JSM Proceedings 2021 (Accessible at https://ww2.amstat.org/meetings/proceedings/2021/data/assets/pdf/1913796.pdf)

  6. arXiv:2203.02025  [pdf, other

    stat.ME

    Online Balanced Experimental Design

    Authors: David Arbour, Drew Dimmery, Tung Mai, Anup Rao

    Abstract: e consider the experimental design problem in an online environment, an important practical task for reducing the variance of estimates in randomized experiments which allows for greater precision, and in turn, improved decision making. In this work, we present algorithms that build on recent advances in online discrepancy minimization which accommodate both arbitrary treatment probabilities and m… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  7. arXiv:2110.07006  [pdf, other

    stat.ME stat.AP

    Estimating the effects of a California gun control program with Multitask Gaussian Processes

    Authors: Eli Ben-Michael, David Arbour, Avi Feller, Alex Franks, Steven Raphael

    Abstract: Gun violence is a critical public safety concern in the United States. In 2006 California implemented a unique firearm monitoring program, the Armed and Prohibited Persons System (APPS), to address gun violence in the state. The APPS program first identifies those firearm owners who become prohibited from owning one due to federal or state law, then confiscates their firearms. Our goal is to asses… ▽ More

    Submitted 8 June, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

  8. arXiv:2103.06476  [pdf, other

    math.ST stat.ME stat.ML

    Time-uniform central limit theory and asymptotic confidence sequences

    Authors: Ian Waudby-Smith, David Arbour, Ritwik Sinha, Edward H. Kennedy, Aaditya Ramdas

    Abstract: Confidence intervals based on the central limit theorem (CLT) are a cornerstone of classical statistics. Despite being only asymptotically valid, they are ubiquitous because they permit statistical inference under weak assumptions and can often be applied to problems even when nonasymptotic inference is impossible. This paper introduces time-uniform analogues of such asymptotic confidence interval… ▽ More

    Submitted 13 March, 2024; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: 69 pages, 10 figures

  9. arXiv:2010.11332  [pdf, other

    stat.ME stat.ML

    Efficient Balanced Treatment Assignments for Experimentation

    Authors: David Arbour, Drew Dimmery, Anup Rao

    Abstract: In this work, we reframe the problem of balanced treatment assignment as optimization of a two-sample test between test and control units. Using this lens we provide an assignment algorithm that is optimal with respect to the minimum spanning tree test of Friedman and Rafsky (1979). This assignment to treatment groups may be performed exactly in polynomial time. We provide a probabilistic interpre… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  10. arXiv:2009.03860  [pdf, other

    stat.ME

    Designing Transportable Experiments

    Authors: My Phan, David Arbour, Drew Dimmery, Anup B. Rao

    Abstract: We consider the problem of designing a randomized experiment on a source population to estimate the Average Treatment Effect (ATE) on a target population. We propose a novel approach which explicitly considers the target when designing the experiment on the source. Under the covariate shift assumption, we design an unbiased importance-weighted estimator for the target population's ATE. To reduce t… ▽ More

    Submitted 4 September, 2021; v1 submitted 8 September, 2020; originally announced September 2020.

  11. arXiv:2004.01218  [pdf, other

    stat.ME cs.AI cs.LG

    General Identification of Dynamic Treatment Regimes Under Interference

    Authors: Eli Sherman, David Arbour, Ilya Shpitser

    Abstract: In many applied fields, researchers are often interested in tailoring treatments to unit-level characteristics in order to optimize an outcome of interest. Methods for identifying and estimating treatment policies are the subject of the dynamic treatment regime literature. Separately, in many settings the assumption that data are independent and identically distributed does not hold due to inter-s… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: 2020 Conference on Artificial Intelligence and Statistics (AIStats)

  12. arXiv:1906.03694  [pdf, other

    cs.LG stat.ME stat.ML

    Balanced off-policy evaluation in general action spaces

    Authors: Arjun Sondhi, David Arbour, Drew Dimmery

    Abstract: Estimation of importance sampling weights for off-policy evaluation of contextual bandits often results in imbalance - a mismatch between the desired and the actual distribution of state-action pairs after weighting. In this work we present balanced off-policy evaluation (B-OPE), a generic method for estimating weights which minimize this imbalance. Estimation of these weights reduces to a binary… ▽ More

    Submitted 4 March, 2020; v1 submitted 9 June, 2019; originally announced June 2019.

    Comments: Accepted to AISTATS 2020

  13. arXiv:1901.01230  [pdf, other

    stat.ME

    Permutation Weighting

    Authors: David Arbour, Drew Dimmery, Arjun Sondhi

    Abstract: In observational causal inference, in order to emulate a randomized experiment, weights are used to render treatments independent of observed covariates. This property is known as balance; in its absence, estimated causal effects may be arbitrarily biased. In this work we introduce permutation weighting, a method for estimating balancing weights using a standard binary classifier (regardless of ca… ▽ More

    Submitted 14 July, 2020; v1 submitted 4 January, 2019; originally announced January 2019.