Estimation of Treatment Effects Under Nonstationarity via the Truncated Policy Gradient Estimator

Johari, Ramesh; Peng, Tianyi; Xing, Wenqian

Statistics > Methodology

arXiv:2506.05308 (stat)

[Submitted on 5 Jun 2025 (v1), last revised 7 Oct 2025 (this version, v2)]

Title:Estimation of Treatment Effects Under Nonstationarity via the Truncated Policy Gradient Estimator

Authors:Ramesh Johari, Tianyi Peng, Wenqian Xing

View PDF HTML (experimental)

Abstract:Randomized experiments (or A/B tests) are widely used to evaluate interventions in dynamic systems such as recommendation platforms, marketplaces, and digital health. In these settings, interventions affect both current and future system states, so estimating the global average treatment effect (GATE) requires accounting for temporal dynamics, which is especially challenging in the presence of nonstationarity; existing approaches suffer from high bias, high variance, or both. In this paper, we address this challenge via the novel Truncated Policy Gradient (TPG) estimator, which replaces instantaneous outcomes with short-horizon outcome trajectories. The estimator admits a policy-gradient interpretation: it is a truncation of the first-order approximation to the GATE, yielding provable reductions in bias and variance in nonstationary Markovian settings. We further establish a central limit theorem for the TPG estimator and develop a consistent variance estimator that remains valid under nonstationarity with single-trajectory data. We validate our theory with two real-world case studies. The results show that a well-calibrated TPG estimator attains low bias and variance in practical nonstationary settings.

Subjects:	Methodology (stat.ME)
Cite as:	arXiv:2506.05308 [stat.ME]
	(or arXiv:2506.05308v2 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.2506.05308

Submission history

From: Wenqian Xing [view email]
[v1] Thu, 5 Jun 2025 17:53:26 UTC (301 KB)
[v2] Tue, 7 Oct 2025 17:14:32 UTC (326 KB)

Statistics > Methodology

Title:Estimation of Treatment Effects Under Nonstationarity via the Truncated Policy Gradient Estimator

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:Estimation of Treatment Effects Under Nonstationarity via the Truncated Policy Gradient Estimator

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators