Skip to main content

Showing 1–10 of 10 results for author: Szpiro, A A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.07135  [pdf

    stat.AP cs.LG stat.ML

    Causal Inference with Double/Debiased Machine Learning for Evaluating the Health Effects of Multiple Mismeasured Pollutants

    Authors: Gang Xu, Xin Zhou, Molin Wang, Boya Zhang, Wenhao Jiang, Francine Laden, Helen H. Suh, Adam A. Szpiro, Donna Spiegelman, Zuoheng Wang

    Abstract: One way to quantify exposure to air pollution and its constituents in epidemiologic studies is to use an individual's nearest monitor. This strategy results in potential inaccuracy in the actual personal exposure, introducing bias in estimating the health effects of air pollution and its constituents, especially when evaluating the causal effects of correlated multi-pollutant constituents measured… ▽ More

    Submitted 21 September, 2024; originally announced October 2024.

  2. arXiv:2006.00150  [pdf, other

    stat.ME stat.AP

    Random Spatial Forests

    Authors: Travis Hee Wai, Michael T. Young, Adam A. Szpiro

    Abstract: We introduce random spatial forests, a method of bagging regression trees allowing for spatial correlation. Our main contribution is the development of a computationally efficient tree building algorithm which selects each split of the tree adjusting for spatial correlation. We evaluate two different approaches for estimation of random spatial forests, a pseudo-likelihood approach combining random… ▽ More

    Submitted 22 July, 2020; v1 submitted 29 May, 2020; originally announced June 2020.

  3. Spatial Matrix Completion for Spatially-Misaligned and High-Dimensional Air Pollution Data

    Authors: Phuong T. Vu, Adam A. Szpiro, Noah Simon

    Abstract: In health-pollution cohort studies, accurate predictions of pollutant concentrations at new locations are needed, since the locations of fixed monitoring sites and study participants are often spatially misaligned. For multi-pollution data, principal component analysis (PCA) is often incorporated to obtain low-rank (LR) structure of the data prior to spatial prediction. Recently developed predicti… ▽ More

    Submitted 21 January, 2022; v1 submitted 11 April, 2020; originally announced April 2020.

    Comments: 26 pages, 5 figures, 5 tables, 1 supplemental file (available upon request). This v2 is a pre peer-reviewed version that was submitted to Environmetrics. A final version with minor revisions was accepted for publication by Environmetrics on Dec 13, 2021, and will be linked to this version once published

  4. arXiv:1909.11161  [pdf, other

    stat.AP stat.ME

    Selecting a Scale for Spatial Confounding Adjustment

    Authors: Joshua P. Keller, Adam A. Szpiro

    Abstract: Unmeasured, spatially-structured factors can confound associations between spatial environmental exposures and health outcomes. Adding flexible splines to a regression model is a simple approach for spatial confounding adjustment, but the spline degrees of freedom do not provide an easily interpretable spatial scale. We describe a method for quantifying the extent of spatial confounding adjustment… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

    Comments: 22 pages, 6 figures

    Journal ref: Journal of the Royal Statistical Society: Series A (2020) 183, Part 3, 1121-1143

  5. Probabilistic Predictive Principal Component Analysis for Spatially-Misaligned and High-Dimensional Air Pollution Data with Missing Observations

    Authors: Phuong T. Vu, Timothy V. Larson, Adam A. Szpiro

    Abstract: Accurate predictions of pollutant concentrations at new locations are often of interest in air pollution studies on fine particulate matters (PM$_{2.5}$), in which data is usually not measured at all study locations. PM$_{2.5}$ is also a mixture of many different chemical components. Principal component analysis (PCA) can be incorporated to obtain lower-dimensional representative scores of such mu… ▽ More

    Submitted 8 December, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: 36 pages, 8 figures, 5 tables. v2 is a pre peer-reviewed version that was submitted to Environmetrics. A final version with minor revisions was accepted for publication by Environmetrics on Oct 30, 2019, and will be linked to this version once published

    Journal ref: Environmetrics 2020, Vol. 31, No. 4, e2614

  6. arXiv:1808.09126  [pdf

    stat.AP

    National PM2.5 and NO2 Exposure Models for China Based on Land Use Regression, Satellite Measurements, and Universal Kriging

    Authors: Hao Xu, Matthew J. Bechle, Meng Wang, Adam A. Szpiro, Sverre Vedal, Yuqi Bai, Julian D. Marshall

    Abstract: Outdoor air pollution is a major killer worldwide and the fourth largest contributor to the burden of disease in China. China is the most populous country in the world and also has the largest number of air pollution deaths per year, yet the spatial resolution of existing national air pollution estimates for China is generally relatively low. We address this knowledge gap by developing and evaluat… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

  7. arXiv:1509.01171  [pdf, other

    stat.ME

    A novel principal component analysis for spatially-misaligned multivariate air pollution data

    Authors: Roman A. Jandarov, Lianne A. Sheppard, Paul D. Sampson, Adam A. Szpiro

    Abstract: We propose novel methods for predictive (sparse) PCA with spatially misaligned data. These methods identify principal component loading vectors that explain as much variability in the observed data as possible, while also ensuring the corresponding principal component scores can be predicted accurately by means of spatial statistics at locations where air pollution measurements are not available.… ▽ More

    Submitted 3 September, 2015; originally announced September 2015.

    Comments: 43 pages, 5 figures, and 6 tables

  8. Reduced-rank spatio-temporal modeling of air pollution concentrations in the Multi-Ethnic Study of Atherosclerosis and Air Pollution

    Authors: Casey Olives, Lianne Sheppard, Johan Lindström, Paul D. Sampson, Joel D. Kaufman, Adam A. Szpiro

    Abstract: There is growing evidence in the epidemiologic literature of the relationship between air pollution and adverse health outcomes. Prediction of individual air pollution exposure in the Environmental Protection Agency (EPA) funded Multi-Ethnic Study of Atheroscelerosis and Air Pollution (MESA Air) study relies on a flexible spatio-temporal prediction model that integrates land-use regression with kr… ▽ More

    Submitted 3 February, 2015; originally announced February 2015.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS786 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS786

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 4, 2509-2537

  9. Measurement error in two-stage analyses, with application to air pollution epidemiology

    Authors: Adam A. Szpiro, Christopher J. Paciorek

    Abstract: Public health researchers often estimate health effects of exposures (e.g., pollution, diet, lifestyle) that cannot be directly measured for study subjects. A common strategy in environmental epidemiology is to use a first-stage (exposure) model to estimate the exposure based on covariates and/or spatio-temporal proximity and to use predictions from the exposure model as the covariate of interest… ▽ More

    Submitted 30 June, 2013; v1 submitted 27 October, 2012; originally announced October 2012.

    Comments: 35 pages, 4 figures, 2 tables

    Journal ref: Environmetrics (2013) 24: 501-517

  10. Model-robust regression and a Bayesian ``sandwich'' estimator

    Authors: Adam A. Szpiro, Kenneth M. Rice, Thomas Lumley

    Abstract: We present a new Bayesian approach to model-robust linear regression that leads to uncertainty estimates with the same robustness properties as the Huber--White sandwich estimator. The sandwich estimator is known to provide asymptotically correct frequentist inference, even when standard modeling assumptions such as linearity and homoscedasticity in the data-generating mechanism are violated. Our… ▽ More

    Submitted 7 January, 2011; originally announced January 2011.

    Comments: Published in at http://dx.doi.org/10.1214/10-AOAS362 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS362

    Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 4, 2099-2113