-
Prenatal phthalate exposures and adiposity outcomes trajectories: a multivariate Bayesian factor regression approach
Authors:
Phuc H. Nguyen,
Stephanie M. Engel,
Amy H. Herring
Abstract:
We aim to assess the longitudinal effects of prenatal exposure to phthalates on the risk of childhood obesity in children aged 4 to 7, with potential time-varying and sex-specific effects. Multiple body-composition-related outcomes, such as BMI z-score, fat mass percentage, and waist circumference, are available in the data. Existing chemical mixture analyses often look at these outcomes individua…
▽ More
We aim to assess the longitudinal effects of prenatal exposure to phthalates on the risk of childhood obesity in children aged 4 to 7, with potential time-varying and sex-specific effects. Multiple body-composition-related outcomes, such as BMI z-score, fat mass percentage, and waist circumference, are available in the data. Existing chemical mixture analyses often look at these outcomes individually due to the limited availability of multivariate models for mixture exposures. We propose a multivariate Bayesian factor regression that handles multicollinearity in chemical exposures and borrows information across highly correlated outcomes to improve estimation efficiency. We demonstrate the proposed method's utility through simulation studies and an analysis of data from the Mount Sinai Children's Environmental Health Study. We find the associations between prenatal phthalate exposures and adiposity outcomes in male children to be negative at early ages but to become positive as the children get older.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Inferring Synergistic and Antagonistic Interactions in Mixtures of Exposures
Authors:
Shounak Chattopadhyay,
Stephanie M. Engel,
David Dunson
Abstract:
There is abundant interest in assessing the joint effects of multiple exposures on human health. This is often referred to as the mixtures problem in environmental epidemiology and toxicology. Classically, studies have examined the adverse health effects of different chemicals one at a time, but there is concern that certain chemicals may act together to amplify each other's effects. Such amplific…
▽ More
There is abundant interest in assessing the joint effects of multiple exposures on human health. This is often referred to as the mixtures problem in environmental epidemiology and toxicology. Classically, studies have examined the adverse health effects of different chemicals one at a time, but there is concern that certain chemicals may act together to amplify each other's effects. Such amplification is referred to as synergistic interaction, while chemicals that inhibit each other's effects have antagonistic interactions. Current approaches for assessing the health effects of chemical mixtures do not explicitly consider synergy or antagonism in the modeling, instead focusing on either parametric or unconstrained nonparametric dose response surface modeling. The parametric case can be too inflexible, while nonparametric methods face a curse of dimensionality that leads to overly wiggly and uninterpretable surface estimates. We propose a Bayesian approach that decomposes the response surface into additive main effects and pairwise interaction effects, and then detects synergistic and antagonistic interactions. Variable selection decisions for each interaction component are also provided. This Synergistic Antagonistic Interaction Detection (SAID) framework is evaluated relative to existing approaches using simulation experiments and an application to data from NHANES.
△ Less
Submitted 29 May, 2024; v1 submitted 17 October, 2022;
originally announced October 2022.
-
mpower: An R Package for Power Analysis of Exposure Mixture Studies via Monte Carlo Simulations
Authors:
Phuc H. Nguyen,
Stephanie M. Engel,
Amy H. Herring
Abstract:
Estimating sample size and statistical power is an essential part of a good study design. This R package allows users to conduct power analysis based on Monte Carlo simulations in settings in which consideration of the correlations between predictors is important. It runs power analyses given a data generative model and an inference model. It can set up a data generative model that preserves depen…
▽ More
Estimating sample size and statistical power is an essential part of a good study design. This R package allows users to conduct power analysis based on Monte Carlo simulations in settings in which consideration of the correlations between predictors is important. It runs power analyses given a data generative model and an inference model. It can set up a data generative model that preserves dependence structures among variables given existing data (continuous, binary, or ordinal) or high-level descriptions of the associations. Users can generate power curves to assess the trade-offs between sample size, effect size, and power of a design. This paper presents tutorials and examples focusing on applications for environmental mixture studies when predictors tend to be moderately to highly correlated. It easily interfaces with several existing and newly developed analysis strategies for assessing associations between exposures and health outcomes. However, the package is sufficiently general to facilitate power simulations in a wide variety of settings.
△ Less
Submitted 14 April, 2024; v1 submitted 16 September, 2022;
originally announced September 2022.
-
Bayesian Matrix Completion for Hypothesis Testing
Authors:
Bora Jin,
David B. Dunson,
Julia E. Rager,
David Reif,
Stephanie M. Engel,
Amy H. Herring
Abstract:
We aim to infer bioactivity of each chemical by assay endpoint combination, addressing sparsity of toxicology data. We propose a Bayesian hierarchical framework which borrows information across different chemicals and assay endpoints, facilitates out-of-sample prediction of activity for chemicals not yet assayed, quantifies uncertainty of predicted activity, and adjusts for multiplicity in hypothe…
▽ More
We aim to infer bioactivity of each chemical by assay endpoint combination, addressing sparsity of toxicology data. We propose a Bayesian hierarchical framework which borrows information across different chemicals and assay endpoints, facilitates out-of-sample prediction of activity for chemicals not yet assayed, quantifies uncertainty of predicted activity, and adjusts for multiplicity in hypothesis testing. Furthermore, this paper makes a novel attempt in toxicology to simultaneously model heteroscedastic errors and a nonparametric mean function, leading to a broader definition of activity whose need has been suggested by toxicologists. Real application identifies chemicals most likely active for neurodevelopmental disorders and obesity.
△ Less
Submitted 6 November, 2022; v1 submitted 17 September, 2020;
originally announced September 2020.
-
A Bayesian approach to the g-formula
Authors:
Alexander P. Keil,
Eric J. Daza,
Stephanie M. Engel,
Jessie P. Buckley,
Jessie K. Edwards
Abstract:
Epidemiologists often wish to estimate quantities that are easy to communicate and correspond to the results of realistic public health scenarios. Methods from causal inference can answer these questions. We adopt the language of potential outcomes under Rubin's original Bayesian framework and show that the parametric g-formula is easily amenable to a Bayesian approach. We show that the frequentis…
▽ More
Epidemiologists often wish to estimate quantities that are easy to communicate and correspond to the results of realistic public health scenarios. Methods from causal inference can answer these questions. We adopt the language of potential outcomes under Rubin's original Bayesian framework and show that the parametric g-formula is easily amenable to a Bayesian approach. We show that the frequentist properties of the Bayesian g-formula suggest it improves the accuracy of estimates of causal effects in small samples or when data may be sparse. We demonstrate our approach to estimate the effect of environmental tobacco smoke on body mass index z-scores among children aged 4-9 years who were enrolled in a longitudinal birth cohort in New York, USA. We give a general algorithm and supply SAS and Stan code that can be adopted to implement our computational approach in both time-fixed and longitudinal data.
△ Less
Submitted 15 December, 2015;
originally announced December 2015.