-
Unified calibration and spatial mapping of fine particulate matter data from multiple low-cost air pollution sensor networks in Baltimore, Maryland
Authors:
Claire Heffernan,
Kirsten Koehler,
Drew R. Gentner,
Roger D. Peng,
Abhirup Datta
Abstract:
Low-cost air pollution sensor networks are increasingly being deployed globally, supplementing sparse regulatory monitoring with localized air quality data. In some areas, like Baltimore, Maryland, there are only few regulatory (reference) devices but multiple low-cost networks. While there are many available methods to calibrate data from each network individually, separate calibration of each ne…
▽ More
Low-cost air pollution sensor networks are increasingly being deployed globally, supplementing sparse regulatory monitoring with localized air quality data. In some areas, like Baltimore, Maryland, there are only few regulatory (reference) devices but multiple low-cost networks. While there are many available methods to calibrate data from each network individually, separate calibration of each network leads to conflicting air quality predictions. We develop a general Bayesian spatial filtering model combining data from multiple networks and reference devices, providing dynamic calibrations (informed by the latest reference data) and unified predictions (combining information from all available sensors) for the entire region. This method accounts for network-specific bias and noise (observation models), as different networks can use different types of sensors, and uses a Gaussian process (state-space model) to capture spatial correlations. We apply the method to calibrate PM$_{2.5}$ data from Baltimore in June and July 2023 -- a period including days of hazardous concentrations due to wildfire smoke. Our method helps mitigate the effects of preferential sampling of one network in Baltimore, results in better predictions and narrower confidence intervals. Our approach can be used to calibrate low-cost air pollution sensor data in Baltimore and any other areas with multiple low-cost networks.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
Infinite Hidden Markov Models for Multiple Multivariate Time Series with Missing Data
Authors:
Lauren Hoskovec,
Matthew D. Koslovsky,
Kirsten Koehler,
Nicholas Good,
Jennifer L. Peel,
John Volckens,
Ander Wilson
Abstract:
Exposure to air pollution is associated with increased morbidity and mortality. Recent technological advancements permit the collection of time-resolved personal exposure data. Such data are often incomplete with missing observations and exposures below the limit of detection, which limit their use in health effects studies. In this paper we develop an infinite hidden Markov model for multiple asy…
▽ More
Exposure to air pollution is associated with increased morbidity and mortality. Recent technological advancements permit the collection of time-resolved personal exposure data. Such data are often incomplete with missing observations and exposures below the limit of detection, which limit their use in health effects studies. In this paper we develop an infinite hidden Markov model for multiple asynchronous multivariate time series with missing data. Our model is designed to include covariates that can inform transitions among hidden states. We implement beam sampling, a combination of slice sampling and dynamic programming, to sample the hidden states, and a Bayesian multiple imputation algorithm to impute missing data. In simulation studies, our model excels in estimating hidden states and state-specific means and imputing observations that are missing at random or below the limit of detection. We validate our imputation approach on data from the Fort Collins Commuter Study. We show that the estimated hidden states improve imputations for data that are missing at random compared to existing approaches. In a case study of the Fort Collins Commuter Study, we describe the inferential gains obtained from our model including improved imputation of missing data and the ability to identify shared patterns in activity and exposure among repeated sampling days for individuals and among distinct individuals.
△ Less
Submitted 13 April, 2022;
originally announced April 2022.
-
A dynamic spatial filtering approach to mitigate underestimation bias in field calibrated low-cost sensor air-pollution data
Authors:
Claire Heffernan,
Roger Peng,
Drew R. Gentner,
Kirsten Koehler,
Abhirup Datta
Abstract:
Low-cost air pollution sensors, offering hyper-local characterization of pollutant concentrations, are becoming increasingly prevalent in environmental and public health research. However, low-cost air pollution data can be noisy, biased by environmental conditions, and usually need to be field-calibrated by collocating low-cost sensors with reference-grade instruments. We show, theoretically and…
▽ More
Low-cost air pollution sensors, offering hyper-local characterization of pollutant concentrations, are becoming increasingly prevalent in environmental and public health research. However, low-cost air pollution data can be noisy, biased by environmental conditions, and usually need to be field-calibrated by collocating low-cost sensors with reference-grade instruments. We show, theoretically and empirically, that the common procedure of regression-based calibration using collocated data systematically underestimates high air pollution concentrations, which are critical to diagnose from a health perspective. Current calibration practices also often fail to utilize the spatial correlation in pollutant concentrations. We propose a novel spatial filtering approach to collocation-based calibration of low-cost networks that mitigates the underestimation issue by using an inverse regression. The inverse-regression also allows for incorporating spatial correlations by a second-stage model for the true pollutant concentrations using a conditional Gaussian Process. Our approach works with one or more collocated sites in the network and is dynamic, leveraging spatial correlation with the latest available reference data. Through extensive simulations, we demonstrate how the spatial filtering substantially improves estimation of pollutant concentrations, and measures peak concentrations with greater accuracy. We apply the methodology for calibration of a low-cost PM2.5 network in Baltimore, Maryland, and diagnose air pollution peaks that are missed by the regression-calibration.
△ Less
Submitted 20 February, 2023; v1 submitted 28 March, 2022;
originally announced March 2022.
-
Characterising complex healthcare systems using network science: The small world of emergency surgery
Authors:
Katharina Kohler,
Ari Ercole
Abstract:
Hospitals are complex systems and optimising their function is critical to the provision of high quality, cost effective healthcare. Nevertheless, metrics of performance have to date focused on the performance of individual elements rather than the system as a whole. Manipulation of individual elements of a complex system without an integrative understanding of its function is undesirable and may…
▽ More
Hospitals are complex systems and optimising their function is critical to the provision of high quality, cost effective healthcare. Nevertheless, metrics of performance have to date focused on the performance of individual elements rather than the system as a whole. Manipulation of individual elements of a complex system without an integrative understanding of its function is undesirable and may lead to counter-intuitive outcomes and a holistic metric of hospital function might help design more efficient services. We aimed to characterise the system of peri-operative care for emergency surgical admissions in our tertiary care hospital using network analysis. We used retrospective electronic health record data to construct a weighted directional network of the system. For this we selected all unplanned admissions during a 3.5 year period involving a surgical intervention during the inpatient stay and obtained a set of 16,500 individual inpatient episodes. We then constructed and analysed the structure of this network using established methods from network science such as degree distribution, betweenness centrality and small-world characteristics. The analysis showed the service to be a complex system with scale-free, small-world network properties. This finding has implications for the structure and resilience of the service as such networks, whilst being robust in general, may be vulnerable to outages at specific key nodes. We also identified such potential hubs and bottlenecks in the system based on a variety of network measures. It is hoped that such a holistic, system-wide description of a hospital service may provide better metrics for hospital strain and serve to help planners engineer systems that are as robust as possible to external shocks.
△ Less
Submitted 5 August, 2019;
originally announced August 2019.