-
Model-based calibration of gear-specific fish abundance survey data as a change-of-support problem
Authors:
Grace S. Chiu,
Anton H. Westveld,
Mark A. Albins,
Kevin M. Boswell,
John M. Hoenig,
Sean P. Powers,
S. Lynne Stokes,
Allison L. White
Abstract:
In a continental-scale fish abundance study, a major challenge in deriving an absolute abundance estimate lies in the fact that regional surveys deploy different gear types, each with its unique field of view, producing gear-specific relative abundance data. Thus, data from regional surveys in the study must be converted from the gear-specific relative scale to an absolute scale before being combi…
▽ More
In a continental-scale fish abundance study, a major challenge in deriving an absolute abundance estimate lies in the fact that regional surveys deploy different gear types, each with its unique field of view, producing gear-specific relative abundance data. Thus, data from regional surveys in the study must be converted from the gear-specific relative scale to an absolute scale before being combined to estimate a continental scale absolute abundance. In this paper, we develop a tool that takes gear-based data as input, and produces as output the required conversion, with associated uncertainty. Methodologically, this tool is operationalized from a Bayesian hierarchical model which we develop in an inferential context that is akin to the change-of-support problem often encountered in spatial studies; the actual context here is to reconcile abundance data at various gear-specific scales, some being relative, and others, absolute. We consider data from a small-scale calibration experiment in which 2 to 4 underwater video camera types, as well as an acoustic echosounder, were simultaneously deployed on each of 21 boat trips. While acoustic fish signals are recorded along transects on the absolute scale, they are subject to confounding from acoustically similar species, thus requiring an externally derived correction factor. Conversely, a camera allows visual distinction between species but records data on a gear-specific relative scale. Our statistical modeling framework reflects the relationship among all 5 gear types across the 21 trips, and the resulting model is used to derive calibration formulae to translate relative abundance data to the corrected absolute abundance scale whenever a camera is deployed alone. Cross-validation is conducted using mark-recapture abundance estimates. We also briefly discuss the case when one camera type is deployed alongside the echosounder.
△ Less
Submitted 9 May, 2025;
originally announced May 2025.
-
A New Perspective to Fish Trajectory Imputation: A Methodology for Spatiotemporal Modeling of Acoustically Tagged Fish Data
Authors:
Mahshid Ahmadian,
Edward L. Boone,
Grace S. Chiu
Abstract:
The focus of this paper is a key component of a methodology for understanding, interpolating, and predicting fish movement patterns based on spatiotemporal data recorded by spatially static acoustic receivers. Unlike GPS trackers which emit satellite signals from the animal's location, acoustic receivers are akin to stationary motion sensors that record movements within their detection range. Thus…
▽ More
The focus of this paper is a key component of a methodology for understanding, interpolating, and predicting fish movement patterns based on spatiotemporal data recorded by spatially static acoustic receivers. Unlike GPS trackers which emit satellite signals from the animal's location, acoustic receivers are akin to stationary motion sensors that record movements within their detection range. Thus, for periods of time, fish may be far from the receivers, resulting in the absence of observations. The lack of information on the fish's location for extended time periods poses challenges to the understanding of fish movement patterns, and hence, the identification of proper statistical inference frameworks for modeling the trajectories. As the initial step in our methodology, in this paper, we devise and implement a simulation-based imputation strategy that relies on both Markov chain and random-walk principles to enhance our dataset over time. This methodology will be generalizable and applicable to all fish species with similar migration patterns or data with similar structures due to the use of static acoustic receivers.
△ Less
Submitted 29 January, 2025; v1 submitted 23 August, 2024;
originally announced August 2024.
-
Statistics did not prove that the Huanan Seafood Wholesale Market was the early epicenter of the COVID-19 pandemic
Authors:
Dietrich Stoyan,
Sung Nok Chiu
Abstract:
In a recent prominent study Worobey et al.\ (2022, Science, 377, pp.\ 951--9) purported to demonstrate statistically that the Huanan Seafood Wholesale Market was the epicenter of the early COVID-19 epidemic. We show that this statistical conclusion is invalid on two grounds: (1) The assumption that a centroid of early case locations or another simply constructed point is the origin of an epidemic…
▽ More
In a recent prominent study Worobey et al.\ (2022, Science, 377, pp.\ 951--9) purported to demonstrate statistically that the Huanan Seafood Wholesale Market was the epicenter of the early COVID-19 epidemic. We show that this statistical conclusion is invalid on two grounds: (1) The assumption that a centroid of early case locations or another simply constructed point is the origin of an epidemic is unproved. (2) A Monte Carlo test used to conclude that no other location than the seafood market can be the origin is flawed. Hence, the question of the origin of the pandemic has not been answered by their statistical analysis.
△ Less
Submitted 22 November, 2023; v1 submitted 22 August, 2022;
originally announced August 2022.
-
Spatiotemporal Modeling of Nursery Habitat Using Bayesian Inference: Environmental Drivers of Juvenile Blue Crab Abundance
Authors:
A. Challen Hyman,
Grace S. Chiu,
Mary C. Fabrizio,
Romuald N. Lipcius
Abstract:
Nursery grounds are favorable for growth and survival of juvenile fish and crustaceans through abundant food resources and refugia, and enhance secondary production of populations. While small-scale studies remain important tools to assess nursery value of habitats, targeted applications that unify survey data over large spatiotemporal scales are vital to generalize inference of nursery function,…
▽ More
Nursery grounds are favorable for growth and survival of juvenile fish and crustaceans through abundant food resources and refugia, and enhance secondary production of populations. While small-scale studies remain important tools to assess nursery value of habitats, targeted applications that unify survey data over large spatiotemporal scales are vital to generalize inference of nursery function, identify highly productive regions, and inform management strategies. Using 21 years of GIS and spatiotemporally indexed field survey data on potential nursery habitats, we constructed five Bayesian models with varying spatiotemporal dependence structures to infer nursery habitat value for juveniles of the blue crab C. sapidus within three tributaries in lower Chesapeake Bay. Out-of-sample predictions of juvenile counts from a fully nonseparable spatiotemporal model outperformed predictions from simpler models. Salt marsh surface area, turbidity, and their interaction showed the strongest associations (and positively) with abundance. Relative seagrass area, previously emphasized as the most valuable nursery in small spatial-scale studies, was not associated with abundance. Hence, we argue that salt marshes should be considered a key nursery habitat for blue crabs, even amidst extensive seagrass beds. Moreover, identification of nurseries should be based on investigations at broad spatiotemporal scales incorporating multiple potential nursery habitats, and on rigorously addressing spatiotemporal dependence.
△ Less
Submitted 17 December, 2021;
originally announced January 2022.
-
Global Trends and Predictors of Face Mask Usage During the COVID-19 Pandemic
Authors:
Elena Badillo-Goicoechea,
Ting-Hsuan Chang,
Esther Kim,
Sarah LaRocca,
Katherine Morris,
Xiaoyi Deng,
Samantha Chiu,
Adrianne Bradford,
Andres Garcia,
Christoph Kern,
Curtiss Cobb,
Frauke Kreuter,
Elizabeth A. Stuart
Abstract:
Background: Guidelines and recommendations from public health authorities related to face masks have been essential in containing the COVID-19 pandemic. We assessed the prevalence and correlates of mask usage during the pandemic.
Methods: We examined a total of 13,723,810 responses to a daily cross-sectional representative online survey in 38 countries who completed from April 23, 2020 to Octobe…
▽ More
Background: Guidelines and recommendations from public health authorities related to face masks have been essential in containing the COVID-19 pandemic. We assessed the prevalence and correlates of mask usage during the pandemic.
Methods: We examined a total of 13,723,810 responses to a daily cross-sectional representative online survey in 38 countries who completed from April 23, 2020 to October 31, 2020 and reported having been in public at least once during the last seven days. The outcome was individual face mask usage in public settings, and the predictors were country fixed effects, country-level mask policy stringency, calendar time, individual sociodemographic factors, and health prevention behaviors. Associations were modelled using survey-weighted multivariable logistic regression.
Findings: Mask-wearing varied over time and across the 38 countries. While some countries consistently showed high prevalence throughout, in other countries mask usage increased gradually, and a few other countries remained at low prevalence. Controlling for time and country fixed effects, sociodemographic factors (older age, female gender, education, urbanicity) and stricter mask-related policies were significantly associated with higher mask usage in public settings, while social behaviors considered risky in the context of the pandemic (going out to large events, restaurants, shopping centers, and socializing outside of the household) were associated with lower mask use.
Interpretation: The decision to wear a face mask in public settings is significantly associated with sociodemographic factors, risky social behaviors, and mask policies. This has important implications for health prevention policies and messaging, including the potential need for more targeted policy and messaging design.
△ Less
Submitted 8 January, 2021; v1 submitted 21 December, 2020;
originally announced December 2020.
-
Latent Causal Socioeconomic Health Index
Authors:
Swen Kuh,
Grace S. Chiu,
Anton H. Westveld
Abstract:
This research develops a model-based LAtent Causal Socioeconomic Health (LACSH) index at the national level. Motivated by the need for a holistic national well-being index, we build upon the latent health factor index (LHFI) approach that has been used to assess the unobservable ecological/ecosystem health. LHFI integratively models the relationship between metrics, latent health, and covariates t…
▽ More
This research develops a model-based LAtent Causal Socioeconomic Health (LACSH) index at the national level. Motivated by the need for a holistic national well-being index, we build upon the latent health factor index (LHFI) approach that has been used to assess the unobservable ecological/ecosystem health. LHFI integratively models the relationship between metrics, latent health, and covariates that drive the notion of health. In this paper, the LHFI structure is integrated with spatial modeling and statistical causal modeling. Our efforts are focused on developing the integrated framework to facilitate the understanding of how an observational continuous variable might have causally affected a latent trait that exhibits spatial correlation. A novel visualization technique to evaluate covariate balance is also introduced for the case of a continuous policy (treatment) variable. Our resulting LACSH framework and visualization tool are illustrated through two global case studies on national socioeconomic health (latent trait), each with various metrics and covariates pertaining to different aspects of societal health, and the treatment variable being mandatory maternity leave days and government expenditure on healthcare, respectively. We validate our model by two simulation studies. All approaches are structured in a Bayesian hierarchical framework and results are obtained by Markov chain Monte Carlo techniques.
△ Less
Submitted 10 October, 2023; v1 submitted 24 September, 2020;
originally announced September 2020.
-
Low Complexity Sequential Search with Size-Dependent Measurement Noise
Authors:
Sung-En Chiu,
Tara Javidi
Abstract:
This paper considers a target localization problem where at any given time an agent can choose a region to query for the presence of the target in that region. The measurement noise is assumed to be increasing with the size of the query region the agent chooses. Motivated by practical applications such as initial beam alignment in array processing, heavy hitter detection in networking, and visual…
▽ More
This paper considers a target localization problem where at any given time an agent can choose a region to query for the presence of the target in that region. The measurement noise is assumed to be increasing with the size of the query region the agent chooses. Motivated by practical applications such as initial beam alignment in array processing, heavy hitter detection in networking, and visual search in robotics, we consider practically important complexity constraints/metrics: \textit{time complexity}, \textit{computational and memory complexity}, and the complexity of possible query sets in terms of geometry and cardinality.
Two novel search strategy, $dyaPM$ and $hiePM$, are proposed. Pertinent to the practicality of out solutions, $dyaPM$ and $hiePM$ are of a connected query geometry (i.e. query set is always a connected set) implemented with low computational and memory complexity. Additionally, $hiePM$ has a hierarchical structure and, hence, a further reduction in the cardinality of possible query sets, making $hiePM$ practically suitable for applications such as beamforming in array processing where memory limitations favors a smaller codebook size.
Through a unified analysis with Extrinsic Jensen Shannon (EJS) Divergence, $dyaPM$ is shown to be asymptotically optimal in search time complexity (asymptotic in both resolution (rate) and error (reliability)). On the other hand, $hiePM$ is shown to be near-optimal in rate. In addition, both $hiePM$ and $dyaPM$ are shown to outperform prior work in the non-asymptotic regime.
△ Less
Submitted 1 September, 2020; v1 submitted 15 May, 2020;
originally announced May 2020.
-
Sequential Learning of CSI for MmWave Initial Alignment
Authors:
Nancy Ronquillo,
Sung-En Chiu,
Tara Javidi
Abstract:
MmWave communications aim to meet the demand for higher data rates by using highly directional beams with access to larger bandwidth. An inherent challenge is acquiring channel state information (CSI) necessary for mmWave transmission. We consider the problem of adaptive and sequential learning of the CSI during the mmWave initial alignment phase of communication. We focus on the single-user with…
▽ More
MmWave communications aim to meet the demand for higher data rates by using highly directional beams with access to larger bandwidth. An inherent challenge is acquiring channel state information (CSI) necessary for mmWave transmission. We consider the problem of adaptive and sequential learning of the CSI during the mmWave initial alignment phase of communication. We focus on the single-user with a single dominant path scenario where the problem is equivalent to acquiring an optimal beamforming vector, where ideally, the resulting beams point in the direction of the angle of arrival with the desired resolution. We extend our prior by proposing two algorithms for adaptively and sequentially selecting beamforming vectors for learning of the CSI, and that formulate a Bayesian update to account for the time-varying fading model. Numerically, we analyze the outage probability and expected spectral efficiency of our proposed algorithms and demonstrate improvements over strategies that utilize a practical hierarchical codebook.
△ Less
Submitted 29 December, 2019;
originally announced December 2019.
-
Modeling National Latent Socioeconomic Health and Examination of Policy Effects via Causal Inference
Authors:
F. Swen Kuh,
Grace S. Chiu,
Anton H. Westveld
Abstract:
This research develops a socioeconomic health index for nations through a model-based approach which incorporates spatial dependence and examines the impact of a policy through a causal modeling framework. As the gross domestic product (GDP) has been regarded as a dated measure and tool for benchmarking a nation's economic performance, there has been a growing consensus for an alternative measure-…
▽ More
This research develops a socioeconomic health index for nations through a model-based approach which incorporates spatial dependence and examines the impact of a policy through a causal modeling framework. As the gross domestic product (GDP) has been regarded as a dated measure and tool for benchmarking a nation's economic performance, there has been a growing consensus for an alternative measure---such as a composite `wellbeing' index---to holistically capture a country's socioeconomic health performance. Many conventional ways of constructing wellbeing/health indices involve combining different observable metrics, such as life expectancy and education level, to form an index. However, health is inherently latent with metrics actually being observable indicators of health. In contrast to the GDP or other conventional health indices, our approach provides a holistic quantification of the overall `health' of a nation. We build upon the latent health factor index (LHFI) approach that has been used to assess the unobservable ecological/ecosystem health. This framework integratively models the relationship between metrics, the latent health, and the covariates that drive the notion of health. In this paper, the LHFI structure is integrated with spatial modeling and statistical causal modeling, so as to evaluate the impact of a policy variable (mandatory maternity leave days) on a nation's socioeconomic health, while formally accounting for spatial dependency among the nations. We apply our model to countries around the world using data on various metrics and potential covariates pertaining to different aspects of societal health. The approach is structured in a Bayesian hierarchical framework and results are obtained by Markov chain Monte Carlo techniques.
△ Less
Submitted 1 November, 2019;
originally announced November 2019.
-
Active Learning and CSI Acquisition for mmWave Initial Alignment
Authors:
Sung-En Chiu,
Nancy Ronquillo,
Tara Javidi
Abstract:
Millimeter wave (mmWave) communication with large antenna arrays is a promising technique to enable extremely high data rates due to the large available bandwidth in mmWave frequency bands. In addition, given the knowledge of an optimal directional beamforming vector, large antenna arrays have been shown to overcome both the severe signal attenuation in mmWave as well as the interference problem.…
▽ More
Millimeter wave (mmWave) communication with large antenna arrays is a promising technique to enable extremely high data rates due to the large available bandwidth in mmWave frequency bands. In addition, given the knowledge of an optimal directional beamforming vector, large antenna arrays have been shown to overcome both the severe signal attenuation in mmWave as well as the interference problem. However, fundamental limits on achievable learning rate of an optimal beamforming vector remain.
This paper considers the problem of adaptive and sequential optimization of the beamforming vectors during the initial access phase of communication. With a single-path channel model, the problem is reduced to actively learning the Angle-of-Arrival (AoA) of the signal sent from the user to the Base Station (BS). Drawing on the recent results in the design of a hierarchical beamforming codebook [1], sequential measurement dependent noisy search strategies [2], and active learning from an imperfect labeler [3], an adaptive and sequential alignment algorithm is proposed.
An upper bound on the expected search time of the proposed algorithm is derived via Extrinsic Jensen-Shannon Divergence. which demonstrates that the search time of the proposed algorithm asymptotically matches the performance of the noiseless bisection search up to a constant factor. Furthermore, the upper bound shows that the acquired AoA error probability decays exponentially fast with the search time with an exponent that is a decreasing function of the acquisition rate.
Numerically, the proposed algorithm is compared with prior work where a significant improvement of the system communication rate is observed. Most notably, in the relevant regime of low (-10dB to 5dB) raw SNR, this establishes the first practically viable solution for initial access and, hence, the first demonstration of stand-alone mmWave communication
△ Less
Submitted 3 September, 2019; v1 submitted 18 December, 2018;
originally announced December 2018.
-
Therapeutic hypothermia: quantification of the transition of core body temperature using the flexible mixture bent-cable model for longitudinal data
Authors:
Shahedul A Khan,
Grace S Chiu,
Joel A Dubin
Abstract:
By reducing core body temperature, T_c, induced hypothermia is a therapeutic tool to prevent brain damage resulting from physical trauma. However, all physiological systems begin to slow down due to hypothermia that in turn can result in increased risk of mortality. Therefore, quantification of the transition of T_c to early hypothermia is of great clinical interest. Conceptually, T_c may exhibit…
▽ More
By reducing core body temperature, T_c, induced hypothermia is a therapeutic tool to prevent brain damage resulting from physical trauma. However, all physiological systems begin to slow down due to hypothermia that in turn can result in increased risk of mortality. Therefore, quantification of the transition of T_c to early hypothermia is of great clinical interest. Conceptually, T_c may exhibit an either gradual or abrupt transition. Bent-cable regression is an appealing statistical tool to model such data due to the model's flexibility and greatly interpretable regression coefficients. It handles more flexibly models that traditionally have been handled by low-order polynomial models (for gradual transition) or piecewise linear changepoint models (for abrupt change). We consider a rat model for humans to quantify the temporal trend of T_c to primarily address the question: What is the critical time point associated with a breakdown in the compensatory mechanisms following the start of hypothermia therapy? To this end, we develop a Bayesian modelling framework for bent-cable regression of longitudinal data to simultaneously account for gradual and abrupt transitions. Our analysis reveals that: (a) about 39% of rats exhibit a gradual transition in T_c; (b) the critical time point is approximately the same regardless of transition type; (c) both transition types show a significant increase of T_c followed by a significant decrease.
△ Less
Submitted 14 April, 2013; v1 submitted 10 October, 2012;
originally announced October 2012.
-
Assessing the Health of Richibucto Estuary with the Latent Health Factor Index
Authors:
Margaret Wu,
Grace S. Chiu,
Lin Lu
Abstract:
The ability to quantitatively assess the health of an ecosystem is often of great interest to those tasked with monitoring and conserving ecosystems. For decades, research in this area has relied upon multimetric indices of various forms. Although indices may be numbers, many are constructed based on procedures that are highly qualitative in nature, thus limiting the quantitative rigour of the pra…
▽ More
The ability to quantitatively assess the health of an ecosystem is often of great interest to those tasked with monitoring and conserving ecosystems. For decades, research in this area has relied upon multimetric indices of various forms. Although indices may be numbers, many are constructed based on procedures that are highly qualitative in nature, thus limiting the quantitative rigour of the practical interpretations made from these indices. The statistical modelling approach to construct the latent health factor index (LHFI) was recently developed to express ecological data, collected to construct conventional multimetric health indices, in a rigorous quantitative model that integrates qualitative features of ecosystem health and preconceived ecological relationships among such features. This hierarchical modelling approach allows (a) statistical inference of health for observed sites and (b) prediction of health for unobserved sites, all accompanied by formal uncertainty statements. Thus far, the LHFI approach has been demonstrated and validated on freshwater ecosystems. The goal of this paper is to adapt this approach to modelling estuarine ecosystem health, particularly that of the previously unassessed system in Richibucto in New Brunswick, Canada. Field data correspond to biotic health metrics that constitute the AZTI marine biotic index (AMBI) and abiotic predictors preconceived to influence biota. We also briefly discuss related LHFI research involving additional metrics that form the infaunal trophic index (ITI). Our paper is the first to construct a scientifically sensible model to rigorously identify the collective explanatory capacity of salinity, distance downstream, channel depth, and silt-clay content --- all regarded a priori as qualitatively important abiotic drivers --- towards site health in the Richibucto ecosystem.
△ Less
Submitted 24 June, 2013; v1 submitted 27 August, 2012;
originally announced August 2012.
-
Understanding thermoregulatory transitions during haemorrhage by piecewise regression
Authors:
Penny S Reynolds,
Grace S Chiu
Abstract:
Transition points are common in physiological processes. However the transition between normothermia and hypothermia during haemorrhagic shock has rarely been systematically quantified from intensive time series data. We estimated the critical transition point (CTP) and provided confidence intervals for core body temperature response to acute severe haemorrhage in a conscious rat model. Estimates…
▽ More
Transition points are common in physiological processes. However the transition between normothermia and hypothermia during haemorrhagic shock has rarely been systematically quantified from intensive time series data. We estimated the critical transition point (CTP) and provided confidence intervals for core body temperature response to acute severe haemorrhage in a conscious rat model. Estimates were obtained by traditional piecewise linear regression (broken stick model) and compared to those from the more novel bent cable regression. Bent cable regression relaxes the assumption of an abrupt point transition, and thus allows the capture of a potentially gradual transition phase; the broken stick is a special case of the bent cable model. We calculated two types of confidence intervals, assuming either independent or autoregressive structure for the residuals. In spite of the severity of the haemorrhage, median temperature change was minor (0.8 C; IQR 0.57-1.31 C) and only four of 38 rats were clinically hypothermic (core temperature < 35 C). However, a transition could be estimated for 23 rats. Bent cable fits were superior when the transition appeared to be gradual rather than abrupt. In all cases, assuming independence gave incorrect uncertainty estimates of CTP. For 15 animals, neither model could be fitted because of irregular temperature profiles that did not conform to the assumption of a single transition. Arbitrary imposition of broken stick fits on a gradual transition profile and assuming independent rather than autocorrelated error may result in misleading estimates of CTP. Identification of the onset of irreversible shock will require further quantification of appropriate time-dependent physiological variables and their behaviour during haemorrhage.
△ Less
Submitted 26 June, 2010;
originally announced June 2010.
-
A Statistical Social Network Model for Consumption Data in Food Webs
Authors:
Grace S. Chiu,
Anton H. Westveld
Abstract:
We adapt existing statistical modeling techniques for social networks to study consumption data observed in trophic food webs. These data describe the feeding volume (non-negative) among organisms grouped into nodes, called trophic species, that form the food web. Model complexity arises due to the extensive amount of zeros in the data, as each node in the web is predator/prey to only a small numb…
▽ More
We adapt existing statistical modeling techniques for social networks to study consumption data observed in trophic food webs. These data describe the feeding volume (non-negative) among organisms grouped into nodes, called trophic species, that form the food web. Model complexity arises due to the extensive amount of zeros in the data, as each node in the web is predator/prey to only a small number of other trophic species. Many of the zeros are regarded as structural (non-random) in the context of feeding behavior. The presence of basal prey and top predator nodes (those who never consume and those who are never consumed, with probability 1) creates additional complexity to the statistical modeling. We develop a special statistical social network model to account for such network features. The model is applied to two empirical food webs; focus is on the web for which the population size of seals is of concern to various commercial fisheries.
△ Less
Submitted 6 September, 2013; v1 submitted 23 June, 2010;
originally announced June 2010.