-
Validity of the total quasi-steady-state approximation in stochastic biochemical reaction networks
Authors:
Yun Min Song,
Kangmin Lee,
Jae Kyoung Kim
Abstract:
Stochastic models for biochemical reaction networks are widely used to explore their complex dynamics but face significant challenges, including difficulties in determining rate constants and high computational costs. To address these issues, model reduction approaches based on deterministic quasi-steady-state approximations (QSSA) have been employed, resulting in propensity functions in the form…
▽ More
Stochastic models for biochemical reaction networks are widely used to explore their complex dynamics but face significant challenges, including difficulties in determining rate constants and high computational costs. To address these issues, model reduction approaches based on deterministic quasi-steady-state approximations (QSSA) have been employed, resulting in propensity functions in the form of deterministic non-elementary reaction functions, such as the Michaelis-Menten equation. In particular, the total QSSA (tQSSA), known for its accuracy in deterministic frameworks, has been perceived as universally valid for stochastic model reduction. However, recent studies have challenged this perception. In this review, we demonstrate that applying tQSSA in stochastic model reduction can distort dynamics, even in cases where the deterministic tQSSA is rigorously valid. This highlights the need for caution when using deterministic QSSA in stochastic model reduction to avoid erroneous conclusions from model simulations.
△ Less
Submitted 2 March, 2025;
originally announced March 2025.
-
Computational translation framework identifies biochemical reaction networks with special topologies and their long-term dynamics
Authors:
Hyukpyo Hong,
Bryan S. Hernandez,
Jinsu Kim,
Jae Kyoung Kim
Abstract:
Long-term behaviors of biochemical systems are described by steady states in deterministic models and stationary distributions in stochastic models. Obtaining their analytic solutions can be done for limited cases, such as linear or finite-state systems, as it generally requires solving many coupled equations. Interestingly, analytic solutions can be easily obtained when underlying networks have s…
▽ More
Long-term behaviors of biochemical systems are described by steady states in deterministic models and stationary distributions in stochastic models. Obtaining their analytic solutions can be done for limited cases, such as linear or finite-state systems, as it generally requires solving many coupled equations. Interestingly, analytic solutions can be easily obtained when underlying networks have special topologies, called weak reversibility (WR) and zero deficiency (ZD), and the kinetic law follows a generalized form of mass-action kinetics. However, such desired topological conditions do not hold for the majority of cases. Thus, translating networks to have WR and ZD while preserving the original dynamics was proposed. Yet, this approach is limited because manually obtaining the desired network translation among the large number of candidates is challenging. Here, we prove necessary conditions for having WR and ZD after translation, and based on these conditions, we develop a user-friendly computational package, TOWARDZ, that automatically and efficiently identifies translated networks with WR and ZD. This allows us to quantitatively examine how likely it is to obtain WR and ZD after translation depending on the number of species and reactions. Importantly, we also describe how our package can be used to analytically derive steady states of deterministic models and stationary distributions of stochastic models. TOWARDZ provides an effective tool to analyze biochemical systems.
△ Less
Submitted 2 December, 2022;
originally announced December 2022.
-
An empirical likelihood approach to reduce selection bias in voluntary samples
Authors:
Jae Kwang Kim,
Kosuke Morikawa
Abstract:
We address the weighting problem in voluntary samples under a nonignorable sample selection model. Under the assumption that the sample selection model is correctly specified, we can compute a consistent estimator of the model parameter and construct the propensity score estimator of the population mean. We use the empirical likelihood method to construct the final weights for voluntary samples by…
▽ More
We address the weighting problem in voluntary samples under a nonignorable sample selection model. Under the assumption that the sample selection model is correctly specified, we can compute a consistent estimator of the model parameter and construct the propensity score estimator of the population mean. We use the empirical likelihood method to construct the final weights for voluntary samples by incorporating the bias calibration constraints and the benchmarking constraints. Linearization variance estimation of the proposed method is developed. A limited simulation study is also performed to check the performance of the proposed methods.
△ Less
Submitted 11 May, 2023; v1 submitted 5 November, 2022;
originally announced November 2022.
-
Semiparametric adaptive estimation under informative sampling
Authors:
Kosuke Morikawa,
Yoshikazu Terada,
Jae Kwang Kim
Abstract:
In survey sampling, survey data do not necessarily represent the target population, and the samples are often biased. However, information on the survey weights aids in the elimination of selection bias. The Horvitz-Thompson estimator is a well-known unbiased, consistent, and asymptotically normal estimator; however, it is not efficient. Thus, this study derives the semiparametric efficiency bound…
▽ More
In survey sampling, survey data do not necessarily represent the target population, and the samples are often biased. However, information on the survey weights aids in the elimination of selection bias. The Horvitz-Thompson estimator is a well-known unbiased, consistent, and asymptotically normal estimator; however, it is not efficient. Thus, this study derives the semiparametric efficiency bound for various target parameters by considering the survey weight as a random variable and consequently proposes a semiparametric optimal estimator with certain working models on the survey weights. The proposed estimator is consistent, asymptotically normal, and efficient in a class of the regular and asymptotically linear estimators. Further, a limited simulation study is conducted to investigate the finite sample performance of the proposed method. The proposed method is applied to the 1999 Canadian Workplace and Employee Survey data.
△ Less
Submitted 3 April, 2024; v1 submitted 11 August, 2022;
originally announced August 2022.
-
Maximum sampled conditional likelihood for informative subsampling
Authors:
HaiYing Wang,
Jae Kwang Kim
Abstract:
Subsampling is a computationally effective approach to extract information from massive data sets when computing resources are limited. After a subsample is taken from the full data, most available methods use an inverse probability weighted (IPW) objective function to estimate the model parameters. The IPW estimator does not fully utilize the information in the selected subsample. In this paper,…
▽ More
Subsampling is a computationally effective approach to extract information from massive data sets when computing resources are limited. After a subsample is taken from the full data, most available methods use an inverse probability weighted (IPW) objective function to estimate the model parameters. The IPW estimator does not fully utilize the information in the selected subsample. In this paper, we propose to use the maximum sampled conditional likelihood estimator (MSCLE) based on the sampled data. We established the asymptotic normality of the MSCLE and prove that its asymptotic variance covariance matrix is the smallest among a class of asymptotically unbiased estimators, including the IPW estimator. We further discuss the asymptotic results with the L-optimal subsampling probabilities and illustrate the estimation procedure with generalized linear models. Numerical experiments are provided to evaluate the practical performance of the proposed method.
△ Less
Submitted 9 October, 2022; v1 submitted 11 November, 2020;
originally announced November 2020.
-
Bootstrap inference for the finite population total under complex sampling designs
Authors:
Zhonglei Wang,
Jae Kwang Kim,
Liuhua Peng
Abstract:
Bootstrap is a useful tool for making statistical inference, but it may provide erroneous results under complex survey sampling. Most studies about bootstrap-based inference are developed under simple random sampling and stratified random sampling. In this paper, we propose a unified bootstrap method applicable to some complex sampling designs, including Poisson sampling and probability-proportion…
▽ More
Bootstrap is a useful tool for making statistical inference, but it may provide erroneous results under complex survey sampling. Most studies about bootstrap-based inference are developed under simple random sampling and stratified random sampling. In this paper, we propose a unified bootstrap method applicable to some complex sampling designs, including Poisson sampling and probability-proportional-to-size sampling. Two main features of the proposed bootstrap method are that studentization is used to make inference, and the finite population is bootstrapped based on a multinomial distribution by incorporating the sampling information. We show that the proposed bootstrap method is second-order accurate using the Edgeworth expansion. Two simulation studies are conducted to compare the proposed bootstrap method with the Wald-type method, which is widely used in survey sampling. Results show that the proposed bootstrap method is better in terms of coverage rate especially when sample size is limited.
△ Less
Submitted 6 January, 2019;
originally announced January 2019.
-
Reduction for stochastic biochemical reaction networks with multiscale conservations
Authors:
Jae Kyoung Kim,
Grzegorz A. Rempala,
Hye-Won Kang
Abstract:
Biochemical reaction networks frequently consist of species evolving on multiple timescales. Stochastic simulations of such networks are often computationally challenging and therefore various methods have been developed to obtain sensible stochastic approximations on the timescale of interest. One of the rigorous and popular approaches is the multiscale approximation method for continuous time Ma…
▽ More
Biochemical reaction networks frequently consist of species evolving on multiple timescales. Stochastic simulations of such networks are often computationally challenging and therefore various methods have been developed to obtain sensible stochastic approximations on the timescale of interest. One of the rigorous and popular approaches is the multiscale approximation method for continuous time Markov processes. In this approach, by scaling species abundances and reaction rates, a family of processes parameterized by a scaling parameter is defined. The limiting process of this family is then used to approximate the original process. However, we find that such approximations become inaccurate when combinations of species with disparate abundances either constitute conservation laws or form virtual slow auxiliary species. To obtain more accurate approximation in such cases, we propose here an appropriate modification of the original method.
△ Less
Submitted 19 April, 2017;
originally announced April 2017.
-
Fixed point theory for composite maps on almost dominating extension spaces
Authors:
Ravi P Agarwal,
Jong Kyu Kim,
Donal O'Regan
Abstract:
New fixed point results are presented for ${\cal U}_c^κ(X,X)$ maps in extension type spaces.
New fixed point results are presented for ${\cal U}_c^κ(X,X)$ maps in extension type spaces.
△ Less
Submitted 28 July, 2006;
originally announced July 2006.
-
Finite sample properties of multiple imputation estimators
Authors:
Jae Kwang Kim
Abstract:
Finite sample properties of multiple imputation estimators under the linear regression model are studied. The exact bias of the multiple imputation variance estimator is presented. A method of reducing the bias is presented and simulation is used to make comparisons. We also show that the suggested method can be used for a general class of linear estimators.
Finite sample properties of multiple imputation estimators under the linear regression model are studied. The exact bias of the multiple imputation variance estimator is presented. A method of reducing the bias is presented and simulation is used to make comparisons. We also show that the suggested method can be used for a general class of linear estimators.
△ Less
Submitted 23 June, 2004;
originally announced June 2004.
-
Norm Estimates for the Difference Between Bochner's Integral and the Convex Combination of Function's Values
Authors:
P. Cerone,
Y. J. Cho,
S. S. Dragomir,
J. K. Kim,
S. S. Kim
Abstract:
Norm estimates are developed between the Bochner integral of a vector-valued function in Banach spaces having the Radon-Nikodym property and the convex combination of function values taken on a division of the interval [a,b].
Norm estimates are developed between the Bochner integral of a vector-valued function in Banach spaces having the Radon-Nikodym property and the convex combination of function values taken on a division of the interval [a,b].
△ Less
Submitted 4 September, 2003;
originally announced September 2003.