-
Posterior contraction rates in a sparse non-linear mixed-effects model
Authors:
Marion Naveau,
Maud Delattre,
Laure Sansonnet
Abstract:
Recent works have shown an interest in investigating the frequentist asymptotic properties of Bayesian procedures for high-dimensional linear models under sparsity constraints. However, there exists a gap in the literature regarding analogous theoretical findings for non-linear models within the high-dimensional setting. The current study provides a novel contribution, focusing specifically on a n…
▽ More
Recent works have shown an interest in investigating the frequentist asymptotic properties of Bayesian procedures for high-dimensional linear models under sparsity constraints. However, there exists a gap in the literature regarding analogous theoretical findings for non-linear models within the high-dimensional setting. The current study provides a novel contribution, focusing specifically on a non-linear mixed-effects model. In this model, the residual variance is assumed to be known, while the covariance matrix of the random effects and the regression vector are unknown and must be estimated. The prior distribution for the sparse regression coefficients consists of a mixture of a point mass at zero and a Laplace distribution, while an Inverse-Wishart prior is employed for the covariance parameter of the random effects. First, the effective dimension of this model is bounded with high posterior probabilities. Subsequently, we derive posterior contraction rates for both the covariance parameter and the prediction term of the response vector. Finally, under additional assumptions, the posterior distribution is shown to contract for recovery of the unknown sparse regression vector at the same rate as observed in the linear case.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Efficient preconditioned stochastic gradient descent for estimation in latent variable models
Authors:
Charlotte Baey,
Maud Delattre,
Estelle Kuhn,
Jean-Benoist Leger,
Sarah Lemler
Abstract:
Latent variable models are powerful tools for modeling complex phenomena involving in particular partially observed data, unobserved variables or underlying complex unknown structures. Inference is often difficult due to the latent structure of the model. To deal with parameter estimation in the presence of latent variables, well-known efficient methods exist, such as gradient-based and EM-type al…
▽ More
Latent variable models are powerful tools for modeling complex phenomena involving in particular partially observed data, unobserved variables or underlying complex unknown structures. Inference is often difficult due to the latent structure of the model. To deal with parameter estimation in the presence of latent variables, well-known efficient methods exist, such as gradient-based and EM-type algorithms, but with practical and theoretical limitations. In this paper, we propose as an alternative for parameter estimation an efficient preconditioned stochastic gradient algorithm. Our method includes a preconditioning step based on a positive definite Fisher information matrix estimate. We prove convergence results for the proposed algorithm under mild assumptions for very general latent variables models. We illustrate through relevant simulations the performance of the proposed methodology in a nonlinear mixed effects model and in a stochastic block model.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm
Authors:
Marion Naveau,
Guillaume Kon Kam King,
Renaud Rincent,
Laure Sansonnet,
Maud Delattre
Abstract:
High-dimensional variable selection, with many more covariates than observations, is widely documented in standard regression models, but there are still few tools to address it in non-linear mixed-effects models where data are collected repeatedly on several individuals. In this work, variable selection is approached from a Bayesian perspective and a selection procedure is proposed, combining the…
▽ More
High-dimensional variable selection, with many more covariates than observations, is widely documented in standard regression models, but there are still few tools to address it in non-linear mixed-effects models where data are collected repeatedly on several individuals. In this work, variable selection is approached from a Bayesian perspective and a selection procedure is proposed, combining the use of a spike-and-slab prior and the Stochastic Approximation version of the Expectation Maximisation (SAEM) algorithm. Similarly to Lasso regression, the set of relevant covariates is selected by exploring a grid of values for the penalisation parameter. The SAEM approach is much faster than a classical MCMC (Markov chain Monte Carlo) algorithm and our method shows very good selection performances on simulated data. Its flexibility is demonstrated by implementing it for a variety of nonlinear mixed effects models. The usefulness of the proposed method is illustrated on a problem of genetic markers identification, relevant for genomic-assisted selection in plant breeding.
△ Less
Submitted 5 April, 2024; v1 submitted 2 June, 2022;
originally announced June 2022.
-
A review on asymptotic inference in stochastic differential equations with mixed-effects
Authors:
Maud Delattre
Abstract:
This paper is a survey of recent contributions on estimation in stochastic differential equations with mixed-effects. These models involve N stochastic differential equations with common drift and diffusion functions but random parameters that allow for differences between processes. The main objective is to estimate the distribution of the random effects and possibly other fixed parameters that a…
▽ More
This paper is a survey of recent contributions on estimation in stochastic differential equations with mixed-effects. These models involve N stochastic differential equations with common drift and diffusion functions but random parameters that allow for differences between processes. The main objective is to estimate the distribution of the random effects and possibly other fixed parameters that are common to the N processes. While many algorithms have been proposed, the theoretical aspects related to estimation have been little studied. This review article focuses only on theoretical inference for stochastic differential equations with mixed-effects. It has so far only been considered in some very specific classes of mixed-effect diffusion models, observed without measurement error, where explicit estimators can be defined. Within this framework, the asymptotic properties of several estimators, either parametric or nonparametric, are discussed. Different schemes of observations are considered according to the approach, associating a large number of individuals with, in most cases, high-frequency observations of the trajectories.
△ Less
Submitted 16 September, 2020;
originally announced September 2020.
-
Estimating the number of change-points in a two-dimensional segmentation model without penalization
Authors:
V. Brault,
M. Delattre,
E. Lebarbier,
T. Mary-Huard,
C. Lévy-Leduc
Abstract:
In computational biology, numerous recent studies have been dedicated to the analysis of the chromatin structure within the cell by two-dimensional segmentation methods. Motivated by this application, we consider the problem of retrieving the diagonal blocks in a matrix of observations. The theoretical properties of the least-squares estimators of both the boundaries and the number of blocks propo…
▽ More
In computational biology, numerous recent studies have been dedicated to the analysis of the chromatin structure within the cell by two-dimensional segmentation methods. Motivated by this application, we consider the problem of retrieving the diagonal blocks in a matrix of observations. The theoretical properties of the least-squares estimators of both the boundaries and the number of blocks proposed by Lévy-Leduc et al. [2014] are investigated. More precisely, the contribution of the paper is to establish the consistency of these estimators. A surprising consequence of our results is that, contrary to the onedimensional case, a penalty is not needed for retrieving the true number of diagonal blocks. Finally, the results are illustrated on synthetic data.
△ Less
Submitted 11 March, 2016; v1 submitted 10 June, 2015;
originally announced June 2015.