Search | arXiv e-print repository

Posterior contraction rates in a sparse non-linear mixed-effects model

Authors: Marion Naveau, Maud Delattre, Laure Sansonnet

Abstract: Recent works have shown an interest in investigating the frequentist asymptotic properties of Bayesian procedures for high-dimensional linear models under sparsity constraints. However, there exists a gap in the literature regarding analogous theoretical findings for non-linear models within the high-dimensional setting. The current study provides a novel contribution, focusing specifically on a n… ▽ More Recent works have shown an interest in investigating the frequentist asymptotic properties of Bayesian procedures for high-dimensional linear models under sparsity constraints. However, there exists a gap in the literature regarding analogous theoretical findings for non-linear models within the high-dimensional setting. The current study provides a novel contribution, focusing specifically on a non-linear mixed-effects model. In this model, the residual variance is assumed to be known, while the covariance matrix of the random effects and the regression vector are unknown and must be estimated. The prior distribution for the sparse regression coefficients consists of a mixture of a point mass at zero and a Laplace distribution, while an Inverse-Wishart prior is employed for the covariance parameter of the random effects. First, the effective dimension of this model is bounded with high posterior probabilities. Subsequently, we derive posterior contraction rates for both the covariance parameter and the prediction term of the response vector. Finally, under additional assumptions, the posterior distribution is shown to contract for recovery of the unknown sparse regression vector at the same rate as observed in the linear case. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2306.12841 [pdf, other]

Efficient preconditioned stochastic gradient descent for estimation in latent variable models

Authors: Charlotte Baey, Maud Delattre, Estelle Kuhn, Jean-Benoist Leger, Sarah Lemler

Abstract: Latent variable models are powerful tools for modeling complex phenomena involving in particular partially observed data, unobserved variables or underlying complex unknown structures. Inference is often difficult due to the latent structure of the model. To deal with parameter estimation in the presence of latent variables, well-known efficient methods exist, such as gradient-based and EM-type al… ▽ More Latent variable models are powerful tools for modeling complex phenomena involving in particular partially observed data, unobserved variables or underlying complex unknown structures. Inference is often difficult due to the latent structure of the model. To deal with parameter estimation in the presence of latent variables, well-known efficient methods exist, such as gradient-based and EM-type algorithms, but with practical and theoretical limitations. In this paper, we propose as an alternative for parameter estimation an efficient preconditioned stochastic gradient algorithm. Our method includes a preconditioning step based on a positive definite Fisher information matrix estimate. We prove convergence results for the proposed algorithm under mild assumptions for very general latent variables models. We illustrate through relevant simulations the performance of the proposed methodology in a nonlinear mixed effects model and in a stochastic block model. △ Less

Submitted 22 June, 2023; originally announced June 2023.

arXiv:2206.01012 [pdf, ps, other]

doi 10.1007/s11222-023-10367-4

Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm

Authors: Marion Naveau, Guillaume Kon Kam King, Renaud Rincent, Laure Sansonnet, Maud Delattre

Abstract: High-dimensional variable selection, with many more covariates than observations, is widely documented in standard regression models, but there are still few tools to address it in non-linear mixed-effects models where data are collected repeatedly on several individuals. In this work, variable selection is approached from a Bayesian perspective and a selection procedure is proposed, combining the… ▽ More High-dimensional variable selection, with many more covariates than observations, is widely documented in standard regression models, but there are still few tools to address it in non-linear mixed-effects models where data are collected repeatedly on several individuals. In this work, variable selection is approached from a Bayesian perspective and a selection procedure is proposed, combining the use of a spike-and-slab prior and the Stochastic Approximation version of the Expectation Maximisation (SAEM) algorithm. Similarly to Lasso regression, the set of relevant covariates is selected by exploring a grid of values for the penalisation parameter. The SAEM approach is much faster than a classical MCMC (Markov chain Monte Carlo) algorithm and our method shows very good selection performances on simulated data. Its flexibility is demonstrated by implementing it for a variety of nonlinear mixed effects models. The usefulness of the proposed method is illustrated on a problem of genetic markers identification, relevant for genomic-assisted selection in plant breeding. △ Less

Submitted 5 April, 2024; v1 submitted 2 June, 2022; originally announced June 2022.

Journal ref: Statistics and Computing, 2023, 34 (1), pp.53

arXiv:2009.07516 [pdf, ps, other]

A review on asymptotic inference in stochastic differential equations with mixed-effects

Authors: Maud Delattre

Abstract: This paper is a survey of recent contributions on estimation in stochastic differential equations with mixed-effects. These models involve N stochastic differential equations with common drift and diffusion functions but random parameters that allow for differences between processes. The main objective is to estimate the distribution of the random effects and possibly other fixed parameters that a… ▽ More This paper is a survey of recent contributions on estimation in stochastic differential equations with mixed-effects. These models involve N stochastic differential equations with common drift and diffusion functions but random parameters that allow for differences between processes. The main objective is to estimate the distribution of the random effects and possibly other fixed parameters that are common to the N processes. While many algorithms have been proposed, the theoretical aspects related to estimation have been little studied. This review article focuses only on theoretical inference for stochastic differential equations with mixed-effects. It has so far only been considered in some very specific classes of mixed-effect diffusion models, observed without measurement error, where explicit estimators can be defined. Within this framework, the asymptotic properties of several estimators, either parametric or nonparametric, are discussed. Different schemes of observations are considered according to the approach, associating a large number of individuals with, in most cases, high-frequency observations of the trajectories. △ Less

Submitted 16 September, 2020; originally announced September 2020.

arXiv:1506.03198 [pdf, other]

Estimating the number of change-points in a two-dimensional segmentation model without penalization

Authors: V. Brault, M. Delattre, E. Lebarbier, T. Mary-Huard, C. Lévy-Leduc

Abstract: In computational biology, numerous recent studies have been dedicated to the analysis of the chromatin structure within the cell by two-dimensional segmentation methods. Motivated by this application, we consider the problem of retrieving the diagonal blocks in a matrix of observations. The theoretical properties of the least-squares estimators of both the boundaries and the number of blocks propo… ▽ More In computational biology, numerous recent studies have been dedicated to the analysis of the chromatin structure within the cell by two-dimensional segmentation methods. Motivated by this application, we consider the problem of retrieving the diagonal blocks in a matrix of observations. The theoretical properties of the least-squares estimators of both the boundaries and the number of blocks proposed by Lévy-Leduc et al. [2014] are investigated. More precisely, the contribution of the paper is to establish the consistency of these estimators. A surprising consequence of our results is that, contrary to the onedimensional case, a penalty is not needed for retrieving the true number of diagonal blocks. Finally, the results are illustrated on synthetic data. △ Less

Submitted 11 March, 2016; v1 submitted 10 June, 2015; originally announced June 2015.

Comments: 30 pages, 8 figures

Showing 1–5 of 5 results for author: Delattre, M