Skip to main content

Showing 1–43 of 43 results for author: Alquier, P

Searching in archive math. Search in all archives.
.
  1. arXiv:2412.18539  [pdf, ps, other

    stat.ML cs.LG math.ST

    Convergence of Statistical Estimators via Mutual Information Bounds

    Authors: El Mahdi Khribch, Pierre Alquier

    Abstract: Recent advances in statistical learning theory have revealed profound connections between mutual information (MI) bounds, PAC-Bayesian theory, and Bayesian nonparametrics. This work introduces a novel mutual information bound for statistical models. The derived bound has wide-ranging applications in statistical inference. It yields improved contraction rates for fractional posteriors in Bayesian n… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

  2. arXiv:2402.10506  [pdf, other

    math.ST math.PR

    Optimistic Estimation of Convergence in Markov Chains with the Average-Mixing Time

    Authors: Geoffrey Wolfer, Pierre Alquier

    Abstract: The convergence rate of a Markov chain to its stationary distribution is typically assessed using the concept of total variation mixing time. However, this worst-case measure often yields pessimistic estimates and is challenging to infer from observations. In this paper, we advocate for the use of the average-mixing time as a more optimistic and demonstrably easier-to-estimate alternative. We furt… ▽ More

    Submitted 23 July, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  3. arXiv:2210.09756  [pdf, ps, other

    math.ST

    Dimension-free Bounds for Sum of Dependent Matrices and Operators with Heavy-Tailed Distribution

    Authors: Shogo Nakakita, Pierre Alquier, Masaaki Imaizumi

    Abstract: We study the deviation inequality for a sum of high-dimensional random matrices and operators with dependence and arbitrary heavy tails. There is an increase in the importance of the problem of estimating high-dimensional matrices, and dependence and heavy-tail properties of data are among the most critical topics currently. In this paper, we derive a dimension-free upper bound on the deviation, t… ▽ More

    Submitted 21 October, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: 33 pages

  4. arXiv:2210.06672  [pdf, other

    math.ST cs.LG stat.ML

    Variance-Aware Estimation of Kernel Mean Embedding

    Authors: Geoffrey Wolfer, Pierre Alquier

    Abstract: An important feature of kernel mean embeddings (KME) is that the rate of convergence of the empirical KME to the true distribution KME can be bounded independently of the dimension of the space, properties of the distribution and smoothness features of the kernel. We show how to speed-up convergence by leveraging variance information in the reproducing kernel Hilbert space. Furthermore, we show th… ▽ More

    Submitted 16 April, 2025; v1 submitted 12 October, 2022; originally announced October 2022.

  5. arXiv:2206.08619  [pdf, ps, other

    stat.ME math.ST stat.CO

    Optimal quasi-Bayesian reduced rank regression with incomplete response

    Authors: The Tien Mai, Pierre Alquier

    Abstract: The aim of reduced rank regression is to connect multiple response variables to multiple predictors. This model is very popular, especially in biostatistics where multiple measurements on individuals can be re-used to predict multiple outputs. Unfortunately, there are often missing data in such datasets, making it difficult to use standard estimation tools. In this paper, we study the problem of r… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  6. arXiv:2206.06991  [pdf, other

    stat.ME math.ST stat.CO

    Concentration of discrepancy-based approximate Bayesian computation via Rademacher complexity

    Authors: Sirio Legramanti, Daniele Durante, Pierre Alquier

    Abstract: There has been increasing interest on summary-free solutions for approximate Bayesian computation (ABC) which replace distances among summaries with discrepancies between the empirical distributions of the observed data and the synthetic samples generated under the proposed parameter values. The success of these strategies has motivated theoretical studies on the limiting properties of the induced… ▽ More

    Submitted 24 January, 2025; v1 submitted 14 June, 2022; originally announced June 2022.

  7. arXiv:2110.11216  [pdf, other

    stat.ML cs.LG math.ST

    User-friendly introduction to PAC-Bayes bounds

    Authors: Pierre Alquier

    Abstract: Aggregated predictors are obtained by making a set of basic predictors vote according to some weights, that is, to some probability distribution. Randomized predictors are obtained by sampling in a set of basic predictors, according to some prescribed probability distribution. Thus, aggregated and randomized predictors have in common that they are not defined by a minimization problem, but by… ▽ More

    Submitted 28 February, 2025; v1 submitted 21 October, 2021; originally announced October 2021.

    Journal ref: Foundations and Trends in Machine Learning, 2024, vol. 17, no. 2, pp. 174-303

  8. arXiv:2102.08685  [pdf, ps, other

    math.PR math.OC stat.ML

    Deviation inequalities for stochastic approximation by averaging

    Authors: Xiequan Fan, Pierre Alquier, Paul Doukhan

    Abstract: We introduce a class of Markov chains, that contains the model of stochastic approximation by averaging and non-averaging. Using martingale approximation method, we establish various deviation inequalities for separately Lipschitz functions of such a chain, with different moment conditions on some dominating random variables of martingale differences.Finally, we apply these inequalities to the sto… ▽ More

    Submitted 18 February, 2022; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: 35 pages

    MSC Class: 60G42; 60J05; 60F10; 60E15

    Journal ref: Stochastic Processes and their Applications 152 (2022)

  9. Tight Risk Bound for High Dimensional Time Series Completion

    Authors: Pierre Alquier, Nicolas Marie, Amélie Rosier

    Abstract: Initially designed for independent datas, low-rank matrix completion was successfully applied in many domains to the reconstruction of partially observed high-dimensional time series. However, there is a lack of theory to support the application of these methods to dependent datas. In this paper, we propose a general model for multivariate, partially observed time series. We show that the least-sq… ▽ More

    Submitted 11 March, 2022; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: 25 pages, 4 figures

    Journal ref: Electronic Journal of Statistics 16, 1, 3001-3035, 2022

  10. arXiv:2102.02504  [pdf, other

    stat.ML cs.LG math.ST stat.CO

    Meta-strategy for Learning Tuning Parameters with Guarantees

    Authors: Dimitri Meunier, Pierre Alquier

    Abstract: Online learning methods, like the online gradient algorithm (OGA) and exponentially weighted aggregation (EWA), often depend on tuning parameters that are difficult to set in practice. We consider an online meta-learning scenario, and we propose a meta-strategy to learn these parameters from past tasks. Our strategy is based on the minimization of a regret bound. It allows to learn the initializat… ▽ More

    Submitted 6 August, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Journal ref: Entropy, 2021, vol. 23, no. 10, 1257

  11. arXiv:2010.00408  [pdf, other

    stat.ME math.ST stat.CO

    Estimation of copulas via Maximum Mean Discrepancy

    Authors: Pierre Alquier, Badr-Eddine Chérief-Abdellatif, Alexis Derumigny, Jean-David Fermanian

    Abstract: This paper deals with robust inference for parametric copula models. Estimation using Canonical Maximum Likelihood might be unstable, especially in the presence of outliers. We propose to use a procedure based on the Maximum Mean Discrepancy (MMD) principle. We derive non-asymptotic oracle inequalities, consistency and asymptotic normality of this new estimator. In particular, the oracle inequalit… ▽ More

    Submitted 14 January, 2022; v1 submitted 1 October, 2020; originally announced October 2020.

  12. arXiv:2006.00840  [pdf, ps, other

    math.ST

    Universal Robust Regression via Maximum Mean Discrepancy

    Authors: Pierre Alquier, Mathieu Gerber

    Abstract: Many modern datasets are collected automatically and are thus easily contaminated by outliers. This led to a regain of interest in robust estimation, including new notions of robustness such as robustness to adversarial contamination of the data. However, most robust estimation methods are designed for a specific model. Notably, many methods were proposed recently to obtain robust estimators in li… ▽ More

    Submitted 4 May, 2023; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: 51 pages, 5 tables (final version)

  13. arXiv:1912.05737  [pdf, other

    math.ST cs.LG stat.CO stat.ME

    Finite sample properties of parametric MMD estimation: robustness to misspecification and dependence

    Authors: Badr-Eddine Chérief-Abdellatif, Pierre Alquier

    Abstract: Many works in statistics aim at designing a universal estimation procedure, that is, an estimator that would converge to the best approximation of the (unknown) data generating distribution in a model, without any assumption on this distribution. This question is of major interest, in particular because the universality property leads to the robustness of the estimator. In this paper, we tackle th… ▽ More

    Submitted 13 February, 2025; v1 submitted 11 December, 2019; originally announced December 2019.

    Journal ref: Bernoulli, 2022, vol. 28(1), no. 1, pp. 181-213

  14. arXiv:1909.13339  [pdf, other

    math.ST cs.LG stat.ML

    MMD-Bayes: Robust Bayesian Estimation via Maximum Mean Discrepancy

    Authors: Badr-Eddine Chérief-Abdellatif, Pierre Alquier

    Abstract: In some misspecified settings, the posterior distribution in Bayesian statistics may lead to inconsistent estimates. To fix this issue, it has been suggested to replace the likelihood by a pseudo-likelihood, that is the exponential of a loss function enjoying suitable robustness properties. In this paper, we build a pseudo-likelihood based on the Maximum Mean Discrepancy, defined via an embedding… ▽ More

    Submitted 11 December, 2019; v1 submitted 29 September, 2019; originally announced September 2019.

  15. arXiv:1905.00959  [pdf, other

    math.ST stat.ME stat.ML

    High dimensional VAR with low rank transition

    Authors: Pierre Alquier, Karine Bertin, Paul Doukhan, Rémy Garnier

    Abstract: We propose a vector auto-regressive (VAR) model with a low-rank constraint on the transition matrix. This new model is well suited to predict high-dimensional series that are highly correlated, or that are driven by a small number of hidden factors. We study estimation, prediction, and rank selection for this model in a very general setting. Our method shows excellent performances on a wide variet… ▽ More

    Submitted 10 February, 2020; v1 submitted 2 May, 2019; originally announced May 2019.

    Journal ref: Statistics and Computing, 2020, vol. 30, pp. 1139-1153

  16. arXiv:1904.03920  [pdf, other

    stat.ML cs.LG math.ST stat.CO

    A Generalization Bound for Online Variational Inference

    Authors: Badr-Eddine Chérief-Abdellatif, Pierre Alquier, Mohammad Emtiyaz Khan

    Abstract: Bayesian inference provides an attractive online-learning framework to analyze sequential data, and offers generalization guarantees which hold even with model mismatch and adversaries. Unfortunately, exact Bayesian inference is rarely feasible in practice and approximation methods are usually employed, but do such methods preserve the generalization properties of Bayesian inference ? In this pape… ▽ More

    Submitted 10 December, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: Published in the proceedings of ACML 2019

    Journal ref: Proceedings in Machine Learning Research, 2019, vol. 101, pp. 662-677

  17. Matrix factorization for multivariate time series analysis

    Authors: Pierre Alquier, Nicolas Marie

    Abstract: Matrix factorization is a powerful data analysis tool. It has been used in multivariate time series analysis, leading to the decomposition of the series in a small set of latent factors. However, little is known on the statistical performances of matrix factorization for time series. In this paper, we extend the results known for matrix estimation in the i.i.d setting to time series. Moreover, we… ▽ More

    Submitted 12 October, 2019; v1 submitted 13 March, 2019; originally announced March 2019.

    Comments: 16 pages

    Journal ref: Electronic Journal of Statistics 13, 2, 4346-4366, 2019

  18. arXiv:1805.05054  [pdf, ps, other

    math.ST stat.CO stat.ME

    Consistency of Variational Bayes Inference for Estimation and Model Selection in Mixtures

    Authors: Badr-Eddine Chérief-Abdellatif, Pierre Alquier

    Abstract: Mixture models are widely used in Bayesian statistics and machine learning, in particular in computational biology, natural language processing and many other fields. Variational inference, a technique for approximating intractable posteriors thanks to optimization algorithms, is extremely popular in practice when dealing with complex models such as mixtures. The contribution of this paper is two-… ▽ More

    Submitted 12 August, 2018; v1 submitted 14 May, 2018; originally announced May 2018.

    Journal ref: Electronic Journal of Statistics, 2018, vol. 12, no. 2, pp. 2995-3035

  19. arXiv:1706.09293  [pdf, ps, other

    math.ST cs.LG

    Concentration of tempered posteriors and of their variational approximations

    Authors: Pierre Alquier, James Ridgway

    Abstract: While Bayesian methods are extremely popular in statistics and machine learning, their application to massive datasets is often challenging, when possible at all. Indeed, the classical MCMC algorithms are prohibitively slow when both the model dimension and the sample size are large. Variational Bayesian methods aim at approximating the posterior by a distribution in a tractable family. Thus, MCMC… ▽ More

    Submitted 22 April, 2019; v1 submitted 28 June, 2017; originally announced June 2017.

  20. arXiv:1702.01402  [pdf, other

    math.ST

    Estimation bounds and sharp oracle inequalities of regularized procedures with Lipschitz loss functions

    Authors: Pierre Alquier, Vincent Cottet, Guillaume Lecué

    Abstract: We obtain estimation error rates and sharp oracle inequalities for regularization procedures of the form \begin{equation*} \hat f \in argmin_{f\in F}\left(\frac{1}{N}\sum_{i=1}^N\ell(f(X_i), Y_i)+λ\|f\|\right) \end{equation*} when $\|\cdot\|$ is any norm, $F$ is a convex class of functions and $\ell$ is a Lipschitz loss function satisfying a Bernstein condition over $F$. We explore both the bo… ▽ More

    Submitted 7 February, 2017; v1 submitted 5 February, 2017; originally announced February 2017.

  21. Simpler PAC-Bayesian Bounds for Hostile Data

    Authors: Pierre Alquier, Benjamin Guedj

    Abstract: PAC-Bayesian learning bounds are of the utmost interest to the learning community. Their role is to connect the generalization ability of an aggregation distribution $ρ$ to its empirical risk and to its Kullback-Leibler divergence with respect to some prior distribution $π$. Unfortunately, most of the available bounds typically rely on heavy assumptions such as boundedness and independence of the… ▽ More

    Submitted 23 May, 2019; v1 submitted 23 October, 2016; originally announced October 2016.

    Comments: 18 pages

    Journal ref: Machine Learning (2018), vol. 107 (5), 887--902

  22. arXiv:1605.05933  [pdf, other

    math.ST math-ph quant-ph

    Pseudo-Bayesian Quantum Tomography with Rank-adaptation

    Authors: The Tien Mai, Pierre Alquier

    Abstract: Quantum state tomography, an important task in quantum information processing, aims at reconstructing a state from prepared measurement data. Bayesian methods are recognized to be one of the good and reliable choice in estimating quantum states~\cite{blume2010optimal}. Several numerical works showed that Bayesian estimations are comparable to, and even better than other methods in the problem of… ▽ More

    Submitted 10 October, 2016; v1 submitted 19 May, 2016; originally announced May 2016.

  23. An Oracle Inequality for Quasi-Bayesian Non-Negative Matrix Factorization

    Authors: Pierre Alquier, Benjamin Guedj

    Abstract: The aim of this paper is to provide some theoretical understanding of quasi-Bayesian aggregation methods non-negative matrix factorization. We derive an oracle inequality for an aggregated estimator. This result holds for a very general class of prior distributions and shows how the prior affects the rate of convergence.

    Submitted 26 June, 2018; v1 submitted 6 January, 2016; originally announced January 2016.

    Comments: This is the corrected version of the published paper P. Alquier, B. Guedj, An Oracle Inequality for Quasi-Bayesian Non-negative Matrix Factorization, Mathematical Methods of Statistics, 2017, vol. 26, no. 1, pp. 55-67. Since then Arnak Dalalyan (ENSAE) found a mistake in the proofs. We fixed the mistake at the price of a slightly different logarithmic term in the bound

    Journal ref: Mathematical Methods of Statistics (MMS), 26(1): 55-67, 2017

  24. arXiv:1506.04091  [pdf, other

    stat.ML math.ST

    On the properties of variational approximations of Gibbs posteriors

    Authors: Pierre Alquier, James Ridgway, Nicolas Chopin

    Abstract: The PAC-Bayesian approach is a powerful set of techniques to derive non- asymptotic risk bounds for random estimators. The corresponding optimal distribution of estimators, usually called the Gibbs posterior, is unfortunately intractable. One may sample from it using Markov chain Monte Carlo, but this is often too slow for big datasets. We consider instead variational approximations of the Gibbs p… ▽ More

    Submitted 15 June, 2015; v1 submitted 12 June, 2015; originally announced June 2015.

  25. A Bayesian Approach for Noisy Matrix Completion: Optimal Rate under General Sampling Distribution

    Authors: The Tien Mai, Pierre Alquier

    Abstract: Bayesian methods for low-rank matrix completion with noise have been shown to be very efficient computationally. While the behaviour of penalized minimization methods is well understood both from the theoretical and computational points of view in this problem, the theoretical optimality of Bayesian estimators have not been explored yet. In this paper, we propose a Bayesian estimator for matrix co… ▽ More

    Submitted 21 January, 2015; v1 submitted 25 August, 2014; originally announced August 2014.

    Journal ref: Electronic Journal of Statistics 9, pp. 823-841, 2015

  26. arXiv:1406.1440  [pdf, other

    stat.ML math.ST stat.CO

    Bayesian matrix completion: prior specification

    Authors: Pierre Alquier, Vincent Cottet, Nicolas Chopin, Judith Rousseau

    Abstract: Low-rank matrix estimation from incomplete measurements recently received increased attention due to the emergence of several challenging applications, such as recommender systems; see in particular the famous Netflix challenge. While the behaviour of algorithms based on nuclear norm minimization is now well understood, an as yet unexplored avenue of research is the behaviour of Bayesian algorithm… ▽ More

    Submitted 22 October, 2014; v1 submitted 5 June, 2014; originally announced June 2014.

  27. Adaptive estimation of the density matrix in quantum homodyne tomography with noisy data

    Authors: P Alquier, K Meziani, G Peyré

    Abstract: In the framework of noisy quantum homodyne tomography with efficiency parameter $1/2 < η\leq 1$, we propose a novel estimator of a quantum state whose density matrix elements $ρ_{m,n}$ decrease like $Ce^{-B(m+n)^{r/ 2}}$, for fixed $C\geq 1$, $B>0$ and $0<r\leq 2$. On the contrary to previous works, we focus on the case where $r$, $C$ and $B$ are unknown. The procedure estimates the matrix coeffic… ▽ More

    Submitted 21 March, 2013; v1 submitted 31 January, 2013; originally announced January 2013.

    MSC Class: 62G05; 62G20; 62G86; 62P35; 81V80

    Journal ref: Inverse Problems, vol. 29(7), pp. 075017, 2013

  28. arXiv:1211.1847  [pdf, ps, other

    math.ST

    Prediction of time series by statistical learning: general losses and fast rates

    Authors: Pierre Alquier, Xiaoyin Li, Olivier Wintenberger

    Abstract: We establish rates of convergences in time series forecasting using the statistical learning approach based on oracle inequalities. A series of papers extends the oracle inequalities obtained for iid observations to time series under weak dependence conditions. Given a family of predictors and $n$ observations, oracle inequalities state that a predictor forecasts the series as well as the best pre… ▽ More

    Submitted 8 November, 2012; originally announced November 2012.

  29. arXiv:1208.1211  [pdf, ps, other

    stat.ME math.ST

    PAC-Bayesian Estimation and Prediction in Sparse Additive Models

    Authors: Benjamin Guedj, Pierre Alquier

    Abstract: The present paper is about estimation and prediction in high-dimensional additive models under a sparsity assumption ($p\gg n$ paradigm). A PAC-Bayesian strategy is investigated, delivering oracle inequalities in probability. The implementation is performed through recent outcomes in high-dimensional MCMC algorithms, and the performance of our method is assessed on simulated data.

    Submitted 1 February, 2013; v1 submitted 6 August, 2012; originally announced August 2012.

    Comments: 28 pages

    MSC Class: 62G08; 62J02; 65C40

    Journal ref: Electronic Journal of Statistics, volume 7, 2013, 264--291

  30. Rank penalized estimation of a quantum system

    Authors: Pierre Alquier, Cristina Butucea, Mohamed Hebiri, Katia Meziani, Morimae Tomoyuki

    Abstract: We introduce a new method to reconstruct the density matrix $ρ$ of a system of $n$-qubits and estimate its rank $d$ from data obtained by quantum state tomography measurements repeated $m$ times. The procedure consists in minimizing the risk of a linear estimator $\hatρ$ of $ρ$ penalized by given rank (from 1 to $2^n$), where $\hatρ$ is previously obtained by the moment method. We obtain simultane… ▽ More

    Submitted 26 September, 2013; v1 submitted 8 June, 2012; originally announced June 2012.

  31. arXiv:1202.4294  [pdf, other

    math.ST

    Prediction of quantiles by statistical learning and application to GDP forecasting

    Authors: Pierre Alquier, Xiaoyin Li

    Abstract: In this paper, we tackle the problem of prediction and confidence intervals for time series using a statistical learning approach and quantile loss functions. In a first time, we show that the Gibbs estimator (also known as Exponentially Weighted aggregate) is able to predict as well as the best predictor in a given family for a wide set of loss functions. In particular, using the quantile loss fu… ▽ More

    Submitted 8 August, 2012; v1 submitted 20 February, 2012; originally announced February 2012.

  32. arXiv:1202.4283  [pdf, ps, other

    math.ST

    Fast rates in learning with dependent observations

    Authors: Pierre Alquier, Olivier Wintenberger

    Abstract: In this paper we tackle the problem of fast rates in time series forecasting from a statistical learning perspective. In a serie of papers (e.g. Meir 2000, Modha and Masry 1998, Alquier and Wintenberger 2012) it is shown that the main tools used in learning theory with iid observations can be extended to the prediction of time series. The main message of these papers is that, given a family of pre… ▽ More

    Submitted 20 February, 2012; originally announced February 2012.

  33. Sparsity considerations for dependent observations

    Authors: Pierre Alquier, Paul Doukhan

    Abstract: The aim of this paper is to provide a comprehensive introduction for the study of L1-penalized estimators in the context of dependent observations. We define a general $\ell_{1}$-penalized estimator for solving problems of stochastic optimization. This estimator turns out to be the LASSO in the regression estimation setting. Powerful theoretical guarantees on the statistical performances of the LA… ▽ More

    Submitted 7 August, 2011; v1 submitted 8 February, 2011; originally announced February 2011.

    Journal ref: Electronic Journal of Statistics 5 (2011) pp 750-774

  34. arXiv:1101.3229  [pdf, other

    math.ST

    Sparse single-index model

    Authors: Pierre Alquier, Gérard Biau

    Abstract: Let $(\bX, Y)$ be a random pair taking values in $\mathbb R^p \times \mathbb R$. In the so-called single-index model, one has $Y=f^{\star}(θ^{\star T}\bX)+\bW$, where $f^{\star}$ is an unknown univariate measurable function, $θ^{\star}$ is an unknown vector in $\mathbb R^d$, and $W$ denotes a random noise satisfying $\mathbb E[\bW|\bX]=0$. The single-index model is known to offer a flexible way to… ▽ More

    Submitted 6 October, 2011; v1 submitted 17 January, 2011; originally announced January 2011.

    Journal ref: Journal of Machine Learning Research 14 (2013) 243-280

  35. arXiv:1009.2707  [pdf, ps, other

    math.ST stat.CO

    Pac-bayesian bounds for sparse regression estimation with exponential weights

    Authors: Pierre Alquier, Karim Lounici

    Abstract: We consider the sparse regression model where the number of parameters $p$ is larger than the sample size $n$. The difficulty when considering high-dimensional problems is to propose estimators achieving a good compromise between statistical and computational performances. The BIC estimator for instance performs well from the statistical point of view \cite{BTW07} but can only be computed for valu… ▽ More

    Submitted 14 March, 2011; v1 submitted 14 September, 2010; originally announced September 2010.

    Comments: 19 pages

    MSC Class: Primary: 62J07; Secondary: 62J05; 62G08; 62F15; 62B10; 68T05

    Journal ref: Electronic Journal of Statistics, Vol 5(2011), 127-145

  36. arXiv:1005.0829  [pdf, ps, other

    math.ST

    Transductive versions of the LASSO and the Dantzig Selector

    Authors: Pierre Alquier, Mohamed Hebiri

    Abstract: Transductive methods are useful in prediction problems when the training dataset is composed of a large number of unlabeled observations and a smaller number of labeled observations. In this paper, we propose an approach for developing transductive prediction procedures that are able to take advantage of the sparsity in the high dimensional linear regression. More precisely, we define transductive… ▽ More

    Submitted 5 May, 2010; originally announced May 2010.

  37. arXiv:0906.0652  [pdf, ps, other

    math.ST

    Transductive versions of the LASSO and the Dantzig Selector

    Authors: Pierre Alquier, Mohamed Hebiri

    Abstract: We consider the linear regression problem, where the number $p$ of covariates is possibly larger than the number $n$ of observations $(x_{i},y_{i})_{i\leq i \leq n}$, under sparsity assumptions. On the one hand, several methods have been successfully proposed to perform this task, for example the LASSO or the Dantzig Selector. On the other hand, consider new values $(x_{i})_{n+1\leq i \leq m}$.… ▽ More

    Submitted 6 June, 2009; v1 submitted 3 June, 2009; originally announced June 2009.

    MSC Class: 62J05; 62J07 (Primary); 62F25 (Secondary)

  38. arXiv:0902.2924  [pdf, ps, other

    stat.ME math.ST

    Model selection for weakly dependent time series forecasting

    Authors: Pierre Alquier, Olivier Wintenberger

    Abstract: Observing a stationary time series, we propose a two-step procedure for the prediction of the next value of the time series. The first step follows machine learning theory paradigm and consists in determining a set of possible predictors as randomized estimators in (possibly numerous) different predictive models. The second step follows the model selection paradigm and consists in choosing one pre… ▽ More

    Submitted 3 July, 2012; v1 submitted 17 February, 2009; originally announced February 2009.

  39. arXiv:0811.0072  [pdf, ps, other

    math.ST

    Generalization of l1 constraints for high dimensional regression problems

    Authors: Pierre Alquier, Mohamed Hebiri

    Abstract: We focus on the high dimensional linear regression $Y\sim\mathcal{N}(Xβ^{*},σ^{2}I_{n})$, where $β^{*}\in\mathds{R}^{p}$ is the parameter of interest. In this setting, several estimators such as the LASSO and the Dantzig Selector are known to satisfy interesting properties whenever the vector $β^{*}$ is sparse. Interestingly both of the LASSO and the Dantzig Selector can be seen as orthogonal proj… ▽ More

    Submitted 4 July, 2011; v1 submitted 1 November, 2008; originally announced November 2008.

  40. PAC-Bayesian Bounds for Randomized Empirical Risk Minimizers

    Authors: Pierre Alquier

    Abstract: The aim of this paper is to generalize the PAC-Bayesian theorems proved by Catoni in the classification setting to more general problems of statistical inference. We show how to control the deviations of the risk of randomized estimators. A particular attention is paid to randomized estimators drawn in a small neighborhood of classical estimators, whose study leads to control the risk of the lat… ▽ More

    Submitted 9 January, 2009; v1 submitted 11 December, 2007; originally announced December 2007.

    Journal ref: Mathematical Methods of Statistics 17, 4 (2008) 279-304

  41. LASSO, Iterative Feature Selection and the Correlation Selector: Oracle Inequalities and Numerical Performances

    Authors: Pierre Alquier

    Abstract: We propose a general family of algorithms for regression estimation with quadratic loss. Our algorithms are able to select relevant functions into a large dictionary. We prove that a lot of algorithms that have already been studied for this task (LASSO and Group LASSO, Dantzig selector, Iterative Feature Selection, among others) belong to our family, and exhibit another particular member of this… ▽ More

    Submitted 25 November, 2008; v1 submitted 24 October, 2007; originally announced October 2007.

    MSC Class: 62G08 (Primary); Secondary 62J07 (Secondary); 62G15; 68T05

    Journal ref: Electronic Journal of Statistics 2 (2008) pp. 1129-1152

  42. arXiv:math/0603349  [pdf, ps, other

    math.ST

    Density estimation with quadratic loss: a confidence intervals method

    Authors: Pierre Alquier

    Abstract: In a previous article, a least square regression estimation procedure was proposed: first, we condiser a family of functions and study the properties of an estimator in every unidimensionnal model defined by one of these functions; we then show how to aggregate these estimators. The purpose of this paper is to extend this method to the case of density estimation. We first give a general overview… ▽ More

    Submitted 14 March, 2006; originally announced March 2006.

    MSC Class: 62G07; 62G15; 68T05

  43. Iterative Feature Selection In Least Square Regression Estimation

    Authors: Pierre Alquier

    Abstract: In this paper, we focus on regression estimation in both the inductive and the transductive case. We assume that we are given a set of features (which can be a base of functions, but not necessarily). We begin by giving a deviation inequality on the risk of an estimator in every model defined by using a single feature. These models are too simple to be useful by themselves, but we then show how… ▽ More

    Submitted 10 April, 2008; v1 submitted 11 November, 2005; originally announced November 2005.

    MSC Class: 62G08 (Primary); 62G15; 68T05 (Secondary)

    Journal ref: Annales de l'Institut Henri Poincare (B) Probability and Statistics' 48, 1 (2008) p47-88