Search | arXiv e-print repository

arXiv:2011.00835 [pdf, other]

Adversarial training for predictive tasks: theoretical analysis and limitations in the deterministic case

Authors: Thibault Lesieur, Jérémie Messud, Issa Hammoud, Hanyuan Peng, Céline Lacombe, Paulien Jeunesse

Abstract: To train a deep neural network to mimic the outcomes of processing sequences, a version of Conditional Generalized Adversarial Network (CGAN) can be used. It has been observed by others that CGAN can help to improve the results even for deterministic sequences, where only one output is associated with the processing of a given input. Surprisingly, our CGAN-based tests on deterministic geophysical… ▽ More To train a deep neural network to mimic the outcomes of processing sequences, a version of Conditional Generalized Adversarial Network (CGAN) can be used. It has been observed by others that CGAN can help to improve the results even for deterministic sequences, where only one output is associated with the processing of a given input. Surprisingly, our CGAN-based tests on deterministic geophysical processing sequences did not produce a real improvement compared to the use of an $L_p$ loss; we here propose a first theoretical explanation why. Our analysis goes from the non-deterministic case to the deterministic one. It led us to develop an adversarial way to train a content loss that gave better results on our data. △ Less

Submitted 30 November, 2020; v1 submitted 2 November, 2020; originally announced November 2020.

Comments: NeurIPS 2020, ICBINB Workshop

arXiv:1610.02918 [pdf, other]

doi 10.1109/ALLERTON.2016.7852287

Phase transitions and optimal algorithms in high-dimensional Gaussian mixture clustering

Authors: Thibault Lesieur, Caterina De Bacco, Jess Banks, Florent Krzakala, Cris Moore, Lenka Zdeborová

Abstract: We consider the problem of Gaussian mixture clustering in the high-dimensional limit where the data consists of $m$ points in $n$ dimensions, $n,m \rightarrow \infty$ and $α= m/n$ stays finite. Using exact but non-rigorous methods from statistical physics, we determine the critical value of $α$ and the distance between the clusters at which it becomes information-theoretically possible to reconstr… ▽ More We consider the problem of Gaussian mixture clustering in the high-dimensional limit where the data consists of $m$ points in $n$ dimensions, $n,m \rightarrow \infty$ and $α= m/n$ stays finite. Using exact but non-rigorous methods from statistical physics, we determine the critical value of $α$ and the distance between the clusters at which it becomes information-theoretically possible to reconstruct the membership into clusters better than chance. We also determine the accuracy achievable by the Bayes-optimal estimation algorithm. In particular, we find that when the number of clusters is sufficiently large, $r > 4 + 2 \sqrtα$, there is a gap between the threshold for information-theoretically optimal performance and the threshold at which known algorithms succeed. △ Less

Submitted 10 October, 2016; originally announced October 2016.

Comments: 8 pages, 3 figures, conference

Journal ref: 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Pages: 601 - 608

arXiv:1507.03857 [pdf, other]

doi 10.1109/ALLERTON.2015.7447070

MMSE of probabilistic low-rank matrix estimation: Universality with respect to the output channel

Authors: Thibault Lesieur, Florent Krzakala, Lenka Zdeborová

Abstract: This paper considers probabilistic estimation of a low-rank matrix from non-linear element-wise measurements of its elements. We derive the corresponding approximate message passing (AMP) algorithm and its state evolution. Relying on non-rigorous but standard assumptions motivated by statistical physics, we characterize the minimum mean squared error (MMSE) achievable information theoretically and… ▽ More This paper considers probabilistic estimation of a low-rank matrix from non-linear element-wise measurements of its elements. We derive the corresponding approximate message passing (AMP) algorithm and its state evolution. Relying on non-rigorous but standard assumptions motivated by statistical physics, we characterize the minimum mean squared error (MMSE) achievable information theoretically and with the AMP algorithm. Unlike in related problems of linear estimation, in the present setting the MMSE depends on the output channel only trough a single parameter - its Fisher information. We illustrate this striking finding by analysis of submatrix localization, and of detection of communities hidden in a dense stochastic block model. For this example we locate the computational and statistical boundaries that are not equal for rank larger than four. △ Less

Submitted 5 January, 2016; v1 submitted 14 July, 2015; originally announced July 2015.

Comments: 10 pages, Allerton Conference on Communication, Control, and Computing 2015

Journal ref: 2015 53rd Annual Allerton Conference on Communication, Control, and Computing, page 680 - 687, IEEE

arXiv:1503.00338 [pdf, other]

doi 10.1109/ISIT.2015.7282733

Phase Transitions in Sparse PCA

Authors: Thibault Lesieur, Florent Krzakala, Lenka Zdeborova

Abstract: We study optimal estimation for sparse principal component analysis when the number of non-zero elements is small but on the same order as the dimension of the data. We employ approximate message passing (AMP) algorithm and its state evolution to analyze what is the information theoretically minimal mean-squared error and the one achieved by AMP in the limit of large sizes. For a special case of r… ▽ More We study optimal estimation for sparse principal component analysis when the number of non-zero elements is small but on the same order as the dimension of the data. We employ approximate message passing (AMP) algorithm and its state evolution to analyze what is the information theoretically minimal mean-squared error and the one achieved by AMP in the limit of large sizes. For a special case of rank one and large enough density of non-zeros Deshpande and Montanari [1] proved that AMP is asymptotically optimal. We show that both for low density and for large rank the problem undergoes a series of phase transitions suggesting existence of a region of parameters where estimation is information theoretically possible, but AMP (and presumably every other polynomial algorithm) fails. The analysis of the large rank limit is particularly instructive. △ Less

Submitted 1 March, 2015; originally announced March 2015.

Comments: 6 pages, 3 figures

Journal ref: IEEE International Symposium on Information Theory (ISIT), pp.1635-1639 (2015)

Showing 1–4 of 4 results for author: Lesieur, T