Inference in High-Dimensional Linear Regression via Lattice Basis Reduction and Integer Relation Detection

Gamarnik, David; Kızıldağ, Eren C.; Zadik, Ilias

doi:10.1109/TIT.2021.3113921

Mathematics > Statistics Theory

arXiv:1910.10890 (math)

[Submitted on 24 Oct 2019]

Title:Inference in High-Dimensional Linear Regression via Lattice Basis Reduction and Integer Relation Detection

Authors:David Gamarnik, Eren C. Kızıldağ, Ilias Zadik

View PDF

Abstract:We focus on the high-dimensional linear regression problem, where the algorithmic goal is to efficiently infer an unknown feature vector $\beta^*\in\mathbb{R}^p$ from its linear measurements, using a small number $n$ of samples. Unlike most of the literature, we make no sparsity assumption on $\beta^*$, but instead adopt a different regularization: In the noiseless setting, we assume $\beta^*$ consists of entries, which are either rational numbers with a common denominator $Q\in\mathbb{Z}^+$ (referred to as $Q$-rationality); or irrational numbers supported on a rationally independent set of bounded cardinality, known to learner; collectively called as the mixed-support assumption. Using a novel combination of the PSLQ integer relation detection, and LLL lattice basis reduction algorithms, we propose a polynomial-time algorithm which provably recovers a $\beta^*\in\mathbb{R}^p$ enjoying the mixed-support assumption, from its linear measurements $Y=X\beta^*\in\mathbb{R}^n$ for a large class of distributions for the random entries of $X$, even with one measurement $(n=1)$. In the noisy setting, we propose a polynomial-time, lattice-based algorithm, which recovers a $\beta^*\in\mathbb{R}^p$ enjoying $Q$-rationality, from its noisy measurements $Y=X\beta^*+W\in\mathbb{R}^n$, even with a single sample $(n=1)$. We further establish for large $Q$, and normal noise, this algorithm tolerates information-theoretically optimal level of noise. We then apply these ideas to develop a polynomial-time, single-sample algorithm for the phase retrieval problem. Our methods address the single-sample $(n=1)$ regime, where the sparsity-based methods such as LASSO and Basis Pursuit are known to fail. Furthermore, our results also reveal an algorithmic connection between the high-dimensional linear regression problem, and the integer relation detection, randomized subset-sum, and shortest vector problems.

Comments:	56 pages. Parts of the material of this manuscript were presented at NeurIPS 2018, and ISIT 2019. This submission subsumes the content of arXiv:1803.06716
Subjects:	Statistics Theory (math.ST); Probability (math.PR); Machine Learning (stat.ML)
Cite as:	arXiv:1910.10890 [math.ST]
	(or arXiv:1910.10890v1 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.1910.10890
Journal reference:	IEEE Transactions on Information Theory (Volume: 67, Issue: 12, December 2021)
Related DOI:	https://doi.org/10.1109/TIT.2021.3113921

Submission history

From: Eren Can Kızıldağ [view email]
[v1] Thu, 24 Oct 2019 02:41:39 UTC (171 KB)

Mathematics > Statistics Theory

Title:Inference in High-Dimensional Linear Regression via Lattice Basis Reduction and Integer Relation Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Inference in High-Dimensional Linear Regression via Lattice Basis Reduction and Integer Relation Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators