Skip to main content

Showing 51–100 of 161 results for author: Diakonikolas, I

.
  1. arXiv:2207.14266  [pdf, other

    cs.LG cs.CC cs.DS

    Cryptographic Hardness of Learning Halfspaces with Massart Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Pasin Manurangsi, Lisheng Ren

    Abstract: We study the complexity of PAC learning halfspaces in the presence of Massart noise. In this problem, we are given i.i.d. labeled examples $(\mathbf{x}, y) \in \mathbb{R}^N \times \{ \pm 1\}$, where the distribution of $\mathbf{x}$ is arbitrary and the label $y$ is a Massart corruption of $f(\mathbf{x})$, for an unknown halfspace $f: \mathbb{R}^N \to \{ \pm 1\}$, with flipping probability… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  2. arXiv:2207.06596  [pdf, other

    cs.DS cs.LG math.ST

    Near-Optimal Bounds for Testing Histogram Distributions

    Authors: Clément L. Canonne, Ilias Diakonikolas, Daniel M. Kane, Sihan Liu

    Abstract: We investigate the problem of testing whether a discrete probability distribution over an ordered domain is a histogram on a specified number of bins. One of the most common tools for the succinct approximation of data, $k$-histograms over $[n]$, are probability distributions that are piecewise constant over a set of $k$ intervals. The histogram testing problem is the following: Given samples from… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  3. arXiv:2206.08918  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Learning a Single Neuron with Adversarial Label Noise via Gradient Descent

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the fundamental problem of learning a single neuron, i.e., a function of the form $\mathbf{x}\mapstoσ(\mathbf{w}\cdot\mathbf{x})$ for monotone activations $σ:\mathbb{R}\mapsto\mathbb{R}$, with respect to the $L_2^2$-loss in the presence of adversarial label noise. Specifically, we are given labeled examples from a distribution $D$ on $(\mathbf{x}, y)\in\mathbb{R}^d \times \mathbb{R}$ such… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  4. arXiv:2206.05245  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Ankit Pensia, Thanasis Pittas

    Abstract: We study the problem of list-decodable sparse mean estimation. Specifically, for a parameter $α\in (0, 1/2)$, we are given $m$ points in $\mathbb{R}^n$, $\lfloor αm \rfloor$ of which are i.i.d. samples from a distribution $D$ with unknown $k$-sparse mean $μ$. No assumptions are made on the remaining points, which form the majority of the dataset. The goal is to return a small list of candidates co… ▽ More

    Submitted 5 July, 2024; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: Added fact about taking roots in SoS proofs (Fact 2.9)

  5. arXiv:2206.04589  [pdf, ps, other

    cs.DS cs.LG math.ST stat.ML

    Optimal SQ Lower Bounds for Robustly Learning Discrete Product Distributions and Ising Models

    Authors: Ilias Diakonikolas, Daniel M. Kane, Yuxin Sun

    Abstract: We establish optimal Statistical Query (SQ) lower bounds for robustly learning certain families of discrete high-dimensional distributions. In particular, we show that no efficient SQ algorithm with access to an $ε$-corrupted binary product distribution can learn its mean within $\ell_2$-error $o(ε\sqrt{\log(1/ε)})$. Similarly, we show that no efficient SQ algorithm with access to an $ε$-corrupted… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: To appear in COLT 2022

  6. arXiv:2206.03441  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Robust Sparse Mean Estimation via Sum of Squares

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Ankit Pensia, Thanasis Pittas

    Abstract: We study the problem of high-dimensional sparse mean estimation in the presence of an $ε$-fraction of adversarial outliers. Prior work obtained sample and computationally efficient algorithms for this task for identity-covariance subgaussian distributions. In this work, we develop the first efficient algorithms for robust sparse mean estimation without a priori knowledge of the covariance. For dis… ▽ More

    Submitted 5 July, 2024; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: Fixed minor oversight in runtime calculation

  7. arXiv:2204.12399  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Streaming Algorithms for High-Dimensional Robust Statistics

    Authors: Ilias Diakonikolas, Daniel M. Kane, Ankit Pensia, Thanasis Pittas

    Abstract: We study high-dimensional robust statistics tasks in the streaming model. A recent line of work obtained computationally efficient algorithms for a range of high-dimensional robust estimation tasks. Unfortunately, all previous algorithms require storing the entire dataset, incurring memory at least quadratic in the dimension. In this work, we develop the first efficient streaming algorithms for hi… ▽ More

    Submitted 3 May, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

  8. arXiv:2112.09104  [pdf, ps, other

    cs.DS cs.LG math.ST stat.ML

    Non-Gaussian Component Analysis via Lattice Basis Reduction

    Authors: Ilias Diakonikolas, Daniel M. Kane

    Abstract: Non-Gaussian Component Analysis (NGCA) is the following distribution learning problem: Given i.i.d. samples from a distribution on $\mathbb{R}^d$ that is non-gaussian in a hidden direction $v$ and an independent standard Gaussian in the orthogonal directions, the goal is to approximate the hidden direction $v$. Prior work \cite{DKS17-sq} provided formal evidence for the existence of an information… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

  9. arXiv:2109.11515  [pdf, other

    cs.LG cs.DS math.OC math.ST stat.ML

    Outlier-Robust Sparse Estimation via Non-Convex Optimization

    Authors: Yu Cheng, Ilias Diakonikolas, Rong Ge, Shivam Gupta, Daniel M. Kane, Mahdi Soltanolkotabi

    Abstract: We explore the connection between outlier-robust high-dimensional statistics and non-convex optimization in the presence of sparsity constraints, with a focus on the fundamental tasks of robust sparse mean estimation and robust sparse PCA. We develop novel and simple optimization formulations for these problems such that any approximate stationary point of the associated optimization problem yield… ▽ More

    Submitted 13 November, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: Accepted to Conference on Neural Information Processing Systems (NeurIPS) 2022. (Updated to the NeurIPS'22 version in v2.)

  10. arXiv:2109.04623  [pdf, other

    cs.LG cs.DS stat.ML

    ReLU Regression with Massart Noise

    Authors: Ilias Diakonikolas, Jongho Park, Christos Tzamos

    Abstract: We study the fundamental problem of ReLU regression, where the goal is to fit Rectified Linear Units (ReLUs) to data. This supervised learning task is efficiently solvable in the realizable setting, but is known to be computationally hard with adversarial label noise. In this work, we focus on ReLU regression in the Massart noise model, a natural and well-studied semi-random noise model. In this m… ▽ More

    Submitted 25 January, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

  11. arXiv:2108.08767  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Learning General Halfspaces with General Massart Noise under the Gaussian Distribution

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of PAC learning halfspaces on $\mathbb{R}^d$ with Massart noise under the Gaussian distribution. In the Massart model, an adversary is allowed to flip the label of each point $\mathbf{x}$ with unknown probability $η(\mathbf{x}) \leq η$, for some parameter $η\in [0,1/2]$. The goal is to find a hypothesis with misclassification error of $\mathrm{OPT} + ε$, where $\mathrm{OPT}$ i… ▽ More

    Submitted 8 November, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: Revised presentation

  12. arXiv:2107.05582  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Forster Decomposition and Learning Halfspaces with Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Christos Tzamos

    Abstract: A Forster transform is an operation that turns a distribution into one with good anti-concentration properties. While a Forster transform does not always exist, we show that any distribution can be efficiently decomposed as a disjoint mixture of few distributions for which a Forster transform exists and can be computed efficiently. As the main application of this result, we obtain the first polyno… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  13. arXiv:2106.09689  [pdf, ps, other

    cs.DS cs.LG math.ST stat.ML

    Statistical Query Lower Bounds for List-Decodable Linear Regression

    Authors: Ilias Diakonikolas, Daniel M. Kane, Ankit Pensia, Thanasis Pittas, Alistair Stewart

    Abstract: We study the problem of list-decodable linear regression, where an adversary can corrupt a majority of the examples. Specifically, we are given a set $T$ of labeled examples $(x, y) \in \mathbb{R}^d \times \mathbb{R}$ and a parameter $0< α<1/2$ such that an $α$-fraction of the points in $T$ are i.i.d. samples from a linear regression model with Gaussian covariates, and the remaining $(1-α)$-fracti… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  14. arXiv:2106.08537  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Clustering Mixture Models in Almost-Linear Time via List-Decodable Mean Estimation

    Authors: Ilias Diakonikolas, Daniel M. Kane, Daniel Kongsgaard, Jerry Li, Kevin Tian

    Abstract: We study the problem of list-decodable mean estimation, where an adversary can corrupt a majority of the dataset. Specifically, we are given a set $T$ of $n$ points in $\mathbb{R}^d$ and a parameter $0< α<\frac 1 2$ such that an $α$-fraction of the points in $T$ are i.i.d. samples from a well-behaved distribution $\mathcal{D}$ and the remaining $(1-α)$-fraction are arbitrary. The goal is to output… ▽ More

    Submitted 12 November, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 64 pages, 1 figure. v2 improves results on bounded-covariance clustering, polishes exposition

  15. arXiv:2106.07779  [pdf, ps, other

    cs.LG stat.ML

    Boosting in the Presence of Massart Noise

    Authors: Ilias Diakonikolas, Russell Impagliazzo, Daniel Kane, Rex Lei, Jessica Sorrell, Christos Tzamos

    Abstract: We study the problem of boosting the accuracy of a weak learner in the (distribution-independent) PAC model with Massart noise. In the Massart noise model, the label of each example $x$ is independently misclassified with probability $η(x) \leq η$, where $η<1/2$. The Massart model lies between the random classification noise model and the agnostic model. Our main positive result is the first compu… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

  16. arXiv:2102.05629  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Agnostic Proper Learning of Halfspaces under Gaussian Marginals

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of agnostically learning halfspaces under the Gaussian distribution. Our main result is the {\em first proper} learning algorithm for this problem whose sample complexity and computational complexity qualitatively match those of the best known improper agnostic learner. Building on this result, we also obtain the first proper polynomial-time approximation scheme (PTAS) for agn… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  17. arXiv:2102.04401  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    The Optimality of Polynomial Regression for Agnostic Learning under Gaussian Marginals

    Authors: Ilias Diakonikolas, Daniel M. Kane, Thanasis Pittas, Nikos Zarifis

    Abstract: We study the problem of agnostic learning under the Gaussian distribution. We develop a method for finding hard families of examples for a wide class of problems by using LP duality. For Boolean-valued concept classes, we show that the $L^1$-regression algorithm is essentially best possible, and therefore that the computational difficulty of agnostically learning a concept class is closely related… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  18. arXiv:2102.02171  [pdf, ps, other

    cs.LG cs.DS math.PR math.ST stat.ML

    Outlier-Robust Learning of Ising Models Under Dobrushin's Condition

    Authors: Ilias Diakonikolas, Daniel M. Kane, Alistair Stewart, Yuxin Sun

    Abstract: We study the problem of learning Ising models satisfying Dobrushin's condition in the outlier-robust setting where a constant fraction of the samples are adversarially corrupted. Our main result is to provide the first computationally efficient robust learning algorithm for this problem with near-optimal error guarantees. Our algorithm can be seen as a special case of an algorithm for robustly lea… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

  19. arXiv:2012.15802  [pdf, ps, other

    cs.LG math.ST stat.ML

    The Sample Complexity of Robust Covariance Testing

    Authors: Ilias Diakonikolas, Daniel M. Kane

    Abstract: We study the problem of testing the covariance matrix of a high-dimensional Gaussian in a robust setting, where the input distribution has been corrupted in Huber's contamination model. Specifically, we are given i.i.d. samples from a distribution of the form $Z = (1-ε) X + εB$, where $X$ is a zero-mean and unknown covariance Gaussian $\mathcal{N}(0, Σ)$, $B$ is a fixed but unknown noise distribut… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

  20. arXiv:2012.09720  [pdf, ps, other

    cs.LG cs.CC math.ST stat.ML

    Near-Optimal Statistical Query Hardness of Learning Halfspaces with Massart Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane

    Abstract: We study the problem of PAC learning halfspaces with Massart noise. Given labeled samples $(x, y)$ from a distribution $D$ on $\mathbb{R}^{d} \times \{ \pm 1\}$ such that the marginal $D_x$ on the examples is arbitrary and the label $y$ of example $x$ is generated from the target halfspace corrupted by a Massart adversary with flipping probability $η(x) \leq η\leq 1/2$, the goal is to compute a hy… ▽ More

    Submitted 8 November, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: This version improves on the previous version. It obtains a near-optimal hardness result essentially matching known algorithms

  21. arXiv:2012.07774  [pdf, ps, other

    cs.LG cs.CC cs.DS math.AG math.ST

    Small Covers for Near-Zero Sets of Polynomials and Learning Latent Variable Models

    Authors: Ilias Diakonikolas, Daniel M. Kane

    Abstract: Let $V$ be any vector space of multivariate degree-$d$ homogeneous polynomials with co-dimension at most $k$, and $S$ be the set of points where all polynomials in $V$ {\em nearly} vanish. We establish a qualitatively optimal upper bound on the size of $ε$-covers for $S$, in the $\ell_2$-norm. Roughly speaking, we show that there exists an $ε$-cover for $S$ of cardinality… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: Full version of FOCS'20 paper

  22. arXiv:2012.02119  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Robustly Learning Mixtures of $k$ Arbitrary Gaussians

    Authors: Ainesh Bakshi, Ilias Diakonikolas, He Jia, Daniel M. Kane, Pravesh K. Kothari, Santosh S. Vempala

    Abstract: We give a polynomial-time algorithm for the problem of robustly estimating a mixture of $k$ arbitrary Gaussians in $\mathbb{R}^d$, for any fixed $k$, in the presence of a constant fraction of arbitrary corruptions. This resolves the main open problem in several previous works on algorithmic robust statistics, which addressed the special cases of robustly estimating (a) a single Gaussian, (b) a mix… ▽ More

    Submitted 7 June, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: This version extends the previous one to yield 1) robust proper learning algorithm with poly(eps) error and 2) an information theoretic argument proving that the same algorithms in fact also yield parameter recovery guarantees. The updates are included in Sections 7,8, and 9 and the main result from the previous version (Thm 1.4) is presented and proved in Section 6

  23. arXiv:2011.09973  [pdf, ps, other

    cs.DS cs.LG math.OC stat.ML

    List-Decodable Mean Estimation in Nearly-PCA Time

    Authors: Ilias Diakonikolas, Daniel M. Kane, Daniel Kongsgaard, Jerry Li, Kevin Tian

    Abstract: Traditionally, robust statistics has focused on designing estimators tolerant to a minority of contaminated data. Robust list-decodable learning focuses on the more challenging regime where only a minority $\frac 1 k$ fraction of the dataset is drawn from the distribution of interest, and no assumptions are made on the remaining data. We study the fundamental task of list-decodable mean estimation… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: 57 pages

  24. arXiv:2010.01705  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    A Polynomial Time Algorithm for Learning Halfspaces with Tsybakov Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of PAC learning homogeneous halfspaces in the presence of Tsybakov noise. In the Tsybakov noise model, the label of every sample is independently flipped with an adversarially controlled probability that can be arbitrarily close to $1/2$ for a fraction of the samples. {\em We give the first polynomial-time algorithm for this fundamental learning problem.} Our algorithm learns… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

  25. arXiv:2009.06540  [pdf, ps, other

    cs.DS cs.LG math.ST stat.ML

    Optimal Testing of Discrete Distributions with High Probability

    Authors: Ilias Diakonikolas, Themis Gouleakis, Daniel M. Kane, John Peebles, Eric Price

    Abstract: We study the problem of testing discrete distributions with a focus on the high probability regime. Specifically, given samples from one or more discrete distributions, a property $\mathcal{P}$, and parameters $0< ε, δ<1$, we want to distinguish {\em with probability at least $1-δ$} whether these distributions satisfy $\mathcal{P}$ or are $ε$-far from $\mathcal{P}$ in total variation distance. Mos… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  26. arXiv:2008.03891  [pdf, other

    cs.DB

    Rapid Approximate Aggregation with Distribution-Sensitive Interval Guarantees

    Authors: Stephen Macke, Maryam Aliakbarpour, Ilias Diakonikolas, Aditya Parameswaran, Ronitt Rubinfeld

    Abstract: Aggregating data is fundamental to data analytics, data exploration, and OLAP. Approximate query processing (AQP) techniques are often used to accelerate computation of aggregates using samples, for which confidence intervals (CIs) are widely used to quantify the associated error. CIs used in practice fall into two categories: techniques that are tight but not correct, i.e., they yield tight inter… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

  27. arXiv:2007.15618  [pdf, ps, other

    math.ST cs.DS cs.LG stat.ML

    Outlier Robust Mean Estimation with Subgaussian Rates via Stability

    Authors: Ilias Diakonikolas, Daniel M. Kane, Ankit Pensia

    Abstract: We study the problem of outlier robust high-dimensional mean estimation under a finite covariance assumption, and more broadly under finite low-degree moment assumptions. We consider a standard stability condition from the recent robust statistics literature and prove that, except with exponentially small failure probability, there exists a large fraction of the inliers satisfying this condition.… ▽ More

    Submitted 16 March, 2021; v1 submitted 30 July, 2020; originally announced July 2020.

  28. arXiv:2007.15220  [pdf, ps, other

    cs.LG cs.CC cs.DS stat.ML

    The Complexity of Adversarially Robust Proper Learning of Halfspaces with Agnostic Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Pasin Manurangsi

    Abstract: We study the computational complexity of adversarially robust proper learning of halfspaces in the distribution-independent agnostic PAC model, with a focus on $L_p$ perturbations. We give a computationally efficient learning algorithm and a nearly matching computational hardness result for this problem. An interesting implication of our findings is that the $L_{\infty}$ perturbations case is prov… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

  29. arXiv:2006.16200  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Near-Optimal SQ Lower Bounds for Agnostically Learning Halfspaces and ReLUs under Gaussian Marginals

    Authors: Ilias Diakonikolas, Daniel M. Kane, Nikos Zarifis

    Abstract: We study the fundamental problems of agnostically learning halfspaces and ReLUs under Gaussian marginals. In the former problem, given labeled examples $(\mathbf{x}, y)$ from an unknown distribution on $\mathbb{R}^d \times \{ \pm 1\}$, whose marginal distribution on $\mathbf{x}$ is the standard Gaussian and the labels $y$ can be arbitrary, the goal is to output a hypothesis with 0-1 loss… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: 19 pages

  30. arXiv:2006.12476  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Algorithms and SQ Lower Bounds for PAC Learning One-Hidden-Layer ReLU Networks

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Nikos Zarifis

    Abstract: We study the problem of PAC learning one-hidden-layer ReLU networks with $k$ hidden units on $\mathbb{R}^d$ under Gaussian marginals in the presence of additive label noise. For the case of positive coefficients, we give the first polynomial-time algorithm for this learning problem for $k$ up to $\tilde{O}(\sqrt{\log d})$. Previously, no polynomial time algorithm was known, even for $k=3$. This an… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  31. arXiv:2006.10715  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    List-Decodable Mean Estimation via Iterative Multi-Filtering

    Authors: Ilias Diakonikolas, Daniel M. Kane, Daniel Kongsgaard

    Abstract: We study the problem of {\em list-decodable mean estimation} for bounded covariance distributions. Specifically, we are given a set $T$ of points in $\mathbb{R}^d$ with the promise that an unknown $α$-fraction of points in $T$, where $0< α< 1/2$, are drawn from an unknown mean and bounded covariance distribution $D$, and no assumptions are made on the remaining points. The goal is to output a smal… ▽ More

    Submitted 20 June, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Fixed typo in title

  32. arXiv:2006.06742  [pdf, ps, other

    cs.LG stat.ML

    Non-Convex SGD Learns Halfspaces with Adversarial Label Noise

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of agnostically learning homogeneous halfspaces in the distribution-specific PAC model. For a broad family of structured distributions, including log-concave distributions, we show that non-convex SGD efficiently converges to a solution with misclassification error $O(\opt)+\eps$, where $\opt$ is the misclassification error of the best-fitting halfspace. In sharp contrast, we… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  33. arXiv:2006.06467  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Learning Halfspaces with Tsybakov Noise

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the efficient PAC learnability of halfspaces in the presence of Tsybakov noise. In the Tsybakov noise model, each label is independently flipped with some probability which is controlled by an adversary. This noise model significantly generalizes the Massart noise model, by allowing the flipping probabilities to be arbitrarily close to $1/2$ for a fraction of the samples. Our main result… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  34. arXiv:2005.12844  [pdf, other

    cs.LG cs.DS stat.ML

    Approximation Schemes for ReLU Regression

    Authors: Ilias Diakonikolas, Surbhi Goel, Sushrut Karmalkar, Adam R. Klivans, Mahdi Soltanolkotabi

    Abstract: We consider the fundamental problem of ReLU regression, where the goal is to output the best fitting ReLU with respect to square loss given access to draws from some unknown distribution. We give the first efficient, constant-factor approximation algorithm for this problem assuming the underlying distribution satisfies some weak concentration and anti-concentration conditions (and includes, for ex… ▽ More

    Submitted 28 September, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

  35. arXiv:2005.07652  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Efficiently Learning Adversarially Robust Halfspaces with Noise

    Authors: Omar Montasser, Surbhi Goel, Ilias Diakonikolas, Nathan Srebro

    Abstract: We study the problem of learning adversarially robust halfspaces in the distribution-independent setting. In the realizable setting, we provide necessary and sufficient conditions on the adversarial perturbation sets under which halfspaces are efficiently robustly learnable. In the presence of random label noise, we give a simple computationally efficient algorithm for this problem with respect to… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  36. arXiv:2005.06417  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Robustly Learning any Clusterable Mixture of Gaussians

    Authors: Ilias Diakonikolas, Samuel B. Hopkins, Daniel Kane, Sushrut Karmalkar

    Abstract: We study the efficient learnability of high-dimensional Gaussian mixtures in the outlier-robust setting, where a small constant fraction of the data is adversarially corrupted. We resolve the polynomial learnability of this problem when the components are pairwise separated in total variation distance. Specifically, we provide an algorithm that, for any constant number of components $k$, runs in p… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  37. arXiv:2005.01378  [pdf, ps, other

    cs.LG cs.DS math.OC math.ST stat.ML

    High-Dimensional Robust Mean Estimation via Gradient Descent

    Authors: Yu Cheng, Ilias Diakonikolas, Rong Ge, Mahdi Soltanolkotabi

    Abstract: We study the problem of high-dimensional robust mean estimation in the presence of a constant fraction of adversarial outliers. A recent line of work has provided sophisticated polynomial-time algorithms for this problem with dimension-independent error guarantees for a range of natural distribution families. In this work, we show that a natural non-convex formulation of the problem can be solve… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: Under submission to ICML'20

  38. arXiv:2003.11086  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Efficient Algorithms for Multidimensional Segmented Regression

    Authors: Ilias Diakonikolas, Jerry Li, Anastasia Voloshinov

    Abstract: We study the fundamental problem of fixed design {\em multidimensional segmented regression}: Given noisy samples from a function $f$, promised to be piecewise linear on an unknown set of $k$ rectangles, we want to recover $f$ up to a desired accuracy in mean-squared error. We provide the first sample and computationally efficient algorithm for this problem in any fixed dimension. Our algorithm re… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  39. arXiv:2002.05632  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Learning Halfspaces with Massart Noise Under Structured Distributions

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of learning halfspaces with Massart noise in the distribution-specific PAC model. We give the first computationally efficient algorithm for this problem with respect to a broad family of distributions, including log-concave distributions. This resolves an open question posed in a number of prior works. Our approach is extremely simple: We identify a smooth {\em non-convex} sur… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  40. arXiv:1911.08085  [pdf, other

    cs.DS cs.LG stat.ML

    Outlier-Robust High-Dimensional Sparse Estimation via Iterative Filtering

    Authors: Ilias Diakonikolas, Sushrut Karmalkar, Daniel Kane, Eric Price, Alistair Stewart

    Abstract: We study high-dimensional sparse estimation tasks in a robust setting where a constant fraction of the dataset is adversarially corrupted. Specifically, we focus on the fundamental problems of robust sparse mean estimation and robust sparse PCA. We give the first practically viable robust estimators for these problems. In more detail, our algorithms are sample and computationally efficient and ach… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

  41. arXiv:1911.05911  [pdf, ps, other

    cs.DS cs.CC math.ST stat.ML

    Recent Advances in Algorithmic High-Dimensional Robust Statistics

    Authors: Ilias Diakonikolas, Daniel M. Kane

    Abstract: Learning in the presence of outliers is a fundamental problem in statistics. Until recently, all known efficient unsupervised learning algorithms were very sensitive to outliers in high dimensions. In particular, even for the task of robust mean estimation under natural distributional assumptions, no efficient algorithm was known. Recent work in theoretical computer science gave the first efficien… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

  42. arXiv:1908.11335  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Nearly Tight Bounds for Robust Proper Learning of Halfspaces with a Margin

    Authors: Ilias Diakonikolas, Daniel M. Kane, Pasin Manurangsi

    Abstract: We study the problem of {\em properly} learning large margin halfspaces in the agnostic PAC model. In more detail, we study the complexity of properly learning $d$-dimensional halfspaces on the unit ball within misclassification error $α\cdot \mathrm{OPT}_γ + ε$, where $\mathrm{OPT}_γ$ is the optimal $γ$-margin error rate and $α\geq 1$ is the approximation ratio. We give learning algorithms and co… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

  43. arXiv:1907.08306  [pdf, other

    cs.DS stat.CO

    A Polynomial Time Algorithm for Log-Concave Maximum Likelihood via Locally Exponential Families

    Authors: Brian Axelrod, Ilias Diakonikolas, Anastasios Sidiropoulos, Alistair Stewart, Gregory Valiant

    Abstract: We consider the problem of computing the maximum likelihood multivariate log-concave distribution for a set of points. Specifically, we present an algorithm which, given $n$ points in $\mathbb{R}^d$ and an accuracy parameter $ε>0$, runs in time $poly(n,d,1/ε),$ and returns a log-concave distribution which, with high probability, has the property that the likelihood of the $n$ points under the retu… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: The present paper is a merger of two independent works arXiv:1811.03204 and arXiv:1812.05524, proposing essentially the same algorithm to compute the log-concave MLE

  44. arXiv:1906.10075  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Distribution-Independent PAC Learning of Halfspaces with Massart Noise

    Authors: Ilias Diakonikolas, Themis Gouleakis, Christos Tzamos

    Abstract: We study the problem of {\em distribution-independent} PAC learning of halfspaces in the presence of Massart noise. Specifically, we are given a set of labeled examples $(\mathbf{x}, y)$ drawn from a distribution $\mathcal{D}$ on $\mathbb{R}^{d+1}$ such that the marginal distribution on the unlabeled points $\mathbf{x}$ is arbitrary and the labels $y$ are generated by an unknown halfspace corrupte… ▽ More

    Submitted 10 December, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

  45. arXiv:1906.04709  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Communication and Memory Efficient Testing of Discrete Distributions

    Authors: Ilias Diakonikolas, Themis Gouleakis, Daniel M. Kane, Sankeerth Rao

    Abstract: We study distribution testing with communication and memory constraints in the following computational models: (1) The {\em one-pass streaming model} where the goal is to minimize the sample complexity of the protocol subject to a memory constraint, and (2) A {\em distributed model} where the data samples reside at multiple machines and the goal is to minimize the communication cost of the protoco… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: Full version of COLT 2019 paper

  46. arXiv:1906.04661  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Faster Algorithms for High-Dimensional Robust Covariance Estimation

    Authors: Yu Cheng, Ilias Diakonikolas, Rong Ge, David Woodruff

    Abstract: We study the problem of estimating the covariance matrix of a high-dimensional distribution when a small constant fraction of the samples can be arbitrarily corrupted. Recent work gave the first polynomial time algorithms for this problem with near-optimal error guarantees for several natural structured distributions. Our main contribution is to develop faster algorithms for this problem whose run… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

  47. arXiv:1905.12950  [pdf, ps, other

    cs.LG stat.ML

    Equipping Experts/Bandits with Long-term Memory

    Authors: Kai Zheng, Haipeng Luo, Ilias Diakonikolas, Liwei Wang

    Abstract: We propose the first reduction-based approach to obtaining long-term memory guarantees for online learning in the sense of Bousquet and Warmuth, 2002, by reducing the problem to achieving typical switching regret. Specifically, for the classical expert problem with $K$ actions and $T$ rounds, using our framework we develop various algorithms with a regret bound of order… ▽ More

    Submitted 27 October, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: 25 pages, accepted by NeurIPS 2019

  48. arXiv:1812.11712  [pdf, ps, other

    cs.GT cs.CC cs.LG

    On the Complexity of the Inverse Semivalue Problem for Weighted Voting Games

    Authors: Ilias Diakonikolas, Chrystalla Pavlou

    Abstract: Weighted voting games are a family of cooperative games, typically used to model voting situations where a number of agents (players) vote against or for a proposal. In such games, a proposal is accepted if an appropriately weighted sum of the votes exceeds a prespecified threshold. As the influence of a player over the voting outcome is not in general proportional to her assigned weight, various… ▽ More

    Submitted 31 December, 2018; originally announced December 2018.

    Comments: To appear in AAAI 2019

  49. arXiv:1812.05524  [pdf, ps, other

    cs.DS

    A Polynomial Time Algorithm for Maximum Likelihood Estimation of Multivariate Log-concave Densities

    Authors: Ilias Diakonikolas, Anastasios Sidiropoulos, Alistair Stewart

    Abstract: We study the problem of computing the maximum likelihood estimator (MLE) of multivariate log-concave densities. Our main result is the first computationally efficient algorithm for this problem. In more detail, we give an algorithm that, on input a set of $n$ points in $\mathbb{R}^d$ and an accuracy parameter $ε>0$, it runs in time $\text{poly}(n, d, 1/ε)$, and outputs a log-concave density that w… ▽ More

    Submitted 13 December, 2018; originally announced December 2018.

  50. arXiv:1811.09380  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    High-Dimensional Robust Mean Estimation in Nearly-Linear Time

    Authors: Yu Cheng, Ilias Diakonikolas, Rong Ge

    Abstract: We study the fundamental problem of high-dimensional mean estimation in a robust model where a constant fraction of the samples are adversarially corrupted. Recent work gave the first polynomial time algorithms for this problem with dimension-independent error guarantees for several families of structured distributions. In this work, we give the first nearly-linear time algorithms for high-dimen… ▽ More

    Submitted 23 November, 2018; originally announced November 2018.