Skip to main content

Showing 1–50 of 176 results for author: Kane, D

.
  1. arXiv:2505.21475  [pdf, ps, other

    cs.LG cs.DS

    Algorithms and SQ Lower Bounds for Robustly Learning Real-valued Multi-index Models

    Authors: Ilias Diakonikolas, Giannis Iakovidis, Daniel M. Kane, Lisheng Ren

    Abstract: We study the complexity of learning real-valued Multi-Index Models (MIMs) under the Gaussian distribution. A $K$-MIM is a function $f:\mathbb{R}^d\to \mathbb{R}$ that depends only on the projection of its input onto a $K$-dimensional subspace. We give a general algorithm for PAC learning a broad class of MIMs with respect to the square loss, even in the presence of adversarial label noise. Moreove… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2504.15251  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    On Learning Parallel Pancakes with Mostly Uniform Weights

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Jasper C. H. Lee, Thanasis Pittas

    Abstract: We study the complexity of learning $k$-mixtures of Gaussians ($k$-GMMs) on $\mathbb{R}^d$. This task is known to have complexity $d^{Ω(k)}$ in full generality. To circumvent this exponential lower bound on the number of components, research has focused on learning families of GMMs satisfying additional structural properties. A natural assumption posits that the component weights are not exponenti… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  3. arXiv:2504.15244  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Faster Algorithms for Agnostically Learning Disjunctions and their Implications

    Authors: Ilias Diakonikolas, Daniel M. Kane, Lisheng Ren

    Abstract: We study the algorithmic task of learning Boolean disjunctions in the distribution-free agnostic PAC model. The best known agnostic learner for the class of disjunctions over $\{0, 1\}^n$ is the $L_1$-polynomial regression algorithm, achieving complexity $2^{\tilde{O}(n^{1/2})}$. This complexity bound is known to be nearly best possible within the class of Correlational Statistical Query (CSQ) alg… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  4. arXiv:2503.09802  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Batch List-Decodable Linear Regression via Higher Moments

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Sihan Liu, Thanasis Pittas

    Abstract: We study the task of list-decodable linear regression using batches. A batch is called clean if it consists of i.i.d. samples from an unknown linear regression distribution. For a parameter $α\in (0, 1/2)$, an unknown $α$-fraction of the batches are clean and no assumptions are made on the remaining ones. The goal is to output a small list of vectors at least one of which is close to the true regr… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  5. arXiv:2502.14772  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Efficient Multivariate Robust Mean Estimation Under Mean-Shift Contamination

    Authors: Ilias Diakonikolas, Giannis Iakovidis, Daniel M. Kane, Thanasis Pittas

    Abstract: We study the algorithmic problem of robust mean estimation of an identity covariance Gaussian in the presence of mean-shift contamination. In this contamination model, we are given a set of points in $\mathbb{R}^d$ generated i.i.d. via the following process. For a parameter $α<1/2$, the $i$-th sample $x_i$ is obtained as follows: with probability $1-α$, $x_i$ is drawn from $\mathcal{N}(μ, I)$, whe… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  6. arXiv:2502.09525  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Robust Learning of Multi-index Models via Iterative Subspace Approximation

    Authors: Ilias Diakonikolas, Giannis Iakovidis, Daniel M. Kane, Nikos Zarifis

    Abstract: We study the task of learning Multi-Index Models (MIMs) with label noise under the Gaussian distribution. A $K$-MIM is any function $f$ that only depends on a $K$-dimensional subspace. We focus on well-behaved MIMs with finite ranges that satisfy certain regularity properties. Our main contribution is a general robust learner that is qualitatively optimal in the Statistical Query (SQ) model. Our a… ▽ More

    Submitted 14 April, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

  7. arXiv:2501.05425  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Entangled Mean Estimation in High-Dimensions

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sihan Liu, Thanasis Pittas

    Abstract: We study the task of high-dimensional entangled mean estimation in the subset-of-signals model. Specifically, given $N$ independent random points $x_1,\ldots,x_N$ in $\mathbb{R}^D$ and a parameter $α\in (0, 1)$ such that each $x_i$ is drawn from a Gaussian with mean $μ$ and unknown covariance, and an unknown $α$-fraction of the points have identity-bounded covariances, the goal is to estimate the… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

  8. arXiv:2501.00508  [pdf, ps, other

    cs.LG

    Active Learning of General Halfspaces: Label Queries vs Membership Queries

    Authors: Ilias Diakonikolas, Daniel M. Kane, Mingchen Ma

    Abstract: We study the problem of learning general (i.e., not necessarily homogeneous) halfspaces under the Gaussian distribution on $R^d$ in the presence of some form of query access. In the classical pool-based active learning model, where the algorithm is allowed to make adaptive label queries to previously sampled points, we establish a strong information-theoretic lower bound ruling out non-trivial imp… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: Accepted by NeurIPS 2024

  9. arXiv:2411.15669  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Implicit High-Order Moment Tensor Estimation and Learning Latent Variable Models

    Authors: Ilias Diakonikolas, Daniel M. Kane

    Abstract: We study the task of learning latent-variable models. A common algorithmic technique for this task is the method of moments. Unfortunately, moment-based approaches are hampered by the fact that the moment tensors of super-constant degree cannot even be written down in polynomial time. Motivated by such learning applications, we develop a general efficient algorithm for {\em implicit moment tensor… ▽ More

    Submitted 12 April, 2025; v1 submitted 23 November, 2024; originally announced November 2024.

    Comments: Abstract shortened due to arxiv requirements

  10. arXiv:2411.08183  [pdf, ps, other

    cs.CC

    Locally Sampleable Uniform Symmetric Distributions

    Authors: Daniel M. Kane, Anthony Ostuni, Kewen Wu

    Abstract: We characterize the power of constant-depth Boolean circuits in generating uniform symmetric distributions. Let $f\colon\{0,1\}^m\to\{0,1\}^n$ be a Boolean function where each output bit of $f$ depends only on $O(1)$ input bits. Assume the output distribution of $f$ on uniform input bits is close to a uniform distribution $D$ with a symmetric support. We show that $D$ is essentially one of the fol… ▽ More

    Submitted 25 February, 2025; v1 submitted 12 November, 2024; originally announced November 2024.

    Comments: This version improves the main result by removing dependence on d from the final distance bound

  11. arXiv:2408.17165  [pdf, other

    cs.LG cs.DS stat.ML

    Efficient Testable Learning of General Halfspaces with Adversarial Label Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sihan Liu, Nikos Zarifis

    Abstract: We study the task of testable learning of general -- not necessarily homogeneous -- halfspaces with adversarial label noise with respect to the Gaussian distribution. In the testable learning framework, the goal is to develop a tester-learner such that if the data passes the tester, then one can trust the output of the robust learner on the data.Our main result is the first polynomial time tester-… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: Presented to COLT'24

    Journal ref: "Testable Learning of General Halfspaces with Adversarial Label Noise." In The Thirty Seventh Annual Conference on Learning Theory, pp. 1308-1335. PMLR, 2024

  12. arXiv:2406.02628  [pdf, ps, other

    stat.ML cs.CC cs.DS cs.LG

    Replicability in High Dimensional Statistics

    Authors: Max Hopkins, Russell Impagliazzo, Daniel Kane, Sihan Liu, Christopher Ye

    Abstract: The replicability crisis is a major issue across nearly all areas of empirical science, calling for the formal study of replicability in statistics. Motivated in this context, [Impagliazzo, Lei, Pitassi, and Sorrell STOC 2022] introduced the notion of replicable learning algorithms, and gave basic procedures for $1$-dimensional tasks including statistical queries. In this work, we study the comput… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 119 pages

    ACM Class: F.2.0

  13. arXiv:2404.00529  [pdf, other

    cs.DS cs.LG

    Super Non-singular Decompositions of Polynomials and their Application to Robustly Learning Low-degree PTFs

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Sihan Liu, Nikos Zarifis

    Abstract: We study the efficient learnability of low-degree polynomial threshold functions (PTFs) in the presence of a constant fraction of adversarial corruptions. Our main algorithmic result is a polynomial-time PAC learning algorithm for this concept class in the strong contamination model under the Gaussian distribution with error guarantee $O_{d, c}(\text{opt}^{1-c})$, for any desired constant $c>0$, w… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: To appear in STOC2024

  14. arXiv:2403.10416  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Robust Sparse Estimation for Gaussians with Optimal Error under Huber Contamination

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Ankit Pensia, Thanasis Pittas

    Abstract: We study Gaussian sparse estimation tasks in Huber's contamination model with a focus on mean estimation, PCA, and linear regression. For each of these tasks, we give the first sample and computationally efficient robust estimators with optimal error guarantees, within constant factors. All prior efficient algorithms for these tasks incur quantitatively suboptimal error. Concretely, for Gaussian r… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  15. arXiv:2403.04744  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    SQ Lower Bounds for Non-Gaussian Component Analysis with Weaker Assumptions

    Authors: Ilias Diakonikolas, Daniel Kane, Lisheng Ren, Yuxin Sun

    Abstract: We study the complexity of Non-Gaussian Component Analysis (NGCA) in the Statistical Query (SQ) model. Prior work developed a general methodology to prove SQ lower bounds for this task that have been applicable to a wide range of contexts. In particular, it was known that for any univariate distribution $A$ satisfying certain conditions, distinguishing between a standard multivariate Gaussian and… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Conference version published in NeurIPS 2023

  16. arXiv:2403.02300  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Statistical Query Lower Bounds for Learning Truncated Gaussians

    Authors: Ilias Diakonikolas, Daniel M. Kane, Thanasis Pittas, Nikos Zarifis

    Abstract: We study the problem of estimating the mean of an identity covariance Gaussian in the truncated setting, in the regime when the truncation set comes from a low-complexity family $\mathcal{C}$ of sets. Specifically, for a fixed but unknown truncation set $S \subseteq \mathbb{R}^d$, we are given access to samples from the distribution $\mathcal{N}(\boldsymbol{ μ}, \mathbf{ I})$ truncated to the set… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  17. arXiv:2402.14278  [pdf, other

    cs.CC cs.DS quant-ph

    Locality Bounds for Sampling Hamming Slices

    Authors: Daniel M. Kane, Anthony Ostuni, Kewen Wu

    Abstract: Spurred by the influential work of Viola (Journal of Computing 2012), the past decade has witnessed an active line of research into the complexity of (approximately) sampling distributions, in contrast to the traditional focus on the complexity of computing functions. We build upon and make explicit earlier implicit results of Viola to provide superconstant lower bounds on the locality of Boolea… ▽ More

    Submitted 26 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Minor updates to better reflect past literature. No technical material has been changed

  18. arXiv:2312.16616  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Agnostically Learning Multi-index Models with Queries

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the power of query access for the task of agnostic learning under the Gaussian distribution. In the agnostic model, no assumptions are made on the labels and the goal is to compute a hypothesis that is competitive with the {\em best-fit} function in a known class, i.e., it achieves error $\mathrm{opt}+ε$, where $\mathrm{opt}$ is the error of the best function in the class. We focus on a g… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: abstract shortened due to arxiv requirements

  19. arXiv:2312.14353  [pdf

    physics.optics

    Chaos spectrum -- semiconductor laser with delayed optical feedback

    Authors: D M Kane, M Radziunas

    Abstract: Maximizing the rf bandwidth associated with the chaotic output from tailored operation of nonlinear semiconductor laser systems is an ongoing research effort. The early pioneering research was done in semiconductor laser with delayed optical feedback systems, which continue to be researched. We report numerical simulations of this system, using a travelling wave model. The results provide new insi… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  20. arXiv:2312.11769  [pdf, other

    cs.LG cs.DS cs.IT math.ST stat.ML

    Clustering Mixtures of Bounded Covariance Distributions Under Optimal Separation

    Authors: Ilias Diakonikolas, Daniel M. Kane, Jasper C. H. Lee, Thanasis Pittas

    Abstract: We study the clustering problem for mixtures of bounded covariance distributions, under a fine-grained separation assumption. Specifically, given samples from a $k$-component mixture distribution $D = \sum_{i =1}^k w_i P_i$, where each $w_i \ge α$ for some known parameter $α$, and each $P_i$ has unknown covariance $Σ_i \preceq σ^2_i \cdot I_d$ for some unknown $σ_i$, the goal is to cluster the sam… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  21. arXiv:2312.01547  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Near-Optimal Algorithms for Gaussians with Huber Contamination: Mean Estimation and Linear Regression

    Authors: Ilias Diakonikolas, Daniel M. Kane, Ankit Pensia, Thanasis Pittas

    Abstract: We study the fundamental problems of Gaussian mean estimation and linear regression with Gaussian covariates in the presence of Huber contamination. Our main contribution is the design of the first sample near-optimal and almost linear-time algorithms with optimal error guarantees for both of these problems. Specifically, for Gaussian robust mean estimation on $\mathbb{R}^d$ with contamination par… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: To appear in NeurIPS 2023

  22. arXiv:2311.13154  [pdf, other

    cs.DS cs.IT cs.LG math.ST stat.ML

    Testing Closeness of Multivariate Distributions via Ramsey Theory

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sihan Liu

    Abstract: We investigate the statistical task of closeness (or equivalence) testing for multidimensional distributions. Specifically, given sample access to two unknown distributions $\mathbf p, \mathbf q$ on $\mathbb R^d$, we want to distinguish between the case that $\mathbf p=\mathbf q$ versus $\|\mathbf p-\mathbf q\|_{A_k} > ε$, where $\|\mathbf p-\mathbf q\|_{A_k}$ denotes the generalized ${A}_k$ dista… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  23. arXiv:2310.15932  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Online Robust Mean Estimation

    Authors: Daniel M. Kane, Ilias Diakonikolas, Hanshen Xiao, Sihan Liu

    Abstract: We study the problem of high-dimensional robust mean estimation in an online setting. Specifically, we consider a scenario where $n$ sensors are measuring some common, ongoing phenomenon. At each time step $t=1,2,\ldots,T$, the $i^{th}$ sensor reports its readings $x^{(i)}_t$ for that time step. The algorithm must then commit to its estimate $μ_t$ for the true mean value of the process at time… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: To appear in SODA2024

  24. arXiv:2310.11876  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    SQ Lower Bounds for Learning Mixtures of Linear Classifiers

    Authors: Ilias Diakonikolas, Daniel M. Kane, Yuxin Sun

    Abstract: We study the problem of learning mixtures of linear classifiers under Gaussian covariates. Given sample access to a mixture of $r$ distributions on $\mathbb{R}^n$ of the form $(\mathbf{x},y_{\ell})$, $\ell\in [r]$, where $\mathbf{x}\sim\mathcal{N}(0,\mathbf{I}_n)$ and $y_\ell=\mathrm{sign}(\langle\mathbf{v}_\ell,\mathbf{x}\rangle)$ for an unknown unit vector $\mathbf{v}_\ell$, the goal is to learn… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: To appear in NeurIPS 2023

  25. arXiv:2310.00211  [pdf, ps, other

    math.ST

    Theoretical Foundations of Ordinal Multidimensional Scaling, Including Internal and External Unfolding

    Authors: Ery Arias-Castro, Clément Berenfeld, Daniel Kane

    Abstract: We provide a comprehensive theory of multiple variants of ordinal multidimensional scaling,including internal unfolding and external unfolding. We first follow Shepard (1966) and work in a continuum model to gain insight. We then follow Kleindessner and von Luxburg (2014) and work in an asymptotic discrete setting.

    Submitted 17 June, 2025; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: final version, focusing on the point model and with discrete-to-continuous convergence, to be published in SIMODS

  26. arXiv:2309.06983  [pdf, ps, other

    stat.OT

    Creating Community in a Data Science Classroom

    Authors: David Kane

    Abstract: A community is a collection of people who know and care about each other. The vast majority of college courses are not communities. This is especially true of statistics and data science courses, both because our classes are larger and because we are more likely to lecture. However, it is possible to create a community in your classroom. This article offers an idiosyncratic set of practices for cr… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  27. arXiv:2308.00089  [pdf, ps, other

    cs.LG math.ST

    New Lower Bounds for Testing Monotonicity and Log Concavity of Distributions

    Authors: Yuqian Cheng, Daniel M. Kane, Zhicheng Zheng

    Abstract: We develop a new technique for proving distribution testing lower bounds for properties defined by inequalities involving the bin probabilities of the distribution in question. Using this technique we obtain new lower bounds for monotonicity testing over discrete cubes and tight lower bounds for log-concavity testing. Our basic technique involves constructing a pair of moment-matching families o… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    MSC Class: 62G10 ACM Class: G.3; F.2.1

  28. arXiv:2307.12840  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Efficiently Learning One-Hidden-Layer ReLU Networks via Schur Polynomials

    Authors: Ilias Diakonikolas, Daniel M. Kane

    Abstract: We study the problem of PAC learning a linear combination of $k$ ReLU activations under the standard Gaussian distribution on $\mathbb{R}^d$ with respect to the square loss. Our main result is an efficient algorithm for this learning task with sample and computational complexity $(dk/ε)^{O(k)}$, where $ε>0$ is the target accuracy. Prior work had given an algorithm for this problem with complexity… ▽ More

    Submitted 25 July, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  29. arXiv:2307.08438  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Near-Optimal Bounds for Learning Gaussian Halfspaces with Random Classification Noise

    Authors: Ilias Diakonikolas, Jelena Diakonikolas, Daniel M. Kane, Puqian Wang, Nikos Zarifis

    Abstract: We study the problem of learning general (i.e., not necessarily homogeneous) halfspaces with Random Classification Noise under the Gaussian distribution. We establish nearly-matching algorithmic and Statistical Query (SQ) lower bound results revealing a surprising information-computation gap for this basic problem. Specifically, the sample complexity of this learning problem is $\widetildeΘ(d/ε)$,… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  30. arXiv:2306.16352  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Information-Computation Tradeoffs for Learning Margin Halfspaces with Random Classification Noise

    Authors: Ilias Diakonikolas, Jelena Diakonikolas, Daniel M. Kane, Puqian Wang, Nikos Zarifis

    Abstract: We study the problem of PAC learning $γ$-margin halfspaces with Random Classification Noise. We establish an information-computation tradeoff suggesting an inherent gap between the sample complexity of the problem and the sample complexity of computationally efficient algorithms. Concretely, the sample complexity of the problem is $\widetildeΘ(1/(γ^2 ε))$. We start by giving a simple efficient alg… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  31. arXiv:2306.13057  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    SQ Lower Bounds for Learning Bounded Covariance GMMs

    Authors: Ilias Diakonikolas, Daniel M. Kane, Thanasis Pittas, Nikos Zarifis

    Abstract: We study the complexity of learning mixtures of separated Gaussians with common unknown bounded covariance matrix. Specifically, we focus on learning Gaussian mixture models (GMMs) on $\mathbb{R}^d$ of the form $P= \sum_{i=1}^k w_i \mathcal{N}(\boldsymbol μ_i,\mathbf Σ_i)$, where $\mathbf Σ_i = \mathbf Σ\preceq \mathbf I$ and $\min_{i \neq j} \| \boldsymbol μ_i - \boldsymbol μ_j\|_2 \geq k^ε$ for… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  32. arXiv:2305.02544  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Nearly-Linear Time and Streaming Algorithms for Outlier-Robust PCA

    Authors: Ilias Diakonikolas, Daniel M. Kane, Ankit Pensia, Thanasis Pittas

    Abstract: We study principal component analysis (PCA), where given a dataset in $\mathbb{R}^d$ from a distribution, the task is to find a unit vector $v$ that approximately maximizes the variance of the distribution after being projected along $v$. Despite being a classical task, standard estimators fail drastically if the data contains even a small fraction of outliers, motivating the problem of robust PCA… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: To appear in ICML 2023

  33. arXiv:2305.00966  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    A Spectral Algorithm for List-Decodable Covariance Estimation in Relative Frobenius Norm

    Authors: Ilias Diakonikolas, Daniel M. Kane, Jasper C. H. Lee, Ankit Pensia, Thanasis Pittas

    Abstract: We study the problem of list-decodable Gaussian covariance estimation. Given a multiset $T$ of $n$ points in $\mathbb R^d$ such that an unknown $α<1/2$ fraction of points in $T$ are i.i.d. samples from an unknown Gaussian $\mathcal{N}(μ, Σ)$, the goal is to output a list of $O(1/α)$ hypotheses at least one of which is close to $Σ$ in relative Frobenius norm. Our main result is a… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  34. A review of ptychographic techniques for ultrashort pulse measurement

    Authors: Daniel J Kane, Andrei B. Vakhtin

    Abstract: The measurement of optical ultrafast laser pulses is done indirectly because the required bandwidth to measure these pulses exceeds the bandwidth of current electronics. As a result, this measurement problem is often posed as a 1-D phase retrieval problem, which is fraught with ambiguities. The phase retrieval method known as ptychography solves this problem by making it possible to measure ultraf… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 96 pages; 26 figures

    Journal ref: Progress in Quantum Electronics Volume 81, January 2022, 100364

  35. arXiv:2303.05485  [pdf, ps, other

    cs.LG stat.ML

    Efficient Testable Learning of Halfspaces with Adversarial Label Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Sihan Liu, Nikos Zarifis

    Abstract: We give the first polynomial-time algorithm for the testable learning of halfspaces in the presence of adversarial label noise under the Gaussian distribution. In the recently introduced testable learning model, one is required to produce a tester-learner such that if the data passes the tester, then one can trust the output of the robust learner on the data. Our tester-learner runs in time… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  36. arXiv:2302.12940  [pdf, ps, other

    cs.LG cs.AI cs.CC

    Exponential Hardness of Reinforcement Learning with Linear Function Approximation

    Authors: Daniel Kane, Sihan Liu, Shachar Lovett, Gaurav Mahajan, Csaba Szepesvári, Gellért Weisz

    Abstract: A fundamental question in reinforcement learning theory is: suppose the optimal value functions are linear in given features, can we learn them efficiently? This problem's counterpart in supervised learning, linear regression, can be solved both statistically and computationally efficiently. Therefore, it was quite surprising when a recent work \cite{kane2022computational} showed a computational-s… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  37. arXiv:2302.06512  [pdf, ps, other

    cs.LG cs.CC cs.DS

    Near-Optimal Cryptographic Hardness of Agnostically Learning Halfspaces and ReLU Regression under Gaussian Marginals

    Authors: Ilias Diakonikolas, Daniel M. Kane, Lisheng Ren

    Abstract: We study the task of agnostically learning halfspaces under the Gaussian distribution. Specifically, given labeled examples $(\mathbf{x},y)$ from an unknown distribution on $\mathbb{R}^n \times \{ \pm 1\}$, whose marginal distribution on $\mathbf{x}$ is the standard Gaussian and the labels $y$ can be arbitrary, the goal is to output a hypothesis with 0-1 loss $\mathrm{OPT}+ε$, where… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  38. arXiv:2302.06285  [pdf, ps, other

    cs.LG stat.ML

    Do PAC-Learners Learn the Marginal Distribution?

    Authors: Max Hopkins, Daniel M. Kane, Shachar Lovett, Gaurav Mahajan

    Abstract: The Fundamental Theorem of PAC Learning asserts that learnability of a concept class $H$ is equivalent to the $\textit{uniform convergence}$ of empirical error in $H$ to its mean, or equivalently, to the problem of $\textit{density estimation}$, learnability of the underlying marginal distribution with respect to events in $H$. This seminal equivalence relies strongly on PAC learning's `distributi… ▽ More

    Submitted 3 March, 2025; v1 submitted 13 February, 2023; originally announced February 2023.

    MSC Class: 68Q32

  39. arXiv:2212.11221  [pdf, ps, other

    math.PR cs.DS cs.LG math.ST stat.ML

    A Nearly Tight Bound for Fitting an Ellipsoid to Gaussian Random Points

    Authors: Daniel M. Kane, Ilias Diakonikolas

    Abstract: We prove that for $c>0$ a sufficiently small universal constant that a random set of $c d^2/\log^4(d)$ independent Gaussian random points in $\mathbb{R}^d$ lie on a common ellipsoid with high probability. This nearly establishes a conjecture of~\cite{SaundersonCPW12}, within logarithmic factors. The latter conjecture has attracted significant attention over the past decade, due to its connections… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  40. arXiv:2212.03008  [pdf, ps, other

    cs.DS cs.CC cs.LG stat.ML

    A Strongly Polynomial Algorithm for Approximate Forster Transforms and its Application to Halfspace Learning

    Authors: Ilias Diakonikolas, Christos Tzamos, Daniel M. Kane

    Abstract: The Forster transform is a method of regularizing a dataset by placing it in {\em radial isotropic position} while maintaining some of its essential properties. Forster transforms have played a key role in a diverse range of settings spanning computer science and functional analysis. Prior work had given {\em weakly} polynomial time algorithms for computing Forster transforms, when they exist. Our… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  41. arXiv:2211.16333  [pdf, ps, other

    cs.DS cs.LG math.ST stat.ML

    Outlier-Robust Sparse Mean Estimation for Heavy-Tailed Distributions

    Authors: Ilias Diakonikolas, Daniel M. Kane, Jasper C. H. Lee, Ankit Pensia

    Abstract: We study the fundamental task of outlier-robust mean estimation for heavy-tailed distributions in the presence of sparsity. Specifically, given a small number of corrupted samples from a high-dimensional heavy-tailed distribution whose mean $μ$ is guaranteed to be sparse, the goal is to efficiently compute a hypothesis that accurately approximates $μ$ with high probability. Prior work had obtained… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: To appear in NeurIPS 2022

  42. arXiv:2211.13751  [pdf, other

    physics.flu-dyn

    Asymptotic Nusselt numbers for internal flow in the Cassie state

    Authors: Daniel Kane, Marc Hodes, Martin Z. Bazant, Toby L. Kirk

    Abstract: We consider laminar, fully-developed, Poiseuille flows of liquid in the Cassie state through diabatic, parallel-plate microchannels symmetrically textured with isoflux ridges. Through the use of matched asymptotic expansions we analytically develop expressions for (apparent hydrodynamic) slip lengths and variously-defined Nusselt numbers. Our small parameter ($ε$) is the pitch of the ridges divide… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: 41 pages, submitted to Journal of Fluid Mechanics

  43. arXiv:2210.13706  [pdf, ps, other

    math.ST cs.DS cs.LG stat.ML

    Gaussian Mean Testing Made Simple

    Authors: Ilias Diakonikolas, Daniel M. Kane, Ankit Pensia

    Abstract: We study the following fundamental hypothesis testing problem, which we term Gaussian mean testing. Given i.i.d. samples from a distribution $p$ on $\mathbb{R}^d$, the task is to distinguish, with high probability, between the following cases: (i) $p$ is the standard Gaussian distribution, $\mathcal{N}(0,I_d)$, and (ii) $p$ is a Gaussian $\mathcal{N}(μ,Σ)$ for some unknown covariance $Σ$ and mean… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: To appear in SIAM Symposium on Simplicity in Algorithms (SOSA) 2023

  44. arXiv:2210.09949  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    SQ Lower Bounds for Learning Single Neurons with Massart Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Lisheng Ren, Yuxin Sun

    Abstract: We study the problem of PAC learning a single neuron in the presence of Massart noise. Specifically, for a known activation function $f: \mathbb{R} \to \mathbb{R}$, the learner is given access to labeled examples $(\mathbf{x}, y) \in \mathbb{R}^d \times \mathbb{R}$, where the marginal distribution of $\mathbf{x}$ is arbitrary and the corresponding label $y$ is a Massart corruption of… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: To appear in NeurIPS 2022

  45. arXiv:2207.14266  [pdf, other

    cs.LG cs.CC cs.DS

    Cryptographic Hardness of Learning Halfspaces with Massart Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Pasin Manurangsi, Lisheng Ren

    Abstract: We study the complexity of PAC learning halfspaces in the presence of Massart noise. In this problem, we are given i.i.d. labeled examples $(\mathbf{x}, y) \in \mathbb{R}^N \times \{ \pm 1\}$, where the distribution of $\mathbf{x}$ is arbitrary and the label $y$ is a Massart corruption of $f(\mathbf{x})$, for an unknown halfspace $f: \mathbb{R}^N \to \{ \pm 1\}$, with flipping probability… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  46. arXiv:2207.06596  [pdf, other

    cs.DS cs.LG math.ST

    Near-Optimal Bounds for Testing Histogram Distributions

    Authors: Clément L. Canonne, Ilias Diakonikolas, Daniel M. Kane, Sihan Liu

    Abstract: We investigate the problem of testing whether a discrete probability distribution over an ordered domain is a histogram on a specified number of bins. One of the most common tools for the succinct approximation of data, $k$-histograms over $[n]$, are probability distributions that are piecewise constant over a set of $k$ intervals. The histogram testing problem is the following: Given samples from… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  47. arXiv:2206.05245  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Ankit Pensia, Thanasis Pittas

    Abstract: We study the problem of list-decodable sparse mean estimation. Specifically, for a parameter $α\in (0, 1/2)$, we are given $m$ points in $\mathbb{R}^n$, $\lfloor αm \rfloor$ of which are i.i.d. samples from a distribution $D$ with unknown $k$-sparse mean $μ$. No assumptions are made on the remaining points, which form the majority of the dataset. The goal is to return a small list of candidates co… ▽ More

    Submitted 5 July, 2024; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: Added fact about taking roots in SoS proofs (Fact 2.9)

  48. arXiv:2206.04589  [pdf, ps, other

    cs.DS cs.LG math.ST stat.ML

    Optimal SQ Lower Bounds for Robustly Learning Discrete Product Distributions and Ising Models

    Authors: Ilias Diakonikolas, Daniel M. Kane, Yuxin Sun

    Abstract: We establish optimal Statistical Query (SQ) lower bounds for robustly learning certain families of discrete high-dimensional distributions. In particular, we show that no efficient SQ algorithm with access to an $ε$-corrupted binary product distribution can learn its mean within $\ell_2$-error $o(ε\sqrt{\log(1/ε)})$. Similarly, we show that no efficient SQ algorithm with access to an $ε$-corrupted… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: To appear in COLT 2022

  49. arXiv:2206.03441  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Robust Sparse Mean Estimation via Sum of Squares

    Authors: Ilias Diakonikolas, Daniel M. Kane, Sushrut Karmalkar, Ankit Pensia, Thanasis Pittas

    Abstract: We study the problem of high-dimensional sparse mean estimation in the presence of an $ε$-fraction of adversarial outliers. Prior work obtained sample and computationally efficient algorithms for this task for identity-covariance subgaussian distributions. In this work, we develop the first efficient algorithms for robust sparse mean estimation without a priori knowledge of the covariance. For dis… ▽ More

    Submitted 5 July, 2024; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: Fixed minor oversight in runtime calculation

  50. arXiv:2204.12399  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Streaming Algorithms for High-Dimensional Robust Statistics

    Authors: Ilias Diakonikolas, Daniel M. Kane, Ankit Pensia, Thanasis Pittas

    Abstract: We study high-dimensional robust statistics tasks in the streaming model. A recent line of work obtained computationally efficient algorithms for a range of high-dimensional robust estimation tasks. Unfortunately, all previous algorithms require storing the entire dataset, incurring memory at least quadratic in the dimension. In this work, we develop the first efficient streaming algorithms for hi… ▽ More

    Submitted 3 May, 2023; v1 submitted 26 April, 2022; originally announced April 2022.