Skip to main content

Showing 1–24 of 24 results for author: Błasiok, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.15372  [pdf, ps, other

    cs.LG cs.DS

    Efficient and Provable Algorithms for Covariate Shift

    Authors: Deeksha Adil, Jarosław Błasiok

    Abstract: Covariate shift, a widely used assumption in tackling {\it distributional shift} (when training and test distributions differ), focuses on scenarios where the distribution of the labels conditioned on the feature vector is the same, but the distribution of features in the training and test data are different. Despite the significance and extensive work on covariate shift, theoretical guarantees fo… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

  2. arXiv:2501.09545  [pdf, ps, other

    cs.CC

    Hardness of clique approximation for monotone circuits

    Authors: Jarosław Błasiok, Linus Meierhöfer

    Abstract: We consider a problem of approximating the size of the largest clique in a graph, with a monotone circuit. Concretely, we focus on distinguishing a random Erdős-Renyi graph $\mathcal{G}_{n,p}$, with $p=n^{-\frac{2}{α-1}}$ chosen st. with high probability it does not even have an $α$-clique, from a random clique on $β$ vertices (where $α\leq β$). Using the approximation method of Razborov, Alon and… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  3. arXiv:2404.14159  [pdf, ps, other

    cs.DS

    Semirandom Planted Clique and the Restricted Isometry Property

    Authors: Jarosław Błasiok, Rares-Darius Buhai, Pravesh K. Kothari, David Steurer

    Abstract: We give a simple, greedy $O(n^{ω+0.5})=O(n^{2.872})$-time algorithm to list-decode planted cliques in a semirandom model introduced in [CSV17] (following [FK01]) that succeeds whenever the size of the planted clique is $k\geq O(\sqrt{n} \log^2 n)$. In the model, the edges touching the vertices in the planted $k$-clique are drawn independently with probability $p=1/2$ while the edges not touching t… ▽ More

    Submitted 9 October, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 22 pages, to appear FOCS 2024

  4. arXiv:2309.12236  [pdf, other

    cs.LG

    Smooth ECE: Principled Reliability Diagrams via Kernel Smoothing

    Authors: Jarosław Błasiok, Preetum Nakkiran

    Abstract: Calibration measures and reliability diagrams are two fundamental tools for measuring and interpreting the calibration of probabilistic predictors. Calibration measures quantify the degree of miscalibration, and reliability diagrams visualize the structure of this miscalibration. However, the most common constructions of reliability diagrams and calibration measures -- binning and ECE -- both suff… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Code at: https://github.com/apple/ml-calibration

  5. arXiv:2305.18764  [pdf, other

    cs.LG math.ST stat.ML

    When Does Optimizing a Proper Loss Yield Calibration?

    Authors: Jarosław Błasiok, Parikshit Gopalan, Lunjia Hu, Preetum Nakkiran

    Abstract: Optimizing proper loss functions is popularly believed to yield predictors with good calibration properties; the intuition being that for such losses, the global optimum is to predict the ground-truth probabilities, which is indeed calibrated. However, typical machine learning models are trained to approximately minimize loss over restricted families of predictors, that are unlikely to contain the… ▽ More

    Submitted 8 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: In NeurIPS 2023. Selected for spotlight presentation

  6. arXiv:2304.09424  [pdf, other

    cs.LG cs.AI stat.ML

    Loss Minimization Yields Multicalibration for Large Neural Networks

    Authors: Jarosław Błasiok, Parikshit Gopalan, Lunjia Hu, Adam Tauman Kalai, Preetum Nakkiran

    Abstract: Multicalibration is a notion of fairness for predictors that requires them to provide calibrated predictions across a large set of protected groups. Multicalibration is known to be a distinct goal than loss minimization, even for simple predictors such as linear functions. In this work, we consider the setting where the protected groups can be represented by neural networks of size $k$, and the… ▽ More

    Submitted 7 December, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: In ITCS 2024

  7. arXiv:2302.11476  [pdf, ps, other

    cs.CC cs.DM

    Matrix Multiplication and Number On the Forehead Communication

    Authors: Josh Alman, Jarosław Błasiok

    Abstract: Three-player Number On the Forehead communication may be thought of as a three-player Number In the Hand promise model, in which each player is given the inputs that are supposedly on the other two players' heads, and promised that they are consistent with the inputs of of the other players. The set of all allowed inputs under this promise may be thought of as an order-3 tensor. We surprisingly ob… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  8. arXiv:2211.16886  [pdf, other

    cs.LG

    A Unifying Theory of Distance from Calibration

    Authors: Jarosław Błasiok, Parikshit Gopalan, Lunjia Hu, Preetum Nakkiran

    Abstract: We study the fundamental question of how to define and measure the distance from calibration for probabilistic predictors. While the notion of perfect calibration is well-understood, there is no consensus on how to quantify the distance from perfect calibration. Numerous calibration measures have been proposed in the literature, but it is unclear how they compare to each other, and many popular me… ▽ More

    Submitted 31 March, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: In STOC 2023

  9. arXiv:2211.13473  [pdf, ps, other

    cs.CC

    Communication Complexity of Inner Product in Symmetric Normed Spaces

    Authors: Alexandr Andoni, Jarosław Błasiok, Arnold Filtser

    Abstract: We introduce and study the communication complexity of computing the inner product of two vectors, where the input is restricted w.r.t. a norm $N$ on the space $\mathbb{R}^n$. Here, Alice and Bob hold two vectors $v,u$ such that $\|v\|_N\le 1$ and $\|u\|_{N^*}\le 1$, where $N^*$ is the dual norm. They want to compute their inner product $\langle v,u \rangle$ up to an $\varepsilon$ additive term. T… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted to ITCS 2023

  10. arXiv:2204.03230  [pdf, other

    cs.LG cs.AI cs.CR cs.CV stat.ML

    What You See is What You Get: Principled Deep Learning via Distributional Generalization

    Authors: Bogdan Kulynych, Yao-Yuan Yang, Yaodong Yu, Jarosław Błasiok, Preetum Nakkiran

    Abstract: Having similar behavior at training time and test time $-$ what we call a "What You See Is What You Get" (WYSIWYG) property $-$ is desirable in machine learning. Models trained with standard stochastic gradient descent (SGD), however, do not necessarily have this property, as their complex behaviors such as robustness or subgroup performance can differ drastically between training and test time. I… ▽ More

    Submitted 17 October, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: First two authors contributed equally. To appear in NeurIPS 2022

  11. arXiv:2107.10797  [pdf, other

    cs.CC

    Fourier growth of structured $\mathbb{F}_2$-polynomials and applications

    Authors: Jarosław Błasiok, Peter Ivanov, Yaonan Jin, Chin Ho Lee, Rocco A. Servedio, Emanuele Viola

    Abstract: We analyze the Fourier growth, i.e. the $L_1$ Fourier weight at level $k$ (denoted $L_{1,k}$), of various well-studied classes of "structured" $\mathbb{F}_2$-polynomials. This study is motivated by applications in pseudorandomness, in particular recent results and conjectures due to [CHHL19,CHLT19,CGLSS20] which show that upper bounds on Fourier growth (even at level $k=2$) give unconditional pseu… ▽ More

    Submitted 11 October, 2024; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Corrected a mistake in Lemma 27 in the previous version of the paper

  12. arXiv:1903.12135  [pdf, other

    cs.IT cs.DM cs.DS math.PR

    An Improved Lower Bound for Sparse Reconstruction from Subsampled Walsh Matrices

    Authors: Jarosław Błasiok, Patrick Lopatto, Kyle Luh, Jake Marcinek, Shravas Rao

    Abstract: We give a short argument that yields a new lower bound on the number of subsampled rows from a bounded, orthonormal matrix necessary to form a matrix with the restricted isometry property. We show that a matrix formed by uniformly subsampling rows of an $N \times N$ Walsh matrix contains a $K$-sparse vector in the kernel, unless the number of subsampled rows is $Ω(K \log K \log (N/K))$ -- our lowe… ▽ More

    Submitted 9 May, 2023; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: Revised version. Published in Discrete Analysis

  13. arXiv:1811.03763  [pdf, ps, other

    cs.DS

    Towards Instance-Optimal Private Query Release

    Authors: Jaroslaw Blasiok, Mark Bun, Aleksandar Nikolov, Thomas Steinke

    Abstract: We study efficient mechanisms for the query release problem in differential privacy: given a workload of $m$ statistical queries, output approximate answers to the queries while satisfying the constraints of differential privacy. In particular, we are interested in mechanisms that optimally adapt to the given workload. Building on the projection mechanism of Nikolov, Talwar, and Zhang, and using t… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Comments: To appear in SODA 2019

  14. arXiv:1810.04298  [pdf, ps, other

    cs.IT

    Polar Codes with exponentially small error at finite block length

    Authors: Jarosław Błasiok, Venkatesan Guruswami, Madhu Sudan

    Abstract: We show that the entire class of polar codes (up to a natural necessary condition) converge to capacity at block lengths polynomial in the gap to capacity, while simultaneously achieving failure probabilities that are exponentially small in the block length (i.e., decoding fails with probability $\exp(-N^{Ω(1)})$ for codes of length $N$). Previously this combination was known only for one specific… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

    Comments: 17 pages, Appeared in RANDOM'18. arXiv admin note: substantial text overlap with arXiv:1802.02718

  15. arXiv:1809.05596  [pdf, ps, other

    stat.ME cs.LG math.ST

    The Generic Holdout: Preventing False-Discoveries in Adaptive Data Science

    Authors: Preetum Nakkiran, Jarosław Błasiok

    Abstract: Adaptive data analysis has posed a challenge to science due to its ability to generate false hypotheses on moderately large data sets. In general, with non-adaptive data analyses (where queries to the data are generated without being influenced by answers to previous queries) a data set containing $n$ samples may support exponentially many queries in $n$. This number reduces to linearly many under… ▽ More

    Submitted 14 September, 2018; originally announced September 2018.

  16. arXiv:1804.01642  [pdf, ps, other

    cs.DS

    Optimal streaming and tracking distinct elements with high probability

    Authors: Jarosław Błasiok

    Abstract: The distinct elements problem is one of the fundamental problems in streaming algorithms --- given a stream of integers in the range $\{1,\ldots,n\}$, we wish to provide a $(1+\varepsilon)$ approximation to the number of distinct elements in the input. After a long line of research an optimal solution for this problem with constant probability of success, using… ▽ More

    Submitted 4 January, 2019; v1 submitted 4 April, 2018; originally announced April 2018.

    Comments: Preliminary version of this paper appeard in SODA 2018

  17. General Strong Polarization

    Authors: Jarosław Błasiok, Venkatesan Guruswami, Preetum Nakkiran, Atri Rudra, Madhu Sudan

    Abstract: Arikan's exciting discovery of polar codes has provided an altogether new way to efficiently achieve Shannon capacity. Given a (constant-sized) invertible matrix $M$, a family of polar codes can be associated with this matrix and its ability to approach capacity follows from the {\em polarization} of an associated $[0,1]$-bounded martingale, namely its convergence in the limit to either $0$ or… ▽ More

    Submitted 8 May, 2022; v1 submitted 8 February, 2018; originally announced February 2018.

    Comments: 73 pages, 2 figures. The final version appeared in JACM. This paper combines results presented in preliminary form at STOC 2018 and RANDOM 2018

    Journal ref: Jarosław Błasiok, Venkatesan Guruswami, Preetum Nakkiran, Atri Rudra, and Madhu Sudan. 2022. General Strong Polarization. J. ACM 69, 2, Article 11 (April 2022), 67 pages

  18. arXiv:1709.07308  [pdf, other

    cs.DS cs.DM cs.LG cs.SI math.CO

    Predicting Positive and Negative Links with Noisy Queries: Theory & Practice

    Authors: Charalampos E. Tsourakakis, Michael Mitzenmacher, Kasper Green Larsen, Jarosław Błasiok, Ben Lawson, Preetum Nakkiran, Vasileios Nakos

    Abstract: Social networks involve both positive and negative relationships, which can be captured in signed graphs. The {\em edge sign prediction problem} aims to predict whether an interaction between a pair of nodes will be positive or negative. We provide theoretical results for this problem that motivate natural improvements to recent heuristics. The edge sign prediction problem is related to correlat… ▽ More

    Submitted 6 December, 2020; v1 submitted 19 September, 2017; originally announced September 2017.

    Comments: arXiv admin note: text overlap with arXiv:1609.00750

  19. arXiv:1704.06710  [pdf, ps, other

    cs.DS

    Continuous monitoring of $\ell_p$ norms in data streams

    Authors: Jarosław Błasiok, Jian Ding, Jelani Nelson

    Abstract: In insertion-only streaming, one sees a sequence of indices $a_1, a_2, \ldots, a_m\in [n]$. The stream defines a sequence of $m$ frequency vectors $x^{(1)},\ldots,x^{(m)}\in\mathbb{R}^n$ with $(x^{(t)})_i = |\{j : j\in[t], a_j = i\}|$. That is, $x^{(t)}$ is the frequency vector after seeing the first $t$ items in the stream. Much work in the streaming literature focuses on estimating some function… ▽ More

    Submitted 8 November, 2017; v1 submitted 21 April, 2017; originally announced April 2017.

    Comments: v2: Lemma 10 proof now correctly bounds q <= (1/eps)^{O(1/p}) instead of the previously erroneous 1/eps^4. All stated results still hold for p in (0,2] bounded away from zero

  20. arXiv:1609.05388  [pdf, other

    stat.ML cs.LG

    ADAGIO: Fast Data-aware Near-Isometric Linear Embeddings

    Authors: Jarosław Błasiok, Charalampos E. Tsourakakis

    Abstract: Many important applications, including signal reconstruction, parameter estimation, and signal processing in a compressed domain, rely on a low-dimensional representation of the dataset that preserves {\em all} pairwise distances between the data points and leverages the inherent geometric structure that is typically present. Recently Hedge, Sankaranarayanan, Yin and Baraniuk \cite{hedge2015} prop… ▽ More

    Submitted 17 September, 2016; originally announced September 2016.

    Comments: ICDM 2016

  21. arXiv:1602.05719  [pdf, ps, other

    cs.LG cs.DS cs.IT math.PR

    An improved analysis of the ER-SpUD dictionary learning algorithm

    Authors: Jarosław Błasiok, Jelani Nelson

    Abstract: In "dictionary learning" we observe $Y = AX + E$ for some $Y\in\mathbb{R}^{n\times p}$, $A \in\mathbb{R}^{m\times n}$, and $X\in\mathbb{R}^{m\times p}$. The matrix $Y$ is observed, and $A, X, E$ are unknown. Here $E$ is "noise" of small norm, and $X$ is column-wise sparse. The matrix $A$ is referred to as a {\em dictionary}, and its columns as {\em atoms}. Then, given some small number $p$ of samp… ▽ More

    Submitted 18 February, 2016; originally announced February 2016.

    ACM Class: I.2.6; F.2.0

  22. arXiv:1511.01111  [pdf, other

    cs.DS

    Streaming Symmetric Norms via Measure Concentration

    Authors: Jaroslaw Blasiok, Vladimir Braverman, Stephen R. Chestnut, Robert Krauthgamer, Lin F. Yang

    Abstract: We characterize the streaming space complexity of every symmetric norm $l$ (a norm on $\mathbb{R}^n$ invariant under sign-flips and coordinate-permutations), by relating this space complexity to the measure-concentration characteristics of $l$. Specifically, we provide nearly matching upper and lower bounds on the space complexity of calculating a $(1\pmε)$-approximation to the norm of the stream,… ▽ More

    Submitted 26 June, 2017; v1 submitted 3 November, 2015; originally announced November 2015.

    Comments: published in STOC 2017

  23. arXiv:1510.07135  [pdf, ps, other

    math.CO cs.DM

    Induced minors and well-quasi-ordering

    Authors: Jarosław Błasiok, Marcin Kamiński, Jean-Florent Raymond, Théophile Trunck

    Abstract: A graph $H$ is an induced minor of a graph $G$ if it can be obtained from an induced subgraph of $G$ by contracting edges. Otherwise, $G$ is said to be $H$-induced minor-free. Robin Thomas showed that $K_4$-induced minor-free graphs are well-quasi-ordered by induced minors [Graphs without $K_4$ and well-quasi-ordering, Journal of Combinatorial Theory, Series B, 38(3):240 -- 247, 1985]. We provid… ▽ More

    Submitted 22 January, 2018; v1 submitted 24 October, 2015; originally announced October 2015.

    MSC Class: 05C; 06A07 ACM Class: G.2.2

  24. arXiv:1304.5849  [pdf, other

    cs.DS cs.DM

    Chain minors are FPT

    Authors: Jaroslaw Blasiok, Marcin Kaminski

    Abstract: Given two finite posets P and Q, P is a chain minor of Q if there exists a partial function f from the elements of Q to the elements of P such that for every chain in P there is a chain C_Q in Q with the property that f restricted to C_Q is an isomorphism of chains. We give an algorithm to decide whether a poset P is a chain minor of o poset Q that runs in time O(|Q| log |Q|) for every fixed poset… ▽ More

    Submitted 22 April, 2013; originally announced April 2013.