Skip to main content

Showing 1–5 of 5 results for author: Mazooji, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.13093  [pdf, other

    cs.IT cs.AI cs.DS cs.LG math.ST

    Guaranteed Recovery of Unambiguous Clusters

    Authors: Kayvon Mazooji, Ilan Shomorony

    Abstract: Clustering is often a challenging problem because of the inherent ambiguity in what the "correct" clustering should be. Even when the number of clusters $K$ is known, this ambiguity often still exists, particularly when there is variation in density among different clusters, and clusters have multiple relatively separated regions of high density. In this paper we propose an information-theoretic c… ▽ More

    Submitted 7 May, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

    Comments: 12 pages, includes minor changes and some new content compared to previous version

  2. arXiv:2401.14277  [pdf, ps, other

    cs.IT cs.DS math.PR math.ST

    An Instance-Based Approach to the Trace Reconstruction Problem

    Authors: Kayvon Mazooji, Ilan Shomorony

    Abstract: In the trace reconstruction problem, one observes the output of passing a binary string $s \in \{0,1\}^n$ through a deletion channel $T$ times and wishes to recover $s$ from the resulting $T$ "traces." Most of the literature has focused on characterizing the hardness of this problem in terms of the number of traces $T$ needed for perfect reconstruction either in the worst case or in the average ca… ▽ More

    Submitted 3 November, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 7 pages, part of this paper was presented at the 58th Annual Conference on Information Sciences and Systems (CISS 2024), funding information added in updated document, an error in the presentation of the main results in the CISS 2024 version of the paper is fixed in the updated document

  3. arXiv:2210.10917  [pdf, other

    cs.IT cs.DS math.PR math.ST

    Substring Density Estimation from Traces

    Authors: Kayvon Mazooji, Ilan Shomorony

    Abstract: In the trace reconstruction problem, one seeks to reconstruct a binary string $s$ from a collection of traces, each of which is obtained by passing $s$ through a deletion channel. It is known that $\exp(\tilde O(n^{1/5}))$ traces suffice to reconstruct any length-$n$ string with high probability. We consider a variant of the trace reconstruction problem where the goal is to recover a "density map"… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 22 pages, 3 figures

  4. arXiv:2101.12124  [pdf, other

    cs.IT cs.CR

    Private DNA Sequencing: Hiding Information in Discrete Noise

    Authors: Kayvon Mazooji, Roy Dong, Ilan Shomorony

    Abstract: When an individual's DNA is sequenced, sensitive medical information becomes available to the sequencing laboratory. A recently proposed way to hide an individual's genetic information is to mix in DNA samples of other individuals. We assume that the genetic content of these samples is known to the individual but unknown to the sequencing laboratory. Thus, these DNA samples act as "noise" to the s… ▽ More

    Submitted 3 November, 2024; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 22 pages, 7 figures, shorter version appeared in proceedings of the 2020 IEEE Information Theory Workshop (ITW), new results and explanations added in updated arXiv document

  5. arXiv:1611.09073  [pdf, other

    cs.IT

    On Unique Decoding from Insertions and Deletions

    Authors: Kayvon Mazooji

    Abstract: In this paper, we study how often unique decoding from $t$ insertions or $t$ deletions occurs for error correcting codes. Insertions and deletions frequently occur in synchronization problems and DNA, a medium which is beginning to be used for long term data storage. We define natural probabilistic channels that make $t$ insertions or $t$ deletions, and study the probability of unique decoding.… ▽ More

    Submitted 28 September, 2017; v1 submitted 28 November, 2016; originally announced November 2016.

    Comments: 12 pages, 4 figures, study of deletion channel added (upper bounds, asymptotics, ect.), tight upper bound on probability of unique decoding for uniform t-insertion channel added (conjecture from previous version proved true), improved upper bounds for VT codes and linear codes added, improved asymptotic analysis