Skip to main content

Showing 1–11 of 11 results for author: Kolpakov, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2208.04380  [pdf, ps, other

    cs.DS cs.FL

    Almost optimal searching of maximal subrepetitions in a word

    Authors: Roman Kolpakov

    Abstract: For $0<δ<1$ a $δ$-subrepetition in a word is a factor which exponent is less than~2 but is not less than $1+δ$ (the exponent of the factor is the ratio of the factor length to its minimal period). The $δ$-subrepetition is maximal if it cannot be extended to the left or to the right by at least one letter with preserving its minimal period. In the paper we propose an algorithm for searching all max… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  2. arXiv:1701.01190  [pdf, ps, other

    cs.FL

    On the number of gapped repeats with arbitrary gap

    Authors: Roman Kolpakov

    Abstract: For any functions $f(x)$, $g(x)$ from $\mathbb {N}$ to $\mathbb {R}$ we call repeats $uvu$ such that $g(|u|)\le |v|\le f(|u|)$ as {\it $f,g$-gapped repeats}. We study the possible number of $f,g$-gapped repeats in words of fixed length~$n$. For quite weak conditions on $f(x)$, $g(x)$ we obtain an upper bound on this number which is linear to~$n$.

    Submitted 4 January, 2017; originally announced January 2017.

    Comments: 17 pages, 2 figures. arXiv admin note: text overlap with arXiv:1309.4055, arXiv:1509.01221

  3. Indexing and querying color sets of images

    Authors: Djamal Belazzougui, Roman Kolpakov, Mathieu Raffinot

    Abstract: We aim to study the set of color sets of continuous regions of an image given as a matrix of $m$ rows over $n\geq m$ columns where each element in the matrix is an integer from $[1,σ]$ named a {\em color}. The set of distinct colors in a region is called fingerprint. We aim to compute, index and query the fingerprints of all rectangular regions named rectangles. The set of all such fingerprints… ▽ More

    Submitted 28 August, 2016; originally announced August 2016.

    Comments: 20 pages, 5 figures

  4. arXiv:1509.01221  [pdf, ps, other

    cs.FL

    Optimal searching of gapped repeats in a word

    Authors: Maxime Crochemore, Roman Kolpakov, Gregory Kucherov

    Abstract: Following (Kolpakov et al., 2013; Gawrychowski and Manea, 2015), we continue the study of {\em $α$-gapped repeats} in strings, defined as factors $uvu$ with $|uv|\leq α|u|$. Our main result is the $O(αn)$ bound on the number of {\em maximal} $α$-gapped repeats in a string of length $n$, previously proved to be $O(α^2 n)$ in (Kolpakov et al., 2013). For a closely related notion of maximal $δ$-subre… ▽ More

    Submitted 2 October, 2015; v1 submitted 3 September, 2015; originally announced September 2015.

    Comments: 27 pages. arXiv admin note: text overlap with arXiv:1309.4055

  5. arXiv:1506.06284  [pdf, ps, other

    cs.DS cs.CC

    Upper bound on the number of steps for solving the subset sum problem by the Branch-and-Bound method

    Authors: Roman Kolpakov, Mikhail Posypkin

    Abstract: We study the computational complexity of one of the particular cases of the knapsack problem: the subset sum problem. For solving this problem we consider one of the basic variants of the Branch-and-Bound method in which any sub-problem is decomposed along the free variable with the maximal weight. By the complexity of solving a problem by the Branch-and-Bound method we mean the number of steps re… ▽ More

    Submitted 20 June, 2015; originally announced June 2015.

  6. arXiv:1309.4055  [pdf, ps, other

    cs.FL

    Searching of gapped repeats and subrepetitions in a word

    Authors: Roman Kolpakov, Mikhail Podolskiy, Mikhail Posypkin, Nickolay Khrapov

    Abstract: A gapped repeat is a factor of the form $uvu$ where $u$ and $v$ are nonempty words. The period of the gapped repeat is defined as $|u|+|v|$. The gapped repeat is maximal if it cannot be extended to the left or to the right by at least one letter with preserving its period. The gapped repeat is called $α$-gapped if its period is not greater than $α|v|$. A $δ$-subrepetition is a factor which exponen… ▽ More

    Submitted 29 September, 2013; v1 submitted 16 September, 2013; originally announced September 2013.

  7. arXiv:1301.3488  [pdf, other

    cs.DS cs.DM cs.IR

    Various improvements to text fingerprinting

    Authors: Djamal Belazzougui, Roman Kolpakov, Mathieu Raffinot

    Abstract: Let s = s_1 .. s_n be a text (or sequence) on a finite alphabet Σof size σ. A fingerprint in s is the set of distinct characters appearing in one of its substrings. The problem considered here is to compute the set {\cal F} of all fingerprints of all substrings of s in order to answer efficiently certain questions on this set. A substring s_i .. s_j is a maximal location for a fingerprint f in F (… ▽ More

    Submitted 15 January, 2013; originally announced January 2013.

  8. arXiv:1105.3116  [pdf, ps, other

    cs.FL

    On the number of Dejean words over alphabets of 5, 6, 7, 8, 9 and 10 letters

    Authors: Roman Kolpakov, Michael Rao

    Abstract: We give lower bounds on the growth rate of Dejean words, i.e. minimally repetitive words, over a k-letter alphabet, for k=5, 6, 7, 8, 9, 10. Put together with the known upper bounds, we estimate these growth rates with the precision of 0,005. As an consequence, we establish the exponential growth of the number of Dejean words over a k-letter alphabet, for k=5, 6, 7, 8, 9, 10.

    Submitted 16 May, 2011; originally announced May 2011.

    Comments: 13 pages

  9. arXiv:1103.5230  [pdf, ps, other

    cs.FL

    On primary and secondary repetitions in words

    Authors: Roman Kolpakov

    Abstract: Combinatorial properties of maximal repetitions (runs) in formal words are studied. We classify all maximal repetitions in a word as primary and secondary where the set of all primary repetitions determines all the other repetitons in the word. Essential combinatorial properties of primary repetitions are established.

    Submitted 27 March, 2011; originally announced March 2011.

    Comments: 14 pages

  10. Linear pattern matching on sparse suffix trees

    Authors: Roman Kolpakov, Gregory Kucherov, Tatiana Starikovskaya

    Abstract: Packing several characters into one computer word is a simple and natural way to compress the representation of a string and to speed up its processing. Exploiting this idea, we propose an index for a packed string, based on a {\em sparse suffix tree} \cite{KU-96} with appropriately defined suffix links. Assuming, under the standard unit-cost RAM model, that a word can store up to $\log_σn$ charac… ▽ More

    Submitted 14 March, 2011; originally announced March 2011.

  11. arXiv:0906.4750  [pdf, ps, other

    cs.DM

    On maximal repetitions of arbitrary exponent

    Authors: Roman Kolpakov, Gregory Kucherov, Pascal Ochem

    Abstract: The first two authors have shown [KK99,KK00] that the sum the exponent (and thus the number) of maximal repetitions of exponent at least 2 (also called runs) is linear in the length of the word. The exponent 2 in the definition of a run may seem arbitrary. In this paper, we consider maximal repetitions of exponent strictly greater than 1.

    Submitted 25 June, 2009; originally announced June 2009.

    Comments: 8 pages, 1 figure

    ACM Class: G.2.1