-
Word-Representable Graphs and Locality of Words
Authors:
Philipp Böll,
Pamela Fleischmann,
Annika Huch,
Jana Kreiß,
Tim Löck,
Kajus Park,
Max Wiedenhöft
Abstract:
In this work, we investigate the relationship between $k$-repre\-sentable graphs and graphs representable by $k$-local words. In particular, we show that every graph representable by a $k$-local word is $(k+1)$-representable. A previous result about graphs represented by $1$-local words is revisited with new insights. Moreover, we investigate both classes of graphs w.r.t. hereditary and in particu…
▽ More
In this work, we investigate the relationship between $k$-repre\-sentable graphs and graphs representable by $k$-local words. In particular, we show that every graph representable by a $k$-local word is $(k+1)$-representable. A previous result about graphs represented by $1$-local words is revisited with new insights. Moreover, we investigate both classes of graphs w.r.t. hereditary and in particular the speed as a measure. We prove that the latter ones belong to the factorial layer and that the graphs in this classes have bounded clique-width.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
Jumbled Scattered Factors
Authors:
Pamela Fleischmann,
Annika Huch,
Melf Kammholz,
Tore Koß
Abstract:
In this work, we combine the research on (absent) scattered factors with the one of jumbled words. For instance, $\mathtt{wolf}$ is an absent scattered factor of $\mathtt{cauliflower}$ but since $\mathtt{lfow}$, a jumbled (or abelian) version of $\mathtt{wolf}$, is a scattered factor, $\mathtt{wolf}$ occurs as a jumbled scattered factor in $\mathtt{cauliflower}$. A \emph{jumbled scattered factor}…
▽ More
In this work, we combine the research on (absent) scattered factors with the one of jumbled words. For instance, $\mathtt{wolf}$ is an absent scattered factor of $\mathtt{cauliflower}$ but since $\mathtt{lfow}$, a jumbled (or abelian) version of $\mathtt{wolf}$, is a scattered factor, $\mathtt{wolf}$ occurs as a jumbled scattered factor in $\mathtt{cauliflower}$. A \emph{jumbled scattered factor} $u$ of a word $w$ is constructed by letters of $w$ with the only rule that the number of occurrences per letter in $u$ is smaller than or equal to the one in $w$. We proceed to partition and characterise the set of jumbled scattered factors by the number of jumbled letters and use the latter as a measure. For this new class of words, we relate the folklore longest common subsequence (scattered factor) to the number of required jumbles. Further, we investigate the smallest possible number of jumbles alongside the jumbled scattered factor relation as well as Simon's congruence from the point of view of jumbled scattered factors and jumbled universality.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
$k$-local Graphs
Authors:
Christian Beth,
Pamela Fleischmann,
Annika Huch,
Daniyal Kazempour,
Peer Kröger,
Andrea Kulow,
Matthias Renz
Abstract:
In 2017 Day et al. introduced the notion of locality as a structural complexity-measure for patterns in the field of pattern matching established by Angluin in 1980. In 2019 Casel et al. showed that determining the locality of an arbitrary pattern is NP-complete. Inspired by hierarchical clustering, we extend the notion to coloured graphs, i.e., given a coloured graph determine an enumeration of t…
▽ More
In 2017 Day et al. introduced the notion of locality as a structural complexity-measure for patterns in the field of pattern matching established by Angluin in 1980. In 2019 Casel et al. showed that determining the locality of an arbitrary pattern is NP-complete. Inspired by hierarchical clustering, we extend the notion to coloured graphs, i.e., given a coloured graph determine an enumeration of the colours such that colouring the graph stepwise according to the enumeration leads to as few clusters as possible. Next to first theoretical results on graph classes, we propose a priority search algorithm to compute the $k$-locality of a graph. The algorithm is optimal in the number of marking prefix expansions, and is faster by orders of magnitude than an exhaustive search. Finally, we perform a case study on a DBLP subgraph to demonstrate the potential of $k$-locality for knowledge discovery.
△ Less
Submitted 8 May, 2025; v1 submitted 1 October, 2024;
originally announced October 2024.
-
Rollercoasters with Plateaus
Authors:
Duncan Adamson,
Pamela Fleischmann,
Annika Huch
Abstract:
In this paper we investigate the problem of detecting, counting, and enumerating (generating) all maximum length plateau-$k$-rollercoasters appearing as a subsequence of some given word (sequence, string), while allowing for plateaus. We define a plateau-$k$-rollercoaster as a word consisting of an alternating sequence of (weakly) increasing and decreasing \emph{runs}, with each run containing at…
▽ More
In this paper we investigate the problem of detecting, counting, and enumerating (generating) all maximum length plateau-$k$-rollercoasters appearing as a subsequence of some given word (sequence, string), while allowing for plateaus. We define a plateau-$k$-rollercoaster as a word consisting of an alternating sequence of (weakly) increasing and decreasing \emph{runs}, with each run containing at least $k$ \emph{distinct} elements, allowing the run to contain multiple copies of the same symbol consecutively. This differs from previous work, where runs within rollercoasters have been defined only as sequences of distinct values. Here, we are concerned with rollercoasters of \emph{maximum} length embedded in a given word $w$, that is, the longest rollercoasters that are a subsequence of $w$.
We present algorithms allowing us to determine the longest plateau-$k$-roller\-coasters appearing as a subsequence in any given word $w$ of length $n$ over an alphabet of size $σ$ in $O(n σk)$ time, to count the number of plateau-$k$-rollercoasters in $w$ of maximum length in $O(n σk)$ time, and to output all of them with $O(n)$ delay after $O(n σk)$ preprocessing. Furthermore, we present an algorithm to determine the longest common plateau-$k$-rollercoaster within a set of words in $O(N k σ)$ where $N$ is the product of all word lengths within the set.
△ Less
Submitted 26 July, 2024;
originally announced July 2024.
-
Tight Bounds for the Number of Absent Scattered Factors
Authors:
Duncan Adamson,
Pamela Fleischmann,
Annika Huch,
Max Wiedenhöft
Abstract:
A scattered factor of a word $w$ is a word $u$ that can be obtained by deleting arbitary letters from $w$ and keep the order of the remaining. Barker et al. introduced the notion of $k$-universality, calling a word $k$-universal, if it contains all possible words of length $k$ over a given alphabet $Σ$ as a scattered factor. Kosche et al. introduced the notion of absent scattered factors to catego…
▽ More
A scattered factor of a word $w$ is a word $u$ that can be obtained by deleting arbitary letters from $w$ and keep the order of the remaining. Barker et al. introduced the notion of $k$-universality, calling a word $k$-universal, if it contains all possible words of length $k$ over a given alphabet $Σ$ as a scattered factor. Kosche et al. introduced the notion of absent scattered factors to categorise the words not being scattered factors of a given word.
In this paper, we investigate tight bounds on the possible number of absent scattered factors of a given length $k$ (also strictly longer than the shortest absent scattered factors) among all words with the same universality extending the results of Kosche et al. Specifically, given a length $k$ and universality index $ι$, we characterize $ι$-universal words with both the maximal and minimal number of absent scattered factors of length $k$. For the lower bound, we provide the exact number in a closed form. For the upper bound, we offer efficient algorithms to compute the number based on the constructed words. Moreover, by combining old results, we present an enumeration with constant delay of the set of scattered factors of a fixed length in time $O(|Σ||w|)$.
△ Less
Submitted 26 July, 2024;
originally announced July 2024.
-
$α$-$β$-Factorization and the Binary Case of Simon's Congruence
Authors:
Pamela Fleischmann,
Jonas Höfer,
Annika Huch,
Dirk Nowotka
Abstract:
In 1991 Hébrard introduced a factorization of words that turned out to be a powerful tool for the investigation of a word's scattered factors (also known as (scattered) subwords or subsequences). Based on this, first Karandikar and Schnoebelen introduced the notion of $k$-richness and later on Barker et al. the notion of $k$-universality. In 2022 Fleischmann et al. presented a generalization of th…
▽ More
In 1991 Hébrard introduced a factorization of words that turned out to be a powerful tool for the investigation of a word's scattered factors (also known as (scattered) subwords or subsequences). Based on this, first Karandikar and Schnoebelen introduced the notion of $k$-richness and later on Barker et al. the notion of $k$-universality. In 2022 Fleischmann et al. presented a generalization of the arch factorization by intersecting the arch factorization of a word and its reverse. While the authors merely used this factorization for the investigation of shortest absent scattered factors, in this work we investigate this new $α$-$β$-factorization as such. We characterize the famous Simon congruence of $k$-universal words in terms of $1$-universal words. Moreover, we apply these results to binary words. In this special case, we obtain a full characterization of the classes and calculate the index of the congruence. Lastly, we start investigating the ternary case, present a full list of possibilities for $αβα$-factors, and characterize their congruence.
△ Less
Submitted 11 September, 2023; v1 submitted 25 June, 2023;
originally announced June 2023.
-
m-Nearly k-Universal Words -- Investigating Simon Congruence
Authors:
Pamela Fleischmann,
Lukas Haschke,
Annika Huch,
Annika Mayrock,
Dirk Nowotka
Abstract:
Determining the index of the Simon congruence is a long outstanding open problem. Two words $u$ and $v$ are called Simon congruent if they have the same set of scattered factors, which are parts of the word in the correct order but not necessarily consecutive, e.g., $\mathtt{oath}$ is a scattered factor of $\mathtt{logarithm}$. Following the idea of scattered factor $k$-universality, we investigat…
▽ More
Determining the index of the Simon congruence is a long outstanding open problem. Two words $u$ and $v$ are called Simon congruent if they have the same set of scattered factors, which are parts of the word in the correct order but not necessarily consecutive, e.g., $\mathtt{oath}$ is a scattered factor of $\mathtt{logarithm}$. Following the idea of scattered factor $k$-universality, we investigate $m$-nearly $k$-universality, i.e., words where $m$ scattered factors of length $k$ are absent, w.r.t. Simon congruence. We present a full characterisation as well as the index of the congruence for $m=1$. For $m\neq 1$, we show some results if in addition $w$ is $(k-1)$-universal as well as some further insights for different $m$.
△ Less
Submitted 16 February, 2022;
originally announced February 2022.