-
Stability properties for subgroups generated by return words
Authors:
France Gheeraert,
Herman Goulet-Ouellet,
Julien Leroy,
Pierre Stas
Abstract:
Return words are a classical tool for studying shift spaces with low factor complexity. In recent years, their projection inside groups have attracted some attention, for instance in the context of dendric shift spaces, of generation of pseudorandom numbers (through the welldoc property), and of profinite invariants of shift spaces. Aiming at unifying disparate works, we introduce a notion of stab…
▽ More
Return words are a classical tool for studying shift spaces with low factor complexity. In recent years, their projection inside groups have attracted some attention, for instance in the context of dendric shift spaces, of generation of pseudorandom numbers (through the welldoc property), and of profinite invariants of shift spaces. Aiming at unifying disparate works, we introduce a notion of stability for subgroups generated by return words. Within this framework, we revisit several existing results and generalize some of them. We also study general aspects of stability, such as decidability or closure under certain operations.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
Algebraic characterization of dendricity
Authors:
France Gheeraert,
Herman Goulet-Ouellet,
Julien Leroy,
Pierre Stas
Abstract:
Dendric shift spaces simultaneously generalize codings of regular interval exchanges and episturmian shift spaces, themselves both generalizations of Sturmian words. One of the key properties enforced by dendricity is the Return Theorem. In this paper, we prove its converse, providing the following natural algebraic perspective on dendricity: A minimal shift space is dendric if and only if every s…
▽ More
Dendric shift spaces simultaneously generalize codings of regular interval exchanges and episturmian shift spaces, themselves both generalizations of Sturmian words. One of the key properties enforced by dendricity is the Return Theorem. In this paper, we prove its converse, providing the following natural algebraic perspective on dendricity: A minimal shift space is dendric if and only if every set of return words is a basis of the free group over the alphabet.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
$\mathcal{S}$-adic characterization of minimal dendric shifts
Authors:
France Gheeraert,
Julien Leroy
Abstract:
Dendric shifts are defined by combinatorial restrictions of the extensions of the words in their languages. This family generalizes well-known families of shifts such as Sturmian shifts, Arnoux-Rauzy shifts and codings of interval exchange transformations. It is known that any minimal dendric shift has a primitive $\mathcal{S}$-adic representation where the morphisms in $\mathcal{S}$ are positive…
▽ More
Dendric shifts are defined by combinatorial restrictions of the extensions of the words in their languages. This family generalizes well-known families of shifts such as Sturmian shifts, Arnoux-Rauzy shifts and codings of interval exchange transformations. It is known that any minimal dendric shift has a primitive $\mathcal{S}$-adic representation where the morphisms in $\mathcal{S}$ are positive tame automorphisms of the free group generated by the alphabet. In this paper we give an $\mathcal{S}$-adic characterization of this family by means of two finite graphs. As an application, we are able to decide whether a shift space generated by a uniformly recurrent morphic word is (eventually) dendric.
△ Less
Submitted 13 February, 2025; v1 submitted 1 June, 2022;
originally announced June 2022.
-
$\mathcal{S}$-adic characterization of minimal ternary dendric shifts
Authors:
France Gheeraert,
Marie Lejeune,
Julien Leroy
Abstract:
Dendric shifts are defined by combinatorial restrictions of the extensions of the words in their languages. This family generalizes well-known families of shifts such as Sturmian shifts, Arnoux-Rauzy shifts and codings of interval exchange transformations. It is known that any minimal dendric shift has a primitive $\mathcal{S}$-adic representation where the morphisms in $\mathcal{S}$ are positive…
▽ More
Dendric shifts are defined by combinatorial restrictions of the extensions of the words in their languages. This family generalizes well-known families of shifts such as Sturmian shifts, Arnoux-Rauzy shifts and codings of interval exchange transformations. It is known that any minimal dendric shift has a primitive $\mathcal{S}$-adic representation where the morphisms in $\mathcal{S}$ are positive tame automorphisms of the free group generated by the alphabet. In this paper we investigate those $\mathcal{S}$-adic representations, heading towards an $\mathcal{S}$-adic characterization of this family. We obtain such a characterization in the ternary case, involving a directed graph with 2 vertices.
△ Less
Submitted 22 June, 2021; v1 submitted 19 February, 2021;
originally announced February 2021.
-
Doc2Vec on the PubMed corpus: study of a new approach to generate related articles
Authors:
Emeric Dynomant,
Stéfan J. Darmoni,
Émeline Lejeune,
Gaëtan Kerdelhué,
Jean-Philippe Leroy,
Vincent Lequertier,
Stéphane Canu,
Julien Grosjean
Abstract:
PubMed is the biggest and most used bibliographic database worldwide, hosting more than 26M biomedical publications. One of its useful features is the "similar articles" section, allowing the end-user to find scientific articles linked to the consulted document in term of context. The aim of this study is to analyze whether it is possible to replace the statistic model PubMed Related Articles (pmr…
▽ More
PubMed is the biggest and most used bibliographic database worldwide, hosting more than 26M biomedical publications. One of its useful features is the "similar articles" section, allowing the end-user to find scientific articles linked to the consulted document in term of context. The aim of this study is to analyze whether it is possible to replace the statistic model PubMed Related Articles (pmra) with a document embedding method. Doc2Vec algorithm was used to train models allowing to vectorize documents. Six of its parameters were optimised by following a grid-search strategy to train more than 1,900 models. Parameters combination leading to the best accuracy was used to train models on abstracts from the PubMed database. Four evaluations tasks were defined to determine what does or does not influence the proximity between documents for both Doc2Vec and pmra. The two different Doc2Vec architectures have different abilities to link documents about a common context. The terminological indexing, words and stems contents of linked documents are highly similar between pmra and Doc2Vec PV-DBOW architecture. These algorithms are also more likely to bring closer documents having a similar size. In contrary, the manual evaluation shows much better results for the pmra algorithm. While the pmra algorithm links documents by explicitly using terminological indexing in its formula, Doc2Vec does not need a prior indexing. It can infer relations between documents sharing a similar indexing, without any knowledge about them, particularly regarding the PV-DBOW architecture. In contrary, the human evaluation, without any clear agreement between evaluators, implies future studies to better understand this difference between PV-DBOW and pmra algorithm.
△ Less
Submitted 26 November, 2019;
originally announced November 2019.
-
Computing the $k$-binomial complexity of the Thue--Morse word
Authors:
Marie Lejeune,
Julien Leroy,
Michel Rigo
Abstract:
Two words are $k$-binomially equivalent whenever they share the same subwords, i.e., subsequences, of length at most $k$ with the same multiplicities. This is a refinement of both abelian equivalence and the Simon congruence. The $k$-binomial complexity of an infinite word $\mathbf{x}$ maps the integer $n$ to the number of classes in the quotient, by this $k$-binomial equivalence relation, of the…
▽ More
Two words are $k$-binomially equivalent whenever they share the same subwords, i.e., subsequences, of length at most $k$ with the same multiplicities. This is a refinement of both abelian equivalence and the Simon congruence. The $k$-binomial complexity of an infinite word $\mathbf{x}$ maps the integer $n$ to the number of classes in the quotient, by this $k$-binomial equivalence relation, of the set of factors of length $n$ occurring in $\mathbf{x}$. This complexity measure has not been investigated very much. In this paper, we characterize the $k$-binomial complexity of the Thue--Morse word. The result is striking, compared to more familiar complexity functions. Although the Thue--Morse word is aperiodic, its $k$-binomial complexity eventually takes only two values. In this paper, we first obtain general results about the number of occurrences of subwords appearing in iterates of the form $Ψ^\ell(w)$ for an arbitrary morphism $Ψ$. We also thoroughly describe the factors of the Thue--Morse word by introducing a relevant new equivalence relation.
△ Less
Submitted 18 December, 2018;
originally announced December 2018.
-
Decidability of the isomorphism and the factorization between minimal substitution subshifts
Authors:
Fabien Durand,
Julien Leroy
Abstract:
Classification is a central problem for dynamical systems, in particular for families that arise in a wide range of topics, like substitution subshifts. It is important to be able to distinguish whether two such subshifts are isomorphic, but the existing invariants are not sufficient for this purpose. We first show that given two minimal substitution subshifts, there exists a computable constant…
▽ More
Classification is a central problem for dynamical systems, in particular for families that arise in a wide range of topics, like substitution subshifts. It is important to be able to distinguish whether two such subshifts are isomorphic, but the existing invariants are not sufficient for this purpose. We first show that given two minimal substitution subshifts, there exists a computable constant $R$ such that any factor map between these subshifts (if any) is the composition of a factor map with a radius smaller than $R$ and some power of the shift map. Then we prove that it is decidable to check whether a given sliding block code is a factor map between two prescribed minimal substitution subshifts. As a consequence of these two results, we provide an algorithm that, given two minimal substitution subshifts, decides whether one is a factor of the other and, as a straightforward corollary, whether they are isomorphic.
△ Less
Submitted 23 August, 2022; v1 submitted 13 June, 2018;
originally announced June 2018.
-
Proceedings of eNTERFACE 2015 Workshop on Intelligent Interfaces
Authors:
Matei Mancas,
Christian Frisson,
Joëlle Tilmanne,
Nicolas d'Alessandro,
Petr Barborka,
Furkan Bayansar,
Francisco Bernard,
Rebecca Fiebrink,
Alexis Heloir,
Edgar Hemery,
Sohaib Laraba,
Alexis Moinet,
Fabrizio Nunnari,
Thierry Ravet,
Loïc Reboursière,
Alvaro Sarasua,
Mickaël Tits,
Noé Tits,
François Zajéga,
Paolo Alborno,
Ksenia Kolykhalova,
Emma Frid,
Damiano Malafronte,
Lisanne Huis in't Veld,
Hüseyin Cakmak
, et al. (49 additional authors not shown)
Abstract:
The 11th Summer Workshop on Multimodal Interfaces eNTERFACE 2015 was hosted by the Numediart Institute of Creative Technologies of the University of Mons from August 10th to September 2015. During the four weeks, students and researchers from all over the world came together in the Numediart Institute of the University of Mons to work on eight selected projects structured around intelligent interf…
▽ More
The 11th Summer Workshop on Multimodal Interfaces eNTERFACE 2015 was hosted by the Numediart Institute of Creative Technologies of the University of Mons from August 10th to September 2015. During the four weeks, students and researchers from all over the world came together in the Numediart Institute of the University of Mons to work on eight selected projects structured around intelligent interfaces. Eight projects were selected and their reports are shown here.
△ Less
Submitted 19 January, 2018;
originally announced January 2018.
-
Counting Subwords Occurrences in Base-b Expansions
Authors:
Julien Leroy,
Michel Rigo,
Manon Stipulanti
Abstract:
We count the number of distinct (scattered) subwords occurring in the base-b expansion of the non-negative integers. More precisely, we consider the sequence $(S_b(n))_{n\ge 0}$ counting the number of positive entries on each row of a generalization of the Pascal triangle to binomial coefficients of base-$b$ expansions. By using a convenient tree structure, we provide recurrence relations for…
▽ More
We count the number of distinct (scattered) subwords occurring in the base-b expansion of the non-negative integers. More precisely, we consider the sequence $(S_b(n))_{n\ge 0}$ counting the number of positive entries on each row of a generalization of the Pascal triangle to binomial coefficients of base-$b$ expansions. By using a convenient tree structure, we provide recurrence relations for $(S_b(n))_{n\ge 0}$ leading to the $b$-regularity of the latter sequence. Then we deduce the asymptotics of the summatory function of the sequence $(S_b(n))_{n\ge 0}$.
△ Less
Submitted 29 May, 2017;
originally announced May 2017.
-
Counting the number of non-zero coefficients in rows of generalized Pascal triangles
Authors:
Julien Leroy,
Michel Rigo,
Manon Stipulanti
Abstract:
This paper is about counting the number of distinct (scattered) subwords occurring in a given word. More precisely, we consider the generalization of the Pascal triangle to binomial coefficients of words and the sequence $(S(n))_{n\ge 0}$ counting the number of positive entries on each row. By introducing a convenient tree structure, we provide a recurrence relation for $(S(n))_{n\ge 0}$. This lea…
▽ More
This paper is about counting the number of distinct (scattered) subwords occurring in a given word. More precisely, we consider the generalization of the Pascal triangle to binomial coefficients of words and the sequence $(S(n))_{n\ge 0}$ counting the number of positive entries on each row. By introducing a convenient tree structure, we provide a recurrence relation for $(S(n))_{n\ge 0}$. This leads to a connection with the $2$-regular Stern-Brocot sequence and the sequence of denominators occurring in the Farey tree. Then we extend our construction to the Zeckendorf numeration system based on the Fibonacci sequence. Again our tree structure permits us to obtain recurrence relations for and the F-regularity of the corresponding sequence.
△ Less
Submitted 23 May, 2017;
originally announced May 2017.
-
Behavior of digital sequences through exotic numeration systems
Authors:
Julien Leroy,
Michel Rigo,
Manon Stipulanti
Abstract:
Many digital functions studied in the literature, e.g., the summatory function of the base-$k$ sum-of-digits function, have a behavior showing some periodic fluctuation. Such functions are usually studied using techniques from analytic number theory or linear algebra. In this paper we develop a method based on exotic numeration systems and we apply it on two examples motivated by the study of gene…
▽ More
Many digital functions studied in the literature, e.g., the summatory function of the base-$k$ sum-of-digits function, have a behavior showing some periodic fluctuation. Such functions are usually studied using techniques from analytic number theory or linear algebra. In this paper we develop a method based on exotic numeration systems and we apply it on two examples motivated by the study of generalized Pascal triangles and binomial coefficients of words.
△ Less
Submitted 23 May, 2017;
originally announced May 2017.
-
Generalized Pascal triangle for binomial coefficients of words
Authors:
Julien Leroy,
Michel Rigo,
Manon Stipulanti
Abstract:
We introduce a generalization of Pascal triangle based on binomial coefficients of finite words. These coefficients count the number of times a word appears as a subsequence of another finite word. Similarly to the Sierpiński gasket that can be built as the limit set, for the Hausdorff distance, of a convergent sequence of normalized compact blocks extracted from Pascal triangle modulo $2$, we des…
▽ More
We introduce a generalization of Pascal triangle based on binomial coefficients of finite words. These coefficients count the number of times a word appears as a subsequence of another finite word. Similarly to the Sierpiński gasket that can be built as the limit set, for the Hausdorff distance, of a convergent sequence of normalized compact blocks extracted from Pascal triangle modulo $2$, we describe and study the first properties of the subset of $[0, 1] \times [0, 1]$ associated with this extended Pascal triangle modulo a prime $p$.
△ Less
Submitted 23 May, 2017;
originally announced May 2017.
-
The constant of recognizability is computable for primitive morphisms
Authors:
Fabien Durand,
Julien Leroy
Abstract:
Mossé proved that primitive morphisms are recognizable. In this paper we give a computable upper bound for the constant of recognizability of such a morphism. This bound can be expressed only using the cardinality of the alphabet and the length of the longest image under the morphism of a letter.
Mossé proved that primitive morphisms are recognizable. In this paper we give a computable upper bound for the constant of recognizability of such a morphism. This bound can be expressed only using the cardinality of the alphabet and the length of the longest image under the morphism of a letter.
△ Less
Submitted 18 October, 2016;
originally announced October 2016.
-
Asymptotic properties of free monoid morphisms
Authors:
Emilie Charlier,
Julien Leroy,
Michel Rigo
Abstract:
Motivated by applications in the theory of numeration systems and recognizable sets of integers, this paper deals with morphic words when erasing morphisms are taken into account. Cobham showed that if an infinite word $w =g(f^ω(a))$ is the image of a fixed point of a morphism $f$ under another morphism $g$, then there exist a non-erasing morphism $σ$ and a coding $τ$ such that $w =τ(σ^ω(b))$.
B…
▽ More
Motivated by applications in the theory of numeration systems and recognizable sets of integers, this paper deals with morphic words when erasing morphisms are taken into account. Cobham showed that if an infinite word $w =g(f^ω(a))$ is the image of a fixed point of a morphism $f$ under another morphism $g$, then there exist a non-erasing morphism $σ$ and a coding $τ$ such that $w =τ(σ^ω(b))$.
Based on the Perron theorem about asymptotic properties of powers of non-negative matrices, our main contribution is an in-depth study of the growth type of iterated morphisms when one replaces erasing morphisms with non-erasing ones. We also explicitly provide an algorithm computing $σ$ and $τ$ from $f$ and $g$.
△ Less
Submitted 15 April, 2016; v1 submitted 1 July, 2015;
originally announced July 2015.
-
Specular sets
Authors:
Valérie Berthé,
Clelia De Felice,
Vincent Delecroix,
Francesco Dolce,
Julien Leroy,
Dominique Perrin,
Christophe reutenauer,
Giuseppina Rindone
Abstract:
We introduce the notion of specular sets which are subsets of groups called here specular and which form a natural generalization of free groups. These sets are an abstract generalization of the natural codings of linear involutions. We prove several results concerning the subgroups generated by return words and by maximal bifix codes in these sets.
We introduce the notion of specular sets which are subsets of groups called here specular and which form a natural generalization of free groups. These sets are an abstract generalization of the natural codings of linear involutions. We prove several results concerning the subgroups generated by return words and by maximal bifix codes in these sets.
△ Less
Submitted 30 May, 2016; v1 submitted 4 May, 2015;
originally announced May 2015.
-
An analogue of Cobham's theorem for graph directed iterated function systems
Authors:
Emilie Charlier,
Julien Leroy,
Michel Rigo
Abstract:
Feng and Wang showed that two homogeneous iterated function systems in $\mathbb{R}$ with multiplicatively independent contraction ratios necessarily have different attractors. In this paper, we extend this result to graph directed iterated function systems in $\mathbb{R}^n$ with contraction ratios that are of the form $\frac{1}β$, for integers $β$. By using a result of Boigelot et al., this allows…
▽ More
Feng and Wang showed that two homogeneous iterated function systems in $\mathbb{R}$ with multiplicatively independent contraction ratios necessarily have different attractors. In this paper, we extend this result to graph directed iterated function systems in $\mathbb{R}^n$ with contraction ratios that are of the form $\frac{1}β$, for integers $β$. By using a result of Boigelot et al., this allows us to give a proof of a conjecture of Adamczewski and Bell. In doing so, we link the graph directed iterated function systems to Büchi automata. In particular, this link extends to real numbers $β$. We introduce a logical formalism that permits to characterize sets of $\mathbb{R}^n$ whose representations in base $β$ are recognized by some Büchi automata. This result depends on the algebraic properties of the base: $β$ being a Pisot or a Parry number. The main motivation of this work is to draw a general picture representing the different frameworks where an analogue of Cobham's theorem is known.
△ Less
Submitted 25 November, 2013; v1 submitted 1 October, 2013;
originally announced October 2013.
-
Acyclic, connected and tree sets
Authors:
Valerie Berthé,
Clelia De Felice,
Francesco Dolce,
Julien Leroy,
Dominique Perrin,
Christophe Reutenauer,
Giuseppina Rindone
Abstract:
Given a set $F$ of words, one associates to each word $w$ in $F$ an undirected graph, called its extension graph, and which describes the possible extensions of $w$ on the left and on the right. We investigate the family of sets of words defined by the property of the extension graph of each word in the set to be acyclic or connected or a tree. We prove that in a uniformly recurrent tree set, the…
▽ More
Given a set $F$ of words, one associates to each word $w$ in $F$ an undirected graph, called its extension graph, and which describes the possible extensions of $w$ on the left and on the right. We investigate the family of sets of words defined by the property of the extension graph of each word in the set to be acyclic or connected or a tree. We prove that in a uniformly recurrent tree set, the sets of first return words are bases of the free group on the alphabet. Concerning acyclic sets, we prove as a main result that a set $F$ is acyclic if and only if any bifix code included in $F$ is a basis of the subgroup that it generates.
△ Less
Submitted 21 February, 2015; v1 submitted 20 August, 2013;
originally announced August 2013.
-
An $S$-adic characterization of minimal subshifts with first difference of complexity $1 \leq p(n+1) - p(n) \leq 2$
Authors:
Julien Leroy
Abstract:
In [Ergodic Theory Dynam. System, 16 (1996) 663--682], S. Ferenczi proved that any minimal subshift with first difference of complexity bounded by 2 is $S$-adic with $\card S \leq 3^{27}$. In this paper, we improve this result by giving an $S$-adic charaterization of these subshifts with a set $S$ of 5 morphisms, solving by this way the $S$-adic conjecture for this particular case.
In [Ergodic Theory Dynam. System, 16 (1996) 663--682], S. Ferenczi proved that any minimal subshift with first difference of complexity bounded by 2 is $S$-adic with $\card S \leq 3^{27}$. In this paper, we improve this result by giving an $S$-adic charaterization of these subshifts with a set $S$ of 5 morphisms, solving by this way the $S$-adic conjecture for this particular case.
△ Less
Submitted 2 May, 2013;
originally announced May 2013.
-
The finite index basis property
Authors:
Valérie Berthé,
Clelia De Felice,
Francesco Dolce,
Julien Leroy,
Dominique Perrin,
Christophe Reutenauer,
Giuseppina Rindone
Abstract:
We describe in this paper a connection between bifix codes, symbolic dynamical systems and free groups. This is in the spirit of the connection established previously for the symbolic systems corresponding to Sturmian words. We introduce a class of sets of factors of an infinite word with linear factor complexity containing Sturmian sets and regular interval exchange sets, namemly the class of tre…
▽ More
We describe in this paper a connection between bifix codes, symbolic dynamical systems and free groups. This is in the spirit of the connection established previously for the symbolic systems corresponding to Sturmian words. We introduce a class of sets of factors of an infinite word with linear factor complexity containing Sturmian sets and regular interval exchange sets, namemly the class of tree sets. We prove as a main result that for a uniformly recurrent tree set $F$, a finite bifix code $X$ on the alphabet $A$ is $F$-maximal of $F$-degree $d$ if and only if it is the basis of a subgroup of index $d$ of the free group on $A$.
△ Less
Submitted 20 February, 2015; v1 submitted 1 May, 2013;
originally announced May 2013.
-
Towards a statement of the S-adic conjecture through examples
Authors:
Fabien Durand,
Julien Leroy,
Gwénaël Richomme
Abstract:
The $S$-adic conjecture claims that there exists a condition $C$ such that a sequence has a sub-linear complexity if and only if it is an $S$-adic sequence satisfying Condition $C$ for some finite set $S$ of morphisms. We present an overview of the factor complexity of $S$-adic sequences and we give some examples that either illustrate some interesting properties or that are counter-examples to wh…
▽ More
The $S$-adic conjecture claims that there exists a condition $C$ such that a sequence has a sub-linear complexity if and only if it is an $S$-adic sequence satisfying Condition $C$ for some finite set $S$ of morphisms. We present an overview of the factor complexity of $S$-adic sequences and we give some examples that either illustrate some interesting properties or that are counter-examples to what could be believed to be "a good Condition $C$".
△ Less
Submitted 31 August, 2012;
originally announced August 2012.