-
A random recursive tree model with doubling events
Authors:
Jakob E. Björnberg,
Cécile Mailler
Abstract:
We introduce a new model of random tree that grows like a random recursive tree, except at some exceptional "doubling events" when the tree is replaced by two copies of itself attached to a new root. We prove asymptotic results for the size of this tree at large times, its degree distribution, and its height profile. We also prove a lower bound for its height. Because of the doubling events that a…
▽ More
We introduce a new model of random tree that grows like a random recursive tree, except at some exceptional "doubling events" when the tree is replaced by two copies of itself attached to a new root. We prove asymptotic results for the size of this tree at large times, its degree distribution, and its height profile. We also prove a lower bound for its height. Because of the doubling events that affect the tree globally, the proofs are all much more intricate than in the case of the random recursive tree in which the growing operation is always local.
△ Less
Submitted 30 January, 2025;
originally announced January 2025.
-
A localisation phase transition for the catalytic branching random walk
Authors:
Cécile Mailler,
Bruno Schapira
Abstract:
We show the existence of a phase transition between a localisation and a non-localisation regime for a branching random walk with a catalyst at the origin. More precisely, we consider a continuous-time branching random walk that jumps at rate one, with simple random walk jumps on $\mathbb Z^d$, and that branches (with binary branching) at rate $λ>0$ everywhere, except at the origin, where it branc…
▽ More
We show the existence of a phase transition between a localisation and a non-localisation regime for a branching random walk with a catalyst at the origin. More precisely, we consider a continuous-time branching random walk that jumps at rate one, with simple random walk jumps on $\mathbb Z^d$, and that branches (with binary branching) at rate $λ>0$ everywhere, except at the origin, where it branches at rate $λ_0>λ$. We show that, if $λ_0$ is large enough, then the occupation measure of the branching random walk localises (i.e. converges almost surely without spatial renormalisation), whereas, if $λ_0$ is close enough to $λ$, then localisation cannot occur, at least not in a strong sense. The case $λ= 0$ (when branching only occurs at the origin) has been extensively studied in the literature and a transition between localisation and non-localisation was also exhibited in this case. Strikingly, the transition that we observe, conjecture, and partially prove in this paper occurs at the same threshold as in the case $λ=0$.
△ Less
Submitted 16 December, 2024;
originally announced December 2024.
-
Central limit theorems for the monkey walk with steep memory kernel
Authors:
Erion-Stelios Boci,
Cécile Mailler
Abstract:
The monkey walk is a stochastic process defined as the trajectory of a walker that moves on $\mathbb R^d$ according to a Markovian generator, except at some random "relocation" times at which it jumps back to its position at a time sampled randomly in its past, according to some "memory kernel". The relocations make the process non-Markovian and introduce a reinforcement effect (the walker is more…
▽ More
The monkey walk is a stochastic process defined as the trajectory of a walker that moves on $\mathbb R^d$ according to a Markovian generator, except at some random "relocation" times at which it jumps back to its position at a time sampled randomly in its past, according to some "memory kernel". The relocations make the process non-Markovian and introduce a reinforcement effect (the walker is more likely to relocate in a Borel set in which it has spent a lot of time in the past). In this paper, we focus on "steep" memory kernels: in these cases, the time sampled in the past at each relocation time is likely to be quite recent. One can see this as a way to model the case when the walker quickly "forgets" its past. We prove limit theorems for the position of the walker at large times, which confirm and generalise the estimates available in the physics literature.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
A two-table theorem for a disordered Chinese restaurant process
Authors:
Jakob E. Björnberg,
Cécile Mailler,
Peter Mörters,
Daniel Ueltschi
Abstract:
We investigate a disordered variant of Pitman's Chinese restaurant process where tables carry i.i.d. weights. Incoming customers choose to sit at an occupied table with a probability proportional to the product of its occupancy and its weight, or they sit at an unoccupied table with a probability proportional to a parameter $θ>0$. This is a system out of equilibrium where the proportion of custome…
▽ More
We investigate a disordered variant of Pitman's Chinese restaurant process where tables carry i.i.d. weights. Incoming customers choose to sit at an occupied table with a probability proportional to the product of its occupancy and its weight, or they sit at an unoccupied table with a probability proportional to a parameter $θ>0$. This is a system out of equilibrium where the proportion of customers at any given table converges to zero almost surely. We show that for weight distributions in any of the three extreme value classes, Weibull, Gumbel or Fréchet, the proportion of customers sitting at the largest table converges to one in probability, but not almost surely, and the proportion of customers sitting at either of the largest two tables converges to one almost surely.
△ Less
Submitted 3 May, 2024; v1 submitted 22 March, 2023;
originally announced March 2023.
-
Scaling limit of critical random trees in random environment
Authors:
Guillaume Conchon--Kerjan,
Daniel Kious,
Cécile Mailler
Abstract:
We consider Bienaymé-Galton-Watson trees in random environment, where each generation $k$ is attributed a random offspring distribution $μ_k$, and $(μ_k)_{k\geq 0}$ is a sequence of independent and identically distributed random probability measures. We work in the ``strictly critical'' regime where, for all $k$, the average of $μ_k$ is assumed to be equal to $1$ almost surely, and the variance of…
▽ More
We consider Bienaymé-Galton-Watson trees in random environment, where each generation $k$ is attributed a random offspring distribution $μ_k$, and $(μ_k)_{k\geq 0}$ is a sequence of independent and identically distributed random probability measures. We work in the ``strictly critical'' regime where, for all $k$, the average of $μ_k$ is assumed to be equal to $1$ almost surely, and the variance of $μ_k$ has finite expectation. We prove that, for almost all realizations of the environment (more precisely, under some deterministic conditions that the random environment satisfies almost surely), the scaling limit of the tree in that environment, conditioned to be large, is the Brownian continuum random tree. The habitual techniques used for standard Bienaymé-Galton-Watson trees, or trees with exchangeable vertices, do not apply to this case. Our proof therefore provides alternative tools.
△ Less
Submitted 26 January, 2023; v1 submitted 22 September, 2022;
originally announced September 2022.
-
Fluctuations of balanced urns with infinitely many colours
Authors:
Svante Janson,
Cécile Mailler,
Denis Villemonais
Abstract:
In this paper, we prove convergence and fluctuation results for measure-valued Pólya processes (MVPPs, also known as Pólya urns with infinitely-many colours). Our convergence results hold almost surely and in $L^2$, under assumptions that are different from that of other convergence results in the literature. Our fluctuation results are the first second-order results in the literature on MVPPs; th…
▽ More
In this paper, we prove convergence and fluctuation results for measure-valued Pólya processes (MVPPs, also known as Pólya urns with infinitely-many colours). Our convergence results hold almost surely and in $L^2$, under assumptions that are different from that of other convergence results in the literature. Our fluctuation results are the first second-order results in the literature on MVPPs; they generalise classical fluctuation results from the literature on finitely-many-colour Pólya urns. As in the finitely-many-colour case, the order and shape of the fluctuations depend on whether the "spectral gap is small or large".
To prove these results, we show that MVPPs are stochastic approximations taking values in the set of measures on a measurable space $E$ (the colour space). We then use martingale methods and standard operator theory to prove convergence and fluctuation results for these stochastic approximations.
△ Less
Submitted 26 November, 2021;
originally announced November 2021.
-
The trace-reinforced ants process does not find shortest paths
Authors:
Daniel Kious,
Cécile Mailler,
Bruno Schapira
Abstract:
In this paper, we study a probabilistic reinforcement-learning model for ants searching for the shortest path(s) between their nest and a source of food. In this model, the nest and the source of food are two distinguished nodes $N$ and $F$ in a finite graph $\mathcal G$. The ants perform a sequence of random walks on this graph, starting from the nest and stopped when first hitting the source of…
▽ More
In this paper, we study a probabilistic reinforcement-learning model for ants searching for the shortest path(s) between their nest and a source of food. In this model, the nest and the source of food are two distinguished nodes $N$ and $F$ in a finite graph $\mathcal G$. The ants perform a sequence of random walks on this graph, starting from the nest and stopped when first hitting the source of food. At each step of its random walk, the $n$-th ant chooses to cross a neighbouring edge with probability proportional to the number of preceding ants that crossed that edge at least once. We say that {\it the ants find the shortest path} if, almost surely as the number of ants grow to infinity, almost all the ants go from the nest to the source of food through one of the shortest paths, without loosing time on other edges of the graph.
Our contribution is three-fold: (1) We prove that, if $\mathcal G$ is a tree rooted at $N$ whose leaves have been merged into node $F$, and with one edge between $N$ and $F$, then the ants indeed find the shortest path. (2) In contrast, we provide three examples of graphs on which the ants do not find the shortest path, suggesting that in this model and in most graphs, ants do not find the shortest path. (3) In all these cases, we show that the sequence of normalised edge-weights converge to a {\it deterministic} limit, despite a linear-reinforcement mechanism, and we conjecture that this is a general fact which is valid on all finite graphs. To prove these results, we use stochastic approximation methods, and in particular the ODE method. One difficulty comes from the fact that this method relies on understanding the behaviour at large times of the solution of a non-linear, multi-dimensional ODE.
△ Less
Submitted 2 October, 2023; v1 submitted 19 June, 2021;
originally announced June 2021.
-
Parametrised branching processes: a functional version of Kesten \& Stigum theorem
Authors:
Cécile Mailler,
Jean-François Marckert
Abstract:
Let $(Z_n,n\geq 0)$ be a supercritical Galton-Watson process whose offspring distribution $μ$ has mean $λ>1$ and is such that $\int x(\log(x))_+ dμ(x)<+\infty$. According to the famous Kesten \& Stigum theorem, $(Z_n/λ^n)$ converges almost surely, as $n\to+\infty$. The limiting random variable has mean~1, and its distribution is characterised as the solution of a fixed point equation. \par In this…
▽ More
Let $(Z_n,n\geq 0)$ be a supercritical Galton-Watson process whose offspring distribution $μ$ has mean $λ>1$ and is such that $\int x(\log(x))_+ dμ(x)<+\infty$. According to the famous Kesten \& Stigum theorem, $(Z_n/λ^n)$ converges almost surely, as $n\to+\infty$. The limiting random variable has mean~1, and its distribution is characterised as the solution of a fixed point equation. \par In this paper, we consider a family of Galton-Watson processes $(Z_n(λ), n\geq 0)$ defined for~$λ$ ranging in an interval $I\subset (1, \infty)$, and where we interpret $λ$ as the time (when $n$ is the generation). The number of children of an individual at time~$λ$ is given by $X(λ)$, where $(X(λ))_{λ\in I}$ is a càdlàg integer-valued process which is assumed to be almost surely non-decreasing and such that $\mathbb E(X(λ))=λ>1$ for all $λ\in I$. This allows us to define $Z_n(λ)$ the number of elements in the $n$th generation at time $λ$.
Set $W_n(λ)= Z_n(λ)/λ^n$ for all $n\geq 0$ and $λ\in I$. We prove that, under some moment conditions on the process~$X$, the sequence of processes $(W_n(λ), λ\in I)_{n\geq 0}$ converges in probability as~$n$ tends to infinity in the space of càdlàg processes equipped with the Skorokhod topology to a process, which we characterise as the solution of a fixed point equation.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
Large deviations principle for a stochastic process with random reinforced relocations
Authors:
Erion-Stelios Boci,
Cécile Mailler
Abstract:
Stochastic processes with random reinforced relocations have been introduced in the physics literature to model animal foraging behaviour. Such a process evolves as a Markov process, except at random relocation times, when it chooses a time at random in its whole past according to some ``memory kernel'', and jumps to its value at that random time.
We prove a quenched large deviations principle f…
▽ More
Stochastic processes with random reinforced relocations have been introduced in the physics literature to model animal foraging behaviour. Such a process evolves as a Markov process, except at random relocation times, when it chooses a time at random in its whole past according to some ``memory kernel'', and jumps to its value at that random time.
We prove a quenched large deviations principle for the value of the process at large times. The difficulty in proving this result comes from the fact that the process is not Markov because of the relocations. Furthermore, the random inter-relocation times act as a random environment.
△ Less
Submitted 11 July, 2023; v1 submitted 6 May, 2021;
originally announced May 2021.
-
Voronoi cells in random split trees
Authors:
Alexander Drewitz,
Markus Heydenreich,
Cécile Mailler
Abstract:
We study the sizes of the Voronoi cells of $k$ uniformly chosen vertices in a random split tree of size $n$. We prove that, for $n$ large, the largest of these $k$ Voronoi cells contains most of the vertices, while the sizes of the remaining ones are essentially all of order $n\exp(-\mathrm{const}\sqrt{\log n})$. This discrepancy persists if we modify the definition of the Voronoi cells by (a) int…
▽ More
We study the sizes of the Voronoi cells of $k$ uniformly chosen vertices in a random split tree of size $n$. We prove that, for $n$ large, the largest of these $k$ Voronoi cells contains most of the vertices, while the sizes of the remaining ones are essentially all of order $n\exp(-\mathrm{const}\sqrt{\log n})$. This discrepancy persists if we modify the definition of the Voronoi cells by (a) introducing random edge lengths (with suitable moment assumptions), and (b) assigning different "influence" parameters (called "speeds" in the paper) to each of the $k$ vertices. Our findings are in contrast to corresponding results on random uniform trees and on the continuum random tree, where it is known that the vector of the relative sizes of the $k$ Voronoi cells is asymptotically uniformly distributed on the $(k-1)$-dimensional simplex.
△ Less
Submitted 17 March, 2021;
originally announced March 2021.
-
Finding geodesics on graphs using reinforcement learning
Authors:
Daniel Kious,
Cécile Mailler,
Bruno Schapira
Abstract:
It is well-known in biology that ants are able to find shortest paths between their nest and the food by successive random explorations, without any mean of communication other than the pheromones they leave behind them. This striking phenomenon has been observed experimentally and modelled by different mean-field reinforcement-learning models in the biology literature.
In this paper, we introdu…
▽ More
It is well-known in biology that ants are able to find shortest paths between their nest and the food by successive random explorations, without any mean of communication other than the pheromones they leave behind them. This striking phenomenon has been observed experimentally and modelled by different mean-field reinforcement-learning models in the biology literature.
In this paper, we introduce the first probabilistic reinforcement-learning model for this phenomenon. In this model, the ants explore a finite graph in which two nodes are distinguished as the nest and the source of food. The ants perform successive random walks on this graph, starting from the nest and stopped when first reaching the food, and the transition probabilities of each random walk depend on the realizations of all previous walks through some dynamic weighting of the graph. We discuss different variants of this model based on different reinforcement rules and show that slight changes in this reinforcement rule can lead to drastically different outcomes.
We prove that, in two variants of this model and when the underlying graph is, respectively, any series-parallel graph and a 5-edge non-series-parallel losange graph, the ants indeed eventually find the shortest path(s) between their nest and the food. Both proofs rely on the electrical network method for random walks on weighted graphs and on Rubin's embedding in continuous time. The proof in the series-parallel cases uses the recursive nature of this family of graphs, while the proof in the seemingly-simpler losange case turns out to be quite intricate: it relies on a fine analysis of some stochastic approximation, and on various couplings with standard and generalised Pólya urns.
△ Less
Submitted 9 October, 2020;
originally announced October 2020.
-
Dynamical Models for Random Simplicial Complexes
Authors:
Nikolaos Fountoulakis,
Tejas Iyer,
Cécile Mailler,
Henning Sulzbach
Abstract:
We study a general model of random dynamical simplicial complexes and derive a formula for the asymptotic degree distribution. This asymptotic formula encompasses results for a number of existing models, including random Apollonian networks and the weighted random recursive tree. It also confirms results on the scale-free nature of Complex Quantum Network Manifolds in dimensions $d > 2$, and speci…
▽ More
We study a general model of random dynamical simplicial complexes and derive a formula for the asymptotic degree distribution. This asymptotic formula encompasses results for a number of existing models, including random Apollonian networks and the weighted random recursive tree. It also confirms results on the scale-free nature of Complex Quantum Network Manifolds in dimensions $d > 2$, and special types of Network Geometry with Flavour models studied in the physics literature by Bianconi, Rahmede [$\mathit{Sci. Rep.} \; \mathbf{5},\text{ 13979 (2015) and }\mathit{Phys. Rev. E} \; \mathbf{93},\text{ 032315 (2016)}$].
△ Less
Submitted 21 March, 2022; v1 submitted 28 October, 2019;
originally announced October 2019.
-
Competing growth processes with random growth rates and random birth times
Authors:
Cécile Mailler,
Peter Mörters,
Anna Senkevich
Abstract:
Finding the most powerful node in a dynamic random network, the largest set in a partition-valued stochastic process, or the largest family in an evolving population at a given time, can be a very difficult problem. This is particularly the case when the underlying stochastic process has complex dependencies and the individual strength of an object has an impact that only plays out over time. We p…
▽ More
Finding the most powerful node in a dynamic random network, the largest set in a partition-valued stochastic process, or the largest family in an evolving population at a given time, can be a very difficult problem. This is particularly the case when the underlying stochastic process has complex dependencies and the individual strength of an object has an impact that only plays out over time. We propose a novel technique to deal with such problems and show how it can be applied to a broad range of examples where it produces new insight and surprising results. The method relies on two steps: In the first step, which is highly problem dependent, the problem is embedded into continuous time so that the evolution of the sizes of objects after their individual birth times become approximately independent while we only need minimal control over the birth times themselves. Once such an embedding is achieved, the second step is to apply a Poisson limit theorem that allows a comparison of object sizes in a critical window and therefore allows a description of features of extremal objects. In this paper we prove such a versatile limit theorem, based on extreme value theory, and show how the technique can be used to study extremal behaviour in different types of preferential attachment networks with fitness, branching processes with selection and mutation, and random permutations with random cycle weights.
△ Less
Submitted 7 September, 2020; v1 submitted 17 September, 2019;
originally announced September 2019.
-
Characterising random partitions by random colouring
Authors:
Jakob E. Björnberg,
Cécile Mailler,
Peter Mörters,
Daniel Ueltschi
Abstract:
Let $(X_1,X_2,...)$ be a random partition of the unit interval $[0,1]$, i.e. $X_i\geq0$ and $\sum_{i\geq1} X_i=1$, and let $(\varepsilon_1,\varepsilon_2,...)$ be i.i.d. Bernoulli random variables of parameter $p \in (0,1)$. The Bernoulli convolution of the partition is the random variable $Z =\sum_{i\geq1} \varepsilon_i X_i$. The question addressed in this article is: Knowing the distribution of…
▽ More
Let $(X_1,X_2,...)$ be a random partition of the unit interval $[0,1]$, i.e. $X_i\geq0$ and $\sum_{i\geq1} X_i=1$, and let $(\varepsilon_1,\varepsilon_2,...)$ be i.i.d. Bernoulli random variables of parameter $p \in (0,1)$. The Bernoulli convolution of the partition is the random variable $Z =\sum_{i\geq1} \varepsilon_i X_i$. The question addressed in this article is: Knowing the distribution of $Z$ for some fixed $p\in(0,1)$, what can we infer about the random partition? We consider random partitions formed by residual allocation and prove that their distributions are fully characterised by their Bernoulli convolution if and only if the parameter $p$ is not equal to $1/2$.
△ Less
Submitted 20 November, 2019; v1 submitted 12 July, 2019;
originally announced July 2019.
-
Random walks with preferential relocations and fading memory: a study through random recursive trees
Authors:
Cécile Mailler,
Gerónimo Uribe Bravo
Abstract:
Consider a stochastic process that behaves as a $d$-dimensional simple and symmetric random walk, except that, with a certain fixed probability, at each step, it chooses instead to jump to a given site with probability proportional to the time it has already spent there. This process has been analyzed in the physics literature under the name "random walk with preferential relocations", where it is…
▽ More
Consider a stochastic process that behaves as a $d$-dimensional simple and symmetric random walk, except that, with a certain fixed probability, at each step, it chooses instead to jump to a given site with probability proportional to the time it has already spent there. This process has been analyzed in the physics literature under the name "random walk with preferential relocations", where it is argued that the position of the walker after $n$ steps, scaled by $\log n$, converges to a Gaussian random variable; because of the $\log$ spatial scaling, the process is said to undergo a "slow diffusion".
In this paper, we generalize this model by allowing the underlying random walk to be any Markov process and the random run-lengths (time between two relocations) to be i.i.d.-distributed. We also allow the memory of the walker to fade with time, meaning that when a relocations occurs, the walker is more likely to go back to a place it has visited more recently.
We prove rigorously the central limit theorem described above (plus a local limit theorem and the convergence of the weighted occupation measure) by associating to the process a growing family of vertex-weighted random recursive trees and a Markov chain indexed by this tree. The spatial scaling of our relocated random walk is related to the height of a "typical" vertex in the random tree. This typical height can range from doubly-logarithmic to logarithmic or even a power of the number of nodes of the tree, depending on the form of the memory.
△ Less
Submitted 9 January, 2019; v1 submitted 5 October, 2018;
originally announced October 2018.
-
Stochastic approximation on non-compact measure spaces and application to measure-valued Pólya processes
Authors:
Cécile Mailler,
Denis Villemonais
Abstract:
Our main result is to prove almost-sure convergence of a stochastic-approximation algorithm defined on the space of measures on a non-compact space. Our motivation is to apply this result to measure-valued Pólya processes (MVPPs, also known as infinitely-many Pólya urns). Our main idea is to use Foster-Lyapunov type criteria in a novel way to generalize stochastic-approximation methods to measure-…
▽ More
Our main result is to prove almost-sure convergence of a stochastic-approximation algorithm defined on the space of measures on a non-compact space. Our motivation is to apply this result to measure-valued Pólya processes (MVPPs, also known as infinitely-many Pólya urns). Our main idea is to use Foster-Lyapunov type criteria in a novel way to generalize stochastic-approximation methods to measure-valued Markov processes with a non-compact underlying space, overcoming in a fairly general context one of the major difficulties of existing studies on this subject.
From the MVPPs point of view, our result implies almost-sure convergence of a large class of MVPPs, this convergence was only obtained until now for specific examples, with only convergence in probability established for general classes. Furthermore, our approach allows us to extend the definition of MVPPs by adding "weights" to the different colors of the infinitely-many-color urn. We also exhibit a link between non-"balanced" MVPPs and quasi-stationary distributions of Markovian processes, which allows us to treat, for the first time in the literature, the non-balanced case.
Finally, we show how our result can be applied to designing stochastic-approximation algorithms for the approximation of quasi-stationary distributions of discrete- and continuous-time Markov processes on non-compact spaces.
△ Less
Submitted 20 January, 2020; v1 submitted 5 September, 2018;
originally announced September 2018.
-
Multiple drawing multi-colour urns by stochastic approximation
Authors:
Nabil Lasmar,
Cécile Mailler,
Olfa Selmi
Abstract:
A classical Pólya urn scheme is a Markov process whose evolution is encoded by a replacement matrix $(R_{i,j})_{1\leq i,j\leq d}$. At every discrete time-step, we draw a ball uniformly at random, denote its colour $c$, and replace it in the urn together with $R_{c,j}$ balls of colour $j$ (for all $1\leq j\leq d$).
We are interested in multi-drawing Pólya urns, where the replacement rule depends…
▽ More
A classical Pólya urn scheme is a Markov process whose evolution is encoded by a replacement matrix $(R_{i,j})_{1\leq i,j\leq d}$. At every discrete time-step, we draw a ball uniformly at random, denote its colour $c$, and replace it in the urn together with $R_{c,j}$ balls of colour $j$ (for all $1\leq j\leq d$).
We are interested in multi-drawing Pólya urns, where the replacement rule depends on the random drawing of a set of $m$ balls from the urn (with or without replacement). This generalisation has already been studied in the literature, in particular by Kuba & Mahmoud (ArXiv:1503.09069 and 1509.09053), where second order asymptotic results are proved for $2$-colour urns under the balanced and the affinity assumptions.
The main idea of this work is to apply stochastic approximation methods to this problem, which enables us to remove the affinity hypothesis of Kuba & Mahmoud and generalise the result to more-than-two-colour urns. We also give some partial results in the two-colour non-balanced case.
△ Less
Submitted 17 June, 2021; v1 submitted 28 November, 2016;
originally announced November 2016.
-
A bijective study of Basketball walks
Authors:
Jérémie Bettinelli,
Éric Fusy,
Cécile Mailler,
Lucas Randazzo
Abstract:
The Catalan numbers count many classes of combinatorial objects. The most emblematic such objects are probably the Dyck walks and the binary trees, and, whenever another class of combinatorial objects is counted by the Catalan numbers, it is natural to search for an explicit bijection between the latter objects and one of the former objects. In most cases, such a bijection happens to be relatively…
▽ More
The Catalan numbers count many classes of combinatorial objects. The most emblematic such objects are probably the Dyck walks and the binary trees, and, whenever another class of combinatorial objects is counted by the Catalan numbers, it is natural to search for an explicit bijection between the latter objects and one of the former objects. In most cases, such a bijection happens to be relatively simple but it might sometimes be more intricate.
In this work, we focus on so-called \emph{basketball walks}, which are integer-valued walks with step-set $\{-2,-1,+1,+2\}$. The presence of $-2$ as an allowed step makes it impossible to use the classical Łukasiewicz encoding of trees by integer-valued walks, and thus a different strategy is needed. We give an explicit bijection that maps, for each $n\ge 2$, $n$-step basketball walks from $0$ to $0$ that visit $1$ and are positive except at their extremities to $n$-leaf binary trees. Moreover, we can partition the steps of a walk into $\pm 1$-steps, odd $+2$-steps or even $-2$-steps, and odd $-2$-steps or even $+2$-steps, and these three types of steps are mapped through our bijection to double leaves, left leaves, and right leaves of the corresponding tree.
We also prove that basketball walks from $0$ to $1$ that are positive except at the origin are in bijection with increasing unary-binary trees with associated permutation avoiding $213$. We furthermore give the refined generating function of these objects with an extra variable accounting for the unary nodes.
△ Less
Submitted 19 January, 2017; v1 submitted 4 November, 2016;
originally announced November 2016.
-
Measure-valued Pólya processes
Authors:
Cécile Mailler,
Jean-François Marckert
Abstract:
A Pólya urn process is a Markov chain that models the evolution of an urn containing some coloured balls, the set of possible colours being $\{1,\ldots,d\}$ for $d\in \mathbb{N}$. At each time step, a random ball is chosen uniformly in the urn. It is replaced in the urn and, if its colour is $c$, $R_{c,j}$ balls of colour $j$ are also added (for all $1\leq j\leq d$). We introduce a model of measur…
▽ More
A Pólya urn process is a Markov chain that models the evolution of an urn containing some coloured balls, the set of possible colours being $\{1,\ldots,d\}$ for $d\in \mathbb{N}$. At each time step, a random ball is chosen uniformly in the urn. It is replaced in the urn and, if its colour is $c$, $R_{c,j}$ balls of colour $j$ are also added (for all $1\leq j\leq d$). We introduce a model of measure-valued processes that generalises this construction. This generalisation includes the case when the space of colours is a (possibly infinite) Polish space $\mathcal P$.
We see the urn composition at any time step $n$ as a measure ${\mathcal M}_n$ -- possibly non atomic -- on $\mathcal P$. In this generalisation, we choose a random colour $c$ according to the probability distribution proportional to ${\mathcal M}_n$, and add a measure ${\mathcal R}_c$ in the urn, where the quantity ${\mathcal R}_c(B)$ of a Borel set $B$ models the added weight of "balls" with colour in $B$.
We study the asymptotic behaviour of these measure-valued Pólya urn processes, and give some conditions on the replacements measures $({\mathcal R}_c, c\in \mathcal P)$ for the sequence of measures $({\mathcal M}_n, n\geq 0)$ to converge in distribution, possibly after rescaling. For certain models, related to branching random walks, $({\mathcal M}_n, n\geq 0)$ is shown to converge almost surely under some moment hypothesis; a particular case of this last result gives the almost sure convergence of the (renormalised) profile of the random recursive tree to a standard Gaussian.
△ Less
Submitted 10 March, 2017; v1 submitted 27 October, 2016;
originally announced October 2016.
-
Non-extensive condensation in reinforced branching processes
Authors:
Steffen Dereich,
Cecile Mailler,
Peter Morters
Abstract:
We study a class of branching processes in which a population consists of immortal individuals equipped with a fitness value. Individuals produce offspring with a rate given by their fitness, and offspring may either belong to the same family, sharing the fitness of their parent, or be founders of new families, with a fitness sampled from a fitness distribution. Examples that can be embedded in th…
▽ More
We study a class of branching processes in which a population consists of immortal individuals equipped with a fitness value. Individuals produce offspring with a rate given by their fitness, and offspring may either belong to the same family, sharing the fitness of their parent, or be founders of new families, with a fitness sampled from a fitness distribution. Examples that can be embedded in this class are stochastic house-of-cards models, urn models with reinforcement, and the preferential attachment tree of Bianconi and Barabasi. Our focus is on the case when the fitness distribution has bounded support and regularly varying tail at the essential supremum. In this case there exists a condensation phase, in which asymptotically a proportion of mass in the empirical fitness distribution of the overall population condenses in the maximal fitness value. Our main results describe the asymptotic behaviour of the size and fitness of the largest family at a given time. In particular, we show that as time goes to infinity the size of the largest family is always negligible compared to the overall population size. This implies that condensation, when it arises, is non-extensive and emerges as a collective effort of several families none of which can create a condensate on its own. Our result disproves claims made in the physics literature in the context of preferential attachment trees.
△ Less
Submitted 30 January, 2017; v1 submitted 29 January, 2016;
originally announced January 2016.
-
And/or trees: A local limit point of view
Authors:
Nicolas Broutin,
Cécile Mailler
Abstract:
We present here a new and universal approach for the study of random and/or trees, unifying in one framework many different models, including some novel ones not yet understood in the literature. An and/or tree is a Boolean expression represented in (one of) its tree shapes. Fix an integer $k$, take a sequence of random (rooted) trees of increasing size, say $(t_n)_{n\ge 1}$, and label each of the…
▽ More
We present here a new and universal approach for the study of random and/or trees, unifying in one framework many different models, including some novel ones not yet understood in the literature. An and/or tree is a Boolean expression represented in (one of) its tree shapes. Fix an integer $k$, take a sequence of random (rooted) trees of increasing size, say $(t_n)_{n\ge 1}$, and label each of these random trees uniformly at random in order to get a random Boolean expression on $k$ variables.
We prove that, under rather weak local conditions on the sequence of random trees $(t_n)_{n\ge 1}$, the distribution induced on Boolean functions by this procedure converges as $n$ tends to infinity. In particular, we characterise two different behaviours of this limit distribution depending on the shape of the local limit of $(t_n)_{n\ge 1}$: a degenerate case when the local limit has no leaves; and a non-degenerate case, which we are able to describe in more details under stronger conditions. In this latter case, we provide a relationship between the probability of a given Boolean function and its complexity.
The examples covered by this unified framework include trees that interpolate between models with logarithmic typical distances (such as random binary search trees) and other ones with square root typical distances (such as conditioned Galton--Watson trees).
△ Less
Submitted 8 June, 2017; v1 submitted 22 October, 2015;
originally announced October 2015.
-
Condensation and symmetry-breaking in the zero-range process with weak site disorder
Authors:
Cécile Mailler,
Peter Mörters,
Daniel Ueltschi
Abstract:
Condensation phenomena in particle systems typically occur as one of two distinct types: either as a spontaneous symmetry breaking in a homogeneous system, in which particle interactions enforce condensation in a randomly located site, or as an explicit symmetry breaking in a system with background disorder, in which particles condensate in the site of extremal disorder. In this paper we confirm a…
▽ More
Condensation phenomena in particle systems typically occur as one of two distinct types: either as a spontaneous symmetry breaking in a homogeneous system, in which particle interactions enforce condensation in a randomly located site, or as an explicit symmetry breaking in a system with background disorder, in which particles condensate in the site of extremal disorder. In this paper we confirm a recent conjecture by Godrèche and Luck by showing, for a zero range process with weak site disorder, that there exists a phase where condensation occurs with an intermediate type of symmetry-breaking, in which particles condensate in a site randomly chosen from a range of sites favoured by disorder. We show that this type of condensation is characterised by the occurrence of a Gamma distribution in the law of the disorder at the condensation site. We further investigate fluctuations of the condensate size and confirm a phase diagram, again conjectured by Godrèche and Luck, showing the existence of phases with normal and anomalous fluctuations.
△ Less
Submitted 25 September, 2015;
originally announced September 2015.
-
Generalised and Quotient Models for Random And/Or Trees and Application to Satisfiability
Authors:
Antoine Genitrini,
Cécile Mailler
Abstract:
This article is motivated by the following satisfiability question: pick uniformly at random an and/or Boolean expression of length n, built on a set of k_n Boolean variables. What is the probability that this expression is satisfiable? asymptotically when n tends to infinity?
The model of random Boolean expressions developed in the present paper is the model of Boolean Catalan trees, already ex…
▽ More
This article is motivated by the following satisfiability question: pick uniformly at random an and/or Boolean expression of length n, built on a set of k_n Boolean variables. What is the probability that this expression is satisfiable? asymptotically when n tends to infinity?
The model of random Boolean expressions developed in the present paper is the model of Boolean Catalan trees, already extensively studied in the literature for a constant sequence (k_n)_{n\geq 1}. The fundamental breakthrough of this paper is to generalise the previous results to any (reasonable) sequence of integers (k_n)_{n\geq 1}, which enables us, in particular, to solve the above satisfiability question.
We also analyse the effect of introducing a natural equivalence relation on the set of Boolean expressions. This new "quotient" model happens to exhibit a very interesting threshold (or saturation) phenomenon at k_n = n/ln n.
△ Less
Submitted 30 July, 2015;
originally announced July 2015.
-
Describing the asymptotic behaviour of multicolour Pólya urns via smoothing systems analysis
Authors:
Cécile Mailler
Abstract:
The present paper aims at describing in details the asymptotic composition of a class of d-colour Pólya urn: namely balanced, tenable and irreducible urns. We decompose the composition vector of such urns according to the Jordan decomposition of their replacement matrix. The projections of the composition vector onto the so-called small Jordan spaces are known to be asymptotically gaussian, but th…
▽ More
The present paper aims at describing in details the asymptotic composition of a class of d-colour Pólya urn: namely balanced, tenable and irreducible urns. We decompose the composition vector of such urns according to the Jordan decomposition of their replacement matrix. The projections of the composition vector onto the so-called small Jordan spaces are known to be asymptotically gaussian, but the asymptotic behaviour of the projections onto the large Jordan spaces are not known in full details up to now and are discribed by a limiting random variables called W, depending on the parameters of the urn.
△ Less
Submitted 21 December, 2017; v1 submitted 10 July, 2014;
originally announced July 2014.
-
The relation between tree size complexity and probability for Boolean functions generated by uniform random trees
Authors:
Antoine Genitrini,
Bernhard Gittenberger,
Veronika Kraus,
Cécile Mailler
Abstract:
We consider a probability distribution on the set of Boolean functions in n variables which is induced by random Boolean expressions. Such an expression is a random rooted plane tree where the internal vertices are labelled with connectives And and OR and the leaves are labelled with variables or negated variables. We study limiting distribution when the tree size tends to infinity and derive a re…
▽ More
We consider a probability distribution on the set of Boolean functions in n variables which is induced by random Boolean expressions. Such an expression is a random rooted plane tree where the internal vertices are labelled with connectives And and OR and the leaves are labelled with variables or negated variables. We study limiting distribution when the tree size tends to infinity and derive a relation between the tree size complexity and the probability of a function. This is done by first expressing trees representing a particular function as expansions of minimal trees representing this function and then computing the probabilities by means of combinatorial counting arguments relying on generating functions and singularity analysis.
△ Less
Submitted 25 September, 2015; v1 submitted 2 July, 2014;
originally announced July 2014.
-
Associative and commutative tree representations for Boolean functions
Authors:
Antoine Genitrini,
Bernhard Gittenberger,
Veronika Kraus,
Cécile Mailler
Abstract:
Since the 90's, several authors have studied a probability distribution on the set of Boolean functions on $n$ variables induced by some probability distributions on formulas built upon the connectors $And$ and $Or$ and the literals $\{x_{1}, \bar{x}_{1}, \dots, x_{n}, \bar{x}_{n}\}$. These formulas rely on plane binary labelled trees, known as Catalan trees. We extend all the results, in particul…
▽ More
Since the 90's, several authors have studied a probability distribution on the set of Boolean functions on $n$ variables induced by some probability distributions on formulas built upon the connectors $And$ and $Or$ and the literals $\{x_{1}, \bar{x}_{1}, \dots, x_{n}, \bar{x}_{n}\}$. These formulas rely on plane binary labelled trees, known as Catalan trees. We extend all the results, in particular the relation between the probability and the complexity of a Boolean function, to other models of formulas: non-binary or non-plane labelled trees (i.e. Polya trees). This includes the natural tree class where associativity and commutativity of the connectors $And$ and $Or$ are realised.
△ Less
Submitted 3 May, 2013;
originally announced May 2013.
-
Catalan satisfiability problem
Authors:
Antoine Genitrini,
Cécile Mailler
Abstract:
An and/or tree is usually a binary plane tree, with internal nodes labelled by logical connectives, and with leaves labelled by literals chosen in a fixed set of k variables and their negations. In the present paper, we introduce the first model of such Catalan trees, whose number of variables k_n is a function of n, the size of the expressions. We describe the whole range of the probability distr…
▽ More
An and/or tree is usually a binary plane tree, with internal nodes labelled by logical connectives, and with leaves labelled by literals chosen in a fixed set of k variables and their negations. In the present paper, we introduce the first model of such Catalan trees, whose number of variables k_n is a function of n, the size of the expressions. We describe the whole range of the probability distributions depending on the function k_n, as soon as it tends jointly with n to infinity. As a by-product we obtain a study of the satisfiability problem in the context of Catalan trees.
Our study is mainly based on analytic combinatorics and extends the Kozik's pattern theory, first developed for the fixed-k Catalan tree model.
△ Less
Submitted 12 September, 2013; v1 submitted 20 April, 2013;
originally announced April 2013.
-
Smoothing equations for large Pólya urns
Authors:
Brigitte Chauvin,
Cécile Mailler,
Nicolas Pouyanne
Abstract:
Consider a balanced non triangular two-color Pólya-Eggenberger urn process, assumed to be large which means that the ratio sigma of the replacement matrix eigenvalues satisfies 1/2<sigma <1. The composition vector of both discrete time and continuous time models admits a drift which is carried by the principal direction of the replacement matrix. In the second principal direction, this random vect…
▽ More
Consider a balanced non triangular two-color Pólya-Eggenberger urn process, assumed to be large which means that the ratio sigma of the replacement matrix eigenvalues satisfies 1/2<sigma <1. The composition vector of both discrete time and continuous time models admits a drift which is carried by the principal direction of the replacement matrix. In the second principal direction, this random vector admits also an almost sure asymptotics and a real-valued limit random variable arises, named WDT in discrete time and WCT in continous time. The paper deals with the distributions of both W. Appearing as martingale limits, known to be nonnormal, these laws remain up to now rather mysterious.
Exploiting the underlying tree structure of the urn process, we show that WDT and WCT are the unique solutions of two distributional systems in some suitable spaces of integrable probability measures. These systems are natural extensions of distributional equations that already appeared in famous algorithmical problems like Quicksort analysis. Existence and unicity of the solutions of the systems are obtained by means of contracting smoothing transforms. Via the equation systems, we find upperbounds for the moments of WDT and WCT and we show that the laws of WDT and WCT are moment-determined. We also prove that WDT is supported by the whole real line and admits a continuous density (WCT was already known to have a density, infinitely differentiable on R\{0} and not bounded at the origin).
△ Less
Submitted 30 May, 2013; v1 submitted 6 February, 2013;
originally announced February 2013.