-
Fully graphic degree sequences and P-stable degree sequences
Authors:
Péter L. Erdős,
István Miklós,
Lajos Soukup
Abstract:
The notion of $P$-stability of an infinite set of degree sequences plays influential role in approximating the permanents, rapidly sampling the realizations of graphic degree sequences, or even studying and improving network privacy. While there exist several known sufficient conditions for $P$-stability, we don't know any useful necessary condition for it. We also do not have good insight of poss…
▽ More
The notion of $P$-stability of an infinite set of degree sequences plays influential role in approximating the permanents, rapidly sampling the realizations of graphic degree sequences, or even studying and improving network privacy. While there exist several known sufficient conditions for $P$-stability, we don't know any useful necessary condition for it. We also do not have good insight of possible structure of $P$-stable degree sequence families.
At first we will show that every known infinite $P$-stable degree sequence set, described by inequalities of the parameters $n, c_1, c_2, Σ$ (the sequence length, the maximum and minimum degrees and the sum of the degrees) is ,,fully graphic" meaning that every degree sequence from the region with an even degree sum, is graphic. Furthermore, if $Σ$ does not occur in the determining inequality, then the notions of $P$-stability and full graphicality will be proved equivalent. In turns, this equality provides a strengthening of the well-known theorem of Jerrum, McKay and Sinclair about $P$-stability, describing the maximal $P$-stable sequence set by $n, c_1, c_2$. Furthermore we conjecture that similar equivalences occur in cases if $Σ$ also part of the defining inequality.
△ Less
Submitted 21 August, 2024; v1 submitted 20 May, 2024;
originally announced May 2024.
-
Dense, irregular, yet always graphic $3$-uniform hypergraph degree sequences
Authors:
Runze Li,
Istvan Miklos
Abstract:
A $3$-uniform hypergraph is a generalization of simple graphs where each hyperedge is a subset of vertices of size $3$. The degree of a vertex in a hypergraph is the number of hyperedges incident with it. The degree sequence of a hypergraph is the sequence of the degrees of its vertices. The degree sequence problem for $3$-uniform hypergraphs is to decide if a $3$-uniform hypergraph exists with a…
▽ More
A $3$-uniform hypergraph is a generalization of simple graphs where each hyperedge is a subset of vertices of size $3$. The degree of a vertex in a hypergraph is the number of hyperedges incident with it. The degree sequence of a hypergraph is the sequence of the degrees of its vertices. The degree sequence problem for $3$-uniform hypergraphs is to decide if a $3$-uniform hypergraph exists with a prescribed degree sequence. Such a hypergraph is called a realization. Recently, Deza \emph{et al.} proved that the degree sequence problem for $3$-uniform hypergraphs is NP-complete. Some special cases are easy; however, polynomial algorithms have been known so far only for some very restricted degree sequences. The main result of our research is the following. If all degrees are between $\frac{2n^2}{63}+O(n)$ and $\frac{5n^2}{63}-O(n)$ in a degree sequence $D$, further, the number of vertices is at least $45$, and the degree sum can be divided by $3$, then $D$ has a $3$-uniform hypergraph realization. Our proof is constructive and in fact, it constructs a hypergraph realization in polynomial time for any degree sequence satisfying the properties mentioned above. To our knowledge, this is the first polynomial running time algorithm to construct a $3$-uniform hypergraph realization of a highly irregular and dense degree sequence.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
Constructing and sampling partite, $3$-uniform hypergraphs with given degree sequence
Authors:
Andras Hubai,
Tamas Robert Mezei,
Ferenc Beres,
Andras Benczur,
Istvan Miklos
Abstract:
Partite, $3$-uniform hypergraphs are $3$-uniform hypergraphs in which each hyperedge contains exactly one point from each of the $3$ disjoint vertex classes. We consider the degree sequence problem of partite, $3$-uniform hypergraphs, that is, to decide if such a hypergraph with prescribed degree sequences exists. We prove that this decision problem is NP-complete in general, and give a polynomial…
▽ More
Partite, $3$-uniform hypergraphs are $3$-uniform hypergraphs in which each hyperedge contains exactly one point from each of the $3$ disjoint vertex classes. We consider the degree sequence problem of partite, $3$-uniform hypergraphs, that is, to decide if such a hypergraph with prescribed degree sequences exists. We prove that this decision problem is NP-complete in general, and give a polynomial running time algorithm for third almost-regular degree sequences, that is, when each degree in one of the vertex classes is $k$ or $k-1$ for some fixed $k$, and there is no restriction for the other two vertex classes. We also consider the sampling problem, that is, to uniformly sample partite, $3$-uniform hypergraphs with prescribed degree sequences. We propose a Parallel Tempering method, where the hypothetical energy of the hypergraphs measures the deviation from the prescribed degree sequence. The method has been implemented and tested on synthetic and real data. It can also be applied for $χ^2$ testing of contingency tables. We have shown that this hypergraph-based $χ^2$ test is more sensitive than the standard $χ^2$ test. The extra sensitivity is especially advantageous on small data sets, where the proposed Parallel Tempering method shows promising performance.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Approximate Sampling of Graphs with Near-$P$-stable Degree Intervals
Authors:
Péter L. Erdős,
Tamás Róbert Mezei,
István Miklós
Abstract:
The approximate uniform sampling of graph realizations with a given degree sequence is an everyday task in several social science, computer science, engineering etc. projects. One approach is using Markov chains. The best available current result about the well-studied switch Markov chain is that it is rapidly mixing on P-stable degree sequences (see DOI:10.1016/j.ejc.2021.103421). The switch Mark…
▽ More
The approximate uniform sampling of graph realizations with a given degree sequence is an everyday task in several social science, computer science, engineering etc. projects. One approach is using Markov chains. The best available current result about the well-studied switch Markov chain is that it is rapidly mixing on P-stable degree sequences (see DOI:10.1016/j.ejc.2021.103421). The switch Markov chain does not change any degree sequence. However, there are cases where degree intervals are specified rather than a single degree sequence. (A natural scenario where this problem arises is in hypothesis testing on social networks that are only partially observed.) Rechner, Strowick, and Müller-Hannemann introduced in 2018 the notion of degree interval Markov chain which uses three (separately well-studied) local operations (switch, hinge-flip and toggle), and employing on degree sequence realizations where any two sequences under scrutiny have very small coordinate-wise distance. Recently Amanatidis and Kleer published a beautiful paper (arXiv:2110.09068), showing that the degree interval Markov chain is rapidly mixing if the sequences are coming from a system of very thin intervals which are centered not far from a regular degree sequence. In this paper we extend substantially their result, showing that the degree interval Markov chain is rapidly mixing if the intervals are centred at P-stable degree sequences.
△ Less
Submitted 24 April, 2022; v1 submitted 20 April, 2022;
originally announced April 2022.
-
Constructing bounded degree graphs with prescribed degree and neighbor degree sequences
Authors:
Uroš Čibej,
Aaron Li,
István Miklós,
Sohaib Nasir,
Varun Srikanth
Abstract:
Let $D = d_1, d_2, \ldots, d_n$ and $F = f_1, f_2,\ldots, f_n$ be two sequences of positive integers. We consider the following decision problems: is there a $i)$ multigraph, $ii)$ loopless multigraph, $iii)$ simple graph, $iv)$ connected simple graph, $v)$ tree, $vi)$ caterpillar $G = (V,E)$ such that for all $k$, $d(v_k) = d_k$ and $\sum_{w\in \mathcal{N}(v_k)} d(w) = f_k$ ($d(v)$ is the degree…
▽ More
Let $D = d_1, d_2, \ldots, d_n$ and $F = f_1, f_2,\ldots, f_n$ be two sequences of positive integers. We consider the following decision problems: is there a $i)$ multigraph, $ii)$ loopless multigraph, $iii)$ simple graph, $iv)$ connected simple graph, $v)$ tree, $vi)$ caterpillar $G = (V,E)$ such that for all $k$, $d(v_k) = d_k$ and $\sum_{w\in \mathcal{N}(v_k)} d(w) = f_k$ ($d(v)$ is the degree of $v$ and $\mathcal{N}(v)$ is the set of neighbors of $v$). Here we show that all these decision problems can be solved in polynomial time if $\max_{k} d_k$ is bounded.
The problem is motivated by NMR spectroscopy of hydrocarbons.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
A Markov chain on the solution space of edge-colorings of bipartite graphs
Authors:
Letong Hong,
Istvan Miklos
Abstract:
In this paper, we exhibit an irreducible Markov chain $M$ on the edge $k$-colorings of bipartite graphs based on certain properties of the solution space. We show that diameter of this Markov chain grows linearly with the number of edges in the graph. We also prove a polynomial upper bound on the inverse of acceptance ratio of the Metropolis-Hastings algorithm when the algorithm is applied on $M$…
▽ More
In this paper, we exhibit an irreducible Markov chain $M$ on the edge $k$-colorings of bipartite graphs based on certain properties of the solution space. We show that diameter of this Markov chain grows linearly with the number of edges in the graph. We also prove a polynomial upper bound on the inverse of acceptance ratio of the Metropolis-Hastings algorithm when the algorithm is applied on $M$ with the uniform distribution of all possible edge $k$-colorings of $G$. A special case of our results is the solution space of the possible completions of Latin rectangles.
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
#P-hardness proofs of matrix immanants evaluated on restricted matrices
Authors:
Istvan Miklos,
Cordian Riener
Abstract:
\#P-hardness of computing matrix immanants are proved for each member of a broad class of shapes and restricted sets of matrices. We prove \#P-hardness of computing $λ$-immanants of $0$-$1$ matrices when $λ$ has a large domino-tilable part and satisfying some technical conditions. We also give hardness proofs of some $λ$-immanants of weighted adjacency matrices of planarly drawable directed graphs…
▽ More
\#P-hardness of computing matrix immanants are proved for each member of a broad class of shapes and restricted sets of matrices. We prove \#P-hardness of computing $λ$-immanants of $0$-$1$ matrices when $λ$ has a large domino-tilable part and satisfying some technical conditions. We also give hardness proofs of some $λ$-immanants of weighted adjacency matrices of planarly drawable directed graphs, such that the shape $λ= (\mathbf{1}+λ_d)$ has size $n$ such that $|λ_d| = n^{\varepsilon}$ for some $0<\varepsilon<\frac{1}{2}$, and for some $w$, the shape $λ_d/(w)$ is tilable with $1\times 2$ dominos.
△ Less
Submitted 12 April, 2023; v1 submitted 8 March, 2021;
originally announced March 2021.
-
Counting Maximum Matchings in Planar Graphs Is Hard
Authors:
Istvan Miklos,
Miklos Kresz
Abstract:
Here we prove that counting maximum matchings in planar, bipartite graphs is #P-complete. This is somewhat surprising in the light that the number of perfect matchings in planar graphs can be computed in polynomial time. We also prove that counting non-necessarily perfect matchings in planar graphs is already #P-complete if the problem is restricted to bipartite graphs. So far hardness was proved…
▽ More
Here we prove that counting maximum matchings in planar, bipartite graphs is #P-complete. This is somewhat surprising in the light that the number of perfect matchings in planar graphs can be computed in polynomial time. We also prove that counting non-necessarily perfect matchings in planar graphs is already #P-complete if the problem is restricted to bipartite graphs. So far hardness was proved only for general, non-necessarily bipartite graphs.
△ Less
Submitted 8 March, 2021; v1 submitted 6 January, 2020;
originally announced January 2020.
-
Half-graphs, other non-stable degree sequences, and the switch Markov chain
Authors:
Péter L. Erdős,
Ervin Győri,
Tamás Róbert Mezei,
István Miklós,
Dániel Soltész
Abstract:
One of the simplest methods of generating a random graph with a given degree sequence is provided by the Monte Carlo Markov Chain method using switches. The switch Markov chain converges to the uniform distribution, but generally the rate of convergence is not known. After a number of results concerning various degree sequences, rapid mixing was established for so-called $P$-stable degree sequence…
▽ More
One of the simplest methods of generating a random graph with a given degree sequence is provided by the Monte Carlo Markov Chain method using switches. The switch Markov chain converges to the uniform distribution, but generally the rate of convergence is not known. After a number of results concerning various degree sequences, rapid mixing was established for so-called $P$-stable degree sequences (including that of directed graphs), which covers every previously known rapidly mixing region of degree sequences.
In this paper we give a non-trivial family of degree sequences that are not $P$-stable and the switch Markov chain is still rapidly mixing on them. This family has an intimate connection to Tyshkevich-decompositions and strong stability as well.
△ Less
Submitted 16 June, 2020; v1 submitted 5 September, 2019;
originally announced September 2019.
-
The mixing time of the switch Markov chains: a unified approach
Authors:
Péter L. Erdős,
Catherine Greenhill,
Tamás Róbert Mezei,
István Miklós,
Dániel Soltész,
Lajos Soukup
Abstract:
Since 1997 a considerable effort has been spent to study the mixing time of switch Markov chains on the realizations of graphic degree sequences of simple graphs. Several results were proved on rapidly mixing Markov chains on unconstrained, bipartite, and directed sequences, using different mechanisms. The aim of this paper is to unify these approaches. We will illustrate the strength of the unifi…
▽ More
Since 1997 a considerable effort has been spent to study the mixing time of switch Markov chains on the realizations of graphic degree sequences of simple graphs. Several results were proved on rapidly mixing Markov chains on unconstrained, bipartite, and directed sequences, using different mechanisms. The aim of this paper is to unify these approaches. We will illustrate the strength of the unified method by showing that on any $P$-stable family of unconstrained/bipartite/directed degree sequences the switch Markov chain is rapidly mixing. This is a common generalization of every known result that shows the rapid mixing nature of the switch Markov chain on a region of degree sequences. Two applications of this general result will be presented. One is an almost uniform sampler for power-law degree sequences with exponent $γ>1+\sqrt{3}$. The other one shows that the switch Markov chain on the degree sequence of an Erdős-Rényi random graph $G(n,p)$ is asymptotically almost surely rapidly mixing if $p$ is bounded away from 0 and 1 by at least $\frac{5\log n}{n-1}$.
△ Less
Submitted 31 July, 2021; v1 submitted 15 March, 2019;
originally announced March 2019.
-
Not all simple looking degree sequence problems are easy
Authors:
Péter L. Erdős,
István Miklós
Abstract:
Degree sequence (DS) problems are around for at least hundred twenty years, and with the advent of network science, more and more complicated, structured DS problems were invented. Interestingly enough all those problems so far are computationally easy. It is clear, however, that we will find soon computationally hard DS problems. In this paper we want to find such hard DS problems with relatively…
▽ More
Degree sequence (DS) problems are around for at least hundred twenty years, and with the advent of network science, more and more complicated, structured DS problems were invented. Interestingly enough all those problems so far are computationally easy. It is clear, however, that we will find soon computationally hard DS problems. In this paper we want to find such hard DS problems with relatively simple definition.
For a vertex $v$ in the simple graph $G$ denote $d_i(v)$ the number of vertices at distance exactly $i$ from $v$. Then $d_1(v)$ is the usual degree of vertex $v.$ The vector $\mathbf{d}^2(G)=( (d_1(v_1), d_2(v_1)), \ldots,$ $(d_1(v_n), d_2(v_n))$ is the {\bf second order degree sequence} of the graph $G$. In this note we show that the problem to decide whether a sequence of natural numbers $((i_1,j_1),\ldots (i_n,j_n))$ is a second order degree sequence of a simple undirected graph $G$ is strongly NP-complete. Then we will discuss some further NP-complete DS problems.
△ Less
Submitted 3 November, 2016; v1 submitted 2 June, 2016;
originally announced June 2016.
-
New classes of degree sequences with fast mixing swap Markov chain sampling
Authors:
Péter L. Erdős,
István Miklós,
Zoltán Toroczkai
Abstract:
In network modeling of complex systems one is often required to sample random realizations of networks that obey a given set of constraints, usually in form of graph measures. A much studied class of problems targets uniform sampling of simple graphs with given degree sequence or also with given degree correlations expressed in the form of a joint degree matrix. One approach is to use Markov chain…
▽ More
In network modeling of complex systems one is often required to sample random realizations of networks that obey a given set of constraints, usually in form of graph measures. A much studied class of problems targets uniform sampling of simple graphs with given degree sequence or also with given degree correlations expressed in the form of a joint degree matrix. One approach is to use Markov chains based on edge switches (swaps) that preserve the constraints, are irreducible (ergodic) and fast mixing. In 1999, Kannan, Tetali and Vempala (KTV) proposed a simple swap Markov chain for sampling graphs with given degree sequence and conjectured that it mixes rapidly (in poly-time) for arbitrary degree sequences. While the conjecture is still open, it was proven for special degree sequences, in particular, for those of undirected and directed regular simple graphs, of half-regular bipartite graphs, and of graphs with certain bounded maximum degrees. Here we prove the fast mixing KTV conjecture for novel, exponentially large classes of irregular degree sequences. Our method is based on a canonical decomposition of degree sequences into split graph degree sequences, a structural theorem for the space of graph realizations and on a factorization theorem for Markov chains. After introducing bipartite splitted degree sequences, we also generalize the canonical split graph decomposition for bipartite and directed graphs.
△ Less
Submitted 14 December, 2016; v1 submitted 29 January, 2016;
originally announced January 2016.
-
The computational complexity of calculating partition functions of optimal medians with Hamming distance
Authors:
István Miklós,
Heather Smith
Abstract:
In this paper, we show that calculating the partition function of optimal medians of binary strings with Hamming distance is \#P-complete for several weight functions. The case when the weight function is the factorial function has application in bioinformatics. In that case, the partition function counts the most parsimonious evolutionary scenarios on a star tree under several models in bioinform…
▽ More
In this paper, we show that calculating the partition function of optimal medians of binary strings with Hamming distance is \#P-complete for several weight functions. The case when the weight function is the factorial function has application in bioinformatics. In that case, the partition function counts the most parsimonious evolutionary scenarios on a star tree under several models in bioinformatics. The results are extended to binary trees and we show that it is also \#P-complete to calculate the most parsimonious evolutionary scenarios on an arbitrary binary tree under the substitution model of biological sequences and under the Single Cut-or-Join model for genome rearrangements.
△ Less
Submitted 13 January, 2017; v1 submitted 19 June, 2015;
originally announced June 2015.
-
Exact sampling of graphs with prescribed degree correlations
Authors:
Kevin E. Bassler,
Charo I. Del Genio,
Péter L. Erdős,
István Miklós,
Zoltán Toroczkai
Abstract:
Many real-world networks exhibit correlations between the node degrees. For instance, in social networks nodes tend to connect to nodes of similar degree. Conversely, in biological and technological networks, high-degree nodes tend to be linked with low-degree nodes. Degree correlations also affect the dynamics of processes supported by a network structure, such as the spread of opinions or epidem…
▽ More
Many real-world networks exhibit correlations between the node degrees. For instance, in social networks nodes tend to connect to nodes of similar degree. Conversely, in biological and technological networks, high-degree nodes tend to be linked with low-degree nodes. Degree correlations also affect the dynamics of processes supported by a network structure, such as the spread of opinions or epidemics. The proper modelling of these systems, i.e., without uncontrolled biases, requires the sampling of networks with a specified set of constraints. We present a solution to the sampling problem when the constraints imposed are the degree correlations. In particular, we develop an efficient and exact method to construct and sample graphs with a specified joint-degree matrix, which is a matrix providing the number of edges between all the sets of nodes of a given degree, for all degrees, thus completely specifying all pairwise degree correlations, and additionally, the degree sequence itself. Our algorithm always produces independent samples without backtracking. The complexity of the graph construction algorithm is O(NM) where N is the number of nodes and M is the number of edges.
△ Less
Submitted 30 June, 2015; v1 submitted 23 March, 2015;
originally announced March 2015.
-
Approximate statistical alignment by iterative sampling of substitution matrices
Authors:
Joseph L. Herman,
Adrienn Szabó,
Instván Miklós,
Jotun Hein
Abstract:
We outline a procedure for jointly sampling substitution matrices and multiple sequence alignments, according to an approximate posterior distribution, using an MCMC-based algorithm. This procedure provides an efficient and simple method by which to generate alternative alignments according to their expected accuracy, and allows appropriate parameters for substitution matrices to be selected in an…
▽ More
We outline a procedure for jointly sampling substitution matrices and multiple sequence alignments, according to an approximate posterior distribution, using an MCMC-based algorithm. This procedure provides an efficient and simple method by which to generate alternative alignments according to their expected accuracy, and allows appropriate parameters for substitution matrices to be selected in an automated fashion. In the cases considered here, the sampled alignments with the highest likelihood have an accuracy consistently higher than alignments generated using the standard BLOSUM62 matrix.
△ Less
Submitted 19 January, 2015;
originally announced January 2015.
-
A decomposition based proof for fast mixing of a Markov chain over balanced realizations of a joint degree matrix
Authors:
Péter L. Erdős,
István Miklós,
Zoltán Toroczkai
Abstract:
A joint degree matrix (JDM) specifies the number of connections between nodes of given degrees in a graph, for all degree pairs and uniquely determines the degree sequence of the graph. We consider the space of all balanced realizations of an arbitrary JDM, realizations in which the links between any two degree groups are placed as uniformly as possible. We prove that a swap Markov Chain Monte Car…
▽ More
A joint degree matrix (JDM) specifies the number of connections between nodes of given degrees in a graph, for all degree pairs and uniquely determines the degree sequence of the graph. We consider the space of all balanced realizations of an arbitrary JDM, realizations in which the links between any two degree groups are placed as uniformly as possible. We prove that a swap Markov Chain Monte Carlo (MCMC) algorithm in the space of all balanced realizations of an {\em arbitrary} graphical JDM mixes rapidly, i.e., the relaxation time of the chain is bounded from above by a polynomial in the number of nodes $n$. To prove fast mixing, we first prove a general factorization theorem similar to the Martin-Randall method for disjoint decompositions (partitions). This theorem can be used to bound from below the spectral gap with the help of fast mixing subchains within every partition and a bound on an auxiliary Markov chain between the partitions. Our proof of the general factorization theorem is direct and uses conductance based methods (Cheeger inequality).
△ Less
Submitted 16 September, 2014; v1 submitted 19 July, 2013;
originally announced July 2013.
-
On sampling SCJ rearrangement scenarios
Authors:
Istvan Miklos,
Sandor Z. Kiss,
Eric Tannier
Abstract:
The Single Cut or Join (SCJ) operation on genomes, generalizing chromosome evolution by fusions and fissions, is the computationally simplest known model of genome rearrangement. While most genome rearrangement problems are already hard when comparing three genomes, it is possible to compute in polynomial time a most parsimonious SCJ scenario for an arbitrary number of genomes related by a binary…
▽ More
The Single Cut or Join (SCJ) operation on genomes, generalizing chromosome evolution by fusions and fissions, is the computationally simplest known model of genome rearrangement. While most genome rearrangement problems are already hard when comparing three genomes, it is possible to compute in polynomial time a most parsimonious SCJ scenario for an arbitrary number of genomes related by a binary phylogenetic tree.
Here we consider the problems of sampling and counting the most parsimonious SCJ scenarios. We show that both the sampling and counting problems are easy for two genomes, and we relate SCJ scenarios to alternating permutations. However, for an arbitrary number of genomes related by a binary phylogenetic tree, the counting and sampling problems become hard. We prove that if a Fully Polynomial Randomized Approximation Scheme or a Fully Polynomial Almost Uniform Sampler exist for the most parsimonious SCJ scenario, then RP = NP.
The proof has a wider scope than genome rearrangements: the same result holds for parsimonious evolutionary scenarios on any set of discrete characters.
△ Less
Submitted 8 April, 2013;
originally announced April 2013.
-
Proving the Pressing Game Conjecture on Linear Graphs
Authors:
Eliot Bixby,
Toby Flint,
István Miklós
Abstract:
The pressing game on black-and-white graphs is the following: Given a graph $G(V,E)$ with its vertices colored with black and white, any black vertex $v$ can be pressed, which has the following effect: (a) all neighbors of $v$ change color, i.e. white neighbors become black and \emph{vice versa}, (b) all pairs of neighbors of $v$ change connectivity, i.e. connected pairs become unconnected, unconn…
▽ More
The pressing game on black-and-white graphs is the following: Given a graph $G(V,E)$ with its vertices colored with black and white, any black vertex $v$ can be pressed, which has the following effect: (a) all neighbors of $v$ change color, i.e. white neighbors become black and \emph{vice versa}, (b) all pairs of neighbors of $v$ change connectivity, i.e. connected pairs become unconnected, unconnected ones become connected, (c) and finally, $v$ becomes a separated white vertex. The aim of the game is to transform $G$ into an all white, empty graph. It is a known result that the all white empty graph is reachable in the pressing game if each component of $G$ contains at least one black vertex, and for a fixed graph, any successful transformation has the same number of pressed vertices.
The pressing game conjecture is that any successful pressing path can be transformed into any other successful pressing path with small alterations. Here we prove the conjecture for linear graphs. The connection to genome rearrangement and sorting signed permutations with reversals is also discussed.
△ Less
Submitted 27 March, 2013;
originally announced March 2013.
-
On Realizations of a Joint Degree Matrix
Authors:
Éva Czabarka,
Aaron Dutle,
Péter Erdös,
István Miklós
Abstract:
The joint degree matrix of a graph gives the number of edges between vertices of degree i and degree j for every pair (i,j). One can perform restricted swap operations to transform a graph into another with the same joint degree matrix. We prove that the space of all realizations of a given joint degree matrix over a fixed vertex set is connected via these restricted swap operations. This was clai…
▽ More
The joint degree matrix of a graph gives the number of edges between vertices of degree i and degree j for every pair (i,j). One can perform restricted swap operations to transform a graph into another with the same joint degree matrix. We prove that the space of all realizations of a given joint degree matrix over a fixed vertex set is connected via these restricted swap operations. This was claimed before, but there is an error in the previous proof, which we illustrate by example. We also give a simplified proof of the necessary and sufficient conditions for a matrix to be a joint degree matrix. Finally, we address some of the issues concerning the mixing time of the corresponding MCMC method to sample uniformly from these realizations.
△ Less
Submitted 15 February, 2013; v1 submitted 14 February, 2013;
originally announced February 2013.
-
Modulated String Searching
Authors:
Alberto Apostolico,
Péter L. Erdős,
István Miklós,
Johannes Siemons
Abstract:
In his 1987 paper entitled "Generalized String Matching", Abrahamson introduced {\em pattern matching with character classes} and provided the first efficient algorithm to solve it. The best known solution to date is due to Linhart and Shamir (2009).
Another broad yet comparatively less studied class of string matching problems is that of numerical string searching, such as, e.g., the `less-than…
▽ More
In his 1987 paper entitled "Generalized String Matching", Abrahamson introduced {\em pattern matching with character classes} and provided the first efficient algorithm to solve it. The best known solution to date is due to Linhart and Shamir (2009).
Another broad yet comparatively less studied class of string matching problems is that of numerical string searching, such as, e.g., the `less-than' or $L_1$-norm string searching. The best known solutions for problems in this class are based on FFT convolution after some suitable re-encoding.
The present paper introduces {\em modulated string searching} as a unified framework for string matching problems where the numerical conditions can be combined with some Boolean/numerical decision conditions on the character classes. One example problem in this class is the {\em locally bounded $L_1$-norm} matching problem on character classes: here the "match" between a character at some position in the text and a set of characters at some position in the pattern is assessed based on the smallest $L_1$ distance between the text character and one of those pattern characters. The two positions "match" if the (absolute value of the) difference between the two characters does not exceed a predefined constant. The pattern has an occurrence in an alignment with the text if the sum of all such differences does not exceed a second predefined constant value. This problem requires a pointwise evaluation of the quality of each match and has no known solution based on the previously mentioned algorithms.
△ Less
Submitted 25 July, 2013; v1 submitted 14 February, 2013;
originally announced February 2013.
-
On the swap-distances of different realizations of a graphical degree sequence
Authors:
Péter L. Erdős,
Zoltán Király,
István Miklós
Abstract:
One of the first graph theoretical problems which got serious attention (already in the fifties of the last century) was to decide whether a given integer sequence is equal to the degree sequence of a simple graph (or it is {\em graphical} for short). One method to solve this problem is the greedy algorithm of Havel and Hakimi, which is based on the {\em swap} operation. Another, closely related q…
▽ More
One of the first graph theoretical problems which got serious attention (already in the fifties of the last century) was to decide whether a given integer sequence is equal to the degree sequence of a simple graph (or it is {\em graphical} for short). One method to solve this problem is the greedy algorithm of Havel and Hakimi, which is based on the {\em swap} operation. Another, closely related question is to find a sequence of swap operations to transform one graphical realization into another one of the same degree sequence. This latter problem got particular emphases in connection of fast mixing Markov chain approaches to sample uniformly all possible realizations of a given degree sequence. (This becomes a matter of interest in connection of -- among others -- the study of large social networks.) Earlier there were only crude upper bounds on the shortest possible length of such swap sequences between two realizations. In this paper we develop formulae (Gallai-type identities) for these {\em swap-distance}s of any two realizations of simple undirected or directed degree sequences. These identities improves considerably the known upper bounds on the swap-distances.
△ Less
Submitted 20 February, 2013; v1 submitted 13 May, 2012;
originally announced May 2012.
-
A linear memory algorithm for Baum-Welch training
Authors:
Istvan Miklos,
Irmtraud M. Meyer
Abstract:
Background: Baum-Welch training is an expectation-maximisation algorithm for training the emission and transition probabilities of hidden Markov models in a fully automated way.
Methods and results: We introduce a linear space algorithm for Baum-Welch training. For a hidden Markov model with M states, T free transition and E free emission parameters, and an input sequence of length L, our new…
▽ More
Background: Baum-Welch training is an expectation-maximisation algorithm for training the emission and transition probabilities of hidden Markov models in a fully automated way.
Methods and results: We introduce a linear space algorithm for Baum-Welch training. For a hidden Markov model with M states, T free transition and E free emission parameters, and an input sequence of length L, our new algorithm requires O(M) memory and O(L M T_max (T + E)) time for one Baum-Welch iteration, where T_max is the maximum number of states that any state is connected to. The most memory efficient algorithm until now was the checkpointing algorithm with O(log(L) M) memory and O(log(L) L M T_max) time requirement. Our novel algorithm thus renders the memory requirement completely independent of the length of the training sequences. More generally, for an n-hidden Markov model and n input sequences of length L, the memory requirement of O(log(L) L^(n-1) M) is reduced to O(L^(n-1) M) memory while the running time is changed from O(log(L) L^n M T_max + L^n (T + E)) to O(L^n M T_max (T + E)).
Conclusions: For the large class of hidden Markov models used for example in gene prediction, whose number of states does not scale with the length of the input sequence, our novel algorithm can thus be both faster and more memory-efficient than any of the existing algorithms.
△ Less
Submitted 16 August, 2005; v1 submitted 11 May, 2005;
originally announced May 2005.