-
Sampling Decisions
Authors:
Michael Chertkov,
Sungsoo Ahn,
Hamidreza Behjoo
Abstract:
In this manuscript we introduce a novel Decision Flow (DF) framework for sampling from a target distribution while incorporating additional guidance from a prior sampler. DF can be viewed as an AI driven algorithmic reincarnation of the Markov Decision Process (MDP) approach in Stochastic Optimal Control. It extends the continuous space, continuous time path Integral Diffusion sampling technique t…
▽ More
In this manuscript we introduce a novel Decision Flow (DF) framework for sampling from a target distribution while incorporating additional guidance from a prior sampler. DF can be viewed as an AI driven algorithmic reincarnation of the Markov Decision Process (MDP) approach in Stochastic Optimal Control. It extends the continuous space, continuous time path Integral Diffusion sampling technique to discrete time and space, while also generalizing the Generative Flow Network framework. In its most basic form, an explicit, Neural Network (NN) free formulation, DF leverages the linear solvability of the the underlying MDP to adjust the transition probabilities of the prior sampler. The resulting Markov Process is expressed as a convolution of the reverse time Green's function of the prior sampling with the target distribution. We illustrate the DF framework through an example of sampling from the Ising model, discuss potential NN based extensions, and outline how DF can enhance guided sampling across various applications.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
Mixing Artificial and Natural Intelligence: From Statistical Mechanics to AI and Back to Turbulence
Authors:
Michael Chertkov
Abstract:
The paper reflects on the future role of AI in scientific research, with a special focus on turbulence studies, and examines the evolution of AI, particularly through Diffusion Models rooted in non-equilibrium statistical mechanics. It underscores the significant impact of AI on advancing reduced, Lagrangian models of turbulence through innovative use of deep neural networks. Additionally, the pap…
▽ More
The paper reflects on the future role of AI in scientific research, with a special focus on turbulence studies, and examines the evolution of AI, particularly through Diffusion Models rooted in non-equilibrium statistical mechanics. It underscores the significant impact of AI on advancing reduced, Lagrangian models of turbulence through innovative use of deep neural networks. Additionally, the paper reviews various other AI applications in turbulence research and outlines potential challenges and opportunities in the concurrent advancement of AI and statistical hydrodynamics. This discussion sets the stage for a future where AI and turbulence research are intricately intertwined, leading to more profound insights and advancements in both fields.
△ Less
Submitted 12 July, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
Universality and Control of Fat Tails
Authors:
Michael Chertkov
Abstract:
Motivated by applications in hydrodynamics and networks of thermostatically-control loads in buildings we study control of linear dynamical systems driven by additive and also multiplicative noise of a general position. Utilizing mathematical theory of stochastic multiplicative processes we present a universal way to estimate fat, algebraic tails of the state vector probability distributions. This…
▽ More
Motivated by applications in hydrodynamics and networks of thermostatically-control loads in buildings we study control of linear dynamical systems driven by additive and also multiplicative noise of a general position. Utilizing mathematical theory of stochastic multiplicative processes we present a universal way to estimate fat, algebraic tails of the state vector probability distributions. This prompts us to introduce and analyze mean-q-power stability criterion, generalizing the mean-square stability criterion, and then juxtapose it to other tools in control.
△ Less
Submitted 11 December, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
Exact Fractional Inference via Re-Parametrization & Interpolation between Tree-Re-Weighted- and Belief Propagation- Algorithms
Authors:
Hamidreza Behjoo,
Michael Chertkov
Abstract:
Computing the partition function, $Z$, of an Ising model over a graph of $N$ \enquote{spins} is most likely exponential in $N$. Efficient variational methods, such as Belief Propagation (BP) and Tree Re-Weighted (TRW) algorithms, compute $Z$ approximately by minimizing the respective (BP- or TRW-) free energy. We generalize the variational scheme by building a $λ$-fractional interpolation,…
▽ More
Computing the partition function, $Z$, of an Ising model over a graph of $N$ \enquote{spins} is most likely exponential in $N$. Efficient variational methods, such as Belief Propagation (BP) and Tree Re-Weighted (TRW) algorithms, compute $Z$ approximately by minimizing the respective (BP- or TRW-) free energy. We generalize the variational scheme by building a $λ$-fractional interpolation, $Z^{(λ)}$, where $λ=0$ and $λ=1$ correspond to TRW- and BP-approximations, respectively. This fractional scheme -- coined Fractional Belief Propagation (FBP) -- guarantees that in the attractive (ferromagnetic) case $Z^{(TRW)} \geq Z^{(λ)} \geq Z^{(BP)}$, and there exists a unique (\enquote{exact}) $λ_*$ such that $Z=Z^{(λ_*)}$. Generalizing the re-parametrization approach of \citep{wainwright_tree-based_2002} and the loop series approach of \citep{chertkov_loop_2006}, we show how to express $Z$ as a product, $\forall λ:\ Z=Z^{(λ)}{\tilde Z}^{(λ)}$, where the multiplicative correction, ${\tilde Z}^{(λ)}$, is an expectation over a node-independent probability distribution built from node-wise fractional marginals. Our theoretical analysis is complemented by extensive experiments with models from Ising ensembles over planar and random graphs of medium and large sizes. Our empirical study yields a number of interesting observations, such as the ability to estimate ${\tilde Z}^{(λ)}$ with $O(N^{2::4})$ fractional samples and suppression of variation in $λ_*$ estimates with an increase in $N$ for instances from a particular random Ising ensemble, where $[2::4]$ indicates a range from $2$ to $4$. We also discuss the applicability of this approach to the problem of image de-noising.
△ Less
Submitted 13 November, 2024; v1 submitted 24 January, 2023;
originally announced January 2023.
-
A New Family of Tractable Ising Models
Authors:
Valerii Likhosherstov,
Yury Maximov,
Michael Chertkov
Abstract:
We present a new family of zero-field Ising models over N binary variables/spins obtained by consecutive "gluing" of planar and $O(1)$-sized components along with subsets of at most three vertices into a tree. The polynomial time algorithm of the dynamic programming type for solving exact inference (partition function computation) and sampling consists of a sequential application of an efficient (…
▽ More
We present a new family of zero-field Ising models over N binary variables/spins obtained by consecutive "gluing" of planar and $O(1)$-sized components along with subsets of at most three vertices into a tree. The polynomial time algorithm of the dynamic programming type for solving exact inference (partition function computation) and sampling consists of a sequential application of an efficient (for planar) or brute-force (for $O(1)$-sized) inference and sampling to the components as a black box. To illustrate the utility of the new family of tractable graphical models, we first build an $O(N^{3/2})$ algorithm for inference and sampling of the K5-minor-free zero-field Ising models - an extension of the planar zero-field Ising models - which is neither genus- nor treewidth-bounded. Second, we demonstrate empirically an improvement in the approximation quality of the NP-hard problem of the square-grid Ising model (with non-zero field) inference.
△ Less
Submitted 14 June, 2019;
originally announced June 2019.
-
Inference and Sampling of $K_{33}$-free Ising Models
Authors:
Valerii Likhosherstov,
Yury Maximov,
Michael Chertkov
Abstract:
We call an Ising model tractable when it is possible to compute its partition function value (statistical inference) in polynomial time. The tractability also implies an ability to sample configurations of this model in polynomial time. The notion of tractability extends the basic case of planar zero-field Ising models. Our starting point is to describe algorithms for the basic case computing part…
▽ More
We call an Ising model tractable when it is possible to compute its partition function value (statistical inference) in polynomial time. The tractability also implies an ability to sample configurations of this model in polynomial time. The notion of tractability extends the basic case of planar zero-field Ising models. Our starting point is to describe algorithms for the basic case computing partition function and sampling efficiently. To derive the algorithms, we use an equivalent linear transition to perfect matching counting and sampling on an expanded dual graph. Then, we extend our tractable inference and sampling algorithms to models, whose triconnected components are either planar or graphs of $O(1)$ size. In particular, it results in a polynomial-time inference and sampling algorithms for $K_{33}$ (minor) free topologies of zero-field Ising models - a generalization of planar graphs with a potentially unbounded genus.
△ Less
Submitted 21 May, 2019; v1 submitted 22 December, 2018;
originally announced December 2018.
-
Mean Field Control for Efficient Mixing of Energy Loads
Authors:
David Métivier,
Michael Chertkov
Abstract:
We pose an engineering challenge of controlling an Ensemble of Energy Devices via coordinated, implementation-light and randomized on/off switching as a problem in Non-Equilibrium Statistical Mechanics. We show that Mean Field Control} with nonlinear feedback on the cumulative consumption, assumed available to the aggregator via direct physical measurements of the energy flow, allows the ensemble…
▽ More
We pose an engineering challenge of controlling an Ensemble of Energy Devices via coordinated, implementation-light and randomized on/off switching as a problem in Non-Equilibrium Statistical Mechanics. We show that Mean Field Control} with nonlinear feedback on the cumulative consumption, assumed available to the aggregator via direct physical measurements of the energy flow, allows the ensemble to recover from its use in the Demand Response regime, i.e. transition to a statistical steady state, significantly faster than in the case of the fixed feedback. Moreover when the nonlinearity is sufficiently strong, one observes the phenomenon of "super-relaxation" -- where the total instantaneous energy consumption of the ensemble transitions to the steady state much faster than the underlying probability distribution of the devices over their state space, while also leaving almost no devices outside of the comfort zone.
△ Less
Submitted 15 January, 2020; v1 submitted 30 September, 2018;
originally announced October 2018.
-
Power of Ensemble Diversity and Randomization for Energy Aggregation
Authors:
David Métivier,
Ilia Luchnikov,
Michael Chertkov
Abstract:
We study an ensemble of diverse (inhomogeneous) thermostatically controlled loads aggregated to provide the demand response (DR) services in a district-level energy system. Each load in the ensemble is assumed to be equipped with a random number generator switching heating/cooling on or off with a Poisson rate, $r$, when the load leaves the comfort zone. Ensemble diversity is modeled through inhom…
▽ More
We study an ensemble of diverse (inhomogeneous) thermostatically controlled loads aggregated to provide the demand response (DR) services in a district-level energy system. Each load in the ensemble is assumed to be equipped with a random number generator switching heating/cooling on or off with a Poisson rate, $r$, when the load leaves the comfort zone. Ensemble diversity is modeled through inhomogeneity/disorder in the deterministic dynamics of loads. Approached from the standpoint of statistical physics, the ensemble represents a non-equilibrium system driven away from its natural steady state by the DR. The ability of the ensemble to recover by mixing faster to the steady state after its DR's use is advantageous. The trade-off between the level of the aggregator's control, commanding the devices to lower the rate $r$, and the phase-space-oscillatory deterministic dynamics is analyzed. We discover that there exists a critical value, $r_c$, corresponding to both the most efficient mixing and the bifurcation point where the ensemble transitions from the oscillatory relaxation at $r>r_c$ to the pure relaxation at $r<r_c$. Then, we study the effect of the load diversity, investigating four different disorder probability distributions (DPDs) ranging from the case of the Gaussian DPD to the case of the uniform with finite support DPD. Demonstrating resemblance to the similar question of the effectiveness of Landau damping in plasma physics, we show that stronger regularity of the DPD around its maximum results in faster mixing. Our theoretical analysis is supported by extensive numerical validation, which also allows us to access the effect of the ensemble's finite size.
△ Less
Submitted 3 October, 2018; v1 submitted 28 August, 2018;
originally announced August 2018.
-
Optimal structure and parameter learning of Ising models
Authors:
Andrey Y. Lokhov,
Marc Vuffray,
Sidhant Misra,
Michael Chertkov
Abstract:
Reconstruction of structure and parameters of an Ising model from binary samples is a problem of practical importance in a variety of disciplines, ranging from statistical physics and computational biology to image processing and machine learning. The focus of the research community shifted towards developing universal reconstruction algorithms which are both computationally efficient and require…
▽ More
Reconstruction of structure and parameters of an Ising model from binary samples is a problem of practical importance in a variety of disciplines, ranging from statistical physics and computational biology to image processing and machine learning. The focus of the research community shifted towards developing universal reconstruction algorithms which are both computationally efficient and require the minimal amount of expensive data. We introduce a new method, Interaction Screening, which accurately estimates the model parameters using local optimization problems. The algorithm provably achieves perfect graph structure recovery with an information-theoretically optimal number of samples, notably in the low-temperature regime which is known to be the hardest for learning. The efficacy of Interaction Screening is assessed through extensive numerical tests on synthetic Ising models of various topologies with different types of interactions, as well as on a real data produced by a D-Wave quantum computer. This study shows that the Interaction Screening method is an exact, tractable and optimal technique universally solving the inverse Ising problem.
△ Less
Submitted 26 December, 2017; v1 submitted 15 December, 2016;
originally announced December 2016.
-
Interaction Screening: Efficient and Sample-Optimal Learning of Ising Models
Authors:
Marc Vuffray,
Sidhant Misra,
Andrey Y. Lokhov,
Michael Chertkov
Abstract:
We consider the problem of learning the underlying graph of an unknown Ising model on p spins from a collection of i.i.d. samples generated from the model. We suggest a new estimator that is computationally efficient and requires a number of samples that is near-optimal with respect to previously established information-theoretic lower-bound. Our statistical estimator has a physical interpretation…
▽ More
We consider the problem of learning the underlying graph of an unknown Ising model on p spins from a collection of i.i.d. samples generated from the model. We suggest a new estimator that is computationally efficient and requires a number of samples that is near-optimal with respect to previously established information-theoretic lower-bound. Our statistical estimator has a physical interpretation in terms of "interaction screening". The estimator is consistent and is efficiently implemented using convex optimization. We prove that with appropriate regularization, the estimator recovers the underlying graph using a number of samples that is logarithmic in the system size p and exponential in the maximum coupling-intensity and maximum node-degree.
△ Less
Submitted 19 December, 2016; v1 submitted 23 May, 2016;
originally announced May 2016.
-
Extreme value statistics of work done in stretching a polymer in a gradient flow
Authors:
Marija Vucelja,
Konstantin S. Turitsyn,
Michael Chertkov
Abstract:
We analyze the statistics of work generated by a gradient flow to stretch a nonlinear polymer. We obtain the Large Deviation Function (LDF) of the work in the full range of appropriate parameters by combining analytical and numerical tools. The LDF shows two distinct asymptotes: "near tails" are linear in work and dominated by coiled polymer configurations, while "far tails" are quadratic in work…
▽ More
We analyze the statistics of work generated by a gradient flow to stretch a nonlinear polymer. We obtain the Large Deviation Function (LDF) of the work in the full range of appropriate parameters by combining analytical and numerical tools. The LDF shows two distinct asymptotes: "near tails" are linear in work and dominated by coiled polymer configurations, while "far tails" are quadratic in work and correspond to preferentially fully stretched polymers. We find the extreme value statistics of work for several singular elastic potentials, as well as the mean and the dispersion of work near the coil-stretch transition. The dispersion shows a maximum at the transition.
△ Less
Submitted 10 February, 2015; v1 submitted 9 February, 2014;
originally announced February 2014.
-
Stochastic Optimal Control as Non-equilibrium Statistical Mechanics: Calculus of Variations over Density and Current
Authors:
Vladimir Y. Chernyak,
Michael Chertkov,
Joris Bierkens,
Hilbert J. Kappen
Abstract:
In Stochastic Optimal Control (SOC) one minimizes the average cost-to-go, that consists of the cost-of-control (amount of efforts), cost-of-space (where one wants the system to be) and the target cost (where one wants the system to arrive), for a system participating in forced and controlled Langevin dynamics. We extend the SOC problem by introducing an additional cost-of-dynamics, characterized b…
▽ More
In Stochastic Optimal Control (SOC) one minimizes the average cost-to-go, that consists of the cost-of-control (amount of efforts), cost-of-space (where one wants the system to be) and the target cost (where one wants the system to arrive), for a system participating in forced and controlled Langevin dynamics. We extend the SOC problem by introducing an additional cost-of-dynamics, characterized by a vector potential. We propose derivation of the generalized gauge-invariant Hamilton-Jacobi-Bellman equation as a variation over density and current, suggest hydrodynamic interpretation and discuss examples, e.g., ergodic control of a particle-within-a-circle, illustrating non-equilibrium space-time complexity.
△ Less
Submitted 27 June, 2013;
originally announced June 2013.
-
Loop Calculus and Bootstrap-Belief Propagation for Perfect Matchings on Arbitrary Graphs
Authors:
Michael Chertkov,
Andrew Gelfand,
Jinwoo Shin
Abstract:
This manuscript discusses computation of the Partition Function (PF) and the Minimum Weight Perfect Matching (MWPM) on arbitrary, non-bipartite graphs. We present two novel problem formulations - one for computing the PF of a Perfect Matching (PM) and one for finding MWPMs - that build upon the inter-related Bethe Free Energy, Belief Propagation (BP), Loop Calculus (LC), Integer Linear Programming…
▽ More
This manuscript discusses computation of the Partition Function (PF) and the Minimum Weight Perfect Matching (MWPM) on arbitrary, non-bipartite graphs. We present two novel problem formulations - one for computing the PF of a Perfect Matching (PM) and one for finding MWPMs - that build upon the inter-related Bethe Free Energy, Belief Propagation (BP), Loop Calculus (LC), Integer Linear Programming (ILP) and Linear Programming (LP) frameworks. First, we describe an extension of the LC framework to the PM problem. The resulting formulas, coined (fractional) Bootstrap-BP, express the PF of the original model via the BFE of an alternative PM problem. We then study the zero-temperature version of this Bootstrap-BP formula for approximately solving the MWPM problem. We do so by leveraging the Bootstrap-BP formula to construct a sequence of MWPM problems, where each new problem in the sequence is formed by contracting odd-sized cycles (or blossoms) from the previous problem. This Bootstrap-and-Contract procedure converges reliably and generates an empirically tight upper bound for the MWPM. We conclude by discussing the relationship between our iterative procedure and the famous Blossom Algorithm of Edmonds '65 and demonstrate the performance of the Bootstrap-and-Contract approach on a variety of weighted PM problems.
△ Less
Submitted 5 June, 2013;
originally announced June 2013.
-
Tail-Constraining Stochastic Linear-Quadratic Control: Large Deviation and Statistical Physics Approach
Authors:
Michael Chertkov,
Igor Kolokolov,
Vladimir Lebedev
Abstract:
Standard definition of the stochastic Risk-Sensitive Linear-Quadratic (RS-LQ) control depends on the risk parameter, which is normally left to be set exogenously. We reconsider the classical approach and suggest two alternatives resolving the spurious freedom naturally. One approach consists in seeking for the minimum of the tail of the Probability Distribution Function (PDF) of the cost functiona…
▽ More
Standard definition of the stochastic Risk-Sensitive Linear-Quadratic (RS-LQ) control depends on the risk parameter, which is normally left to be set exogenously. We reconsider the classical approach and suggest two alternatives resolving the spurious freedom naturally. One approach consists in seeking for the minimum of the tail of the Probability Distribution Function (PDF) of the cost functional at some large fixed value. Another option suggests to minimize the expectation value of the cost functional under constraint on the value of the PDF tail. Under assumption of the resulting control stability, both problems are reduced to static optimizations over stationary control matrix. The solutions are illustrated on the examples of scalar and 1d chain (string) systems. Large Deviation self-similar asymptotic of the cost functional PDF is analyzed.
△ Less
Submitted 15 July, 2012; v1 submitted 3 April, 2012;
originally announced April 2012.
-
Approximating the Permanent with Fractional Belief Propagation
Authors:
M. Chertkov,
A. B. Yedidia
Abstract:
We discuss schemes for exact and approximate computations of permanents, and compare them with each other. Specifically, we analyze the Belief Propagation (BP) approach and its Fractional Belief Propagation (FBP) generalization for computing the permanent of a non-negative matrix. Known bounds and conjectures are verified in experiments, and some new theoretical relations, bounds and conjectures a…
▽ More
We discuss schemes for exact and approximate computations of permanents, and compare them with each other. Specifically, we analyze the Belief Propagation (BP) approach and its Fractional Belief Propagation (FBP) generalization for computing the permanent of a non-negative matrix. Known bounds and conjectures are verified in experiments, and some new theoretical relations, bounds and conjectures are proposed. The Fractional Free Energy (FFE) functional is parameterized by a scalar parameter $γ\in[-1;1]$, where $γ=-1$ corresponds to the BP limit and $γ=1$ corresponds to the exclusion principle (but ignoring perfect matching constraints) Mean-Field (MF) limit. FFE shows monotonicity and continuity with respect to $γ$. For every non-negative matrix, we define its special value $γ_*\in[-1;0]$ to be the $γ$ for which the minimum of the $γ$-parameterized FFE functional is equal to the permanent of the matrix, where the lower and upper bounds of the $γ$-interval corresponds to respective bounds for the permanent. Our experimental analysis suggests that the distribution of $γ_*$ varies for different ensembles but $γ_*$ always lies within the $[-1;-1/2]$ interval. Moreover, for all ensembles considered the behavior of $γ_*$ is highly distinctive, offering an emprirical practical guidance for estimating permanents of non-negative matrices via the FFE approach.
△ Less
Submitted 5 January, 2013; v1 submitted 30 July, 2011;
originally announced August 2011.
-
Statistical Classification of Cascading Failures in Power Grids
Authors:
René Pfitzner,
Konstantin Turitsyn,
Michael Chertkov
Abstract:
We introduce a new microscopic model of the outages in transmission power grids. This model accounts for the automatic response of the grid to load fluctuations that take place on the scale of minutes, when the optimum power flow adjustments and load shedding controls are unavailable. We describe extreme events, initiated by load fluctuations, which cause cascading failures of loads, generators an…
▽ More
We introduce a new microscopic model of the outages in transmission power grids. This model accounts for the automatic response of the grid to load fluctuations that take place on the scale of minutes, when the optimum power flow adjustments and load shedding controls are unavailable. We describe extreme events, initiated by load fluctuations, which cause cascading failures of loads, generators and lines. Our model is quasi-static in the causal, discrete time and sequential resolution of individual failures. The model, in its simplest realization based on the Directed Current description of the power flow problem, is tested on three standard IEEE systems consisting of 30, 39 and 118 buses. Our statistical analysis suggests a straightforward classification of cascading and islanding phases in terms of the ratios between average number of removed loads, generators and links. The analysis also demonstrates sensitivity to variations in line capacities. Future research challenges in modeling and control of cascading outages over real-world power networks are discussed.
△ Less
Submitted 3 December, 2010;
originally announced December 2010.
-
Geometric Universality of Currents
Authors:
V. Y. Chernyak,
M. Chertkov,
N. A. Sinitsyn
Abstract:
We discuss a non-equilibrium statistical system on a graph or network. Identical particles are injected, interact with each other, traverse, and leave the graph in a stochastic manner described in terms of Poisson rates, possibly dependent on time and instantaneous occupation numbers at the nodes of the graph. We show that under the assumption of constancy of the relative rates, the system demonst…
▽ More
We discuss a non-equilibrium statistical system on a graph or network. Identical particles are injected, interact with each other, traverse, and leave the graph in a stochastic manner described in terms of Poisson rates, possibly dependent on time and instantaneous occupation numbers at the nodes of the graph. We show that under the assumption of constancy of the relative rates, the system demonstrates a profound statistical symmetry, resulting in geometric universality of the statistics of the particle currents. This phenomenon applies broadly to many man-made and natural open stochastic systems, such as queuing of packages over the internet, transport of electrons and quasi-particles in mesoscopic systems, and chains of reactions in bio-chemical networks. We illustrate the utility of our general approach using two enabling examples from the two latter disciplines.
△ Less
Submitted 16 August, 2010;
originally announced August 2010.
-
Non-Equilibrium Statistical Physics of Currents in Queuing Networks
Authors:
Vladimir Y. Chernyak,
Michael Chertkov,
David A. Goldberg,
Konstantin Turitsyn
Abstract:
We consider a stable open queuing network as a steady non-equilibrium system of interacting particles. The network is completely specified by its underlying graphical structure, type of interaction at each node, and the Markovian transition rates between nodes. For such systems, we ask the question ``What is the most likely way for large currents to accumulate over time in a network ?'', where tim…
▽ More
We consider a stable open queuing network as a steady non-equilibrium system of interacting particles. The network is completely specified by its underlying graphical structure, type of interaction at each node, and the Markovian transition rates between nodes. For such systems, we ask the question ``What is the most likely way for large currents to accumulate over time in a network ?'', where time is large compared to the system correlation time scale. We identify two interesting regimes. In the first regime, in which the accumulation of currents over time exceeds the expected value by a small to moderate amount (moderate large deviation), we find that the large-deviation distribution of currents is universal (independent of the interaction details), and there is no long-time and averaged over time accumulation of particles (condensation) at any nodes. In the second regime, in which the accumulation of currents over time exceeds the expected value by a large amount (severe large deviation), we find that the large-deviation current distribution is sensitive to interaction details, and there is a long-time accumulation of particles (condensation) at some nodes. The transition between the two regimes can be described as a dynamical second order phase transition. We illustrate these ideas using the simple, yet non-trivial, example of a single node with feedback.
△ Less
Submitted 19 June, 2010; v1 submitted 29 January, 2010;
originally announced January 2010.
-
Belief Propagation and Loop Calculus for the Permanent of a Non-Negative Matrix
Authors:
Yusuke Watanabe,
Michael Chertkov
Abstract:
We consider computation of permanent of a positive $(N\times N)$ non-negative matrix, $P=(P_i^j|i,j=1,\cdots,N)$, or equivalently the problem of weighted counting of the perfect matchings over the complete bipartite graph $K_{N,N}$. The problem is known to be of likely exponential complexity. Stated as the partition function $Z$ of a graphical model, the problem allows exact Loop Calculus represen…
▽ More
We consider computation of permanent of a positive $(N\times N)$ non-negative matrix, $P=(P_i^j|i,j=1,\cdots,N)$, or equivalently the problem of weighted counting of the perfect matchings over the complete bipartite graph $K_{N,N}$. The problem is known to be of likely exponential complexity. Stated as the partition function $Z$ of a graphical model, the problem allows exact Loop Calculus representation [Chertkov, Chernyak '06] in terms of an interior minimum of the Bethe Free Energy functional over non-integer doubly stochastic matrix of marginal beliefs, $β=(β_i^j|i,j=1,\cdots,N)$, also correspondent to a fixed point of the iterative message-passing algorithm of the Belief Propagation (BP) type. Our main result is an explicit expression of the exact partition function (permanent) in terms of the matrix of BP marginals, $β$, as $Z=\mbox{Perm}(P)=Z_{BP} \mbox{Perm}(β_i^j(1-β_i^j))/\prod_{i,j}(1-β_i^j)$, where $Z_{BP}$ is the BP expression for the permanent stated explicitly in terms if $β$. We give two derivations of the formula, a direct one based on the Bethe Free Energy and an alternative one combining the Ihara graph-$ζ$ function and the Loop Calculus approaches. Assuming that the matrix $β$ of the Belief Propagation marginals is calculated, we provide two lower bounds and one upper-bound to estimate the multiplicative term. Two complementary lower bounds are based on the Gurvits-van der Waerden theorem and on a relation between the modified permanent and determinant respectively.
△ Less
Submitted 2 May, 2010; v1 submitted 7 November, 2009;
originally announced November 2009.
-
Inference in particle tracking experiments by passing messages between images
Authors:
M. Chertkov,
L. Kroc,
F. Krzakala,
M. Vergassola,
L. Zdeborová
Abstract:
Methods to extract information from the tracking of mobile objects/particles have broad interest in biological and physical sciences. Techniques based on simple criteria of proximity in time-consecutive snapshots are useful to identify the trajectories of the particles. However, they become problematic as the motility and/or the density of the particles increases due to uncertainties on the trajec…
▽ More
Methods to extract information from the tracking of mobile objects/particles have broad interest in biological and physical sciences. Techniques based on simple criteria of proximity in time-consecutive snapshots are useful to identify the trajectories of the particles. However, they become problematic as the motility and/or the density of the particles increases due to uncertainties on the trajectories that particles followed during the images' acquisition time. Here, we report an efficient method for learning parameters of the dynamics of the particles from their positions in time-consecutive images. Our algorithm belongs to the class of message-passing algorithms, known in computer science, information theory and statistical physics as Belief Propagation (BP). The algorithm is distributed, thus allowing parallel implementation suitable for computations on multiple machines without significant inter-machine overhead. We test our method on the model example of particle tracking in turbulent flows, which is particularly challenging due to the strong transport that those flows produce. Our numerical experiments show that the BP algorithm compares in quality with exact Markov Chain Monte-Carlo algorithms, yet BP is far superior in speed. We also suggest and analyze a random-distance model that provides theoretical justification for BP accuracy. Methods developed here systematically formulate the problem of particle tracking and provide fast and reliable tools for its extensive range of applications.
△ Less
Submitted 14 May, 2010; v1 submitted 23 September, 2009;
originally announced September 2009.
-
Message Passing for Integrating and Assessing Renewable Generation in a Redundant Power Grid
Authors:
Lenka Zdeborová,
Scott Backhaus,
Michael Chertkov
Abstract:
A simplified model of a redundant power grid is used to study integration of fluctuating renewable generation. The grid consists of large number of generator and consumer nodes. The net power consumption is determined by the difference between the gross consumption and the level of renewable generation. The gross consumption is drawn from a narrow distribution representing the predictability of…
▽ More
A simplified model of a redundant power grid is used to study integration of fluctuating renewable generation. The grid consists of large number of generator and consumer nodes. The net power consumption is determined by the difference between the gross consumption and the level of renewable generation. The gross consumption is drawn from a narrow distribution representing the predictability of aggregated loads, and we consider two different distributions representing wind and solar resources. Each generator is connected to D consumers, and redundancy is built in by connecting R of these consumers to other generators. The lines are switchable so that at any instance each consumer is connected to a single generator. We explore the capacity of the renewable generation by determining the level of "firm" generation capacity that can be displaced for different levels of redundancy R. We also develop message-passing control algorithm for finding switch settings where no generator is overloaded.
△ Less
Submitted 12 September, 2009;
originally announced September 2009.
-
Non-Equilibrium Thermodynamics and Topology of Currents
Authors:
Vladimir Y. Chernyak,
Michael Chertkov,
Sergey V. Malinin,
Razvan Teodorescu
Abstract:
In many experimental situations, a physical system undergoes stochastic evolution which may be described via random maps between two compact spaces. In the current work, we study the applicability of large deviations theory to time-averaged quantities which describe such stochastic maps, in particular time-averaged currents and density functionals. We derive the large deviations principle for th…
▽ More
In many experimental situations, a physical system undergoes stochastic evolution which may be described via random maps between two compact spaces. In the current work, we study the applicability of large deviations theory to time-averaged quantities which describe such stochastic maps, in particular time-averaged currents and density functionals. We derive the large deviations principle for these quantities, as well as for global topological currents, and formulate variational, thermodynamic relations to establish large deviation properties of the topological currents. We illustrate the theory with a nontrivial example of a Heisenberg spin-chain with a topological driving of the Wess-Zumino type. The Cramér functional of the topological current is found explicitly in the instanton gas regime for the spin-chain model in the weak-noise limit. In the context of the Morse theory, we discuss a general reduction of continuous stochastic models with weak noise to effective Markov chains describing transitions between stable fixed points.
△ Less
Submitted 12 September, 2009; v1 submitted 20 July, 2009;
originally announced July 2009.
-
Message Passing for Optimization and Control of Power Grid: Model of Distribution System with Redundancy
Authors:
Lenka Zdeborová,
Aurélien Decelle,
Michael Chertkov
Abstract:
We use a power grid model with $M$ generators and $N$ consumption units to optimize the grid and its control. Each consumer demand is drawn from a predefined finite-size-support distribution, thus simulating the instantaneous load fluctuations. Each generator has a maximum power capability. A generator is not overloaded if the sum of the loads of consumers connected to a generator does not excee…
▽ More
We use a power grid model with $M$ generators and $N$ consumption units to optimize the grid and its control. Each consumer demand is drawn from a predefined finite-size-support distribution, thus simulating the instantaneous load fluctuations. Each generator has a maximum power capability. A generator is not overloaded if the sum of the loads of consumers connected to a generator does not exceed its maximum production. In the standard grid each consumer is connected only to its designated generator, while we consider a more general organization of the grid allowing each consumer to select one generator depending on the load from a pre-defined consumer-dependent and sufficiently small set of generators which can all serve the load. The model grid is interconnected in a graph with loops, drawn from an ensemble of random bipartite graphs, while each allowed configuration of loaded links represent a set of graph covering trees. Losses, the reactive character of the grid and the transmission-level connections between generators (and many other details relevant to realistic power grid) are ignored in this proof-of-principles study. We focus on the asymptotic limit and we show that the interconnects allow significant expansion of the parameter domains for which the probability of a generator overload is asymptotically zero. Our construction explores the formal relation between the problem of grid optimization and the modern theory of sparse graphical models. We also design heuristic algorithms that achieve the asymptotically optimal selection of loaded links. We conclude discussing the ability of this approach to include other effects, such as a more realistic modeling of the power grid and related optimization and control algorithms.
△ Less
Submitted 27 July, 2009; v1 submitted 2 April, 2009;
originally announced April 2009.
-
Planar Graphical Models which are Easy
Authors:
Vladimir Y. Chernyak,
Michael Chertkov
Abstract:
We describe a rich family of binary variables statistical mechanics models on a given planar graph which are equivalent to Gaussian Grassmann Graphical models (free fermions) defined on the same graph. Calculation of the partition function (weighted counting) for such a model is easy (of polynomial complexity) as reducible to evaluation of a Pfaffian of a matrix of size equal to twice the number o…
▽ More
We describe a rich family of binary variables statistical mechanics models on a given planar graph which are equivalent to Gaussian Grassmann Graphical models (free fermions) defined on the same graph. Calculation of the partition function (weighted counting) for such a model is easy (of polynomial complexity) as reducible to evaluation of a Pfaffian of a matrix of size equal to twice the number of edges in the graph. In particular, this approach touches upon Holographic Algorithms of Valiant and utilizes the Gauge Transformations discussed in our previous works.
△ Less
Submitted 29 September, 2010; v1 submitted 2 February, 2009;
originally announced February 2009.
-
Fermions and Loops on Graphs. II. Monomer-Dimer Model as Series of Determinants
Authors:
Vladimir Y. Chernyak,
Michael Chertkov
Abstract:
We continue the discussion of the fermion models on graphs that started in the first paper of the series. Here we introduce a Graphical Gauge Model (GGM) and show that : (a) it can be stated as an average/sum of a determinant defined on the graph over $\mathbb{Z}_{2}$ (binary) gauge field; (b) it is equivalent to the Monomer-Dimer (MD) model on the graph; (c) the partition function of the model…
▽ More
We continue the discussion of the fermion models on graphs that started in the first paper of the series. Here we introduce a Graphical Gauge Model (GGM) and show that : (a) it can be stated as an average/sum of a determinant defined on the graph over $\mathbb{Z}_{2}$ (binary) gauge field; (b) it is equivalent to the Monomer-Dimer (MD) model on the graph; (c) the partition function of the model allows an explicit expression in terms of a series over disjoint directed cycles, where each term is a product of local contributions along the cycle and the determinant of a matrix defined on the remainder of the graph (excluding the cycle). We also establish a relation between the MD model on the graph and the determinant series, discussed in the first paper, however, considered using simple non-Belief-Propagation choice of the gauge. We conclude with a discussion of possible analytic and algorithmic consequences of these results, as well as related questions and challenges.
△ Less
Submitted 20 November, 2008; v1 submitted 19 September, 2008;
originally announced September 2008.
-
Fermions and Loops on Graphs. I. Loop Calculus for Determinant
Authors:
Vladimir Y. Chernyak,
Michael Chertkov
Abstract:
This paper is the first in the series devoted to evaluation of the partition function in statistical models on graphs with loops in terms of the Berezin/fermion integrals. The paper focuses on a representation of the determinant of a square matrix in terms of a finite series, where each term corresponds to a loop on the graph. The representation is based on a fermion version of the Loop Calculus…
▽ More
This paper is the first in the series devoted to evaluation of the partition function in statistical models on graphs with loops in terms of the Berezin/fermion integrals. The paper focuses on a representation of the determinant of a square matrix in terms of a finite series, where each term corresponds to a loop on the graph. The representation is based on a fermion version of the Loop Calculus, previously introduced by the authors for graphical models with finite alphabets. Our construction contains two levels. First, we represent the determinant in terms of an integral over anti-commuting Grassman variables, with some reparametrization/gauge freedom hidden in the formulation. Second, we show that a special choice of the gauge, called BP (Bethe-Peierls or Belief Propagation) gauge, yields the desired loop representation. The set of gauge-fixing BP conditions is equivalent to the Gaussian BP equations, discussed in the past as efficient (linear scaling) heuristics for estimating the covariance of a sparse positive matrix.
△ Less
Submitted 20 November, 2008; v1 submitted 19 September, 2008;
originally announced September 2008.
-
Irreversible Monte Carlo Algorithms for Efficient Sampling
Authors:
Konstantin S. Turitsyn,
Michael Chertkov,
Marija Vucelja
Abstract:
Equilibrium systems evolve according to Detailed Balance (DB). This principe guided development of the Monte-Carlo sampling techniques, of which Metropolis-Hastings (MH) algorithm is the famous representative. It is also known that DB is sufficient but not necessary. We construct irreversible deformation of a given reversible algorithm capable of dramatic improvement of sampling from known distr…
▽ More
Equilibrium systems evolve according to Detailed Balance (DB). This principe guided development of the Monte-Carlo sampling techniques, of which Metropolis-Hastings (MH) algorithm is the famous representative. It is also known that DB is sufficient but not necessary. We construct irreversible deformation of a given reversible algorithm capable of dramatic improvement of sampling from known distribution. Our transformation modifies transition rates keeping the structure of transitions intact. To illustrate the general scheme we design an Irreversible version of Metropolis-Hastings (IMH) and test it on example of a spin cluster. Standard MH for the model suffers from the critical slowdown, while IMH is free from critical slowdown.
△ Less
Submitted 23 September, 2008; v1 submitted 4 September, 2008;
originally announced September 2008.
-
Provably efficient instanton search algorithm for LP decoding of LDPC codes over the BSC
Authors:
Shashi Kiran Chilappagari,
Michael Chertkov,
Bane Vasic
Abstract:
We consider Linear Programming (LP) decoding of a fixed Low-Density Parity-Check (LDPC) code over the Binary Symmetric Channel (BSC). The LP decoder fails when it outputs a pseudo-codeword which is not a codeword. We design an efficient algorithm termed the Instanton Search Algorithm (ISA) which, given a random input, generates a set of flips called the BSC-instanton. We prove that: (a) the LP d…
▽ More
We consider Linear Programming (LP) decoding of a fixed Low-Density Parity-Check (LDPC) code over the Binary Symmetric Channel (BSC). The LP decoder fails when it outputs a pseudo-codeword which is not a codeword. We design an efficient algorithm termed the Instanton Search Algorithm (ISA) which, given a random input, generates a set of flips called the BSC-instanton. We prove that: (a) the LP decoder fails for any set of flips with support vector including an instanton; (b) for any input, the algorithm outputs an instanton in the number of steps upper-bounded by twice the number of flips in the input. Repeated sufficient number of times, the ISA outcomes the number of unique instantons of different sizes.
△ Less
Submitted 2 September, 2008; v1 submitted 18 August, 2008;
originally announced August 2008.
-
Belief Propagation and Beyond for Particle Tracking
Authors:
Michael Chertkov,
Lukas Kroc,
Massimo Vergassola
Abstract:
We describe a novel approach to statistical learning from particles tracked while moving in a random environment. The problem consists in inferring properties of the environment from recorded snapshots. We consider here the case of a fluid seeded with identical passive particles that diffuse and are advected by a flow. Our approach rests on efficient algorithms to estimate the weighted number of…
▽ More
We describe a novel approach to statistical learning from particles tracked while moving in a random environment. The problem consists in inferring properties of the environment from recorded snapshots. We consider here the case of a fluid seeded with identical passive particles that diffuse and are advected by a flow. Our approach rests on efficient algorithms to estimate the weighted number of possible matchings among particles in two consecutive snapshots, the partition function of the underlying graphical model. The partition function is then maximized over the model parameters, namely diffusivity and velocity gradient. A Belief Propagation (BP) scheme is the backbone of our algorithm, providing accurate results for the flow parameters we want to learn. The BP estimate is additionally improved by incorporating Loop Series (LS) contributions. For the weighted matching problem, LS is compactly expressed as a Cauchy integral, accurately estimated by a saddle point approximation. Numerical experiments show that the quality of our improved BP algorithm is comparable to the one of a fully polynomial randomized approximation scheme, based on the Markov Chain Monte Carlo (MCMC) method, while the BP-based scheme is substantially faster than the MCMC scheme.
△ Less
Submitted 6 June, 2008;
originally announced June 2008.
-
Belief Propagation and Loop Series on Planar Graphs
Authors:
Michael Chertkov,
Vladimir Y. Chernyak,
Razvan Teodorescu
Abstract:
We discuss a generic model of Bayesian inference with binary variables defined on edges of a planar graph. The Loop Calculus approach of [1, 2] is used to evaluate the resulting series expansion for the partition function. We show that, for planar graphs, truncating the series at single-connected loops reduces, via a map reminiscent of the Fisher transformation [3], to evaluating the partition f…
▽ More
We discuss a generic model of Bayesian inference with binary variables defined on edges of a planar graph. The Loop Calculus approach of [1, 2] is used to evaluate the resulting series expansion for the partition function. We show that, for planar graphs, truncating the series at single-connected loops reduces, via a map reminiscent of the Fisher transformation [3], to evaluating the partition function of the dimer matching model on an auxiliary planar graph. Thus, the truncated series can be easily re-summed, using the Pfaffian formula of Kasteleyn [4]. This allows to identify a big class of computationally tractable planar models reducible to a dimer model via the Belief Propagation (gauge) transformation. The Pfaffian representation can also be extended to the full Loop Series, in which case the expansion becomes a sum of Pfaffian contributions, each associated with dimer matchings on an extension to a subgraph of the original graph. Algorithmic consequences of the Pfaffian representation, as well as relations to quantum and non-planar models, are discussed.
△ Less
Submitted 11 April, 2008; v1 submitted 27 February, 2008;
originally announced February 2008.
-
Exactness of Belief Propagation for Some Graphical Models with Loops
Authors:
Michael Chertkov
Abstract:
It is well known that an arbitrary graphical model of statistical inference defined on a tree, i.e. on a graph without loops, is solved exactly and efficiently by an iterative Belief Propagation (BP) algorithm convergent to unique minimum of the so-called Bethe free energy functional. For a general graphical model on a loopy graph the functional may show multiple minima, the iterative BP algorit…
▽ More
It is well known that an arbitrary graphical model of statistical inference defined on a tree, i.e. on a graph without loops, is solved exactly and efficiently by an iterative Belief Propagation (BP) algorithm convergent to unique minimum of the so-called Bethe free energy functional. For a general graphical model on a loopy graph the functional may show multiple minima, the iterative BP algorithm may converge to one of the minima or may not converge at all, and the global minimum of the Bethe free energy functional is not guaranteed to correspond to the optimal Maximum-Likelihood (ML) solution in the zero-temperature limit. However, there are exceptions to this general rule, discussed in \cite{05KW} and \cite{08BSS} in two different contexts, where zero-temperature version of the BP algorithm finds ML solution for special models on graphs with loops. These two models share a key feature: their ML solutions can be found by an efficient Linear Programming (LP) algorithm with a Totally-Uni-Modular (TUM) matrix of constraints. Generalizing the two models we consider a class of graphical models reducible in the zero temperature limit to LP with TUM constraints. Assuming that a gedanken algorithm, g-BP, funding the global minimum of the Bethe free energy is available we show that in the limit of zero temperature g-BP outputs the ML solution. Our consideration is based on equivalence established between gapless Linear Programming (LP) relaxation of the graphical model in the $T\to 0$ limit and respective LP version of the Bethe-Free energy minimization.
△ Less
Submitted 2 September, 2008; v1 submitted 2 January, 2008;
originally announced January 2008.
-
Non-equilibrium thermodynamics for functionals of current and density
Authors:
Vladimir Y. Chernyak,
Michael Chertkov,
Sergey V. Malinin,
Razvan Teodorescu
Abstract:
We study a stochastic many-body system maintained in an non-equilibrium steady state. Probability distribution functional of the time-integrated current and density is shown to attain a large-deviation form in the long-time asymptotics. The corresponding Current-Density Cramer Functional (CDCF) is explicitly derived for irreversible Langevin dynamics and discrete-space Markov chains. We also sho…
▽ More
We study a stochastic many-body system maintained in an non-equilibrium steady state. Probability distribution functional of the time-integrated current and density is shown to attain a large-deviation form in the long-time asymptotics. The corresponding Current-Density Cramer Functional (CDCF) is explicitly derived for irreversible Langevin dynamics and discrete-space Markov chains. We also show that the Cramer functionals of other linear functionals of density and current, like work generated by a force, are related to CDCF in a way reminiscent of variational relations between different thermodynamic potentials. The general formalism is illustrated with a model example.
△ Less
Submitted 20 December, 2007;
originally announced December 2007.
-
Loop Calculus and Belief Propagation for q-ary Alphabet: Loop Tower
Authors:
Vladimir Y. Chernyak,
Michael Chertkov
Abstract:
Loop Calculus introduced in [Chertkov, Chernyak '06] constitutes a new theoretical tool that explicitly expresses the symbol Maximum-A-Posteriori (MAP) solution of a general statistical inference problem via a solution of the Belief Propagation (BP) equations. This finding brought a new significance to the BP concept, which in the past was thought of as just a loop-free approximation. In this pa…
▽ More
Loop Calculus introduced in [Chertkov, Chernyak '06] constitutes a new theoretical tool that explicitly expresses the symbol Maximum-A-Posteriori (MAP) solution of a general statistical inference problem via a solution of the Belief Propagation (BP) equations. This finding brought a new significance to the BP concept, which in the past was thought of as just a loop-free approximation. In this paper we continue a discussion of the Loop Calculus. We introduce an invariant formulation which allows to generalize the Loop Calculus approach to a q-are alphabet.
△ Less
Submitted 9 September, 2008; v1 submitted 12 January, 2007;
originally announced January 2007.
-
Pseudo-codeword Landscape
Authors:
Michael Chertkov,
Mikhail Stepanov
Abstract:
We discuss the performance of Low-Density-Parity-Check (LDPC) codes decoded by means of Linear Programming (LP) at moderate and large Signal-to-Noise-Ratios (SNR). Utilizing a combination of the previously introduced pseudo-codeword-search method and a new "dendro" trick, which allows us to reduce the complexity of the LP decoding, we analyze the dependence of the Frame-Error-Rate (FER) on the S…
▽ More
We discuss the performance of Low-Density-Parity-Check (LDPC) codes decoded by means of Linear Programming (LP) at moderate and large Signal-to-Noise-Ratios (SNR). Utilizing a combination of the previously introduced pseudo-codeword-search method and a new "dendro" trick, which allows us to reduce the complexity of the LP decoding, we analyze the dependence of the Frame-Error-Rate (FER) on the SNR. Under Maximum-A-Posteriori (MAP) decoding the dendro-code, having only checks with connectivity degree three, performs identically to its original code with high-connectivity checks. For a number of popular LDPC codes performing over the Additive-White-Gaussian-Noise (AWGN) channel we found that either an error-floor sets at a relatively low SNR, or otherwise a transient asymptote, characterized by a faster decay of FER with the SNR increase, precedes the error-floor asymptote. We explain these regimes in terms of the pseudo-codeword spectra of the codes.
△ Less
Submitted 22 April, 2007; v1 submitted 12 January, 2007;
originally announced January 2007.
-
Growing condensate in two-dimensional turbulence
Authors:
M. Chertkov,
C. Connaughton,
I. Kolokolov,
V. Lebedev
Abstract:
We report a numerical study, supplemented by phenomenological explanations, of ``energy condensation'' in forced 2D turbulence in a biperiodic box. Condensation is a finite size effect which occurs after the standard inverse cascade reaches the size of the system. It leads to emergence of a coherent vortex dipole. We show that the time growth of the dipole is self-similar, and it contains most o…
▽ More
We report a numerical study, supplemented by phenomenological explanations, of ``energy condensation'' in forced 2D turbulence in a biperiodic box. Condensation is a finite size effect which occurs after the standard inverse cascade reaches the size of the system. It leads to emergence of a coherent vortex dipole. We show that the time growth of the dipole is self-similar, and it contains most of the injected energy, thus resulting in an energy spectrum which is markedly steeper than the standard $k^{-5/3}$ one. Once the coherent component is subtracted, however, the remaining fluctuations have a spectrum close to $k^{-1}$. The fluctuations decay slowly as the coherent part grows.
△ Less
Submitted 28 February, 2007; v1 submitted 22 December, 2006;
originally announced December 2006.
-
Statistics of Entropy Production in Linearized Stochastic System
Authors:
K. Turitsyn,
M. Chertkov,
V. Y. Chernyak,
A. Puliafito
Abstract:
We consider a wide class of linear stochastic problems driven off the equilibrium by a multiplicative asymmetric force. The force brakes detailed balance, maintained otherwise, thus producing entropy. The large deviation function of the entropy production in the system is calculated explicitly. The general result is illustrated using an example of a polymer immersed in a gradient flow and subjec…
▽ More
We consider a wide class of linear stochastic problems driven off the equilibrium by a multiplicative asymmetric force. The force brakes detailed balance, maintained otherwise, thus producing entropy. The large deviation function of the entropy production in the system is calculated explicitly. The general result is illustrated using an example of a polymer immersed in a gradient flow and subject to thermal fluctuations.
△ Less
Submitted 20 February, 2007; v1 submitted 21 September, 2006;
originally announced September 2006.
-
Loop Calculus Helps to Improve Belief Propagation and Linear Programming Decodings of Low-Density-Parity-Check Codes
Authors:
Michael Chertkov,
Vladimir Y. Chernyak
Abstract:
We illustrate the utility of the recently developed loop calculus for improving the Belief Propagation (BP) algorithm. If the algorithm that minimizes the Bethe free energy fails we modify the free energy by accounting for a critical loop in a graphical representation of the code. The log-likelihood specific critical loop is found by means of the loop calculus. The general method is tested using…
▽ More
We illustrate the utility of the recently developed loop calculus for improving the Belief Propagation (BP) algorithm. If the algorithm that minimizes the Bethe free energy fails we modify the free energy by accounting for a critical loop in a graphical representation of the code. The log-likelihood specific critical loop is found by means of the loop calculus. The general method is tested using an example of the Linear Programming (LP) decoding, that can be viewed as a special limit of the BP decoding. Considering the (155,64,20) code that performs over Additive-White-Gaussian-Noise channel we show that the loop calculus improves the LP decoding and corrects all previously found dangerous configurations of log-likelihoods related to pseudo-codewords with low effective distance, thus reducing the code's error-floor.
△ Less
Submitted 28 September, 2006;
originally announced September 2006.
-
Path-integral analysis of fluctuation theorems for general Langevin processes
Authors:
Vladimir Y. Chernyak,
Michael Chertkov,
Christopher Jarzynski
Abstract:
We examine classical, transient fluctuation theorems within the unifying framework of Langevin dynamics. We explicitly distinguish between the effects of non-conservative forces that violate detailed balance, and non-autonomous dynamics arising from the variation of an external parameter. When both these sources of nonequilibrium behavior are present, there naturally arise two distinct fluctuati…
▽ More
We examine classical, transient fluctuation theorems within the unifying framework of Langevin dynamics. We explicitly distinguish between the effects of non-conservative forces that violate detailed balance, and non-autonomous dynamics arising from the variation of an external parameter. When both these sources of nonequilibrium behavior are present, there naturally arise two distinct fluctuation theorems.
△ Less
Submitted 29 June, 2006; v1 submitted 18 May, 2006;
originally announced May 2006.
-
Loop series for discrete statistical models on graphs
Authors:
Michael Chertkov,
Vladimir Y. Chernyak
Abstract:
In this paper we present derivation details, logic, and motivation for the loop calculus introduced in \cite{06CCa}. Generating functions for three inter-related discrete statistical models are each expressed in terms of a finite series. The first term in the series corresponds to the Bethe-Peierls (Belief Propagation)-BP contribution, the other terms are labeled by loops on the factor graph. Al…
▽ More
In this paper we present derivation details, logic, and motivation for the loop calculus introduced in \cite{06CCa}. Generating functions for three inter-related discrete statistical models are each expressed in terms of a finite series. The first term in the series corresponds to the Bethe-Peierls (Belief Propagation)-BP contribution, the other terms are labeled by loops on the factor graph. All loop contributions are simple rational functions of spin correlation functions calculated within the BP approach. We discuss two alternative derivations of the loop series. One approach implements a set of local auxiliary integrations over continuous fields with the BP contribution corresponding to an integrand saddle-point value. The integrals are replaced by sums in the complimentary approach, briefly explained in \cite{06CCa}. A local gauge symmetry transformation that clarifies an important invariant feature of the BP solution, is revealed in both approaches. The partition function remains invariant while individual terms change under the gauge transformation. The requirement for all individual terms to be non-zero only for closed loops in the factor graph (as opposed to paths with loose ends) is equivalent to fixing the first term in the series to be exactly equal to the BP contribution. Further applications of the loop calculus to problems in statistical physics, computer and information sciences are discussed.
△ Less
Submitted 1 July, 2007; v1 submitted 7 March, 2006;
originally announced March 2006.
-
An Efficient Pseudo-Codeword Search Algorithm for Linear Programming Decoding of LDPC Codes
Authors:
Michael Chertkov,
Mikhail G. Stepanov
Abstract:
In Linear Programming (LP) decoding of a Low-Density-Parity-Check (LDPC) code one minimizes a linear functional, with coefficients related to log-likelihood ratios, over a relaxation of the polytope spanned by the codewords \cite{03FWK}. In order to quantify LP decoding, and thus to describe performance of the error-correction scheme at moderate and large Signal-to-Noise-Ratios (SNR), it is impo…
▽ More
In Linear Programming (LP) decoding of a Low-Density-Parity-Check (LDPC) code one minimizes a linear functional, with coefficients related to log-likelihood ratios, over a relaxation of the polytope spanned by the codewords \cite{03FWK}. In order to quantify LP decoding, and thus to describe performance of the error-correction scheme at moderate and large Signal-to-Noise-Ratios (SNR), it is important to study the relaxed polytope to understand better its vertexes, so-called pseudo-codewords, especially those which are neighbors of the zero codeword. In this manuscript we propose a technique to heuristically create a list of these neighbors and their distances. Our pseudo-codeword-search algorithm starts by randomly choosing the initial configuration of the noise. The configuration is modified through a discrete number of steps. Each step consists of two sub-steps. Firstly, one applies an LP decoder to the noise-configuration deriving a pseudo-codeword. Secondly, one finds configuration of the noise equidistant from the pseudo codeword and the zero codeword. The resulting noise configuration is used as an entry for the next step. The iterations converge rapidly to a pseudo-codeword neighboring the zero codeword. Repeated many times, this procedure is characterized by the distribution function (frequency spectrum) of the pseudo-codeword effective distance. The effective distance of the coding scheme is approximated by the shortest distance pseudo-codeword in the spectrum. The efficiency of the procedure is demonstrated on examples of the Tanner $[155,64,20]$ code and Margulis $p=7$ and $p=11$ codes (672 and 2640 bits long respectively) operating over an Additive-White-Gaussian-Noise (AWGN) channel.
△ Less
Submitted 4 July, 2007; v1 submitted 26 January, 2006;
originally announced January 2006.
-
Instanton analysis of Low-Density-Parity-Check codes in the error-floor regime
Authors:
M. G. Stepanov,
M. Chertkov
Abstract:
In this paper we develop instanton method introduced in [1], [2], [3] to analyze quantitatively performance of Low-Density-Parity-Check (LDPC) codes decoded iteratively in the so-called error-floor regime. We discuss statistical properties of the numerical instanton-amoeba scheme focusing on detailed analysis and comparison of two regular LDPC codes: Tanner's (155, 64, 20) and Margulis' (672, 33…
▽ More
In this paper we develop instanton method introduced in [1], [2], [3] to analyze quantitatively performance of Low-Density-Parity-Check (LDPC) codes decoded iteratively in the so-called error-floor regime. We discuss statistical properties of the numerical instanton-amoeba scheme focusing on detailed analysis and comparison of two regular LDPC codes: Tanner's (155, 64, 20) and Margulis' (672, 336, 16) codes. In the regime of moderate values of the signal-to-noise ratio we critically compare results of the instanton-amoeba evaluations against the standard Monte-Carlo calculations of the Frame-Error-Rate.
△ Less
Submitted 16 January, 2006;
originally announced January 2006.
-
Loop Calculus in Statistical Physics and Information Science
Authors:
Michael Chertkov,
Vladimir Y. Chernyak
Abstract:
Considering a discrete and finite statistical model of a general position we introduce an exact expression for the partition function in terms of a finite series. The leading term in the series is the Bethe-Peierls (Belief Propagation)-BP contribution, the rest are expressed as loop-contributions on the factor graph and calculated directly using the BP solution. The series unveils a small parame…
▽ More
Considering a discrete and finite statistical model of a general position we introduce an exact expression for the partition function in terms of a finite series. The leading term in the series is the Bethe-Peierls (Belief Propagation)-BP contribution, the rest are expressed as loop-contributions on the factor graph and calculated directly using the BP solution. The series unveils a small parameter that often makes the BP approximation so successful. Applications of the loop calculus in statistical physics and information science are discussed.
△ Less
Submitted 7 March, 2006; v1 submitted 20 January, 2006;
originally announced January 2006.
-
The error-floor of LDPC codes in the Laplacian channel
Authors:
M. G. Stepanov,
M. Chertkov
Abstract:
We analyze the performance of Low-Density-Parity-Check codes in the error-floor domain where the Signal-to-Noise-Ratio, s, is large, s >> 1. We describe how the instanton method of theoretical physics, recently adapted to coding theory, solves the problem of characterizing the error-floor domain in the Laplacian channel. An example of the (155,64,20) LDPC code with four iterations (each iteratio…
▽ More
We analyze the performance of Low-Density-Parity-Check codes in the error-floor domain where the Signal-to-Noise-Ratio, s, is large, s >> 1. We describe how the instanton method of theoretical physics, recently adapted to coding theory, solves the problem of characterizing the error-floor domain in the Laplacian channel. An example of the (155,64,20) LDPC code with four iterations (each iteration consisting of two semi-steps: from bits-to-checks and from checks-to-bits) of the min-sum decoding is discussed. A generalized computational tree analysis is devised to explain the rational structure of the leading instantons. The asymptotic for the symbol Bit-Error-Rate in the error-floor domain is comprised of individual instanton contributions, each estimated as ~ \exp(-l_{inst;L} s), where the effective distances, l_{inst;L}, of the the leading instantons are 7.6, 8.0 and 8.0 respectively. (The Hamming distance of the code is 20.) The analysis shows that the instantons are distinctly different from the ones found for the same coding/decoding scheme performing over the Gaussian channel. We validate instanton results against direct simulations and offer an explanation for remarkable performance of the instanton approximation not only in the extremal, s -> \infty, limit but also at the moderate s values of practical interest.
△ Less
Submitted 3 October, 2005; v1 submitted 11 July, 2005;
originally announced July 2005.
-
Diagnosis of weaknesses in modern error correction codes: a physics approach
Authors:
M. G. Stepanov,
V. Chernyak,
M. Chertkov,
B. Vasic
Abstract:
One of the main obstacles to the wider use of the modern error-correction codes is that, due to the complex behavior of their decoding algorithms, no systematic method which would allow characterization of the Bit-Error-Rate (BER) is known. This is especially true at the weak noise where many systems operate and where coding performance is difficult to estimate because of the diminishingly small…
▽ More
One of the main obstacles to the wider use of the modern error-correction codes is that, due to the complex behavior of their decoding algorithms, no systematic method which would allow characterization of the Bit-Error-Rate (BER) is known. This is especially true at the weak noise where many systems operate and where coding performance is difficult to estimate because of the diminishingly small number of errors. We show how the instanton method of physics allows one to solve the problem of BER analysis in the weak noise range by recasting it as a computationally tractable minimization problem.
△ Less
Submitted 1 June, 2005;
originally announced June 2005.
-
Statistics of Polymer Extension in a Random Flow with Mean Shear
Authors:
M. Chertkov,
I. Kolokolov,
V. Lebedev,
K. Turitsyn
Abstract:
Considering the dynamics of a polymer with finite extensibility placed in a chaotic flow with large mean shear, we explain how the statistics of polymer extension changes with Weissenberg number, ${\it Wi}$, defined as the product of the polymer relaxation time and the Lyapunov exponent of the flow. Four regimes, of the ${\it Wi}$ number, are identified. One below the coil-stretched transition a…
▽ More
Considering the dynamics of a polymer with finite extensibility placed in a chaotic flow with large mean shear, we explain how the statistics of polymer extension changes with Weissenberg number, ${\it Wi}$, defined as the product of the polymer relaxation time and the Lyapunov exponent of the flow. Four regimes, of the ${\it Wi}$ number, are identified. One below the coil-stretched transition and three above the coil-stretched transition. Specific emphasis is given to explaining these regimes in terms of the polymer dynamics.
△ Less
Submitted 28 November, 2004;
originally announced November 2004.
-
Tumbling of Polymers in a Random Flow with Mean Shear
Authors:
M. Chertkov,
I. Kolokolov,
V. Lebedev,
K. Turitsyn
Abstract:
A polymer placed in chaotic flow with large mean shear tumbles, making a-periodic flips. We describe the statistics of angular orientation, as well as of tumbling time (separating two subsequent flips) of polymers in this flow. The probability distribution function (PDF) of the polymer orientation is peaked around a shear-preferred direction. The tails of this angular PDF are algebraic. The PDF…
▽ More
A polymer placed in chaotic flow with large mean shear tumbles, making a-periodic flips. We describe the statistics of angular orientation, as well as of tumbling time (separating two subsequent flips) of polymers in this flow. The probability distribution function (PDF) of the polymer orientation is peaked around a shear-preferred direction. The tails of this angular PDF are algebraic. The PDF of the tumbling time, $τ$, has a maximum at the value estimated as inverse Lyapunov exponent of the flow. This PDF shows an exponential tail for large $τ$ and a small-$τ$ tail determined by the simultaneous statistics of velocity PDF.
△ Less
Submitted 28 November, 2004;
originally announced November 2004.
-
Probability of anomalously large Bit-Error-Rate in long haul optical transmission
Authors:
Vladimir Chernyak,
Michael Chertkov,
Igor Kolokolov,
Vladimir Lebedev
Abstract:
We consider a linear model of optical pulse transmission through fiber with birefringent disorder in the presence of amplifier noise. Both disorder and noise are assumed to be weak, i.e. the average bit-error rate (BER) is small. The probability of rare violent events leading to the values of BER much larger than its typical value is estimated. We show that the probability distribution has a lon…
▽ More
We consider a linear model of optical pulse transmission through fiber with birefringent disorder in the presence of amplifier noise. Both disorder and noise are assumed to be weak, i.e. the average bit-error rate (BER) is small. The probability of rare violent events leading to the values of BER much larger than its typical value is estimated. We show that the probability distribution has a long algebraic tail.
△ Less
Submitted 18 March, 2003; v1 submitted 4 March, 2003;
originally announced March 2003.
-
Polymer Stretching by Turbulence
Authors:
Michael Chertkov
Abstract:
The stretching of a polymer chain by a large scale chaotic flow is considered. The steady state which emerges as a balance of the turbulent stretching and anharmonic resistance of the chain is quantitatively described, i.e. the dependency on the flow parameters (Lyapunov exponent statistics) and the chain characteristics (the number of beads and the inter-bead elastic potential) is made explicit…
▽ More
The stretching of a polymer chain by a large scale chaotic flow is considered. The steady state which emerges as a balance of the turbulent stretching and anharmonic resistance of the chain is quantitatively described, i.e. the dependency on the flow parameters (Lyapunov exponent statistics) and the chain characteristics (the number of beads and the inter-bead elastic potential) is made explicit.
△ Less
Submitted 1 March, 2000; v1 submitted 8 November, 1999;
originally announced November 1999.
-
Passive advection in nonlinear medium
Authors:
Michael Chertkov
Abstract:
Forced advection of passive tracer, $θ$, in nonlinear relaxational medium by large scale (Batchelor problem) incompressible velocity field at scales less than the correlation length of the flow and larger than the diffusion scale is considered. Effective theory explaining small scale scalar fluctuations is proven to be linear, asymptotic free (downscales from the scale of the pumping) and univer…
▽ More
Forced advection of passive tracer, $θ$, in nonlinear relaxational medium by large scale (Batchelor problem) incompressible velocity field at scales less than the correlation length of the flow and larger than the diffusion scale is considered. Effective theory explaining small scale scalar fluctuations is proven to be linear, asymptotic free (downscales from the scale of the pumping) and universal. Only three parameters are required to decribe exhaustively the small scale statistics of scalar difference: two velocity-dependent ones, average and dispersion ($\barλ$ and $Δ$ respectively) of the exponential stretching rate of a trial line element, and $α$, standing for average rate of linear damping of small scale scalar fluctuations. $α$ is an explicit functional of potential chracterized medium nonlinearity and amplitude of $θ^{2}$ flux pumped into the system. Structure functions show an extremely anomalous, intermittent behavior: $<|δθ_{r}|^{q}> \sim r^{ξ_{q}}, ξ_{q} = \min {q,\sqrt{[ \frac{\barλ}Δ] ^{2} + \frac{2αq}Δ} - \frac{\barλ}Δ}$. No dissipative anomaly is found in the problem.
△ Less
Submitted 10 September, 1998;
originally announced September 1998.
-
On how a joint interaction of two innocent partners (smooth advection & linear damping) produces a strong intermittency
Authors:
Michael Chertkov
Abstract:
Forced advection of passive scalar by a smooth $d$-dimensional incompressible velocity in the presence of a linear damping is studied. Acting separately advection and dumping do not lead to an essential intermittency of the steady scalar statistics, while being mixed together produce a very strong non-Gaussianity in the convective range: $q$-th (positive) moment of the absolute value of scalar d…
▽ More
Forced advection of passive scalar by a smooth $d$-dimensional incompressible velocity in the presence of a linear damping is studied. Acting separately advection and dumping do not lead to an essential intermittency of the steady scalar statistics, while being mixed together produce a very strong non-Gaussianity in the convective range: $q$-th (positive) moment of the absolute value of scalar difference, $<|θ(t;{\bf r})-θ(t;0)|^{q}> $ is proportional to $r^{ξ_{q}}$, $ξ_{q}=\sqrt{d^{2}/4+αdq/[ (d-1)D]}-d/2$, where $α/D$ measures the rate of the damping in the units of the stretching rate. Probability density function (PDF) of the scalar difference is also found.
△ Less
Submitted 5 March, 1998;
originally announced March 1998.