-
Analysis and Optimization of Probabilities of Beneficial Mutation and Crossover Recombination in a Hamming Space
Authors:
Roman V. Belavkin
Abstract:
Inspired by Fisher's geometric approach to study beneficial mutations, we analyse probabilities of beneficial mutation and crossover recombination of strings in a general Hamming space with arbitrary finite alphabet. Mutations and recombinations that reduce the distance to an optimum are considered as beneficial. Geometric and combinatorial analysis is used to derive closed-form expressions for tr…
▽ More
Inspired by Fisher's geometric approach to study beneficial mutations, we analyse probabilities of beneficial mutation and crossover recombination of strings in a general Hamming space with arbitrary finite alphabet. Mutations and recombinations that reduce the distance to an optimum are considered as beneficial. Geometric and combinatorial analysis is used to derive closed-form expressions for transition probabilities between spheres around an optimum giving a complete description of Markov evolution of distances from an optimum over multiple generations. This paves the way for optimization of parameters of mutation and recombination operators. Here we derive optimality conditions for mutation and recombination radii maximizing the probabilities of mutation and crossover into the optimum. The analysis highlights important differences between these evolutionary operators. While mutation can potentially reach any part of the search space, the probability of beneficial mutation decreases with distance to an optimum, and the optimal mutation radius or rate should also decrease resulting in a slow-down of evolution near the optimum. Crossover recombination, on the other hand, acts in a subspace of the search space defined by the current population of strings. However, probabilities of beneficial and deleterious crossover are balanced, and their characteristics, such as variance, are translation invariant in a Hamming space, suggesting that recombination may complement mutation and boost the rate of evolution near the optimum.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Relation between the Kantorovich-Wasserstein metric and the Kullback-Leibler divergence
Authors:
Roman V. Belavkin
Abstract:
We discuss a relation between the Kantorovich-Wasserstein (KW) metric and the Kullback-Leibler (KL) divergence. The former is defined using the optimal transport problem (OTP) in the Kantorovich formulation. The latter is used to define entropy and mutual information, which appear in variational problems to find optimal channel (OCP) from the rate distortion and the value of information theories.…
▽ More
We discuss a relation between the Kantorovich-Wasserstein (KW) metric and the Kullback-Leibler (KL) divergence. The former is defined using the optimal transport problem (OTP) in the Kantorovich formulation. The latter is used to define entropy and mutual information, which appear in variational problems to find optimal channel (OCP) from the rate distortion and the value of information theories. We show that OTP is equivalent to OCP with one additional constraint fixing the output measure, and therefore OCP with constraints on the KL-divergence gives a lower bound on the KW-metric. The dual formulation of OTP allows us to explore the relation between the KL-divergence and the KW-metric using decomposition of the former based on the law of cosines. This way we show the link between two divergences using the variational and geometric principles.
△ Less
Submitted 24 August, 2019;
originally announced August 2019.
-
Asymmetric Topologies on Statistical Manifolds
Authors:
Roman V. Belavkin
Abstract:
Asymmetric information distances are used to define asymmetric norms and quasimetrics on the statistical manifold and its dual space of random variables. Quasimetric topology, generated by the Kullback-Leibler (KL) divergence, is considered as the main example, and some of its topological properties are investigated.
Asymmetric information distances are used to define asymmetric norms and quasimetrics on the statistical manifold and its dual space of random variables. Quasimetric topology, generated by the Kullback-Leibler (KL) divergence, is considered as the main example, and some of its topological properties are investigated.
△ Less
Submitted 6 October, 2015; v1 submitted 29 July, 2015;
originally announced July 2015.
-
Asymmetry of Risk and Value of Information
Authors:
Roman V. Belavkin
Abstract:
The von Neumann and Morgenstern theory postulates that rational choice under uncertainty is equivalent to maximization of expected utility (EU). This view is mathematically appealing and natural because of the affine structure of the space of probability measures. Behavioural economists and psychologists, on the other hand, have demonstrated that humans consistently violate the EU postulate by swi…
▽ More
The von Neumann and Morgenstern theory postulates that rational choice under uncertainty is equivalent to maximization of expected utility (EU). This view is mathematically appealing and natural because of the affine structure of the space of probability measures. Behavioural economists and psychologists, on the other hand, have demonstrated that humans consistently violate the EU postulate by switching from risk-averse to risk-taking behaviour. This paradox has led to the development of descriptive theories of decisions, such as the celebrated prospect theory, which uses an $S$-shaped value function with concave and convex branches explaining the observed asymmetry. Although successful in modelling human behaviour, these theories appear to contradict the natural set of axioms behind the EU postulate. Here we show that the observed asymmetry in behaviour can be explained if, apart from utilities of the outcomes, rational agents also value information communicated by random events. We review the main ideas of the classical value of information theory and its generalizations. Then we prove that the value of information is an $S$-shaped function, and that its asymmetry does not depend on how the concept of information is defined, but follows only from linearity of the expected utility. Thus, unlike many descriptive and `non-expected' utility theories that abandon the linearity (i.e. the `independence' axiom), we formulate a rigorous argument that the von Neumann and Morgenstern rational agents should be both risk-averse and risk-taking if they are not indifferent to information.
△ Less
Submitted 19 May, 2014;
originally announced May 2014.
-
Monotonicity of Fitness Landscapes and Mutation Rate Control
Authors:
Roman V. Belavkin,
Alastair Channon,
Elizabeth Aston,
John Aston,
Rok Krasovec,
Christopher G. Knight
Abstract:
A common view in evolutionary biology is that mutation rates are minimised. However, studies in combinatorial optimisation and search have shown a clear advantage of using variable mutation rates as a control parameter to optimise the performance of evolutionary algorithms. Much biological theory in this area is based on Ronald Fisher's work, who used Euclidean geometry to study the relation betwe…
▽ More
A common view in evolutionary biology is that mutation rates are minimised. However, studies in combinatorial optimisation and search have shown a clear advantage of using variable mutation rates as a control parameter to optimise the performance of evolutionary algorithms. Much biological theory in this area is based on Ronald Fisher's work, who used Euclidean geometry to study the relation between mutation size and expected fitness of the offspring in infinite phenotypic spaces. Here we reconsider this theory based on the alternative geometry of discrete and finite spaces of DNA sequences. First, we consider the geometric case of fitness being isomorphic to distance from an optimum, and show how problems of optimal mutation rate control can be solved exactly or approximately depending on additional constraints of the problem. Then we consider the general case of fitness communicating only partial information about the distance. We define weak monotonicity of fitness landscapes and prove that this property holds in all landscapes that are continuous and open at the optimum. This theoretical result motivates our hypothesis that optimal mutation rate functions in such landscapes will increase when fitness decreases in some neighbourhood of an optimum, resembling the control functions derived in the geometric case. We test this hypothesis experimentally by analysing approximately optimal mutation rate control functions in 115 complete landscapes of binding scores between DNA sequences and transcription factors. Our findings support the hypothesis and find that the increase of mutation rate is more rapid in landscapes that are less monotonic (more rugged). We discuss the relevance of these findings to living organisms.
△ Less
Submitted 24 August, 2019; v1 submitted 3 September, 2012;
originally announced September 2012.
-
Optimal measures and Markov transition kernels
Authors:
Roman V. Belavkin
Abstract:
We study optimal solutions to an abstract optimization problem for measures, which is a generalization of classical variational problems in information theory and statistical physics. In the classical problems, information and relative entropy are defined using the Kullback-Leibler divergence, and for this reason optimal measures belong to a one-parameter exponential family. Measures within such a…
▽ More
We study optimal solutions to an abstract optimization problem for measures, which is a generalization of classical variational problems in information theory and statistical physics. In the classical problems, information and relative entropy are defined using the Kullback-Leibler divergence, and for this reason optimal measures belong to a one-parameter exponential family. Measures within such a family have the property of mutual absolute continuity. Here we show that this property characterizes other families of optimal positive measures if a functional representing information has a strictly convex dual. Mutual absolute continuity of optimal probability measures allows us to strictly separate deterministic and non-deterministic Markov transition kernels, which play an important role in theories of decisions, estimation, control, communication and computation. We show that deterministic transitions are strictly sub-optimal, unless information resource with a strictly convex dual is unconstrained. For illustration, we construct an example where, unlike non-deterministic, any deterministic kernel either has negatively infinite expected utility (unbounded expected error) or communicates infinite information.
△ Less
Submitted 5 September, 2012; v1 submitted 1 December, 2010;
originally announced December 2010.
-
Conservation Law of Utility and Equilibria in Non-Zero Sum Games
Authors:
Roman V. Belavkin
Abstract:
This short note demonstrates how one can define a transformation of a non-zero sum game into a zero sum, so that the optimal mixed strategy achieving equilibrium always exists. The transformation is equivalent to introduction of a passive player into a game (a player with a singleton set of pure strategies), whose payoff depends on the actions of the active players, and it is justified by the law…
▽ More
This short note demonstrates how one can define a transformation of a non-zero sum game into a zero sum, so that the optimal mixed strategy achieving equilibrium always exists. The transformation is equivalent to introduction of a passive player into a game (a player with a singleton set of pure strategies), whose payoff depends on the actions of the active players, and it is justified by the law of conservation of utility in a game. In a transformed game, each participant plays against all other players, including the passive player. The advantage of this approach is that the transformed game is zero-sum and has an equilibrium solution. The optimal strategy and the value of the new game, however, can be different from strategies that are rational in the original game. We demonstrate the principle using the Prisoner's Dilemma example.
△ Less
Submitted 13 October, 2010; v1 submitted 12 October, 2010;
originally announced October 2010.