Adaptation, fitness landscape learning and fast evolution

John Reinitz; Sergey Vakulenko; Dmitri Grigoriev; Andreas Weber

doi:10.12688/f1000research.18575.2

Home Browse Adaptation, fitness landscape learning and fast evolution

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Revised

Adaptation, fitness landscape learning and fast evolution

[version 2; peer review: 2 approved]

John Reinitz¹, Sergey Vakulenko², Dmitri Grigoriev³, Andreas Weber⁴

PUBLISHED 13 Sep 2019

Author details Author details

¹ Departments of Statistics, Ecology and Evolution, Molecular Genetics and Cell Biology, University of Chicago, Chicago, IL, USA
² Saint Petersburg National Research University of Information Technologies, Mechanics and Optics, Saint Petersburg, Russian Federation
³ CNRS, Mathématiques, Université de Lille, Villeneuve d'Ascq, France
⁴ Department of Computer Science, University of Bonn, Bonn, Germany

John Reinitz
Roles: Conceptualization, Methodology, Writing – Review & Editing

Sergey Vakulenko
Roles: Conceptualization, Formal Analysis, Investigation, Methodology, Software, Writing – Original Draft Preparation, Writing – Review & Editing

Dmitri Grigoriev
Roles: Formal Analysis, Methodology, Validation, Writing – Review & Editing

Andreas Weber
Roles: Software, Visualization, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

We consider evolution of a large population, where fitness of each organism is defined by many phenotypical traits. These traits result from expression of many genes. Under some assumptions on fitness we prove that such model organisms are capable, to some extent, to recognize the fitness landscape. That fitness landscape learning sharply reduces the number of mutations needed for adaptation. Moreover, this learning increases phenotype robustness with respect to mutations, i.e., canalizes the phenotype. We show that learning and canalization work only when evolution is gradual. Organisms can be adapted to many constraints associated with a hard environment, if that environment becomes harder step by step. Our results explain why evolution can involve genetic changes of a relatively large effect and why the total number of changes are surprisingly small.

Keywords

evolution, gene networks, fitness landscape learning

Corresponding authors: John Reinitz, Sergey Vakulenko

Competing interests: No competing interests were disclosed.

Grant information: The second author was supported by the grant of Russian Ministry of Education, 2012-1.2.1-12-000-1013-016.
Additionally, the second author was
financially supported by Government of Russian Federation, Grant Grant 08-08.

D. Grigoriev is grateful to the grant RSF 16-11-10075 and to both MCCME and MPI f\"ur Mathematik for wonderful working conditions and inspiring atmosphere.

J. Reinitz and S. Vakulenko were supported by US NIH grant
RO1 OD010936 (formerly RO1 RR07801).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2019 Reinitz J et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Reinitz J, Vakulenko S, Grigoriev D and Weber A. Adaptation, fitness landscape learning and fast evolution [version 2; peer review: 2 approved]. F1000Research 2019, 8:358 (https://doi.org/10.12688/f1000research.18575.2) First published: 01 Apr 2019, 8:358 (https://doi.org/10.12688/f1000research.18575.1) Latest published: 13 Sep 2019, 8:358 (https://doi.org/10.12688/f1000research.18575.2)

Revised Amendments from Version 1

To respond to the reviewers' comments, in this revision, we extended the discussion. For the key point that the network modifies the thresholds by a simply feedback mechanism we point to regulation via enhancers.
In order to explain in more detail the regulation we added a new section
2.5 on "Gene regulation networks". Two new figures (Figure 1 and Figure
2) show results of numerical simulations based on the ``strong selection weak mutation'' (SSWM) algorithm.
In the discussion of Theorem 3.2 we added a paragraph on rare mutants and giving some intuitions towards the proof of Theorem 3.2 as it is based on estimates of the accuracy of the Nagylaki equations. For the main Lemma 4.5 for the proof of Theorem 3.2. we add a transparent interpretation.
Additional references pointed out by the reviewers are incorporated into version 2 of the paper.

To read any peer review reports and author responses for this article, follow the "read" links in the Open Peer Review table.

1 Introduction

A central idea of modern biology is that evolution proceeds by mutation and selection. This process may be represented as a walk in a fitness landscape leading to fitness increase and slow adaptation¹. According to classical ideas this walk can be considered a sequence of small random steps with small phenotypic effects. Nevertheless, there is a limited amount of experimental support for this idea² and some experimental evidence that evolution can involve genetic changes of a relatively large effect and that the total number of changes are surprisingly small³. Another intriguing fact is that organisms are capable of making adaptive predictions of environmental changes⁴.

To explain those facts new evolutionary concepts have been suggested (see the review by 5 and references therein). The main idea is that a population can “learn” (recognize) fitness landscapes^5–7. The approach developed in these works is a generalization of ideas from machine learning in which learning (regression to data) is viewed as selection and generalization (interpolation or extrapolation) is viewed as adaptation.

A mathematical basis for investigation of evolution learning problems has been developed by 8. However, this work uses a simplified model, where organisms are represented as Boolean circuits seeking an “ideal answer” to environmental challenges. These circuits involve N_g Boolean variables that can be interpreted as genes, and the ideal circuit answer maximizes the fitness. A similar model was studied numerically by 7 to confirm the theory of “facilitated variation” explaining the appearance of genetic variations which can lead to large phenotypic ones. In the work by 9 a theory of the evolution of these Boolean circuits was advanced. It was shown that, under some conditions—weak selection, see 10—a polynomially large population over polynomially many generations (polynomial in N_g) will end up almost surely consisting exclusively of assignments, which satisfy all constraints. This theorem can shed light on the problem of the evolution of complex adaptations since that satisfiability problem can be considered as a rough mathematical model of adaptation to many constraints.

In 6 it is shown that, in the regime of weak selection, population evolution can be described by the multiplicative weight update algorithm (MWUA), which is a powerful tool, well known in theoretical computer science and a generalization of such famous algorithms as Adaboost and others¹¹. Note that in 6 infinitely large populations are investigated whereas the results of 9 hold only for finite populations and take into account genetic drift.

Evolution Computation(EC) problems are considered recently by many papers^12–16 mainly for artificial test fitness functions like OneMax or LeadingOnes (for an overwiew of EC problems, see 17).

In this paper, we investigate adaptation and the fitness landscape learning problem for more realistic fitness function This fitness can model adaptation for insects and connected with a fundamental hard combinatorial problem: K-SAT.

The main results can be outlined as follows. We show that, in a fixed environment, genes can serve as learners in the machine learning sense. Indeed, if an organism has survived for a long period, this fact alone constitutes important information, which can be used. The biological interpretation of this fact is simple: if a population is large enough and mutations are sufficiently rare, deleterious mutations are eliminated by purifying selection. Hence, those non-neutral mutant alleles which have become fixed in natural populations will, with probability close to 1, be adaptive and cause a positive increment of fitness (see Theorem 3.1 and Theorem 3.2 in Subsection 3.1 and Subsection 3.2). We obtain mathematical results, which allows us to estimate the reduction of mutation number due to that learning landscape procedure. Learning can sharply reduce the number of mutations needed to form a phenotypic trait useful for adaptation that is consistent with experimental data mentioned above (see 3).

Another important result is as follows. We estimate the accuracy of fundamental Nagylaki equations^6,10 for a realistic population model, where the population size is bounded and a non-zero mutation rate is taken into account (in the case of asexual reproduction). Those accuracy estimates are fulfilled for all possible values of mutation rates and population sizes.

2 Model

In this section, we describe our model and mathematical approach.

2.1 Genome

We assume that the genotype can be described by Boolean strings of length N_g, where N_g is the number of genes. Then

s = (s_{1}, s_{2}, \dots, s_{N_{g}}), s_{i} \in S = {0, 1}, s \in S^{N_{g}}, (2.1)

where s_i = 1 means that gene i is activated (switched on) and s_i = 0 means that it is repressed (switched off). Correspondence between Boolean hypercube and genotypes is considered for example in 12.

2.2 Phenotypic traits

Although phenotype is controlled by genes, it is also influenced by environmental conditions and various epigenetic processes. In this paper, we suppose that phenotypic traits are controlled by genotype only. We consider levels f_j of expressions of those traits as real variables in the interval (0, 1). Then the vector f = (f₁, . . . , f_{N_b}) can be considered to represent the organismal phenotype. We suppose that

f_{j} = f_{j} (s), j = 1, \dots, N_{b}, (2.2)

where f_j ∈ (0, 1) is a real valued function of the Boolean string s, the genotype.

Only a part of s_i is involved in f_j. Namely, for each j we have a set of indices K_j = {i₁, i₂, . . ., i_{n_j}} such that f_j depends on s_i with i ∈ K_j, so that

f_{j} (s) = f_{j} (s_{i_{1}}, s_{i_{2}}, \dots, s_{i_{n_{j}}}),

where i_l ∈ K_j and n_j is the number of genes involved in the control of the trait expression.

The representation of phenotype by the quantities f_j is suggestive of quantitative traits because the f_j are real valued. The limiting values of 0 or 1 suggest another interpretation, however, in terms of cell type. Multicellular organisms consist of cells of different types. One can suppose that the organismal phenotype is defined completely by the corresponding cell pattern. The cell type j is determined by morphogenes, which can be identified as gene products or signaling molecules that can change cell type (or genes that code for signaling molecules that can determine cell types or cell-cell interactions and then finally the cell pattern). The morphogene activity is defined by (2.2).

We further suppose:

Assumption M. Assume activities f_j have the following properties.

The sets K_j are independent uniformly random subsets of S_g = {1, . . . N_g}

K_{j} = {i_{1}, \dots, i_{n_{j}}}, i_{l} \in S_{g}, l = 1, \dots, n_{j} . (2.3)

We denote the total number of genes involved in regulation of all f_j by N_r, where $N_{r} = \sum n_{j} \leq K N_{b} .$

Assumption M implies that the genetic control of the phenotype is organized, in a sense, randomly, and that only a portion of the full set of genes controls phenotypic traits. That modularity of gene control is well known from experimental data (see 18, 19) and for evolution computation problems it was studied, for example, in 16.

Consider an example, where the assumption M holds, where we have a saturated expression, inspired by earlier work^20,21. Let

f_{j} = σ (\sum_{i = 1}^{N_{g}} w_{j i} s_{i} - h_{j}), (2.4)

where j = 1, . . . , N_b. Here σ(z) is a sigmoidal function of real z such that

σ (+ \infty) = 1, σ (- \infty) = 0, σ^{'} (z) > 0 \forall z (2.5)

and w_ij, h_j are some coefficients (their meaning will be explained below). As an example, we can take σ(S) = (1 + exp(−bS))⁻¹, where b > 0 is a sharpness parameter. Note that for large b this sigmoidal function tends to the step function and for b = +∞ our model becomes a Boolean one. The parameters h_j defines thresholds for trait expression²⁰. The relation (2.4) can be interpreted as a simple mathematical model for quantitative trait locus (QTL) action.

To understand the role of h_j consider a trait f_j and suppose that for a well adapted organism f_j ≈ 1. Let, for simplicity, w_ij take the values 1, 0, or −1. Then the parameter h_j defines how many genes involved in the control of the f_j expression should be activators and how many should be repressors. Let the numbers of activator and repressor genes be $n_{j}^{\pm}$ , respectively.

Then f_j ≈ 1 if $n_{j}^{+} - n_{j}^{-} ≫ h_{j} .$

One can suppose that h_j describes a direct influence of environment on phenotype, such as stress, that can exert epigenetic effects. In Section 2.6 using data from 22 we will show that the model defined by (2.4) are capable to describe main topological characteristics of really observed fitness functions in the case of mimicry, camouflage and thermoregulation for insects.

Let us introduce the matrix W of size N_b × N_g with the entries w_ij. The coefficients w_ji determine the effects of terminal differentiation genes (see 23), and hence encodes the genotype-phenotype map. We assume that the coefficients w_ji are random, with the probability that w_ji > 0 or that w_ji < 0 is β/2N, where β > 0 is a parameter. This quantity β << N defines a genetic redundancy, i.e., averaged numbers of genes that control a trait. Note that then large β ≫ 1 one has n_j < 4β with the probability Pr_β, which is exponentially close to 1: Pr_β > 1 – exp(−0.1β), thus, the number n_j are bounded.

2.3 Fitness

We know little about the details of how fitness relates to the phenotype of multicellular organisms, and for that reason classic neo-Darwinian theory takes fitness to be a function of genotype. Some models which take account of epistasis have been proposed²⁴. The random field models assign fitness values to genotypes independently from a fixed probability distribution. They are close to mutation selection models introduced by 25, and can be named House of Cards (HoC) model. The best known model of this kind is the NK model introduced by Kauffman and Weinberger²⁶, where each locus interacts with K other loci. Rough Mount Fuji (RMF) models are obtained by combining a random HoC landscape with an additive landscape models²⁷. In evolution computations (EC) some artificial fitness models were used, for example OneMax and Leading Ones to test evolution algorithms, see for example¹⁵.

In this work, we use the classical approach of R. Fisher by introducing an explicit representation of phenotype, f, and allow it to determine fitness through interaction with an environment b. That is, we assume that the phenotype is completely determined by the phenotype trait expression, and thus the fitness depends on the genotype s via f_j.

We express the relative fitness F and its dependence on environment b via an auxiliary function W via the relation

F (s, b) = K_{F} \exp (W (s, b)), (2.6)

where K_F is a positive constant and b = (b₁,..., b_{N_b}) is a vector consisting of coefficients b_j, respectively. Below we refer to W as a fitness potential, and we assume that

W (s, b) = \sum_{j = 1}^{N_{b}} b_{j} f_{j} (s) . (2.7)

Sometimes, if the parameter b is fixed, we shall omit the corresponding argument in notation for W and F.

We consider fitness as a numerical measure of interactions between the phenotype and an environment. For a fixed environment, this idea gives us the fitness of classical population genetics. A part of the fitness, however, depends on the organism developing properly and for now we represent it as independent of the environment, although we are aware that this is not always the case. Note that some coefficients b_j may be negative and others may be positive, and that the model (2.7) can describe gene epistatic effects via dependence of f_j on s if f_j are nonlinear in s.

The expression (2.7) can serve as a rough approximation of the fitness function in the case of insects such as grasshoppers or fruit flies. In fact, important factors, which determine insect survival, are thermoregulation, mimicry and camouflage levels^18,22,28. All those factors depend on colour pigmentation pattern. Blackwhite pigmentation patterns can be roughly described by vectors f = (f₁, f₂,..., f_{N_b}), where f_j ≈ 1 and f_j ≈ 0 mean that the cell j is black, or white, respectively, Then thermoregulation depends on $\sum_{j} f_{j} .$ The mimicry level can be approximately defined by expression $\sum_{j} | f_{j} - f_{j}^{*} |$ , where f_* is a target pattern corresponding to an insect to mimic. Colour patterns can be also described by classical RGB formalism.

The representation of the fitness as a sum of terms (2.7) is of course a rough approximation; however if assumption M holds that representation is consistent with important observed facts. First, mutations have been identified that alter one part of the pigment pattern without affecting any other. This independence of different pattern parts can be explained by the modular organization of the genetic regulation that controls pigmentation. In the course of evolution, different aspects of the pigment pattern have clearly evolved independently of each other¹⁸. Second, the topology of the fitness landscapes was studied in 22 by field experiments in the case of insect mimicry. Main conclusions are as follows. A number of studies of fitness landscapes in natural populations have demonstrated low fitness of intermediate phenotypes, i.e., existence of valley in the fitness landscape. It is found²² that natural selection promotes genetic architecture preventing the expression of intermediate phenotypes. Close fitness peaks are separated by ridges, favouring colour pattern switches and allowing drift from local peaks.

In Section 2.6 we will show that the fitness model defined by (2.4) and (2.7) have those topological properties.

2.4 Population dynamics model

For simplicity, we consider populations with asexual reproduction. (Although a part of the results remain valid for sexual reproduction, as we discuss at the end of this subsection). We choose initial genotypes randomly from a gene pool and assign them to organisms. This choice is invariant with respect to the population member, i.e,. the probability to assign a given genotype s to a member of the population does not depend on that member.

In each generation, there are N_pop(t) individuals, the genome of each of which is denoted by s(t), where t = 0, 1, 2,... stands for the evolution step number). Following the classical Wright-Fisher ideas, we suppose that generations do not overlap. In each generation (i.e., for each t), the following three steps are performed:

1. Each individual s at each evolution step can mutate with probability p_mut per gene;
2. At evolution step t each individual with a genotype s produces k progeny, where k is a random non-negative integer, distributed according to the Poisson law

$P_{k} = \frac{q^{k}}{k!} \exp (- q), (2.8)$
where q = F(s) is the fitness of that individual;
3. To take into account ecological restrictions on the population size, we introduce the maximal population size N_popmax. If $N^{'}$ (t) > N_popmax, where $N^{'}$ (t) is the number of progeny produced by the population at step t, we kill randomly selected individuals in a population-dependent manner. The probability of the death of an individual is given by p_kill( $N^{'}$ ) = 1 − (N_popmax/ $N^{'}$ (t)). If $N^{'}$ (t) ≤ N_popmax, we do nothing. We refer to this as the “massacre procedure.”

Conditions 1 and 2 imply that mutations in the genotypes create a new genetic pool and then a new round of selection starts. Condition 3 expresses the fundamental ecological limitation that all environments can only support populations of a limited size. If N_popmax ≫ 1 then by (2.8) and the Central Limit Theorem one can show, under some additional conditions, (see Section 4) that fluctuations of the population size are small, and thus the population is ecologically stable and N_pop(t) ≈ N_popmax.

In the limit case of infinitely large populations we will write the discrete dynamical equation for the time evolution of the frequency X(s, t) of the genotype s in the population as

X (s, t + 1) = \bar{F} {(t)}^{- 1} X (s, t) F (s), X (s, 0) = X_{0} (s), (2.9)

where $\bar{F}$ (t) is the average fitness of the population at the moment t defined by

\bar{F} (t) = \sum_{s \in S (t)} X (s, t) F (s), (2.10)

where S(t) is the set of genotypes existing in the population at time t (the genetic pool) and X(s, t) = N(s, t)/N_pop(t) is the frequency of the genotype s. Here N(s, t) denotes the number of the population members with the genotype s at the step t.

The equations (2.9) do not take mutations into account. They only describe changes in the genotype frequencies because of selection at the t-th time step. The same equations govern evolution in the case of sexual reproduction in the limit of weak selection^6,10. Note that for an evolution defined by (2.9), the average fitness $\bar{F}$ (t) defined by (2.10) satisfies Fisher’s theorem, so that this function increases at each time step t: $\bar{F}$ (t + 1) ≥ $\bar{F}$ (t).

2.5 Gene regulation network

In this section, we follow ideas of the classic paper²⁹: the model should include a regulatory network, which evolves itself.

Regulatory genes as well as environmental factors, such as temperature, can influence the trait expression. This effect can be realized via thresholds h_j (we shall describe it below), or via a regulation of coefficients w_ij (see 30). In fact, these approaches are similar for sharp sigmoidal functions σ that are close to step functions, as can be shown by ideas from 31. Consider the expression

S_{i} = \sum_{i = 1}^{N_{g}} w_{i j} s_{j} - h_{j} (2.11)

involved in relation (2.4). Suppose following 31 that w_ij take the values γ, 0 or −γ, where γ is a parameter, which can be regulated. Let h_j ≪ γ be fixed.

Assume that at an evolution step we have S_i > d_S, where d_S > 0 is a parameter, which is more than γ. According to our Theorems this fact indicates, that for a well adapted organism the trait f_j(s) ≈ 1 and the corresponding coefficient b_j = 1. On the contrary, if S_i < −d_S, then one can expect that b_j = −1 and f_j(s) must be 0.

In the both cases, we can regulate the trait expression by a feedback so that whenever S_i attains a critical level, i.e., a trait is well expressed, then γ should be increased. In our numerical simulations we use an alternative model, where we change h_j. The alternative model, which is used in our numerical simulations, can be described as follows. We suppose that depending on activity of some regulatory genes or proteins (such as Hsp90), the threshold value can take three values $h_{i}^{(-)}$ , $h_{i}^{(+)}$ and $h_{i}^{(0)}$ such that

h_{i}^{(-)} \leq - D_{h}, h_{i}^{(+)} \geq D_{h}, | h_{i}^{(0)} | \approx 0, (2.12)

where D_h > 1 is a large parameter that defines the number of genes involved in trait specification (see Subsection 2.2). Thus, h_j can take large negative or positive values, and also a neutral value close to 0. The feedback can be described as follows: if S_i > 0 is large enough and h_i is small, then h_i = −D_h; if S_i < 0 and $| S_{i} |$ is large enough and h_i is small, then h_i = D_h; otherwise, we do not change h_i.

In our simulations, each ΔT evolution steps we modify h_i from 0 to −D_h or D_h for the trait with the maximal value $| S_{i} |$ .

Such a regulation produces a good adaptation even when the number of genes is essentially less than the number of the traits. The plot in Figure 3 shows a difference between an evolution without any regulation and with the regulation by (2.12) described above.

However, the evolution of gene regulation via h_i has an advantage: it makes phenotype more robust. In fact, the traits with large $| h_{i} |$ are non-sensitive with respect to mutations. The effect produced by this robustness is shown on Figure 1.

Figure 1. This graph illustrates a difference between the adaptation for evolution without evolution of gene regulation via threshold (the red curve) and with that evolution by 2.12 (the green curve).

The parameters are N_g = 50, M = 300, the mutation rate p_mut = 0.01, γ = 1 and K = 4. Non-zero coefficients w_ij are random numbers distributed according to the standard normal law. The initial genome is a random binary string, where each value is 0 or 1 with probability 1/2. The coefficients b_i are either 1 or −1, where the probability of 1 is p_b = 0.8.

This regulation is even more effective, if we consider the modular evolution, following the recent paper³². Indeed, biological systems are characterized by a high degree of modularity. This modularity allows biological systems to vary only in a small subset of traits at each evolution round. Figure 2 shows an effect of this modularity. We consider a toy example, where a system with only 10 genes should be adapted to 200 constraints. Without regulation, we have no chances to make an adaptation (only about 10 traits are correctly adapted, see the green curve). It is a consequence of a formidable pleiotropy (20 traits on 1 gene). However, if at each evolution stage an organism should be adapted to 4 traits whereas the remaining ones are made robust by high $| h_{i} |$ , then already 66 traits are correctly expressed after 20000 evolution steps. Note that the learning plays a key role in the regulation. In fact, the sign of the regulation threshold h_i depends on the sign of the corresponding coefficient b_i, and the knowledge of that sign gives us important information on the fitness landscape.

Figure 2. This graph shows that the modular evolution of gene regulation allows adaptation with a few genes, in that case N_g = 10 and the number of traits M = 200.

This means that we have a very big pleiotropy. Evolution proceeds in 20000 steps. The green curve corresponds to random walk with fixed small h without any evolution of gene regulation. We observe that with 10 genes 9 traits are correctly expressed. If evolution goes into 50 rounds then 66 traits are correctly adapted, and if evolution goes in 190 rounds then 129 traits are correctly adapted (the red curve). In that last case, at each step we make adaptation to at most one trait. Parameters are as follows. The mutation rate p_mut = 0.01 and K = 5. Nonzero coefficients w_ij are random numbers distributed according to the standard normal law. The initial genome is a random binary string, where each value is 0 or 1 with probability 1/2. The coefficients b_i are either 1 or –1, where the probability of 1 is p_b = 0.8.

Figure 3. Frequency distributions of degree of gene pleiotropy for model (2.4) with the parameters N_g = 4000, β = 4, h = 0, N_b = 3000.

2.6 Adaptation as a hard combinatorial problem

Adaptation (i.e., maximization of fitness in a changing environment) is a very hard problem since over evolutionary history we observe the coevolution of many traits accompanied by changes in many genes. In its general context, this is a problem in the theory of macroevolution, which in general requires the integration of population genetics and developmental biology for its full understanding. There are two key components of this problem. First, development is itself a dynamical process operating over time. Second, there is a combinatorial component of development wherein different combinations of gene must be expressed in different cell types. This combinatorial aspect of the problem means that straightforward theoretical methods of considering the relationship between gene expression and a changing environment that have been very successful in single celled organisms³³ cannot be applied to metazoa. In this work, for the sake of tractability, we focus on the combinatorial aspect of the problem and neglect developmental dynamics. Even at the highly simplified level of our model, adaptation is a hard computational problem, as we now demonstrate.

Consider the case, where f_j are defined by relations (2.4) and assume that

i) σ is the step function;
ii) b_j > 0.

As a consequence of the second assumption, F attains its maximum for f₁ = 1, f₂ = 1,..., f_{N_b} = 1. Let us show that, even in this particular case, the problem of the fitness maximization with respect to s is very complex. In fact, for a choice of h_j it reduces to the famous NP-complete problem, so-called K-SAT, which has received a great deal of attention in the last few decades (see 34–39). The K-SAT can be formulated as follows.

K-SAT problem. Let us consider the set V_n = {s₁,..., s_n} of Boolean variables s_i ∈ {0, 1} and a set 𝒞_m of m clauses. The clauses C_j are disjunctions (logical ORs) involving K literals z_i₁, z_i₂,..., z_{i_k}, where each z_i is either s_i or the negation $\bar{s}$ _i of s_i. The problem is to test whether one can satisfy all of the clauses by an assignment of Boolean variables.

Cook and Levin^34,35 have shown that the K-SAT problem is NP-complete and therefore in general it is not feasible in a reasonable running time. In subsequent studies—for instance, by 36 —it was shown that K-SAT of a random structure is feasible under the condition that N_b < α_c(K)N_g, where α_c(K) ≈ 2^K log 2 for large K.

The set 𝒞_K of solutions of random K-SAT has a nontrivial structure depending on parameter α = N_b/N_g^37,39. For sufficiently small α < α_g(K), where α_g(K) ≈ 2^K log(K)/K is some critical parameter, the set 𝒞_K forms a giant cluster, where nearest solutions are connected by a single flip and one can go from a solution to another by a sequence of single flips (pointed mutations)³⁹. For α ∈ (α_g, α_d), where α_d(k) < α_c is another critical value, solutions form a set of disconnected clusters. The local search algorithms do not work in the domain α > α_g.

Probably, for evolution context K-SAT was applied first in 40, where it was used for an investigation of speciation problem.

To see the connection of our model with K-SAT, consider equation (2.4) supposing that w_ij ∈ {1, 0, −1} and h_j = −C_j + 0.5, where C_j is the number of negative w_ji in the sum $S_{j} = \sum_{i = 1}^{N_{g}} w_{j i} s_{i} .$ We set m = N_b and n = N_g. Under this choice of h_j, the terms σ(S_j) can be represented as disjunctions of literals z_j. Each literal z_j equals either s_j or $\bar{s}$ _j, where $\bar{s}$ _j denotes negation of s_j. To maximize the fitness, we must assign s_j such that all disjunctions will be satisfied. If we fix the number n_j of the literals participating in each disjunction (clause) and set n_j = K, this assignment problem is precisely the K-SAT problem formulated above.

Reduction to the K-SAT problem is a transparent way of representing the idea that multiple constraints need to be satisfied. The number K defines the gene redundancy and the probability of gene pleiotropy. Remind that pleiotropy occurs when one gene influences two or more seemingly unrelated phenotypic traits. The threshold h_j and K define the number of genes which need be flipped in order to attain a high expression of the trait f_j. Note that gene pleiotropy is a fundamental characteristics⁴¹, which is studied for real organisms only recently (see 19). We can compare experimental observations and consequence of model (2.4), which is a generalisation of K-SAT (compare plot Figure 3 and plots on Figure 1 in 19). So, we can fit our model parameters using real data. Moreover, we can check validity of our model by the following arguments.

We note that, in the case of giant cluster formation, the topological properties of the solution set 𝒞_K, mentioned above, outline the properties of really observed fitness landscapes²²: existence of many peaks, valleys and ridges connecting peaks. Namely, existence of many solutions of K-SAT, when a giant cluster exists, means that the landscape has a number of peaks separated by valleys. On the other hand, connectance of solutions within the giant cluster can be interpreted that there exist ridges that connect peaks.

Note that there are important differences between K-SAT in Theoretical Computer Science and fitness maximization problems. First, the signs of b_j are unknown for real biological situations since the fitness landscape is unknown. Second, our adaptation problem involves the threshold parameters h_j (see (2.4)). In contrast to K-SAT, in our case the Boolean circuit is plastic, because the h_j are not fixed.

If the b_j are unknown, the adaptation (fitness maximization) problem becomes even harder because we do not know the function to optimize. Therefore, many algorithms for K-SAT are useless for biological adaptation problems. Below we will nonetheless obtain some analytical results based on the assumption that b_j are random.

3. Main theorems

The subsequent material is organized as follows. First we formulate a result on regulation mechanism power. Furthermore, we prove two fitness landscape learning theorems.

3.1 Fitness landscape learning theorems

For simplicity, we consider asexual reproduction. To obtain similar results for sexual reproduction, one can consider a weak selection regime and use the results of 10, where eq. (2.9) are derived.

Let us introduce two sets of indices I₊ and I₋, such that I₊ ∪ I₋ = {1,..., N_b}. We refer to these sets in the sequel as positive and negative sets, respectively. We have

I_{+} = {j \in {1, \dots, N_{b}} | b_{j} > 0}, (3.1)

I_{-} = {j \in {1, \dots, N_{b}} | b_{j} < 0} . (3.2)

The biological interpretation of that definition is transparent: the expression of the traits f_j with j ∊ I₊ increases the fitness and for j ∊ I_– expression of the trait decreases the fitness.

Let s and $\bar{s}$ be two genotypes. Then we denote by Diff(s, $\bar{s}$ ) the set of positions i such that s_i ≠ $\bar{s}$ _i:

Diff (s, \bar{s}) = {i \in {1, \dots, N_{g}} | s_{i} \neq \bar{s}} \cdot

The set Diff(s, $\bar{s}$ ) indicates which genes in s should be flipped in order to obtain $\bar{s}$ .

We formulate two theorems on fitness landscape learning. First we consider the case of infinitely large populations.

Theorem 3.1. Suppose that the evolution of the genotype frequencies X(s, t) is determined by equations (2.9) and (2.10). Moreover, assume that

I for all t ∊ [T₁, T₁ + T_c], where T₁, T_c > 0 are integers, the population contains two genotypes s and $\bar{s}$ such that the frequencies X(s, t) and X( $\bar{s}$ , t) satisfy

X (s, T_{1}) = p_{0} > 0, X (\bar{s}, T_{1} + T_{C}) = p_{1} > 0, (3.3)

II we have

Diff(s, \bar{s}) \subset K_{j,} (3.4)

for some j. In other words, the genes s_i such that s_i ≠ $\bar{s}$ _i are involved in a single regulation set K_j; and finally,

III Let

δ_{j} = | f_{j} (s) - f_{j} (\bar{s}) | > 0, | b_{j} | > 0, (3.5)

and

T_{c} > \frac{- \log (p_{0} p_{1})}{| b_{j} | δ_{j}} \cdot (3.6)

Then, if

f_{j} (s) < f_{j} (\bar{s}), (3.7)

we have j ∊ I₊. If f_j(s) > f_j( $\bar{s}$ ), then j ∊ I_–.

Before proving this, let us make some comments. The biological meaning of the theorem is simple: for simple fitness models, where unknown parameters b_j are involved in a linear way, in the limit of infinitely large populations fitness landscape learning is possible.

Moreover, note that we do not make any specific assumptions about the nature of mutation, but only that all genetic variation between s and $\bar{s}$ are contained in a single regulatory set K_j.

The assertion of Theorem 3.1 is not valid if the set Diff(s, $\bar{s}$ ) belongs to a union of different regulation sets K_j , j = j₁, . . . , j_p with p > 1. This effect of belonging to different sets K_j is pleiotropy in gene regulation. Note that if N_b ≪ N_g then the pleiotropy probability is small for large genome lengths N_g. On the contrary, if N_b ≫ N_g then assumption II is invalid.

Assumption II looks natural if when we deal with point mutations. In fact, if $\bar{s}$ is obtained from s by a single point mutation then condition (3.4) always holds for some j. For small mutation rates the probability of two point mutations is essentially below than the probability of a single mutation.

To conclude let us note that Theorem gives a rough estimate for the learning time T_c:

T_{c} = O (- \frac{\log (p_{0} p_{1})}{| b_{j} | δ_{j}}) \cdot

Proof. The main idea is simple. Negative mutations lead to elimination of mutant genotypes from the population, and the corresponding frequencies become, for large times, exponentially small.

Assume that (3.7) holds. Let j ∊ I₋, and thus b_j < 0. Consider the quantity

Q (t) = \frac{X (s, t)}{X (\bar{s}, t)} = \frac{N (s, t)}{N (\bar{s}, t)} \cdot (3.8)

According to assumption II

Δ W = W (s) - W (\bar{s}) = b_{j} (f_{j} (s) - f (\bar{s})) \cdot (3.9)

Assumption III entails that

Δ W \geq | b_{j} | δ_{j \cdot} (3.10)

Relations (2.6) and (3.10) imply

\frac{F (s)}{F (\bar{s})} = exp (Δ W) \geq exp (| b_{j} | δ_{j}) \cdot

By (2.9) and the last inequality we find that for T > T₁

Q (T) \geq Q (T_{1}) \exp (| b_{j} | δ_{j} (T - T_{1})) \cdot (3.11)

Consider inequality (3.11) for T = T₁ + T_c. Let us note that in the relation Q(T₁) = X(s, T₁)/X( $\bar{s}$ , T₁) the numerator is p₀ whereas the denominator ≤ 1. Thus, Q(T₁) ≥ p₀. The same arguments show that Q(T₁ + T_c ) ≤ 1/p₁. Therefore, by (3.11) one obtains that

\frac{1}{p_{1} p_{0}} \geq \exp (| b_{j} | δ_{j} T_{c}) \cdot (3.12)

This inequality leads to a contradiction for T_c satisfying (3.6), thus completing the proof.

3.2 The case of finite populations

Theorem 3.1 can be extended to the case of finite populations and non-zero mutation rates. is small. To formulate this generalization, we need an additional assumption about the fitness function. Suppose that

\begin{array}{l} 1 < c_{F} < \min F (s), \\ s \in S (t) \end{array}

\begin{array}{l} \max F (s) < C_{F}, \forall t \in [T_{1}, T_{1} + T_{c}] \\ s \in S (t) \end{array} (3.13)

where c_F, C_F > 0 are constants independent of t. For example, if

\sum_{j = 1}^{N_{b}} | b_{j} | < γ,

then c_F = K_F exp(–γ) and C_F = K_F exp(γ) and (3.13) holds if K_F > exp(γ).

Condition (3.13) means that each individual gives birth to at least c_F and at most C_F descendants, where those bounds do not depend on the population size and evolution step.

Let

N_{pop} (T_{1}) = N_{popmax} \cdot (3.14)

Note that for simplicity in the next Theorem 3.2 we consider point mutations (bit flipping) only. The model used here cannot represent mutations of arbitrarily small effect, but it can include insertions or deletions. In contrast to Theorem 3.2, Theorem 3.1 is valid for all kinds of mutations.

Then we have

Theorem 3.2. Consider the population dynamics defined by model 1-3 in Subsection 2.4. Assume conditions (3.14) and M hold, and assumptions (3.3), (3.4), (3.5), (3.7) of Theorem 3.1 are satisfied. Suppose

X (s, t) \geq p_{0} \forall t \in [T_{1}, T_{1} + T_{c}], (3.15)

X (\bar{s}, t) \geq p_{1} \forall t \in [T_{1}, T_{1} + T_{c}] \cdot (3.16)

Then if j ∊ I_– the inequality

p_{1} < p_{0}^{- 1} \exp (- 0.5 | b_{j} | δ_{j} T_{c}) (3.17)

is fulfilled with the probability Pr_v such that

{Pr}_{v} > {(1 - ρ (p_{0}) - ρ (p_{1}))}^{T_{c}}, (3.18)

where for large N_popmax and p_mut → 0

ρ (p) = exp (- (ln 2 - 1 / 2) p_{mut} c_{F} p κ N_{popmax}) \cdot

Interpretation of Theorem 3.2

It is interesting to compare Theorem 3.1 and Theorem 3.2. The previous one asserts that for infinite populations the probability of the event j ∊ I₋ is zero whereas the second one claims that this probability becomes exponentially small as the population size increases.

This theorem also shows that evolution can make a statistical test checking the hypothesis H_– that j ∊ I_– against the hypothesis H₊ that j ∊ I₊. Suppose that H_– is true. Let V be the event that the frequency X( $\bar{s}$ , t) of the genotype $\bar{s}$ in the population is larger than p₁ within a sufficiently large time T_c. According to estimate (3.18), the probability of the event V is so small that it is almost unbelievable. Therefore, the hypothesis H_– should be rejected. We will refer T_c as the checking time.

Rare mutants. In this Theorem we assume that the frequencies p₀ and p₁ of genotypes (wild and mutant) are fixed and our estimate is valid as p_mut → 0. I.e., we do not consider mutants with a very small frequencies (fractions). Of course, a large population always contains a small number of such mutants. In numerical simulations we assume that evolution is successful and population is perfectly adapted, if, say, 95 or 99 percents of population members have the maximal fitness.

Ideas for the proof. The main idea is the same as that for the previous theorem: we compare the frequencies of the organisms with the genotype $\bar{s}$ and the organisms with the genotype s. However, the proof includes a number of technical details connected with estimates of mutation effects and fluctuations. The formal proof can be found in Section 4. It is based on estimates of the accuracy of the Nagylaki equations (2.9). The main Lemma 4.5 for the proof of Theorem 3.2. admits a transparent interpretation. We show that fraction X(s, t) of genotype s evolves in time in such a way that the estimates.

X (s, t + 1) ​ < (F (s) + r (p_{mut}, ​ N_{popmax}, s)) X (s, t) / \bar{F} (t)

X (s, t + 1) ​ > (F (s) - r (p_{mut}, ​ N_{popmax}, s)) X (s, t) / \bar{F} (t)

are satisfied, where F(s) is a fitness of genotype s, $\bar{F}$ is average population fitness, and r(p_mut, N_pop)) are small corrections, which converge to zero uniformly in s as the mutation rate p_mut → 0 and the population size N_popmax → ∞. This means that in the limit p_mut → 0, N_popmax → ∞ we have equation (2.9). The main problem with the application of Theorem 3.2 is how it allows to perform fitness landscape learning. It can be done by a regulation, as is detailed in the following section.

4 Proof of theorems

Let us prove Theorem 3.2.

4.1 Main tools and auxiliary Lemmas

Let us introduce notation and make some preliminary remarks. Remind that we denote by N(s, t) the number of the population members with the genotype s at the moment t. Let X(t) be the set of all population members at the moment t. For each x ∊ X(t) let us denote by $N^{'}$ (x, t) the number of progeny born by the individual x at the moment t before the massacre (see point 3 of model from Subsection 2.4). Let s_g (x) be the genotype of x. Then, according to (2.8), the mean of $N^{'}$ (x, t) is

E N^{'} (x, t) = F (s_{g} (x)), (4.1)

where EX denotes the expected value of X. By $\bar{N}$ (s, t) we denote the number of all progeny born by individuals with the genotype s at the moment t before the massacre. Since all progeny are produced independently and randomly, the previous relation gives

E \bar{N} (s, t) = N (s, t) F (s) . (4.2)

Our main analytical tools are the Chernoff bounds and the Hoeffding inequalities. We also use the Markov inequality: for a positive random quantity X and a > 0 one has

P r {X > a} \leq \frac{E X}{a} . (4.3)

Moreover, we use two elementary estimates. Let 𝒜 be an event in stochastic population dynamics. We denote by Not𝒜 the negation (complement) of 𝒜 and by Pr(𝒜|ℬ) the conditional probability of 𝒜 under the condition ℬ. For events 𝒜 , ℬ₁, . . . , ℬ_n we have

\begin{array}{l} Pr (A) = Pr (A ℬ_{1} \dots ℬ_{n}) + \\ Pr (A Not (ℬ_{1} \dots ℬ_{n})) \\ \leq Pr (A | ℬ_{1} \dots ℬ_{n}) + \sum_{j = 1}^{n} Pr (Not ℬ_{j}) . (4.4) \end{array}

For two events 𝒜, ℬ one has

Pr (A ℬ) \geq 1 - Pr (N o t A) - Pr (N o t ℬ) . (4.5)

Lemma 4.1. Let X_i be independent random quantities, where i = 1, . . . , n. Let each X_i be distributed according to the Poisson law with the average EX_i = μ_i. Let us denote

X = \sum_{j = 1}^{n} X_{j}, \bar{μ} = \frac{\sum_{j = 1}^{n} μ_{j}}{n} .

Then for all δ > 0

Pr {X > (1 + δ) \bar{μ} n} \leq exp (- \bar{μ} d (δ) n), (4.6)

where

d (δ) = (1 + δ) ln (1 + δ) - δ .

Similarly,

Pr {X < (1 - δ) \bar{μ} n} \leq exp (- \bar{μ} d (- δ) n) . (4.7)

Proof. Note that for any λ > 0

\begin{array}{l} Pr {X > (1 + δ) \bar{μ} n} = \\ Pr {exp (λ X) > exp (λ (1 + δ) \bar{μ} n))} (4.8) \end{array}

Since X_j are independent quantities, we have

E exp (λ X) = \prod_{j = 1}^{n} E exp (λ X_{j}) .

The straight forward computation shows that

E exp (λ X_{j}) = exp ((e^{λ} - 1) μ_{j}) .

Therefore, due to the Markov inequality (4.3) and estimate (4.8) one has

Pr {X > (1 + δ) \bar{μ} n} \leq exp (n \bar{μ} f (λ)),

where

f (λ) = exp (λ) - 1 - λ (1 + δ) .

We minimize f with respect to λ and obtain (4.6). To derive (4.7), we use

\begin{array}{l} Pr {X < (1 - δ) \bar{μ} n} = \\ Pr {exp (- λ X) > \\ exp (- λ (1 - δ) \bar{μ} n))} (4.9) \end{array}

and repeat the same arguments. The Lemma is proved.

Lemma 4.2. Let X_i be independent random quantities, where i = 1, . . . , n such that X_i ∊ {0, 1} and EX_i = p. Then

Pr {2 X < p n} \leq exp (- g (p) n), (4.10)

where

g (p) = - \frac{p ln 2}{2} - ln (1 - \frac{p}{2}) . (4.11)

Proof. Note that for any λ > 0

Pr {X < p n / 2} = Pr {exp (- λ X) > exp (- λ p n / 2))} . (4.12)

Since X_j are independent quantities, we have

E exp (- λ X) = \prod_{j = 1}^{n} E exp (- λ X_{j}) .

Note that E exp(−λX_j) = p exp(−λ) + 1 − p. Let

G (λ, p) = λ p / 2 + ln (p exp (- λ) + 1 - p) .

We take λ = ln 2 and find that G(ln 2, p) = −g(p). Now by using the Markov inequality (4.3) and estimate (4.12) one obtains (4.10). The Lemma is proved.

We also use the following Chernoff-Hoeffding theorem. Let X_i be i.i.d. quantities such that X_i ∊ {0, 1} and EX_i = p, where i = 1, . . . , n. Then for $X = \sum_{j = 1}^{n} X_{j}$ one has

Pr {X > (p + ε) n} \leq exp (- D (p + ε ‖ p) n), (4.13)

where D(x||y) is the Kullback-Leibler divergence

D (x ‖ y) = x ln (x / y) + (1 - x) ln ((1 - x) / (1 - y)) . (4.14)

Moreover, we will use the Hoeffding Theorem: if i.i.d. quantities X_i ∊ [0, 1] with the probability 1 then

Pr {| X - E X | > a} \leq 2 exp (- 2 a^{2} / n) . (4.15)

4.2 Main lemmas

First we estimate the population size fluctuations.

Lemma 4.3. Let $\bar{N}$ (t) be the number of all progeny, born in the population at the moment t before the massacre, and ε₁ > 0 be a small number. Then

\begin{array}{l} \bar{N} (t) \in J_{ε_{1}} (t) = \\ [(1 - ε_{1}) \bar{F} (t) N_{pop} (t), (1 + ε_{1}) \bar{F} (t) N_{pop} (t)] (4.16) \end{array}

with probability

{Pr}_{\bar{N}} > 1 - η_{0} (ε_{1}), (4.17)

where

\begin{array}{l} η_{0} (ε_{1}) = exp (- d (ε_{1}) c_{F} N_{pop} (t)) + \\ exp (- d (- ε_{1}) c_{F} N_{pop} (t)) . (4.18) \end{array}

Proof. Let $n^{'}$ (x, t) denote the number of progeny produced by the individual x before the massacre at the t-th evolution step. The number $\bar{N}$ (t) is the sum

\bar{N} (t) = \sum_{x \in X (t)} N^{'} (x, t)

of the mutually independent random quantities. According to (4.2), the average E $N^{'}$ (x, t) is F(s_g (x)). Therefore,

E \bar{N} (t) = \sum_{x \in X (t)} E N^{'} (x, t) (4.19)

= \sum_{x \in X (t)} F (s_{g} (x)) (4.20)

= N_{pop} (t) \bar{F} (t) . (4.21)

We set

n = N_{pop} (t), μ_{x} = F (s_{g} (x)), \bar{μ} = \bar{F} (t)

and use the Lemma 4.1 that gives us (4.17).

Lemma 4.4. Let ε₂ ∊ (0, 1) be fixed and condition (3.13) be fulfilled. Assume, moreover, that

2 N_{popmax} \geq N_{pop} (t) \geq κ N_{popmax}, (4.22)

where

κ \in (c_{F}^{- 1}, 1) (4.23)

and c_F > 1 is defined by (3.13). Let us define the event 𝒟_ε₂ (t) by

𝒟_{ε_{2}} (t) = {| N_{pop} (t + 1) - N_{popmax} | < ε_{2} N_{popmax}} . (4.24)

Then one has

Pr (𝒟_{ε_{2}} (t)) > 1 - η (ε_{2}), (4.25)

where

\begin{array}{l} η (ε_{2}) = \\ exp (- d (\tilde{ε}) κ N_{popmax}) + \\ exp (- d (- \tilde{ε}) κ N_{popmax}) + \\ 2 exp (- \frac{2 ε_{2}^{2} N_{popmax}}{2 (1 + \tilde{ε})) C_{F}}) (4.26) \end{array}

and

\tilde{ε} = 1 - {(κ c_{F})}^{- 1} . (4.27)

Proof. Let ξ(x) be random quantities defined as follows: ξ(x) = 1 if the individual x is survived as a result of massacre (see point 3 of our model from Subsection 2.4), and ξ(x) = 0 otherwise. Let $X^{'}$ (t) be the set of progeny produced by all individuals from the population. Then the number N_sur(t) = N_pop(t + 1) of finally survived progeny can be computed as follows:

N_{sur} (t) = \sum_{x \in X^{'} (t)} ξ (x) .

Note that | $X^{'}$ (t)| = $\bar{N}$ (t). Moreover, Eξ(x ) = N_popmax/ $\bar{N}$ (t) for $\bar{N}$ (t) ≥ N_popmax. Therefore, if $\bar{N}$ (t) ≥ N_popmax then

E N_{sur} (t) = N_{popmax} . (4.28)

Let us define the event

ℬ (t) = {\bar{N} (t) \in J_{\tilde{ε}} (t)}, (4.29)

where the interval J_ε(t) is defined by (4.16) and $\tilde{ε}$ is defined by (4.27). By (4.4) we have

\begin{array}{l} Pr (Not 𝒟_{ε_{2}} (t)) \leq Pr (N o t 𝒟_{ε_{2}} (t) | ℬ (t)) + \\ Pr (Not ℬ (t)) . (4.30) \end{array}

Now we apply the Hoeffding inequality (4.15). For each ε₂ > 0 we obtain

Pr (Not 𝒟_{ε_{2}} (t)) < 2 exp (- \frac{2 ε_{2}^{2} E N_{sur} {(t)}^{2}}{\bar{N} (t)}) .

If ℬ (t) takes place, then $\bar{N}$ (t) ≥ N_popmax and consequently

\frac{2 ε_{2}^{2} E N_{sur} {(t)}^{2}}{\bar{N} (t)} > \frac{2 ε_{2}^{2} N_{popmax}}{2 (1 + \tilde{ε}) C_{F}} . (4.31)

Therefore,

Pr (N o t 𝒟_{ε_{2}} (t) | ℬ (t)) < 2 exp (- \frac{2 ε_{2}^{2} N_{popmax}}{2 (1 + \tilde{ε}) C_{F}}) . (4.32)

Moreover, by Lemma 4.3

\begin{array}{l} Pr (Not ℬ (t)) < exp (- d (\tilde{ε}) κ N_{popmax}) + \\ exp (- d (- \tilde{ε}) κ N_{popmax}) . (4.33) \end{array}

Inequalities (4.30), (4.32) and (4.33) prove (4.25).

The following lemma, in particular, allows us to obtain equations (2.9) and (2.10) in the limit of infinite populations and for small mutation probabilities.

Recall that $\bar{N} (s, t)$ denotes the number of non-mutated progeny generated by the individuals with the genotype s before the massacre. Let N_sur(s, t) be the number of those progeny that survived after that massacre.

Lemma 4.5. Let ε₀ be a positive number satisfying (4.75) and

κ N_{popmax} < N_{pop} (t) < 2 N_{popmax} . (4.34)

Then one has

N (s, t + 1) > (1 - ε_{0}) F (s) \bar{F} {(t)}^{- 1} N (s, t) (4.35)

with the probability Pr_s,t,+ such that

{Pr}_{s, t, +} > 1 - \sum_{i = 1}^{5} R_{i} (s, t), (4.36)

where

\begin{array}{l} R_{1} (s, t) = exp (- d (1) c_{F} N_{pop} (t)) + \\ exp (- d (- 1) c_{F} N_{pop} (t)), \end{array} (4.37)

\begin{array}{l} R_{2} (s, t) = exp (- d (0.5) c_{F} N (s, t)) + \\ exp (- d (- 0.5) c_{F} N (s, t)), \end{array} (4.38)

R_{3} (s, t) = 2 exp (\frac{- ε_{0}^{2}}{16 C_{F}^{2}} c_{F} N (s, t)), (4.39)

\begin{array}{l} R_{4} (s, t) = exp (- 0.5 (2 ln 2 p_{mut} + \\ (1 - p_{mut}) ln (\frac{1 - 2 p_{mut}}{1 - p_{mut}})) \cdot \\ c_{F} N (s, t)), (4.40) \end{array}

\begin{array}{l} R_{5} (s, t) = exp (- d (1) c_{F} N (s, t)) + \\ exp (- d (- 1) c_{F} N (s, t)), (4.41) \end{array}

Similarly,

N (s, t + 1) < (1 + ε_{0}) F (s) \bar{F} {(t)}^{- 1} N (s, t) (4.42)

with the probability Pr_s,t,− such that

{Pr}_{s, t, -} > 1 - \sum_{i = 1}^{5} R_{i} (s, t) . (4.43)

Proof. Step 1, estimates of fluctuations. First let us estimate the fluctuations of the number $\bar{N} (s, t)$ . For each ε₂ > 0 let us define the event

A_{s, ε_{2}} (t) = {| \bar{N} (s, t) - E \bar{N} (s, t) | > ε_{2} E \bar{N} (s, t)} . (4.44)

By Lemma 4.1 one has

\begin{array}{l} Pr (A_{s, ε_{2}} (t)) < exp (- d (ε_{2}) E \bar{N} (s, t)) + \\ exp (- d (- ε_{2}) E \bar{N} (s, t)) . \end{array} (4.45)

Note that

E \bar{N} (s, t) = F (s) N (s, t) > c_{F} N (s, t) . (4.46)

As a result, by (4.46) we obtain

\begin{array}{l} Pr (A_{s, ε_{2}} (t)) < exp (- d (ε_{2}) c_{F} N (s, t)) + \\ exp (- d (- ε_{2}) c_{F} N (s, t)) . (4.47) \end{array}

Step 2. Here we estimate the number of progeny that survived as a result of the massacre procedure (point 3 of the population dynamics model, see subsection 2.9). Let X' (s, t) be the set of progeny produced by individuals with the genotype s. Then the number N_sur(s, t) of survived progeny x for individuals x belonging to the set Z' (s, t) is

N_{sur} (s, t) = \sum_{x \in X^{'} (s, t)} ξ (x),

where ξ(x) are defined in the proof of the previous Lemma. For ε₃ > 0 we consider the event

\begin{array}{l} A_{sur, s, ε_{3}} (t) = {| N_{sur} (s, t) - E N_{sur} (s, t) | \\ > ε_{3} E [N_{sur} (s, t)]}, (4.48) \end{array}

Let us estimate the probability Pr(𝒜_sur,s(t)). According to the Hoeffding Theorem (4.15)

\begin{array}{l} Pr (A_{sur, s, ε_{3}} (t)) < \\ 2 exp (- 2 ε_{3}^{2} E {[N_{sur} (s, t)]}^{2} \bar{N} {(s, t)}^{- 1}) . (4.49) \end{array}

Note that ξ(x) and ξ(y) are independent quantities for different x and y, thus under the condition $\bar{N} (t)$ > N_popmax

E N_{sur} (s, t) = \sum_{x \in X^{'} (s, t)} E ξ (x) = \bar{N} (s, t) \frac{N_{popmax}}{\bar{N} (t)},

therefore,

Pr (A_{sur, s, ε_{3}} (t)) < 2 exp (- 2 ε_{3}^{2} \bar{N} (s, t) (\frac{N_{popmax}}{\bar{N} (t)})^{2}) . (4.50)

Let us define the events ℬ_s(t) and ℬ(t) by

\begin{array}{l} ℬ_{s} (t) = {\bar{N} (s, t) \\ > (1 - ε_{2}) E \bar{N} (s, t)}, (4.51) \end{array}

\begin{array}{l} ℬ (t) = {\bar{N} (t) \\ < (1 + ε_{1}) E \bar{N} (t)} . (4.52) \end{array}

Then using (4.4) one has

\begin{array}{l} \Pr (A_{sur, s, ε_{3}} (t)) \leq \Pr (A_{sur, s} (t) | ℬ_{s} (t) ℬ (t)) + \\ \Pr (Not ℬ_{s} (t)) + \\ \Pr (Not ℬ (t)) . (4.53) \end{array}

We observe that under conditions ℬ_s(t) and ℬ(t)

\begin{array}{l} \bar{N} (s, t) {(\frac{N_{popmax}}{\bar{N} (t)})}^{2} < (1 - ε_{2}) {(1 + ε_{1})}^{- 2} \cdot \\ E \bar{N} (s, t) \cdot \\ {(\frac{N_{popmax}}{E \bar{N} (t)})}^{2} \cdot \end{array} (4.54)

In that estimate let us set ε₂ = 0.5 and ε₁ = 1. Taking into account that E $\bar{N} (t)$ = $\bar{F} (t)$ N_pop(t) < 2C_FN_popmax, we have that

\Pr (A_{sur, s, ε_{3}} (t) | ℬ_{s} (t) ℬ (t)) < {\bar{R}}_{3} (ε_{3}, s, t), (4.55)

where

{\bar{R}}_{3} (ε_{3}, s, t) = 2 exp (- 0.25 ε_{3}^{2} C_{F}^{- 2} c_{F} N (s, t)) . (4.56)

Moreover, according to (4.47)

\Pr (Not ℬ_{s} (t)) < R_{2} (s, t), (4.57)

and due to (4.17)

\Pr (Not ℬ (t) ν) < R_{1} (s, t), (4.58)

where R₁,R₂ are defined by (4.37) and (4.38). Finally,

\Pr (A_{sur, s, ε_{3}} (t)) < R_{1} (s, t) + R_{2} (s, t) + {\bar{R}}_{3} (ε_{3}, s, t) . (4.59)

Step 3, estimate of the number of mutants.

Let us estimate how many individuals with genotypes s can mutate. The probability of mutation is p_mut. Let N_mut(s, t) be the number of such mutants. Let us define the event 𝒜_mut,s(t) by

A_{mut, s} (t) = {N_{mut} (s, t) > 2 p_{mut} \bar{N} (s, t)}, (4.60)

Since the random quantity N_mut(s, t) is subject to the Bernoulli law, we can apply the Chernoff-Hoeffding inequality (4.13). Then we obtain that

\Pr (A_{mut, s} (t)) < exp (- D (2 p_{mut} ‖ p_{mut}) \bar{N} (s, t)), (4.61)

where, according to definition (4.14) of D(x||y), one has

D (2 p_{mut} ‖ p_{mut}) = g (p_{mut})

and g is defined by (4.11).

Using (4.4) one has

\Pr (A_{mut, s} (t)) \leq \Pr (A_{mut, s} (t) | ℬ_{s} (t)) + \Pr (Not ℬ_{s} (t)) . (4.62)

As a result, by Lemma 4.3 one finds

\Pr (A_{mut, s} (t)) \leq R_{4} + R_{5}, (4.63)

where R₄,R₅ are defined by (4.40) and (4.41).

To prove (4.35), we set ε₃ = ε₀/2. Taking into account condition (4.75) for ε₀ we see that if the both events Not 𝒜_mut,s(t) and Not 𝒜_{sur,s,ε₀/2}(t) take place, then inequality (4.35) is fulfilled. Thus

\begin{array}{l} \Pr (Not A_{mut, s} (t) Not A_{sur, s, ε_{0} / 2} (t)) \geq \\ 1 - \Pr (A_{mut, s} (t)) - \Pr (A_{sur, s, ε_{0} / 2} (t)) > \\ 1 - \sum_{i = 1}^{5} R_{i}, \end{array}

where R_i are defined by (4.37)–(4.41).

Finally, taking into account the results of steps 1, 2 and 3 we see that estimate (4.35) holds with the probability Pr_t,+. It completes the proof of (4.35). The second inequality (4.42) can be obtained in the same way.

4.3 Remaining part of the proof of Theorem 3.2

We use the same idea that in the proof of Theorem 3.1 but first we establish uniform bounds for the population size and other quantities involved in the proof.

Step 1 Here we estimate the population size. Let us set

ε_{2} = 1 - κ > 0

in Lemma 4.4. Let us consider the events 𝒟_ε₂(t) defined by (4.24) in Lemma 4.4. If the events 𝒟_ε₂(t) take place for all t ∈ [T₁, T₁ + T_c] and N_pop(0) = N_popmax, we have that

2 N_{popmax} > N_{pop} (t) (4.64)

> κ N_{popmax} \forall t \in [T_{1}, T_{1} + T_{c}] . (4.65)

Then conditions (3.15), (3.16) of Theorem 3.2 imply

\begin{array}{l} N (s, t) > κ p_{0} N_{popmax}, N (\bar{s}, t) > κ p_{1} N_{popmax} . \end{array} (4.66)

Those inequalities imply the following estimates for the quantities R_i defined by (4.37)–(4.41):

R_{i} (s, t) > q_{i} (p_{0}), R_{i} (\bar{s}, t) > q_{i} (p_{1}), (4.67)

where q_i are defined by

\begin{array}{l} q_{1} = \exp (- (2 \ln 2 - 1) c_{F} κ N_{popmax}) + \\ \exp (- c_{F} κ N_{popmax}), (4.68) \end{array}

\begin{array}{l} q_{2} (p) = 2 \exp (- (3 / 2 \ln (3 / 2) - 1 / 2) \\ c_{F} p κ N_{popmax}) + \\ \exp (- 1 / 2 (1 - \ln 2) c_{F} p \\ κ N_{popmax}), (4.69) \end{array}

q_{3} (p) = 2 \exp (- \frac{ε_{0}^{2}}{16 C_{F}^{2}} κ c_{F} p N_{popmax}), (4.70)

q_{4} (p) = \exp (- 0.5 U (p_{mut}) c_{F} p κ N_{popmax}), (4.71)

\begin{array}{l} q_{5} (p) = \exp (- (2 \ln 2 - 1) c_{F} κ p N_{popmax}) + \\ \exp (- c_{F} p κ N_{popmax}), \end{array} (4.72)

where

U (p) = 2 \ln 2 p + (1 - p) \ln ((1 - 2 p) / (1 - p)), (4.73)

and

\begin{array}{l} \tilde{q} = \exp (- d (\tilde{ε}) κ N_{popmax}) + \\ \exp (- d (- \tilde{ε}) κ N_{popmax}) + \\ 2 \exp (- \frac{2 {(1 - κ)}^{2} N_{popmax}}{2 (1+ \tilde{ε}) C_{F}}) . (4.74) \end{array}

where $\tilde{ε}$ = 1 – (κc_F)^–1, and

\begin{array}{l} ε_{0} = \frac{1 - \exp (- b_{j} δ_{j} T_{c} / 2)}{1 + \exp (- b_{j} δ_{j} T_{c} / 2)} \\ > 4 p_{m u t} C_{F} . (4.75) \end{array}

For each p ∈ (0, 1) let us define an auxiliary function

ρ (p) = q_{1} + q_{2} (p) + q_{3} (p) + q_{4} (p) + q_{5} (p) + \tilde{q}, (4.76)

where q_i, $\tilde{q}$ are defined by relations (4.68)–(4.72). We can find asymptotics of ρ under natural assumptions that p_mut → 0 and N_popmax → ∞ while all the rest parameters are fixed. Then the leading term in the right hand side of (4.36) is q₄ and $U (p_{mut}) = (2 \ln 2 - 1) p_{mut} + O (p_{mut}^{2}) .$ As a result, we have

\begin{array}{l} ρ (p) = \exp (- (\ln 2 - 1 / 2) p_{mut} c_{F} p κ N_{popmax}) . \\ (1 + o (1)), p_{mut} \to 0. \end{array} (4.77)

Step 2. Let Q(t) is defined by (3.8) and, moreover, let j ∈ I₋. We use Lemma 4.5 inductively for genotypes s and $\tilde{s}$ . Let us set

θ = \frac{1 - ε_{0}}{1 + ε_{0}},

where ε is defined by (4.75). We remark that the inequality

Q (T_{c} + t + 1) \geq Q (T_{c} + t) θ \exp (b_{j} δ_{j} T_{c}) (4.78)

holds with a probability Pr_Q,t > 0. Let us obtain a uniform estimate of that probability. Let ℰ(t) be the event that (4.78) holds at the step t. Using (4.4) we have

\begin{array}{l} Pr (Not ℰ (t)) \leq Pr (Not ℰ (t) | 𝒟_{ε_{2}} (t)) + \\ Pr (Not 𝒟_{ε_{2}} (t)), (4.79) \end{array}

where, according to Lemma 4.4, the probability of the event Not 𝒟_ε₂(t) is less than η, where η is defined by (4.25), and

P r (N o t ℰ (t) | 𝒟_{ε_{2}} (t)) < \tilde{q} + ρ (p_{0}) + ρ (p_{1}) .

We conclude by (4.5) that

P r_{Q, t} > Z, Z = 1 - \tilde{q} - ρ (p_{0}) - ρ (p_{1}), (4.80)

where $\tilde{q}$ is defined by (4.74). This estimate is uniform in t ∈ [1, . . . , T_c]. By (4.80) we obtain then that the inequality

Q (T_{c} + T_{1}) \geq Q (T_{1}) \bar{θ}, \bar{θ} = \frac{1 - ε_{0}}{1 + ε_{0}} exp (b_{j} δ_{j} T_{c}) . (4.81)

is satisfied with the probability Pr_v such that

{Pr}_{v} > Z^{T_{c}} . (4.82)

For ε₀ defined by (4.75). one has

Q (T_{c} + T_{1}) \geq Q (T_{1}) \exp (b_{j} δ_{j} T_{c} / 2) .

Now repeating the same arguments that in the end of the proof of Theorem 3.1, and taking into account asymptotics (4.77), we obtain the conclusion of Theorem 3.2.

5 Discussion

In this paper, we proposed a model for fitness landscape learning, which extends earlier work by 7–9 in two ways. First, we use hybrid circuits involving two kinds of variables. The first class of variables are real valued in the interval (0, 1) and can be interpreted as relative levels of phenotypic traits, other variables are Boolean and can be interpreted as genes. Second, we use a threshold scheme of regulation, which is inspired by ideas of the paper by 20. All variables are involved in gene regulation via thresholds.

The work presented here is a major extension of a long term effort to explicitly model the effects of phenotypic buffering in evolution by considering a class of Boolean and mixed Boolean-continuous models in which the phenotype is represented explicitly and the degree of phenotypic buffering can be controlled in various ways. For example, we have demonstrated that the idea of an “evolutionary capacitor”^42,43 can be implemented by explicit control of phenotypic buffering in a hub-and-spokes architecture²³ and that in a more general class of genetic architecture numerical simulations show that an intermediate level of buffering is optimal for evolution in a changing environment³¹.

The results reported here are very promising, since they are consistent with the results of recent experiments by 44 and 45 on heat shock stress. The essential mechanism is that the exploration of the fitness landscape by the genetic network in such a way that future mutations are more likely to be adaptive. We have shown that, at least for some fitness landscapes, rapid evolutionary changes—perhaps instances of the “hopeful monsters” of Goldschmidt⁴⁶—can be created by a combination of random small mutations and epigenetic effects. The main idea is that small mutations pave the way for large epigenetic or genetic changes. The hypothetical mechanism, which we propose, can be outlined as follows (see Figure 4, Figure 5).

Figure 4. This graph illustrates Goldshmidt’s leaps.

At the initial moment the trait expressions take the values x = 0.5, y = 0.5. According to Fisher’s ideas, random large mutations decrease the fitness F = K_F exp(W). (Changes of x = F₁, y = f₂, which are induced by mutations, are shown by red vectors.) Thus such mutations produce non-viable organisms.

Figure 5. This plot illustrates the main ideas of evolution based on the fitness landscape learning.

At the initial time the trait expressions take the values x = 0.5, y = 0.5. Evolutionary changes go in two stages. First we make small random mutations (shown by red vectors), which explore the fitness landscape. If such a mutation is not eliminated from the population, this means that a correct evolution direction is found, and gene regulation system makes a big leap (shown by the green vector) in the direction of that small mutation. Such a two step model can be called clever Goldshmidt leaps. Note that evolution is gradual, and the existence of clusters of almost identical genes involved in the same QTL increases the chances to create a clever Goldschmidt hopeful monster.

This network modifies the thresholds by a simple feedback mechanism. Although we are not aware of clear experimental evidence for the existence of such a mechanism, we nevertheless think that such a mechanism can be connected with regulations via enhancers^47,48, where enhancer action is described by deep network models based on thermodynamics, and chemical kinetics, and those models contain threshold parameters. Alternative variants involve modifications of weights. To some extent, both mechanisms are mathematically equivalent. However, the regulation via thresholds has an important advantage: it makes phenotypes robust with respect to mutations. In this second version of the paper we included a subsection about regulation and two pictures, which show results of numerical simulations based on the “strong selection weak mutation” (SSWM) algorithm. As is the case for threshold signs, they of course should have different signs. The key question is how an individual obtains information about the fitness landscape. In our model that information is the sign of b_j. If the b_j is positive the corresponding threshold must be negative, otherwise, it should be negative. Actually, we think that the gene of individuals have that information! According to the main theorems proved in this paper, the fact that a mutated individual survives for a sufficient long period of time gives us that information. The very fact of the existence of the individual carries the most important information. Look at the mutations of its genotype and you will know where evolution should go!

As is described in Subsection 2.5, we assume that expression of genes involved in the expression of phenotypic traits depends on threshold parameters h_j, which take three values: a large negative one, a neutral value close to zero and a large positive one. First the threshold parameter h_j is small and thus the phenotypic trait is sensitive with respect to even small mutations. Those mutations play a fundamental role working as scouts exploring environments (see Figure 4). If a mutation occurred and the corresponding mutant has survived within T_c ≫ 1 generations then according to Theorem 3.1 and Theorem 3.2 these events mean that that mutation increases the fitness that allows the network to estimate the correct direction of evolution. Then gene regulation detects that increase to change the threshold according to simple rules. Namely, if the trait is less expressed in that mutant with respect to wild type parent, the gene regulation system decreases the threshold up to the large negative value. On the contrary, if the trait is strongly expressed in the mutant, the gene regulation system increases the threshold up to the large positive value. This simple regulation control not only sharply reduces the number of mutations needed for adaptation, but also canalizes the phenotype since for large thresholds the trait expression level becomes insensitive with respect to mutations. We suppose that these threshold modifications can be inherited.

So, we propose the mechanism: small mutations serve as scouts finding the way for large epigenetic or genetic changes, which can be performed by gene regulatory system.

The mechanism may also explain the results of 4 on prediction of environmental changes. In fact, let us suppose that environment varies in time. The first, perhaps relatively small, variations can trigger the threshold mechanism described above. As a result, the population will be adapted to the subsequent changes in advance.

Our results show that evolution can proceed rapidly because it reduces the number of mutations required for adaptive change.

The primary limitation of our results is that the representation of the evolving genetic network is limited to the network of gene controlling phenotype, represented here by the Boolean strings s. Other model variables represent the coarse-grained activities of genes. One class is the terminal differentiation genes represented by w_ij, and another are the genes or epigenetic factors controlling the thresholds h and their associated learning rules. A more careful consideration of the relationship of these moieties to observable molecular entities is an important objective of future work. At the mathematical level, the key analytical results were obtained in a simplified context that falls short of a realistic level of pleiotropy and thus of the level of NP-hard complexity exhibited by fully pleotropic forms of our model. We believe that our analytical results can be generalized, which we plan to address in future work.

Data availability

All data underlying the results are available as part of the article and no additional source data are required.

Faculty Opinions recommended

References

1. Orr HA: The genetic theory of adaptation: a brief history. Nat Rev Genet. 2005; 6(2): 119–127. PubMed Abstract | Publisher Full Text
2. Orr HA, Coyne JA: The genetics of adaptation: a reassessment. Am Nat. 1992; 140(5): 725–742. PubMed Abstract | Publisher Full Text
3. Zeyl C: The number of mutations selected during adaptation in a laboratory population of Saccharomyces cerevisiae. Genetics. 2005; 169(4): 1825–1831. PubMed Abstract | Publisher Full Text | Free Full Text
4. Mitchell A, Romano GH, Groisman B, et al.: Adaptive prediction of environmental changes by microorganisms. Nature. 2009; 460(7252): 220–4. PubMed Abstract | Publisher Full Text
5. Watson RA, Szathmáry E: How Can Evolution Learn? Trends Ecol Evol. 2016; 31(2):147–157. PubMed Abstract | Publisher Full Text
6. Chastain E, Livnat A, Papadimitriou C, et al.: Algorithms, games, and evolution. Proc Natl Acad Sci U S A. 2014; 111(29): 10620–10623. PubMed Abstract | Publisher Full Text | Free Full Text
7. Parter M, Kashtan N, Alon U: Facilitated variation: how evolution learns from past environments to generalize to new environments. PLoS Comput Biol. 2008; 4(11): e1000206. PubMed Abstract | Publisher Full Text | Free Full Text
8. Valiant LG: Evolvability. Journal of the ACM. 2009; 56(1): 1–21. Publisher Full Text
9. Livnat A, Papadimitriou C, Rubinstein A, et al.: Satisfiability and evolution. In Annu IEEE Symp Found Comput Sci FOCS. 2014; 524–530. Publisher Full Text
10. Nagylaki T: The evolution of multilocus systems under weak selection. Genetics. 1993; 134(2): 627–647. PubMed Abstract | Free Full Text
11. Arora S, Hazan E, Kale S: The Multiplicative Weights Update Method: A Meta-Algorithm and Applications. Theory Comput. 2012; 8: 121–164. Publisher Full Text
12. Paixão T, Badkobeh G, Barton N, et al.: Toward a unifying framework for evolutionary processes. J Theor Biol. 2015; 383: 28–43. PubMed Abstract | Publisher Full Text | Free Full Text
13. Mühlenbein H, Schlierkamp-Voosen D: Predictive Models for the Breeder Genetic Algorithm I. Continuous Parameter Optimization. Evol Comput. 1993; 1(1): 25–49. Publisher Full Text
14. Chatterjee K, Pavlogiannis A, Adlam B, et al.: The time scale of evolutionary innovation. PLoS Comput Biol. 2014; 10(9): e1003818. PubMed Abstract | Publisher Full Text | Free Full Text
15. Heredia JP, Trubenová B, Sudholt D, et al.: Selection Limits to Adaptive Walks on Correlated Landscapes. Genetics. 2017; 205(2): 803–825. PubMed Abstract | Publisher Full Text | Free Full Text
16. Doerr B, Doerr C, Kötzing T: Solving Problems with Unknown Solution Length at (Almost) No Extra Cost. In Proceedings of the 2015 on Genetic and Evolutionary Computation Conference GECCO - ’15. Madrid, Spain, ACM Press, 2015; 831–838. Publisher Full Text
17. Rudolph G: Finite Markov Chain Results in Evolutionary Computation: A Tour d’Horizon. Fundamenta Informaticae. 1998; 35(1–4): 67–89. Publisher Full Text
18. Witcopp PJ, Carroll SB, Kopp A: Evolution in black and white: genetic control of pigment patterns in Drosophila. Trends Genet. 2003; 19(9): 495–504. PubMed Abstract | Publisher Full Text
19. Wang Z, Liao BY, Zhang J: Genomic patterns of pleiotropy and the evolution of complexity. Proc Natl Acad Sci U S A. 2010; 107(42): 18034–18039. PubMed Abstract | Publisher Full Text | Free Full Text
20. Stern C: Selection for subthreshold differences and the origin of pseudoexogenous adaptations. Am Nat. 1958; 92(866): 313–316. Publisher Full Text
21. Mjolsness E, Sharp DH, Reinitz J: A connectionist model of development. J Theor Biol. 1991; 152(4): 429–453. PubMed Abstract | Publisher Full Text
22. Arias M, le Poul Y, Chouteau M, et al.: Crossing fitness valleys: empirical estimation of a fitness landscape associated with polymorphic mimicry. Proc Biol Sci. 2016; 283(1829): pii: 20160391. PubMed Abstract | Publisher Full Text | Free Full Text
23. Grigoriev D, Reinitz J, Vakulenko S, et al.: Punctuated evolution and robustness in morphogenesis. Biosystems. 2014; 123: 106–113. PubMed Abstract | Publisher Full Text | Free Full Text
24. Franke J, Klözer A, de Visser JA, et al.: Evolutionary accessibility of mutational pathways. PLoS Comput Biol. 2011; 7(8): e1002134. PubMed Abstract | Publisher Full Text | Free Full Text
25. Kingman JFC: A simple model for the balance between selection and mutation. J Appl Probab. 1978; 15(1): 1–12. Publisher Full Text
26. Kauffman SA, Weinberger ED: The NK model of rugged fitness landscapes and its application to maturation of the immune response. J Theor Biol. 1989; 141(2): 211–245. PubMed Abstract | Publisher Full Text
27. Aita T, Iwakura M, Husimi Y: A cross-section of the fitness landscape of dihydrofolate reductase. Protein Eng. 2001; 14(9): 633–638. PubMed Abstract | Publisher Full Text
28. Leimar O, Tullberg B, James M: Mimicry, saltational evolution, and the crossing of fitness valleys. Chapter 16 in: E.I. Svensson and R. Calsbeek, eds. The Adaptive Landscape in Evolutionary Biology, 2012; 257–270. Publisher Full Text
29. Hinton GE, Nowlan SJ: How learning can guide evolution. Complex Systems. 1987; 1: 495–502. Reference Source
30. Watson RA, Wagner GP, Pavlicev M, et al.: The evolution of phenotypic correlations and "developmental memory". Evolution. 2014; 68(4): 1124–1138. PubMed Abstract | Publisher Full Text | Free Full Text
31. Jiang P, Kreitman M, Reinitz J: The relationship between robustness and evolution. bioRxiv. 2018; 268862. Publisher Full Text
32. Kucharavy A, Rubinstein B, Zhu J, et al.: Robustness and evolvability of heterogeneous cell populations. Mol Biol Cell. 2018; 29(11): 1400–1409. PubMed Abstract | Publisher Full Text | Free Full Text
33. Savageau MA: Demand theory of gene regulation. II. Quantitative application to the lactose and maltose operons of Escherichia coli. Genetics. 1998; 149(4): 1677–1691. PubMed Abstract | Free Full Text
34. Cook SA: The complexity of theorem-proving procedures. In Proc third Annu ACM Symp Theory Comput. – STOC ’71. New York, USA, ACM Press. 1971; 151–158. Publisher Full Text
35. Levin LA: Universal enumeration problems (Russian). Probl Peredai Inf. 1973; 9(3): 115–116. Reference Source
36. Friedgut E: Sharp thresholds of graph properties, and the k-sat problem. J Am Math Soc. 1999; 12(04): 1017–1055. Publisher Full Text
37. Moore C, Mertens S: The Nature of Computation. Oxford University Press, 2011. Publisher Full Text
38. Mertens S, Mézard M, Zecchina R: Threshold values of random k-sat from the cavity method. Random Struct Algor. 2006; 28(3): 340–373. Publisher Full Text
39. Achlioptas D, Coja-Oghlan A: Algorithmic barriers from phase transitions. In Proceedings 49th FOCS. 2008; 793–802. Publisher Full Text
40. Gravner J, Pitman D, Gavrilets S: Percolation on fitness landscapes: effects of correlation, phenotype, and incompatibilities. J Theor Biol. 2007; 248(4): 627–645. PubMed Abstract | Publisher Full Text | Free Full Text
41. Wagner GP, Zhang J: The pleiotropic structure of the genotype-phenotype map: the evolvability of complex organisms. Nat Rev Genet. 2011; 12(3): 204–213. PubMed Abstract | Publisher Full Text
42. Rutherford SL, Lindquist S: Hsp90 as a capacitor for morphological evolution. Nature. 1998; 396(6709): 336–342. PubMed Abstract | Publisher Full Text
43. Masel J, Siegal ML: Robustness: mechanisms and consequences. Trends Genet. 2009; 25(9): 395–403. PubMed Abstract | Publisher Full Text | Free Full Text
44. Fanti L, Piacentini L, Cappucci U, et al.: Canalization by Selection of de Novo Induced Mutations. Genetics. 2017; 206(4): 1995–2006. PubMed Abstract | Publisher Full Text | Free Full Text
45. Klosin A, Casas E, Hidalgo-Carcedo C, et al.: Transgenerational transmission of environmental information in C. elegans. Science. 2017; 356(6335): 320–323. PubMed Abstract | Publisher Full Text
46. Goldschmidt R: The Material Basis of Evolution. Yale Univ. Press, New Haven CT, 1940. Reference Source
47. Barr KA, Martinez C, Moran JR, et al.: Synthetic enhancer design by in silico compensatory evolution reveals flexibility and constraint in cis-regulation. BMC Syst Biol. 2017; 11(1): 116. PubMed Abstract | Publisher Full Text | Free Full Text
48. Shen J, Petkova MD, Liu F, et al.: Toward deciphering developmental patterning with deep neural network. bioRxiv. 2018. Publisher Full Text

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 01 Apr 2019

Author details Author details

¹ Departments of Statistics, Ecology and Evolution, Molecular Genetics and Cell Biology, University of Chicago, Chicago, IL, USA
² Saint Petersburg National Research University of Information Technologies, Mechanics and Optics, Saint Petersburg, Russian Federation
³ CNRS, Mathématiques, Université de Lille, Villeneuve d'Ascq, France
⁴ Department of Computer Science, University of Bonn, Bonn, Germany

John Reinitz
Roles: Conceptualization, Methodology, Writing – Review & Editing

Sergey Vakulenko
Roles: Conceptualization, Formal Analysis, Investigation, Methodology, Software, Writing – Original Draft Preparation, Writing – Review & Editing

Dmitri Grigoriev
Roles: Formal Analysis, Methodology, Validation, Writing – Review & Editing

Andreas Weber
Roles: Software, Visualization, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

The second author was supported by the grant of Russian Ministry of Education, 2012-1.2.1-12-000-1013-016.
Additionally, the second author was
financially supported by Government of Russian Federation, Grant Grant 08-08.

D. Grigoriev is grateful to the grant RSF 16-11-10075 and to both MCCME and MPI f\"ur Mathematik for wonderful working conditions and inspiring atmosphere.

J. Reinitz and S. Vakulenko were supported by US NIH grant
RO1 OD010936 (formerly RO1 RR07801).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (2)

version 2

Revised

Published: 13 Sep 2019, 8:358

https://doi.org/10.12688/f1000research.18575.2

version 1

Published: 01 Apr 2019, 8:358

https://doi.org/10.12688/f1000research.18575.1

Copyright

© 2019 Reinitz J et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Reinitz J, Vakulenko S, Grigoriev D and Weber A. Adaptation, fitness landscape learning and fast evolution [version 2; peer review: 2 approved]. F1000Research 2019, 8:358 (https://doi.org/10.12688/f1000research.18575.2)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 2

VERSION 2

PUBLISHED 13 Sep 2019

Revised

Views

8

Reviewer Report 16 Oct 2019

Eors Szathmary, Evolutionary Systems Research Group, MTA Centre for Ecological Research, Tihany, Hungary; Parmenides Center for the Conceptual Foundations of Science, Pullach, Germany; Deparment of Plant Systematics, Ecology and Theoretical Biology, Eötvös University, Budapest, Hungary

Approved

https://doi.org/10.5256/f1000research.22586.r53938

This is a throught-provoking, hypothetical ... Continue reading

CITE

Report a concern

Respond or Comment

Views

11

Reviewer Report 27 Sep 2019

Aviv Bergman, Systems & Computational Biology Department, Albert Einstein College of Medicine, New York, NY, USA

Approved

https://doi.org/10.5256/f1000research.22586.r53939

I confirm that I have read this submission and believe that I have an ... Continue reading

CITE

Report a concern

Respond or Comment

Version 1

VERSION 1

PUBLISHED 01 Apr 2019

Views

17

Reviewer Report 13 Jun 2019

Yehonatan Sella, Systems & Computational Biology Department, Albert Einstein College of Medicine, New York, NY, USA

Aviv Bergman, Systems & Computational Biology Department, Albert Einstein College of Medicine, New York, NY, USA

Approved with Reservations

https://doi.org/10.5256/f1000research.20332.r48858

In the current work the authors study a population genetics model in which fitness is a linear function of a set of phenotypic traits, and where the genotype-to-phenotype map is given by a linear transformation composed with sigmoidal functions. Despite the seeming ... Continue reading

In the current work the authors study a population genetics model in which fitness is a linear function of a set of phenotypic traits, and where the genotype-to-phenotype map is given by a linear transformation composed with sigmoidal functions. Despite the seeming simplicity of the fitness function, the authors make the case that optimizing fitness is a hard, NP-complete, problem. Under this model, they study the extent to which the fitness landscape (that is, the question of which phenotypic traits contribute positively to fitness, and which contribute negatively) can be inferred from the distribution of these traits in the population after being subject to evolution for a moderate span of time. They then connect this question with that of whether learning algorithms (potentially epigenetic in nature) can help optimize and speed up evolution by learning the fitness landscape.

Toward the goal of inferring the fitness landscape, the authors prove two theorems. Theorem 3.1 concerns a simplified model of an infinite population with no mutations, while Theorem 3.2 concerns a more complex model of a finite population with mutation, stochastic number of descendants and a culling process. In either case, the result is that under certain assumptions, if a mutant genotype is present in the population with high enough frequency after a long enough period of elapsed time, then we can confidently infer that any phenotypic trait differential between the wildtype and the mutant is associated with a higher fitness.

While Theorem 3.1 ignores mutation, even Theorem 3.2 seems at odds with mutation-selection balance. Even a mutant with lower fitness will be present in the population at a frequency that is on the order of the rate of mutation, while the theorem seems to claim that the frequency of such a mutant will be exponentially small with high probability. The authors should resolve this apparent discrepancy. In addition, the main ideas of the proof would be more clearly communicated if the authors would include a treatment of the intermediate model of an infinite population with mutation.

As for the discussion of the way learning can speed up the evolutionary process, this part of the paper remains unclear and underdeveloped. The authors discuss a two-step evolutionary process in which the first step consists of small mutations in order to explore the fitness landscape, and the next step involves changing the thresholds involved in the genotype-to-phenotype map in a way that promotes phenotypic traits associated with higher fitness. While this idea is interesting and worth exploring, a few issues arise. A conceptual issue remaining to be addressed is whether the threshold h_iis part of the genotype and what mechanism are needed to alter its value. According to the authors, the h_i’s can be modified genetically or epigenetically. If epigenetically, it is not apparent what are the environmental cues that will lead to such learning let alone the actual mechanism of modifying them. If genetically, it is similarly unclear in what way the learning of the fitness landscape is being stored, if at all, in the genotype, and what is the connection to the Theorems of chapter 3. The theorems in chapter 3 rely on observing the frequency of a genotype in the population, but such information is not stored in individual genotypes. Additionally, if the thresholds are understood to be variable and subject to selection, then the fitness-maximization problem in fact becomes easy (in contrast to the prior analysis of it as NP-complete), unless we impose restrictions on the range of the threshold. Indeed, one can set the thresholds at positive or negative infinity depending on whether the corresponding trait is positive or negative in order to effectively keep the trait on or off regardless of genotype.

Finally, as this manuscript addresses the relation between learning and the rate of evolution, it would benefit from including a reference to one of the most relevant and intuitive articles written in the late 80’s by Geoffrey E. Hinton & Steven J. Nowlan, “How Learning Can Guide Evolution” Complex Systems, 1, 495-502¹. In it, Hinton and Nowlan showed how learning alters the shape of the fitness landscape and thereby provides easier evolutionary paths towards sets of co-adapted alleles. Hinton and Nowlan demonstrated that this effect allows learning individuals to evolve faster than non-learners. Though the learning model presented by Hinton and Nowlan operates at “somatic” timescale, the analogy to mutations at the evolutionary timescale can be drawn.

To conclude, though it can be improved by the above suggestions, this article touches upon very interesting and important issues in the field of evolutionary biology which are still only lightly investigated, and highlight what might be a fruitful path towards better understanding.

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

No source data required
Are the conclusions drawn adequately supported by the results?

Partly

References

1. Hinton GE, Nowlan SJ: How Learning Can Guide Evolution. Complex Systems. 1987; 1.

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Theoretical/Mathematical Systems and Evolutionary Biology.

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however we have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Views

24

Reviewer Report 23 Apr 2019

Eors Szathmary, Evolutionary Systems Research Group, MTA Centre for Ecological Research, Tihany, Hungary; Parmenides Center for the Conceptual Foundations of Science, Pullach, Germany; Deparment of Plant Systematics, Ecology and Theoretical Biology, Eötvös University, Budapest, Hungary

Approved with Reservations

https://doi.org/10.5256/f1000research.20332.r46602

This is a new paper in a series of exciting papers about phenotypic evolution, based on models of genes genetic regulatory networks, and phenotypes.
There are many technical details in the paper that most biologists will find difficult to ... Continue reading

This is a new paper in a series of exciting papers about phenotypic evolution, based on models of genes genetic regulatory networks, and phenotypes.
There are many technical details in the paper that most biologists will find difficult to follow – I would put at least some of these items into an Appendix. Also, I am wondering how many (even theoretical) biologists would readily grasp the meaning of the K-SAT problem as defined. A more accessible formulation (in addition to the strictly technical one) could do a lot for easier understanding (one of the papers by the authors contains a didactive figure, for example).
I consider this paper a serious attempt at broadening our views of the role of learning dynamics in evolution, but I think the message could be made a lot clearer. The punch line in a way is in the Discussion that I find both intriguing and baffling.
First, as it is portrayed in this paper, the effect of genes is filtered through a genetic regulatory network. It seems that, just like in Ref 40¹, that this filter itself does not evolve, or at least the dynamics is not given. In other words, how do the weights in the network evolve? Note that in the Watson-Wagner paper in Evolution² it was the evolution of the weights that was related to Hebbian dynamics. So, how do we stand on this?
Second, in this paper the genetic control of the thresholds in also kept implicit, but the evolution of threshold values does play an important role in the Goldschmidtian argument. Is the change in thresholds genetic or epigenetic? If the latter, then it ought to be part of the regulatory network. I guess this is what the statement “gene regulation detects that increase to change the threshold according to simple rules”. How these rules would be implemented in mechanistic terms remains obscure. And how would they arise in evolution? The simple rule implies feedback from expression levels to threshold values.
Third, do I understand it correctly that here, in contrast to Ref 40¹, thresholds can be negative? But a large negative threshold value would mean that expression levels will increase, since negative times negative is positive (Eq. 2.4). If the statement “take three values: a large negative one, a neutral value close to zero and a large positive one” instead refers to gene expression levels rather than threshold values (the latter would in this case be nonnegative) then I do not understand the thought example. Increasing the threshold would reduce the expression level that was selected to go up in the first place!
These points seem to me crucial, so their clarification is badly needed. I would appreciate a few numerical examples along with at least hints to answers to the more conceptual questions on the dynamics.
Also, according to the cited Nagylaki dynamics³ the population evolves almost as if it were in linkage equilibrium – which cannot hold for the asexual population considered by the authors.
There are also some minor issues in the paper: in Eq. 2.4 “s” should be replaced by a sigma, and there are also examples where plurals and singulars in the same sentence do not match.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

No source data required
Are the conclusions drawn adequately supported by the results?

Partly

References

1. Jiang P, Kreitman M, Reinitz J: The relationship between robustness and evolution. bioRxiv. 2018. Publisher Full Text
2. Watson RA, Wagner GP, Pavlicev M, Weinreich DM, et al.: The evolution of phenotypic correlations and. Evolution. 2014; 68 (4): 1124-38 PubMed Abstract | Publisher Full Text
3. Nagylaki T: The evolution of multilocus systems under weak selection.Genetics. 1993; 134 (2): 627-47 PubMed Abstract

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Theoretical evolutionary biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 01 Apr 2019

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 2 (revision) 13 Sep 19	read	read
Version 1 01 Apr 19	read	read

Eors Szathmary, MTA Centre for Ecological Research, Tihany, Hungary; Parmenides Center for the Conceptual Foundations of Science, Pullach, Germany; Eötvös University, Budapest, Hungary
Aviv Bergman, Albert Einstein College of Medicine, New York, USA

Yehonatan Sella, Albert Einstein College of Medicine, New York, USA

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

8 Views

16 Oct 2019 | for Version 2

Eors Szathmary, Evolutionary Systems Research Group, MTA Centre for Ecological Research, Tihany, Hungary; Parmenides Center for the Conceptual Foundations of Science, Pullach, Germany; Deparment of Plant Systematics, Ecology and Theoretical Biology, Eötvös University, Budapest, Hungary

8 Views Cite this report Responses(0)

Approved

This is a throught-provoking, hypothetical paper with considerable potential, duly revised.

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Theoretical evolutionary biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

11 Views

27 Sep 2019 | for Version 2

Aviv Bergman, Systems & Computational Biology Department, Albert Einstein College of Medicine, New York, NY, USA

11 Views Cite this report Responses(0)

Approved

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

17 Views

13 Jun 2019 | for Version 1

Yehonatan Sella, Systems & Computational Biology Department, Albert Einstein College of Medicine, New York, NY, USA

Aviv Bergman, Systems & Computational Biology Department, Albert Einstein College of Medicine, New York, NY, USA

17 Views Cite this report Responses(0)

Approved With Reservations

In the current work the authors study a population genetics model in which fitness is a linear function of a set of phenotypic traits, and where the genotype-to-phenotype map is given by a linear transformation composed with sigmoidal functions. Despite the seeming simplicity of the fitness function, the authors make the case that optimizing fitness is a hard, NP-complete, problem. Under this model, they study the extent to which the fitness landscape (that is, the question of which phenotypic traits contribute positively to fitness, and which contribute negatively) can be inferred from the distribution of these traits in the population after being subject to evolution for a moderate span of time. They then connect this question with that of whether learning algorithms (potentially epigenetic in nature) can help optimize and speed up evolution by learning the fitness landscape.

Toward the goal of inferring the fitness landscape, the authors prove two theorems. Theorem 3.1 concerns a simplified model of an infinite population with no mutations, while Theorem 3.2 concerns a more complex model of a finite population with mutation, stochastic number of descendants and a culling process. In either case, the result is that under certain assumptions, if a mutant genotype is present in the population with high enough frequency after a long enough period of elapsed time, then we can confidently infer that any phenotypic trait differential between the wildtype and the mutant is associated with a higher fitness.

While Theorem 3.1 ignores mutation, even Theorem 3.2 seems at odds with mutation-selection balance. Even a mutant with lower fitness will be present in the population at a frequency that is on the order of the rate of mutation, while the theorem seems to claim that the frequency of such a mutant will be exponentially small with high probability. The authors should resolve this apparent discrepancy. In addition, the main ideas of the proof would be more clearly communicated if the authors would include a treatment of the intermediate model of an infinite population with mutation.

As for the discussion of the way learning can speed up the evolutionary process, this part of the paper remains unclear and underdeveloped. The authors discuss a two-step evolutionary process in which the first step consists of small mutations in order to explore the fitness landscape, and the next step involves changing the thresholds involved in the genotype-to-phenotype map in a way that promotes phenotypic traits associated with higher fitness. While this idea is interesting and worth exploring, a few issues arise. A conceptual issue remaining to be addressed is whether the threshold h_iis part of the genotype and what mechanism are needed to alter its value. According to the authors, the h_i’s can be modified genetically or epigenetically. If epigenetically, it is not apparent what are the environmental cues that will lead to such learning let alone the actual mechanism of modifying them. If genetically, it is similarly unclear in what way the learning of the fitness landscape is being stored, if at all, in the genotype, and what is the connection to the Theorems of chapter 3. The theorems in chapter 3 rely on observing the frequency of a genotype in the population, but such information is not stored in individual genotypes. Additionally, if the thresholds are understood to be variable and subject to selection, then the fitness-maximization problem in fact becomes easy (in contrast to the prior analysis of it as NP-complete), unless we impose restrictions on the range of the threshold. Indeed, one can set the thresholds at positive or negative infinity depending on whether the corresponding trait is positive or negative in order to effectively keep the trait on or off regardless of genotype.

Finally, as this manuscript addresses the relation between learning and the rate of evolution, it would benefit from including a reference to one of the most relevant and intuitive articles written in the late 80’s by Geoffrey E. Hinton & Steven J. Nowlan, “How Learning Can Guide Evolution” Complex Systems, 1, 495-502¹. In it, Hinton and Nowlan showed how learning alters the shape of the fitness landscape and thereby provides easier evolutionary paths towards sets of co-adapted alleles. Hinton and Nowlan demonstrated that this effect allows learning individuals to evolve faster than non-learners. Though the learning model presented by Hinton and Nowlan operates at “somatic” timescale, the analogy to mutations at the evolutionary timescale can be drawn.

To conclude, though it can be improved by the above suggestions, this article touches upon very interesting and important issues in the field of evolutionary biology which are still only lightly investigated, and highlight what might be a fruitful path towards better understanding.

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

No source data required
Are the conclusions drawn adequately supported by the results?

Partly

References

1. Hinton GE, Nowlan SJ: How Learning Can Guide Evolution. Complex Systems. 1987; 1.

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Theoretical/Mathematical Systems and Evolutionary Biology.

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however we have significant reservations, as outlined above.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

24 Views

23 Apr 2019 | for Version 1

Eors Szathmary, Evolutionary Systems Research Group, MTA Centre for Ecological Research, Tihany, Hungary; Parmenides Center for the Conceptual Foundations of Science, Pullach, Germany; Deparment of Plant Systematics, Ecology and Theoretical Biology, Eötvös University, Budapest, Hungary

24 Views Cite this report Responses(0)

Approved With Reservations

This is a new paper in a series of exciting papers about phenotypic evolution, based on models of genes genetic regulatory networks, and phenotypes.
There are many technical details in the paper that most biologists will find difficult to follow – I would put at least some of these items into an Appendix. Also, I am wondering how many (even theoretical) biologists would readily grasp the meaning of the K-SAT problem as defined. A more accessible formulation (in addition to the strictly technical one) could do a lot for easier understanding (one of the papers by the authors contains a didactive figure, for example).
I consider this paper a serious attempt at broadening our views of the role of learning dynamics in evolution, but I think the message could be made a lot clearer. The punch line in a way is in the Discussion that I find both intriguing and baffling.
First, as it is portrayed in this paper, the effect of genes is filtered through a genetic regulatory network. It seems that, just like in Ref 40¹, that this filter itself does not evolve, or at least the dynamics is not given. In other words, how do the weights in the network evolve? Note that in the Watson-Wagner paper in Evolution² it was the evolution of the weights that was related to Hebbian dynamics. So, how do we stand on this?
Second, in this paper the genetic control of the thresholds in also kept implicit, but the evolution of threshold values does play an important role in the Goldschmidtian argument. Is the change in thresholds genetic or epigenetic? If the latter, then it ought to be part of the regulatory network. I guess this is what the statement “gene regulation detects that increase to change the threshold according to simple rules”. How these rules would be implemented in mechanistic terms remains obscure. And how would they arise in evolution? The simple rule implies feedback from expression levels to threshold values.
Third, do I understand it correctly that here, in contrast to Ref 40¹, thresholds can be negative? But a large negative threshold value would mean that expression levels will increase, since negative times negative is positive (Eq. 2.4). If the statement “take three values: a large negative one, a neutral value close to zero and a large positive one” instead refers to gene expression levels rather than threshold values (the latter would in this case be nonnegative) then I do not understand the thought example. Increasing the threshold would reduce the expression level that was selected to go up in the first place!
These points seem to me crucial, so their clarification is badly needed. I would appreciate a few numerical examples along with at least hints to answers to the more conceptual questions on the dynamics.
Also, according to the cited Nagylaki dynamics³ the population evolves almost as if it were in linkage equilibrium – which cannot hold for the asexual population considered by the authors.
There are also some minor issues in the paper: in Eq. 2.4 “s” should be replaced by a sigma, and there are also examples where plurals and singulars in the same sentence do not match.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

No source data required
Are the conclusions drawn adequately supported by the results?

Partly

References

1. Jiang P, Kreitman M, Reinitz J: The relationship between robustness and evolution. bioRxiv. 2018. Publisher Full Text
2. Watson RA, Wagner GP, Pavlicev M, Weinreich DM, et al.: The evolution of phenotypic correlations and. Evolution. 2014; 68 (4): 1124-38 PubMed Abstract | Publisher Full Text
3. Nagylaki T: The evolution of multilocus systems under weak selection.Genetics. 1993; 134 (2): 627-47 PubMed Abstract

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Theoretical evolutionary biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

[1] 1. Orr HA: The genetic theory of adaptation: a brief history. Nat Rev Genet. 2005; 6(2): 119–127. PubMed Abstract | Publisher Full Text

[2] 2. Orr HA, Coyne JA: The genetics of adaptation: a reassessment. Am Nat. 1992; 140(5): 725–742. PubMed Abstract | Publisher Full Text

[3] 3. Zeyl C: The number of mutations selected during adaptation in a laboratory population of Saccharomyces cerevisiae. Genetics. 2005; 169(4): 1825–1831. PubMed Abstract | Publisher Full Text | Free Full Text

[4] 4. Mitchell A, Romano GH, Groisman B, et al.: Adaptive prediction of environmental changes by microorganisms. Nature. 2009; 460(7252): 220–4. PubMed Abstract | Publisher Full Text

[5] 5. Watson RA, Szathmáry E: How Can Evolution Learn? Trends Ecol Evol. 2016; 31(2):147–157. PubMed Abstract | Publisher Full Text

[6] 6. Chastain E, Livnat A, Papadimitriou C, et al.: Algorithms, games, and evolution. Proc Natl Acad Sci U S A. 2014; 111(29): 10620–10623. PubMed Abstract | Publisher Full Text | Free Full Text

[7] 7. Parter M, Kashtan N, Alon U: Facilitated variation: how evolution learns from past environments to generalize to new environments. PLoS Comput Biol. 2008; 4(11): e1000206. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. Valiant LG: Evolvability. Journal of the ACM. 2009; 56(1): 1–21. Publisher Full Text

[9] 9. Livnat A, Papadimitriou C, Rubinstein A, et al.: Satisfiability and evolution. In Annu IEEE Symp Found Comput Sci FOCS. 2014; 524–530. Publisher Full Text

[10] 10. Nagylaki T: The evolution of multilocus systems under weak selection. Genetics. 1993; 134(2): 627–647. PubMed Abstract | Free Full Text

[11] 11. Arora S, Hazan E, Kale S: The Multiplicative Weights Update Method: A Meta-Algorithm and Applications. Theory Comput. 2012; 8: 121–164. Publisher Full Text

[12] 12. Paixão T, Badkobeh G, Barton N, et al.: Toward a unifying framework for evolutionary processes. J Theor Biol. 2015; 383: 28–43. PubMed Abstract | Publisher Full Text | Free Full Text

[13] 13. Mühlenbein H, Schlierkamp-Voosen D: Predictive Models for the Breeder Genetic Algorithm I. Continuous Parameter Optimization. Evol Comput. 1993; 1(1): 25–49. Publisher Full Text

[14] 14. Chatterjee K, Pavlogiannis A, Adlam B, et al.: The time scale of evolutionary innovation. PLoS Comput Biol. 2014; 10(9): e1003818. PubMed Abstract | Publisher Full Text | Free Full Text

[15] 15. Heredia JP, Trubenová B, Sudholt D, et al.: Selection Limits to Adaptive Walks on Correlated Landscapes. Genetics. 2017; 205(2): 803–825. PubMed Abstract | Publisher Full Text | Free Full Text

[16] 16. Doerr B, Doerr C, Kötzing T: Solving Problems with Unknown Solution Length at (Almost) No Extra Cost. In Proceedings of the 2015 on Genetic and Evolutionary Computation Conference GECCO - ’15. Madrid, Spain, ACM Press, 2015; 831–838. Publisher Full Text

[17] 17. Rudolph G: Finite Markov Chain Results in Evolutionary Computation: A Tour d’Horizon. Fundamenta Informaticae. 1998; 35(1–4): 67–89. Publisher Full Text

[18] 18. Witcopp PJ, Carroll SB, Kopp A: Evolution in black and white: genetic control of pigment patterns in Drosophila. Trends Genet. 2003; 19(9): 495–504. PubMed Abstract | Publisher Full Text

[19] 19. Wang Z, Liao BY, Zhang J: Genomic patterns of pleiotropy and the evolution of complexity. Proc Natl Acad Sci U S A. 2010; 107(42): 18034–18039. PubMed Abstract | Publisher Full Text | Free Full Text

[20] 20. Stern C: Selection for subthreshold differences and the origin of pseudoexogenous adaptations. Am Nat. 1958; 92(866): 313–316. Publisher Full Text

[21] 21. Mjolsness E, Sharp DH, Reinitz J: A connectionist model of development. J Theor Biol. 1991; 152(4): 429–453. PubMed Abstract | Publisher Full Text

[22] 22. Arias M, le Poul Y, Chouteau M, et al.: Crossing fitness valleys: empirical estimation of a fitness landscape associated with polymorphic mimicry. Proc Biol Sci. 2016; 283(1829): pii: 20160391. PubMed Abstract | Publisher Full Text | Free Full Text

[23] 23. Grigoriev D, Reinitz J, Vakulenko S, et al.: Punctuated evolution and robustness in morphogenesis. Biosystems. 2014; 123: 106–113. PubMed Abstract | Publisher Full Text | Free Full Text

[24] 24. Franke J, Klözer A, de Visser JA, et al.: Evolutionary accessibility of mutational pathways. PLoS Comput Biol. 2011; 7(8): e1002134. PubMed Abstract | Publisher Full Text | Free Full Text

[25] 25. Kingman JFC: A simple model for the balance between selection and mutation. J Appl Probab. 1978; 15(1): 1–12. Publisher Full Text

[26] 26. Kauffman SA, Weinberger ED: The NK model of rugged fitness landscapes and its application to maturation of the immune response. J Theor Biol. 1989; 141(2): 211–245. PubMed Abstract | Publisher Full Text

[27] 27. Aita T, Iwakura M, Husimi Y: A cross-section of the fitness landscape of dihydrofolate reductase. Protein Eng. 2001; 14(9): 633–638. PubMed Abstract | Publisher Full Text

[28] 28. Leimar O, Tullberg B, James M: Mimicry, saltational evolution, and the crossing of fitness valleys. Chapter 16 in: E.I. Svensson and R. Calsbeek, eds. The Adaptive Landscape in Evolutionary Biology, 2012; 257–270. Publisher Full Text

[29] 29. Hinton GE, Nowlan SJ: How learning can guide evolution. Complex Systems. 1987; 1: 495–502. Reference Source

[30] 30. Watson RA, Wagner GP, Pavlicev M, et al.: The evolution of phenotypic correlations and "developmental memory". Evolution. 2014; 68(4): 1124–1138. PubMed Abstract | Publisher Full Text | Free Full Text

[31] 31. Jiang P, Kreitman M, Reinitz J: The relationship between robustness and evolution. bioRxiv. 2018; 268862. Publisher Full Text

[32] 32. Kucharavy A, Rubinstein B, Zhu J, et al.: Robustness and evolvability of heterogeneous cell populations. Mol Biol Cell. 2018; 29(11): 1400–1409. PubMed Abstract | Publisher Full Text | Free Full Text

[33] 33. Savageau MA: Demand theory of gene regulation. II. Quantitative application to the lactose and maltose operons of Escherichia coli. Genetics. 1998; 149(4): 1677–1691. PubMed Abstract | Free Full Text

[34] 34. Cook SA: The complexity of theorem-proving procedures. In Proc third Annu ACM Symp Theory Comput. – STOC ’71. New York, USA, ACM Press. 1971; 151–158. Publisher Full Text

[35] 35. Levin LA: Universal enumeration problems (Russian). Probl Peredai Inf. 1973; 9(3): 115–116. Reference Source

[36] 36. Friedgut E: Sharp thresholds of graph properties, and the k-sat problem. J Am Math Soc. 1999; 12(04): 1017–1055. Publisher Full Text

[37] 37. Moore C, Mertens S: The Nature of Computation. Oxford University Press, 2011. Publisher Full Text

[38] 38. Mertens S, Mézard M, Zecchina R: Threshold values of random k-sat from the cavity method. Random Struct Algor. 2006; 28(3): 340–373. Publisher Full Text

[39] 39. Achlioptas D, Coja-Oghlan A: Algorithmic barriers from phase transitions. In Proceedings 49th FOCS. 2008; 793–802. Publisher Full Text

[40] 40. Gravner J, Pitman D, Gavrilets S: Percolation on fitness landscapes: effects of correlation, phenotype, and incompatibilities. J Theor Biol. 2007; 248(4): 627–645. PubMed Abstract | Publisher Full Text | Free Full Text

[41] 41. Wagner GP, Zhang J: The pleiotropic structure of the genotype-phenotype map: the evolvability of complex organisms. Nat Rev Genet. 2011; 12(3): 204–213. PubMed Abstract | Publisher Full Text

[42] 42. Rutherford SL, Lindquist S: Hsp90 as a capacitor for morphological evolution. Nature. 1998; 396(6709): 336–342. PubMed Abstract | Publisher Full Text

[43] 43. Masel J, Siegal ML: Robustness: mechanisms and consequences. Trends Genet. 2009; 25(9): 395–403. PubMed Abstract | Publisher Full Text | Free Full Text

[44] 44. Fanti L, Piacentini L, Cappucci U, et al.: Canalization by Selection of de Novo Induced Mutations. Genetics. 2017; 206(4): 1995–2006. PubMed Abstract | Publisher Full Text | Free Full Text

[45] 45. Klosin A, Casas E, Hidalgo-Carcedo C, et al.: Transgenerational transmission of environmental information in C. elegans. Science. 2017; 356(6335): 320–323. PubMed Abstract | Publisher Full Text

[46] 46. Goldschmidt R: The Material Basis of Evolution. Yale Univ. Press, New Haven CT, 1940. Reference Source

[47] 47. Barr KA, Martinez C, Moran JR, et al.: Synthetic enhancer design by in silico compensatory evolution reveals flexibility and constraint in cis-regulation. BMC Syst Biol. 2017; 11(1): 116. PubMed Abstract | Publisher Full Text | Free Full Text

[48] 48. Shen J, Petkova MD, Liu F, et al.: Toward deciphering developmental patterning with deep neural network. bioRxiv. 2018. Publisher Full Text

Adaptation, fitness landscape learning and fast evolution

Abstract

Keywords

Revised Amendments from Version 1

1 Introduction

2 Model

2.1 Genome

2.2 Phenotypic traits

2.3 Fitness

2.4 Population dynamics model

2.5 Gene regulation network

Figure 1. This graph illustrates a difference between the adaptation for evolution without evolution of gene regulation via threshold (the red curve) and with that evolution by 2.12 (the green curve).

Figure 2. This graph shows that the modular evolution of gene regulation allows adaptation with a few genes, in that case Ng = 10 and the number of traits M = 200.

Figure 3. Frequency distributions of degree of gene pleiotropy for model (2.4) with the parameters Ng = 4000, β = 4, h = 0, Nb = 3000.

2.6 Adaptation as a hard combinatorial problem

3. Main theorems

3.1 Fitness landscape learning theorems

3.2 The case of finite populations

4 Proof of theorems

4.1 Main tools and auxiliary Lemmas

4.2 Main lemmas

4.3 Remaining part of the proof of Theorem 3.2

5 Discussion

Figure 4. This graph illustrates Goldshmidt’s leaps.

Figure 5. This plot illustrates the main ideas of evolution based on the fitness landscape learning.

Data availability

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated

Figure 2. This graph shows that the modular evolution of gene regulation allows adaptation with a few genes, in that case N_g = 10 and the number of traits M = 200.

Figure 3. Frequency distributions of degree of gene pleiotropy for model (2.4) with the parameters N_g = 4000, β = 4, h = 0, N_b = 3000.