The authors have declared that no competing interests exist.
¤ Current address: Department of Biostatistics, University of Florida, Gainesville, Florida, United States of America
A population’s spatial structure affects the rate of genetic change and the outcome of natural selection. These effects can be modeled mathematically using the Birth-death process on graphs. Individuals occupy the vertices of a weighted graph, and reproduce into neighboring vertices based on fitness. A key quantity is the probability that a mutant type will sweep to fixation, as a function of the mutant’s fitness. Graphs that increase the fixation probability of beneficial mutations, and decrease that of deleterious mutations, are said to amplify selection. However, fixation probabilities are difficult to compute for an arbitrary graph. Here we derive an expression for the fixation probability, of a weakly-selected mutation, in terms of the time for two lineages to coalesce. This expression enables weak-selection fixation probabilities to be computed, for an arbitrary weighted graph, in polynomial time. Applying this method, we explore the range of possible effects of graph structure on natural selection, genetic drift, and the balance between the two. Using exhaustive analysis of small graphs and a genetic search algorithm, we identify families of graphs with striking effects on fixation probability, and we analyze these families mathematically. Our work reveals the nuanced effects of graph structure on natural selection and neutral drift. In particular, we show how these notions depend critically on the process by which mutations arise.
When a new mutation appears in a population, it may ultimately spread to all individuals, or it may go extinct. Which outcome occurs depends on how the mutation affects the organism’s fitness (i.e., natural selection), but also on random chance. The spatial arrangement of organisms in the population can alter the balance between selection and random chance: amplifying one, suppressing the other. However, these effects can be difficult to predict or compute, even in simple, idealized mathematical models. We develop a method to efficiently calculate the effects of spatial structure on natural selection, in the case that mutations have only a weak effect on fitness. We use this method to comb through millions of distinct spatial structures, identifying those with the most extreme effects on natural selection. The question we study has applications to cancer research, with an individual’s cells considered as a population, of which cancer cells are an invading mutant type.
Evolution proceeds by the arrival and fixation of mutations. The fate of each new mutation depends on selection (how the mutation affects the organism’s fitness) as well as drift (random chance). The combined effects of selection and drift determine how a population evolves, with selection driving adaptation to the environment, and drift maintaining genetic variety.
Spatial population structure can alter the balance between these two forces [1–13]. Some spatial structures amplify selection, so that fitness plays a larger role in which mutations become fixed. Other structures suppress selection, reducing the role of fitness and increasing the role of random drift. Spatial structure can also change the accumulation rate of neutral mutations, which do not affect fitness [14]. The effects of spatial structure on selection have consequences for microbial evolution [15], cancer [16–20], aging [19, 20], and infectious disease [21].
These effects can be probed mathematically by modeling population structure as a graph. Each vertex is occupied by a single haploid individual. Reproduction occurs along the graph’s edges, according to a specified update rule (see below). The key quantity of interest is the fixation probability ρ(r), defined as the probability that a new mutation of fitness r will take over a population of wild-type fitness 1.
The update rules most commonly considered in this context are Birth-death (Bd) and death-Birth (dB). In this naming convention, the ordering indicates which event (birth or death) occurs first, while the capitalization indicates which event(s) are affected by fitness (birth, in this case). Most studies [1, 3–5, 8–11, 22–26] focus on Birth-death updating, in which an individual is first selected to reproduce, proportionally to its fitness. The offspring then replaces a neighbor chosen at random (independently of fitness). A minority of works [2, 6, 12, 13, 27–29] have considered death-Birth updating, in which an individual is first selected for death, uniformly at random. A neighbor is then chosen, proportionally to fitness, to produce an offspring, which fills the vacancy.
Determining fixation probability on an arbitrary graph is computationally intensive. Current methods [6, 9, 30, 31] require solving a system of linear equations whose size grows exponentially with the graph size. This is prohibitive except for graphs that are small [6, 7, 9–11, 31], highly symmetric [1, 3–5, 22–25], or have other special properties [32].
However, most nonlethal biological mutations are either neutral (r = 1) or weakly selected (r ≈ 1). Our previous work [12, 14, 33] has shown that, in these cases, fixation probabilities can be computed in polynomial time. For neutral mutations, the fixation probability determines the population’s “molecular clock”—the rate at which neutral genetic substitutions accrue over time. Allen et al. [14] showed that spatial structure can either accelerate or slow this molecular clock rate.
For weakly selected mutations, fixation probabilities can be computed by combining perturbative methods [33–37] with coalescing random walks [38–41]. Allen et al. [12] applied this method to dB updating, and showed how it allows for the efficient identification of amplifiers and suppressors of weak selection.
Here we apply these methods to Birth-death updating on arbitrary weighted graphs. We show that fixation probabilities under weak selection can be expressed in terms of coalescence times—the expected times for two independent random walks to meet. These coalescence times—and hence the fixation probability for weak selection—can be computed in polynomial time. While our methods apply to arbitrary placement of the initial mutant, we focus in particular on temperature initialization (mutations arise only in new offspring) and uniform initialization (mutations arise uniformly in all individuals).
Using these methods, we compute weak-selection fixation probabilities for all simple connected graphs up to size 10. For larger sizes, we employ a previously-developed genetic algorithm [10] to identify graphs with extreme effects on fixation probability. These investigations reveal a family of “Cartwheel” graphs, which strongly amplify selection under temperature initialization. They also show that a family of “Detour” graphs [10] can significantly decrease the ratio of beneficial to neutral mutations accruing over time. Our results highlight previously-unexamined subtleties in the notions of amplifier and suppressor, and in the way that spatial structure affects neutral and selective genetic change.
We represent spatial structure by a weighted graph G. The edge weight from vertex i to j is wij ≥ 0. The graph may be directed (wij not necessarily equal to wji) and may contain self-loops (wii may be positive). We require that the graph is strongly connected, meaning that there is a path of directed edges with nonzero weight from any vertex to any other; for undirected graphs (wij = wji), this reduces to the usual notion of connected.
We define the weighted (out-)degree of vertex i as wi = ∑j∈G wij. The probability that a random walker at vertex i will step to vertex j is pij = wij/wi. We also define the temperature of vertex i as Ti = ∑j∈G pji. We note that the total temperature is equal to the total population size: ∑i∈G Ti = ∑i,j∈G pji = N.
We consider a well-studied model of natural selection [1–11, 22–30]. There are two types of individuals: residents (or wild-types), which have fitness 1, and mutants, which have fitness r. The mutant is advantageous if r > 1, deleterious if r < 1, and neutral if r = 1.
In each state of the process, each vertex is occupied by a single individual, either mutant or resident. Selection proceeds according to the Birth-death (Bd) update rule. First, an individual i is selected at random, proportionally to its fitness. The chosen individual i produces an offspring; this offspring replaces another individual j, chosen with probability pij. Offspring inherit the type of the parent.
This process is a finite Markov chain with two absorbing states: one in which only residents are present, and one in which only mutants are present.
The key biological question is, if a new mutation arises in a single individual, how likely is this type to take over the resident population? The answer depends on where the initial mutant arises. In the general case, we can consider an arbitrary probability distribution {μi} over the vertices of G, such that the initial mutation arises at vertex i with probability μi.
Most previous works [1, 3, 4, 6, 7, 9, 10, 22–25, 30] consider only uniform initialization, meaning that the mutant is equally likely to appear at each vertex, μi = 1/N for all i. Uniform initialization corresponds to a biological assumption that heritable mutations arise primarily in adult individuals, with constant probability per unit time.
If we instead suppose that mutations occur primarily in new offspring, then mutations will be more likely to arise in sites that are replaced more often. This leads to temperature initialization [5, 8, 11, 14, 42], meaning that the initial mutant’s location i is chosen proportionally to the temperature Ti, μi = Ti/N.
Here we consider both initialization schemes, with a particular focus on temperature initialization because it leads to a rich interplay of selection and drift. More generally, the methods we present apply to any probability distribution {μi} of initial mutant locations.
We define the fixation probability ρG(r), for a mutant of fitness r on a graph G, as the probability that the state of all mutants is reached, from an initial state chosen according to the specified initialization scheme.
For the complete graph KN, representing a well-mixed population of size N, the fixation probability is [43]


Although calculating fixation probability is computationally intensive in general [6, 7, 30, 31], it simplifies when selection is weak; that is when mutations are nearly neutral (r ≈ 1). In this case, we may form the Taylor expansion



Classifying the effects of graph structure on weak selection.
(A) The fixation probability ρG(r), for a mutation of fitness r on a graph G, can be expanded under weak selection as
For isothermal graphs, Taylor expansion of Eq (2) gives

Results of Allen et al. [14] imply that, for uniform initialization, ρ∘ = 1/N for all graphs. Thus spatial structure does not affect the rate of neutral drift when mutations appear uniformly. In contrast, for temperature initialization, Result 3 of Allen et al. [14] implies that ρ∘ ≤ 1/N, with equality if and only if the graph is isothermal.
Our goal is to understand how spatial structure affects selection, neutral drift, and the relationship between the two. More specifically, we are interested whether a graph amplifies or suppresses selection, compared to the baseline of a well-mixed population. However, there are (at least) two distinct ways of making this comparison.
First, we can examine how the probability of fixation increases with fitness, as quantified by the first-order term, ρ′. It is intuitive that the rate of increase should be greater for amplifiers, and less for suppressors, relative to the well-mixed value of ρ′ = (N − 1)/(2N). This leads us to define a graph G as an absolute amplifier of weak selection if ρ′ > (N − 1)/(2N), and an absolute suppressor of weak selection if ρ′ < (N − 1)/(2N).
Second, to quantify the relationship between selection and neutral drift, we can compute the ratio of fixation probabilities for selected versus neutral mutations:

For a family of graphs of unbounded size (N → ∞), we can compare limN→∞ ρ′ to 1/2 to determine absolute amplifiers or suppressors, and limN→∞ ρ′/(Nρ∘) to 1/2 to determine relative amplifiers or suppressors, in the large-population limit.
For uniform initialization, there is no distinction between the relative and absolute definitions, since ρ∘ = 1/N for all graphs. But for temperature initialization, the two notions are distinct. Moreover, since ρ∘ ≤ 1/N for all graphs [14], we have ρ′/ρ∘ ≥ Nρ′. This rules out the possibility that a graph can be simultaneously an absolute amplifier and a relative suppressor, leaving three possible classifications for non-isothermal graphs: (i) absolute and relative amplifier; (ii) absolute suppressor and relative amplifier; and (iii) absolute and relative suppressor. We illustrate these cases in Fig 1B.
Here we present our analytical and numerical results. Derivations are given in S1 Text
To calculate the zeroth- and first-order coefficients, ρ∘ and ρ′ respectively, in the weak-selection expansion, Eq (3), we apply methods developed in previous works [14, 33, 45]. We address the zeroth-order (neutral drift) and first-order (weak selection) terms separately.
To obtain the neutral fixation probability, ρ∘, we must first derive the fixation probability πi of a neutral mutation arising at a particular vertex i. This probability πi can also be understood as the reproductive value of vertex i [45, 46]. For Birth-death updating, the reproductive values πi are the unique solution to the system of equations


In the undirected case (wij = wji), Eq (6) has an explicit solution in which reproductive value is inversely proportional to weighted degree:
The overall neutral fixation probability ρ∘, from an arbitrary probability distribution {μi} of initial mutant locations, is then given by


To obtain the first-selection coefficient, ρ′, we turn to a method developed by McAvoy and Allen [33]. Consider an arbitrary initialization, characterized by a probability distribution {μi} of initial mutant locations. Let τij be the expected time, from initialization to fixation, that vertices i and j have different types. We prove in S1 Text that these τij are uniquely determined by the recurrence relation


In the more general case of Eq (9), τij can be thought of as a rescaled coalescence time, with the time spent at each vertex k ∈ G scaled proportionally to μk. For uniform initialization (μi = 1/N) the recurrence becomes

For any initialization, we show in S1 Text that the weak-selection coefficient, ρ′, can be expressed in terms of the τij as

To explore the variety of possible effects of graph structure on fixation probabilities, we performed an exhaustive analysis of all connected simple graphs (unweighted, undirected, and with no self-loops) up to size 10, obtained from an online database [48]. For each graph, we calculated ρ′ and ρ∘ numerically, for both temperature and uniform initialization, by solving Eqs (10) or (11) and applying Eqs (8) and (12). We provide code to compute ρ′ and ρ∘ in Ref. [49].
Our results for temperature initialization are summarized in Table 1 and Fig 2. We find that the vast majority of small unweighted graphs (99.6%, for N = 10) are relative amplifiers but absolute suppressors of weak selection. After that, for 8 ≤ N ≤ 10, absolute and relative suppressors are the next most abundant, followed by absolute and relative amplifiers, and finally isothermal graphs. (Note that an unweighted, undirected graph is isothermal if and only if it is regular.) Overall, there is a positive relationship between ρ∘ and ρ′, meaning that the graphs that slow the neutral molecular clock are also likely to suppress the (absolute) effects of weak selection.


Exhaustive analysis of fixation probabilities under weak selection for small graphs.
(A) The values of ρ∘ and ρ′ are plotted for all 11,716,571 connected unweighted graphs of size 10. Colors correspond to the classification of graphs as shown in Fig 1B. (B) Scatter plot of ρ′ versus Nρ∘ for all graphs up to size 10. Note that Nρ∘ ≤ 1 for all graphs, with equality if only if the graph is isothermal (or regular, in the context of unweighted graphs).

| Size, N | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|---|
| Absolute and relative suppressors | 0 | 0 | 0 | 0 | 5 | 51 | 1,035 | 43,249 |
| Absolute suppressors, relative amplifiers | 1 | 4 | 19 | 106 | 838 | 11,006 | 259,776 | 11,671,038 |
| Absolute and relative amplifiers | 0 | 0 | 0 | 1 | 6 | 43 | 247 | 2,117 |
| Isothermal (regular) | 1 | 2 | 2 | 5 | 4 | 17 | 22 | 167 |
| Total | 2 | 6 | 21 | 112 | 853 | 11,117 | 261,080 | 11,716,571 |
We are particularly interested in the graphs that maximize or minimize ρ′ and ρ′/ρ∘ (Fig 3); these are the graphs with the most pronounced effects of spatial structure on natural selection. We find that the graphs that maximize ρ′ consist of several “islands” that are joined by single edges.


Small graphs with extreme effects for temperature initialization.
The graphs with the largest or smallest values of ρ′ (which characterizes the likelihood of selected mutations to become fixed), and ρ′/ρ∘ (which quantifies the balance of selection versus drift) are shown for sizes 7 to 10. The Star graph minimizes ρ′ but maximizes ρ′/ρ∘ for these sizes. The largest ρ′ values arise for graphs with multiple components joined by single edges, while the smallest ρ′/ρ∘ ratios occur for Detour graphs.
Interestingly, for all graphs up to size 10, the Star, Sn, minimizes ρ′ and also maximizes the ratio ρ′/ρ∘. This suggests that in a star-structured population (with Bd updating and constant probability of mutation per birth), both neutral and advantageous mutations accrue much more slowly—but the ratio of advantageous to neutral is much larger—than in a well-mixed population of the same size.
The graphs that minimize the ratio ρ′/ρ∘ belong to a particular family, termed “Detours” by Möller et al. [10]. Detours are formed by starting with a complete graph and replacing one of the edges with a path of length ≥2.
Our results for uniform initialization are presented in Table 2 and Fig 4. We find that the vast majority of small graphs (94%, in the case N = 10) are amplifiers of weak selection. This is consistent with previous numerical analyses [6, 9–11] which found that most small unweighted graphs are amplifiers for Bd with uniform initialization. In particular, the results for N = 6 and N = 7 agree with those from an exhaustive analysis performed by Cuesta et al. [9], who computed fixation probabilities for arbitrary mutant fitness r.


Small graphs with extreme effects for uniform initialization.
The graphs with the largest or smallest values of ρ′ are shown for graphs of size 7 to 10. (Only ρ′ is shown because, for uniform initialization, ρ∘ = 1/N for every graph.) Star graphs have the largest ρ′, while the smallest ρ′ values appear for graphs with a highly connected component linked to a tail.

| Size, N | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|---|
| Suppressors | 0 | 0 | 1 | 7 | 55 | 671 | 14,890 | 659,784 |
| Amplifiers | 1 | 4 | 18 | 100 | 794 | 10,429 | 246,168 | 11,056,620 |
| Isothermal (regular) | 1 | 2 | 2 | 5 | 4 | 17 | 22 | 167 |
| Total | 2 | 6 | 21 | 112 | 853 | 11,117 | 261,080 | 11,716,571 |
The Star emerged as the strongest amplifier of weak selection, again consistent with previous results [10, 11, 50]. Stronger unweighted amplifiers of selection have been found for much larger populations [25], but not for population sizes N ≤ 100. Meanwhile, the strongest suppressors of weak selection have a well-connected portion joined by one or two edges to a tail—a structure we explore further below.
For graphs of size greater than 10, an exhaustive analysis is no longer feasible. To identify larger graphs that have extreme effects on fixation probability, we employed a genetic algorithm previously developed by Möller et al. [10]. The idea is to begin with a random ensemble of graphs, and select a subset with the largest or smallest values of a quantity of interest. These graphs are then “mated” with each other to produce an ensemble of “offspring” graphs, and the process is repeated. A formal description of the genetic algorithm and the parameters used is provided in S1 Text.
Based on our analysis of small graphs, we chose three targets to explore using the genetic algorithm: (i) maximal ρ′ for temperature initialization, (ii) minimal ρ′/ρ∘ for temperature initialization, and (iii) minimal ρ′ for uniform initialization. The other possible targets (minimal ρ′ and maximal ρ′/ρ∘ for temperature initialization, maximal ρ′ for uniform initialization) were optimized by the Star graph for all N ≤ 10, and so are presumably less interesting to explore.
The results from the genetic algorithm extend and illuminate the patterns that were seen in the exhaustive analysis of smaller graphs. For temperature initialization, searching for large ρ′ (strong absolute amplifiers; Fig 5) leads to the emergence of a densely connected “hub”, joined by single edges to a number of “islands” consisting of two or more vertices each. Detour graphs, on the other hand, continue to appear when searching for small ρ′/ρ∘ (strong relative suppressors; Fig 6). We analyze both of these structures in-depth in later sections.


Discovering absolute amplifiers for temperature initialization.
We used a genetic algorithm to discover graphs with large weak-selection effect ρ′. The resulting graphs (middle column) have a central “hub” joined by single links to outlying “islands”. To formalize this structure, we introduce a family of “Cartwheel” graphs CWn,m,h, consisting of a hub of size h and n islands of m vertices each (rightmost column). We find that the optimal Cartwheel graph has ρ′ exceeding that found by the genetic algorithm, except for N = 12 for which the same graph was identified by both methods. All graphs found by both methods are absolute amplifiers of weak selection, meaning ρ′ > (N − 1)/(2N).


Discovering relative suppressors for temperature initialization.
When seeking to minimize the ratio ρ′/ρ∘, the genetic algorithm, in all cases, found Detour graphs [10], consisting of a complete graph with one edge replaced by a path. For comparison, we calculated ρ′/ρ∘ for all Detour graphs of the given sizes. The results were identical to those of the genetic algorithm except in the case N = 14, for which the genetic algorithm found a Detour graph with a non-optimal number of ring vertices.
For uniform initialization, searching for small ρ′ (strong suppressors; Fig 7) led to graphs with a well-connected part and a tail. The tails are longer than those found in the exhaustive search for N ≤ 10 (Fig 4). In some cases (N = 11, 12, 13, and also N = 7 in the exhaustive analysis) the tail connects at a single vertex to a clique, forming what is known as a Lollipop graph (also called a “standard kite” by Möller et al. [10]). Random walks on Lollipop graphs are known to have maximal hitting times [51], cover times [52], and commute times [53], so it is unsurprising that they have extreme effects on ρ′, which Eq (12) expresses in terms of the closely-related notion of coalescence times. However, in other cases (N = 14, 15, and N = 8, 9, 10 in the exhaustive search), the tail connects to two of the well-connected vertices, which are not connected to each other, forming what might be termed a “Balloon graph”. Additionally, other exploratory analyses (not shown) revealed some Balloon-like graphs for which the tail ends in a star; we call this a “Balloon-star”. We numerically computed the minimal ρ′ for all three of these families (Lollipop, Balloon, and Balloon-Star); the results are compared to the genetic algorithm results in Fig 7.


Discovering suppressors for uniform initialization.
When seeking to minimize the ratio ρ′/ρ∘, the genetic algorithm produced structures consisting of a well-connected part and a tail. We compared these to Lollipop graphs (known for their random walk properties [51–53]), and two new families, which we call Balloons and Balloon-Stars. The minimal ρ′ from these families improved on the genetic algorithm results for N = 12, 13, 14 (albeit by less than 10−4 for N = 14), and matched the genetic algorithm results for N = 11 and N = 15.
Here we analyze particular families of graphs that are found to have interesting properties. Derivations and proofs are given in S1 Text.
The Star, Sn (Fig 8), consists of a single hub connected to each of n ≥ 2 leaves by an edge of weight 1. The Star was one of the first-identified amplifiers of selection for uniform initialization [1, 22]. For graphs of size N ≤ 10, our numerical investigation found Stars to be extremal in three different ways: the best absolute suppressors and relative amplifiers for temperature initialization, and the best amplifiers for uniform initialization.


Star.
Fixation probability, ρ(r) is plotted against mutant fitness r, for the Star and the complete graph. Dotted lines show the linear approximation ρ(1+ δ) ≈ ρ∘ + δρ′, accurate for weak selection (δ ≪ 1). (A) For temperature initialization, the Star is an absolute suppressor of weak selection, ρ′ < (N − 1)/(2N), but a relative amplifier, ρ′/ρ∘ > (N − 1)/2. (B) For uniform initialization, the Star is an amplifier of weak selection, ρ′ > (N − 1)/(2N). Note that ρ(1) = ρ∘ = 1/N for both graphs, as is true for any graph under uniform initialization.
For temperature initialization, solving Eqs (8), (10), and (12) yields


For uniform initialization, ρ∘ = 1/N (as for any graph), and Eqs (11), and (12) yield

The complete bipartite graph,
In S1 Text we use our method to derive


Both the exhaustive search and the genetic algorithm identified strong absolute amplifiers (large ρ′), for temperature initialization, consisting of a highly intraconnected “hub” joined by single edges to a number of “islands”. To formalize this pattern, we define a family of “Cartwheel” graphs CWn,m,h (Fig 9A), consisting of an h-vertex hub and n islands of m vertices each. The hub and each island are cliques [55], meaning that within each subpopulation, each vertex is connected to each other. Each island is connected, by a single edge, to a distinct hub vertex. We generalized to a weighted graph by setting the hub-to-island edge weight to be a free parameter ϵ > 0; all other edge weights are 1. We consider only temperature initialization here, since this was the context in which Cartwheel graphs arose.


Cartwheel.
(A) The Cartwheel graph CWn,m,h contains n islands of m vertices each and h ≥ n hub vertices. Each island is connected to a distinct hub vertex by an edge of weight ϵ; vertices within the hub and within each island are joined by edges of weight 1. (B) The special case n = h and m = 2 has a “spider” structure; this graph has the largest ρ′ in the ϵ → 0 limit. (C) Plot of ρ∘ vs ρ′ for various Cartwheel graphs of size N = 30, with temperature initialization. Points are shown for each ϵ = 2k, where k varies from −5 to 9 in increments of 0.2. Larger points correspond to ϵ = 1 and the ϵ → 0 limit, as derived in Eq (19). Note that limϵ→0 ρ∘ = 1/N for all Cartwheel graphs. CW10,2,10 (the “spider” case) has by far the largest ρ′ in the ϵ → 0 limit; it also has the largest ρ/ρ∘, for all ϵ, among the graphs displayed. However, CW6,3,12 has the largest ρ′ for ϵ = 1 among Cartwheels of size 30. CW4,6,6 and CW2,10,10 both have h = m and therefore have the same fixation probability as a well-mixed population in the ϵ → 0 limit, according to Eq (18). CW3,8,6 has h < m and is therefore a suppressor of weak selection in the ϵ → 0 limit.
We have obtained closed-form expressions for ρ∘ and ρ′ for the Cartwheel using Mathematica. The formula for ρ∘ is given in S1 Text, while the formula for ρ′ is too lengthy to print. The behavior of ρ∘ and ρ′ as ϵ varies is illustrated in Fig 9C. As shown in Fig 5, there are unweighted Cartwheel graphs whose ρ′ values exceed those found by the genetic algorithm, suggesting that this family contains very strong absolute absolute amplifiers of weak selection. Table 3 shows the unweighted (ϵ = 1) Cartwheel graphs with the largest values of ρ′ for fixed sizes N. These ρ′ values eventually surpass 1, which is an upper bound for ρ′ values on the Star. We have not determined whether the ρ′ values for the unweighted Cartwheel are bounded, or whether they diverge to infinity as N increases.

| Size, N | n | m | h | ρ′ |
|---|---|---|---|---|
| 9 | 3 | 2 | 3 | 0.4622 |
| 10 | 2 | 3 | 4 | 0.4744 |
| 11 | 2 | 3 | 5 | 0.4835 |
| 12 | 4 | 2 | 4 | 0.4937 |
| 13 | 3 | 3 | 4 | 0.5030 |
| 14 | 3 | 3 | 5 | 0.5171 |
| 15 | 3 | 3 | 6 | 0.5273 |
| 20 | 4 | 3 | 8 | 0.5732 |
| 40 | 7 | 3 | 19 | 0.7082 |
| 60 | 10 | 3 | 30 | 0.7999 |
| 80 | 9 | 4 | 44 | 0.8827 |
| 100 | 11 | 4 | 56 | 0.9569 |
| 120 | 13 | 4 | 68 | 1.0183 |
| 140 | 12 | 5 | 80 | 1.0775 |
For arbitrary mutant fitness r, we derive (in S1 Text) the following closed-form expression for fixation probability in the ϵ → 0 limit:



Nonweak selection on the Cartwheel graph.
In the limit as the hub-to-island weight ϵ goes to zero, the fixation probability for arbitrary mutant fitness r is expressed in closed form by Eq (17). Temperature and uniform initialization are equivalent in this limit. Dashed lines show the weak-selection approximation, with slope ρ′ given by Eq (19). (A) Fixation probability is plotted against r for two Cartwheels of size 12. CW4,2,4 has h > m and is therefore an amplifier of weak selection. CW2,5,2 has h < m and is therefore a suppressor of weak selection (but appears to amplify selection for r > 2.4219). Cartwheels with h = m have the same fixation probability as the well-mixed population, for all r, in the ϵ → 0 limit. (B) In the limits ϵ → 0 and n = h → ∞, with m = 2, the fixation probability jumps discontinuously from 0 to 1/3 as r crosses 1; the expression for r > 1 is given in Eq (20). For comparison we also show the upper bound ρ(r) ≤ 1 − (r + 1)−1, derived by Pavlogiannis et al. [8], which applies to all weighted graphs with no self-loops under temperature initialization.
If the hub is the same size as the islands, h = m, then Eq (17) reduces to

Performing a Taylor expansion of Eq (17) around r = 1, we obtain ρ∘ = 1/N and

The Cartwheel most strongly amplifies selection in the case ϵ → 0, m = 2, and h = n ≫ 1. This results in a “spider” structure, in which a large number of fully interconnected hub vertices are each joined (by an edge of vanishingly small weight) to a two-vertex “leg”. In this case, the fixation probability ρ(r) becomes (in the ϵ → 0 limit) discontinuous as a function of r: ρ(r) = 0 for disadvantageous mutations (r < 0), but for advantageous mutations (r > 0) we have

We also show in S1 Text that the Cartwheel CWn,2,n, with n ≥ 7 and sufficiently small ϵ, is a stronger relative amplifier of weak selection than a Star of the same size.
In the exhaustive analysis and genetic algorithm, Detour graphs [10] (Fig 11A) emerged as the strongest relative suppressors of weak selection (minimal ρ′/ρ∘) under temperature initialization. The Detour graph Dc,d is constructed by starting with a complete graph of size c, and replacing one edge with a path containing d vertices. The total size is N = c + d.


Detour.
(A) The Detour graph Dc,d begins with a complete graph of size c, and replaces the edge between two vertices by a path with d interior vertices. (B) Values of ρ∘ and ρ′ are plotted as d varies, for N = 100. The Detour Dc,d is a relative suppressor of weak selection except for d = 1. The minimal ratio ρ′/ρ∘ is achieved for d = 15 (marked with a star). (C) The minimal ρ′/(Nρ∘) ratio, and the value of d achieving this ratio, is plotted for each N. Note that the minimizing d grows sub-linearly with N.
We have numerically determined the minimal ρ′/ρ∘ ratio for all sizes up to 100 (Fig 11B and 11C). As N increases beyond 8, the minimal ρ′/ρ∘ decreases, making these graphs increasingly strong relative suppressors. We also observe that the minimizing number of detour vertices d increases sublinearly with N.
According to our small graph analysis in Table 1, there are no absolute amplifiers of weak selection of size ≤5, and only one of size 6. This graph of size 6 has a bowtie shape (Fig 12A) and is the minimal absolute amplifier of weak selection.


Minimal absolute amplifier.
(A) This bowtie-shaped graph is the smallest absolute amplifier of weak selection under temperature initialization. (B) The difference in fixation probabilities between this graph and the complete graph K6 is plotted against mutant fitness, r. The dashed line shows the linear approximation to this difference at r = 1, computed using our weak-selection methods. Although this difference is increasing at r = 1 (because the graph is an absolute amplifier of weak selection), the difference is negative for all values of r. Thus this graph does not amplify selection in the usual sense of
Due to its small size and symmetry, the fixation probability for this graph can be computed in closed form for arbitrary mutant fitness r. Employing an algorithm developed by Cuesta et al. [7, 9], adapted for temperature initialization, we obtained the fixation probability ρG(r) as a ratio of 17th-degree polynomials (shown in S1 Text).
We find (Fig 12B) that, despite being an absolute amplifier of weak selection, this graph has fixation probability less than that of the complete graph K6 for all r > 0. This is possible because, although this graph increases the first-order term ρ′, relative to K6, it decreases the neutral term ρ∘, leading to a smaller overall fixation probability.
Finally, we exhibit a graph family that displays all possible classifications of behavior as a particular edge weight is varied. The Fan, Fn,m [12] (or Windmill [56]), consists of a hub vertex attached to n ≥ 2 blades (Fig 13A). Each blade is a clique of m ≥ 2 vertices joined by edges of weight 1. The hub is connected to each blade vertex by an edge of weight ϵ. The total population size is N = mn + 1.


Fan.
(A) The Fan Fn,m (or Windmill [56]) consists of one hub and n blades containing m vertices each. We consider a weighted version, with edge weights as shown. Pictured here is the case n = m = 3. (B,C) The neutral fixation probability ρ∘ and weak selection coefficient ρ′, plotted as ϵ varies from 0 to infinity, for all Fan graphs of sizes N = 13 and N = 101. As ϵ increases, the behavior changes from absolute and relative suppressor, to absolute and relative amplifier, to relative amplifier but absolute suppressor. For ϵ → ∞, the ρ∘ and ρ′ values approach those of the Star graph Snm (marked by a red star). (D) As n → ∞, there are three regimes of behavior, depending whether ϵ is held constant (blue line) or scales as n−1 (purple line), or as n−2 (green line). The maximal ρ′ for fixed m is ρ′ = (m + 1)/(2m), achieved for n → ∞ with
For the Fan Fn,m with temperature initialization, we find a neutral fixation probability of



We find that, as ϵ increases, Fn,m displays all three classifications of behavior. First, for 0 < ϵ < (m − 1)/(mn − 1), the Fan is both an absolute and relative suppressor. At ϵ = (m − 1)/(mn − 1), the graph is isothermal, and therefore has the same fixation probability as the well-mixed population. The graph is both an absolute and relative amplifier for (m − 1)/(mn − 1) < ϵ < ϵ*, where ϵ* is a particular cubic root for which ρ′ = (N − 1)/(2N). Then, for ϵ > ϵ*, the graph is a relative amplifier but absolute suppressor. As ϵ → ∞, the values of ρ∘ and ρ′ approach those of the star Snm, as given by Eq (13).
In the limit of many islands, n → ∞, the curve formed by Nρ∘ and ρ′ approaches a triangle (Fig 13D), the sides of which correspond to three scalings of ϵ with n: constant, inverse (ϵ = kn−1), and inverse square (ϵ = kn−2). The maximum value of ρ′ is achieved when n → ∞ with
For uniform initialization, ρ∘ = 1/N and ρ′ = num/denom, where


The problem of identifying spatial structures that amplify or suppress selection has been actively investigated for the past decade and a half [1–13]. Our work introduces new methods for studying this problem in the weak selection regime. These methods allow for the effects of spatial structure on fixation probability (for r ≈ 1) to be computed in polynomial time, enabling analytical and computational investigations that would previously have been infeasible. However, this weak-selection approach does not tell us the behavior of ρ(r) away from r = 1, which may include complex phenomena such as multiple transitions between amplification and suppression [4, 9, 57]. We also have not explored the question of fixation time [10, 11, 26], which can significantly affect the overall rate of evolution [58].
Our work underscores the critical role that initialization schemes play in the determining how spatial structure affects selection [5, 11]. Different initialization schemes arise from different assumptions on how mutations occur. Uniform initialization, which is taken as the default in most previous works, corresponds to an assumption that mutation strikes all individuals equally regardless of age. Temperature initialization arises from instead considering a constant probability of mutation in each new offspring [14].
With temperature initialization comes new considerations, since the neutral fixation probability ρ∘ is affected in addition to the weak-selection coefficient ρ′. This leads to a distinction between relative versus absolute amplification and suppression. The absolute notions refer to the rate of increase fixation probability with respect to mutant fitness, while the relative notions refer to the expected ratio of beneficial to neutral mutations that fix over time. Of these, the relative notions are arguably more empirically relevant since they are directly related to observable genetic change.
A caveat with these definitions is that, for temperature initialization, absolute amplifiers of weak selection do not necessarily have larger fixation probability than the well-mixed population for any particular value of r. The minimal absolute amplifier (Fig 12) shows why: under temperature initialization, the neutral fixation probability ρ∘ is typically reduced from the well-mixed value, which may outweigh any increase in ρ′. On the other hand, it is certainly possible for a graph to have larger fixation probability than a well-mixed population under temperature initialization; the Cartwheel family provides examples of this.
Other initialization schemes aside from temperature and uniform may be considered. Since mutations can strike both existing organisms as well as new offspring, the relevant scheme for a given population may be a blend of uniform and temperature. Alternatively, for unicellular populations, mutations may occur in the parent cell upon reproduction; this would lead to initialization schemes that depend on the birth rate as well as the death rate. Fortunately, our method can be applied to any initialization scheme, using Eq (9) for coalescence times.
We have found that, for temperature initialization, the vast majority of small (unweighted, undirected) graphs are relative amplifiers but absolute suppressors of weak selection. This means that, for a spatially structured population satisfying the relevant modeling assumptions, one would typically see a greater ratio of beneficial-to-neutral mutations, but a slower overall rate of genetic change, compared to a well-mixed population. For uniform initialization, we find that the vast majority of graphs are amplifiers of weak selection, which accords with findings from other works using different methods [6, 10, 11].
Our findings reveal new families of graphs with interesting effects on selection, and shed new light on known families. In particular, our work has identified Cartwheel graphs (Fig 9) as strong absolute amplifiers for temperature initialization—with the “spider” case, n = h and m = 2, having especially interesting properties. In one limit, the fixation probability jumps discontinuously, from zero for all deleterious mutations, to more than one-third for all beneficial mutations (Fig 10). Stronger amplifiers for temperature initialization were found by Pavlogiannis et al. [8], but only with self-loops, meaning that new offspring can displace their own parents. Among graphs with no self-loops, Cartwheels appear to be the strongest family of amplifiers for temperature initialization that have been discovered thus far.
Detour graphs, in contrast, are shown to strongly suppress the ρ′/ρ∘ ratio for temperature initialization. This family of graphs was previously identified by Möller et al. [10] for their extreme properties under uniform initialization. In that context, Möller et al. found that Detours are powerful suppressors of selection for fitness values of r close to 1, but transition into amplifiers as r increases beyond a threshold value r* > 1. Overall, Detours appear to be a particularly interesting example for evolutionary graph theory. We have not come up with closed-form solutions for ρ∘ or ρ′ for Detours; doing so appears difficult although perhaps not impossible.
Our analysis of the Fan, meanwhile, highlights the determinative role that edge weight can play. By increasing the weight of a single type of edge, one can change the graph from a relative suppressor, to an absolute amplifier, to an absolute suppressor but relative amplifier. Interestingly, both ρ∘ and ρ′ are maximized for intermediate values of this edge weight. Attention to edge weight will therefore be crucial in connecting this theory to applications.
Although our general results apply to arbitrary (strongly connected) weighted digraphs, we focused our numerical exploration and examples on undirected graphs without self-loops. Extending this analysis to directed graphs would allow for many more possible structures, and may reveal new phenomena with regard to the amplification or suppression of selection.
Our work and other recent contributions [5, 6, 9, 11–13, 27, 28] make clear that whether a population structure amplifies for suppresses selection depends not only on the graph, but also on the update rule, the initialization scheme, and the regime of fitness values being considered, and whether absolute or relative genetic change is of interest. This combinatorial explosion of modeling choices may seem daunting, but it also suggests an increased opportunity for applications of the theory. Different modeling choices will be relevant to animal, plant, and microbial populations, to somatic tissue [18–20], and to infectious diseases [15, 21]. Given recent advances in theory [8–13, 26, 33], the toolkit now exists to apply evolutionary graph theory in a wide range of biological settings.
We are grateful to Alex McAvoy for insightful conversations, and to Álvaro Lozano Rojo for adapting his code to temperature initialization and helping with computation for the “Bowtie” graph.
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58