Drift as constitutive: conclusions from a formal reconstruction of population genetics

Roffé, Ariel Jonathan

doi:10.1007/s40656-019-0294-6

Drift as constitutive: conclusions from a formal reconstruction of population genetics

Published: 20 November 2019

Volume 41, article number 55, (2019)
Cite this article

History and Philosophy of the Life Sciences Aims and scope Submit manuscript

Ariel Jonathan Roffé ORCID: orcid.org/0000-0002-0051-2028¹

124 Accesses
1 Citation
6 Altmetric
Explore all metrics

Abstract

This article elaborates on McShea and Brandon’s idea that drift is unlike the rest of the evolutionary factors because it is constitutive rather than imposed on the evolutionary process. I show that the way they spelled out this idea renders it inadequate and is the reason why it received some (good) objections. I propose a different way in which their point could be understood, that rests on two general distinctions. The first is a distinction between the underlying mathematical apparatus used to formulate a theory and a concept proposed by that theory. With the aid of a formal reconstruction of a population genetic model, I show that drift belongs to the first category. That is, that drift is constitutive of population genetics in the same sense that multiplication is constitutive in classical mechanics, or that circle is constitutive in Ptolemaic astronomy. The second distinction is between eliminating a concept from a theory and setting its value to zero. I will show that even though drift can be set to zero just like the rest of the evolutionary factors (as others have noted in their criticism of McShea and Brandon), eliminating drift is much harder than eliminating those other factors, since it would require changing the entire mathematical apparatus of standard population genetic theory. I conclude by drawing some other implications from the proposed formal reconstruction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Back to the fundamentals: a reply to Basener and Sanford 2018

Article 03 April 2024

Zachary B. Hancock & Daniel Stern Cardinale

Flow

Hutchinson’s ecological niche for individuals

Article Open access 23 June 2022

Elina Takola & Holger Schielzeth

Notes

There is quite a large literature discussing the adequacy of the analogy in various different aspects (see Sect. 5 for some examples). In this article I focus on only one of those aspects, which is presented in what follows.
In their 2010 book, and in Brandon and McShea (2012, p. 739) they also treat mutation as a constitutive constraint. I will have more to say about mutation and its status in Sect. 4.
For example, as I show below, because considering it allows us to get a better understanding of how certain models within PG are put together, and how they explain.
For more on their definition of constitutive and imposed constraints, see Brandon and McShea (2012).
Again, this is for clarity. The axioms could be established in an entirely formal manner.
Alternative presentations, for example, calculate the probability of every possible type of mating in the population, together with the sampling probabilities for the descendants of each of those types. Generational transitions are then modeled with recourse to more than two sampling processes. There are other formal reconstructions that focus on these kind of presentation; see for example Lloyd (1994) and Lorenzano (2014).
Following usual talk, I call the type of a gene an allele-type (for example, when one says that “gene g₁ is of allele-type A, while gene g₂ is of allele-type a”), and the type of a pair of genes (of an individual) a genotype, even though "genotype" would have been a better fitting name to the type of a gene. The term "genotype" is also used ambiguously in the literature to refer to a particular pair of genes (not to its type); here, I use the term "genotype" to refer exclusively to the former not the latter.
I also assume, for simplicity, that fitness coefficients remain constant over the generations, though this could be easily modified in a more complex version of the reconstruction.
In a sampling process without replacement, already chosen individuals from a population to form part of the sample cannot be chosen again. In the typical examples, where one samples marbles from an urn, marbles that have already been sampled are not put back inside the urn. In contrast, in a sampling process with replacement, marbles are put back into the urn after being sampled and can be chosen again. Obviously, when one speaks of biological sampling processes there is no one choosing the individuals from the samples; for instance, in parental sampling, the sample consists simply of the individuals who survived.
Probability assignments for sampling processes with and without replacement are different; however, it can be mathematically proven that, when the samples are large, the probabilities of the second approximate (and are equal, at the limit) to the first (see Feller 1971, Chapters II, VI).
Notice that, for this second process, it is not the case that I_i+1 ⊆ I_i* (the "sample" is not a subset of the original population), because that would violate Axiom 1. In fact (because of Axiom 1) I_i+1 and I_i* do not share any elements. However, the genotype distribution of I_i+1 depends on that found in I_i* exactly in the same way as if it was a sample (a subset) properly speaking—see the constraints listed below.
Fitness coefficients do not play a role here because, as said before, I am only considering selection by viability. In a more complete version, fitnesses should appear.
I assume that migration plays a role especially in the first sampling process, and mutation in the second.
Logicians (and to a lesser extent, mathematicians) usually do specify the language in which their theories are built, while empirical scientists typically do not do this. However, formal reconstructions of empirical theories, done usually by philosophers, do make the language explicit. For example, for a formal reconstruction of CM that makes explicit all the terms used in its language, see Balzer et al. (1987).
Of course, there is also the issue of whether the theory itself remains the same if one of its concepts is eliminated. I will not go into this problem here.
Moreover, the concept of drift was originally introduced into PG to account for certain empirical phenomena that could not be explained purely by selection (e.g. Gulick 1872, discusses the puzzling geographical distribution of the genera within a family of land snails in the Hawaiian islands, which are phenotypically very distinct, but live in environments that are very similar to one another; see also Hagedoorn and Hagedoorn 1921; Brooks 1899, for other antecedents). None of this would make much sense if drift was just a purely mathematical phenomenon.
It might be thought that my conclusions regarding drift as part of the background mathematical vocabulary lend some credibility to the statisticalist position, defended chiefly by Walsh, Ariew, Lewens and Matthen (I thank an anonymous reviewer for this suggestion). I have some reservations about this, but for reasons of space I cannot explore this issue here, since going into it would require introducing that debate more fully. Instead, leave it open as a suggestion.

References

Balzer, W., Moulines, C. U., & Sneed, J. D. (1987). An architectonic for science: The structuralist program. Dordrecht: Reidel.
Book Google Scholar
Baravalle, L., & Vecchi, D. (forthcoming). Drift as a force of evolution: A manipulationist account. In Life and evolution. Berlin: Springer.
Beatty, J. (1984). Chance and natural selection. Philosophy of Science, 51(2), 183–211.
Article Google Scholar
Brandon, R. (2005). The difference between selection and drift: A reply to Millstein. Biology and Philosophy, 20(1), 153–170.
Article Google Scholar
Brandon, R. (2006). The principle of drift: Biology’s first law. The Journal of Philosophy, 103(7), 319–335.
Article Google Scholar
Brandon, R. N., & McShea, D. W. (2012). Four solutions for four puzzles. Biology and Philosophy, 27(5), 737–744.
Article Google Scholar
Brooks, W. K. (1899). The foundations of zoology. Oxford: Macmillan Co.
Book Google Scholar
Carnap, R. (1950). Logical foundations of probability. Chicago: University of Chicago Press.
Google Scholar
Crow, J. F., & Kimura, M. (1970). An introduction to population genetics theory. New York: Burgess Pub. Co.
Google Scholar
Earnshaw, E. (2015). Evolutionary forces and the Hardy–Weinberg equilibrium. Biology and Philosophy, 30(3), 423–437.
Article Google Scholar
Feller, W. (1971). An introduction to probability theory and its applications. New York: Wiley.
Google Scholar
Gillespie, J. H. (2004). Population genetics: A concise guide. New York: JHU Press.
Google Scholar
Gulick, J. T. (1872). On the variation of species as related to their geographical distribution, illustrated by the achatinellinæ. Nature, 6, 222–224.
Article Google Scholar
Hagedoorn, A. L., & Hagedoorn, A. C. (1921). The relative value of the processes causing evolution. The Hague: Nijhoff.
Book Google Scholar
Hartl, D. L., & Clark, A. G. (2007). Principles of population genetics. New York: Sinauer Associates.
Google Scholar
Hitchcock, C., & Velasco, J. D. (2014). Evolutionary and Newtonian forces. Ergo, an Open Access Journal of Philosophy, 1(2), 39–77.
Google Scholar
Lewens, T. (2010). The natures of selection. The British Journal for the Philosophy of Science, 61(2), 313–333.
Article Google Scholar
Lloyd, E. A. (1994). The structure and confirmation of evolutionary theory. Princeton: Princeton University Press.
Google Scholar
Lorenzano, P. (2014). What is the status of the Hardy–Weinberg law within population genetics? In M. Galavotti, E. Nemeth, & F. Stadler (Eds.), European philosophy of science—Philosophy of science in Europe and the Viennese heritage (Vol. 17, pp. 159–172). Berlin: Springer.
Chapter Google Scholar
Luque, V. J. (2016a). The principle of stasis: Why drift is not a zero-cause law. Studies in History and Philosophy of Science Part C: Studies in History and Philosophy of Biological and Biomedical Sciences, 57, 71–79.
Article Google Scholar
Luque, V. J. (2016b). Drift and evolutionary forces. THEORIA. An International Journal for Theory, History and Foundations of Science, 31(3), 397.
Article Google Scholar
Matthen, M., & Ariew, A. (2002). Two ways of thinking about fitness and natural selection. Journal of Philosophy, 99(2), 55–83.
Article Google Scholar
McShea, D., & Brandon, R. (2010). Biology’s first law. Chicago: The University of Chicago Press.
Book Google Scholar
McShea, D. W., Wang, S. C., & Brandon, R. N. (2019). A quantitative formulation of biology’s first law. Evolution, 73(6), 1101–1115.
Article Google Scholar
Millstein, R. L. (2002). Are random drift and natural selection conceptually distinct? Biology and Philosophy, 17(1), 33–53.
Article Google Scholar
Millstein, R. L., Skipper, R. A., & Dietrich, M. R. (2009). (Mis)Interpreting mathematical models: Drift as a physical process. Philosophy, Theory, and Practice in Biology, 31(4), 459–482.
Google Scholar
Otsuka, J. (2016). A critical review of the statisticalist debate. Biology and Philosophy, 31(4), 459–482.
Article Google Scholar
Pence, C. H. (2017). Is genetic drift a force? Synthese, 193(6), 1967–1988.
Article Google Scholar
Reisman, K., & Forber, P. (2005). Manipulation and the causes of evolution. Philosophy of Science, 72(5), 1113–1123.
Article Google Scholar
Roffé, A. J. (2017). Genetic drift as a directional factor: Biasing effects and a priori predictions. Biology and Philosophy, 32(4), 535–558.
Article Google Scholar
Shapiro, L. A., & Sober, E. (2007). Epiphenomenalism—The Do’s and the Don “Ts”. In G. Wolters & P. K. Machamer (Eds.), Thinking about causes: From Greek philosophy to modern physics (pp. 235–264). Pittsburgh: University of Pittsburgh Press.
Google Scholar
Sober, E. (1984). The nature of selection: Evolutionary theory in philosophical focus. Chicago: University of Chicago Press.
Google Scholar
Stephens, C. (2004). Selection, drift, and the “forces” of evolution. Philosophy of Science, 71, 550–570.
Article Google Scholar
Stephens, C. (2010). Forces and causes in evolutionary theory. Philosophy of Science, 77(5), 716–727.
Article Google Scholar
Wakeley, J. (2005). The limits of theoretical population genetics. Genetics, 169(1), 1–7.
Google Scholar
Walsh, D. M., Ariew, A., & Lewens, T. (2002). The trials of life: Natural selection and random drift. Philosophy of Science, 69(3), 452–473.
Article Google Scholar
Williams, M. B. (1970). Deducing the consequences of evolution: A mathematical model. Journal of Theoretical Biology, 29(3), 343–385.
Article Google Scholar

Download references

Acknowledgements

This work has been funded by the research projects SAI 827-223/19 and PUNQ 1401/15 (National University of Quilmes, Argentina), UNTREF 32/15 255 (Universidad Tres de Febrero, Argentina) and UBACyT 20020170200106BA (Universidad de Buenos Aires, Argentina).

Author information

Authors and Affiliations

Centro de Estudios de Filosofía e Historia de la Ciencia (CEFHIC-UNQ-CONICET), Universidad de Buenos Aires (UBA), Universidad Tres de Febrero (UNTREF), Roque Sáenz Peña 352, B1876BXD, Bernal, Buenos Aires, Argentina
Ariel Jonathan Roffé

Authors

Ariel Jonathan Roffé
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ariel Jonathan Roffé.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Proofs of theorems

Theorem A1

Let R be the relation $\left\{ {\left\langle {x,\left\{ {x,y} \right\}} \right\rangle /x \in G_{i} \,\& \,\left\{ {x,y} \right\} \in I_{i} } \right\}$. Note that R is a function, since Axiom 3 specifies that R satisfies the uniqueness and existence requisites for functions (each gene has one and only one corresponding individual to which it belongs). Thusly, R can be seen as a G_i → I_i function. Additionally, Axiom 2 implies that each gene is present only once inside each individual (in the g_j ≠ g_k part). All of these tells us that, for each gene, there exists one and only one individual to which it belongs, and which also contains a different gene. In that way, every individual from generation i will be the value assigned by function R to two different genes (arguments) from that generation. Therefore, if |G_i| is the number of genes in generation i, function R establishes |G_i|/2 partitions in its domain G_i. Notice also that R is suryective (every element of the codomain, in this case I_i, is the value of some argument), since, by Axiom 3, no gene can be “loose”. Therefore, the number of partitions also coincides with the number of individuals. Therefore, |I_i| = |G_i|/2, which immediately gives us the desired result.

Theorem A2

For simplicity’s sake, I only prove this for finite populations, and for allele type A (the proof for a is almost identical). By applying the definitions of FreqAT and FreqGT, what needs to be proved is that:

$$\frac{{\left| {\left\{ {g_{k} \in G_{i} /f_{1} \left( {g_{k} } \right) \, = A} \right\}} \right|}}{{\left| {G_{i} } \right|}} \, = \, \frac{{\left( {\left| {\left\{ {i_{k} \in I_{j} /f_{2} \left( {i_{k} } \right) \, = \{ A,A\} } \right\}} \right| \, + \, 1/2\left| {\left\{ {i_{k} \in I_{j} /f_{2} \left( {i_{k} } \right) \, = \{ A,a\} } \right\}} \right|} \right)}}{{\left| {I_{i} } \right|}}.$$

It shall be convenient to call G_i(x) the set $\left\{ {g_{k} \in G_{i} /f_{1} \left( {g_{k} } \right) \, = x} \right\}$ (i.e. the set of genes of type x present in generation i), and I_i({x, y}) the set $\left\{ {i_{k} \in I_{j} /f_{2} \left( {i_{k} } \right) \, = \{ x,y\} } \right\}$ (the set of individuals of genotype {x, y} in generation i). With this terminology, what needs to be proved is that:

$$\frac{{\left| {G_{i} \left( A \right)} \right|}}{{\left| {G_{i} } \right|}} \, = \, \frac{{\left| {I_{i} \left( {\{ A,A\} } \right)} \right| + \raise.5ex\hbox{$\scriptstyle 1$}\kern-.1em/ \kern-.15em\lower.25ex\hbox{$\scriptstyle 2$} \left| {I_{i} \left( {\{ A,a\} } \right)} \right|}}{{\left| {I_{i} } \right|}}$$

Theorem A1 establishes that |G_i| = 2 × |I_i|. Thus, for the previous equality to hold, what needs to happen is that:

$$\begin{aligned} \left| {G_{i} \left( A \right)} \right| & = 2\left( {\left| {I_{i} \left( {\{ A,A\} } \right)} \right| + \raise.5ex\hbox{$\scriptstyle 1$}\kern-.1em/ \kern-.15em\lower.25ex\hbox{$\scriptstyle 2$} \left| {I_{i} \left( {\{ A,a\} } \right)} \right|} \right) \\ & = 2\left| {I_{i} \left( {\{ A,A\} } \right)} \right| + \left| {I_{i} \left( {\{ A,a\} } \right)} \right| \\ \end{aligned}$$

To prove this, I define two new sets, called G_i(A, {A, A}) and G_i(A, {A, a}). These sets will represent the set of A genes present in an {A, A} kind of individual, and in an {A, a} kind of individual. Formally,

$$G_{i} \left( {x, \, \{ x,y\} } \right) = \, \{ g_{k} \in G_{i} /f_{1} (g_{k} ) = x{\text{ and }}\exists g_{j} \in G_{i} {\text{ such that }}f_{1} (g_{j} ) = y\quad \& \quad \{ g_{k} ,g_{j} \} \in I_{i} \}$$

Axioms 3 and 4 imply that $G_{i} (A,\{ A,A\} ) \cap G_{i} (A,\{ A,a\} ) = \varnothing$ and that $G_{i} (A,\{ A,A\} ) \cup G_{i} (A,\{ A,a\} ) = G_{i} (A)$ (the first follows from the fact that no gene is inside more than one individual, while the second from the fact that every gene is inside at least one individual, along with another gene—which must be either of the same or different type). Both of these facts imply that $\left| {G_{i} (A,\{ A,A\} )} \right| + \left| {G_{i} (A,\{ A,a\} )} \right| = \left| {G_{i} (A)} \right|$.

The following two facts, along with the equation just derived, directly give us the desired result.

1.
$2\left| {I_{i} \left( {\{ A,A\} } \right)} \right| = \left| {G_{i} (A,\{ A,A\} )} \right|$
This follows from the fact that the members of I_i({A, A}) are pairs of different members of G_i(A,{A, A}), since every member of G_i(A,{A, A}) belongs to an AA individual, along with another G_i(A,{A, A}) member.
2.
$\left| {I_{i} \left( {\{ A,a\} } \right)} \right| = \left| {G_{i} (A,\{ A,a\} )} \right|$

This follows from the fact that every member of G_i(A,{A, a}) belongs to a (different) Aa individual (i.e. member of I_i({A, a})). This individual also contains a member not belonging to G_i(A,{A, a}, but to G_i(a,{A, a}). Thus, a bijection can be established between both sets, which means that they have the same number of elements.

Theorem A3

This theorem follows from the following two facts. First, that if:

$$\frac{n!}{{x!y!\left( {n - x - y} \right)!}}p^{x} \times q^{y} \times r^{n - x - y}$$

is a multinomial distribution (with p, q, r being the frequencies of three kinds of objects in the population), then the expected value of the frequencies in the sample is exactly p: q: r (i.e. that all kinds of objects maintain their original frequency). Note that the probability assignment for the first sampling process is multinomially distributed.

The second fact is the probabilistic “Law of large numbers”. This law states (with terminology modified to fit the one I am using) that:

$$\mathop {\lim}\limits_{n \to \infty} P\left({\hat{p} - E_{{\hat{p}}} (Sampl(x)) > \epsilon} \right) = 0$$

where n is the size of a sample in a probabilistic sampling process Sampl(x), $\hat{p}$ is the frequency of one kind of object in the sample, $E_{{\hat{p}}} \left( {Sampl\left( x \right)} \right)$ is the expected value of $\hat{p}$ in Sampl(x), and ϵ is an arbitrary number (thus, an arbitrarily small one). In other words, at infinite sample sizes, the probability that deviations from the expected value occur (for the frequency of one kind of object) reduce to zero (for more on this law, see Feller 1971, chapter X). If this is applied to the first sampling process, the result is that actual outcomes should equal their expected outcomes. These two facts together directly imply the desired result.

Theorem A4

The proof of this result uses the same two facts as the proof from above. Since |I_i*| is infinite, by Theorem A3, we get that for every genotype {x, y}:

$$FreqGT\left( {I_{i} *, \, \{ x,y\} } \right) \, = \frac{{w_{x,y} \times FreqGT(I_{i} ,\{ x,y\} )}}{{\bar{w}(i)}}$$

Now, since we assume that all fitnesses are equal, then there exists a number m, such that m = w_A,A = w_A,a = w_a,a. Thus (by definition of $\bar{w}(i)$):

$$\begin{aligned} FreqGT\left( {I_{i} *, \, \{ x,y\} } \right) \, & = \frac{{m \times FreqGT(I_{i} ,\{ x,y\} )}}{{m \times FreqGT(I_{i} ,\{ A,A\} ) + m \times FreqGT(I_{i} ,\{ A,a\} ) + m \times FreqGT(I_{i} ,\{ a,a\} )}} \\ & = \frac{{m \times FreqGT(I_{i} ,\{ x,y\} )}}{{m \times \left( {FreqGT(I_{i} ,\{ A,A\} ) + FreqGT(I_{i} ,\{ A,a\} ) + FreqGT(I_{i} ,\{ a,a\} )} \right)}} \\ \end{aligned}$$

Since the m’s cancel out in the numerator and denominator, and what is left in the denominator (the sum of all the genotype frequencies) equals 1, all of this equals FreqGT(I_i, {x, y}). Thus, after the first sampling process, the frequencies of the genotypes remain identical. Additionally, Theorem A2 implies that allele-type frequencies will also be identical to the originals (since they can be calculated from genotype frequencies, which are themselves identical to the originals). Thus, for every allele-type x, FreqAT($\cup I_{i} *$, x) = FreqAT(G_i, x) (the union set of a set of individuals equals the set of genes present in those individuals).

By using the same two facts as in the demonstration of Theorem A3, and the probability distribution of the second sampling process, we get the desired result.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Roffé, A.J. Drift as constitutive: conclusions from a formal reconstruction of population genetics. HPLS 41, 55 (2019). https://doi.org/10.1007/s40656-019-0294-6

Download citation

Received: 04 May 2019
Accepted: 12 November 2019
Published: 20 November 2019
DOI: https://doi.org/10.1007/s40656-019-0294-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Drift as constitutive: conclusions from a formal reconstruction of population genetics

Abstract

Access this article

Similar content being viewed by others

Back to the fundamentals: a reply to Basener and Sanford 2018

Flow

Hutchinson’s ecological niche for individuals

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Proofs of theorems

Theorem A1

Theorem A2

Theorem A3

Theorem A4

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Abstract

Access this article

Similar content being viewed by others

Back to the fundamentals: a reply to Basener and Sanford 2018

Flow

Hutchinson’s ecological niche for individuals

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Proofs of theorems

Appendix: Proofs of theorems

Theorem A1

Theorem A2

Theorem A3

Theorem A4

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation