john hawks weblog

paleoanthropology, genetics and evolution

Iceland

  • mtDNA selection in Iceland?

    Wed, 2009-01-21 17:11 -- John Hawks

    Leave it to me to have readers unwilling to ignore selection in recent populations! Here's an e-mail:

    Why couldn't the Icelandic genetic changes have been the result of selection that favored some mtDNA lineages rather than others? We know the population of Iceland derived from settlers that were transplanted into a relatively alien climate and ecology, and had to adjust agriculture and subsistence activity to survive there. We know that there were dramatic environmental insults to the population: disease, starvation, eruptions. At least some of these insults would have likely been more severe than the ancestral populations would have encountered, whether they were Scandinavian or "Celtic".

    So why isn't there at least a token mention of selection, either by you or by the authors? Is "genetic drift" that much more likely than selection? Is selection a more academically risky proposition than the comforting mathematics of "drift"?

    This refers to my post earlier this week about the loss of mtDNA haplotypes from medieval Icelanders. Iceland has around half the number of unique haplotypes that other European samples have, despite similar sampling intensity. I went through some quick simulations of genetic drift in a relatively small, medieval Iceland-sized population to show that the loss of these alleles by drift was quite likely. I also pointed out that very large changes in frequency of common alleles were not a likely outcome of drift, even in the small population. This went along with another post, ("Could genetic drift really break your heart?"), which claimed that a relatively new heart-disease risk mutation was not likely to have become common by genetic drift. In that case, I thought that some selection was probably necessary -- both because the South Asian population has been very large for a long time, and because the mutation has a deleterious effect that would impede it from drifting upward.

    But showing that genetic drift could explain the Iceland results is not the same as rejecting the hypothesis of selection.

    In other ancient European samples, it is pretty clear that selection must explain differences in mtDNA haplotype frequencies over time. Neolithic Germans were dominated by a haplotype that is vanishingly rare in Europe today ("Early European mtDNA: only mysterious if you want it to be"). Similar but less extreme results are also true of medieval British and medieval Danes. All over Europe, some mtDNA haplotypes have been proliferating in the last few thousand years, and others have been declining. That is in accordance with other evidence that mtDNA has been selected in recent populations (Mitochondrial DNA adaptations in living human populations"), as well as the associations of various mtDNA haplotypes with human chronic diseases.

    Humans are not exceptional here, either -- for example medieval skeletons of Scandinavian dogs show mtDNA selection has happened to them, too.

    In the case of Iceland, we have a lot of mtDNA haplotypes that have been lost. It is possible that purifying selection took out some number of these. Probably more likely, positive selection may have increased the frequency of one or more common haplotypes, inevitably reducing the collective frequency of the others. Under that hypothesis, the loss of any particular mtDNA haplotype is a random event, but the fact that an unexpectedly large number were lost would be attributable to selection on common haplotypes.

    We could test this hypothesis by considering the frequencies of the more common haplotypes. That would also allow us to examine whether the same mtDNA haplotypes have been favored in other European populations.

    In the meantime, the loss of mtDNA haplotypes in Iceland does appear to be consistent with drift, given what we know about Iceland historical demography. That's the most important part: Genetic drift is not a null hypothesis, it involves many assumptions about demography. When we know information about the demography, we shouldn't pretend that we don't.

    One final point: Is selection an "academically risky proposition"?

    There's no question that I'm the leading proponent of the idea that natural selection has been widespread, powerful and important in human evolution. I didn't exactly elbow my way into this position. Very few people were willing to work with the idea that natural selection materially affected human genetic variation. This has given me some success, partly because I've been able to test easy and obvious hypotheses that R. A. Fisher might have thought up.

    I find all of this to be very puzzling. Why is it, for example, that Rush Limbaugh and the Drudge Report could pick up on the importance of natural selection, but so many human geneticists can't?

    Whatever the reasons, I've been talking about mtDNA selection for ten years. I reviewed the issue here in 2005 ("Selection, nuclear genetic variation, and mtDNA"):

    Within the past 30,000 to 1 million years, human populations have changed radically in longevity (Caspari and Lee 2004), brain size (Lee and Wolpoff 2003; Ruff et al. 1997), diet, and energetics (Leonard and Robertson 1997; Sorensen and Leonard 2001). Human mtDNA variants have been found to be associated with chronic diseases of aging , brain disorders (Zhu et al. 2004), performance in athletes (Niemi and Majamaa 2005), and longevity itself (Niemi et al. 2005). The present pattern of variation also appears to be correlated with climate (Ruiz-Pesini et al. 2004), and may affect the dietary energetics and insulin metabolism (Lowell and Schulman 2005).

    Simply put, variation in mtDNA is a strong target for further research into the effects of aging, metabolism, and disorders of the brain for a reason: it impacts all these areas strongly.

    I've revisited the issue many times in the last four years. At one point, it got to the stage where I wondered if I was writing about mtDNA selection too much. I wrote:

    I'm sure some readers are beginning to think this is mtDNA Selection Central. Believe it or not, I've gotten a lot of requests to cover this topic, which of course is one of the central issues in the Neandertal problem as well as the unraveling of human origins.

    And it's an exciting developing story: it shows how medical genetics is steamrolling the human genetics of the past thirty years. Finding mutations that actually do things has great medical interest, and the search is accelerating. This work is being undertaken by people who have no investment in the idea that variation among humans should be completely neutral.

    After all, what's more important: that a neutral mtDNA lets us trace human migrations, or that understanding mtDNA selection helps us find treatments for Alzheimer's disease?

    Has it hurt my career? Well, I have to admit that some of my colleagues think I'm a total wackaloon. Ten years ago, when I started talking about mtDNA selection, I heard snickers from high-ranking scientists when I ascended the podium. Among other things, I fought peer reviews that said that positive selection in humans must be "rare, if it happens at all."

    All I can say is, if the idea of natural selection has damaged my career, I hope for much more damage like this in the future!

  • Genetic drift eliminated rare mtDNA haplotypes from Iceland

    Tue, 2009-01-20 12:33 -- John Hawks

    How powerful has genetic drift been in recent human evolution? That's the question I raised the other day with reference to the claim that a heart disease risk-inducing allele had become common by drift in India during the last 30,000 years.

    Another paper released earlier this week in PLoS Genetics claims that mtDNA haplotypes have been recently lost from the Icelandic population by strong genetic drift. The evidence for such changes in haplotypes comes from sequencing the mtDNA of thousand-year-old skeletons unearthed in Iceland during the last 150 years. These ancient remains have haplotypes that are found elsewhere in Europe today, but not in Iceland. The conclusion is that the modern-day descendants of these early Iceland settlers have experienced genetic drift within the last 1000 years, relieving them of of a load of rare mtDNA haplotypes.

    Could genetic drift have accomplished this loss of haplotypes? Although the paper does not present any analysis of this question, a quick consideration of some theory will show that genetic drift could easily have caused the observed results. It also shows a contrast between this case and others where genetic drift has been described as "strong". Even in this case, on an island with a limited human population, genetic drift is only "strong" in the sense of eliminating alleles that are already quite rare in the population.

    Today, Iceland is a large population of more than 300,000 individuals. As in many countries, the current size is a result of twentieth-century population growth. From Wikipedia:

    The population of the island is believed to have varied from 40,000 to 60,000 in the period from initial settlement until the mid-19th century. During that time, cold winters, ashfall from volcanic eruptions, and bubonic plagues adversely affected the population several times. The first census was carried out in 1703 and revealed that the population was then 50,358. After the destructive volcanic eruptions of the Laki volcano during 1783–1784 the population reached a low of about 40,000. Improving living conditions have triggered a rapid increase in population since the mid-19th century - from about 60,000 in 1850 to 320,000 in 2008.

    Forty thousand is a large population, in evolutionary terms. But not all these 40,000 people would have counted toward the variance in reproduction of gene lineages (See my discussion of the Wright-Fisher model). If we took a census of the medieval Iceland population, we would find that a large fraction of individuals, maybe half, were children. A rather small fraction would be postreproductive adults. So at most half the population, and probably less, were of reproductive age at any given time. At a first approximation, this population of 40,000 individuals would reproduce like a Wright-Fisher population of less than 20,000. Around half that number will be females -- although probably a bit more than half, since reproductive variance is usually higher among men. So less than 10,000 mating females per generation. Other factors, such as within-family fitness correlations, may limit the effective number even further. I'll assume a value of 5000 effective women, which is certainly an underestimate for the last 300 years, but may not be too much of an underestimate for earlier times.

    We should keep in the back of our minds that the population did not grow instantaneously to this value -- it would have taken a couple hundred years to reach its ultimate medieval size. So genetic drift might have been substantially stronger when these skeletons were being buried than it would have been 300 or 400 years later.

    Helgason et al. (2009) applied an unusual test to demonstrate the genetic difference between early and current Icelanders. They don't consider the frequencies of haplotypes. Nor do they use a measure that would ultimately be influenced by frequencies, like FST. Instead, they consider the number of shared haplotypes between the two samples as a measure of similarity.

    The statistical question is whether the modern and ancient samples share significantly fewer haplotypes than expected if the ancient and modern samples had been randomly drawn from a single population. The authors tested this hypothesis with a randomization test: they repeatedly drew random samples of the same sizes as the ancient and modern samples, from a distribution including both samples. It's a clever technique.

    It doesn't yield an easy answer to the question of how strong genetic drift needs to be to explain the data. But we can address this question by considering the data and nature of the comparison.

    Here, we're talking about the presence or absence of haplotypes that each have a very low frequency. For example, the modern Scandinavian sample in the paper includes 337 distinct haplotypes. The modern Icelandic sample, with a slightly larger same sample size (947 versus 898 individuals), has many fewer haplotypes -- only 172. That's a substantial deficit in the Icelandic sample.

    Now, let's consider the ancient sample, with 73 individuals and 58 haplotypes. Obviously, nearly all the haplotypes were present in only a single sampled individual. This implies that many haplotypes existed in that population that were not sampled in these 73 individuals. It also implies that the average frequency of a unique mtDNA haplotype in that population was less than 1.4 percent (less, in other words, than 1/73).

    We don't know that these alleles were lost in Iceland; what we know is that they weren't sampled today. So a real answer to this question would include some considerations of sampling theory.

    We can do a quick calculation to figure out the chances that an allele of 1.5 percent would be eliminated by drift in less than 1000 years (around 40 generations). Here's a small piece of Mathematica code that begins with an allele at 0.015 frequency, a population of 5000 effective individuals, runs it under drift for 40 generations:

    drift := Module[{popSize, a, result, count, b},
    count = 0;
    For[i = 1, i popSize = 5000;
    a = N[RandomInteger[BinomialDistribution[popSize, 0.015]]];
    For[j = 1, j a = N[RandomInteger[BinomialDistribution[popSize, a/popSize]]]];
    If[a == 0, count++]];
    count]

    The variable count returns the number of trials out of 10,000 that the allele disappeared. For an initial frequency of 1.5 percent, roughly 3 percent of trials (308 out of 10000) end in loss of the allele. That's not a very high proportion of loss by drift.

    Still, we're not interested in loss from the population; we're interested in why haplotypes might not show up in the present-day sample. That might happen for two reasons:

    1. The alleles weren't so common to begin with -- many haplotypes were present in the early population that weren't in the skeletal sample. We can use the simulation to see what happens to rarer haplotypes. For example, if the initial frequency was only half a percent (0.005), then they are lost from the population after 40 generations nearly a third of the time (3142 out of 10000).

    2. The alleles are still in the Icelandic population, but rare enough they didn't get sampled. A sample of 900 people will miss roughly half of haplotypes with frequencies of 0.005 or less. Altering the simulation again, we find that drift achieves this result in roughly 5 percent (526 out of 10000) of cases, beginning at an initial frequency of 0.015.

    So altogether, the observed result is not at all unlikely. Genetic drift in Iceland had the power to eliminate rare alleles, even after the population was founded and grew to its medieval size.

    Still, this "strength" of genetic drift is manifested in its ability to eliminate initially rare alleles. If we ask what effect it would have on a common allele, we see a somewhat different dynamic. For mtDNA, the variance of allele frequency after one generation of drift is p(1-p)/(N), where N is the female effective population size (for autosomal genes with two copies, this expression would include 2N instead of N). After 40 generations, we expect a variance of 40p(1-p)/(N). What that means is that in 95 percent of cases, an allele that begins at 20 percent frequency will be between 13 and 27 percent after 40 generations.

    In other words, even in the small population of medieval Iceland, genetic drift is not strong enough to cause large increases or losses of common mtDNA haplotypes. And the effect of drift is four times greater for mtDNA than for autosomal genes.

    As a result of the loss of rare haplotypes, the medieval Iceland sample shares more alleles with Eastern Europe, Russia, and Spain and Portugal than with today's Icelanders. But if we look at a measure like FST, which is dominated by common alleles, the differences between these populations are very slight.

    I've gone through the example to point out that some kinds of observations are explained well by drift, and others aren't. Genetic drift has been an important cause of some evolutionary changes, even in recent human populations. But in any given case it is a hypothesis. We test this hypothesis by applying our knowledge of demographic history, mathematical models, and other genes. The hypothesis of drift might well be useful to explain one kind of observation (loss of rare alleles), while it is useless to explain others (elevation of one rare allele to a high frequency).

    References:

    Helgason A, Nicholson G, Stefánsson K, Donnelly P. 2003. A reassessment of genetic diversity in Icelanders: Strong evidence from multiple loci for relative homozygosity caused by genetic drift. Ann Hum Genet 67:281-297. doi:10.1046/j.1469-1809.2003.00046.x

    Helgason A, Lalueza-Fox C, Ghosh S, Sigurðardóttir S, Sampietro ML, Gigli E, Baker A, Bertranpetit J, Arnadottir L, Þorsteinsdotter U, Stefánsson K. 2009. Sequences From First Settlers Reveal Rapid Evolution in Icelandic mtDNA Pool. PLoS Genet 5(1): e1000343. doi:10.1371/journal.pgen.1000343

Subscribe to Iceland

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.