john hawks weblog

paleoanthropology, genetics and evolution

gene flow

  • Denisovan DNA in the islands, and an Australian genome

    Thu, 2011-09-22 18:09 -- John Hawks

    David Reich and colleagues today report on the persistence of Denisova-like ancestry in island Southeast Asia and Australia (citation not yet available). Meanwhile, Morten Rasmussen and colleagues (citation not yet available) report on the whole-genome sequencing of hair from an Aboriginal Australian who lived some 100 years ago.

    The most obvious story: These data utterly destroy the hypothesis of a single out-of-Africa colonization of Southeast Asia by modern humans. Many human geneticists have argued our present pattern of diversity originated in a wave of successive founder effects coming from a single recent African origin. They were wrong.

    Instead, we can turn to a complex model with successive dispersals and episodes of population mixture. This is not a static model of isolation-by-distance; it is a dynamic model in which populations grow and spread across large spans of the Old World, again and again and again. By my count, at least three massive episodes of population dispersal and mixture are necessary in Reich and colleagues' model. A picture of their admixture hypothesis:

    Denisova admixture model from Reich et al. 2011

    This model depicts (a) an early divergence of an African (represented by Yoruba) and Asian/Australasian populations. These mix with first Neandertals and then (for the Australian/New Guinea/Mamanwa populations) with Denisova-like people. Later (b), after the initial habitation of the Philippines by the ancestors of Mamanwa, a population like Andamanese Onge pushes into the islands, mixing with the ancestors of New Guinea and Australian populations. Later still (c), a population ancestral to today's Chinese people mixes with Philippines and other Southeast Asian people.

    As complicated as it looks, even this model must be a vast oversimplification. I don't like or attribute much belief to mixture models like this, as they assume too much about relative population sizes and the timing of mixture. Many recent hunting and gathering populations of Southeast Asia are not included in the current samples, and the Chinese sample is itself the result of very recent demographic events, covering what once may have been a wider diversity of peoples. Depicting Australian and New Guinean populations as monolithic is an artifact of the small sample; these places themselves housed a tremendous diversity of peoples. Nevertheless, the true model won't be simpler than this one; it will involve many more events that the data cannot yet resolve.

    Hints of that complexity emerge from the Aboriginal Australian whole genome. Rasmussen and colleagues show that this individual shares some ancestry with East Asian peoples, but on the whole populations in Europe and East Asia are much more genetically similar to each other than to this genome. The picture from the whole genome is essentially the same as that drawn by the SNP comparisons by Reich and colleagues, but with the potential (in the long run) to actually trace the histories of individual genes. And I think the gene-by-gene account of history will be important, because we already have some evidence that a few Denisovan genes do persist in mainland Asia, even though most are gone.

    To explain why, we can look at the proportion of Denisovan ancestry in different populations as depicted in a map by Reich and colleagues. The pie charts are confusing here, because they report the fraction of ancestry from Denisovans in each population relative to the 5% estimate for New Guinea. So Australians also have 5% in this figure, Timorese have around 2.5%, and Bougainville has more than 4%.

    Notice the apparent lack of Denisovan ancestry in anyone who lives anywhere that was once connected by land with mainland Asia. I say "apparent" deliberately: Abi-Rached and colleagues reported last month on the widespread distribution of Denisovan HLA types among today's Asian populations, and those may well be products of Denisovan genes that were later selected. I've already identified a handful of other loci that seem to reflect Denisovan ancestry in mainland Asian people. According to the comparisons by Reich and colleagues, such loci must be exceptions.

    At the same time, the mixture model presents an important idea: Once there were people in Southeast Asia who had much more Denisovan ancestry than any populations still remaining today. Both Australian/New Guinea populations and Philippine populations like the Mamanwa have subsequently mixed with new immigrants who lacked any sign of Denisovan ancestry. Prior to this later mixture, the ancestors of those populations must have been more Denisovan -- Reich and colleagues estimate 7%. This is the first evidence that ancestry from archaic people of Eurasia was diluted to a lower value by later population movements. If the population mixture originally happened somewhere in mainland Asia, any traces of Denisovan ancestry in those areas has been diluted almost to nonexistence. But the persistence of some genes would be predicted if natural selection were maintaining them in the face of demographic pressure from elsewhere.

    About the Australian genome, there will be much more interesting analyses to come, I expect. As whole-genome data come to represent more of the variation within human populations, we get a larger store of information about how we came to be variable. Variation traces not only to population movements and demography, but also to natural selection. Australia's population history has been very different from many populations of the Old World, and this genome should give us new perspective on the effects of that demographic history.

    Synopsis: 
    The hypothesis of a single out-of-Africa dispersal is rejected by new data about Denisovan mixture and whole-genome sequencing of an Aboriginal Australian.
  • Migration thinking

    Fri, 2010-08-20 08:30 -- John Hawks

    Murray Cox and Michael Hammer have a short commentary piece in the current BMC Biology, titled, "A question of scale: Human migrations writ large and small" [1]. They review a few recent papers concerning human migration and intermixture -- including the Neandertal genome draft [2], the paper by Chuanxiang Li and colleagues showing Bronze Age admixture in the Tarim Basin [3], and their own work quantifying historical gene flow inside and outside Africa [4].

    It's a short review, but I thought their conclusion serves some thought -- they discuss some of the theoretical complexity of estimating ancient rates of gene flow. The simple model assumes constant rates, but human populations aren't simple.

    We expand on just one of these points for illustration (Figure 3). Even when gene flow is inferred explicitly, existing methods invariably assume that it has remained constant through time. However, it seems more reasonable that two diverging populations might share more migrants initially (due to shared geography or existing social relationships), with gene flow subsequently decreasing exponentially as the two populations move apart (Figure 3a). Or gene flow might increase exponentially as two geographically separated populations begin to move closer together (Figure 3b). Alternatively, gene flow might suddenly resume between two long separated populations; for instance, where geographically disconnected populations came back into contact, either as hunter-gatherer groups during the late Pleistocene (Figure 3d), or as human mobility increased following the development of farming in the Holocene (Figure 3c). The important point is this: two populations can look very similar (FST = 0) or very different (FST = 0.3) even when they have exchanged the same number of migrants (that is, graph lines with the same color in figure 3). It is therefore insufficient to consider only how many migrants have moved between populations; we also need to know when these movements occurred.

    I don't reproduce the figure, because it's complicated and I think the text is sufficient to establish the point. Averages aren't very meaningful. I'll point out that there is some hope of testing these hypotheses, if we consider selected genes -- which have a time that they originated.


    References

  • Quote: Peter Heather, migration is the great Satan

    Tue, 2010-06-29 20:08 -- John Hawks

    Peter Heather's Empires and Barbarians begins with a chapter summarizing grand theories of demography and social transformation among near-prehistoric peoples of Europe.

    Both classical sources and pre-1960 scholarship tended to explain events in terms of the wholesale migration of demes of people. In later years, it has become more common to deny the importance of demic migration, instead invoking elite migration and dominance, demic diffusion, or other schemes.

    I'm not reviewing the chapter here but I wanted to record a quote from Heather's page 19:

    [A] basic equation has grown up in the minds of some archaeologists between any model of the past involving population movement, and simple-mindedness. As a recent introduction to early medieval cemeteries put it, avoiding migration in explanations of archaeological change 'is simply to dispose of an always simplistic and usually groundless supposition in order to enable its replacement with a more subtle interpretation of the period'. Note the language, particularly the contrast between 'simplistic' and 'groundless' (the world dominated by migration) with 'more subtle' (any other kind of explanation). The message here is loud and clear. Anyone dealing with the geographical displacement of archaeologically observable artefact types or habits, who wants to produce an account of the past that is at all 'subtle' or 'complex', should avoid migration at all costs. The tables have turned. From a position of overwhelming dominance before the 1960s, migration has become the great Satan of archaeological explanation.

    What a way of capturing the sneers of critics following a fad. Better to be "subtle" and "complex" than "simplistic"!

  • Libidinous Neandertal men and the women who loved them

    Mon, 2010-05-10 21:54 -- John Hawks

    I keep seeing people, who really ought to know better, saying that the new Neandertal genome results show that the gene flow must have been Neandertal men mating with modern human women, and not the other way around.

    You see, they're fixated on the idea that the mtDNA showed no signs that the Neandertal clade survived into the present-day population. That result really convinced some people that interbreeding was impossible. They're flummoxed that some of the rest of the genome has significant signs of intermixture. It's like their world is spinning out of control. I'm not naming any names, but if you've followed much of the press around the Neandertal genome, you've probably seen this suggestion.

    I don't know why it hasn't occurred to them that the Neandertal mtDNA type was probably lost because of natural selection.

    To avoid raising the awful specter of Darwin, they've been talking about weird mating restrictions. Well, I suppose that if you really have to find a way to get Neandertal nuclear genes into us, without bringing mtDNA along, a total lack of Neandertal women contributing genes is formally one way to get that.

    I'd just like to see these people explain how exactly we managed not to get any Neandertal Y chromosomes, either.

    Is it safe to talk about selection, now?

    UPDATE (2010-05-11): A reader writes:

    With regard to your latest blog post on lack of neanderthal mitochondrial and Y chromosome DNA in humans: yes, it's possible natural selection had a part. However, given that only a small proportion of our ancestors seem to have been neanderthals at the appropriate time, it strikes me that this is a case where drift could be the correct explanation - despite the fact that I'm usually not a big fan of drift as an explanation.

    Much depends on the size of the ancestral population and the pace of population growth in the generations surrounding the pickup of Neandertal genes. Drift is less likely to eliminate alleles in a growing population, but it depends how many copies there were to begin with. The key questions -- where and when the population was growing -- are unlikely to be the same as assumed by the modeling that showed drift couldn't have eliminated the Neandertal mtDNA, as most assumed the location of contact would be Europe and the time would be late.

    There were other deficiencies with the modeling, also. Here we've been working on a source-sink model as a possible demographic scenario for Pleistocene humans; that kind of metapopulation dynamic might easily explain allele losses without selection, and becomes more and more credible as we learn the variance of contribution of Neandertal-like alleles across the genome. It's a different world this week than last week.

    These are all mathematically tricky answers, clever, but academic unless we have good matches to genome-wide variation. Meanwhile a very simple answer, easy to explain to anyone, lies fallow. Exceedingly curious.

    I'd be happy to be proven wrong about the Y chromosome, by the way -- we don't really know that Neandertals didn't have a human-like type, although we do now that today's human population has an exceedingly recent coalescent time. Could be bad estimates of mutation rate. Maybe we'll have more surprises in store.

  • NEANDERTALS LIVE!

    Thu, 2010-05-06 12:53 -- John Hawks

    I, for one, welcome my Neandertal ancestry.

    It may not sound like a lot -- between 1 and 4 percent. But that's the equivalent of one great-great-great grandparent's DNA contribution. In the case of the Neandertal contribution, more than 1500 generations ago, it's an enduring legacy of an ancient group of people, spread across many lines of the genealogies of living people. Beyond their genealogical interest, Neandertal genes might have made a big difference to our evolutionary potential.

    In case you wonder what the heck I'm talking about, here's the story: Two new papers in Science describe the full draft sequence of the Neandertal genome, and perform additional analyses to understand the pattern of adaptive evolution in the population ancestral to living people.

    Richard Green and colleagues report on the genome, demonstrating very convincingly that present-day people have Neandertal ancestors. It is not entirely obvious when and where the gene flow between Neandertals and other ancient populations happened -- whether it was associated with the dispersal of most of our ancestry from Africa, or whether it may have been earlier. The gene flow was not limited to Europe, and evidence for Neandertal ancestry occurs in East Asian and Australasian populations.

    The paper is full of other good stuff, including some evidence about which gene regions changed under selection in the ancestral human population.

    Meanwhile, the second paper by Burbano and colleagues applies new microarray techniques to assess how much of the human legacy of amino acid changes has arisen in the latest, post-Neandertal period of our evolution.

    So there's a lot about the pattern of evolution and gene flow leading to living people, and a lot about adaptive and functional evolution. That makes a lot for me to cover -- and while I have the papers a little early, time is short. Let's see how much I can help clarify what's in this new research.

    If you had to sum up in a few words, what does this mean for paleoanthropology?

    These scientists have given an immense gift to humanity.

    I've been comparing it to the pictures of Earth that came back from Apollo 8. The Neandertal genome gives us a picture of ourselves, from the outside looking in. We can see, and now learn about, the essential genetic changes that make us human -- the things that made our emergence as a global species possible.

    And in doing so, they've taken a forgotten group of people -- whom even most anthropologists had given up on -- and they've restored them to their rightful place in our heritage.

    Beyond that, they've taken all of their data and deposited it in a public database, so that the rest of us can inspect them, replicate results, and learn new things from them. High school kids can download this stuff and do science fair projects on Neandertal genomics.

    This is what anthropology ought to be.

    What did they sequence?

    The Max Planck group obtained most of their genomic sequence from three specimens from Vindija -- Vi33.16, Vi33.25, and Vi33.26. These are all postcranial fragments with minimal anatomical information. Green and colleagues were able to establish that the three bones represent different women, and that Vi33.16 and Vi33.26 may represent maternal relatives.

    From these skeletons they got 5.3 billion bases of sequence. All this from an amount of bone powder about equal in mass to an aspirin pill.

    Amazing. I mean, I know the folks at Max Planck are reading this. It's inspiring to see what they've been able to do. These are three pieces of barely diagnostic hominin bone, and they've obtained literally hundreds of times more information than we have ever gotten from the fossil record of Neandertals.

    I'll describe the analyses of genetic similarity with humans in more detail below. As a brief summary, of those positions where the human genome differs from chimpanzees, Neandertals have the chimpanzee version around 12.7 percent of the time -- meaning that across the genome, a Neandertal and a human will share a genetic ancestor an average of around 800,000 years ago. This is a couple hundred thousand years higher than the same number if we compare two humans to each other. The higher age of genetic common ancestors reflects partial isolation between the Neandertal population and the African populations that gave rise to most of our current genetic variation.

    The team were able to identify 111 candidate duplications, almost all of which have some evidence of copy number variation in humans or other primates. They tentatively show that Neandertals have a bit more copy number variation than present-day humans, and identify a few loci with substantially higher copy numbers in one group or the other.

    A substantial part of the paper is dedicated to finding evidence of positive selection on the human lineage after the emergence of Neandertals. The idea is to look for fixed selective sweeps -- regions where humans are likely to have SNPs absent in Neandertals and a relatively shallow gene tree. They identify 212 regions like this -- as I discuss below, a surprisingly low number.

    The second paper, by Hernán Burbano and colleagues, describes the application of a targeted microarray to probe Neandertal genetic samples for protein-coding variants that separate humans from chimpanzees. They identify 88 amino acid substitutions that seem fixed in the known sample of living humans, but not present in the Neandertal sequence. Those 88 are not necessarily all functionally important, although this list will include a number of "structural" genetic changes that make a difference to proteins expressed worldwide today. There is much to come in analyzing the categories and genes represented in both lists, which may tell us very interesting things about our Late Pleistocene evolution.

    What is the evidence for interbreeding?

    From their initial work sequencing the nuclear genome in Neandertals, the Max Planck group has followed a clever strategy: Don't look at the Neandertal sequence to see what humans share, look at human variation to see which version the Neandertal sequence has.

    The strategy is smart because it helps to obviate some major problems with ancient DNA -- you don't have all the parts, and the parts you do have probably contain a lot of sequencing errors of various kinds. By looking first at sites that vary within humans (or, in some comparisons, between humans and chimpanzees), we can focus on a very simple question -- did the Neandertal have one version, or the other?

    Applied to human variation today, there are several ways we might use a Neandertal genome test the hypothesis of no interbreeding. Green and colleagues focus on two complementary approaches.

    1. If Neandertals contributed no genes to living populations, then they should be equally related to all living people, no matter where in the world those people live.

    Green and colleagues show that the Neandertal genome is closer to some humans than others. People whose ancestry lies outside Africa are significantly more like Neandertals than are people who live in Africa today. In this study, the authors include whole genomes from people in France, China and Papua New Guinea outside Africa, and Yoruba and San inside Africa. The Africans are not as close to the Neandertal as any of the non-Africans.

    That doesn't mean that non-Africans derive most of their genes from Neandertals -- in fact, as I describe below, the proportion is quite small. Living people are more like each other -- even non-Africans and Africans -- than any of them are like Neandertals.

    The point is that despite this great similarity of living people, we have genetic variants that we share with the Neandertal genome, and that proportion is a lot higher outside Africa than inside it. The natural conclusion is the Neandertals contributed more genes to non-Africans than to Africans.

    One thing is for sure: You can't explain this observation under the hypothesis that a small, African population expanded out of Africa without interbreeding with Neandertals along the way.

    2. Look at the genes most likely to represent ancient population structure, the ones with deep roots outside Africa.

    This is an idea that we came up with to look for genes in living humans that might have come in from Neandertals or other ancient populations (for example, we described it in our 2008 review). Look for the parts of the genome with the deepest genealogical roots outside of Africa. Those are candidates for Neandertal gene flow -- a high chance that one of the two sides of that deep root was present outside of Africa for hundreds of thousands of years.

    Green and colleagues took this idea to the next level. They found parts of the genome where non-Africans have a deep root and Africans don't. Then they looked at the Neandertal sequence. Out of the 12 regions they identified with deep roots outside Africa, they found that the Neandertals had the deep, non-African specific version in 10 of those.

    I mean, there's really not any other way you can explain this. We got those genes from Neandertals. Every one of those loci is a region where some people have a Neandertal-derived allele, and others don't. Those particular 10 loci are a small fraction of the overall Neandertal-derived element of our heritage -- because they used Perlegen SNPs to find them, they ended up with regions that are fairly long (100 kb or more in length). Those are probably all really interesting, but there will be more of them when we can reliably identify smaller segments with deep genealogies.

    Could the results have been caused by contamination?

    Green and colleagues are utterly convincing about the level of contamination in their sequence. They have employed several independent checks, all of which arrive at the same conclusion: The modern human contamination in almost all their comparisons is limited to significantly less than one percent -- and for autosomal sequence they can give a tight estimate of 0.7 percent contaminating sequence.

    The methods that Green and colleagues used to test for a Neandertal contribution to non-African populations are not likely to be strongly influenced by contamination. The probe for deep roots in particular is extremely unlikely to be influenced by contamination in the Neandertal sequence.

    The very low contamination rate, and methods that should be robust to some contamination, means that we can be very confident in their result.

    How much Neandertal ancestry do we have?

    The Neandertal contribution does not make up a major proportion of any population, even outside of Africa. Green and colleagues apply a population model that involves isolation between ancestral Neandertal and African populations, a dispersal from Africa into Eurasia, and subsequent mixture with the Neandertals. Under this model, the estimated fraction of Neandertal ancestry for non-African populations today is between 1 and 4 percent.

    Now, let's put on our skeptics' hats. Is this the right model?

    If Neandertal and African populations had not been isolated, then the amount of mixture after an out-of-Africa dispersal would be lower. On the other hand, the dispersing African population would already be part Neandertal, because of genetic mixture. The proportion of ancestry from ancestral Neandertals would be around the same amount, it would just be distributed across a longer time.

    They did not examine the question of how much of the genome came in from Neandertals because of selection. The estimate they have, between 1 and 4 percent, is so high that this is not just a few genes introgressing in from Neandertals -- it is a big fraction of the neutral, non-coding part of the genome. So selection doesn't explain the similarity, nor can parallelism -- the similarity is genome-wide, not just coding or functional changes, and not as far as we know clustered into regions that might have hitchhiked with adaptive alleles.

    But there's clearly a lot more to do, characterizing the functional implications of some regions, testing for selection, and finding Neandertal variants that might have reached very high frequencies in later populations. To the extent that selection has influenced the pattern, it will also throw off the simple population model. But it doesn't throw off the fraction of Neandertal ancestry -- if it's three percent, it doesn't matter whether it was selected or neutral, it's still three percent.

    So the bottom line is, the fraction is going to be about right, regardless of the mechanism by which the genetic mixture happened.

    Can we please take off our skeptics' hats? It's getting in the way of my Neandertal victory dance.

    No. All the cool paleoanthropologists wear hats.

    What about population structure within Africa? Could that explain the apparent Neandertal contribution?

    We've known about the occasional deep-rooted genealogies outside Africa for a long time (and Jeff Wall's work, as an example among others, has explained that pattern as archaic human mixture into non-Africans). They've been talking about something like five percent of the human genome coming from admixture with ancient groups outside of Africa. So this shouldn't come as a shock.

    Until now, though, it has been possible for some people to wave these results away. We didn't really know that any of those deep roots were in archaic humans, and after all, who's to say that they aren't variants that originated in Africa and have since been lost there, or that we haven't found them yet? African variation is great, and if you imagine that some variation might have once existed in northeastern Africa and was subsequently lost within African populations, that might look like admixture with archaic humans outside of Africa.

    This line of argument is now special pleading. Why would we posit a cryptic mystery population in Africa, which happens to look genetically identical to Neandertals, but has subsequently disappeared? A big fraction of deep genealogies outside Africa really are in Neandertals. By far the simplest explanation is that today's non-Africans got them from ancient non-Africans. This is no surprise -- that's where the data have been pointing now for five years.

    Yet Africans are a lot more diverse than other populations, and this diversity itself does reflect the dynamics of the ancient African population. The Neandertals aren't so different from that pattern that now still exists within Africa -- they're extending the notion that "modern" is something that's been evolving for a long time. I expect we'll be able to come to a better understanding of ancient population interactions within Africa, by understanding the parts of the genome that have come from Neandertals outside of Africa.

    Could the gene flow be due to ancient interactions between West Asia and Africa?

    Green and colleagues suggest that at most few genes from modern humans ended up in Neandertals.

    That is, although they find lots of evidence of old-looking genes in us that are shared with the Neandertal genome, they find few cases of new-looking genes in us that are shared with that genome.

    That might suggest several things about interactions between Africa and West Asia and Europe during the Middle to Late Pleistocene. For example, if there had been high gene flow from Africa into West Asia after the first appearance of a distinct Neandertal population, maybe 200,000 to 400,000 years ago, we might expect to find some new-looking genes in humans that Neandertals also got.

    On the other hand, the data are from European Neandertals, who are at the end of a fairly long chain of populations from Northeast Africa. If gene flow had been ongoing into the Levant or further into West Asia during the last 200,000 years, it's not obvious how many of these genes would have made it into Europe. The rapid mitochondrial DNA coalescence of Neandertals does suggest substantial mobility in the population across Central Asia to Western Europe. But maybe that apparent dynamism had a boost from mtDNA selection.

    So just on the data, I don't think we know yet whether this is gene flow in the Levant 200,000 or 100,000 years ago, or whether it's genes coming from West Asian Neandertals into dispersing Africans after 100,000 years ago. I expect all are likely. I have some ideas how to test some of these things, and we will get started immediately.

    The lack of apparent mixture of "modern" genes into Neandertals -- what does it mean?

    It means that a model of one-way gene flow from Neandertals into us can explain the pattern of genetic similarity.

    The authors explain this as a function of population expansion. The expanding population (us) picks up some Neandertal genes that expand in numbers, while the contracting population (Neandertals) doesn't have a chance to pick up as many genes because it is declining in numbers. That model seems plausible, particularly in comparison with historical cases of population contact.

    On the other hand, the three Neandertals from which most of the genome sequence was derived all date to before 40,000 years ago. There weren't any modern humans around for them to have interacted with around Vindija at that time. So should we be surprised that they don't have genes of modern humans?

    A more interesting question was posed to me by a very sharp journalist: What would we expect the result to have been if they had sequenced a Near Eastern Neandertal, like Amud, for example?

    The answer seems obvious -- the admixture fraction should have been higher. That population, which is the most likely to have been the source of mixture, must have been somewhat genetically different from the European Neandertals. Any extent of genetic differentiation between them would make the European Neandertals look less like non-Africans today than the Near Eastern ones.

    I'll have more to say about these Near Eastern Neandertals in the next few days.

    But wait a minute. I thought the mitochondrial DNA proved that Neandertals are extinct!

    Selection. Selection. Selection.

    I've been saying it for years. I've published it. Will you learn to listen to me, already?

    The mtDNA of Neandertals is gone because it conferred some disadvantage. There are many reasons to suspect this -- the Neandertal variation is itself apparently recently derived; the human variation is clearly in disequilibrium, especially outside Africa; the mtDNA genes affect functions that differ greatly in Neandertal and recent populations, including energetics, longevity, and brain; there are clear signs of mtDNA selection in many recent human populations.

    Mitochondrial DNA is useful for a lot of reasons, but nobody should ever have relied on it alone as evidence of Neandertal population dynamics.

    Is it really true that there is no variation in Neandertal ancestry outside Africa?

    The comparisons in the paper are highly convincing because of the sheer amount of sequence taken from the sampled individuals. A single gene locus from an individual may be unrepresentative of the person's population, but averaged across the whole genome, the difference between two people from distant populations is very, very close to the difference between the two populations.

    But they sampled very few individuals. So we are left with a question -- do we really know we've sampled variation outside Africa enough to make regional estimates of Neandertal gene flow?

    I think we could do better with more genomes. For example, when it comes to finding deep genealogies, we need to be able to find shorter regions than the ones used by Green and colleagues. That will expand the sample of candidate loci, and will catch some Neandertal-derived genes that we're missing now. Moreover, if gene flow was really around 1-4 percent, many SNPs that came in from Neandertals will be rare enough to be missing from the big SNP genotyping samples. We may find some variants with whole-genome sequencing on larger samples that will be worth examining.

    But most important, we'll be able to develop strategies based on this success to find ancient population structure involving groups where we don't yet have the DNA -- like populations of South and East Asia. Some of those may give us the chance to test those methods soon, as for the Denisova individual.

    Is this multiregional evolution, or just out-of-Africa with some leakage of earlier Eurasian genes?

    Out-of-Africa movement was a major mechanism of recent human evolution. The genetic ancestry of living people is multiregional.

    I see no contradiction between those statements. From now on, we are all multiregionalists trying to explain the out-of-Africa pattern.

    There was clearly a dispersal of African genes into the rest of the world during the Late Pleistocene, sometime between 50,000 and 100,000 years ago. Living people everywhere on Earth derive more than 90 percent of their genes from African populations who lived 100,000 years ago. That much is plain.

    (Why did I not write "more than 96 percent?" See below.)

    These genetic observations require some kind of out-of-Africa event. This event was not limited to a few genes, and selection of a few genes even with substantial hitchhiking of surrounding genome cannot account for the pattern. There must have been some kind of demographic expansion including African-derived populations and preferentially excluding the genes of Eurasian populations like the Neandertals. Selection on a gene network might have mediated the expansion, as suggested by Eswaran (2002). Or the expansion might have been culturally or technologically mediated, as many other people have suggested.

    Those are hypotheses about mechanisms. How did it come to be that living people trace the overwhelming majority of their ancestry to Africa within the last 100,000 years? These explanations may answer that question.

    The present study shows that Neandertals were at a minimum partially isolated from their contemporaries in Africa, and that the genetic divergence between those populations was larger than the genetic differences between European, Asian, and African populations today.

    Yet those Neandertals are among our ancestors. Late Pleistocene humans had multiregional origins, and the evolution of the Neandertals was itself a case of relatively recent population dispersal from Africa or West Asia. Human and Neandertal genes mostly derive from common genetic ancestors between 400,000 and a million years ago -- much, much later than the initial habitation of Eurasia 1.8 million years ago.

    But 1-4 percent is so minor, can it be an important part of our evolution?

    There are three things you have to ask about the fraction of Neandertal ancestry.

    1. How much gene flow would it take to guarantee that anything adaptive in the Neandertal population survived into later people?

    The answer to that question is simple -- it takes a few dozen matings to get most adaptive genes into our population. If there was a lot of interference with the genetic background, it might take more -- just to make sure that the advantageous alleles had a chance to be de-linked from the genetic background.

    If Neandertals are one percent of the ancestry of non-Africans, we can be very sure that any gene in a Neandertal that had adaptive value in the later population is here now. That means they were important in an evolutionary sense.

    2. What fraction of the human population 50,000 years ago were Neandertals?

    This is very important -- when it comes to neutral genetic loci, the essential question is how much the Neandertals may be underrepresented today relative to their numbers in the past. Is three percent too low? It seems very unlikely that the fraction of Neandertals compared to the rest of humans was as high as 10 percent -- we know that Africa already had a large population 50,000 years ago, and everything we know about Neandertals suggests a very low population density, an effective size much smaller than 10,000 individuals. Were five percent of the people on Earth 50,000 years ago Neandertals?

    We don't really know the answers, but now we have a chance to test hypotheses about ancient population size and expansion in Neandertals. My point at the moment is only this: If today Neandertal genes make up only one percent of the gene pool of the 5 billion people outside Africa, that's the genetic equivalent of 50 million Neandertals.

    In relative terms, their contribution to our population may be a reduction from their fraction of the Late Pleistocene population. Not that great a reduction, not a massive crash to zero. A reduction in the wake of the out-of-Africa movement, possibly from five percent to three.

    You might think the answer to this is obviously zero. But in genetic terms, we can ask, how many times has the average Neandertal-derived gene been replicated in our present gene pool? Those aren't Neandertal individuals -- that is, a forensic anthropologist wouldn't classify them as Neandertals. They're the genetic equivalent.

    The answer to this is also simple: In absolute terms, the Neandertals are here around us, yawping from the rooftops.

    There are more than five billion people living outside of Africa today. If they are one percent Neandertal, that's the genetic equivalent of fifty million Neandertals walking the Earth around us.

    Does that sound minor? If I told you that your average gene would be replicated into fifty million copies in the future, would you be satisfied? Maybe your ambition is greater, but I think the Neandertals have done very well for themselves.

    Does this mean that Neandertals belong in our species, Homo sapiens?

    Yes.

    Interbreeding with fertile offspring in nature. That's the biological species concept.

    Now, some paleontologists might still disagree -- maintaining that species are units that can be distinguished morphologically, or by one or more derived features, or any number of other definitions. That's fine with me, as long as they're clear. But understand: It does define all non-Africans today as an interspecific hybrid population.

    So maybe they want to rethink that one?

    If Eurasians got less than 4 percent from Neandertals, doesn't that mean that they got more than 96 percent from Africa?

    I look at the 1-4 percent estimate as a minimum, for several reasons. As I'll note below, this estimate mainly refers to the excess Neandertal ancestry outside Africa, which means there may be some additional amount that both recent African and non-African populations share.

    But more important, Neandertals weren't the only people living in Eurasia 100,000 years ago. China didn't have Neandertals, nor did Southeast Asia and Java. India was full of hominins, which might or might not have shared substantial genetic similarity with Neandertals. They're close enough to the known Neandertal range to speculate that they may have been close, but the only available fossil, the Middle Pleistocene Narmada skull, is not very informative. Any of these populations might have been genetically different from Neandertals, and might have also contributed genes to present-day human populations -- genes that wouldn't show up by scanning the Neandertal genome.

    The recent genetic sequencing of the Denisova pinky (a.k.a. the X-woman) from the Altai Mountains reminds us that these populations outside of Africa may have been quite a bit closer to us, genetically, than we might have expected from the 1.8-million-year record of humans outside Africa. These populations were dynamic in ways that many paleoanthropologists haven't yet appreciated.

    Do living Africans have Neandertal ancestry, too?

    I think that the present study doesn't have the power to answer this question, at least with the design that the authors used. The fact that living Africans are less genetically similar to the Neandertals is extremely important evidence of the Neandertals' genetic contribution to populations outside Africa. But it doesn't bear on how much back-migration into Africa may have happened.

    We know that the answer is nonzero, because Africa has received immigrants from other parts of the world during historic times. The same genetic patterns that reflect population contacts up and down the East African coast, and across the Sahara into West Africa, show the possible conduits for the flow of Neandertal-derived genes into African populations.

    But how much genetic dispersal into Africa happened in LSA or late MSA times? Mitochondrial and Y chromosome distributions in Northeast Africa suggest there was been some. Nevertheless, Africa would have been a very difficult place to return, for humans who had begun adapting to different ecological and disease environment.

    I think that some Neandertal genes might have made it back into Africa, even in ancient times, but I wouldn't be surprised if that number was small.

    The big shoe left to drop is the extent of population differentiation within Africa during MSA times. So far we've seen hints that these populations might have been nearly as differentiated from each other as they were from Neandertals, with substantial gene flow homogenizing them in the last 30,000 years. This paper includes an additional Bushman genome, after the four published earlier this year. Comparing that new genome to the Neandertals, its modal difference from the human reference (Hg18) genome is between the other humans and the Neandertal. Not quite halfway between, but nearly so. There's a lot of genomic variation within Africa, and exploring the population history that explains that variation may turn up some surprises.

    What about recent selection?

    One of the really exciting aspects of this work is that both Green and colleagues and Burbano and colleagues look for things that all humans today share but Neandertals lack.

    You might call these "the genes that make us modern," although functionally we have little idea what any of them do.

    Both papers show one thing that is extremely interesting: There aren't very many such genetic changes.

    Burbano and colleagues put together a microarray including all the amino acid changes inferred to have happened on the human lineage. They used this to genotype the Neandertal DNA, and show that out of more than 10,000 amino acid changes that happened in human evolution, only 88 of them are shared by humans today but not present in the Neandertals.

    That's amazingly few.

    Green and colleagues did a similar exercise, except they went looking for "selective sweeps" in the ancestors of today's' humans. These are regions of the genome that have an unusually low amount of incomplete lineage sorting with Neandertals, and therefore represent shallow genealogies for all living people. They identify 212 regions that seem to be new selected genes present in humans and not in Neandertals. This number is probably fairly close to the real number of selected changes in the ancestry of modern humans, because it includes non-coding changes that might have been selected.

    Again, that's really a small number. We have roughly 200,000-300,000 years for these to have occurred on the human lineage -- after the inferred population divergence with Neandertals, but early enough that one of these selected genes could reach fixation in the expanding and dispersing human population. That makes roughly one selected substitution per 1000 years.

    Which is more or less the rate that we infer by comparing humans and chimpanzees. What this means is simple: The origin of modern humans was nothing special, in adaptive terms. To the extent that we can see adaptive genetic changes, they happened at the basic long-term rate that they happened during the rest of our evolution.

    Now from my perspective, this means something even more interesting. In our earlier work, we inferred a recent acceleration of human evolution from living human populations. That is a measure of the number of new selected mutations that have arisen very recently, within the last 40,000 years. And most of those happened within the past 10,000 years.

    In that short time period, more than a couple thousand selected changes arose in the different human populations we surveyed. We demonstrated that this was a genuine acceleration, because it is much higher than the rate that could have occurred across human evolution, from the human-chimpanzee ancestor.

    What we now know is that this is a genuine acceleration compared to the evolution of modern humans, within the last couple hundred thousand years.

    Our recent evolution, after the dispersal of human populations across the world, was much faster than the evolution of Late Pleistocene populations. In adaptive terms, it is really true -- we're more different from early "modern" humans today, than they were from Neandertals. Possibly many times more different.

    More?

    That's what I have time for now, if I want to get this posted. There is much, much more to say on the topic, and you can bet it will be all Neandertals all the time here for the foreseeable future.

    References:

    Green RE and many others. 2010. A draft sequence of the Neandertal genome. Science (in press) doi:10.1126/science.1188021

    Burbano HA and many others. 2010. Targeted investigation of the Neandertal genome by array-based sequence capture. Science (in press) doi:10.1126/science.1188046

  • What kangaroos do...

    Fri, 2010-04-23 11:07 -- John Hawks

    In the current issue of Heredity, Neaves and colleagues describe the results of their analysis of 12 microsatellite loci and the mtDNA of two kangaroo species -- western and eastern grey kangaroos. The two species are sympatric across part of Australia, basically a swath through western New South Wales. Neaves and colleagues describe substantial evidence for introgression of both autosomal loci and mtDNA into both populations:

    A total of 7.6% of grey kangaroos sampled from the region of sympatry displayed evidence of introgression. Although no F1 hybrids were identified, 14 M. giganteus backcrosses and 3 M. fuliginosus backcrosses were detected. In addition to introgression at nuclear microsatellite loci, a single individual also exhibited introgression of mtDNA. The two phenotypic groups apparent within the region of sympatry corresponded (in 95% of individuals) to the two clusters identified by genetic analyses. Furthermore, the two phenotypic/genetic groups within the region of sympatry corresponded to representative allopatric samples of M. giganteus and M. fuliginosus from elsewhere in the distribution. Five of the M. giganteus backcrosses identified by genetic analyses were classified as M. fuliginosus based on overall phenotype. Geographically, hybrids were located throughout the region of sympatry.

    This introgression has happened between the kangaroos despite the presence of prezygotic barriers that interrupt mating even in captivity:

    Physical differences in the structure of the cloacal eminence as well as the production of species-specific odours by females may allow for species recognition (Kirsch and Poole, 1972). These characteristic differences are potentially among the features that result in the unidirectional hybridization observed in captivity, with male M. giganteus frequently failing to recognize female M. fuliginosus in oestrus.

    In addition, there was male sterility in captive F1 hybrids. The authors expected a unidirectional bias in introgression owing to these factors, but the evidence says that gene flow apparently has gone both directions in the wild.

    Sort of interesting -- I would actually have expected there to be fewer postzygotic isolating mechanisms in marsupials because the placenta-uterus interaction isn't there complicating matters. But cases of interspecific hybridization have apparently been rarely noted -- maybe that's because Australia is small enough that phylogeographic differentiation doesn't go as far for large species. In any event, this case is another one where F1 hybrids are basically absent in the area of sympatry, yet substantial historical introgression has clearly happened. That's based on a restricted sample of 12 autosomal loci -- we would expect to see much more significant effects at a few genes if the introgressive variant had a high adaptive value.

    A model for ancient humans? Well, here's a case where 12 microsatellite loci seem sufficient to document substantial historical gene flow -- whereas in the human case described last week, there are more than 600 microsatellite loci to test the hypothesis. So the human case should have more power, all things being equal.

    But the humans probably don't have as simple a prior population structure. The kangaroos have two well-defined lineages with a large zone of sympatry. Ancient humans may not have been highly differentiated (given the low Neandertal-human mtDNA coalescence time, for example) and may not have had substantial zones of sympatry -- they may have been much more similar populations interacting along a narrow boundary or cline. So the phylogeography in humans will be much more subtle.

    References:

    Neaves LE, Zenger KR, Cooper DW, Eldridge MDB. 2010. Molecular detection of hybridization between sympatric kangaroo species in south-eastern Australia. Heredity 104:502-512. doi:10.1038/hdy.2009.137

  • East meets West, and vice versa

    Tue, 2010-02-02 19:04 -- John Hawks

    Here's a nice, symmetrical pair of stories:

    DNA testing on 2,000-year-old bones in Italy reveal East Asian ancestry

    ...

    Prowse's team cannot say how recently he, or his ancestors, left East Asia: he could have made the journey alone, or his East Asian genes might have come from a distant maternal ancestor. However, the oxygen isotope evidence indicates that he was definitely not born in Italy and likely came here from elsewhere in the Roman Empire.

    ...

    In addition to the mystery the find uncovers, Prowse sees the broader scientific impact for archaeologists, physical anthropologists, and classicists: The grave goods from this individual's burial gave no indication that he was foreign-born or of East Asian descent.

    OK, that's one way. Now the other:

    Skeleton of Western Man Found In Ancient Mongolian Tomb

    Consider an older gentleman whose skeleton lay in one of more than 200 tombs recently excavated at a 2,000-year-old cemetery in eastern Mongolia, near China’s northern border. DNA extracted from this man’s bones pegs him as a descendant of Europeans or western Asians. Yet he still assumed a prominent position in ancient Mongolia’s Xiongnu Empire, say geneticist Kyung-Yong Kim of Chung-Ang University in Seoul, South Korea, and his colleagues.

    ...

    This long-dead individual possessed a set of genetic mutations on his Y chromosome, which is inherited from paternal ancestors, that commonly appears today among male speakers of Indo-European languages in eastern Europe, central Asia and northern India, Kim’s team reports in an upcoming American Journal of Physical Anthropology. The same man displayed a pattern of mitochondrial DNA mutations, inherited from maternal ancestors, characteristic of speakers of modern Indo-European languages in central Asia, the researchers say.

    Hmmm... it's almost like they're reading from the same script...

    It's not obvious what these finds are uncovering. Is this evidence of very rare migration across very long distances? Is it a weak pattern of long-distance genetic similarity that has been partially masked by later expansions of populations?

    It would help if the stories gave some assessment of how unexpected such finds would be today. Both with regard to the mtDNA-Y chromosome "ancestry" axis, and with respect to the autosomes. Bower mentions work on Kurgan burials which is more informative:

    Add to those discoveries a report in the September 2009 Human Genetics. Geneticist Christine Keyser of the University of Strasbourg in France and her colleagues found that nine of 26 skeletons previously excavated at 11 Kurgan sites in northeastern Russia possess a Y chromosome mutation pattern thought to mark the eastward expansion of early Indo-Europeans. That same genetic signature characterizes the Duurlig Nars man.

    That's a frequency. The more singular finds are much harder to deal with statistically. I also worry about PCR errors when a result is only present in one or two specimens. Looking at dozens of individuals, low-likelihood errors start to become more and more likely.

  • Cultural impedance, demographic growth, effective population size

    Wed, 2009-01-07 01:09 -- John Hawks

    This is a complicated story with many interlocking parts. Telling the whole story may well take me fifty posts. There's a lot of new science hiding in here waiting to get out.

    I'm starting now because of the new paper by Luke Premo and Jean-Jacques Hublin, titled "Culture, population structure, and low genetic diversity in Pleistocene hominins." This paper is not the final word on its topic, nor is it the first word. But it is very much worth reading.

    It makes an excellent point of departure to explain what we know and don't know about the genetics of prehistoric humans. Premo and Hublin propose an interesting model with interaction between culture and natural selection, as an explanation for a 35-year-old problem in human evolution: Our low level of genetic variation.

    Their model may be right. I certainly think there's a kernel of truth in it, shared with a number of other models, as I'll describe below. And it's testable -- a project to which we'll be returning in the next few months.

    Explaining a small effective size

    Humans today have relatively low genetic diversity compared to other hominoids. Chimpanzees, gorillas, and orangutans each harbor more genetic variation than humans worldwide.

    This observation is strange because under a simple genetic model, the amount of genetic diversity in a population should be proportional to the number of individuals. Since there are many more humans in the world than gorillas, chimpanzees, or orangutans, it seems like we ought to have more genetic diversity. But we don't. Strange.

    Or maybe not so strange. Many assumptions are floating under that "under a simple genetic model." My work, and the work of many other geneticists, has been focused on uncovering and examining these hidden assumptions.

    Genetic variation is only indirectly related to demography. Essentially, a population will be genetically diverse because many different alleles survive across generations. This genetic survival is less likely when there are few individuals. It is also less likely when most individuals are close relatives -- that is, when they are inbred. Natural selection can cause inbreeding. Certain kinds of mating behavior can cause inbreeding also.

    One simple explanation for low genetic diversity is simply that there aren't very many individuals. Few individuals means few chances for an allele to reproduce itself in the population. Rare alleles will therefore be rapidly lost in a small population. But of course, we know that there are a lot of people in the world. That explanation doesn't work.

    Bottlenecks

    The first people to point out that humans were short of genetic variation were John Maynard Smith and John Haigh, in 1974. They looked at the allelic variation of the beta globin gene and determined that it was consistent with a population of only 10,000 individuals. Since there are more than 10,000 people now, they needed some other explanation.

    They proposed a historical scenario, in which humans had been limited to very small numbers in prehistoric times. This scenario is a population bottleneck: a restriction for an unknown and unspecified length of time, followed by a recent expansion to the human population's present large size.

    The bottleneck scenario was revived again and again during the next 20 years. When human mtDNA -- like beta globin -- was found to have relatively low diversity, a bottleneck was the preferred explanation. Since diversity was highest in Africa, many authors proposed that Africa had been the location of this bottleneck population. And so, the Out of Africa hypothesis gained its genetic force.

    Meanwhile, in the last fifteen years, a number of people have set about finding other explanations for human genetic variation. A bottleneck can explain some observations well, but seems inconsistent with others. One of these inconvenient observations -- as Premo and Hublin point out -- is that Pleistocene human groups had low genetic variation, just like humans. We know this now because of the Neandertal genome work -- not only Neandertals, but also our common ancestor with Neandertals had low genetic variation. This coincidence of three hominid populations, two of which no longer exist, can't be the product of a single out-of-Africa bottleneck.

    So either we need three distinct bottlenecks, or we need something else. That, among other observations (such as the continuity of features in regions of the Old World outside of Africa), causes us to consider mechanisms that can reduce genetic variation without a bottleneck.

    Population structure, inbreeding, and diversity

    The fastest way to induce inbreeding is the same way that animal breeders do it: take one big horde, divide it up into little herds, and force each individual to mate only within her tiny group. After many generations, each of these little herds will be inbred. Each tiny herd will retain only a very small subset of the big horde's alleles. The genetic diversity of each tiny herd will be low.

    Here's a problem: We still have a bunch of these little herds. Sure, each one of them has low genetic diversity. But if we look at all of them, they probably still collectively retain most of the alleles that had been in the big horde. The variation in the total population will be great, even as the variation in the average subpopulation has been reduced. The imbalance between these values -- the total variation and the average subpopulation variation -- is measured by Wright's FST: a ratio measuring the reduction in diversity due to inbreeding.

    If one of these little herds expanded and wiped out all the others, it would be just like a population bottleneck. The original genetic variation of the horde would be gone, and only the variation of one single herd would remain. That's the Out of Africa hypothesis.

    The frequent extinction and recolonization model

    Consider the population of E. coli in your gut. There are billions of individuals, but all are descendants of a relatively small number of clones -- maybe only a handful. These clones migrated into your body from other people or animals, which each harbor their own population of billions. The global population of E. coli contains untold numbers of individuals -- upward of 1020.

    E. coli cannot really maintain so much variation. When you die, a few individuals of your E. coli population might make it into the gut of a lion or bear. But most of them are hosed. Your gut population will become extinct. Maybe a few lucky individuals will escape your body during your life and colonize a new host -- maybe your child, or the neighbor's dog. The mechanism that retains variation is not the billions of individuals in your gut, but instead the few that move into and out of your gut.

    Maruyama and Kimura realized that this mode of subpopulation extinctions might vastly reduce genetic variation. Takahata (1994) examined this as a mechanism for human genetic variation. The logic is that Pleistocene humans lived in small bands, and each small band of hunter-gatherers had a substantial risk of extinction. If these truly died and were replaced by new colonists from neighboring bands, then the genetic variation might be very small, even though the human population was spread across the Old World.

    Together with Elise Eller and John Relethford, I examined this model in a 2004 paper. We looked at the relation of different parameters in the model, and whether realistic values for hunter-gatherers would have a substantial effect on human genetic variation.

    If we want to reduce genetic variation with this model, then two things have to be true. First, groups need to be quite genetically different from each other. That is, they need to be inbred. And second, they really need to go extinct and be replaced.

    Recent hunter-gatherers tend not to simply die when times are tough. They may disappear from an area, but some numbers of them survive to move into other populations. And there are high levels of intermarriage among hunter-gatherer bands, and between hunter-gatherers and their neighbors. The values that are realistic for living hunter-gatherers will reduce genetic variation by a substantial amount -- perhaps by half. But not by a huge amount. We concluded that values in the Pleistocene may have been more extreme than in the present day, depending on the culture of prehistoric foragers.

    Notice the two factors important to the model. The groups need to be inbred. That means that some force must impede gene flow between them. And the groups need to be replaced with some regularity. That means that some mechanism must cause groups to die.

    The diffusion wave model

    Vinayak Eswaran (2002) proposed that the low genetic diversity of humans could be explained by selection. In his explanation, a coadapted gene complex arose within ancient Africans and dispersed through the Old World population within the last 100,000 years. It is economical to suppose that this coadapted gene complex generated some anatomical or behavioral trait of modern humans. Hence, a dispersal of an anatomy or behavior would lead to genetic dispersal.

    Yet, in this model local genes of populations outside of Africa would survive into the present day. The spread of the key phenotype in this model is not a replacement, it is a diffusion.

    The diffusion of a single advantageous gene will have relatively little effect on genetic variation across the genome. A small area near the selected gene may hitchhike to fixation as a result of selection. But most of the genome will be completely unaffected.

    But Eswaran proposed that several genes were required to work together to generate the adaptive phenotype. Hence, the selective advantage would need to push all these genes simultaneously for the adaptive phenotype to spread. Further, Eswaran supposed that individuals might mate assortatively based on the presence of the adaptive phenotype. This assortative mating is a kind of inbreeding, and would tend to impede the flow of genes from local populations into the growing population with the adaptive phenotype.

    In other words, the diffusion wave model can restrict genetic variation. It does so with the same two conditions as the extinction and recolonization model: Some force causes inbreeding within populations, and another force pushes some of those populations to expand while others contract. In this model, assortative mating and epistasis are the factors that promote inbreeding, while natural selection causes demographic imbalance.

    Premo and Hublin's model

    Now, we can consider the new paper by Premo and Hublin. As in the two models above, their model has a force that promotes inbreeding and another force that causes demographic flux.

    The inbreeding force is "culturally mediated migration" -- the idea that cultural differences between populations tend to impede gene flow between them. If the global population were divided into relatively small herds, each possessing a distinct culture, then we might expect these herds to be inbred. Premo and Hublin performed simulations in which the effects of culture on migration rates were allowed to vary. If individuals demand to settle down in groups with nearly identical cultures to the group of their birth, the inbreeding within populations will be very high.

    The demographic force in Premo and Hublin's model is natural selection. They suppose that advantageous mutations arise spontaneously, and that these mutations are sufficient to drive demographic expansion, as long as gene flow is impeded by cultural differences:

    In a panmictic population, a selectively advantageous mutation evolves to fixation with a probability and at a rate that share a simple relationship to population size and the strength of selection. The manner in which a favorable mutation spreads through a structured population is not so simple (25). In a structured population, gene flow between subpopulations is required for an advantageous mutation to spread beyond the boundaries of the group in which it first appears. However, [culturally mediated migration] can inhibit the spread of beneficial mutations by restricting gene flow to short cultural distances. One consequence of cultural isolation is that offspring inherit only those novel, beneficial mutations that spread to fixation within, but not beyond, the culturally defined boundaries of the group into which they are born. Another is that, when migration between groups is rare, the fate of each beneficial mut ation—its frequency in the metapopulation— depends upon the rate at which its carrier’s group fissions relative to other groups. Variance in groups’ fission rates depends on how relative indiv idual fitness is partitioned within and bet ween groups. A group-level selective sweep, whereby 1 group (and its daughter and granddaughter groups) fissions more rapidly than other groups, requires low within-group variance and high bet ween-group variance in relative individual fitness (26, 27). As long as these conditions persist, members of the group(s) that has accrued the most favorable mutations will contribute disproportionately more offspring to the metapopulation (28, 29) (Premo and Hublin 34-35).

    It may seem obvious that I would really like this idea -- in fact without knowing about Premo and Hublin's work I was lecturing in November about the demographic effects of selection impeded by cultural differences!

    But as in the case of extinction and recolonization, and the case of the diffusion wave with epistasis, the question is whether realistic parameters for humans will work with the model.

    Premo and Hublin don't answer this question. Their paper explores the interaction of several parameters across their entire range, finding some regions of the parameter space in which culturally mediated migration and selection may combine to exert a strong effect reducing neutral genetic variation. But aside from a general claim that cultural distinction among Pleistocene humans is plausible, they do not attempt to demonstrate the importance of these factors for ancient human groups.

    Given our lack of knowledge about the number of selective events and their timing during human evolution, their caution may be appropriate.

    Still, I think there is a great potential for testing this model as applied to the archaeological and genetic record. Taking the culture areas that appear to have characterized MSA/Middle Paleolithic populations and later, are those areas (and the populations contained within them) suitable for culturally mediated migration as predicted by this model? Given the number of selected mutations on the human lineage, within an order of magnitude, are there enough to generate the demographic flux predicted by the model?

    Despite the lack of attention to real Pleistocene population parameters, Premo and Hublin succeed in putting their model into a very interesting context. They connect the idea to Sewall Wright's shifting balance model, suggesting that an appropriately divided human population might give rise to favorable gene combinations -- small and repeated versions of Eswaran's diffusion wave model. And the spatial aspect of the model lends itself naturally to a comparison with spatial dynamics of group selection, which has been a topic of great theoretical interest in the last few years.

    Premo and Hublin claim that this process will only work in species where cultural factors are significant in mediating gene flow. For a narrow construal of the model -- which depends on culture -- that is of course true. But culture is not the only force that could mediate gene flow in this way. Humans set up similar breeding systems in domesticated animals by imposing artificial barriers to gene flow. And natural barriers to gene flow, such as fitness-reducing epistasis depending on genetic background, might do the same. At the extreme, natural barriers such as lakes or islands would lead to a similar consequence to the extinction and recolonization model.

    Next

    This post has added some additional context to Premo and Hublin's paper, connecting the model to other models that are formally similar in many ways. It is natural now to consider the general model that includes all these as special cases, and develop more specific cases that might have influenced human genetic evolution.

    However, that exercise will take some more background. I started out by writing that this is a complicated problem with many interlocking parts. You can now see the boundaries of the problem. But to take it further, we'll have to consider the quantitative analysis of movement.

    That means differential equations.

    References:

    Premo LS, Hublin J-J. 2009. Culture, population structure, and low genetic diversity in Pleistocene hominins. Proc Nat Acad Sci USA 106:33-37.doi:10.1073/pnas.0809194105

  • Genetic differentiation within Europe

    Fri, 2008-10-24 17:57 -- John Hawks

    Larry Moran tells an interesting personal story about long-distance gene flow among Roman-era elites in Europe (What does Marcus Antonius tell us about evolution?). He describes the genealogical connection between Mark Antony and the dark-age Irish warlord, Niall of the Nine Hostages, Y-chromosomal progenitor of a large proportion of Irish (and British) men.

    But the strange thing is that after this story, describing how one man's descendants covered more than a thousand miles in a few generations, Moran gives this conclusion:

    New beneficial alleles will not make much headway in 2000 years because gene flow between subpopulations is very low. There's no reason to assume that it was any different in the ancient past—it may even have been worse. Think about that the next time you hear about some hypothetical allele that arose 50,000 years ago and became fixed in the entire species. That's not very likely.

    I would conclude just the opposite from the story. Garlic mustard has spread across North America after being introduced from Europe less than 500 years ago. It is currently invading formerly "wilderness" spaces such as the remaining patches of prairie here in Wisconsin. This has not happened by a slow, plodding spread from one square meter to the next. It has happened because every so often a few mustard seeds get stuck in the tread of someone's shoe, or tire, or in mud stuck in wheel-wells of cars and four-wheelers. Those seeds get carried into wilderness areas, many miles from their sources.

    Only a very small fraction of garlic mustard seeds get themselves stuck in shoes. We might think that surely this small number should be no threat to Wisconsin prairies. It would take hundreds or thousands of generations for them to make any difference, right?

    But garlic mustard grows exponentially, particularly if that area has been disturbed by fire, plowing, traffic or overbrowsing. A tiny number of seeds are all it takes to spread invasively into a new place. A small amount of long-distance movement has been sufficient to permeate almost every suitable mustard habitat in North America, in less than 500 years.

    A selected gene is like garlic mustard. We may say that only a few members of the Roman elite intermarried with Britons. But if a single Roman married a Briton, carrying an advantageous gene, that gene has the chance to grow exponentially. That chance is not a guarantee, any more than a single garlic mustard seed is a guarantee. A single copy of an advantageous gene still has a very high probability of being lost by chance. But selected genes have a much higher chance of spreading than neutral ones. A very slight amount of long-distance gene flow can cause a selected gene to spread vastly faster than diffusion across a population.

    Besides that, in this case, the history is incomplete. Roman legions occupied Britain for more than 400 years. Those legions were not only Italian, but included soldiers from across the empire, including in one famous instance thousands of Sarmatians. Sarmatians carried with them genes from the steppes of Central Asia, much farther than Rome. Soldiers were stationed for years, and many left the service and became local merchants, landowners, or minor nobles. They were not celibate. For that matter, neither were the early Latin clergy...

    This massive flow of genes into the British Isles did not erase the standing genetic variation, some of which persisted from Neolithic and Paleolithic Britons. But the immigrants were more than enough to spread advantageous genes into the British population. We need not imagine one hitchhiking like mustard seed in the grandchildren of Mark Antony, although that is certainly possible. Antony's descendants were joined by thousands of lonely Roman soldiers stationed for years in backwater British towns, horny Vikings, pillaging Saxons, conquering Normans, and the occasional German prince.

    Early gene flow would have been more influential on the present composition of the British population than later gene flow. But if the question is whether a gene could traverse the European population in a few thousand years, there have been ample opportunities. And if we go back 50,000 years, even relative isolates like Australia and the Americas had their chance to get such genes.

    All this just says that it is plausible for genes to have spread widely through the human population recently. It's no proof that they actually did so, or that they had substantial effects on human similarities or differences. For that we must turn to empirical evidence.

    In that vein, here's a question that I know is of interest to a number of people: How similar should the selected genes in Britain, or Northern Europe generally, be to those of Central Asia, or the Near East, or Italy? We have samples of genetic variation in each of these places (and many others) that would answer the question empirically. We know that the majority of the genome, presumably neutral to selection, shows significant population differentiation among those places. But what does theory tell us? Should we expect selected genes to have a different pattern?

    On this, I'll have to save my answer for later....

Subscribe to gene flow

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.