john hawks weblog

paleoanthropology, genetics and evolution

malaria

  • Malaria death toll estimate rises

    Fri, 2012-02-03 07:42 -- John Hawks

    Notable, from the Guardian: "Malaria kills twice as many people as previously thought, research finds".

    The study demolishes conventional thinking on malaria – that almost all the deaths are in babies and small children under the age of five. The study found that 42% were in older children and adults.

    "You learn in medical school that people exposed to malaria as children develop immunity and rarely die from malaria as adults," said Murray, IHME director and the study's lead author. "What we have found in hospital records, death records, surveys and other sources shows that just is not the case."

    It will take some figuring to work out what that means relative to births in areas where malaria is endemic, but we can add to the higher death toll the clear long-term costs of malaria even to its survivors.

  • Spatial dispersal, parallel adaptation, and the "Stooge effect"

    Thu, 2010-10-14 00:06 -- John Hawks

    Peter Ralph and Graham Coop have an interesting paper in the current Genetics, titled, "Parallel Adaptation: One or Many Waves of Advance of an Advantageous Allele?" [1]

    Fisher [2] famously considered the case in which an advantageous allele is dispersing through a spatially dispersed population, showing that the dispersal forms a "wave of advance". This work was the foundation for a lot of progress in understanding spatial dynamics of organisms.

    As I discussed in 2008 ("Overstating the obvious"), one of the consequences of the Fisher wave model for human evolution is that advantageous alleles will spread very slowly through the population. During the course of the Holocene, a strongly selected mutation might move only across a radius of a thousand or so kilometers. That provides one explanation for why new advantageous alleles haven't spread very far beyond their points of origin -- they just haven't had time yet.

    Another reason why an allele might not have spread widely is interference from other alleles with similar effects. I mentioned this process last year ("Spatial variation and near-fixed selected alleles"):

    Greg Cochran and I have been discussing this idea for some time. We call it the "Stooge effect". Think of the Three Stooges all trying to run through a door at the same time and getting stuck in the middle. That's what these genes are doing -- all of them are competing to respond to selection, but each is slowed by the presence of the others.

    Ralph and Coop have cleverly combined the "Stooge effect" phenomenon with spatial dispersal. They suppose a case in which two separate advantageous mutations arise in different geographic locations, each affecting the same trait. Each begins to spread independently as a Fisher wave of advance. What happens when they meet?

    As they show, the dynamics in this case give rise to a static equilibrium -- once the "waves of advance" meet, they stop moving, forming a stable boundary. A new favorable mutation makes headway only so long as it has no equally favorable mutation to compete against.

    I like the way they used both analytical approaches and simulations to come to this outcome. The appearance of stable boundaries in a reaction-diffusion system has long been known (demonstrated first by Alan Turing, actually!). But to my knowledge, no one has considered this specific case from an analytical perspective.

    The Fisher equation is not all that simple for most students to work with. If you become familiar with the equation, you will notice the key aspect is that it has two separate components -- a logistic (or reaction) component representing the increase in frequency at a single point in space, and a diffusion component representing the dispersal across space.

    The muscle of the dispersal process comes from the logistic component. Without the intrinsic growth of the selected allele, the dispersal of individuals along the boundary would not carry many copies of the selected allele into new geographic areas. If the local selective advantage dies, the wave of advance rapidly stalls. A static equilibrium arises, with the frequency of the selected allele forming a cline that correlates with the local selection pressure.

    Ralph and Coop's model approximates this case, in a dynamical sense. Each new selected mutation forms an increasing zone in which the selective advantage of other mutations is zero. When those other mutations encounter this zone, they form a stable cline. The cline is stable in the short term, but the diffusion component still disperses copies of an allele; they just lack the muscle to continue their deterministic expansion.

    The most interesting simulations by Ralph and Coop show the two-dimensional case, in which the stable boundaries emerge in a "tesselation" pattern.

    Tesselations

    Figure 6 from Ralph and Coop (2010), showing "tesselations" in 2-d simulations of waves of advance.

    The lower three panes in the figure show the stability of the boundaries between the selected alleles. They proceed to fixation locally, but their dispersal stops where they come into contact with other adaptive alleles. Over the very long term, the population will mix -- the diffusion process will slowly carry all these alleles throughout the species' range. Look at the process after a million generations and the entire zone will be gray. But this dispersal occurs at the neutral rate, where the diffusion term is the only factor driving the dispersal.

    What about humans?

    My graduate student Zach Throckmorton and I have been working in this area for a while now. One of the things that impresses us is the way that much more interesting dynamics can emerge when you alter the assumptions. I learned some of this stuff by talking to Frank Livingstone, who gave a lot of thought to these issues of spatial dispersal and selection as applied to malaria resistance alleles.

    In particular, Frank thought about the case where one allele has a slightly larger advantage than another. In some contexts, this allows the "better" allele to overtake and swamp the expansion of the "weaker" (but nonetheless adaptive) one. In others, the two come to a near standstill, one displacing the other only very gradually. Much depends on the timing of the two mutations and the local conditions controlling their initial dispersal.

    Ralph and Coop briefly consider this case in their paper, noting that the difference in fitness advantage of two alleles will allow one to advance into the range of the other, albeit at a slower rate. In humans, we may be seeing a smaller subset of cases, where one or more of the alleles have not yet established a wavefront. In these cases, the arrival of another wave can disrupt the spatial pattern of the rarer allele. The diploid case gives rise to the possibility of more complex epistases. Well-defined boundaries between selected alleles are rare, and where they occur (as may be the case with HbC and HbS in Africa), many have focused on negative epistasis as an explanation.

    Also, alleles are unlikely to substitute perfectly for each other. In many cases, they may work synergistically -- individuals carrying two selected alleles that affect the same function may outperform those carrying only one such allele. At some point, new selected mutations may start to have diminishing returns, even on a trait like skin pigmentation where dozens of alleles may have been selected in widespread human populations. So the current distribution may to some extent be "frozen", but by a more complicated dynamic than the simple intersection of waves of advance.

    As Coop and colleagues showed last year [3], and we discussed in 2007 [4], there are really only few genes that have approached local fixation in recent human evolution. The current spatial pattern of recently selected alleles doesn't look like a tesselation with many alleles near local fixation. Over most of the Old World, it looks like populations have a very large number of very new alleles, far from fixation, and few up over 70 percent in frequency.

    So the specific scenario in this paper by itself probably does not explain the overall empirical pattern in humans. But if we consider the current pattern as a transient, approximating the early stages of dispersal for many selected alleles, we may not be terribly far off the mark.

    Mutation-limited evolution

    This is a long dense paper and there's a lot in it. One further aspect of the paper that I think is essential is the way that Ralph and Coop reiterate the basic point that more people means more mutations. In their case, they focus on population density over space (population number, when you multiply them) as a constraint on the number of possible adaptive mutations. They apply this idea as a hypothesis to account for parallel adaptations that may have emerged in recent human evolution.

    Multiple mutational origins are likely if the characteristic length is shorter than the physical dimensions of the region. Eurasia measures >8000 km across, and so Table 1 suggests that multiple origins at a single base pair are very unlikely at the lower population density. On the other hand, if the mutational target is large, then multiple origins are likely at low densities, while at high densities independent origins are ubiquitous. The complementary cases of (rho = 2, µ = 10–8) and (rho = 0.002, µ = 10–5) give identical characteristic lengths of 3000 km, although the timescale on which the mutations spread differs. Thus for these two parameter combinations we can expect a few mutations to dominate within continents and for multiple mutations to be common in a population spread across an area the size of Eurasia. Obviously these calculations are very crude, as population densities vary through space and time, and dispersal across continents is not simply a function of geographic distance and individual dispersal. Nevertheless, these calculations suggest that it is plausible that for adaptive traits with reasonable mutational targets (e.g., a change anywhere within a gene or pathway) even low population densities can lead to parallel adaptation across an area the size of Eurasia, and higher densities almost certainly will.

    We note that as human population densities have increased dramatically over time, so too has the probability of parallel adaptation. It is interesting therefore to note that a number of recent human adaptations (e.g., sickle cell alleles) involve repeated changes at very small mutational targets in relatively small geographic areas, while older adaptations from single changes (e.g., skin pigmentation) are more broadly spread.

    They are describing a scenario in which small human populations would have been mutation-limited -- that is, the number of new mutations is small, making it unlikely that adaptive mutations will happen in any given generation. In such populations, the rate of adaptation is limited by the availability of new mutations. In an extreme -- in the very small effective sizes of Pleistocene human populations -- the rate of adaptation may be extremely slow and regional populations may come to differ at many weakly selected loci, which spread very slowly.

    As the population grows, strongly adaptive mutations become more and more likely to happen somewhere in the species' range. Yet they are still relatively rare -- meaning that they have an opportunity to spread fairly far before encountering another equally strongly selected mutation affecting the same trait.

    This process can give rise to very large differences on a continental scale, even when the selection pressures in different regions do not differ. In humans, the dispersal of selected alleles across space may have been significantly accelerated by actual dispersals of populations. It is not a mere coincidence that very widespread alleles in Eurasia also tend to be much older than 20,000 years old -- long-distance dispersals prior to that time had a higher chance of leaving a lasting influence on subsequent populations.

    But as the population gets bigger and bigger, parallel mutations are more and more likely to happen. As Ralph and Coop point out, at the extreme of large population size and likely mutations, you shouldn't see any new mutations emerging and spreading over very large areas. Any of these mutations would be very likely to encounter other new mutations that do the same thing.

    Is this likely in humans? Clearly some mutations have happened recurrently. Making a broken gene is easy -- there's a large mutational target, since a large fraction of nonsynonymous substitutions might do the job. So if there's a net selective advantage to breaking a gene, we ought to see that happen recurrently in human populations.

    In contrast, if the mutational target is very small, then mutations will still be rare even in a very large population. If only one base change can have an adaptive effect, that precise change will happen less than once in 109 births (remember that not just any mutation at a site, but some particular mutation is what we may need). If a rare duplication or gene conversion is the necessary change, then it may be much rarer.

    Looking across the last few million years, when human population numbers were much smaller than the Holocene, we can be pretty sure that some aspects of our evolution were mutation-limited. The changes that took hold in our ancestors were the ones that happened, and that survived the winnowing of genetic drift. Many changes that would have been adaptive didn't happen in our ancestors. They just weren't lucky enough.

    But some of those changes would still be adaptive now, if we could get them. And we have had much larger numbers in the last 10,000 years. Homo erectus needed these mutations, but we only now are seeing them selected in the human population.

    Malaria adaptation

    Hemoglobinopathies are among the cases of easy mutations -- where breaking a gene is adaptive. It's not just any broken version of alpha- or beta-globin that does the job, though. The hemoglobin needs to be impaired in certain ways to impede the parasites while maintaining blood function. This provides many of the classic cases of human adaptation, and Ralph and Coop turn to this system for examples of parallel adaptation:

    The sickle cell allele HbS at the β-globin gene in humans provides a particularly interesting case of putative parallel adaptation. The HbS allele (β6 Glu-Val) has been driven to intermediate frequencies by selection within the past 10,000 years due to increased resistance to malaria of heterozygotes for the allele (HALDANE 1949; ALLISON 1954; CURRAT et al. 2002; KWIATKOWSKI 2005). The HbS allele is present on at least four major distinct haplotypes in Africa, each at intermediate frequency within a different geographic region; the haplotypes are named after the population sample where they were first discovered (Central African Republic, Senegal, Benin, and Cameroon). This is consistent with multiple origins of this single-base-pair change. Note that a distinct, malaria resistance allele, HbC (β6 Glu-Lys), has also arisen in Africa at the same codon as the HbS allele (TRABUCHET et al. 1991; AGARWAL et al. 2000; WOOD et al. 2005a), increasing our confidence that the mutational input was high enough to allow multiple types to arise. However, FLINT et al. (1998) thought the hypothesis of multiple new mutations arising at a single base pair was extremely unlikely and proposed that it was more likely that gene conversion had spread a single mutation across multiple haplotypes.

    The theory we have developed can be used to assess the plausibility of the multiple mutational origins of the sickle cell allele, by exhibiting parameter combinations that yield characteristic lengths consistent with the separation of the sample locations. [Recall that the wave of advance, and thus also our model, works in the case of heterozygote advantage (ARONSON and WEINBERGER 1975).] The different HbS haplotypes co-occur within a few thousand kilometers of each other (see Table 5 of FLINT et al. 1998) (noting that these locations are unlikely to reflect the geographic mutational origins, and mutations will have been spread by large population movements). As the HbS changes occur at a single base pair, the mutation rate would have been 10–8, and we take an s = 0.05 (as in CURRAT et al. 2002). If human dispersal at that time was well approximated by a Gaussian kernel with sigma = 100 km, then a characteristic length of 1000 km would require an effective density of individuals of rho = 25 km–2, while if sigma = 10 km, then we would require only rho = 2.5 km–2. This latter set of parameters does not seem unrealistic, considering our knowledge of population density and dispersal parameters, so our model suggests that the hypothesis of multiple origins is not unreasonable.

    I think they've got the basic idea correct here, but there are some additional details to consider. The distribution of HbE is not quite so easy to understand if parallel mutations are really so likely, and of course there is the negative epistasis of different alleles (and the thalassemias) which impacts their dispersal ability when they become moderately common. The dynamic may be of similar form to the one described here, but boundaries between alleles may be reinforced by the fitness costs of carrying multiple ones.

    This situation raises the issue of path dependence. Some mutations have "first mover" advantages. Once they are common, other adaptive mutations may still occur -- even mutations that are better from the standpoint of fitness -- but be lost or grow very slowly because their net fitness advantage over the common mutant is slight. Where HbE is common, new HbS alleles are unlikely to invade quickly. Where HbS is common, new HbE mutants are similarly unlikely to invade -- even though HbE has a higher fitness.

    Network effects among genes may also dominate the spatial dynamics. HbS spread most widely in the context of populations that were already Duffy null, and in which G6PD deficiency was rapidly increasing. The first conditioned the parasite environment -- P. vivax had a strong disadvantage in Duffy null populations, P. falciparum made up most of the parasite load. G6PD deficiency should have impacted the relative advantage of HbS, more and more as it became more common. Those are two loci among many that alter malaria dynamics in Africa compared to South and Southeast Asia.

    Conclusions

    There is much more to say about this paper -- it's 22 journal pages. But I think I've given an impression of what's there and how the ideas may impact our interpretation of recent human evolution. Many of the central concepts were presaged by earlier work in 2007 and 2008, as reviewed here on the blog. The new analytical and simulation work, I really like.

    Hopefully we can get out some shorter papers that will focus on aspects of these problems as applied to humans. A message that comes across very clearly in our work and this new paper is that different time periods in our evolutionary history must have had very different selection dynamics. Pleistocene humans were not only in a different ecology than us, they experienced a radically lower potential for adaptation.


    References

  • Falciparum malaria came from gorillas

    Wed, 2010-09-22 15:38 -- John Hawks

    Malaria in humans is caused by one of five different species of Plasmodium parasites. The deadliest of these is P. falciparum, especially within Africa where native resistance to P. vivax is high. Where the vivax parasites seem to have been around for at least tens of thousands of years, P. falciparum in many ways looks relatively young. Its comparative lack of genetic variation suggests either a recent origin from some other primate species, or an intense bottleneck or selective sweep affecting the parasite's demography. In either case, the falciparum history seems to indicate that its present widespread distribution is a very recent phenomenon -- possibly within the last 5000 years.

    Because P. falciparum is phenotypically similar to the major chimpanzee malaria parasite, P. reichenowi, most scientists have assumed that we got falciparum malaria from chimpanzees. But in a new report, Weimin Liu and colleagues [1] have surveyed parasite variation in gorillas, bonobos and chimpanzees across Africa, finding that human falciparum parasites all group in with a single small clade of gorilla parasites. The other primates carry many varieties of parasites, with typical individuals being highly heteroplasmic -- that is, carrying several different strains.

    From the discussion:

    Using single-template amplification strategies and a much larger collection of ape specimens than previously analysed, we show here that wild-living chimpanzees and western gorillas are naturally infected with at least nine Plasmodium species. Among more than 1,100 SGA-derived mitochondrial, apicoplast and nuclear gene sequences from 80 chimpanzee and 55 gorilla samples, we found a total of nine sequences that were related to P. malariae, P. ovale or P. vivax (Supplementary Table 5). All others grouped within one of six chimpanzee- or gorilla-specific lineages representing distinct Plasmodium species, three of which had not previously been described. Significantly, all currently available human P. falciparum sequences constitute a single lineage nested within the G1 clade of gorilla parasites. This indicates that human P. falciparum is of gorilla origin, and not of chimpanzee9, 10, 12, bonobo11 or ancient human5 origin, and that all known human strains may have resulted from a single cross-species transmission event. What is still unclear is when gorilla P. falciparum entered the human population and whether present-day ape populations represent a source for recurring human infection. It has been suggested that the limited levels of genetic diversity seen at many loci in human P. falciparum reflect a relatively recent selective sweep8. Our data suggest that this bottleneck or ‘Eve event’ was instead the consequence of cross-species transmission of a gorilla parasite. It is difficult to date this event without having reliable dates with which to calibrate the Plasmodium phylogenetic trees.

    What's interesting about the study is the sheer coverage of wild primates, and the application of multiple gene trees, which suggests that this is a recent origin of human parasites instead of introgression and selection of a single gene. I don't know if it makes any difference whether the disease came from gorillas or chimpanzees, but it certainly helps to confirm that it is new and not a long-time coevolution. That explains the burst of recent selection associated with resistance genes, especially within Africa.


    References

  • Primate genomics: the Duffy (FY) gene, malaria, and baboons

    Wed, 2009-06-24 20:55 -- John Hawks

    Jenny Tung of Duke University and colleagues report in Nature (online early) that yellow baboons have evolved a Duffy antigen-related defense against a baboon relative of malaria.

    Most Africans carry a null allele for the Duffy antigen, coded by the DARC gene, which functions to protect them from vivax malaria. It's not the worst kind of malaria (that would be falciparum), but it is a major cause of disease outside those regions where the Fy*O allele is near fixation.

    The baboon version of DARC is not convergent with the human null allele; the paper reports that it actually increases the gene activity, whereas the baboon variant allele actually increases the activity.

    Presumably, the different defense in baboons is because they're fighting a different parasite:

    Baboons are not generally infected by Plasmodium in the wild, but are vulnerable to infection by several closely related haematoprotozoans (4, 5) including Hepatocystis kochi, a blood parasite nested within the paraphyletic Plasmodium genus (15). Hepatocystis parasites do not produce the cyclical fever spikes typical of malaria in humans, but do produce anaemia and visible merocyst formation, followed by scarring on the liver (4).

    They were able to show that the infection rate with Hepatocystis is significantly lower in baboons that cary the protective allele.

    Based on their comparisons, the locus looks like a case of balancing selection:

    We detected an increased level of population differentiation among East African baboon populations around FY, by comparing a FY-linked microsatellite with 35 neutral microsatellites (Fst = 0.31, P Fst for the neutral markers was 0.008–0.346; Fst is a metric describing genetic divergence between populations based on allele frequency differences at variable sites; Supplementary Fig. 3). We also detected a higher value for the Tajima's D statistic (D = 1.26) in this region relative to nine of nine other resequenced putative cis-regulatory regions in the Amboseli population and 11 of 12 resequenced transcribed regions (range of D for all other loci was -1.60 to 2.12). The only locus with a higher value of D, a transcribed portion of the gene MSR1, exhibited an even more extreme value than that identified for the MHC DQA1 promoter in baboons (22), which is known to evolve under strong trans-specific balancing selection (20).

    A balance really would be necessary for them to be likely to have any evidence of selection. There's no reason to think that baboons are in the kind of demographic and disease transient that humans are in, so if a protective allele were always beneficial, it would likely be fixed. Still, it's not obvious what the disadvantage of the protective allele would be, although in humans it has been suggested that altering Duffy expression may impair immune response by reducing white blood cell count (Reich et al. 2009). Considering the high stress and cortisol levels of wild baboons, it may be that changes in immune activity have even more disadvantages.

    One important aspect of the study is that the allele affects the cis-regulatory region of the gene; that's the general research topic covered by Gregory Wray's research group. I think that it's important because it raises the prospect that targeted sequencing of cis-regulatory elements in primate genomes might lead to the discovery of more adaptive variations within primate species. In their concluding paragraph, the authors emphasize the strengths of a combined field and in vitro approach to characterizing functional variants:

    In vivo gene expression measurements are complicated by variation in genetic background and in the environment, both of which can modify functional cis-regulatory effects (25, 26). Indeed, our results show that even baboons that are homozygotes at the C/T site sometimes exhibit allelic imbalance in FY expression, suggesting that other, unidentified functional cis-regulatory variants are also segregating in the population. In contrast, in the in vitro comparisons, only a single cis-regulatory site differed between the experimental constructs, thus controlling for both environment and genetic backgrounds. Using both approaches in tandem can be synergistic: while in vitro experiments can help pin down specific functional sites, in vivo results demonstrate that these effects are relevant to the biology of individuals in the wild.

    When it comes to primate genetics, looking for defenses to infectious diseases should be low-hanging fruit. Just take human genes that have alleles that defend against diseases, and sequence them for variations. Hopefully we'll find many others -- and in a few cases, those variations may prove useful in human health contexts, as they may reveal new pathways to deter or defeat pathogen infections.

    References:

    Tung J, Primus A, Bouley AJ, Severson TF, Alberts SC, Wray GA. 2009. Evolution of a malaria resistance gene in wild primates. Nature (advance online publication) doi:10.1038/nature08149

    Reich D, Nalls MA, Kao WHL, Akylbekova EL, Tandon A, et al. 2009. Reduced Neutrophil Count in People of African Descent Is Due To a Regulatory Variant in the Duffy Antigen Receptor for Chemokines Gene. PLoS Genet 5(1): e1000360. doi:10.1371/journal.pgen.1000360

  • Bill Gates, bioterrorist wannabe

    Thu, 2009-02-05 00:54 -- John Hawks

    Well, when you've got a captive audience....

    LONG BEACH, Calif. - "Bill Gates just released mosquitos into the audience at TED and said, 'Not only poor people should experience this.'"

    That was the post by Facebook's Senior Platform Manager Dave Morin on social networking site Twitter.

    ...

    The mosquito incident was confirmed by the media office of the Bill and Melinda Gates Foundation, which also noted that the insects released were not carrying malaria.

    Because that would be wrong. Although it would be interesting to see the reaction to that. Maybe Gates was trying to emulate Steve Jobs' famous "reality distortion field" and got it horribly wrong.

Subscribe to malaria

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.