john hawks weblog

paleoanthropology, genetics and evolution

genealogy

  • Effective size through genealogy

    Thu, 2011-11-24 23:48 -- John Hawks

    Sandwalk: "What William the Conqueror's Companions Teach Us about Effective Population Size".

    Let's assume that there are 20 well-documented companions. Only one of these (William Mallet) has possibly passed on his Y chromosome to the present time and even that male line of descent is disputed. This is fully consistent with our understanding of genetics when you consider that most male lines are likely to die out in a few generations. Those that survive ten generations or so are unlikely to become extinct since there will likely be several male lines at that time.

    Only 10 of the companions have descendants who are alive today. This could be due to the fact that genealogists don't have perfect records for all the companions and their families but it's also quite in line with expectations.

    A nice illustration, with a link to my own review article (available free here), "From genes to numbers: Effective population sizes in human evolution".

  • Genomes unzipped, ancestry revealed

    Mon, 2010-11-01 11:25 -- John Hawks

    Last week I linked to Genomes Unzipped participant Joe Pickrell ("Ancestry unzipped"), who was working through the ancestry calculations that made his genome appear to have been partially Ashkenazi Jewish in origin ("Am I partly Jewish?").

    Now, Pickrell updates the story ("Am I partly Jewish? An unexpected turn of events"):

    As I was mulling over these sorts of issues, I sent the link to my previous analysis to a family member. I didn’t really expect this person to find it that interesting, but hey, you never know. I then got a phone call. I’ll summarize a couple days worth of moderate confusion, second-hand reports of conversations with distant relatives, and family intrigue with this: as it turns out, one of my great-grandparents was indeed a Polish Ashkenazi Jew who immigrated to the United States around the turn of the century. I, obviously, was completely unaware of this.

    Before coming to this deeper genealogical discovery, Pickrell summarizes some additional comparisons that made it difficult to explain the genetic results by errors in assumptions of the methods. This is the kind of outcome many people are hoping they will get from their genetic information -- a prompt for distant relatives to uncover family histories that, in many cases, they didn't know would be interesting. Or that in the past some wanted to forget.

    Razib ponders the question "Do ancestors matter?" Obviously they matter deeply to some, to others not so much. Unexpected discoveries from genetic information are not new, they're at least as old as blood typing. There is already a large community of people who find meaning from genealogical research and comparisons with others -- research that ultimately can illuminate only a very small part of any individual's genealogical history. Genetics doesn't necessarily offer any more than this. Any one person's genes come from only a small fraction of her ancestors.

    But the DNA may inform about a different part of one's genealogy than oral and written history. And it may give rise to a deeper idea of ancestry than cultural reckoning -- which in a mere fifty years can drive people to forget some of their ancestors and promote fictive ones.

    Synopsis: 
    A researcher gets a genealogical surprise after making his genome public.
  • More X-Woman thoughts

    Sat, 2010-03-27 13:08 -- John Hawks

    I had a great session with my advanced students yesterday running through different evolutionary scenarios for the X-Woman. This and some later posts will follow up on my initial thoughts ("Hobbit version 2.0: the undiscovered hominin").

    May I just say, "X-Woman" is one of the more dopey nicknames for an ancient piece of bone? I mean, it's better than "Twiggy", but jeesh. I can't be the only one who thinks of John Singer Sargent:

    Madame X

    "Madame X", the once-shocking salon portrait by John Singer Sargent. Fulfilling my lifelong dream of bringing Sargent together with Neandertals.

    Meanwhile, I have some great e-mails about Madame X, some of which I can share. First, an exchange on the topic of incomplete lineage sorting:

    I'm confused by your suggestion of an ancient divergence among Neanderthals. Wouldn't that lead to a tree with the Siberian DNA and other Neanderthal DNA samples forming their own clade, to the exclusion of human DNA? As things stand, the Neanderthals are closer to humans than to the Siberian DNA.

    Not at all; it could be either way.

    Consider humans today. Africans have mtDNA lineages (the L clades) that are deeper in the human tree than any outside of Africa and basically absent elsewhere except for recent migration. But Africa also contains many of the mtDNA lineages that are present in Europe, India and West Asia.

    Now imagine that the human population divides into two species, Africans and non-Africans, and those species persist for 100,000 years. Assuming no huge bottlenecks in either of these species, they both ought to retain the major clades present today. If we sample their genes at that time, 100,000 years in the future, we'll discover that Africans will be more genetically diverse than non-Africans. And the Africans will have L clades that are outgroups to the clades (M and others) that include *some* Africans and *all* non-Africans.

    Subsequent population bottlenecks or selection could eliminate those ancient clades, but they will hang around unless they are eliminated. That's also the explanation for why humans and gorillas are more genetically similar at some loci than either is to chimpanzees, even though humans and chimpanzees speciated more recently. The variation in that ancestral H-C-G population was retained in the ancestral H-C population, to some extent, and lineage sorting sometimes gave humans the more gorilla-like clade.

    An interesting question is whether the rest of the Neandertal sample would be so relatively invariant, if some part of their population included this quite divergent mtDNA haplotype.

    It's quite hard to answer that question given the small sample of Neandertal mtDNA -- only less than twenty individuals, sampled from a range of times. A "lopsided" tree, with a lot of similar sequences and a few divergent ones, is not an unlikely genealogy in a small sample. The variance in the lengths of the deep branches in a genealogy is intrinsically high, even in the simple Wright-Fisher model with no population structure or selection. A "lopsided" tree is just one possibility on a continuum, in which the deepest coalescence time in the sample is high relative to the next deepest -- not an unlikely event at all.

    For those who would like to explore this process, I put together a Mathematica demonstration ("Coalescent Gene Genealogies") that generates random gene trees under the neutral Wright-Fisher model. Strange-looking trees are normal, in the sense that they occur often enough that they are not statistically unlikely for a single gene locus.

    Obviously what you'd want to do is compare multiple gene loci -- in this case, to get nuclear genomic sequence. Since the Max Planck group is actively pursuing further sequencing (and already has had some success, according to their press conference), I expect they're already making progress toward testing the neutral hypothesis.

    If mtDNA proves to be unusual compared to other loci, then it's either intrinsic coalescent variability, or selection. Testing those two alternatives would require a larger sample of Neandertal mtDNA.

    If, on the other hand, the nuclear genetic diversity is also substantially not shared with Neandertals (or living people), then the hypothesis of population structure in Late Pleistocene-age Eurasia would be strongly supported. It's a bit more complicated to test whether a speciation had occurred, but with whole genomes such a test can almost certainly be done.

    Synopsis: 
    What does it mean for an ancient specimen from the Altai to have a highly divergent mtDNA?
  • Genealogy and genetics

    Thu, 2009-10-15 18:03 -- John Hawks

    Larry Moran writes, "Are you a descendant of Charlemagne?"

    Thousands of amateur genealogists have contributed to a huge database of family relationships, including genetic analyses. What does this teach us about human populations and evolution?

    It touches on some issues covered in more detail in Steve Olson's book, Mapping Human History: Genes, Race, and Our Common Origins, which remains surprisingly relevant today despite the explosion in genetic data. That's because Olson did a good job on the population genetics side.

    Oh, and yes I am a descendant of Charlemagne. Woo-hoo!

  • Sending a cheek swab?

    Sat, 2009-02-07 21:04 -- John Hawks

    A reader wrote me today:

    I am wondering if you could suggest a general article for me to give to people who tell me how exited they are about sending a cheek swab to National Geographic for analysis. I imagine you have written something like this, but don't remember where it is. Any thoughts?

    I wrote back, and thought I should share. I have an article in Slate from a couple of years ago that covers that topic:

    http://www.slate.com/id/2138059/

    Also, there was a NY Times article in late 2007 that basically covers the same difficulties:

    http://www.nytimes.com/2007/11/25/business/25dna.html

    I blogged about that article at the time. I also have a post from the time of the Slate article.

    And my "testing" tag leads to a bunch of posts on the topic of genetic testing. I've never really sat down to write an FAQ on the topic, but I think it would be worth doing. Genetic Future does a very good job of covering current commercial offerings in genetic testing, a beat also covered at Eye on DNA.

    If someone was thinking of sending $100 to Genographic, I have nothing against it as a form of entertainment. I think the information you get that way is, for most people, very superficial and unsurprising. Basically, they're giving you your paternal Y chromosome lineage if you're a man, and your mtDNA if you're a woman. That fee pays for your test, the information they give out to you as a participant, and an extra amount that they will use to subsidize testing of various populations around the world.

    But if you're looking for the entertainment value, you will be much better off saving up $400 to send to 23andMe. That pays for a whole-genome SNP survey, including various phenotype predictions. They have all the genealogical information from the Y and mtDNA that you would get from Genographic, and it's better-presented. Overall, it's a much better deal, and more likely to generate something interesting to share with your friends. Of course, it costs more, but it's a better value.

    Now, for serious medical purposes, or for trying to do phenotype predictions of your children, I don't recommend 23andMe. I honestly think that gene testing now is best treated as expensive entertainment. You might get a laugh, or learn something about your ancestry that you didn't know. But you won't likely learn anything that would be worthwhile to your children. Not yet, anyway. This is not an investment.

    In other words, don't spend your food money. Think of it as eight months of cable TV.

  • Genetic differentiation within Europe

    Fri, 2008-10-24 17:57 -- John Hawks

    Larry Moran tells an interesting personal story about long-distance gene flow among Roman-era elites in Europe (What does Marcus Antonius tell us about evolution?). He describes the genealogical connection between Mark Antony and the dark-age Irish warlord, Niall of the Nine Hostages, Y-chromosomal progenitor of a large proportion of Irish (and British) men.

    But the strange thing is that after this story, describing how one man's descendants covered more than a thousand miles in a few generations, Moran gives this conclusion:

    New beneficial alleles will not make much headway in 2000 years because gene flow between subpopulations is very low. There's no reason to assume that it was any different in the ancient past—it may even have been worse. Think about that the next time you hear about some hypothetical allele that arose 50,000 years ago and became fixed in the entire species. That's not very likely.

    I would conclude just the opposite from the story. Garlic mustard has spread across North America after being introduced from Europe less than 500 years ago. It is currently invading formerly "wilderness" spaces such as the remaining patches of prairie here in Wisconsin. This has not happened by a slow, plodding spread from one square meter to the next. It has happened because every so often a few mustard seeds get stuck in the tread of someone's shoe, or tire, or in mud stuck in wheel-wells of cars and four-wheelers. Those seeds get carried into wilderness areas, many miles from their sources.

    Only a very small fraction of garlic mustard seeds get themselves stuck in shoes. We might think that surely this small number should be no threat to Wisconsin prairies. It would take hundreds or thousands of generations for them to make any difference, right?

    But garlic mustard grows exponentially, particularly if that area has been disturbed by fire, plowing, traffic or overbrowsing. A tiny number of seeds are all it takes to spread invasively into a new place. A small amount of long-distance movement has been sufficient to permeate almost every suitable mustard habitat in North America, in less than 500 years.

    A selected gene is like garlic mustard. We may say that only a few members of the Roman elite intermarried with Britons. But if a single Roman married a Briton, carrying an advantageous gene, that gene has the chance to grow exponentially. That chance is not a guarantee, any more than a single garlic mustard seed is a guarantee. A single copy of an advantageous gene still has a very high probability of being lost by chance. But selected genes have a much higher chance of spreading than neutral ones. A very slight amount of long-distance gene flow can cause a selected gene to spread vastly faster than diffusion across a population.

    Besides that, in this case, the history is incomplete. Roman legions occupied Britain for more than 400 years. Those legions were not only Italian, but included soldiers from across the empire, including in one famous instance thousands of Sarmatians. Sarmatians carried with them genes from the steppes of Central Asia, much farther than Rome. Soldiers were stationed for years, and many left the service and became local merchants, landowners, or minor nobles. They were not celibate. For that matter, neither were the early Latin clergy...

    This massive flow of genes into the British Isles did not erase the standing genetic variation, some of which persisted from Neolithic and Paleolithic Britons. But the immigrants were more than enough to spread advantageous genes into the British population. We need not imagine one hitchhiking like mustard seed in the grandchildren of Mark Antony, although that is certainly possible. Antony's descendants were joined by thousands of lonely Roman soldiers stationed for years in backwater British towns, horny Vikings, pillaging Saxons, conquering Normans, and the occasional German prince.

    Early gene flow would have been more influential on the present composition of the British population than later gene flow. But if the question is whether a gene could traverse the European population in a few thousand years, there have been ample opportunities. And if we go back 50,000 years, even relative isolates like Australia and the Americas had their chance to get such genes.

    All this just says that it is plausible for genes to have spread widely through the human population recently. It's no proof that they actually did so, or that they had substantial effects on human similarities or differences. For that we must turn to empirical evidence.

    In that vein, here's a question that I know is of interest to a number of people: How similar should the selected genes in Britain, or Northern Europe generally, be to those of Central Asia, or the Near East, or Italy? We have samples of genetic variation in each of these places (and many others) that would answer the question empirically. We know that the majority of the genome, presumably neutral to selection, shows significant population differentiation among those places. But what does theory tell us? Should we expect selected genes to have a different pattern?

    On this, I'll have to save my answer for later....

  • Genetic GPS in Popular Science

    Mon, 2008-09-15 09:30 -- John Hawks

    Popular Science has a short article covering recent research into European population structure:

    To the delight of genealogy buffs like me, scientists recently announced in the journal Nature that they can trace European ancestry to within 192 miles by analyzing tiny inherited markers in DNA. That means someday we'll need look no further than our own genes to locate our motherlands.

    The study, and another much like it in a recent issue of Current Biology, harnessed the stream of human genetic data now being gathered by pharmaceutical companies. Using modern “gene chip” technology, researchers can screen 500,000 units of DNA at once. The companies use the data to investigate the genetic basis for adverse drug responses. But population geneticists are taking advantage of the high-resolution databases too, scouring them for trends in human evolution that are otherwise hard to find.

    That's me, I guess. They're calling it a "DNA GPS", which is pretty clever.

Subscribe to genealogy

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.