john hawks weblog

paleoanthropology, genetics and evolution

23andMe

  • Postmodernists are genetic determinists?

    Tue, 2012-05-01 19:37 -- John Hawks

    An article in The Awl by Russell Brandom sighs disappointedly about commercially available personal genome testing ("Everything I Didn't Learn From Taking A Personal Genome Test"). Misha Angrist, early personal genomics adopter, reacts to the piece on his GenomeBoy blog, "Of hairballs and long hauls".

    I agree with most of Misha's post, and I especially started cheering when I read his final point:

    Some of my postmodernist friends tend to look down their noses at genetic ancestry testing. I would argue that they are genetic determinists. Why assume that genetic information is so omnipotent as to irrevocably unravel one’s identity? Why must one narrative trump another? “Because it’s TECHNOLOGY! It’s GENETICS! It is ALL POWERFUL!” Please. It’s just another way of looking at one’s ancestry. And learning about genetics: I would argue that Henry Louis Gates has done as much to stir public interest in genetics as anyone or anything since the Human Genome Project. For realz.

    Data demystifies. The ancestry determinations aren't great, but you know what? People -- including nonprofessionals with an avocation in genetics -- are improving them every day, both showing their limits and inventing new ways to sift information out of genomes.

  • Ancestry perspective from 23andMe

    Sun, 2012-03-04 13:42 -- John Hawks

    Stanford geneticist Joanna Mountain recounts some of the experience she brings to 23andMe in her role as Senior Director of Research: "Solving mysteries via DNA". Much of her interests are the anthropological aspects of DNA and ancestry.

    Now that we know how DNA aligns with prehistoric migrations, we can trace the DNA of individuals to northern Europe or Central Asia, South America or the Near East, western Africa or Oceania. That information about where DNA is from can, in turn, answer questions about our ancestors. Were they struggling to feed their children through hunting red deer in northern Europe, harvesting shellfish in southeastern Asia, raising alpacas in the highland plateaus of western South America, or digging for tubers in eastern Africa? DNA shows that some of us have ancestors who faced the challenge of survival using several of these strategies.

    A new round of "Finding Your Roots With Henry Louis Gates, Jr" is going to begin on PBS this month, using 23andMe services as part of the program.

  • A quick look at your Neandertal fraction

    Fri, 2011-12-16 15:13 -- John Hawks

    The 23andMe blog, the Spittoon, has a description of their new technique to use 23andMe SNPs to estimate any customer's fraction of Neandertal: "Find your inner Neanderthal".

    The result is a rough-and-ready numerical estimate of your Neandertal ancestry fraction. For me it's 2.5 percent. Gretchen is 3 percent, and she's been lording it over me all day.

    The estimate is the work of Eric Durand, who broke ground on the D-statistic method for finding introgression from archaic genomes [1]. He has made public a short white paper describing the application.

    So far, all estimates of Neandertal (or other archaic human) ancestry have come from the proportion of a genome (or genotypes from a genome) that are shared and derived with Neandertals. That includes the results I've been posting here for the 1000 Genomes Project samples this week.

    The next step is to uncover exactly which parts of a person's genome have come from Neandertal ancestors. To discover this, we have to further determine which shared alleles come from recent introgression as opposed to ancient incomplete lineage sorting. We have been working very hard on that problem here, as you'll see, and it has been an important aspect of our work in pigmentation genes in the archaic genomes.

    If you have been considering getting your genotypes from 23andMe, it has become a very good time to do this. The overall fraction of your DNA derived from Neandertals is only the beginning. Soon we'll be able to specify which parts, and in a few cases we'll have a good guess as to what difference it makes. If you want to participate in this research, I'm hoping to gather as many interested people as I can -- so keep your eyes here over the next month.

    And if you are interested in having your genotypes done, feel free to use my link to the 23andMe promotion. I've been very happy with their way of presenting the genotypes and their updates, and know many other people who have also found it interesting. As I wrote a couple of years ago, it's not something to spend your food money on, but it does have an entertainment value. And the potential to be an active research participant.


    References

  • Over coffee

    Fri, 2011-12-16 08:03 -- John Hawks

    G: Guess what Daddy and I learned last night? I'm more Neandertal than he is!

    S: How did you find that out?

    G: Our genes.

    S: That's creepy.

    G: What do you mean, creepy? We think it's awesome!

    S: Awesome... in a creepy way.

  • Anodyne DTC genetics

    Sun, 2011-04-24 16:51 -- John Hawks

    The Wall Street Journal has an op-ed by Matt Ridley, on the topic of possible regulation of consumer genetic testing. He writes that after years of relative non-interest in such tests, he ordered his own because of the likelihood that the FDA will limit their ability in the near future.

    The champions of regulation respond that some firms in the direct-to-consumer genetic-testing industry are sometimes much exaggerating the health benefits of genotyping. As I said above, most results are anodyne and close to useless in terms of telling you how to live your life, but that is not how it sounds on the websites. However, this is not an argument for FDA medical-device regulation or requiring doctors' prescriptions before testing. It is an argument for plain, old-fashioned truth-in-advertising regulation of the kind effected by the Federal Trade Commission.

    The AMA always seems to think it's fighting Doc Brinkley. Personally, I'd say the supplement industry is a far greater threat to the public health than DTC genetic testing, and is surely a better use of the FDA's time.

    UPDATE (2011-04-24): More on Gene Expression. Much tweeting of the final line of the op-ed, as well:

    Genetic knowledge, whether the high priests like it or not, is going to be a crowd-sourced phenomenon.

  • I'm a genetic libertarian

    Thu, 2011-03-10 13:53 -- John Hawks

    Much news coming out of the FDA public meeting on direct-to-consumer (DTC) genetics. Dan Vorhaus was at the proceedings and reports on them ("Looking Ahead After the FDA’s DTC Meeting").

    I believe that I have a fundamental right to my own biological information. What I mean is that, if anybody has biological information about me, I should be able to access and use it. Additionally, I think it is immoral for anyone to charge me excessive rates to access my own information. So that's where I'm coming from. I'm a genetic libertarian.

    In the current proceedings, two issues have arisen of some interest. The first is a relative sideshow but has for good reasons absorbed much attention. Some public figures have adopted a deliberate strategy to portray DTC genomics companies, such as 23andMe, as parasites on the human genetic research otherwise conducted by academics and pharmaceutical companies. This assertion is obviously false -- some DTC companies now involve thousands of participants in active research projects.

    But more immediately important, there's a video that effectively shows the "big lie" -- Congressmen and an FDA official claiming that no research is done by DTC companies only two days after that very official participated in a meeting that highlighted this kind of research! It is incredible, and widely linked (see, for example, Razib's post or Joe Pickrell's post).

    I'm not the fountain of information about this topic, but I do want to link and promote others with whom I mostly agree. Joe Pickrell takes a practical perspective: DTC testing is good for research, because there's an awful lot of research that would never be done without it.

    You can think what you want about the value of the research done to date by 23andme [1], but in my mind, there’s one simple reason why the sorts of participant-driven research they’re doing can only be a good thing: all research is driven by curiosity, and the people most curious about a disease or trait are those who have it. While people may think of the academic research community as a machine with endless resources and limitless motivation, it’s not. People work on things they think are interesting; they sometimes follow “trendy” topics, or move into fields with more grant money, or get bored of a given problem and move on. So if the research in the trait you’re most interested in isn’t moving fast enough for you, well, tough luck.

    Some of my research on Neandertal genetics surely falls into that category, as does almost all genealogical research. The widespread availability of genomes is already leading to much research of anthropological interest.

    Many of my readers will already have seen Razib's post, "Your genes, your rights – FDA’s Jeffrey Shuren not a fan". Working from the video, he tackles what I see as the second and more important issue: Whether the interpretation of genomes should be subject to regulation.

    The online community needs to get organized. We’re not as powerful as a million doctors and a Leviathan government, but we have right on our side. They’re trying to take from us what is ours.

    According to Vorhaus, some regulation is likely a foregone conclusion from the FDA. This is to be expected; it is the ordinary process of rent-seeking by the political class.

  • DNA relatives

    Mon, 2011-03-07 15:07 -- John Hawks

    Steve Mount works through the math of "relative finder" predictions from 23andMe (and by extension, other personal genome tests): "Genetic genealogy and the single segment".

    He does a nice short explanation of a point that is counter-intuitive to many people. You don't actually share much DNA with your relatives by descent, and because chromosomes are inherited in chunks, you quickly (within 6 generations) get to a point where you're not likely to have any DNA in common at all. Yet...you do have to have DNA from somebody, which means that if you do share DNA, you'll probably share a big chunk of it.

  • Ancestry unzipped

    Fri, 2010-10-22 08:30 -- John Hawks

    One of the incredible benefits of the open source approach to genomics is that non-practitioners have a chance to see how interpretations are built. Sometimes it's a real "warts and all" picture of science, as statistical and historical details come into conflict with each other.

    The group at Genomes Unzipped includes a group of forward-thinking geneticists and related professionals who have made their 23andMe genotype data public. Soon after their data release, some other folks went to work on the data. Dienekes Pontikos applied ancestry prediction algorithm, finding that the Genomes Unzipped authors were, no surprise, mostly European -- but two of them were predicted to have a high component of Ashkenazi Jewish ancestry.

    Genomes Unzipped participant Joe Pickrell was surprised to discover he might have a high fraction of ancestry tracing back to Ashkenazi Jews. So he did some investigation of his own:

    Several hours after we released our data, however, I was pointed to a post where Dienekes Pontikos wrote about the results of running all our data through his ancestry prediction program. While just about everyone was quite confidently predicted to be almost entirely of northwestern European descent, this analysis gave me a point estimate of 20% Ashkenazi Jewish ancestry. Within hours, several people had asked me about this, and I had no real response. So I decided to take a look at the data myself; some basic analyses are below.

    The post is a great summary of some basic methods, including the strengths and weaknesses of the assumptions that underlie them.

    I have found over the last several years that this "surprised to discover" reaction is very common among people who have ancestry testing or other genotyping done. Sometimes the surprises end up being well supported by other historical evidence, of which the subject may not have been aware. But more often, the "surprising genealogy" is just an artificial result of applying erroneous or simplified assumptions in the course of the analysis. I think it is tremendously important to write up case studies where the process leading to a result is explicated, where the sensitivity of the analysis to various assumptions can be probed.

  • Sergey Brin and genetic research

    Wed, 2010-07-07 08:30 -- John Hawks

    While I was out of town, Wired ran a long article about Google cofounder Sergey Brin and his quest to find the genetic causes of Parkinson's disease. There is much of interest here. The piece gives an account of present-day genomic research from a unique point of view.

    Brin is a smart person, with a family history of Parkinson's and knowledge that he carries a risk allele. So he is directing a lot of money and attention toward new ways of approaching gene-disease associations. He is one of the major financial backers of the direct-to-consumer genomics company 23andMe, and the husband of founder Ann Wojcicki. Google, of course, has prospered by making unconventional uses of data. That's an approach that many are starting to apply to science:

    Increasingly, though, scientists—especially those with a background in computing and information theory—are starting to wonder if that model could be inverted. Why not start with tons of data, a deluge of information, and then wade in, searching for patterns and correlations?

    This is what Jim Gray, the late Microsoft researcher and computer scientist, called the fourth paradigm of science, the inevitable evolution away from hypothesis and toward patterns. Gray predicted that an “exaflood” of data would overwhelm scientists in all disciplines, unless they reconceived their notion of the scientific process and applied massive computing tools to engage with the data. “The world of science has changed,” Gray said in a 2007 speech—from now on, the data would come first.

    I think that "fourth paradigm" probably overdignifies the approach, which looks like a regression to a naive positivism. As described in the book, The Fourth Paradigm: Data-Intensive Scientific Discovery, the idea is rather more -- a unification of theory and massive amounts of data. Data really do speak for themselves, say Fourth Paradigmers, but they speak quietly with a lot of noise drowning them out. So if you collect vast amounts of data, you have a chance to sort out the whispers of real associations from all the junk.

    The article gives a vivid example:

    Langston offers a case in point. Last October, the New England Journal of Medicine published the results of a massive worldwide study that explored a possible association between people with Gaucher’s disease—a genetic condition where too much fatty substances build up in the internal organs—and a risk for Parkinson’s. The study, run under the auspices of the National Institutes of Health, hewed to the highest standards and involved considerable resources and time. After years of work, it concluded that people with Parkinson’s were five times more likely to carry a Gaucher mutation.

    Langston decided to see whether the 23andMe Research Initiative might be able to shed some insight on the correlation, so he rang up 23andMe’s Eriksson, and asked him to run a search. In a few minutes, Eriksson was able to identify 350 people who had the mutation responsible for Gaucher’s. A few clicks more and he was able to calculate that they were five times more likely to have Parkinson’s disease, a result practically identical to the NEJM study. All told, it took about 20 minutes. “It would’ve taken years to learn that in traditional epidemiology,” Langston says. “Even though we’re in the Wright brothers early days with this stuff, to get a result so strongly and so quickly is remarkable.”

    But there are a few stumbling blocks. Unknown associations are relatively weak. Most of the phenotypes are polygenic. With heritabilities less than one, there remain unknown environmental causes of most phenotypes, which are not captured in genetic data and which may interact with different genes.

    At present, I don't think anyone in genetics is really operating on a "Fourth Paradigm" level. The massive datasets are building, but few authors are working with existing population genetic theory in ways that would enhance the pattern-matching exercise. If you look through papers describing genome-wide association studies, there are a lot of bivariate statistics, and some multivariate descriptive statistics (like principal components analysis). About all the theory is case-control statistical design. Look through a paper on genetic variation and you're likely to see a STRUCTURE analysis and some coalescent simulations.

    The article puts this well:

    'We have no grand unified theory,' says Nicholas Eriksson, a 23andMe scientist. 'We have a lot of data.'

    Genetic data today are a huge contrast from the past, both in sample size and in coverage. There are a lot of new low-hanging fruit. In the future, the easy stuff will be gone and theory will become more and more important. It remains unclear to me how much progress on health may be made by pattern-matching alone, and how much will require new theoretical advances. Given the problems explaining heritability so far, it may be that we'll need new theory sooner rather than later.

    Genetic data are slowly being joined by environment data of various kinds. The article contextualizes the study of environmental variables by telling the story of the initial discovery and long use of aspirin. After it had been common in the population for a long time, researchers started to realize that it had health interactions besides followed by the slow realization that long-time use has health interactions of its own.

    The second coming of aspirin is considered one of the triumphs of contemporary medical research. But to Brin, who spoke of the drug in a talk at the Parkinson’s Institute last August, the story offers a different sort of lesson—one drawn from that period after the drug was introduced but before the link to heart disease was established. During those decades, Brin notes, surely “many millions or hundreds of millions of people who took aspirin had a variety of subsequent health benefits.” But the association with aspirin was overlooked, because nobody was watching the patients. “All that data was lost,” Brin said.

    The answer is simple: Collect all the data and see what percolates out of them. Heck, probably Google already has enough data about everybody based on their web searches, if they could just connect those to the 23andMe database. If you have a few years of web searches, I wonder just how much that tells you about a person's other phenotypes?

    Remember (as the article points out), Google is the company that can predict flu outbreaks faster than the CDC.

    Still, aside from the obvious technological progress, I'm a little more sober about the prospects of making rapid health improvements. Consider:

    This approach—huge data sets and open questions—isn’t unknown in traditional epidemiology. Some of the greatest insights in medicine have emerged from enormous prospective projects like the Framingham Heart Study, which has followed 15,000 citizens of one Massachusetts town for more than 60 years, learning about everything from smoking risks to cholesterol to happiness. Since 1976, the Nurses Health Study has tracked more than 120,000 women, uncovering risks for cancer and heart disease. These studies were—and remain—rigorous, productive, fascinating, even lifesaving. They also take decades and demand hundreds of millions of dollars and hundreds of researchers. The 23andMe Parkinson’s community, by contrast, requires fewer resources and demands far less manpower. Yet it has the potential to yield just as much insight as a Framingham or a Nurses Health. It automates science, making it something that just … happens. To that end, later this month 23andMe will publish several new associations that arose out of their main database, which now includes 50,000 individuals, that hint at the power of this new scientific method.

    Today's sequencing techniques make it much cheaper to do some things that used to be very expensive. But we've done a lot of gold-plated medical studies, and have more coming soon. The most important barrier to progress is not the lack of money; it is the difficulty of altering biological systems without adverse complications.

  • Impacts of gene mix-ups

    Mon, 2010-06-07 09:17 -- John Hawks

    Daniel MacArthur: "Sample swaps at 23andMe: a cautionary tale".

    It appears that a single 96-well plate of customer DNA was affected by the mix-up. This resulted in incorrect results being sent to customers, with some alarming consequences; one mother posted on the 23andMe community about her distress upon discovering that her son's results were incompatible with the rest of the family:

    Not a happy story of genetic discovery, but in the end her son is in fact her own. You hear about genetic information making people question their identity -- this is the depth of impact genetic testing can have.

    It's also amazing to me that there are already families submitting swabs for both parents and children. Talk about disposable income!

Pages

Subscribe to 23andMe

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.