john hawks weblog

paleoanthropology, genetics and evolution

metascience

  • Tenured inertia on publishing

    Wed, 2012-01-04 16:52 -- John Hawks

    Danah Boyd rants "Save Scholarly Ideas, Not the Publishing Industry". This is a well-worn topic here on my blog, but she hits on a useful theme: People with tenure should be leading the charge, but instead it's mainly young scholars who are working for change in the way we publish research and scholarship:

    What pisses me off to no end is that the same Marxist academics who pooh-pooh corporations justify their own commitment to this blood-sucking process with one word: tenure. Not like that is the end of the self-justifications. Even once scholars get tenure, they continue down the same path – even when not publishing with students – by telling themselves it’s for promotion or because grants require it or because of any other status-seeking process.

    WTF? How did academia become so risk-adverse? The whole point of tenure was to protect radical thinking. But where is the radicalism in academia? I get that there are more important things to protest in the world than scholarly publishing, but why the hell aren’t academics working together to resist the corporatization and manipulation of the knowledge that they produce? Why aren’t they collectively teaming up to challenge the status quo? Journal articles aren’t nothing… they’re the very product of our knowledge production process.

    Coming from corporate research, Boyd lacks information on this topic. She doesn't seem aware of the immensity of the open access movement underway or its notable successes. But the comment stream is full of interesting anecdotes and suggestions from academics.

    In my view, substituting open access for closed access journals is a necessary but not sufficient change to our system of academic communication. We need to recognize new modes of publication and dissemination of knowledge that are relevant beyond the academy, and we need to formalize credibility in this new, broader context. That would be truly radical.

    (via Neuroanthropology)

  • Should science reporting have a standardized checklist?

    Wed, 2012-01-04 10:08 -- John Hawks

    An interesting read this morning from Fiona Fox, chief executive of Britain's Science Media Centre: "What If There Were Rules for Science Journalism?"

    She proposes a "checklist" for science reporting, which sounds to me a bit like the "Nutrition Facts" that the government puts on a box of cereal.

    A checklist would look something like the following. Every story on new research should include the sample size and highlight where it may be too small to draw general conclusions. Any increase in risk should be reported in absolute terms as well as percentages: For example, a "50 percent increase" in risk or a "doubling" of risk could merely mean an increase from 1 in 1,000 to 1.5 or 2 in 1,000. A story about medical research should provide a realistic time frame for the work's translation into a treatment or cure. It should emphasize what stage findings are at: If it is a small study in mice, it is just the beginning; if it's a huge clinical trial involving thousands of people, it is more significant. Stories about shocking findings should include the wider context: The first study to find something unusual is inevitably very preliminary; the 50th study to show the same thing may be justifiably alarming. Articles should mention where the story has come from: a conference lecture, an interview with a scientist, or a study in a peer-reviewed journal, for example.

    I think these are good recommendations for health reporting. An awful lot of people have adopted diet recommendations that at best can lower disease risk by a small fraction. Meanwhile, many continue smoking despite much larger and repeatedly demonstrated risks. Science and health reporting have not historically helped people to understand relative risks, and they do a poor job of informing people how scientific conclusions are produced. This lack of transparency has enabled a large niche for "health advisors" who are essentially quacks. People are poorly informed about how to distinguish quack advice from science.

    Nevertheless, I think some of Fox's recommendations verge on censorship -- their aim is to stop the public from being misdirected to unreliable findings, but the solutions are all oriented toward stopping the reporting of unreliable findings. I would prefer to see a change in emphasis away from reporting findings and toward reporting process. Scientists trust science without trusting every result, because they understand the process of science. The public will be better informed about scientific results when they see the process in action. A sharp reporter should not only attend to the immediate result of a study but the process underway to test and possibly reject today's findings.

  • “He had a sufficiently high opinion of himself"

    Tue, 2012-01-03 23:20 -- John Hawks

    Gina Kolata profiles Eric Lander, director of Harvard and MIT's Broad Institute and advisor to President Obama, in the New York Times. It's a good read for those interested in the recent history of genetics, and where it may be going from the perspective of one of the largest sequencing centers.

    I also learned a lot from the descriptions of Lander in Jamie Shreeve's recent book, The Genome War: How Craig Venter Tried to Capture the Code of Life and Save the World. I really enjoyed the book, and if I have time I'll do a full review.

  • Is Nature Genetics something more than the GWAS Catalog?

    Tue, 2012-01-03 23:03 -- John Hawks

    I always look through the table of contents of Nature Genetics, which I have delivered to my inbox. Over the last couple of years, the journal has included a high fraction of papers that are either original genome-wide association studies or meta-analyses of multiple studies. These are substantial studies that have dozens of authors, on conditions of broad interest -- for example, this month there is a meta-analysis paper about type 2 diabetes. So I have no criticism of the journal, these studies need to be published somewhere.

    But others might be impatient with this course of research. The studies are formulaic: put together a large set of cases and controls, run them across a genotyping chip, and report the results. In the current issue, the journal's editorial board enters an op/ed suggesting that the current situation will not continue forever, because GWAS studies just aren't that interesting anymore [1]:

    Which Mendelian variants produce results suitable for publication in the journal? Our general principles are and have always been to select papers for review by the amount of new data and new ideas and the resource value contained within. Papers must meet current field-specific standards set by our latest benchmark papers and referee advice. Finally, we consider the value of the paper as a research tool, prioritizing those that will motivate larger numbers of scientists to do their research differently as a consequence. In principle it should be possible to find a phenotype for each of the tens of thousands of genetic elements in the human genome, but not all such results will be equally informative. However, if, say, 50 other labs will drop everything and instead use the results of your work, that paper is certainly suitable for this journal!

    Well, there you go. The editorial also addresses pedigree research, stating that new identifications of Mendelian disorders in single families will not be sent for review.

    I think this all is appropriate, it's just interesting that research has advanced to the point that finding a genetic cause for a disorder is no longer a sufficient reason for publication. If you look through the GWAS Catalog, you find study after study published in Nature Genetics. Those days are probably numbered.


    References

    1. Anonymous. Full spectrum genetics. Nature genetics. 2011;44(1):1.
  • Schools of fish, schools of thought

    Thu, 2011-12-22 00:17 -- John Hawks

    Kate Shaw enters a report in the science section of Wired on a paper that modeled decision-making in animal groups: "How ignorance could improve group decisions." The paper itself by Iain Couzin and colleagues was in Science [1]. Jevin West and Carl Bergstrom authored a companion perspective piece in Science explicating some of the paper's findings. Here's the paper's abstract:

    Conflicting interests among group members are common when making collective decisions, yet failure to achieve consensus can be costly. Under these circumstances individuals may be susceptible to manipulation by a strongly opinionated, or extremist, minority. It has previously been argued, for humans and animals, that social groups containing individuals who are uninformed, or exhibit weak preferences, are particularly vulnerable to such manipulative agents. Here, we use theory and experiment to demonstrate that, for a wide range of conditions, a strongly opinionated minority can dictate group choice, but the presence of uninformed individuals spontaneously inhibits this process, returning control to the numerical majority. Our results emphasize the role of uninformed individuals in achieving democratic consensus amid internal group conflict and informational constraints.

    In other words, they have generated an agent-based model where each individual may have a marginal effect on group behavior relative to an "intensity" parameter, and the group's decision is dictated by the collective center of gravity. "Uninformed" in the title is a misnomer; the study examines what happens to the group as individuals with little or no bias (that is, low "intensity") are added to the group. Adding a high fraction of such individuals tends to reduce the influence of a small minority of individuals with an intense bias. In other words, the addition of low-bias individuals in the model skews the group decision in favor of a low-intensity majority.

    Not surprising; the model generates the results expected given the assumptions. The study's most interesting aspect is its application of the model to a problem of schooling in fish.

    So why do so many of the press accounts of this study claim that it shows that "ignorance" can support democratic decisions? The "uninformed" individuals in the model lack information only in the sense that they lack a strong preference. In no other sense does information come into the model, other than as a measure of decision bias. This is "information" more or less in the sense that someone can "ask the audience" on "Who Wants to Be a Millionaire?" The contestant basically has to hope that a minority of the audience knows the correct answer, while the majority has no bias toward a false answer.

    I've been thinking a lot about modern human origins lately, as I'm finishing up an e-book project. For many years I have worked very closely with the details in this area, really pushing up my sleeves and getting into both the genetic and morphological aspects of the problem. What always struck me was how the "scientific consensus" emerged in favor of an Out-of-Africa replacement hypothesis, and against the multiregional evolution hypothesis. The protagonists on either side of the modern human origins controversy didn't really change very much over time. There were few high-profile "defections" from one side to the other. Initially, many of the best-known paleoanthropologists and geneticists sat on the fence. After all, few people work so directly with the evidence of modern human origins, when there is a debate it is reasonable to be cautious. But over a decade, like wildebeest waiting to jump into the crocodile-infested river, paleoanthropologists and geneticists whose work has little to do with modern human origins began to tip in favor of the replacement scenario.

    They jumped too soon.

    But why? I've been tackling this question: How did the majority of paleoanthropologists and human geneticists get this one so wrong?

    I feel so fortunate to have been engaged in this problem, because it says so much about the process of science. Science is always a process where progress requires an opinionated minority to recruit support among peers who are not specialists in the same area. Such a minority may forge consensus through consistent and repeated demonstration of facts. More likely -- as in the case of modern human origins, where new evidence was often equivocal -- a motivated minority will apply a broader range of rhetorical strategies. Over the years, I saw people pull out every trick in the book to persuade the uncommitted to their point of view.

    Rapid visual processing allows schooling fish to signal and comprehend the direction of movement within milliseconds. Scientists signal each other through their publications, grant reviews, and the press. It takes a lot longer for scientists to school.


    References

    Synopsis: 
    A paper on decision making in groups of animals prompts me to think about science.
  • "Transformative" research can't come from milquetoast

    Sat, 2011-12-10 15:25 -- John Hawks

    Philip Ball writes in The Guardian about another new initiative from NSF to fund "potentially transformative" research ("Science funding tends to favor mediocrity over grand ideas".

    He begins his essay with this:

    The kind of idle pastime that might amuse physicists is to imagine drafting Einstein's grant applications in 1905. "I propose to investigate the idea that light travels in little bits," one might say. "I will explore the possibility that time slows down as things speed up," goes another. Imagine what comments these would have elicited from reviewers for the German Science Funding Agency, had such a thing existed. Instead, Einstein just did the work anyway while drawing his wages as a technical expert third-class at the Bern patent office. And that is how he invented quantum physics and relativity.

    The moral seems to be that really innovative ideas don't get funded – that the system is set up to exclude them.

    The system is set up to exclude really innovative ideas. But Einstein is a really misleading example.

    For one thing, Einstein didn't need much grant funding for his research. Yes, if somebody had given the poor guy a postdoc, he might have had an easier time being productive in physics. But his theoretical work didn't need expensive lab equipment, RA and postdoc salaries, and institutional overhead to fund secretarial support, building maintenance and research opportunities for undergraduates.

    It is a better question whether we would have wanted Einstein to spend 1905 applying for grants instead of publishing. But even this is terribly misleading. Most scientists who are denied grants are not Einstein. Most ideas that appear to be transformative in the end turn out to be bunk. Someone who compares himself to Einstein is overwhelmingly likely to be a charlatan. There should probably be a "No Einsteins need apply" clause in every federal grant program.

    Setting aside the misleading Einstein comparison, our current grant system still has some severe problems. Is it selecting against "transformative" research, or big breakthroughs? I would put the problem differently. "Transformative" is in the eye of the beholder. Our grant system does what it has been designed for: it picks winners and losers, with a minimum of accountability for the people who set funding priorities.

    We might be perfectly happy if the winners were scientists who all go on to make important breakthroughs. But in reality our system picks winners in a way that often selects against creativity and significance, and selects for established networks of institutions and senior scientists, and above all "grantsmanship". What is the difference between a 35-year-old assistant professor who becomes the manager of 1.5 million dollars of federal money, and his 37-year-old colleague who has been denied twice for the same grant before applying for tenure? In a system where fewer than 20% of grants are funded, the difference may be luck. The "transformative" value of either person's ideas hardly comes into this calculation. Yet this is the system we are currently using to staff the next generation of senior science positions.

    Anyone who has submitted grant applications in multiple years can see this in action. The reviews in one year often completely contradict those of the previous year, even for the same project. An application's chances of being funded are based on the luck of who reviews the application and who is in the room. The way to ensure bad luck is to be an outlier. I have applied for federal grants several times, and have often had strong reviews but have never been funded. A good application can take weeks of effort to prepare, as much as a research paper. Yet only a small fraction of applications are funded. For me, each time has been a costly training for writing the next unsuccessful grant application.

    I don't want there to be a pool of money set aside for "high-risk" or "transformative" work. I want the agencies to set transformative objectives and to fund projects in accordance with them. If my scientific objectives don't match those of the agency for my research area, I want to know that so I don't spend time on fruitless applications. If a grant agency has milquetoast objectives, I want a transparent process by which ordinary scientists can participate in changing those objectives.

    Anthropology has its own unique grantwriting challenges, and we can't easily generalize across fields. Some of the very large grants available from NSF for human evolution research are strongly interdisciplinary, and much of the budget of funded projects is spent outside of anthropology. "Transformative" research in this context may be conceived as the collaboration of scientists from different areas, even when the results themselves may be quite conventional. In my opinion, funding productive field projects is the most effective use of federal money in paleoanthropology. Every new fossil discovery might be the one that transforms our understanding of the important events in human evolution. But those field projects may actually inhibit "transformative" research if they do not make their results available to other scientists in a timely manner, if they do not openly archive and document their activities in the open, and if they do not contribute to the education of future scientists.

    Many have pointed to the problem of transitional funding for early career scientists. Few have noticed that this is a decay curve. As researchers get older, some of them get their first grant, but many others stop trying. Personally, I'm still quite a ways from the age when the average grant recipient receives his first federal grant. But I'm very glad that I've chosen a research area where I can do great research without that federal money.

    Synopsis: 
    Grant agencies should set transformative objectives, not set aside money for transformative research
  • XMRV saga develops

    Mon, 2011-12-05 23:01 -- John Hawks

    John Timmer's reporting on the rise and fall of the hypothesis that XMRV causes chronic fatigue syndrome is the best I've seen so far on the topic: "How a Collapsing Scientific Hypothesis Ended in an Arrest"

    Something worth reminding: basic analysis of lab samples is often based on evolutionary theory:

    The key piece of evidence came in an evolutionary analysis of XMRV origins. Researchers found that the most diverse group of XMRV sequences come from a single prostate cancer cell line called 22Rv1 that was grown in lab dishes. All of the XMRV sequences isolated from patients clustered within the evolutionary tree derived from the cancer cell line, meaning the ancestors of the viruses supposedly found in patients had all come from a single lab-grown cancer cell line. The clear implication is that the sequences came from the cell lines rather than patients.

    There is much to say about the errors, retractions, and fraud involved in the story. I'll just point out that some of the key revelations of fraudulent analysis came from the blog erv, whose author Abbie Smith first reported that figures from a Science paper had been used in conference talks, relabeled with entirely different contexts added.

  • Value within, not unto

    Tue, 2011-11-22 07:58 -- John Hawks

    Anthropologies continues to publish some provocative essays. This month's edition focuses on anthropologists working in Appalachia. One of the themes of my essay, "What's wrong with anthropology?" is how we must produce work that is valuable to the communities that enable our research. Sarah Raskin's essay in this month's edition shows that work needs to address concerns within communities, not only about communities: "When the stereotype is the research topic".

    “I was thinking about what you said yesterday,” the woman said to me, “and I have some feedback. I wish you’d have said some good things about Appalachia. Our sense of family, our perseverance through adversity, our generosity and care for community, our faith in God. I don’t know why everyone always focuses on the negative stuff. Why doesn’t anyone talk about the positives? My mother passed her knowledge of the land and handcrafts to me. I haven’t lived there for decades but I still remember that. I’m still proud of it. I don’t know why people don’t talk about that. Why you didn’t talk about that. Unless you didn’t see these things in your research in which case, of course, I understand.”

    Raskin's research concerns health disparities and in particular access to dental care among rural people in southwest Virginia. As she explains in this case, the stereotype contributes to keeping people out of clinics. And yet, an anthropologist who describes people as disempowered, or who describes only social negatives and stereotypes, may do little to make her work valuable within a community.

  • "False-positive psychology"

    Fri, 2011-11-11 21:05 -- John Hawks

    Razib Khan conveys a list of suggestions from a recent paper by Joseph Simmons and colleagues [1], concerned with reducing reporting biases in research papers. The article is directed toward psychology research, but many of the observations hold true in paleoanthropology and genetics.

    The central point is that every research paper is a product of a course of inquiry that may come to include many kinds of questions, most of which are unanswered or answered negatively by results and data. When scientists report results, they focus on those that meet some statistical threshold. The threshold ostensibly makes results "significant" but the actual probability of seeing such a result depends on how many things the scientists looked at, not only on those they choose to report:

    In this article, we show that despite the nominal endorsement of a maximum false-positive rate of 5% (i.e., p ≤ .05), current standards for disclosing details of data collection and analyses make false positives vastly more likely. In fact, it is unacceptably easy to publish “statistically significant” evidence consistent with any hypothesis.

    The culprit is a construct we refer to as researcher degrees of freedom. In the course of collecting and analyzing data, researchers have many decisions to make: Should more data be collected? Should some observations be excluded? Which conditions should be combined and which ones compared? Which control variables should be considered? Should specific measures be combined or transformed or both?

    It is rare, and sometimes impractical, for researchers to make all these decisions beforehand. Rather, it is common (and accepted practice) for researchers to explore various analytic alternatives, to search for a combination that yields “statistical significance,” and to then report only what “worked.” The problem, of course, is that the likelihood of at least one (of many) analyses producing a falsely positive finding at the 5% level is necessarily greater than 5%.

    "Researcher degrees of freedom" sounds erudite, but all they're really describing is tinkering. When the data lead you to a result, they do so by leading you along a drunkard's path of new analytical biases.

    Razib has presented the authors' suggestions for researchers and reviewers, to try to reduce the tinkering bias. I think if we followed those suggestions in paleoanthropology, our discipline would be stronger in some ways, weaker in others. For example, the authors suggest rejecting any paper with fewer than 20 observations in a cell of a test of association. Clearly, if we rigidly enforced such a rule, we'd have a lot more work done on comparative collections, and that would be a good thing. On the other hand, we'd have a lot more papers like the ones I like to write, about how the data are insufficient to test a hypothesis. Our science would shift even further toward description, which would benefit some kinds of research and punish others.

    One may object that there are many cases in paleoanthropology where a single observation is fundamentally important. I would just point out that such cases are most evident where the single observation is a many-sigma outlier to some pre-existing hypothesis. If we have a new radiocarbon date that's a three-sigma outlier above previous dates, it will either cause us to change our hypothesis or challenge the date's accuracy. There are biases even so -- for example, when we find outlier radiocarbon dates on otherwise-uncontroversial things, we tend to just ignore the outliers.

    What I most liked about this paper was that the authors anticipated various objections. For example, many researchers would claim that a Bayesian statistical approach would eliminate or reduce the bias from "researcher degrees of freedom". Here's the authors' response:

    Although the Bayesian approach has many virtues, it actually increases researcher degrees of freedom. First, it offers a new set of analyses (in addition to all frequentist ones) that authors could flexibly try out on their data. Second, Bayesian statistics require making additional judgments (e.g., the prior distribution) on a case-by-case basis, providing yet more researcher degrees of freedom.

    That's my observation as well. Researchers adopt Bayesian methods for more ways to tinker. I also appreciate this comment:

    We are strongly supportive of all journals requiring authors to make their original materials and data publicly available. However, this is not likely to address the problem of interest, as this policy would impose too high a cost on readers and reviewers to examine, in real time, the credibility of a particular claim. Readers should not need to download data, load it into their statistical packages, and start running analyses to learn the importance of controlling for father’s age; nor should they need to read pages of additional materials to learn that the researchers simply dropped the “Hot Potato” condition.

    Furthermore, if a journal allows the redaction of a condition from the report, for example, it would presumably also allow its redaction from the raw data and “original” materials, making the entire transparency effort futile.

    All in all, the article is a good reminder of Feynman's first principle, "You must not fool yourself, and you are the easiest person to fool."


    References

    Synopsis: 
    Three psychologists look for solutions to researcher biases in publishing results.
  • How many scholars are copyright pirates of their own work?

    Fri, 2011-11-11 10:00 -- John Hawks

    Ryan Anderson has been interviewing anthropologist Jason Baird Jackson about open access publication ("Anthropology & Open Access: An Interview with Jason Baird Jackson (Part 2 of 3)". I like his description of the coordinated action problem of moving to open access models of publication.

    Most of us do not understand journal business models or how it is that librarians have made so much (expensive) information so easily available to those of us with the luxury of university affiliations. In the face of much confusion and anxiety, just sending our manuscripts to the editors and journals that we know in the way that we have always done has seemed sensible and prudent.

    Related is the situation in which we perceive that we understand the changing landscape better than we do. A clear instance is when we post the final published versions of our writings online because we wrongly believe ourselves to have the right to do so. The increasing prevalence of such accidental piracy fosters the misunderstanding that such practices are the right way to do open access. Such piracy is counter-productive on many levels and is unnecessary given that there are legal and technically better ways to pursue OA.

    I don't make reprints available on my website when I don't have the copyright permission to do so.

    The interview has been wide-ranging so far and this installment discusses the problem of scholarly societies in the open access era. In the old days, societies supported their journals with high member dues, and often required a paper journal subscription with membership. Many societies still do so. Today, there's no reason to ship paper journals to the vast majority of society members. Societies claim that the cost of preparing content for publication is still high, but high-cost pre-publication processing of submissions is transparently unnecessary, considering the number of open access journals run by small societies at relatively low cost, using open access tools.

Pages

Subscribe to metascience

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.