john hawks weblog

paleoanthropology, genetics and evolution

politics

  • Public interests in data from federally funded research

    Thu, 2012-01-12 20:20 -- John Hawks

    I submitted the following essay in response to the Request for Information on Public Access to Digital Data Resulting from Federally Funded Research from the National Science and Technology Council's Interagency Working Group on Digital Data.

    This RFI is not the same as the current bill before Congress ("Open access op/ed in NY Times"), which would restrict public access to research articles based on federally funded research. Research articles are a very important issue, but I hope that the access to digital data will not be overshadowed by the attention to published results. As a paleoanthropologist, I believe that access to digital data from federally funded research projects is a fundamentally important issue, as I remark below.

    Introduction

    The United States provides grant funding to scientists through many federal programs. This funding advances work of public interest that might not happen without federal assistance.

    The creation of scientific knowledge may serve the public interest directly by enabling useful inventions or supplying actionable information on issues of public importance. A funded project may also serve the public interest indirectly, by (1) finding negative results that prevent wasted effort or public harm; (2) building the scientific infrastructure that enables future discoveries and advances; (3) training new and established scientists in effective research techniques; (4) enhancing international cooperation and public/private partnerships.

    Congress and the Executive Branch have recognized that access to the published results of scientific research is not sufficient to advance the direct and indirect public interests served by federally funded projects. Facilitating the indirect benefits of research is a major aim of federal agencies' "Broader Impacts" and data access rules. These policies have been a qualified success since their implementation, limited mainly by the exceptions carved out by programs and agencies to avoid requiring certain kinds of data to be reported along with research reports.

    I argue that open public access to digital data should be a requirement for all federally funded scientific research. Digital data can be maintained by federal agencies as a part of the reporting requirement of federal grant funding. Doing so will advance the interest of the public and ensure that today's science generates a continuing heritage of research excellence.

    Data access and transparency

    Transparency is essential to public trust. Scientific conclusions are formed by observation and replication, and for this process to be transparent, all data must be available for independent inspection. The possibility of such inspection should not be limited to qualified researchers, because the very existence of special access requirements blocks transparency of the scientific process.

    Changing technology has shifted the public's expectations about transparency. Digital technology enables most research data to be shared rapidly and at low cost. If data are produced in digital form, and digital data can be shared at low cost, researchers and agencies cannot credibly claim that the difficulty of reproducing and disseminating data is a sufficient reason to restrict access. Where no competing interest argues for restricted access (such as human subjects protections), a lack of access to digital data itself can now be a compelling reason for public distrust.

    Therefore, federally funded researchers should release digital data to the public by default. Federal agencies should facilitate this public reporting by requiring digital data to be supplied as part of final project reporting.

    Data access has a well-established record of success

    The recent history of human genetics demonstrates that open access to data has unforeseen benefits that can spawn innovation, support more effective education, and catalyze new discovery. In genetics, both federal and journal policies require release of data; raw data from federally funded projects are often available as they are generated, long before publication.

    My own laboratory has no federal research funding to date, but is actively engaged in research using data from federally funded projects. Today my laboratory trains undergraduate students in genetics with new data from ongoing federally funded genetic projects such as the 1000 Genomes Project. We use open access data from archaic human genomes to investigate the variation of ancient people and their relationships to living humans. This kind of work would be impractical without clearly established open data access policy.

    The open access to data from the Human Genome Project facilitated the rapid development of microarrays that are now used on a broad scale in human genetics to investigate the genetic correlates of human health and disease. Access to data from these studies has enabled other scientists to independently replicate many genetic associations. More important, meta-analysis of such data has shown that many associations cannot be replicated, while also showing some cases in which nonsignificant results across different samples give rise to a significant finding when pooling those samples. Access to negative results and raw data is necessary, in other words, to establish the facts in subsequent research. This goes beyond access to published research results and requires open access to unpublished digital data.

    Intellectual property protections and data access

    Research data are somewhat distinct from the intellectual property issues relating to research publications. Some kinds of data do not meet the standard of originality necessary for copyright protection, such as sequence data, CT or MRI data, or data from measurement instruments. For raw data from instruments, there is no intellectual property reason why federal agency should not maintain an open archive for the public.

    Much research data is unquestionably subject to copyright protection, such as lab notebooks, written descriptions, photographs, and original reconstructions. Yet there is still a substantial public and scientific interest in inspecting such data. For example, photographic documentation of archaeological sites and specimens are of particular scientific value and are today routinely produced by digital technologies and stored in digital form. Some primary digital records are unique products that cannot be recreated at another time and place: for example, in situ photographs of specimens, photographs and records of sites before excavation, and digital reconstructions. The scientific record would be incomplete without such contributions, and maintaining an archive of such data over the long term is a difficult task for a single investigator, beyond the scope of a grant term.

    In cases where it is impracticable to obtain Creative Commons or other open licenses to such content, a funding agency should at a minimum require that a copy of all such archival information be deposited along with the final project report and a limited-use non-commercial license permitting electronic dissemination of these materials to the public as part of the report.

    Metadata and data access

    Many have noted that raw data may be useless in the absence of additional information about how the data were obtained. Such information is known as "metadata". Researchers generate instrumental data using particular instrument settings and recording standards. They gather observational data under particular research protocols. These standards are may change quickly as instrumentation, technology, and scientific results themselves demand new practices.

    Some scientists note the problem of incompatible metadata, using it as an argument against to delay the establishment of open public access to data. In their view, the public are likely to misunderstand or misuse scientific data where metadata are not clearly indicated. Meta-analyses combining data from multiple research projects are an important secondary use of digital data, and such meta-analyses are impossible when data cannot be reconciled into common observational or instrumental frameworks. Performing original work with data collected in heterogeneous contexts is a research speciality of its own, and is itself sometimes targeted by federal grants.

    However, meta-analysis is only one purpose of data access. Transparency, replicability, and education are central public interests that do not require the reconciliation of data collection methods from multiple studies. They require only clear description of the methods under which data were obtained. At a minimum, final research reports on federally funded projects must describe the standards of data collection with sufficient detail to allow independent replication, including all unpublished results and data.

    Successes of data access in paleoanthropology

    I am an anthropologist, and am most familiar with the scientific data relating to human evolution. These data include genetic observations on living and skeletal samples of humans. They also include fossil and archaeological evidence such as photographs, CT scans, isotopic records, anatomical measurements and descriptions.

    For many years, nearly all genetic data resulting from federally funded research have been made available for public download. Much genetic data generated by non-federally funded research programs, including foreign and domestic institutes, has also been free for public download. These data have resulted in a massive acceleration of research on recent human evolution and human origins. They have also led to unexpected discoveries and a burgeoning contribution of other disciplines to understanding our evolution.

    Data from radiocarbon dating and other isotopic sampling has also been made available to the public. Human occupation sites are among the best sources of evidence about past climates. The investment of federal resources in human evolution research has generated a temporal record that is now essential to studying changes in the faunal and plant compositions of past environments. Free access to records has enabled stronger calibration of radiocarbon dates, the development of a more secure chronology, and a more highly replicable scientific record correlating different regions of the world. Our understanding of such events changes is vastly stronger when data are made public.

    Institutions and data access in paleoanthropology

    By contrast, CT scans and photographs pertaining to human origins are typically not made freely accessible to the public. The United States funding agencies are not the only parties with an interest in such data. In particular, museums and institutes that curate specimens often permit data collection under agreements that restrict the dissemination of the resulting data. Such agreements may be equated to "non-disclosure agreements" with respect to scientific data.

    An institution has a legitimate interest in controlling the public use of images and access to curated materials. Nevertheless, the lack of access to digital data results in reduplication of effort, overapplication of destructive sampling and measurement techniques, and unnecessary handling of precious and fragile specimens. Where it is practical, the United States should facilitate agreements with institutions that allow the release of digital data produced by public funding. Where release is not possible, funding should be granted only for those activities that will result in the release of data under a limited-use non-commercial license. Non-disclosure of data from instruments such as CT scanners, electron microscopes, or mass spectrometers is incompatible with scientific replication.

    Scientific careers and data access in paleoanthropology

    The economy of federal funding for scientific production sometimes leads to perverse incentives for high-ranking researchers that prevent public access to research data. Some scientists believe that their own future research will require exclusive access to data. Others want to impede research achievements by their academic rivals, or to maintain prestige and future funding opportunities.

    Scientific data in some areas may constitute "trade secrets" until they are protected by patents. Even in noncommercial research, federally funded scientists sometimes claim exclusive ownership over data that they plan to use in future research. In my own field of paleoanthropology, data secrecy supports a clandestine "quid pro quo" economy among researchers, in which established researchers and institutions allow furtive looks at unpublished data, to support and consolidate their power and influence.

    This is a game that the United States should simply decline to play. When federal research supports scientific results that are not subject to independent replication, it betrays the public interest in science.

    Established collaborations and centers of scientific research will always exert a strong influence upon the future of science, irrespective of federal data access policies. But established players should not use federal funding to construct barriers to open inquiry.

    Conclusion

    Open public access to data is one indication that a research project is following scientific principles. Making digital data available to the public would be good practice for any researcher, irrespective of funding source. Data access mitigates the risk that negative data will be unreported. Data access facilitates broader stewardship of research projects, in particular where collaborations create data that are distributed across many institutions. Data access and reporting standards enable other researchers to fill in for those who cannot complete scientific project due to health or other personal reasons.

    Federal grant agencies already have successful repositories for many kinds of digital data. Such data are shared with the public at minimal cost relative to the overall budget for federal research grants. Supporting digital data repositories has itself been an important granting aim for several federal agencies and continues to be an active part of scientific infrastructure. Limiting such repositories for the exclusive use of a small cadre of researchers is enormously wasteful of resources, when they can be opened to an interested public for a small incremental cost.

    The public has repeatedly invented surprising uses for digital data that can complement or enhance the scientific record. But much more important, open access to digital data serves the scientific values of transparency and independent replication, essential to maintaining public trust and investment in the research enterprise.

    Synopsis: 
    My response to a federal Request for Information on the topic of digital data access to federally funded research
  • Open access op/ed in NY Times

    Tue, 2012-01-10 23:43 -- John Hawks

    Molecular biologist Michael Eisen, writing in the New York Times: "Research bought, then paid for".

    THROUGH the National Institutes of Health, American taxpayers have long supported research directed at understanding and treating human disease. Since 2009, the results of that research have been available free of charge on the National Library of Medicine’s Web site, allowing the public (patients and physicians, students and teachers) to read about the discoveries their tax dollars paid for.

    But a bill introduced in the House of Representatives last month threatens to cripple this site. The Research Works Act would forbid the N.I.H. to require, as it now does, that its grantees provide copies of the papers they publish in peer-reviewed journals to the library.

    Three years ago, a similar bill was introduced into Congress and did not proceed into law ("Congress to repeal open access science provisions?"). Today's NIH repository and the data access provisions of NSF grants were established by acts of Congress in the late 1990s. In my opinion, the agencies have in many areas gotten away with the bare minimum of compliance with these regulations. Worse, far from strengthening open access to publications and data, some in Congress want to reverse them. The current effort owes much to lobbying by academic publishers, and large campaign donations from officers and employees of those publishers to key Congressmen. Eisen shares more information on his blog ( "Elsevier-funded NY Congresswoman Carolyn Maloney Wants to Deny Americans Access to Taxpayer Funded Research").

    Again, public comment on access to federally funded research ends this Thursday, January 12.

  • Florida: Anthropologists not wanted

    Tue, 2011-10-11 21:26 -- John Hawks

    Last week I linked to my essay, "What's wrong with anthropology?" My theme was that anthropology has been a failure over the past two decades at engaging with policymakers and the public, and that the field can only look forward to decline unless we take immediate action to improve this situation.

    Well...today the governor of Florida, Rick Scott, gave a convincing proof of my thesis on a radio program:

    We don’t need a lot more anthropologists in the state. It’s a great degree if people want to get it, but we don’t need them here. I want to spend our dollars giving people science, technology, engineering, and math degrees. That’s what our kids need to focus all their time and attention on, those types of degrees, so when they get out of school, they can get a job.

    And in the Herald-Tribune:

    “I got accused of not liking anthropologists the other day,” Scott said. “But just think about it, how many more jobs do think there are for anthropologists in the state?

    “Do you want to use your tax dollars to educate more people who can’t get jobs in anthropology? I don’t. I want to make sure that we spend our dollars where people can get jobs when they get out.”

    Daniel Lende has a roundup of stories and responses by anthropologists. It's very difficult to come up with a rapid and effective reply from an organization or department, so I understand these aren't as punchy as they might be. Still, it seems to me a vastly more effective response would describe the economic impact of anthropologists in Florida, the dollar amounts of federal and private grants they bring to Florida universities, their role as custodians of natural and cultural history, and their history of engagement with indigenous and immigrant peoples in the state.

    Oh, and the major associations could mention that the state will not be considered for national meetings. The AAA meeting in particular drives millions of dollars of direct and indirect revenue to its host city.

    Florida anthropologists have a great opportunity moving forward to get attention for their work in public engagement. The attention of the press will never be directed as closely to the value of anthropology within the state.

    UPDATE (2011-10-12): According to the AP and Tampa Bay Online, Governor Scott's daughter took a degree in anthropology. Let me just say, that reinforces the message. We can't even communicate the importance of our field to the parents of our students!

  • Engaging with the public

    Wed, 2011-04-13 20:30 -- John Hawks

    Alice Bell raises an essential question: "What’s this public ‘engagement’ with science thing then?"

    I’m similarly sceptical about lumping this whole ‘science’ thing together (and in particular, lumping together ‘scientists). Science is big and complex, its ideas about itself vary and change over time. Maybe it should be pluralised to sciences, like publics. Or again, maybe we could just talk about specific people, ideas and approaches. Leave loose talk about ‘science’ to philosophers and advertising executives, and instead focus on sharing what you have particular expertise in, be honest about what you don’t know and think about all the new things you might learn from engaging in a bit of broader discussion about your work.

    Like Bell, I favor much more specificity about the "public" we're addressing. I like "people" much better than "publics" plural, because the effective point of contact is the individual, not the committee. People have many different goals in their interactions and experiences with science. When I bring Sophie to the local planetarium for a show, neither she nor I is the "public". We are people with a pre-existing relationship, looking to deepen that by engaging with the particulars of a science both of us have some knowledge about. Other people have their own goals and experiences -- many of them intent on avoiding science. No form of engagement can bring together all these people without addressing their distinct goals and interests.

    On that note, I very much like Bell's final suggestions -- particularly being receptive to serendipity:

    Don’t be silly about ‘the public’. Remember: knowing your audience and targeting specific groups can be very powerful, but so can the serendipitous connections made by packaging your work as accessibly as audience as possible.

    ...

    There is nothing wrong with a bit of ambition, but be realistic. This means keeping in mind the limitations of your project, including pragmatic concerns like money, time, your professional image and the weather. You are unlikely to change the world. You may not even change any minds, but that doesn’t mean it’s not worthwhile, you may well have helped move towards a bit of world/ mind changing. These things take time. None of them are easy.

    I would add one thing. This final point may sound a little nihilistic. I mean, if you can't change minds, why even bother?

    But at an early career stage, very few people have the moxie to change minds. The point of engagement is to become a better scientist. Like all things, it takes practice to master. It may take many failed efforts to arrive at success.

    You are only a reed. But you are a thinking reed.

  • Science Pub, day of creationism

    Tue, 2011-03-29 18:52 -- John Hawks

    I had a wonderful afternoon Sunday at the Madison Science Pub. The featured guest was Ron Numbers, the historian of science at UW-Madison whose research has focused on the origins and history of the creationism movement in the U.S. and worldwide. Ron drew a crowd well over 50 people -- I didn't count, but the large salon at Brocach was packed high and tight.

    The conversation was a rollicking exchange -- Ron told a story about his meeting with Turkish creationist guru Harun Yahya, gave some insight on the very earliest origins of the creationism movement, and shared his good humor. He reminded us that the fraction of Americans who claim belief that humans were created within the last 10,000 years has basically remained unchanged above 40 percent for three decades.

    Attendees came from every walk of life and many had their own stories. My favorite was the inside view of the home-schooling movement, with some groups banning publishers that print science curricula, others banning prominent creationists.

    Invariably, when a group of well-thinking people get together and start talking about creationism, the discussion drifts toward speculation about why folks would turn to creationism -- especially young Earth creationism -- when it means they must reject the most basic principles of almost every branch of natural science. In a crowd of people there will always be several who turn to stupidity as an explanation for different beliefs. Very few things irritate me more than witnessing people's biases coming out in this way. Yet there were many at the Pub who had close and direct experience working with and confronting creationism, and I think we gave some needed perspective.

    Science Pub organizer Skip Evans, speaking from experience at the NCSE and as an organizer of the Wisconsin Citizens for Science, noted that most students who resist the idea of evolution are actually driven by convictions about what will happen to them after they die. Many perceive that religious doctrine about eternal life and personal salvation can be maintained only if other literalist aspects of religion are accepted without question.

    Explaining the history and diversity of life is simply not an issue of great concern to most people except as a marker of belief system. On that score, many "evolution believers" have knowledge that is just as shallow as creationists. They simply nod and smile in response to different cues. Professing a belief in evolution or creation is a not-so-secret handshake that signals membership in a loose clan. That's why the press is so insistent that presidential candidates take some position on the issue; it marks them like a scarlet letter.

    Some committed creationists are simply ignorant of biology -- not stupid, but unschooled in the facts. These can be foiled, and sometimes even persuaded, with a few simple, widely-known examples.

    But many are well practiced in the art of debate and will not easily play into your hands. They will have taken your measure and they know the ground well. The stakes are higher for the creationist, souls hanging in the balance. Wrestling with skunks, you'd be a fool to think you'll keep the stink off.

    As with most things, becoming skilled at advocating for evolution requires much practice. When it comes to debate, many trained students of science are not merely wet behind the ears, they are still tadpoles breathing with gills. Producing simple, effective examples of evolution does not come easily to those untutored in the skills of rhetoric. Yet few things serve a teacher so well as a handful of two-minute examples, told with some style. Saying something credibly means saying it easily and self-evidently, in terms that are familiar to the audience.

    This is the essential skill for every kind of science communication.

  • I'm a genetic libertarian

    Thu, 2011-03-10 13:53 -- John Hawks

    Much news coming out of the FDA public meeting on direct-to-consumer (DTC) genetics. Dan Vorhaus was at the proceedings and reports on them ("Looking Ahead After the FDA’s DTC Meeting").

    I believe that I have a fundamental right to my own biological information. What I mean is that, if anybody has biological information about me, I should be able to access and use it. Additionally, I think it is immoral for anyone to charge me excessive rates to access my own information. So that's where I'm coming from. I'm a genetic libertarian.

    In the current proceedings, two issues have arisen of some interest. The first is a relative sideshow but has for good reasons absorbed much attention. Some public figures have adopted a deliberate strategy to portray DTC genomics companies, such as 23andMe, as parasites on the human genetic research otherwise conducted by academics and pharmaceutical companies. This assertion is obviously false -- some DTC companies now involve thousands of participants in active research projects.

    But more immediately important, there's a video that effectively shows the "big lie" -- Congressmen and an FDA official claiming that no research is done by DTC companies only two days after that very official participated in a meeting that highlighted this kind of research! It is incredible, and widely linked (see, for example, Razib's post or Joe Pickrell's post).

    I'm not the fountain of information about this topic, but I do want to link and promote others with whom I mostly agree. Joe Pickrell takes a practical perspective: DTC testing is good for research, because there's an awful lot of research that would never be done without it.

    You can think what you want about the value of the research done to date by 23andme [1], but in my mind, there’s one simple reason why the sorts of participant-driven research they’re doing can only be a good thing: all research is driven by curiosity, and the people most curious about a disease or trait are those who have it. While people may think of the academic research community as a machine with endless resources and limitless motivation, it’s not. People work on things they think are interesting; they sometimes follow “trendy” topics, or move into fields with more grant money, or get bored of a given problem and move on. So if the research in the trait you’re most interested in isn’t moving fast enough for you, well, tough luck.

    Some of my research on Neandertal genetics surely falls into that category, as does almost all genealogical research. The widespread availability of genomes is already leading to much research of anthropological interest.

    Many of my readers will already have seen Razib's post, "Your genes, your rights – FDA’s Jeffrey Shuren not a fan". Working from the video, he tackles what I see as the second and more important issue: Whether the interpretation of genomes should be subject to regulation.

    The online community needs to get organized. We’re not as powerful as a million doctors and a Leviathan government, but we have right on our side. They’re trying to take from us what is ours.

    According to Vorhaus, some regulation is likely a foregone conclusion from the FDA. This is to be expected; it is the ordinary process of rent-seeking by the political class.

  • FDA-DTC genetics meeting

    Tue, 2011-03-08 19:31 -- John Hawks

    The U.S. Food and Drug Administration has been holding a meeting about Direct-to-Consumer genetic testing. Daniel MacArthur has been following the proceedings remotely and his summary of the first day hits the highlights of the Twitter feed over the course of the day. An excerpt:

    My overall impression from reports on the panel discussion, though, is somewhat ominous: both Dan and Alicia noted an apparent consensus among the panel members that genetic results should be returned via a physician

    One of the heavy-hitters in favor of greater regulation was Nancy Wexler, famous for her work on the genetics of Huntington's Disease.

  • Stuxnet story

    Thu, 2011-03-03 23:29 -- John Hawks

    Fascinating detective story in Vanity Fair about how computer security researchers ferreted out the workings of the Stuxnet worm. I especially enjoyed the psychological sketch of one of the main figures in the story, who works in industrial control software:

    “If I did not have the background that I had, I don’t think I would have had the guts to say what I said about Stuxnet,” Langner says now, finishing his second glass of wine during lunch at a Viennese restaurant in Hamburg. Langner studied psychology and artificial intelligence at the Free University of Berlin. He fell into control systems by accident and found that he loved the fiendishly painstaking work. Every control system is like a bespoke suit made from one-of-a-kind custom fabric—tailored precisely for the conditions of that industrial installation and no other. In a profession whose members have a reputation for being unable to wear matching socks, Langner is a bona fide dandy. “My preference is for Dolce & Gabbana shoes,” he says. “Did you notice, yesterday I wore ostrich?” Langner loves the attention that his theories have gotten. He is waiting, he says, for “an American chick,” preferably a blonde, and preferably from California, to notice his blog and ask him out.

  • Neandertal anti-defamation files, 9

    Fri, 2011-02-11 00:09 -- John Hawks

    Why do they have to bring poor Neandertals into it?

    The eyebrow-raising slap came in response to [former senator Rick] Santorum's recent comments that Palin was likely skipping an annual gathering of conservatives in Washington this weekend because of other "business opportunities" and her mothering responsibilities.

    ...

    "I will not call him the knuckle-dragging Neanderthal," Palin continued. "I'll let his wife call him that instead."

    Oh, well, this is so easy a caveman could do it.

  • Genomes to the people

    Wed, 2011-02-02 18:29 -- John Hawks

    Misha Angrist has written a strong guest post at Daniel MacArthur's "Genetic Future", taking a clear stand in favor of disclosure of genetic information from research studies. In response to the argument that full access to genetic information might "disturb" people:

    If you told me I had an inoperable tumor, don’t you think that would mess with my head? In the old days, doctors wouldn’t tell their patients such things (I know, I know—you’re not my doctor). But anyway, who the hell are you to decide what might or might not mess with my head? How do you know I won’t be more upset if you DON’T tell me? And why would it jeopardize my ability to get insurance? Because I’m only asking you to disclose it to ME, not to append it to my medical record or my Facebook page. Your consent form says you will take all kinds of steps to keep my information confidential, but that it might be examined by the FDA, the NIH, etc. So…that means it would be okay for some bureaucrat to peruse it but not me?

    Well, of course. Bureaucrats mind your own business so you don't have to!

Pages

Subscribe to politics

Neandertals

For years, I've worked on their bones. Now I'm working on their genes. Read more about the science studying these ancient people.

Denisova

From a finger bone of an ancient human came the record of a completely unexpected population. My lab is working on the science of the Denisova genome.

Acceleration

The advent of agriculture caused natural selection to speed up greatly in humans. We're uncovering some of the ways that populations have rapidly changed during the last 10,000 years.

Malapa

Just outside Johannesburg, the Malapa site is producing some of the most exciting finds in human evolution. This site is the headquarters of the Malapa Soft Tissue Project.