Reproducible research can still be wrong: adopting a prevention approach

JT Leek, RD Peng - … of the National Academy of Sciences, 2015 - National Acad Sciences
Proceedings of the National Academy of Sciences, 2015National Acad Sciences
Reproducibility—the ability to recompute results—and replicability—the chances other
experimenters will achieve a consistent result—are two foundational characteristics of
successful scientific research. Consistent findings from independent investigators are the
primary means by which scientific evidence accumulates for or against a hypothesis. Yet, of
late, there has been a crisis of confidence among researchers worried about the rate at
which studies are either reproducible or replicable. To maintain the integrity of science …
Reproducibility—the ability to recompute results—and replicability—the chances other experimenters will achieve a consistent result—are two foundational characteristics of successful scientific research. Consistent findings from independent investigators are the primary means by which scientific evidence accumulates for or against a hypothesis. Yet, of late, there has been a crisis of confidence among researchers worried about the rate at which studies are either reproducible or replicable. To maintain the integrity of science research and the public’s trust in science, the scientific community must ensure reproducibility and replicability by engaging in a more preventative approach that greatly expands data analysis education and routinely uses software tools. We define reproducibility as the ability to recompute data analytic results given an observed dataset and knowledge of the data analysis pipeline. The replicability of a study is the chance that an independent experiment targeting the same scientific question will produce a consistent result (1). Concerns among scientists about both have gained significant traction recently due in part to a statistical argument that suggested most published scientific results may be false positives (2). At the same time, there have been some very public failings of reproducibility across a range of disciplines from cancer genomics (3) to economics (4), and the data for many publications have not been made publicly available, raising doubts about the quality of data analyses. Popular press articles have raised questions about the reproducibility of all scientific research (5), and the US Congress has convened hearings focused on the transparency of scientific research (6). The result is that much of the scientific enterprise has been called into question, putting funding and hard won scientific truths at risk.
From a computational perspective, there are three major components to a reproducible and replicable study:(i) the raw data from the experiment are available,(ii) the statistical code and documentation to reproduce the analysis are available, and (iii) a correct data analysis must be performed. Recent cultural shifts in genomics and other areas have had a positive impact on data and code availability. Journals are starting to require data availability as a condition for publication (7), and centralized databases such as the National Center for Biotechnology Information’s Gene Expression Omnibus are being created for depositing data generated by publicly funded scientific experiments. New
National Acad Sciences