Graduate School of Library and Information Science
University of Illinois at Urbana-Champaign
Reproducibility and Reliability in Statistical and Data Driven Research
The reproducibility and computational inferences from data is widely recognized as an emerging issue in the scientific reliability of results. This talk will motivate the rationale for this shift, and outline the problem of reproducibility. I will then present ongoing research on several solutions: empirical research on data and code publication; the pilot project http://ResearchCompendia.org for large scale validation of statistical findings; and the Reproducible Research Standard' for ensuring the distribution of legally re-usable data and code. If time permits, I will present early results assessing the reproducibility of published computational findings. Some of this research is described in my co-edited books, Implementing Reproducible Researchand Privacy, Big Data, and the Public Good.