Motivation
Four general principles
Case studies
Costs and benefits
20 July 2017
Motivation
Four general principles
Case studies
Costs and benefits
Motivation
Reproducibilty is necessary for scientific progress
Computers wrangle the data, but also obscure it
Especially point-and-click actions
Technical solutions available in open source/format/data/access
Four general principles of reproducible research that have emerged in other fields
✓ Make openly available the data and methods that generated the published result
✓ Write scripts to conduct analyses
✓ Use version control to track changes
✓ Describe and archive the computational environment
All files on figshare, OSF, university data repo, or similar
Data in CSV format
Organised as an R package
All files tracked with Git, hosted on GitHub
Collaboration occurred via GitHub's 'flow'
Docker image and Dockerfile to contain RStudio, packages, code and external dependencies
Based on Rocker image and templates
Case Studies of Compendia
VCS repository
containing…
README.md
R package & manuscript
code CI
environment CI
Costs & benefits
Time learning the tools
Time doing new things
Built-in vs Bolt-on
Comfort of knowing that I am right & have no secrets
Save time by reusing my previous code
Open data confers citation advantages, but magnitude is highly variable
Open Source community membership provides access to high-quality help
Open methods and materials, scripted workflow, version control and environment control are generic principles suitable for most fields of research
The specific details will change over time, but the principles will endure
For most people, the technical problems already have good solutions, the remaining challenge is cultural
Presentation written in R Markdown using ioslides
Compiled into HTML5 using RStudio & knitr
Source code hosting: https://github.com/benmarwick/
ORCID: http://orcid.org/0000-0001-7879-4531
Licensing: