9-10th October 2018

About me

Imposter syndrome

Hadley Wickham: Practioner/Programmer

Reproducible Science

"Reproducibility involves being able to recalculate the exact numbers in a data analysis using the code and raw data provided by the analyst…Reproducibility should not be confused with “correctness” of a data analysis. A data analysis can be fully reproducible and recreate all numbers in an analysis and still be misleading or incorrect."

Jeff Leek, The Elements of Data Analytic Style

Official lesson materials

All the official software carpentry lesson materials can be found here

The official materials mostly use base R, we used a mixture of base R and the tidyverse


  • Intro to R and RStudio
  • Importing data
  • Transforming data with dplyr
  • Functions in R
  • Visualising data with ggplot

R is 25 years old

Ross Ihaka and Robert Gentleman.

R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics, 5(3):299–314, 1996

Writing code is frustrating

“There are only two kinds of languages: the ones people complain about and the ones nobody uses”

Bjarne Stroustrup, C++ creator and developer

Reproducible R

Don't save your workspace, save your code