Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Reproducible Project Workflow in R
(with ProjectTemplate)
Caitlin Hudon, Sr. Data Analyst @ web.com
Why reproducible workflow matters
Things I care about (in structuring projects)
+ Easy to find things (reports, data, etc.)
+ Easy to share
+ Ability to upd...
Enter:
Technical Benefits of ProjectTemplate
+ Easy to create new projects
+ Organizes and standardizes your projects
+ Automatic...
Non-technical benefits
+ Automates the thoughtless parts of your project (so you
can use your energy on the important stuf...
Create a project
library(‘ProjectTemplate’)
create.project(‘new_project’)
New ProjectTemplate Project
Munge Folder
Data Folder
Config Folder → global.dcf file
Load a project
library(‘ProjectTemplate’)
setwd(‘~/projects/new_project’)
load.project()
Advice
+ Number files in the munge folder so they run in order
+ Avoid manually updating raw data files (do this in code!)...
Cheat Sheet
ProjectTemplate home page
+ Don’t miss the tutorial!
Great discussion on best practices for managing analysis ...
Questions? Ideas?
caitlinmhudon@gmail.com
@beeonaposy
Upcoming SlideShare
Loading in …5
×

of

Reproducible Project Workflow in R (with ProjectTemplate) Slide 1 Reproducible Project Workflow in R (with ProjectTemplate) Slide 2 Reproducible Project Workflow in R (with ProjectTemplate) Slide 3 Reproducible Project Workflow in R (with ProjectTemplate) Slide 4 Reproducible Project Workflow in R (with ProjectTemplate) Slide 5 Reproducible Project Workflow in R (with ProjectTemplate) Slide 6 Reproducible Project Workflow in R (with ProjectTemplate) Slide 7 Reproducible Project Workflow in R (with ProjectTemplate) Slide 8 Reproducible Project Workflow in R (with ProjectTemplate) Slide 9 Reproducible Project Workflow in R (with ProjectTemplate) Slide 10 Reproducible Project Workflow in R (with ProjectTemplate) Slide 11 Reproducible Project Workflow in R (with ProjectTemplate) Slide 12
Upcoming SlideShare
CAN I TAKE A PHOTO
Next
Download to read offline and view in fullscreen.

0 Likes

Share

Download to read offline

Reproducible Project Workflow in R (with ProjectTemplate)

Download to read offline

ProjectTemplate is an R package that makes it easy to create and run projects so that you can spend less time thinking about organization and more time analyzing data.

  • Be the first to like this

Reproducible Project Workflow in R (with ProjectTemplate)

  1. 1. Reproducible Project Workflow in R (with ProjectTemplate) Caitlin Hudon, Sr. Data Analyst @ web.com
  2. 2. Why reproducible workflow matters
  3. 3. Things I care about (in structuring projects) + Easy to find things (reports, data, etc.) + Easy to share + Ability to update with new data + Reproducibility + Being able to quickly verify results + The sanity of my future self
  4. 4. Enter:
  5. 5. Technical Benefits of ProjectTemplate + Easy to create new projects + Organizes and standardizes your projects + Automatically loads data and R packages needed + Automatically runs data munging scripts
  6. 6. Non-technical benefits + Automates the thoughtless parts of your project (so you can use your energy on the important stuff) + Easier for someone new to your project to understand steps / data / preparation involved to execute it
  7. 7. Create a project library(‘ProjectTemplate’) create.project(‘new_project’)
  8. 8. New ProjectTemplate Project Munge Folder Data Folder Config Folder → global.dcf file
  9. 9. Load a project library(‘ProjectTemplate’) setwd(‘~/projects/new_project’) load.project()
  10. 10. Advice + Number files in the munge folder so they run in order + Avoid manually updating raw data files (do this in code!) + Edit the global.dcf file (in config folder) to add libraries you use, load libraries automatically, and adjust “strings as factors” settings + Lots of other ways to make analysis reproducible; ProjectTemplate is one part of larger ecosystem
  11. 11. Cheat Sheet ProjectTemplate home page + Don’t miss the tutorial! Great discussion on best practices for managing analysis projects The talk that got me interested in ProjectTemplate (thanks, Hilary Parker!)
  12. 12. Questions? Ideas? caitlinmhudon@gmail.com @beeonaposy

ProjectTemplate is an R package that makes it easy to create and run projects so that you can spend less time thinking about organization and more time analyzing data.

Views

Total views

867

On Slideshare

0

From embeds

0

Number of embeds

20

Actions

Downloads

6

Shares

0

Comments

0

Likes

0

×