12.
thedatacollective@danwwilson #useR2018
Where to start?
13.
thedatacollective@danwwilson #useR2018
It’s all about the data
14.
thedatacollective@danwwilson #useR2018
Set your requirements…
…then be flexible
15.
thedatacollective@danwwilson #useR2018
Make it yours
16.
thedatacollective@danwwilson #useR2018
*
Organise things*
• Name things well http://bit.ly/Jenny_naming
• Determine your folder structure and use it consistently
17.
thedatacollective@danwwilson #useR2018
Identify points of friction
18.
thedatacollective@danwwilson #useR2018
Workflows take thinking
• What work is repetitive?
• What data can you standardize across projects?
• How should you name things?
• How should you structure projects?
• What steps in a project slow you down?
19.
thedatacollective@danwwilson #useR2018
Building skills
20.
thedatacollective@danwwilson #useR2018
Example Project
Receive
Data
Standardise
Data
Review
Summary
Client
Thedata
collective
Analyse
Data
Review data
Zip Outputs
25.
thedatacollective@danwwilson #useR2018
Start at your level, and improve
• Start simple
• Re-usable code snippets
• Keep them together
• Build a file of commonly used functions (99_functions.R)
• Build a package
• When you can make the time, or skills are ready
27.
thedatacollective@danwwilson #useR2018
What slows you down?
• R is great even with its quirks
> paste(“Dan”, NA, “Wilson”)
# Dan NA Wilson
paste_na()
> paste_na(“Dan”, NA, “Wilson”)
# Dan Wilson
28.
thedatacollective@danwwilson #useR2018
Standardised data
> data(package = "segmentr")
Data sets in package 'segmentr’:
ask_conversion Ask Conversion Table
lookup_channel Channel Code Lookup
lookup_classification Classification Code Lookup
segments Segments
29.
thedatacollective@danwwilson #useR2018
Streamlined process*
• Build templates
• Start the project quickly
• Next step is client specific templates
30.
thedatacollective@danwwilson #useR2018
Getting data out of R*
• Copying and pasting from the console is painful (or impossible)
• Writing to CSV and opening to copy/paste from is too many steps
• copy_clip()
31.
thedatacollective@danwwilson #useR2018
Build solutions to common problems
• Limit manual intervention
• Make objects (data, functions, etc) accessible
• Reduce repetition
• Simplify integration with other tools/software
32.
thedatacollective@danwwilson #useR2018
Key take outs
1. Workflows require a little bit of forethought… take the time
2. Start at your level and build your skills
3. Ease the friction