SlideShare a Scribd company logo
Submit Search
Upload
NY R Conference talk
Report
E
Emily Robinson
Follow
•
1 like
•
702 views
1
of
45
NY R Conference talk
•
1 like
•
702 views
Download Now
Download to read offline
Report
Technology
My talk in April 2018 at the NY R Conference on the Lesser Known Stars of the Tidyverse.
Read more
E
Emily Robinson
Follow
Recommended
The Lesser Known Stars of the Tidyverse
Emily Robinson
999 views
•
48 slides
Building an A/B Testing Analytics System with R and Shiny
Emily Robinson
7.8K views
•
43 slides
Introduction to r
Alberto Labarga
476 views
•
37 slides
Getting started with R
CCAFS | CGIAR Research Program on Climate Change, Agriculture and Food Security
557 views
•
37 slides
Object Calisthenics
KLabCyscorpions-TechBlog
2.1K views
•
56 slides
Distributed processing of large graphs in python
Jose Quesada (hiring)
963 views
•
115 slides
More Related Content
Similar to NY R Conference talk
Startup Data Science
Misha Lisovich
290 views
•
13 slides
LECTURE 6 DESIGN, DEBasd, INTERFACES.pdf
ShashikantSathe3
3 views
•
51 slides
2022 - Delivering Powerful Technical Presentations.pdf
Wesley Reisz
105 views
•
80 slides
LECTURE 6 DESIGN, DEBUGGING, INTERFACES.pdf
SHASHIKANT346021
8 views
•
51 slides
Developing in R - the contextual Multi-Armed Bandit edition
Robin van Emden
653 views
•
50 slides
The Essential Perl Hacker's Toolkit
Stephen Scaffidi
3K views
•
229 slides
Similar to NY R Conference talk
(20)
Startup Data Science
Misha Lisovich
•
290 views
LECTURE 6 DESIGN, DEBasd, INTERFACES.pdf
ShashikantSathe3
•
3 views
2022 - Delivering Powerful Technical Presentations.pdf
Wesley Reisz
•
105 views
LECTURE 6 DESIGN, DEBUGGING, INTERFACES.pdf
SHASHIKANT346021
•
8 views
Developing in R - the contextual Multi-Armed Bandit edition
Robin van Emden
•
653 views
The Essential Perl Hacker's Toolkit
Stephen Scaffidi
•
3K views
Lotusphere 2007 AD505 DevBlast 30 LotusScript Tips
Bill Buchan
•
804 views
2014 pycon-talk
c.titus.brown
•
3.1K views
Don't let your tests slow you down
Daniel Irvine
•
259 views
Final grasp ASE
babak danyal
•
1.4K views
Applying an intersectionality lens in data science
Data Con LA
•
45 views
Ad505 dev blast
Bill Buchan
•
503 views
Unicorn or Rhino? Agility in the Data Realm?
Daniel Davis
•
35 views
Qualitative Data Analysis using NVivo10 - A workshop for facilitators
Brenda Cecilia Padilla Rodríguez
•
818 views
Five Ways To Do Data Analytics "The Wrong Way"
Discover Pinterest
•
1.4K views
How To Contribute To Drupal Drupal - DrupalCon Paris
Drupalcon Paris
•
2.6K views
Data Visualization: A Quick Tour for Data Science Enthusiasts
Krist Wongsuphasawat
•
50.6K views
Data Visualization
Vera Kovaleva
•
339 views
Beginner's Guide to UI Design
Máirín Duffy
•
3.6K views
DataHub
Aditya Parameswaran
•
2K views
Recently uploaded
AI: mind, matter, meaning, metaphors, being, becoming, life values
Twain Liu 刘秋艳
32 views
•
16 slides
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...
Vadym Kazulkin
56 views
•
64 slides
JCon Live 2023 - Lice coding some integration problems
Bernd Ruecker
62 views
•
85 slides
.conf Go 2023 - Data analysis as a routine
Splunk
85 views
•
12 slides
Web Dev - 1 PPT.pdf
gdsczhcet
49 views
•
45 slides
ChatGPT and AI for Web Developers
Maximiliano Firtman
161 views
•
82 slides
Recently uploaded
(20)
AI: mind, matter, meaning, metaphors, being, becoming, life values
Twain Liu 刘秋艳
•
32 views
How to reduce cold starts for Java Serverless applications in AWS at JCON Wor...
Vadym Kazulkin
•
56 views
JCon Live 2023 - Lice coding some integration problems
Bernd Ruecker
•
62 views
.conf Go 2023 - Data analysis as a routine
Splunk
•
85 views
Web Dev - 1 PPT.pdf
gdsczhcet
•
49 views
ChatGPT and AI for Web Developers
Maximiliano Firtman
•
161 views
Liqid: Composable CXL Preview
CXL Forum
•
120 views
GigaIO: The March of Composability Onward to Memory with CXL
CXL Forum
•
122 views
"Role of a CTO in software outsourcing company", Yuriy Nakonechnyy
Fwdays
•
38 views
Photowave Presentation Slides - 11.8.23.pptx
CXL Forum
•
120 views
Samsung: CMM-H Tiered Memory Solution with Built-in DRAM
CXL Forum
•
101 views
Combining Orchestration and Choreography for a Clean Architecture
ThomasHeinrichs1
•
67 views
"Fast Start to Building on AWS", Igor Ivaniuk
Fwdays
•
34 views
Data-centric AI and the convergence of data and model engineering:opportunit...
Paolo Missier
•
25 views
Throughput
Moisés Armani Ramírez
•
31 views
MemVerge: Past Present and Future of CXL
CXL Forum
•
110 views
Transcript: The Details of Description Techniques tips and tangents on altern...
BookNet Canada
•
106 views
"AI Startup Growth from Idea to 1M ARR", Oleksandr Uspenskyi
Fwdays
•
25 views
Empathic Computing: Delivering the Potential of the Metaverse
Mark Billinghurst
•
422 views
METHOD AND SYSTEM FOR PREDICTING OPTIMAL LOAD FOR WHICH THE YIELD IS MAXIMUM ...
Prity Khastgir IPR Strategic India Patent Attorney Amplify Innovation
•
24 views
NY R Conference talk
1.
The Lesser Known
s of the Tidyverse Emily Robinson @robinson_es
2.
About Me ➔ R
User ~ 6 years ➔ Data Scientist at DataCamp ➔ Enjoy talking about: ◆ A/B testing ◆ Building and finding data science community ◆ R
3.
Talk Goals
4.
1. Keep you
hip to the lingo
5.
2. Stop you
from doing this ….
6.
…. by sharing
useful functions
7.
3. Point you
to resources
8.
The Tidyverse
9.
Coherent system of
packages for data manipulation, exploration, and visualization that share a common design philosophy
11.
Tidyverse = ?
12.
Tidyverse = !
13.
Tidyverse != Hadleyverse
14.
Tidyverse != Hadleyverse Many
other contributors
15.
Demo
17.
Some steps of
a data analysis workflow ➔ View dataset in console ➔ Inspect missing values ➔ Examine some columns ➔ Make a plot ➔ Do something cool and new!
18.
Problem: it takes
over the console Step 1: print your dataset!
19.
Prints only 10
rows and the columns that fit on the screen Solution: as_tibble()
20.
Problem: how do
you do this for every column? Step 2: examine your NAs
21.
Problem: missing values
aren’t actually NA Answer: purrr::map_df() to “map” function over each column
22.
Solution: na_if() to
replace certain values with NA
23.
Problem: how I
can I do this quickly? + Skimr Solution: dplyr::select_if() + skimr::skim() Step 3: examine your numeric columns
24.
Problem: it has
multiple answers in each row Step 4: examine a single column
25.
Solution: stringr::str_split() …
26.
Solution: stringr::str_split() and
tidyr::unnest() +
27.
Problem: it’s a
mess Step 5: make a plot!
28.
Solution: coord_flip … But
they’re not ordered
29.
+ forcats::fct_reorder
30.
Final step: do
something cool and new! Problem:
31.
One solution: make
a minimal reproducible example +
32.
Part 0 (optional):
use tribble() to make a toy dataset
33.
Part 1: Use
reprex() to find any problems Credit: Nick Tiernay, https://www.njtierney.com/post/2017/01/11/magic-reprex/
34.
Part 2: Use
reprex() to post your question or issue Credit: Nick Tiernay, https://www.njtierney.com/post/2017/01/11/magic-reprex/
35.
Review stringr::str_split tidyr::unnest coord_flip() forcats::fct_reorder tibble::tribble reprex::reprex tibble::as_tibble purrr:map_df dplyr::na_if dplyr::select_if skimr::skim
36.
Resources
37.
R4ds.had.co.nz
38.
#rstats twitter
39.
#rstats twitter
40.
Rstudio.com/resources/cheatsheets
41.
DataCamp.com
42.
Learn | https://datacamp.com/courses
43.
Conclusion
44.
The tidyverse Come for
the stickers and package names … Stay for the friendly community and happy workflow
45.
Thank you! tiny.cc/nyrtalk hookedondata.org @robinson_es