Applications of R 
How companies use data science to succeed 
David Smith @revodavid 
Chief Community Officer 
Revolution Analytics 
DataWeek San Francisco, September 17 2014
What is R? 
 Most widely used data analysis software 
• Used by 2M+ data scientists, statisticians and analysts 
 Most powerful statistical programming language 
• Flexible, extensible and comprehensive for productivity 
 Create beautiful and unique data visualizations 
• As seen in New York Times, Twitter and Flowing Data 
 Thriving open-source community 
• Leading edge of analytics research 
 Fills the talent gap 
• New graduates prefer R 
www.revolutionanalytics.com/what-r
3 
R’s popularity is growing rapidly 
R Usage Growth 
Rexer Data Miner Survey, 2007-2013 
• Rexer Data Miner Survey • IEEE Spectrum, July 2014 
#9: R 
Language Popularity 
IEEE Spectrum Top Programming Languages
4 
R is among the highest-paid IT skills in the US 
• Dice Tech Salary Survey, January 
2014 
• O’Reilly Strata 2013 Data Science 
Salary Survey
Applications of R 
5
Facebook 
• Exploratory Data 
Analysis 
• Experimental Analysis 
“Generally, we use R to move 
fast when we get a new data 
set. With R, we don’t need to 
develop custom tools or write 
a bunch of code. Instead, we 
can just go about cleaning 
and exploring the data.” — 
Solomon Messing, data 
scientist at Facebook
Facebook 
• Big-Data Visualization 
“It resonated with 
many people. It's not 
just a pretty picture, 
it's a reaffirmation of 
the impact we have 
in connecting 
people, even across 
oceans and 
borders.” — Paul 
Butler, data 
scientist, Facebook
Google 
“The great beauty of R 
is that you can modify 
it to do all sorts of 
things.” 
— Hal Varian 
Chief Economist, 
Google 
8 
“R is really 
important to the 
point that it's hard 
to overvalue it.” — 
Daryl Pregibon 
Head of 
Statistics, 
Google 
• Advertising 
Effectiveness 
• Economic forecasting
9 
 Calculating ROI for 
Marketing campaigns 
 CausalImpact: Bayesian 
structural time-series 
models
10 
The New York Times 
Interactive Features 
• Election Forecast 
• Dialect Quiz 
Data Journalism 
• NFL Draft Picks 
• Wealth distribution in USA
11 
The New York Times 
Data Visualization 
• Facebook IPO 
• Baseball legends
12 
Twitter 
“A common pattern for me is that I'll code a MapReduce 
job in Scala, do some simple command-line munging on 
the results, pass the data into Python or R for further 
analysis, pull from a database to grab some extra fields, 
and so on, often integrating what I find into some 
machine learning models in the end” — Ed Chen, Data 
Scientist, Twitter 
• Data Visualization • Semantic clustering
13 
Public Affairs 
• Casualty estimation in Warzones • Political Analysis
14 
Weather and Climate 
• Climate change forecasts • Flood Warnings
15 
Video Gaming 
• Multiplayer Matchmaking 
• Player Churn 
• Game design 
• Difficulty curve 
• Level trouble-spots 
• In-game purchase optimization 
• Fraud detection 
• Player communities 
• Game Analysis 
Video Games
16 
Finance and Banking 
• Credit Risk Analysis • Financial Networks
Companies Using R 
17 
Social media 
Google 
Facebook 
Twitter 
Foursquare 
Kickstarter 
eHarmony 
Media 
New York 
Times 
Economist 
New Scientist 
XBox 
Finance 
American 
Century 
ANZ 
Credit Suisse 
Nationwide 
Lloyds 
BofA 
Software 
Vendors 
Revolution 
Analytics 
Rstudio 
Zementis 
Alteryx 
SAP 
IBM 
HP 
SAS 
Teradata 
TIBCO 
Oracle 
OneTick 
DataCamp 
Services 
Mango 
Accenture 
Deloitte 
Scientific Revenue 
OpenBI 
Coursera 
Government 
FDA 
CPFB 
City of Chicago 
NOAA 
NIST 
Analytics 
Zillow 
Trulia 
DataSong 
Exelate 
X+1 
PredictWise 
Public Affairs 
HRDAG 
Sunlight 
Foundation 
Benetech 
RealClimate 
Other 
Ford 
John Deere 
Monsanto 
Nordstrom 
Uber 
Etsy 
www.revolutionanalytics.com/companies-using-r
18 
OUR COMPANY 
The leading provider 
of advanced analytics 
software and services 
based on open source R, 
since 2007 
OUR SOFTWARE 
The only Big Data, Big 
Analytics software platform 
based on the data science 
language R 
SOME KUDOS 
Visionary 
Gartner Magic Quadrant 
for Advanced Analytics 
Platforms, 2014
Thank You 
David Smith 
david@revolutionanalytics.com, @revodavid 
Download these slides from: 
blog.revolutionanalytics.com/2014/09/dataweek.html

Applications of R (DataWeek 2014)

  • 1.
    Applications of R How companies use data science to succeed David Smith @revodavid Chief Community Officer Revolution Analytics DataWeek San Francisco, September 17 2014
  • 2.
    What is R?  Most widely used data analysis software • Used by 2M+ data scientists, statisticians and analysts  Most powerful statistical programming language • Flexible, extensible and comprehensive for productivity  Create beautiful and unique data visualizations • As seen in New York Times, Twitter and Flowing Data  Thriving open-source community • Leading edge of analytics research  Fills the talent gap • New graduates prefer R www.revolutionanalytics.com/what-r
  • 3.
    3 R’s popularityis growing rapidly R Usage Growth Rexer Data Miner Survey, 2007-2013 • Rexer Data Miner Survey • IEEE Spectrum, July 2014 #9: R Language Popularity IEEE Spectrum Top Programming Languages
  • 4.
    4 R isamong the highest-paid IT skills in the US • Dice Tech Salary Survey, January 2014 • O’Reilly Strata 2013 Data Science Salary Survey
  • 5.
  • 6.
    Facebook • ExploratoryData Analysis • Experimental Analysis “Generally, we use R to move fast when we get a new data set. With R, we don’t need to develop custom tools or write a bunch of code. Instead, we can just go about cleaning and exploring the data.” — Solomon Messing, data scientist at Facebook
  • 7.
    Facebook • Big-DataVisualization “It resonated with many people. It's not just a pretty picture, it's a reaffirmation of the impact we have in connecting people, even across oceans and borders.” — Paul Butler, data scientist, Facebook
  • 8.
    Google “The greatbeauty of R is that you can modify it to do all sorts of things.” — Hal Varian Chief Economist, Google 8 “R is really important to the point that it's hard to overvalue it.” — Daryl Pregibon Head of Statistics, Google • Advertising Effectiveness • Economic forecasting
  • 9.
    9  CalculatingROI for Marketing campaigns  CausalImpact: Bayesian structural time-series models
  • 10.
    10 The NewYork Times Interactive Features • Election Forecast • Dialect Quiz Data Journalism • NFL Draft Picks • Wealth distribution in USA
  • 11.
    11 The NewYork Times Data Visualization • Facebook IPO • Baseball legends
  • 12.
    12 Twitter “Acommon pattern for me is that I'll code a MapReduce job in Scala, do some simple command-line munging on the results, pass the data into Python or R for further analysis, pull from a database to grab some extra fields, and so on, often integrating what I find into some machine learning models in the end” — Ed Chen, Data Scientist, Twitter • Data Visualization • Semantic clustering
  • 13.
    13 Public Affairs • Casualty estimation in Warzones • Political Analysis
  • 14.
    14 Weather andClimate • Climate change forecasts • Flood Warnings
  • 15.
    15 Video Gaming • Multiplayer Matchmaking • Player Churn • Game design • Difficulty curve • Level trouble-spots • In-game purchase optimization • Fraud detection • Player communities • Game Analysis Video Games
  • 16.
    16 Finance andBanking • Credit Risk Analysis • Financial Networks
  • 17.
    Companies Using R 17 Social media Google Facebook Twitter Foursquare Kickstarter eHarmony Media New York Times Economist New Scientist XBox Finance American Century ANZ Credit Suisse Nationwide Lloyds BofA Software Vendors Revolution Analytics Rstudio Zementis Alteryx SAP IBM HP SAS Teradata TIBCO Oracle OneTick DataCamp Services Mango Accenture Deloitte Scientific Revenue OpenBI Coursera Government FDA CPFB City of Chicago NOAA NIST Analytics Zillow Trulia DataSong Exelate X+1 PredictWise Public Affairs HRDAG Sunlight Foundation Benetech RealClimate Other Ford John Deere Monsanto Nordstrom Uber Etsy www.revolutionanalytics.com/companies-using-r
  • 18.
    18 OUR COMPANY The leading provider of advanced analytics software and services based on open source R, since 2007 OUR SOFTWARE The only Big Data, Big Analytics software platform based on the data science language R SOME KUDOS Visionary Gartner Magic Quadrant for Advanced Analytics Platforms, 2014
  • 19.
    Thank You DavidSmith david@revolutionanalytics.com, @revodavid Download these slides from: blog.revolutionanalytics.com/2014/09/dataweek.html

Editor's Notes

  • #2 Companies using R in 2013 http://blog.revolutionanalytics.com/2013/05/companies-using-open-source-r-in-2013.html Title: How the growth of R helps data-driven organizations succeed Abstract: Adoption of the R language has grown rapidly in the last few years, and is ranked as the number-one data science language in several surveys. This accelerating R adoption curve has been driven by the Big Data revolution, and the fact that so many data scientists — having learned R at university — are actively unlocking the secrets hidden in these new, vast data troves. In more than 6 years of writing for the Revolutions blog, I’ve discovered hundreds of applications of R in business, in government, and in the non-profit sector. Sometimes the use of R is obvious, and sometimes it takes a little bit of detective work to learn how R is operating behind the scenes. In this talk, I’ll begin by presenting some recent statistics on the growth of R. Then I’ll recount some of my favourite applications of R, and show how R is behind some amazing innovations in today’s world.
  • #3 Image reference: http://www.facebook.com/notes/facebook-engineering/visualizing-friendships/469716398919
  • #5 Dice Tech Salary Survey, January 2014 O’Reilly Strata 2013 Data Science Salary Survey
  • #9 A
  • #11 Fantasy Football: http://blog.revolutionanalytics.com/2013/10/fantasy-football-modeling-with-r.html
  • #13 http://blog.revolutionanalytics.com/2013/05/the-arteries-of-the-world-in-tweets.html http://blog.revolutionanalytics.com/2012/03/r-twitter-and-mcdonalds.html
  • #16 Xbox: http://blog.revolutionanalytics.com/2014/05/microsoft-uses-r-for-xbox-matchmaking.html Other gaming http://blog.revolutionanalytics.com/2013/06/how-big-data-and-statistical-modeling-are-changing-video-games.html
  • #17 Credit Suisse: http://blog.revolutionanalytics.com/2013/05/sheftel-on-r-on-the-trading-desk.html
  • #18 EHarmony: http://blog.revolutionanalytics.com/2013/11/strata-hadoop-world-2013-recap.html Ford: http://www.zdnet.com/fords-big-data-chief-sees-massive-possibilities-but-the-tools-need-work-7000000322/ CFPB http://blog.revolutionanalytics.com/2012/04/r-at-the-consumer-financial-protection-bureau.html