SlideShare a Scribd company logo
1 of 39
Download to read offline
Applications in R
Success and Lessons Learned from the Marketplace
David Smith
Chief Community Officer
Revolution Analytics
July 29, 2014
Neera Talbert
VP Professional Services
Revolution Analytics
Agenda
Introduction to R
Growth of R
Applications of R
Q&A
David Smith
Chief Community Officer
@revodavid
Editor, blog.revolutionanalytics.com
Co-author, “Introduction to R”
3
OUR COMPANY
The leading provider
of advanced analytics
software and services
based on open source R,
since 2007
OUR SOFTWARE
The only Big Data, Big
Analytics software platform
based on the data science
language R
SOME KUDOS
Visionary
Gartner Magic Quadrant
for Advanced Analytics
Platforms, 2014
What is R?
 Most widely used data analysis software
• Used by 2M+ data scientists, statisticians and analysts
 Most powerful statistical programming language
• Flexible, extensible and comprehensive for productivity
 Create beautiful and unique data visualizations
• As seen in New York Times, Twitter and Flowing Data
 Thriving open-source community
• Leading edge of analytics research
 Fills the talent gap
• New graduates prefer R
www.revolutionanalytics.com/what-r
Poll #1
What data analysis software is used where you work?
5
6
R’s popularity is growing rapidly
R Usage Growth
Rexer Data Miner Survey, 2007-2013
• Rexer Data Miner Survey • IEEE Spectrum, July 2014
#9: R
Language Popularity
IEEE Spectrum Top Programming Languages
7
R is among the highest-paid IT skills in the US
Dice Tech Salary Survey, January 2014 O’Reilly Strata 2013 Data Science Salary Survey
8
Technical Support for Open Source R
AdviseR™ from Revolution Analytics
Technical support for open source R, from the R experts.
 Email and phone support 8AM-6PM, Mon-Fri
 Support for R, validated packages, and third-party software
connections
 On-line case management and knowledgebase
 Access to technical resources, documentation and user forums
 Exclusive on-line webinars from community experts
 Guaranteed response times
Also available: expert hands-on and on-line training for R, from
Revolution Analytics AcademyR.
www.revolutionanalytics.com/AdviseR
R SUPPORT
12 MONTHS
$795
PER USER
Applications of R
9
Facebook
• Exploratory Data
Analysis
• Experimental Analysis
“Generally, we use R to move
fast when we get a new data
set. With R, we don’t need to
develop custom tools or write
a bunch of code. Instead, we
can just go about cleaning
and exploring the data.” —
Solomon Messing, data
scientist at Facebook
Facebook
• Big-Data Visualization
“It resonated with
many people. It's not
just a pretty picture,
it's a reaffirmation of
the impact we have
in connecting
people, even across
oceans and
borders.” — Paul
Butler, data
scientist, Facebook
Google
12
“The great beauty of R
is that you can modify
it to do all sorts of
things.”
— Hal Varian
Chief Economist,
Google
• Advertising
Effectiveness
“R is really
important to the
point that it's hard
to overvalue it.” —
Daryl Pregibon
Head of
Statistics,
Google
• Economic forecasting
13
Twitter
• Data Visualization • Semantic clustering
“A common pattern for me is that I'll code a MapReduce
job in Scala, do some simple command-line munging on
the results, pass the data into Python or R for further
analysis, pull from a database to grab some extra fields,
and so on, often integrating what I find into some
machine learning models in the end” — Ed Chen, Data
Scientist, Twitter
City of Chicago
14
PublicHealth
• Food poisoning monitor
15
The New York Times
Interactive Features
• Election Forecast
• Dialect Quiz
Data Journalism
• NFL Draft Picks
• Wealth distribution in USA
16
The New York Times
Data Visualization
• Facebook IPO
• Baseball legends
17
Video Gaming
• Multiplayer Matchmaking
• Player Churn
• Game design
• Difficulty curve
• Level trouble-spots
• In-game purchase optimization
• Fraud detection
• Player communities
• Game Analysis
VideoGames
18
Housing
• Crime mapping
• choroplethr package
“The core innovation that Zillow
offers are its advanced
statistical predictive products,
including the Zestimate®, the
Rent Zestimate and the ZHVI®
family of real estate indexes. By
using R in production as well as
research, Zillow maximizes
flexibility and minimizes the
latency in rolling out updates
and new products.”
• Statistical forecasting
RealEstate
19
FinanceandBanking
• Credit Risk Analysis • Financial Networks
20
Pharmaceuticals “R use at the FDA is completely
acceptable and has not caused
any problems.” — Dr Jae
Brodsky, Office of
Biostatistics, Food and Drug
Administration
Regulatory Drug Approvals
• Reproducible research
• Accurate, reliable and consistent statistical analysis
• Internal reporting (Section 508 compliance)
Power
“We’ve combined Revolution R
Enterprise and Hadoop to build and
deploy customized exploratory data
analysis and GAM survival models for
our marketing performance
management and attribution platform.
Given that our data sets are already in
the terabytes and are growing rapidly,
we depend on Revolution R Enterprise’s
scalability and power – we saw about
a 4x performance improvement on 50
million records. It works brilliantly.”
- CEO, John Wallace, DataSong
4X performance
50M records scored daily
Scalability
“We’ve been able to scale our solution to a
problem that’s so big that most companies could
not address it. If we had to go with a different
solution we wouldn’t be as efficient as we are
now.”
- SVP Analytics, Kevin Lyons, eXelate
TB’s data from 200+ data sources
10’s thousands attributes
100’s millions of scores daily
2X data
2X attributes
no impact on performance
Performance
“We need a high-performance
analytics infrastructure because
marketing optimization is a lot like a
financial trading. By watching the
market constantly for data or market
condition updates, we can now
identify opportunities for our clients
that would otherwise be lost.”
- Chief Analytics Officer, Leon Zemel,
[x+1]
MarketingAnalytics
All of Open Source R plus:
 Big Data scalability
 High-performance analytics
 Development and deployment
tools
 Data source connectivity
 Application integration framework
 Multi-platform architecture
 Technical Support
 Available training and services
22
is the
Big Data Big Analytics Platform
Poll #2
What kinds of R projects are underway where you work?
23
24
Neera Talbert, VP Big Data & Advanced Analytic Services
 Leads Services at Revolution Analytics
 Fifteen years of experience the business analytics software industry
 Works with Fortune 500 companies to define analytics strategy, implement analytic based
decision making, reduce decision latency, and increase speed of decision making
– Analytics, business intelligence, big data analytics, risk
– Customer intelligence, supply chain, manufacturing, retail, oil & gas, public sector.
Organizational
Readiness
“There will be almost half a million jobs in five years, and a
shortage of up to 190,000 qualified data scientists, plus a
need for 1.5 million executives and support staff who have
an understanding of data”
McKinsey Global Institute
April 2013
Opportunity to develop talent
Data Science “the sexiest job in the 21st
century,” - Harvard Business Review
A cross between computer engineers,
statisticians and business analyst – people who
ask good questions and open to working with
unstructured information
Universities can’t produce them fast enough –
need 60% more resources – McKinsey Global
Institute
Our Philosophy
“The Hands-on exercises were the best part of Revolution Analytics
training”
- A participant from a global telecom company
Course Catalog
www.revolutionanalytics.com/AcademyR
RRE Certification Testing
 Demonstrate your R and RRE programming knowledge
– Fundamentals in R Language
– Data Management in Revolution R Enterprise
– Modeling in Revolution R Enterprise
 Independently proctored exam – online and onsite
Training Data Science team for Big Data Analytics
30
“Given that our data sets are already in the
terabytes and are growing rapidly, we depend
on Revolution R Enterprise’s scalability
and power. We saw about a 4x
performance improvement on 50 million
records. It works brilliantly.”
CEO, John Wallace
(DataSong formerly named UpStream)
4X performance
50M+ records scored daily
Key Technology: Revolution R Enterprise and Hadoop,
replacing SAS and Open Source R
Outcomes: Massively scalable infrastructure to
support attribution and optimization at an individual
customer level (segments of one) for clients such as
Williams-Sonoma. Client saved $250K in one campaign.
Rapid development and deployment of customer-specific
models, using innovative analytic techniques such as big
data GAM Survival models
Bottom Line: Driving revenue lift and cost savings through
marketing optimization
Profile: Multi-channel marketing attribution
and analytics software developer and service
provider. Growing, innovative, cost-conscious.
Model Development for Supply Chain Analytics with Hadoop
31
Profile: The Application Development team worked with
Revolution Analytics Consultants to build cloud-based supply
chain analytics platform
Key Technology and Services: R for Big Data
Analytics, Consulting, Training
Analytic Approach: Aggregate data from 15 data
sources including ERP data, store sales data and
sales forecast data to 25,000 store locations, 50 SKUs
nightly across 6 forecast models, order planning
models, running back tests and validation. Worked
with client to establish big data environment and
models that will generate 6.5 billion computations
daily by end of the year (in a 4-hour window for
processing). Scale and performance will allow new
capabilities such as seasonality, promotions and
incentives.
>Sales and Demand Data Analysis
>R/RRE Model Development
Bottom line: Work with client to develop predictive models,
starting with rigorous forecasts across various models,
generating forecast statistics and scoring each model
against historical data to come up with the best fit. The
forecast is input into an order-planning model that generates
recommendations to optimize product distribution and
ensure in-stock rate targets are achieved so that the right
amount of product is in the right location at the right time. .
“The amount of analytic horsepower required for this
application cannot be supported in traditional means; it
would require millions of dollars of hardware. R +
Hadoop is allowing us to have the compute capacity to
run 6.5 billion computations on nightly basis to
generate order plans for our clients.” VP Application
Development
Confidential – Do Not Distribute
Model Development for Vehicle Data Analysis
32
Profile: The Analytics R&D team of the multinational
automobile manufacturer worked with Revolution Analytics
Consultants to perform Survival Analysis, and to build and
deploy Decision Trees and Time Series models
Key Technology and Services: Revolution R Enterprise
for Big Data Analytics, Consulting, Training
Analytic Approach – Warranty Data Analysis:
Estimating the life of an automobile component using
Survival Analysis with Cox proportional hazards. Models
are trained using historical data, consisting of warranty
claims, and region and weather related variables such
snow, rain, temperature etc.
Outcome: New analytics paradigm for existing
processes introduced, with potential for millions of dollars
in cost savings through improved warranty contracts, and
re-designed automobile components.
>Warranty & Sensor Data Analysis
>R/Revolution R Enterprise Training
Analytic Approach – Sensor Data Analysis: Use sensor
data from vehicle components to build Decision Trees for
classification, and to establish range of predicted values for
sensor readings so that actual readings can be analyzed for
outliers.
Bottom line: New analytics initiative for building an
intelligent automobile system that’s capable of guiding the
driver upon detection of anomalies in driving patterns.
“The consultants and training instructors from
Revolution Analytics were very knowledgeable and
supported me very well. I am looking forward to taking
my learnings to the larger analytics team at my
company.” Senior Researcher, Analytics R&D
Confidential – Do Not Distribute
R Package Validation
33
Profile: The Clinical Trials Analytics team at the
multinational biopharmaceutical company moved from
SAS to R to develop big data analytics for Clinical Trials
Key Technology and Services: RUnit testing framework,
Revolution R Enterprise (RRE) and open source R
Approach: Validate third party (user-contributed) R
packages from CRAN by executing unit and regression
tests for functions both in the stated base package and its
dependent packages.
Outcome: Client moving from SAS to RRE for new
analytics initiatives for improved performance and cost
savings, and requires validation for user contributed
packages for reliability and compliance.
Challenge: The Clinical Trials Analytics team had “big data”
and “big computation” challenges, and needed a
centralized, scalable, and high-performance platform to
concurrently run the analytic models for faster analysis.
Bottom Line: Revolution R Enterprise acts as their
statistical analytics platform providing a centralized and
scalable platform for 10’s of data scientists and analysts.
Confidential – Do Not Distribute
User-contributed, Open Source R package
validation for Clinical Trial compliance to
support move from SAS to R & RRE
Model Optimization for Customer Analytics
34
Profile: The advanced analytics & IT Infrastructure teams at the
Las Vegas-based gaming corporation build and deploy
analytical models for internal customers such as Marketing &
Sales.
Key Technology and Services: Hadoop, Open Source R,
Consulting and Training
Analytic Approach: Assess the end-to-end flow of the
current Guest scoring model, and re-write the existing rmr/
R code using optimization techniques.
Outcome: 84% reduction in run time of the Guest Scoring
model, which helps the gaming company target their
customers with a customized marketing campaign within
minutes of performing a new activity such as checking into
the hotel, and buying tickets to a show.
Challenge: The IT Infrastructure team at the company was
challenged to support innovative, R-powered big data analytics
initiatives and needed to optimize their Analytics and Visualization
architecture.
Bottom line: Revolution Analytics consultants helped re-write R
analytics running inside Hadoop to achieve superior performance
and as a second project, designed a big data architecture
incorporating Cloudera, Teradata, Alteryx and Tableau
“Excellent work, Revolution!! We’re very glad that you came
on board to help us. Revolution Consultants get an A+.”
Technical Program Manager, Big Data Initiatives
Confidential – Do Not Distribute
> 84% improvement in performance &
reliability of Guest Scoring model
> Multi-layer big data infrastructure
architecture design
Revolution Analytics Services Overview
35
Training
• On-Site or
Remote Classes
• Classroom or
Self Paced
• Standard or
Tailored
Project Services
• Analytics
Strategy
• Analytics
Architecture
• Full Life Cycle
Projects
• Application
Migration
• Proof of concept
• Staff
Augmentation
• Package
Certification
Quick Start
Services
• Pre-production
• Jumpstart
value
• Combines
software,
training, and
services
Post Go-Live
Support
• Technical
Account
Management
• On-going
Training
Poll #3
What's the biggest R need at your company?
36
37
Why are so many companies using R?
Big Data
Data Science
Competition and Innovation
Open Source
Ecosystem
38
Q&A / Resources
What is R?
revolutionanalytics.com/what-is-r
Companies using R
revolutionanalytics.com/companies-using-r
AcademyR training
revolutionanalytics.com/AcademyR
AcademyR Certification
revolutionanalytics.com/AcademyR-certification
Contact Revolution Analytics
revolutionanalytics.com/contact-us
Thank you
Join us August 7th at 10:00 AM, Pacific, for our
Moving from SAS to R webinar. Please visit
our website to register.
www.revolutionanalytics.com, 1.855.GET.REVO, Twitter: @RevolutionR
39

More Related Content

What's hot

DeployR: Revolution R Enterprise with Business Intelligence Applications
DeployR: Revolution R Enterprise with Business Intelligence ApplicationsDeployR: Revolution R Enterprise with Business Intelligence Applications
DeployR: Revolution R Enterprise with Business Intelligence ApplicationsRevolution Analytics
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with RGreat Wide Open
 
Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15
Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15
Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15Revolution Analytics
 
The Keys to Digital Transformation
The Keys to Digital TransformationThe Keys to Digital Transformation
The Keys to Digital TransformationMapR Technologies
 
Stephen Cantrell, kdb+ Developer at Kx Systems “Kdb+: How Wall Street Tech c...
Stephen Cantrell, kdb+ Developer at Kx Systems  “Kdb+: How Wall Street Tech c...Stephen Cantrell, kdb+ Developer at Kx Systems  “Kdb+: How Wall Street Tech c...
Stephen Cantrell, kdb+ Developer at Kx Systems “Kdb+: How Wall Street Tech c...Dataconomy Media
 
Insight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital TransformationInsight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital TransformationMapR Technologies
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachSoftServe
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Dataconomy Media
 
Risk Analysis in the Financial Services Industry
Risk Analysis in the Financial Services IndustryRisk Analysis in the Financial Services Industry
Risk Analysis in the Financial Services IndustryRevolution Analytics
 
The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 Dataiku
 
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
 Turning an idea into a Data-Driven Production System: An Energy Load Forecas... Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...Big Data Spain
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataMapR Technologies
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchSheetal Pratik
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR Technologies
 
Data Discoverability at SpotHero
Data Discoverability at SpotHeroData Discoverability at SpotHero
Data Discoverability at SpotHeroMaggie Hays
 
Big data bi-mature-oanyc summit
Big data bi-mature-oanyc summitBig data bi-mature-oanyc summit
Big data bi-mature-oanyc summitOpen Analytics
 
SplunkSummit 2015 - Real World Big Data Architecture
SplunkSummit 2015 -  Real World Big Data ArchitectureSplunkSummit 2015 -  Real World Big Data Architecture
SplunkSummit 2015 - Real World Big Data ArchitectureSplunk
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureMapR Technologies
 

What's hot (20)

DeployR: Revolution R Enterprise with Business Intelligence Applications
DeployR: Revolution R Enterprise with Business Intelligence ApplicationsDeployR: Revolution R Enterprise with Business Intelligence Applications
DeployR: Revolution R Enterprise with Business Intelligence Applications
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with R
 
Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15
Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15
Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15
 
The Keys to Digital Transformation
The Keys to Digital TransformationThe Keys to Digital Transformation
The Keys to Digital Transformation
 
Stephen Cantrell, kdb+ Developer at Kx Systems “Kdb+: How Wall Street Tech c...
Stephen Cantrell, kdb+ Developer at Kx Systems  “Kdb+: How Wall Street Tech c...Stephen Cantrell, kdb+ Developer at Kx Systems  “Kdb+: How Wall Street Tech c...
Stephen Cantrell, kdb+ Developer at Kx Systems “Kdb+: How Wall Street Tech c...
 
Insight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital TransformationInsight Platforms Accelerate Digital Transformation
Insight Platforms Accelerate Digital Transformation
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
 
Risk Analysis in the Financial Services Industry
Risk Analysis in the Financial Services IndustryRisk Analysis in the Financial Services Industry
Risk Analysis in the Financial Services Industry
 
The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016
 
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
 Turning an idea into a Data-Driven Production System: An Energy Load Forecas... Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
Turning an idea into a Data-Driven Production System: An Energy Load Forecas...
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbench
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
 
Data Discoverability at SpotHero
Data Discoverability at SpotHeroData Discoverability at SpotHero
Data Discoverability at SpotHero
 
Managing a Multi-Tenant Data Lake
Managing a Multi-Tenant Data LakeManaging a Multi-Tenant Data Lake
Managing a Multi-Tenant Data Lake
 
Big data bi-mature-oanyc summit
Big data bi-mature-oanyc summitBig data bi-mature-oanyc summit
Big data bi-mature-oanyc summit
 
SplunkSummit 2015 - Real World Big Data Architecture
SplunkSummit 2015 -  Real World Big Data ArchitectureSplunkSummit 2015 -  Real World Big Data Architecture
SplunkSummit 2015 - Real World Big Data Architecture
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
 
Executive Intro to R
Executive Intro to RExecutive Intro to R
Executive Intro to R
 

Similar to Applications of R in Business Analytics and Big Data

05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...
05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...
05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...Revolution Analytics
 
Big data analytics on teradata with revolution r enterprise bill jacobs
Big data analytics on teradata with revolution r enterprise   bill jacobsBig data analytics on teradata with revolution r enterprise   bill jacobs
Big data analytics on teradata with revolution r enterprise bill jacobsBill Jacobs
 
Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics? Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics? Revolution Analytics
 
R for SAS Users Complement or Replace Two Strategies
R for SAS Users Complement or Replace Two StrategiesR for SAS Users Complement or Replace Two Strategies
R for SAS Users Complement or Replace Two StrategiesRevolution Analytics
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormRevolution Analytics
 
Crawl, Walk, Run: How to Get Started with Hadoop
Crawl, Walk, Run: How to Get Started with HadoopCrawl, Walk, Run: How to Get Started with Hadoop
Crawl, Walk, Run: How to Get Started with HadoopInside Analysis
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Neo4j
 
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?SAS Canada
 
How to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data ScientistHow to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data ScientistInside Analysis
 
Gain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's JourneyGain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's JourneyPlatfora
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US InformationJulian Tong
 
Riding the wave of analytics revolution
Riding the wave of analytics revolutionRiding the wave of analytics revolution
Riding the wave of analytics revolutionTanuj Poddar
 
Kristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics SoftwareKristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics SoftwareBAQMaR
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikBardess Group
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleBardess Group
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big DataInfochimps, a CSC Big Data Business
 

Similar to Applications of R in Business Analytics and Big Data (20)

05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...
05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...
05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...
 
Big data analytics on teradata with revolution r enterprise bill jacobs
Big data analytics on teradata with revolution r enterprise   bill jacobsBig data analytics on teradata with revolution r enterprise   bill jacobs
Big data analytics on teradata with revolution r enterprise bill jacobs
 
Revolution Analytics Podcast
Revolution Analytics PodcastRevolution Analytics Podcast
Revolution Analytics Podcast
 
Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics? Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics?
 
R for SAS Users Complement or Replace Two Strategies
R for SAS Users Complement or Replace Two StrategiesR for SAS Users Complement or Replace Two Strategies
R for SAS Users Complement or Replace Two Strategies
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and Storm
 
Crawl, Walk, Run: How to Get Started with Hadoop
Crawl, Walk, Run: How to Get Started with HadoopCrawl, Walk, Run: How to Get Started with Hadoop
Crawl, Walk, Run: How to Get Started with Hadoop
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017
 
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?
 
How to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data ScientistHow to Identify, Train or Become a Data Scientist
How to Identify, Train or Become a Data Scientist
 
Gain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's JourneyGain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's Journey
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Riding the wave of analytics revolution
Riding the wave of analytics revolutionRiding the wave of analytics revolution
Riding the wave of analytics revolution
 
Kristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics SoftwareKristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics Software
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess Qlik
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus Example
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
 

More from Revolution Analytics

Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudRevolution Analytics
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureRevolution Analytics
 
Speed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudSpeed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudRevolution Analytics
 
Predicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondPredicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondRevolution Analytics
 
The Value of Open Source Communities
The Value of Open Source CommunitiesThe Value of Open Source Communities
The Value of Open Source CommunitiesRevolution Analytics
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with RRevolution Analytics
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceRevolution Analytics
 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudRevolution Analytics
 
The Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorThe Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorRevolution Analytics
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalRevolution Analytics
 
Simple Reproducibility with the checkpoint package
Simple Reproducibilitywith the checkpoint packageSimple Reproducibilitywith the checkpoint package
Simple Reproducibility with the checkpoint packageRevolution Analytics
 

More from Revolution Analytics (20)

Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the Cloud
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to Azure
 
R in Minecraft
R in Minecraft R in Minecraft
R in Minecraft
 
The case for R for AI developers
The case for R for AI developersThe case for R for AI developers
The case for R for AI developers
 
Speed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudSpeed up R with parallel programming in the Cloud
Speed up R with parallel programming in the Cloud
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R Then and Now
R Then and NowR Then and Now
R Then and Now
 
Predicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondPredicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per Second
 
Reproducible Data Science with R
Reproducible Data Science with RReproducible Data Science with R
Reproducible Data Science with R
 
The Value of Open Source Communities
The Value of Open Source CommunitiesThe Value of Open Source Communities
The Value of Open Source Communities
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with R
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data Science
 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the Cloud
 
The Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorThe Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductor
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 final
 
Simple Reproducibility with the checkpoint package
Simple Reproducibilitywith the checkpoint packageSimple Reproducibilitywith the checkpoint package
Simple Reproducibility with the checkpoint package
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 

Recently uploaded

GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 

Recently uploaded (20)

GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 

Applications of R in Business Analytics and Big Data

  • 1. Applications in R Success and Lessons Learned from the Marketplace David Smith Chief Community Officer Revolution Analytics July 29, 2014 Neera Talbert VP Professional Services Revolution Analytics
  • 2. Agenda Introduction to R Growth of R Applications of R Q&A David Smith Chief Community Officer @revodavid Editor, blog.revolutionanalytics.com Co-author, “Introduction to R”
  • 3. 3 OUR COMPANY The leading provider of advanced analytics software and services based on open source R, since 2007 OUR SOFTWARE The only Big Data, Big Analytics software platform based on the data science language R SOME KUDOS Visionary Gartner Magic Quadrant for Advanced Analytics Platforms, 2014
  • 4. What is R?  Most widely used data analysis software • Used by 2M+ data scientists, statisticians and analysts  Most powerful statistical programming language • Flexible, extensible and comprehensive for productivity  Create beautiful and unique data visualizations • As seen in New York Times, Twitter and Flowing Data  Thriving open-source community • Leading edge of analytics research  Fills the talent gap • New graduates prefer R www.revolutionanalytics.com/what-r
  • 5. Poll #1 What data analysis software is used where you work? 5
  • 6. 6 R’s popularity is growing rapidly R Usage Growth Rexer Data Miner Survey, 2007-2013 • Rexer Data Miner Survey • IEEE Spectrum, July 2014 #9: R Language Popularity IEEE Spectrum Top Programming Languages
  • 7. 7 R is among the highest-paid IT skills in the US Dice Tech Salary Survey, January 2014 O’Reilly Strata 2013 Data Science Salary Survey
  • 8. 8 Technical Support for Open Source R AdviseR™ from Revolution Analytics Technical support for open source R, from the R experts.  Email and phone support 8AM-6PM, Mon-Fri  Support for R, validated packages, and third-party software connections  On-line case management and knowledgebase  Access to technical resources, documentation and user forums  Exclusive on-line webinars from community experts  Guaranteed response times Also available: expert hands-on and on-line training for R, from Revolution Analytics AcademyR. www.revolutionanalytics.com/AdviseR R SUPPORT 12 MONTHS $795 PER USER
  • 10. Facebook • Exploratory Data Analysis • Experimental Analysis “Generally, we use R to move fast when we get a new data set. With R, we don’t need to develop custom tools or write a bunch of code. Instead, we can just go about cleaning and exploring the data.” — Solomon Messing, data scientist at Facebook
  • 11. Facebook • Big-Data Visualization “It resonated with many people. It's not just a pretty picture, it's a reaffirmation of the impact we have in connecting people, even across oceans and borders.” — Paul Butler, data scientist, Facebook
  • 12. Google 12 “The great beauty of R is that you can modify it to do all sorts of things.” — Hal Varian Chief Economist, Google • Advertising Effectiveness “R is really important to the point that it's hard to overvalue it.” — Daryl Pregibon Head of Statistics, Google • Economic forecasting
  • 13. 13 Twitter • Data Visualization • Semantic clustering “A common pattern for me is that I'll code a MapReduce job in Scala, do some simple command-line munging on the results, pass the data into Python or R for further analysis, pull from a database to grab some extra fields, and so on, often integrating what I find into some machine learning models in the end” — Ed Chen, Data Scientist, Twitter
  • 14. City of Chicago 14 PublicHealth • Food poisoning monitor
  • 15. 15 The New York Times Interactive Features • Election Forecast • Dialect Quiz Data Journalism • NFL Draft Picks • Wealth distribution in USA
  • 16. 16 The New York Times Data Visualization • Facebook IPO • Baseball legends
  • 17. 17 Video Gaming • Multiplayer Matchmaking • Player Churn • Game design • Difficulty curve • Level trouble-spots • In-game purchase optimization • Fraud detection • Player communities • Game Analysis VideoGames
  • 18. 18 Housing • Crime mapping • choroplethr package “The core innovation that Zillow offers are its advanced statistical predictive products, including the Zestimate®, the Rent Zestimate and the ZHVI® family of real estate indexes. By using R in production as well as research, Zillow maximizes flexibility and minimizes the latency in rolling out updates and new products.” • Statistical forecasting RealEstate
  • 19. 19 FinanceandBanking • Credit Risk Analysis • Financial Networks
  • 20. 20 Pharmaceuticals “R use at the FDA is completely acceptable and has not caused any problems.” — Dr Jae Brodsky, Office of Biostatistics, Food and Drug Administration Regulatory Drug Approvals • Reproducible research • Accurate, reliable and consistent statistical analysis • Internal reporting (Section 508 compliance)
  • 21. Power “We’ve combined Revolution R Enterprise and Hadoop to build and deploy customized exploratory data analysis and GAM survival models for our marketing performance management and attribution platform. Given that our data sets are already in the terabytes and are growing rapidly, we depend on Revolution R Enterprise’s scalability and power – we saw about a 4x performance improvement on 50 million records. It works brilliantly.” - CEO, John Wallace, DataSong 4X performance 50M records scored daily Scalability “We’ve been able to scale our solution to a problem that’s so big that most companies could not address it. If we had to go with a different solution we wouldn’t be as efficient as we are now.” - SVP Analytics, Kevin Lyons, eXelate TB’s data from 200+ data sources 10’s thousands attributes 100’s millions of scores daily 2X data 2X attributes no impact on performance Performance “We need a high-performance analytics infrastructure because marketing optimization is a lot like a financial trading. By watching the market constantly for data or market condition updates, we can now identify opportunities for our clients that would otherwise be lost.” - Chief Analytics Officer, Leon Zemel, [x+1] MarketingAnalytics
  • 22. All of Open Source R plus:  Big Data scalability  High-performance analytics  Development and deployment tools  Data source connectivity  Application integration framework  Multi-platform architecture  Technical Support  Available training and services 22 is the Big Data Big Analytics Platform
  • 23. Poll #2 What kinds of R projects are underway where you work? 23
  • 24. 24 Neera Talbert, VP Big Data & Advanced Analytic Services  Leads Services at Revolution Analytics  Fifteen years of experience the business analytics software industry  Works with Fortune 500 companies to define analytics strategy, implement analytic based decision making, reduce decision latency, and increase speed of decision making – Analytics, business intelligence, big data analytics, risk – Customer intelligence, supply chain, manufacturing, retail, oil & gas, public sector.
  • 25. Organizational Readiness “There will be almost half a million jobs in five years, and a shortage of up to 190,000 qualified data scientists, plus a need for 1.5 million executives and support staff who have an understanding of data” McKinsey Global Institute April 2013
  • 26. Opportunity to develop talent Data Science “the sexiest job in the 21st century,” - Harvard Business Review A cross between computer engineers, statisticians and business analyst – people who ask good questions and open to working with unstructured information Universities can’t produce them fast enough – need 60% more resources – McKinsey Global Institute
  • 27. Our Philosophy “The Hands-on exercises were the best part of Revolution Analytics training” - A participant from a global telecom company
  • 29. RRE Certification Testing  Demonstrate your R and RRE programming knowledge – Fundamentals in R Language – Data Management in Revolution R Enterprise – Modeling in Revolution R Enterprise  Independently proctored exam – online and onsite
  • 30. Training Data Science team for Big Data Analytics 30 “Given that our data sets are already in the terabytes and are growing rapidly, we depend on Revolution R Enterprise’s scalability and power. We saw about a 4x performance improvement on 50 million records. It works brilliantly.” CEO, John Wallace (DataSong formerly named UpStream) 4X performance 50M+ records scored daily Key Technology: Revolution R Enterprise and Hadoop, replacing SAS and Open Source R Outcomes: Massively scalable infrastructure to support attribution and optimization at an individual customer level (segments of one) for clients such as Williams-Sonoma. Client saved $250K in one campaign. Rapid development and deployment of customer-specific models, using innovative analytic techniques such as big data GAM Survival models Bottom Line: Driving revenue lift and cost savings through marketing optimization Profile: Multi-channel marketing attribution and analytics software developer and service provider. Growing, innovative, cost-conscious.
  • 31. Model Development for Supply Chain Analytics with Hadoop 31 Profile: The Application Development team worked with Revolution Analytics Consultants to build cloud-based supply chain analytics platform Key Technology and Services: R for Big Data Analytics, Consulting, Training Analytic Approach: Aggregate data from 15 data sources including ERP data, store sales data and sales forecast data to 25,000 store locations, 50 SKUs nightly across 6 forecast models, order planning models, running back tests and validation. Worked with client to establish big data environment and models that will generate 6.5 billion computations daily by end of the year (in a 4-hour window for processing). Scale and performance will allow new capabilities such as seasonality, promotions and incentives. >Sales and Demand Data Analysis >R/RRE Model Development Bottom line: Work with client to develop predictive models, starting with rigorous forecasts across various models, generating forecast statistics and scoring each model against historical data to come up with the best fit. The forecast is input into an order-planning model that generates recommendations to optimize product distribution and ensure in-stock rate targets are achieved so that the right amount of product is in the right location at the right time. . “The amount of analytic horsepower required for this application cannot be supported in traditional means; it would require millions of dollars of hardware. R + Hadoop is allowing us to have the compute capacity to run 6.5 billion computations on nightly basis to generate order plans for our clients.” VP Application Development Confidential – Do Not Distribute
  • 32. Model Development for Vehicle Data Analysis 32 Profile: The Analytics R&D team of the multinational automobile manufacturer worked with Revolution Analytics Consultants to perform Survival Analysis, and to build and deploy Decision Trees and Time Series models Key Technology and Services: Revolution R Enterprise for Big Data Analytics, Consulting, Training Analytic Approach – Warranty Data Analysis: Estimating the life of an automobile component using Survival Analysis with Cox proportional hazards. Models are trained using historical data, consisting of warranty claims, and region and weather related variables such snow, rain, temperature etc. Outcome: New analytics paradigm for existing processes introduced, with potential for millions of dollars in cost savings through improved warranty contracts, and re-designed automobile components. >Warranty & Sensor Data Analysis >R/Revolution R Enterprise Training Analytic Approach – Sensor Data Analysis: Use sensor data from vehicle components to build Decision Trees for classification, and to establish range of predicted values for sensor readings so that actual readings can be analyzed for outliers. Bottom line: New analytics initiative for building an intelligent automobile system that’s capable of guiding the driver upon detection of anomalies in driving patterns. “The consultants and training instructors from Revolution Analytics were very knowledgeable and supported me very well. I am looking forward to taking my learnings to the larger analytics team at my company.” Senior Researcher, Analytics R&D Confidential – Do Not Distribute
  • 33. R Package Validation 33 Profile: The Clinical Trials Analytics team at the multinational biopharmaceutical company moved from SAS to R to develop big data analytics for Clinical Trials Key Technology and Services: RUnit testing framework, Revolution R Enterprise (RRE) and open source R Approach: Validate third party (user-contributed) R packages from CRAN by executing unit and regression tests for functions both in the stated base package and its dependent packages. Outcome: Client moving from SAS to RRE for new analytics initiatives for improved performance and cost savings, and requires validation for user contributed packages for reliability and compliance. Challenge: The Clinical Trials Analytics team had “big data” and “big computation” challenges, and needed a centralized, scalable, and high-performance platform to concurrently run the analytic models for faster analysis. Bottom Line: Revolution R Enterprise acts as their statistical analytics platform providing a centralized and scalable platform for 10’s of data scientists and analysts. Confidential – Do Not Distribute User-contributed, Open Source R package validation for Clinical Trial compliance to support move from SAS to R & RRE
  • 34. Model Optimization for Customer Analytics 34 Profile: The advanced analytics & IT Infrastructure teams at the Las Vegas-based gaming corporation build and deploy analytical models for internal customers such as Marketing & Sales. Key Technology and Services: Hadoop, Open Source R, Consulting and Training Analytic Approach: Assess the end-to-end flow of the current Guest scoring model, and re-write the existing rmr/ R code using optimization techniques. Outcome: 84% reduction in run time of the Guest Scoring model, which helps the gaming company target their customers with a customized marketing campaign within minutes of performing a new activity such as checking into the hotel, and buying tickets to a show. Challenge: The IT Infrastructure team at the company was challenged to support innovative, R-powered big data analytics initiatives and needed to optimize their Analytics and Visualization architecture. Bottom line: Revolution Analytics consultants helped re-write R analytics running inside Hadoop to achieve superior performance and as a second project, designed a big data architecture incorporating Cloudera, Teradata, Alteryx and Tableau “Excellent work, Revolution!! We’re very glad that you came on board to help us. Revolution Consultants get an A+.” Technical Program Manager, Big Data Initiatives Confidential – Do Not Distribute > 84% improvement in performance & reliability of Guest Scoring model > Multi-layer big data infrastructure architecture design
  • 35. Revolution Analytics Services Overview 35 Training • On-Site or Remote Classes • Classroom or Self Paced • Standard or Tailored Project Services • Analytics Strategy • Analytics Architecture • Full Life Cycle Projects • Application Migration • Proof of concept • Staff Augmentation • Package Certification Quick Start Services • Pre-production • Jumpstart value • Combines software, training, and services Post Go-Live Support • Technical Account Management • On-going Training
  • 36. Poll #3 What's the biggest R need at your company? 36
  • 37. 37 Why are so many companies using R? Big Data Data Science Competition and Innovation Open Source Ecosystem
  • 38. 38 Q&A / Resources What is R? revolutionanalytics.com/what-is-r Companies using R revolutionanalytics.com/companies-using-r AcademyR training revolutionanalytics.com/AcademyR AcademyR Certification revolutionanalytics.com/AcademyR-certification Contact Revolution Analytics revolutionanalytics.com/contact-us
  • 39. Thank you Join us August 7th at 10:00 AM, Pacific, for our Moving from SAS to R webinar. Please visit our website to register. www.revolutionanalytics.com, 1.855.GET.REVO, Twitter: @RevolutionR 39