SlideShare a Scribd company logo
Introduction to R
For Big Data Analysis
Wednesday, October 13, 2015
6:00pm – 6:45 pm
Raastech, Inc.
2201 Cooperative Way, Suite 600
Herndon, VA 20171
+1-703-884-2223
info@raastech.com
© Raastech, Inc. 2015 | All rights reserved. Slide 2 of 51@Raastech
About Me
 Harold Dost III @hdost
 7+ years of Oracle Middleware experience
 OCE (SOA Foundation Practitioner)
 Oracle ACE Associate
 From Michigan
 blog.raastech.com
© Raastech, Inc. 2015 | All rights reserved. Slide 3 of 51@Raastech
About Raastech
 Small systems integrator founded in 2009
 Headquartered in the Washington DC area
 Specializes in Oracle Fusion Middleware
 Oracle Platinum Partner – 1 in 3,000 worldwide
 Oracle SOA Specialized – 1 in 1,500 worldwide
 Oracle ACE – 2 of 500 worldwide
 100% of consultants are Oracle certified
 100% of consultants present at major Oracle conferences
 100% of consultants have published books, whitepapers, or articles
© Raastech, Inc. 2015 | All rights reserved. Slide 4 of 51@Raastech
Outline
1. Getting Started
 Installing R
 Installing Tools
 Getting Data
2. Understanding R
 Data Types
 Functions
 Data Import Mechanisms
© Raastech, Inc. 2015 | All rights reserved. Slide 5 of 51@Raastech
Outline (Cont.)
3. Manipulating Data (Large Data Sets)
 Deriving Simple Statistics
 Graphing
4. Demo
5. Incorporating into an Enterprise
 Using Enterprise Data Sources
 Running R in your environment.
 Familiarize with Oracle's R offerings
© Raastech, Inc. 2015 | All rights reserved. Slide 6 of 51@Raastech
© Raastech, Inc. 2015 | All rights reserved. Slide 7 of 51@Raastech
Know CRAN
 Comprehensive
 R
 Archive
 Network
© Raastech, Inc. 2015 | All rights reserved. Slide 8 of 51@Raastech
Installing R
 Windows
 Mac
 Linux
© Raastech, Inc. 2015 | All rights reserved. Slide 9 of 51@Raastech
Installing R
 Windows https://cran.r-project.org/bin/windows/
 Mac https://cran.r-project.org/bin/macosx/
 Linux https://cran.r-project.org/bin/linux/
© Raastech, Inc. 2015 | All rights reserved. Slide 10 of 51@Raastech
Development Tools
 Rstudio - http://www.rstudio.com/products/rstudio/
 Open Source Edition
 Commercial License - $995
 Eclipse
 Sublime, TextPad, Other Simple Text Editors,…
© Raastech, Inc. 2015 | All rights reserved. Slide 11 of 51@Raastech
Installing Packages
 Anything From CRAN
 Anywhere
install.packages(c(“first”, “second”))
> sudo R CMD INSTALL package-version.tar.gz
© Raastech, Inc. 2015 | All rights reserved. Slide 12 of 51@Raastech
© Raastech, Inc. 2015 | All rights reserved. Slide 13 of 51@Raastech
Data Types
 Vectors
 Matrices
 Arrays
 Data Frames
 Lists
 Factors
© Raastech, Inc. 2015 | All rights reserved. Slide 14 of 51@Raastech
Special Values
 Infinity, Positive and Negative: Inf and –Inf
 Not A Number: NaN
 Not Available: NA
 Complex Numbers, 1+9i
© Raastech, Inc. 2015 | All rights reserved. Slide 15 of 51@Raastech
Use Case for Infinities
 Finding Maximums and Minimums
 Placeholder values when others won’t work
© Raastech, Inc. 2015 | All rights reserved. Slide 16 of 51@Raastech
Not a Number (NaN)
 In means something went wrong somewhere
 A missing argument
 Invalid number
 Check for with is.nan(x) to prevent leaking
 Don’t use “==“ to find NaN, it will only give more NaN
© Raastech, Inc. 2015 | All rights reserved. Slide 17 of 51@Raastech
Assigning NaN
> a = NaN
> a
[1] NaN
© Raastech, Inc. 2015 | All rights reserved. Slide 18 of 51@Raastech
Adding NaN
Adding NaN
> b = 1
> c = a + b
> c
[1] NaN
When adding a number to NaN “Not a Number” you will get
NaN.
© Raastech, Inc. 2015 | All rights reserved. Slide 19 of 51@Raastech
Comparing NaN to Regular Number
> d = b == c
> d
[1] NA
When comparing a number to NaN “Not a Number” you will
get NA.
© Raastech, Inc. 2015 | All rights reserved. Slide 20 of 51@Raastech
Comparing NaN to NaN
> e = c == a
> e
[1] NA
When comparing NaN “Not a Number” to NaN you will get
NA.
© Raastech, Inc. 2015 | All rights reserved. Slide 21 of 51@Raastech
Detecting NaN
> a
[1] NaN
> is.nan(a)
[1] TRUE
> is.na(a)
[1] TRUE
Since NaN aren’t proper numbers, special functions must be
used to detect them. They are the result of math gone wrong.
© Raastech, Inc. 2015 | All rights reserved. Slide 22 of 51@Raastech
Detecting NA
> e = c == a
> e
[1] NA
> is.nan(e)
[1] FALSE
> is.na(e)
[1] TRUE
Just as with NaN special functions must be used, but NA
generally indicates that there is missing information
© Raastech, Inc. 2015 | All rights reserved. Slide 23 of 51@Raastech
Operators
 Assignment ( ->, <-)
 Addition (+)
 Subtraction (–)
 Division (/)
 Multiplication (*)
 Exponent (^)
 Parentheses ( (, ) )
© Raastech, Inc. 2015 | All rights reserved. Slide 24 of 51@Raastech
© Raastech, Inc. 2015 | All rights reserved. Slide 25 of 51@Raastech
Math Functions
 max()
 min()
 log()
 sqrt()
© Raastech, Inc. 2015 | All rights reserved. Slide 26 of 51@Raastech
Deriving Simple Statistics
 Minimum
 Maximum
 Median
 Arithmetic Mean
 Function estimation
 Linear
 Log
 Exponential
 R-Values
 Standard Deviation
© Raastech, Inc. 2015 | All rights reserved. Slide 27 of 51@Raastech
How to define your own functions
firstfunction <- function(arg1, arg2, ... ){
statements
return(someoutput)
}
© Raastech, Inc. 2015 | All rights reserved. Slide 28 of 51@Raastech
© Raastech, Inc. 2015 | All rights reserved. Slide 29 of 51@Raastech
Twitter Example
 First Install the Package
install.packages("twitteR”)
© Raastech, Inc. 2015 | All rights reserved. Slide 30 of 51@Raastech
Twitter Example
 Authenticate
consumer = "CONSUMER KEY"
secret = "SECRET KEY"
setup_twitter_oauth(consumer,secret)
© Raastech, Inc. 2015 | All rights reserved. Slide 31 of 51@Raastech
Twitter Example
 Get Trend Locations
 The resulting WOEID (Where on Earth ID) can be
chosen
availableTrendLocations()
© Raastech, Inc. 2015 | All rights reserved. Slide 32 of 51@Raastech
Twitter Example
 Get Trends
trends = getTrends(SOMEWOEID)
© Raastech, Inc. 2015 | All rights reserved. Slide 33 of 51@Raastech
Twitter Example
 Retrieve Tweets
tweets <- searchTwitter(trends[XX,XX],n=1500)
tweetdf <- do.call("rbind",lapply(tweets,as.data.frame))
© Raastech, Inc. 2015 | All rights reserved. Slide 34 of 51@Raastech
Twitter Example
 Filter
 complete.cases is used to check for NA and NaN
numbers
tweetdf <- tweetdf[complete.cases(tweetdf[,15]),]
tweetdf <- tweetdf[tweetdf[,15] != 0,]
© Raastech, Inc. 2015 | All rights reserved. Slide 35 of 51@Raastech
Twitter Example
 Simplify the dataframe
simpledf <- tweetdf[c("screenName","longitude","latitude")]
© Raastech, Inc. 2015 | All rights reserved. Slide 36 of 51@Raastech
Twitter Example
 Create Matrix from Dataframe
tweetMatrix <- data.matrix(simpledf[2:3],rownames.force = FALSE)
© Raastech, Inc. 2015 | All rights reserved. Slide 37 of 51@Raastech
Twitter Example
 Plot the Latitude and Longitude
plot(tweetMatrix)
© Raastech, Inc. 2015 | All rights reserved. Slide 38 of 51@Raastech
Graphing
 Image
 Contour
 Box Chart
© Raastech, Inc. 2015 | All rights reserved. Slide 39 of 51@Raastech
K-Means
 Essentially a search algorithm
 Divides a dataset into k-clusters
© Raastech, Inc. 2015 | All rights reserved. Slide 40 of 51@Raastech
Time Series
 Stock Quotes
 Infection Incidents
 Gas Prices
 Audio
 Etc.
Source: http://www.loc.gov/pictures/resource/hec.23488/
© Raastech, Inc. 2015 | All rights reserved. Slide 41 of 51@Raastech
Time Series Analysis
 Regression
 Forecasting
 Time Frequency (FFTs)
Source: http://groups.csail.mit.edu/netmit/sFFT/algorithm.html
© Raastech, Inc. 2015 | All rights reserved. Slide 42 of 51@Raastech
© Raastech, Inc. 2015 | All rights reserved. Slide 43 of 51@Raastech
Using Enterprise Data Sources
 Database
 Streams
 Files
 Etc.
© Raastech, Inc. 2015 | All rights reserved. Slide 44 of 51@Raastech
© Raastech, Inc. 2015 | All rights reserved. Slide 45 of 51@Raastech
Oracle R Distribution
 Available on Oracle Public Yum
 Enhanced dynamic Library loading
 Enterprise Support Available
 Oracle Advanced Analytics
 Oracle Linux
 Oracle Big Data Appliance
 http://www.oracle.com/technetwork/database/database-technologies/r/r-
distribution/overview/index.html
© Raastech, Inc. 2015 | All rights reserved. Slide 46 of 51@Raastech
Oracle R Enterprise
 Component of the Oracle Advanced
Analytics Option on Oracle Database EE
 Allows use of R in the database without SQL
 Save R Objects in the database
 Easily Integrate with OBIEE
 http://www.oracle.com/technetwork/database/database-
technologies/r/r-enterprise/overview/index.html
© Raastech, Inc. 2015 | All rights reserved. Slide 47 of 51@Raastech
Oracle R Advanced Analytics for Hadoop
 Component of the Oracle Big
Data Software Connectors Suite,
an option for the BDA
 Provides abstraction from HiveQL
through R just as in Oracle R
Enterprise does for SQL
 http://www.oracle.com/technetwork/database/
database-technologies/bdc/r-advanalytics-for-
hadoop/overview/index.html
© Raastech, Inc. 2015 | All rights reserved. Slide 48 of 51@Raastech
ROracle
 Open Source Package
 Maintained by Oracle
 Uses OCI Interface to interact with databases
 http://www.oracle.com/technetwork/database/database-technologies/r/r-
technologies/overview/index.html
© Raastech, Inc. 2015 | All rights reserved. Slide 49 of 51@Raastech
© Raastech, Inc. 2015 | All rights reserved. Slide 50 of 51@Raastech
Contact Information
 Harold Dost III
 Principal Consultant
 @hdost
 harold.dost@raastech.com
© Raastech, Inc. 2015 | All rights reserved. Slide 51 of 51@Raastech
Resources
 https://en.wikibooks.org/wiki/Statistical_Analysis:_an_Introduction_using_R/R_basics
 http://www.r-project.org/
 https://docs.oracle.com/cd/E57012_01/doc.141/e56973/toc.htm
 http://cran.r-project.org/web/packages/akmeans/index.html
 http://cran.r-project.org/web/packages/twitteR/index.html
 http://en.wikipedia.org/wiki/K-means_clustering
 http://www.rdatamining.com/examples/kmeans-clustering
 http://blog.revolutionanalytics.com/2009/02/how-to-choose-a-random-number-in-r.html
 https://www.packtpub.com/books/content/text-mining-r-part-2
 http://www.eia.gov/totalenergy/data/monthly/index.cfm#consumption

More Related Content

Similar to Introduction to R for Big Data Analysis

Understanding and Developing Web Services - For DBAs and Developers
Understanding and Developing Web Services - For DBAs and DevelopersUnderstanding and Developing Web Services - For DBAs and Developers
Understanding and Developing Web Services - For DBAs and DevelopersRevelation Technologies
 
Oracle BPM Suite Development: Getting Started
Oracle BPM Suite Development: Getting StartedOracle BPM Suite Development: Getting Started
Oracle BPM Suite Development: Getting StartedRevelation Technologies
 
What Every Client Should Do on Their Oracle SOA Projects
What Every Client Should Do on Their Oracle SOA ProjectsWhat Every Client Should Do on Their Oracle SOA Projects
What Every Client Should Do on Their Oracle SOA ProjectsRevelation Technologies
 
Oracle Compute Cloud Service vs. Amazon Web Services EC2 : A Hands-On Review
Oracle Compute Cloud Service vs. Amazon Web Services EC2 : A Hands-On ReviewOracle Compute Cloud Service vs. Amazon Web Services EC2 : A Hands-On Review
Oracle Compute Cloud Service vs. Amazon Web Services EC2 : A Hands-On ReviewRevelation Technologies
 
Observe Changes of Taiwan Big Data Communities with Small Data
Observe Changes of Taiwan Big Data Communities with Small DataObserve Changes of Taiwan Big Data Communities with Small Data
Observe Changes of Taiwan Big Data Communities with Small DataJazz Yao-Tsung Wang
 
The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 Dataiku
 
Oracle Enterprise Manager 12c Cloud Control Upgrade
Oracle Enterprise Manager 12c Cloud Control UpgradeOracle Enterprise Manager 12c Cloud Control Upgrade
Oracle Enterprise Manager 12c Cloud Control UpgradeRevelation Technologies
 
DevOpsDays - Pick any Three - Devops from scratch
DevOpsDays - Pick any Three - Devops from scratchDevOpsDays - Pick any Three - Devops from scratch
DevOpsDays - Pick any Three - Devops from scratchPete Cheslock
 
Understanding and Developing Web Services: For DBAs and Database Developers
Understanding and Developing Web Services: For DBAs and Database DevelopersUnderstanding and Developing Web Services: For DBAs and Database Developers
Understanding and Developing Web Services: For DBAs and Database DevelopersRevelation Technologies
 
Dashboards for Business Intelligence
Dashboards for Business IntelligenceDashboards for Business Intelligence
Dashboards for Business IntelligencePetteriTeikariPhD
 
#rstats lessons for #measure
#rstats lessons for #measure#rstats lessons for #measure
#rstats lessons for #measureMark Edmondson
 
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution AnalyticsRevolution Analytics
 
Pick Any Three: Good, Fast, or Safe - Devops from Scratch
Pick Any Three: Good, Fast, or Safe - Devops from ScratchPick Any Three: Good, Fast, or Safe - Devops from Scratch
Pick Any Three: Good, Fast, or Safe - Devops from ScratchPete Cheslock
 
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...Edureka!
 
Deploying to Oracle SOA Suite 12c - Everything You Need To Know
Deploying to Oracle SOA Suite 12c - Everything You Need To KnowDeploying to Oracle SOA Suite 12c - Everything You Need To Know
Deploying to Oracle SOA Suite 12c - Everything You Need To KnowRevelation Technologies
 

Similar to Introduction to R for Big Data Analysis (20)

What is Oracle Beehive?
What is Oracle Beehive?What is Oracle Beehive?
What is Oracle Beehive?
 
Understanding and Developing Web Services - For DBAs and Developers
Understanding and Developing Web Services - For DBAs and DevelopersUnderstanding and Developing Web Services - For DBAs and Developers
Understanding and Developing Web Services - For DBAs and Developers
 
An Unbiased Look: Oracle SOA Suite 12c
An Unbiased Look: Oracle SOA Suite 12cAn Unbiased Look: Oracle SOA Suite 12c
An Unbiased Look: Oracle SOA Suite 12c
 
Oracle BPM Suite Development: Getting Started
Oracle BPM Suite Development: Getting StartedOracle BPM Suite Development: Getting Started
Oracle BPM Suite Development: Getting Started
 
What Every Client Should Do on Their Oracle SOA Projects
What Every Client Should Do on Their Oracle SOA ProjectsWhat Every Client Should Do on Their Oracle SOA Projects
What Every Client Should Do on Their Oracle SOA Projects
 
Oracle Compute Cloud Service vs. Amazon Web Services EC2 : A Hands-On Review
Oracle Compute Cloud Service vs. Amazon Web Services EC2 : A Hands-On ReviewOracle Compute Cloud Service vs. Amazon Web Services EC2 : A Hands-On Review
Oracle Compute Cloud Service vs. Amazon Web Services EC2 : A Hands-On Review
 
Observe Changes of Taiwan Big Data Communities with Small Data
Observe Changes of Taiwan Big Data Communities with Small DataObserve Changes of Taiwan Big Data Communities with Small Data
Observe Changes of Taiwan Big Data Communities with Small Data
 
The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016
 
What is the Oracle Cloud?
What is the Oracle Cloud?What is the Oracle Cloud?
What is the Oracle Cloud?
 
JDK8: Stream style
JDK8: Stream styleJDK8: Stream style
JDK8: Stream style
 
Oracle Enterprise Manager 12c Cloud Control Upgrade
Oracle Enterprise Manager 12c Cloud Control UpgradeOracle Enterprise Manager 12c Cloud Control Upgrade
Oracle Enterprise Manager 12c Cloud Control Upgrade
 
DevOpsDays - Pick any Three - Devops from scratch
DevOpsDays - Pick any Three - Devops from scratchDevOpsDays - Pick any Three - Devops from scratch
DevOpsDays - Pick any Three - Devops from scratch
 
Understanding and Developing Web Services: For DBAs and Database Developers
Understanding and Developing Web Services: For DBAs and Database DevelopersUnderstanding and Developing Web Services: For DBAs and Database Developers
Understanding and Developing Web Services: For DBAs and Database Developers
 
Dashboards for Business Intelligence
Dashboards for Business IntelligenceDashboards for Business Intelligence
Dashboards for Business Intelligence
 
#rstats lessons for #measure
#rstats lessons for #measure#rstats lessons for #measure
#rstats lessons for #measure
 
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
 
Pick Any Three: Good, Fast, or Safe - Devops from Scratch
Pick Any Three: Good, Fast, or Safe - Devops from ScratchPick Any Three: Good, Fast, or Safe - Devops from Scratch
Pick Any Three: Good, Fast, or Safe - Devops from Scratch
 
An Unbiased Look: Oracle SOA Suite 12c
An Unbiased Look: Oracle SOA Suite 12cAn Unbiased Look: Oracle SOA Suite 12c
An Unbiased Look: Oracle SOA Suite 12c
 
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
 
Deploying to Oracle SOA Suite 12c - Everything You Need To Know
Deploying to Oracle SOA Suite 12c - Everything You Need To KnowDeploying to Oracle SOA Suite 12c - Everything You Need To Know
Deploying to Oracle SOA Suite 12c - Everything You Need To Know
 

More from Revelation Technologies

Automating Cloud Operations: Everything You Wanted to Know about cURL and REST
Automating Cloud Operations: Everything You Wanted to Know about cURL and RESTAutomating Cloud Operations: Everything You Wanted to Know about cURL and REST
Automating Cloud Operations: Everything You Wanted to Know about cURL and RESTRevelation Technologies
 
Getting Started with API Management – Why It's Needed On-prem and in the Cloud
Getting Started with API Management – Why It's Needed On-prem and in the CloudGetting Started with API Management – Why It's Needed On-prem and in the Cloud
Getting Started with API Management – Why It's Needed On-prem and in the CloudRevelation Technologies
 
Automating Cloud Operations - Everything you wanted to know about cURL and RE...
Automating Cloud Operations - Everything you wanted to know about cURL and RE...Automating Cloud Operations - Everything you wanted to know about cURL and RE...
Automating Cloud Operations - Everything you wanted to know about cURL and RE...Revelation Technologies
 
Introducing the Oracle Cloud Infrastructure (OCI) Best Practices Framework
Introducing the Oracle Cloud Infrastructure (OCI) Best Practices FrameworkIntroducing the Oracle Cloud Infrastructure (OCI) Best Practices Framework
Introducing the Oracle Cloud Infrastructure (OCI) Best Practices FrameworkRevelation Technologies
 
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...Revelation Technologies
 
PTK Issue 72: Delivering a Platform on Demand
PTK Issue 72: Delivering a Platform on DemandPTK Issue 72: Delivering a Platform on Demand
PTK Issue 72: Delivering a Platform on DemandRevelation Technologies
 
PTK Issue 71: The Compute Cloud Performance Showdown
PTK Issue 71: The Compute Cloud Performance ShowdownPTK Issue 71: The Compute Cloud Performance Showdown
PTK Issue 71: The Compute Cloud Performance ShowdownRevelation Technologies
 
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...Revelation Technologies
 
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...Revelation Technologies
 
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...Revelation Technologies
 
The Microsoft Azure and Oracle Cloud Interconnect Everything You Need to Know
The Microsoft Azure and Oracle Cloud Interconnect Everything You Need to KnowThe Microsoft Azure and Oracle Cloud Interconnect Everything You Need to Know
The Microsoft Azure and Oracle Cloud Interconnect Everything You Need to KnowRevelation Technologies
 
Compute Cloud Performance Showdown: Amazon Web Services, Oracle Cloud, IBM ...
Compute Cloud  Performance Showdown: Amazon Web Services, Oracle  Cloud, IBM ...Compute Cloud  Performance Showdown: Amazon Web Services, Oracle  Cloud, IBM ...
Compute Cloud Performance Showdown: Amazon Web Services, Oracle Cloud, IBM ...Revelation Technologies
 
Securing your Oracle Fusion Middleware Environment, On-Prem and in the Cloud
Securing your Oracle Fusion Middleware Environment, On-Prem and in the CloudSecuring your Oracle Fusion Middleware Environment, On-Prem and in the Cloud
Securing your Oracle Fusion Middleware Environment, On-Prem and in the CloudRevelation Technologies
 
Developing Web Services from Scratch - For DBAs and Database Developers
Developing Web Services from Scratch - For DBAs and Database DevelopersDeveloping Web Services from Scratch - For DBAs and Database Developers
Developing Web Services from Scratch - For DBAs and Database DevelopersRevelation Technologies
 
Domain Partitions and Multitenancy in Oracle WebLogic Server 12c - Why It's U...
Domain Partitions and Multitenancy in Oracle WebLogic Server 12c - Why It's U...Domain Partitions and Multitenancy in Oracle WebLogic Server 12c - Why It's U...
Domain Partitions and Multitenancy in Oracle WebLogic Server 12c - Why It's U...Revelation Technologies
 
Oracle Database Cloud Service - Provisioning Your First DBaaS Instance
Oracle Database Cloud Service - Provisioning Your First DBaaS InstanceOracle Database Cloud Service - Provisioning Your First DBaaS Instance
Oracle Database Cloud Service - Provisioning Your First DBaaS InstanceRevelation Technologies
 

More from Revelation Technologies (20)

Operating System Security in the Cloud
Operating System Security in the CloudOperating System Security in the Cloud
Operating System Security in the Cloud
 
Getting Started with Terraform
Getting Started with TerraformGetting Started with Terraform
Getting Started with Terraform
 
Getting Started with API Management
Getting Started with API ManagementGetting Started with API Management
Getting Started with API Management
 
Automating Cloud Operations: Everything You Wanted to Know about cURL and REST
Automating Cloud Operations: Everything You Wanted to Know about cURL and RESTAutomating Cloud Operations: Everything You Wanted to Know about cURL and REST
Automating Cloud Operations: Everything You Wanted to Know about cURL and REST
 
Getting Started with API Management – Why It's Needed On-prem and in the Cloud
Getting Started with API Management – Why It's Needed On-prem and in the CloudGetting Started with API Management – Why It's Needed On-prem and in the Cloud
Getting Started with API Management – Why It's Needed On-prem and in the Cloud
 
Automating Cloud Operations - Everything you wanted to know about cURL and RE...
Automating Cloud Operations - Everything you wanted to know about cURL and RE...Automating Cloud Operations - Everything you wanted to know about cURL and RE...
Automating Cloud Operations - Everything you wanted to know about cURL and RE...
 
Introducing the Oracle Cloud Infrastructure (OCI) Best Practices Framework
Introducing the Oracle Cloud Infrastructure (OCI) Best Practices FrameworkIntroducing the Oracle Cloud Infrastructure (OCI) Best Practices Framework
Introducing the Oracle Cloud Infrastructure (OCI) Best Practices Framework
 
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
 
PTK Issue 72: Delivering a Platform on Demand
PTK Issue 72: Delivering a Platform on DemandPTK Issue 72: Delivering a Platform on Demand
PTK Issue 72: Delivering a Platform on Demand
 
PTK Issue 71: The Compute Cloud Performance Showdown
PTK Issue 71: The Compute Cloud Performance ShowdownPTK Issue 71: The Compute Cloud Performance Showdown
PTK Issue 71: The Compute Cloud Performance Showdown
 
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
Everything You Need to Know About the Microsoft Azure and Oracle Cloud Interc...
 
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
 
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
Compute Cloud Performance Showdown: 18 Months Later (OCI, AWS, IBM Cloud, GCP...
 
The Microsoft Azure and Oracle Cloud Interconnect Everything You Need to Know
The Microsoft Azure and Oracle Cloud Interconnect Everything You Need to KnowThe Microsoft Azure and Oracle Cloud Interconnect Everything You Need to Know
The Microsoft Azure and Oracle Cloud Interconnect Everything You Need to Know
 
Cloud Integration Strategy
Cloud Integration StrategyCloud Integration Strategy
Cloud Integration Strategy
 
Compute Cloud Performance Showdown: Amazon Web Services, Oracle Cloud, IBM ...
Compute Cloud  Performance Showdown: Amazon Web Services, Oracle  Cloud, IBM ...Compute Cloud  Performance Showdown: Amazon Web Services, Oracle  Cloud, IBM ...
Compute Cloud Performance Showdown: Amazon Web Services, Oracle Cloud, IBM ...
 
Securing your Oracle Fusion Middleware Environment, On-Prem and in the Cloud
Securing your Oracle Fusion Middleware Environment, On-Prem and in the CloudSecuring your Oracle Fusion Middleware Environment, On-Prem and in the Cloud
Securing your Oracle Fusion Middleware Environment, On-Prem and in the Cloud
 
Developing Web Services from Scratch - For DBAs and Database Developers
Developing Web Services from Scratch - For DBAs and Database DevelopersDeveloping Web Services from Scratch - For DBAs and Database Developers
Developing Web Services from Scratch - For DBAs and Database Developers
 
Domain Partitions and Multitenancy in Oracle WebLogic Server 12c - Why It's U...
Domain Partitions and Multitenancy in Oracle WebLogic Server 12c - Why It's U...Domain Partitions and Multitenancy in Oracle WebLogic Server 12c - Why It's U...
Domain Partitions and Multitenancy in Oracle WebLogic Server 12c - Why It's U...
 
Oracle Database Cloud Service - Provisioning Your First DBaaS Instance
Oracle Database Cloud Service - Provisioning Your First DBaaS InstanceOracle Database Cloud Service - Provisioning Your First DBaaS Instance
Oracle Database Cloud Service - Provisioning Your First DBaaS Instance
 

Recently uploaded

Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomCzechDreamin
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupCatarinaPereira64715
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...Product School
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsExpeed Software
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024Stephanie Beckett
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIES VE
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfChristopherTHyatt
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeCzechDreamin
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutesconfluent
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsPaul Groth
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Alison B. Lowndes
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationZilliz
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxAbida Shariff
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backElena Simperl
 

Recently uploaded (20)

Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 

Introduction to R for Big Data Analysis

  • 1. Introduction to R For Big Data Analysis Wednesday, October 13, 2015 6:00pm – 6:45 pm Raastech, Inc. 2201 Cooperative Way, Suite 600 Herndon, VA 20171 +1-703-884-2223 info@raastech.com
  • 2. © Raastech, Inc. 2015 | All rights reserved. Slide 2 of 51@Raastech About Me  Harold Dost III @hdost  7+ years of Oracle Middleware experience  OCE (SOA Foundation Practitioner)  Oracle ACE Associate  From Michigan  blog.raastech.com
  • 3. © Raastech, Inc. 2015 | All rights reserved. Slide 3 of 51@Raastech About Raastech  Small systems integrator founded in 2009  Headquartered in the Washington DC area  Specializes in Oracle Fusion Middleware  Oracle Platinum Partner – 1 in 3,000 worldwide  Oracle SOA Specialized – 1 in 1,500 worldwide  Oracle ACE – 2 of 500 worldwide  100% of consultants are Oracle certified  100% of consultants present at major Oracle conferences  100% of consultants have published books, whitepapers, or articles
  • 4. © Raastech, Inc. 2015 | All rights reserved. Slide 4 of 51@Raastech Outline 1. Getting Started  Installing R  Installing Tools  Getting Data 2. Understanding R  Data Types  Functions  Data Import Mechanisms
  • 5. © Raastech, Inc. 2015 | All rights reserved. Slide 5 of 51@Raastech Outline (Cont.) 3. Manipulating Data (Large Data Sets)  Deriving Simple Statistics  Graphing 4. Demo 5. Incorporating into an Enterprise  Using Enterprise Data Sources  Running R in your environment.  Familiarize with Oracle's R offerings
  • 6. © Raastech, Inc. 2015 | All rights reserved. Slide 6 of 51@Raastech
  • 7. © Raastech, Inc. 2015 | All rights reserved. Slide 7 of 51@Raastech Know CRAN  Comprehensive  R  Archive  Network
  • 8. © Raastech, Inc. 2015 | All rights reserved. Slide 8 of 51@Raastech Installing R  Windows  Mac  Linux
  • 9. © Raastech, Inc. 2015 | All rights reserved. Slide 9 of 51@Raastech Installing R  Windows https://cran.r-project.org/bin/windows/  Mac https://cran.r-project.org/bin/macosx/  Linux https://cran.r-project.org/bin/linux/
  • 10. © Raastech, Inc. 2015 | All rights reserved. Slide 10 of 51@Raastech Development Tools  Rstudio - http://www.rstudio.com/products/rstudio/  Open Source Edition  Commercial License - $995  Eclipse  Sublime, TextPad, Other Simple Text Editors,…
  • 11. © Raastech, Inc. 2015 | All rights reserved. Slide 11 of 51@Raastech Installing Packages  Anything From CRAN  Anywhere install.packages(c(“first”, “second”)) > sudo R CMD INSTALL package-version.tar.gz
  • 12. © Raastech, Inc. 2015 | All rights reserved. Slide 12 of 51@Raastech
  • 13. © Raastech, Inc. 2015 | All rights reserved. Slide 13 of 51@Raastech Data Types  Vectors  Matrices  Arrays  Data Frames  Lists  Factors
  • 14. © Raastech, Inc. 2015 | All rights reserved. Slide 14 of 51@Raastech Special Values  Infinity, Positive and Negative: Inf and –Inf  Not A Number: NaN  Not Available: NA  Complex Numbers, 1+9i
  • 15. © Raastech, Inc. 2015 | All rights reserved. Slide 15 of 51@Raastech Use Case for Infinities  Finding Maximums and Minimums  Placeholder values when others won’t work
  • 16. © Raastech, Inc. 2015 | All rights reserved. Slide 16 of 51@Raastech Not a Number (NaN)  In means something went wrong somewhere  A missing argument  Invalid number  Check for with is.nan(x) to prevent leaking  Don’t use “==“ to find NaN, it will only give more NaN
  • 17. © Raastech, Inc. 2015 | All rights reserved. Slide 17 of 51@Raastech Assigning NaN > a = NaN > a [1] NaN
  • 18. © Raastech, Inc. 2015 | All rights reserved. Slide 18 of 51@Raastech Adding NaN Adding NaN > b = 1 > c = a + b > c [1] NaN When adding a number to NaN “Not a Number” you will get NaN.
  • 19. © Raastech, Inc. 2015 | All rights reserved. Slide 19 of 51@Raastech Comparing NaN to Regular Number > d = b == c > d [1] NA When comparing a number to NaN “Not a Number” you will get NA.
  • 20. © Raastech, Inc. 2015 | All rights reserved. Slide 20 of 51@Raastech Comparing NaN to NaN > e = c == a > e [1] NA When comparing NaN “Not a Number” to NaN you will get NA.
  • 21. © Raastech, Inc. 2015 | All rights reserved. Slide 21 of 51@Raastech Detecting NaN > a [1] NaN > is.nan(a) [1] TRUE > is.na(a) [1] TRUE Since NaN aren’t proper numbers, special functions must be used to detect them. They are the result of math gone wrong.
  • 22. © Raastech, Inc. 2015 | All rights reserved. Slide 22 of 51@Raastech Detecting NA > e = c == a > e [1] NA > is.nan(e) [1] FALSE > is.na(e) [1] TRUE Just as with NaN special functions must be used, but NA generally indicates that there is missing information
  • 23. © Raastech, Inc. 2015 | All rights reserved. Slide 23 of 51@Raastech Operators  Assignment ( ->, <-)  Addition (+)  Subtraction (–)  Division (/)  Multiplication (*)  Exponent (^)  Parentheses ( (, ) )
  • 24. © Raastech, Inc. 2015 | All rights reserved. Slide 24 of 51@Raastech
  • 25. © Raastech, Inc. 2015 | All rights reserved. Slide 25 of 51@Raastech Math Functions  max()  min()  log()  sqrt()
  • 26. © Raastech, Inc. 2015 | All rights reserved. Slide 26 of 51@Raastech Deriving Simple Statistics  Minimum  Maximum  Median  Arithmetic Mean  Function estimation  Linear  Log  Exponential  R-Values  Standard Deviation
  • 27. © Raastech, Inc. 2015 | All rights reserved. Slide 27 of 51@Raastech How to define your own functions firstfunction <- function(arg1, arg2, ... ){ statements return(someoutput) }
  • 28. © Raastech, Inc. 2015 | All rights reserved. Slide 28 of 51@Raastech
  • 29. © Raastech, Inc. 2015 | All rights reserved. Slide 29 of 51@Raastech Twitter Example  First Install the Package install.packages("twitteR”)
  • 30. © Raastech, Inc. 2015 | All rights reserved. Slide 30 of 51@Raastech Twitter Example  Authenticate consumer = "CONSUMER KEY" secret = "SECRET KEY" setup_twitter_oauth(consumer,secret)
  • 31. © Raastech, Inc. 2015 | All rights reserved. Slide 31 of 51@Raastech Twitter Example  Get Trend Locations  The resulting WOEID (Where on Earth ID) can be chosen availableTrendLocations()
  • 32. © Raastech, Inc. 2015 | All rights reserved. Slide 32 of 51@Raastech Twitter Example  Get Trends trends = getTrends(SOMEWOEID)
  • 33. © Raastech, Inc. 2015 | All rights reserved. Slide 33 of 51@Raastech Twitter Example  Retrieve Tweets tweets <- searchTwitter(trends[XX,XX],n=1500) tweetdf <- do.call("rbind",lapply(tweets,as.data.frame))
  • 34. © Raastech, Inc. 2015 | All rights reserved. Slide 34 of 51@Raastech Twitter Example  Filter  complete.cases is used to check for NA and NaN numbers tweetdf <- tweetdf[complete.cases(tweetdf[,15]),] tweetdf <- tweetdf[tweetdf[,15] != 0,]
  • 35. © Raastech, Inc. 2015 | All rights reserved. Slide 35 of 51@Raastech Twitter Example  Simplify the dataframe simpledf <- tweetdf[c("screenName","longitude","latitude")]
  • 36. © Raastech, Inc. 2015 | All rights reserved. Slide 36 of 51@Raastech Twitter Example  Create Matrix from Dataframe tweetMatrix <- data.matrix(simpledf[2:3],rownames.force = FALSE)
  • 37. © Raastech, Inc. 2015 | All rights reserved. Slide 37 of 51@Raastech Twitter Example  Plot the Latitude and Longitude plot(tweetMatrix)
  • 38. © Raastech, Inc. 2015 | All rights reserved. Slide 38 of 51@Raastech Graphing  Image  Contour  Box Chart
  • 39. © Raastech, Inc. 2015 | All rights reserved. Slide 39 of 51@Raastech K-Means  Essentially a search algorithm  Divides a dataset into k-clusters
  • 40. © Raastech, Inc. 2015 | All rights reserved. Slide 40 of 51@Raastech Time Series  Stock Quotes  Infection Incidents  Gas Prices  Audio  Etc. Source: http://www.loc.gov/pictures/resource/hec.23488/
  • 41. © Raastech, Inc. 2015 | All rights reserved. Slide 41 of 51@Raastech Time Series Analysis  Regression  Forecasting  Time Frequency (FFTs) Source: http://groups.csail.mit.edu/netmit/sFFT/algorithm.html
  • 42. © Raastech, Inc. 2015 | All rights reserved. Slide 42 of 51@Raastech
  • 43. © Raastech, Inc. 2015 | All rights reserved. Slide 43 of 51@Raastech Using Enterprise Data Sources  Database  Streams  Files  Etc.
  • 44. © Raastech, Inc. 2015 | All rights reserved. Slide 44 of 51@Raastech
  • 45. © Raastech, Inc. 2015 | All rights reserved. Slide 45 of 51@Raastech Oracle R Distribution  Available on Oracle Public Yum  Enhanced dynamic Library loading  Enterprise Support Available  Oracle Advanced Analytics  Oracle Linux  Oracle Big Data Appliance  http://www.oracle.com/technetwork/database/database-technologies/r/r- distribution/overview/index.html
  • 46. © Raastech, Inc. 2015 | All rights reserved. Slide 46 of 51@Raastech Oracle R Enterprise  Component of the Oracle Advanced Analytics Option on Oracle Database EE  Allows use of R in the database without SQL  Save R Objects in the database  Easily Integrate with OBIEE  http://www.oracle.com/technetwork/database/database- technologies/r/r-enterprise/overview/index.html
  • 47. © Raastech, Inc. 2015 | All rights reserved. Slide 47 of 51@Raastech Oracle R Advanced Analytics for Hadoop  Component of the Oracle Big Data Software Connectors Suite, an option for the BDA  Provides abstraction from HiveQL through R just as in Oracle R Enterprise does for SQL  http://www.oracle.com/technetwork/database/ database-technologies/bdc/r-advanalytics-for- hadoop/overview/index.html
  • 48. © Raastech, Inc. 2015 | All rights reserved. Slide 48 of 51@Raastech ROracle  Open Source Package  Maintained by Oracle  Uses OCI Interface to interact with databases  http://www.oracle.com/technetwork/database/database-technologies/r/r- technologies/overview/index.html
  • 49. © Raastech, Inc. 2015 | All rights reserved. Slide 49 of 51@Raastech
  • 50. © Raastech, Inc. 2015 | All rights reserved. Slide 50 of 51@Raastech Contact Information  Harold Dost III  Principal Consultant  @hdost  harold.dost@raastech.com
  • 51. © Raastech, Inc. 2015 | All rights reserved. Slide 51 of 51@Raastech Resources  https://en.wikibooks.org/wiki/Statistical_Analysis:_an_Introduction_using_R/R_basics  http://www.r-project.org/  https://docs.oracle.com/cd/E57012_01/doc.141/e56973/toc.htm  http://cran.r-project.org/web/packages/akmeans/index.html  http://cran.r-project.org/web/packages/twitteR/index.html  http://en.wikipedia.org/wiki/K-means_clustering  http://www.rdatamining.com/examples/kmeans-clustering  http://blog.revolutionanalytics.com/2009/02/how-to-choose-a-random-number-in-r.html  https://www.packtpub.com/books/content/text-mining-r-part-2  http://www.eia.gov/totalenergy/data/monthly/index.cfm#consumption