SlideShare a Scribd company logo
1 of 34
BIG DATA ANALYTICS
USING R
Big Data & IoT
Umair Shafique (03246441789)
Scholar MS Information Technology - University of Gujrat
BIG DATA ANALYTICS
USING R
TABLE OF CONTENTS:
• WHAT IS BIG DATAANALYTICS?
• DATA SOURCES OF BIG DATA
• WHY DO WE NEED BIG DATAANALYTICS?
• STAGES OF BIG DATAANALYTICS
• TYPES OF BIG DATAANALYTICS
• TOOLS USED IN BIG DATAANALYTICS
• DOMAINS USING BIG DATAANALYTICS
• HISTORY OF R
• ABOUT R LANGUAGE
• FEATURES OF R
• REASONS TO LEARN R
• APPLICATIONS OF R PROGRAMMING
• INSTALLATION OF R
• COMPANIES USING R
• R VS PYTHON
• SKILLS FOR DATAANALYST
WHAT IS BIG DATA ANALYTICS?
• Big data analytics is the often complex process of
examining big data to uncover information, such as
hidden patterns, correlations, market trends and
customer preferences, that can help organizations
make informed business decisions.
• Big data analytics helps businesses to get insights
from today's huge data resources.
• Social media, cloud applications, and machine
sensor data are just some examples.
DATA SOURCES OF BIG DATA
WHY DO WE NEED BIG DATA ANALYTICS?
• Making Smarter and More Efficient
Organization
• Optimize Business Operations by
Analyzing Customer Behavior
• Cost Reduction
• New Generation Products
• Detect Risks and Check Frauds
Use Case 1
Use Case 2
STAGES OF BIG DATA ANALYTICS
TYPES OF BIG DATA ANALYTICS
1. Descriptive Analytics
2. Diagnostic Analytics
3. Predictive Analytics
4. Prescriptive Analytics
TOOLS USED IN BIG DATA ANALYTICS
DOMAINS USING BIG DATA ANALYTICS
History OF R
• R was created by Ross Ihaka and Robert Gentleman
in the University of Auckland, New Zealand, 1993.
• This programming language name is taken from the
name of both the developers.
• The R language was closely modeled on the S
Language for Statistical Computing conceived by
John Chambers, Rick Becker, Trevor Hastie, Allan
Wilks and others at Bell Labs in the mid 1970s.
• In 1995, statistician Martin Mächler convinced Ihaka
and Gentleman to make R free and open-source
software under the General Public License.
About R language
• R is a interpreted computer programming language.
• R is a popular choice for data analysis, statistical
computing and graphical representation.
• R is a programming language and software
environment for statistical computing and graphics.
• The R programming language comprises packages
and environments making analytics easier.
• R can be downloaded and installed from CRAN
website , CRAN stands for Comprehensive R Archive
Network.
Features of R
• Open source: R is an open source programming language. It is
completely free for anybody to use.
• Varity of packages: There are more than 15,000 packages for R
on online repositories like CRAN, GitHub.
• Powerful Graphics: R’s graphical capabilities are amazing. It
can make graphs of any type with its packages.
• Cross platform support: R is cross platform supportive that can
run on any Operating system and any software environment
without any hassle.
• No need for a Compiler: the R is interpreted language. It does
not need a compiler to convert the code into a program.
• Perform Fast Calculation: Through R, you can perform a wide
variety of complex operations on arrays, data frames, vectors
and other data objects of varying sizes.
Reason to learn R
• Open-source and Free Tool
• Strong Graphical Capabilities
• Highly Active Community
• A Wide Selection of Packages
• Comprehensive Environment
• Can Perform Complex Statistical Calculations
• Running Code Without a Compiler
• Interacting with Databases
• Cross-platform Support
• 2 Million jobs are opening for R programmer
Applications of R Programming
• R is used in finance and banking sectors for detecting fraud, reducing customer
churn rate and for making future decisions.
• R is also used by bioinformatics to analyze strands of genetic sequences, for
performing drug discovery and also in computational neuroscience.
• R is used in social media analysis to discover potential customers in online
advertising. Companies also use social media information to analyze customer
sentiments for making their products better.
• E-Commerce companies make use of R to analyze the purchases made by the
customers as well as their feedbacks.
• Manufacturing companies use R to analyze customer feedback. They also use it
to predict future demand to adjust their manufacturing speeds and maximize
profits.
Companies Using R
Some of the companies that
are using R programming
are as follows:
• Facebook
• Google
• Ford
• Twitter
• ANZ
• Microsoft
INSTALLATION OF R
R PYTHON
First appeared in 1993 First appeared in 1991
It has more functions and packages It has less functions and packages
It is an interpreter base language It is an interpreter base language
It is statistical design and graphics
programming language
It is general purpose language
It is difficult to learn and understand It is easy to understand
R is mostly use for data analysis Generic programming, tasks such as
design of software
Skills for Data Analyst
MACHINE
LEARNIG
MS OFFICE
SQL
PRESENTATI
ON SKILLS
CRITICAL
THINKING
R or PYTHON
DATA
VISUALIZATI
ON
BIG DATA ANALYTICS USING  R
BIG DATA ANALYTICS USING  R
BIG DATA ANALYTICS USING  R

More Related Content

What's hot

Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Simplilearn
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceSampath Kumar
 
Data Science with Python Libraries
Data Science with Python LibrariesData Science with Python Libraries
Data Science with Python Librariessabafarheen
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataVipin Batra
 
Big data and data science overview
Big data and data science overviewBig data and data science overview
Big data and data science overviewColleen Farrelly
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with RGreat Wide Open
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
03. Data Exploration.pptx
03. Data Exploration.pptx03. Data Exploration.pptx
03. Data Exploration.pptxSarojkumari55
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessingankur bhalla
 
Introduction to Graph Databases
Introduction to Graph DatabasesIntroduction to Graph Databases
Introduction to Graph DatabasesMax De Marzi
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...Simplilearn
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyRohit Dubey
 
Big Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBig Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBernard Marr
 

What's hot (20)

Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
Deep Learning Frameworks 2019 | Which Deep Learning Framework To Use | Deep L...
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Data Analysis in Python
Data Analysis in PythonData Analysis in Python
Data Analysis in Python
 
Data Mining with R programming
Data Mining with R programmingData Mining with R programming
Data Mining with R programming
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Data Science with Python Libraries
Data Science with Python LibrariesData Science with Python Libraries
Data Science with Python Libraries
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big_data_ppt
Big_data_ppt Big_data_ppt
Big_data_ppt
 
Overview of Big data(ppt)
Overview of Big data(ppt)Overview of Big data(ppt)
Overview of Big data(ppt)
 
Analysis vs reporting
Analysis vs reportingAnalysis vs reporting
Analysis vs reporting
 
Big data and data science overview
Big data and data science overviewBig data and data science overview
Big data and data science overview
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with R
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
03. Data Exploration.pptx
03. Data Exploration.pptx03. Data Exploration.pptx
03. Data Exploration.pptx
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Introduction to Graph Databases
Introduction to Graph DatabasesIntroduction to Graph Databases
Introduction to Graph Databases
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBig Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must Know
 

Similar to BIG DATA ANALYTICS USING R

Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...Data Con LA
 
R and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopR and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopRevolution Analytics
 
2 it unit-1 start learning r
2 it   unit-1 start learning r2 it   unit-1 start learning r
2 it unit-1 start learning rNetaji Gandi
 
Data science using r multisoft systems
Data science using r  multisoft systemsData science using r  multisoft systems
Data science using r multisoft systemsMultisoft Systems
 
Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics? Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics? Revolution Analytics
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormRevolution Analytics
 
Robert Luong: Analyse prédictive dans Excel
Robert Luong: Analyse prédictive dans ExcelRobert Luong: Analyse prédictive dans Excel
Robert Luong: Analyse prédictive dans ExcelMSDEVMTL
 
An introduction to R is a document useful
An introduction to R is a document usefulAn introduction to R is a document useful
An introduction to R is a document usefulssuser3c3f88
 
Extending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and SpotfireExtending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and SpotfireLou Bajuk
 
Applications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the MarketplaceApplications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the MarketplaceRevolution Analytics
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...Experfy
 
Extending the R language to BI and Real-time Applications JSM 2015
Extending the R language to BI and Real-time Applications JSM 2015Extending the R language to BI and Real-time Applications JSM 2015
Extending the R language to BI and Real-time Applications JSM 2015Lou Bajuk
 
R in BI and Streaming Applications for useR 2016
R in BI and Streaming Applications for useR 2016R in BI and Streaming Applications for useR 2016
R in BI and Streaming Applications for useR 2016Lou Bajuk
 
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...Revolution Analytics
 
SWE-610-Lec-1-Software-Intro duction(1).pptx
SWE-610-Lec-1-Software-Intro duction(1).pptxSWE-610-Lec-1-Software-Intro duction(1).pptx
SWE-610-Lec-1-Software-Intro duction(1).pptxnohaaalrajhi
 

Similar to BIG DATA ANALYTICS USING R (20)

Introduction to R
Introduction to RIntroduction to R
Introduction to R
 
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
 
R and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopR and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with Hadoop
 
2 it unit-1 start learning r
2 it   unit-1 start learning r2 it   unit-1 start learning r
2 it unit-1 start learning r
 
UNIT-1 Start Learning R.pdf
UNIT-1 Start Learning R.pdfUNIT-1 Start Learning R.pdf
UNIT-1 Start Learning R.pdf
 
Data science using r multisoft systems
Data science using r  multisoft systemsData science using r  multisoft systems
Data science using r multisoft systems
 
Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics? Are You Ready for Big Data Big Analytics?
Are You Ready for Big Data Big Analytics?
 
Revolution Analytics Podcast
Revolution Analytics PodcastRevolution Analytics Podcast
Revolution Analytics Podcast
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and Storm
 
Robert Luong: Analyse prédictive dans Excel
Robert Luong: Analyse prédictive dans ExcelRobert Luong: Analyse prédictive dans Excel
Robert Luong: Analyse prédictive dans Excel
 
LSESU a Taste of R Language Workshop
LSESU a Taste of R Language WorkshopLSESU a Taste of R Language Workshop
LSESU a Taste of R Language Workshop
 
An introduction to R is a document useful
An introduction to R is a document usefulAn introduction to R is a document useful
An introduction to R is a document useful
 
Extending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and SpotfireExtending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and Spotfire
 
Applications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the MarketplaceApplications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the Marketplace
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
 
Extending the R language to BI and Real-time Applications JSM 2015
Extending the R language to BI and Real-time Applications JSM 2015Extending the R language to BI and Real-time Applications JSM 2015
Extending the R language to BI and Real-time Applications JSM 2015
 
R in BI and Streaming Applications for useR 2016
R in BI and Streaming Applications for useR 2016R in BI and Streaming Applications for useR 2016
R in BI and Streaming Applications for useR 2016
 
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
 
SWE-610-Lec-1-Software-Intro duction(1).pptx
SWE-610-Lec-1-Software-Intro duction(1).pptxSWE-610-Lec-1-Software-Intro duction(1).pptx
SWE-610-Lec-1-Software-Intro duction(1).pptx
 
Executive Intro to R
Executive Intro to RExecutive Intro to R
Executive Intro to R
 

More from Umair Shafique

Exploratory Data Analysis
Exploratory Data AnalysisExploratory Data Analysis
Exploratory Data AnalysisUmair Shafique
 
Pre-Processing and Data Preparation
Pre-Processing and Data PreparationPre-Processing and Data Preparation
Pre-Processing and Data PreparationUmair Shafique
 
Big Data Analytics With Hadoop
Big Data Analytics With HadoopBig Data Analytics With Hadoop
Big Data Analytics With HadoopUmair Shafique
 
BIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGBIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGUmair Shafique
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataUmair Shafique
 
Handling and Processing Big Data
Handling and Processing Big DataHandling and Processing Big Data
Handling and Processing Big DataUmair Shafique
 

More from Umair Shafique (6)

Exploratory Data Analysis
Exploratory Data AnalysisExploratory Data Analysis
Exploratory Data Analysis
 
Pre-Processing and Data Preparation
Pre-Processing and Data PreparationPre-Processing and Data Preparation
Pre-Processing and Data Preparation
 
Big Data Analytics With Hadoop
Big Data Analytics With HadoopBig Data Analytics With Hadoop
Big Data Analytics With Hadoop
 
BIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGBIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNING
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Handling and Processing Big Data
Handling and Processing Big DataHandling and Processing Big Data
Handling and Processing Big Data
 

Recently uploaded

X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneyX-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneySérgio Sacani
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptxArvind Kumar
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
Understanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution MethodsUnderstanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution Methodsimroshankoirala
 
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACherry
 
Site specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdfSite specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdfCherry
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptxCherry
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Cherry
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Cherry
 
Taphonomy and Quality of the Fossil Record
Taphonomy and Quality of the  Fossil RecordTaphonomy and Quality of the  Fossil Record
Taphonomy and Quality of the Fossil RecordSangram Sahoo
 
Lipids: types, structure and important functions.
Lipids: types, structure and important functions.Lipids: types, structure and important functions.
Lipids: types, structure and important functions.Cherry
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxCherry
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxDiariAli
 
ONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for voteONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for voteRaunakRastogi4
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisAreesha Ahmad
 
Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycleCherry
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Cherry
 

Recently uploaded (20)

X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneyX-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Understanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution MethodsUnderstanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution Methods
 
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNA
 
Site specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdfSite specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdf
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Taphonomy and Quality of the Fossil Record
Taphonomy and Quality of the  Fossil RecordTaphonomy and Quality of the  Fossil Record
Taphonomy and Quality of the Fossil Record
 
Lipids: types, structure and important functions.
Lipids: types, structure and important functions.Lipids: types, structure and important functions.
Lipids: types, structure and important functions.
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
ONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for voteONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for vote
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of Asepsis
 
Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycle
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 

BIG DATA ANALYTICS USING R

  • 1. BIG DATA ANALYTICS USING R Big Data & IoT Umair Shafique (03246441789) Scholar MS Information Technology - University of Gujrat
  • 3. TABLE OF CONTENTS: • WHAT IS BIG DATAANALYTICS? • DATA SOURCES OF BIG DATA • WHY DO WE NEED BIG DATAANALYTICS? • STAGES OF BIG DATAANALYTICS • TYPES OF BIG DATAANALYTICS • TOOLS USED IN BIG DATAANALYTICS • DOMAINS USING BIG DATAANALYTICS • HISTORY OF R • ABOUT R LANGUAGE • FEATURES OF R • REASONS TO LEARN R • APPLICATIONS OF R PROGRAMMING • INSTALLATION OF R • COMPANIES USING R • R VS PYTHON • SKILLS FOR DATAANALYST
  • 4. WHAT IS BIG DATA ANALYTICS? • Big data analytics is the often complex process of examining big data to uncover information, such as hidden patterns, correlations, market trends and customer preferences, that can help organizations make informed business decisions. • Big data analytics helps businesses to get insights from today's huge data resources. • Social media, cloud applications, and machine sensor data are just some examples.
  • 5. DATA SOURCES OF BIG DATA
  • 6. WHY DO WE NEED BIG DATA ANALYTICS? • Making Smarter and More Efficient Organization • Optimize Business Operations by Analyzing Customer Behavior • Cost Reduction • New Generation Products • Detect Risks and Check Frauds
  • 9. STAGES OF BIG DATA ANALYTICS
  • 10.
  • 11. TYPES OF BIG DATA ANALYTICS
  • 16. TOOLS USED IN BIG DATA ANALYTICS
  • 17. DOMAINS USING BIG DATA ANALYTICS
  • 18.
  • 19. History OF R • R was created by Ross Ihaka and Robert Gentleman in the University of Auckland, New Zealand, 1993. • This programming language name is taken from the name of both the developers. • The R language was closely modeled on the S Language for Statistical Computing conceived by John Chambers, Rick Becker, Trevor Hastie, Allan Wilks and others at Bell Labs in the mid 1970s. • In 1995, statistician Martin Mächler convinced Ihaka and Gentleman to make R free and open-source software under the General Public License.
  • 20. About R language • R is a interpreted computer programming language. • R is a popular choice for data analysis, statistical computing and graphical representation. • R is a programming language and software environment for statistical computing and graphics. • The R programming language comprises packages and environments making analytics easier. • R can be downloaded and installed from CRAN website , CRAN stands for Comprehensive R Archive Network.
  • 21. Features of R • Open source: R is an open source programming language. It is completely free for anybody to use. • Varity of packages: There are more than 15,000 packages for R on online repositories like CRAN, GitHub. • Powerful Graphics: R’s graphical capabilities are amazing. It can make graphs of any type with its packages. • Cross platform support: R is cross platform supportive that can run on any Operating system and any software environment without any hassle. • No need for a Compiler: the R is interpreted language. It does not need a compiler to convert the code into a program. • Perform Fast Calculation: Through R, you can perform a wide variety of complex operations on arrays, data frames, vectors and other data objects of varying sizes.
  • 22. Reason to learn R • Open-source and Free Tool • Strong Graphical Capabilities • Highly Active Community • A Wide Selection of Packages • Comprehensive Environment • Can Perform Complex Statistical Calculations • Running Code Without a Compiler • Interacting with Databases • Cross-platform Support • 2 Million jobs are opening for R programmer
  • 23. Applications of R Programming • R is used in finance and banking sectors for detecting fraud, reducing customer churn rate and for making future decisions. • R is also used by bioinformatics to analyze strands of genetic sequences, for performing drug discovery and also in computational neuroscience. • R is used in social media analysis to discover potential customers in online advertising. Companies also use social media information to analyze customer sentiments for making their products better. • E-Commerce companies make use of R to analyze the purchases made by the customers as well as their feedbacks. • Manufacturing companies use R to analyze customer feedback. They also use it to predict future demand to adjust their manufacturing speeds and maximize profits.
  • 24. Companies Using R Some of the companies that are using R programming are as follows: • Facebook • Google • Ford • Twitter • ANZ • Microsoft
  • 26.
  • 27.
  • 28.
  • 29.
  • 30. R PYTHON First appeared in 1993 First appeared in 1991 It has more functions and packages It has less functions and packages It is an interpreter base language It is an interpreter base language It is statistical design and graphics programming language It is general purpose language It is difficult to learn and understand It is easy to understand R is mostly use for data analysis Generic programming, tasks such as design of software
  • 31. Skills for Data Analyst MACHINE LEARNIG MS OFFICE SQL PRESENTATI ON SKILLS CRITICAL THINKING R or PYTHON DATA VISUALIZATI ON

Editor's Notes

  1. Analytics is the combination of mathematical, statistical, and heuristic techniques to glean useful insights from data and to implement actions derived from those insights. Big Data Analytics services We offer our service of Big Data Analytics for you to be able to see further progress and business prospects. To gain an insight into marketing trends and always be one step ahead of your business rivals, we resort to the following tools: Data mining. We make your data meaningful to predict future outcomes. Statistics We use statistics to measure the quality of data, define uncertainties and extract only accurate data. Data modeling We structure data in order so that it can feet the needs of application Machine learning We use machine learning to gather, integrate and process huge volumes of data. Database management Our services also include database management, which allows to collect, track and store stream of data, build data warehouses and make a data processing efficient. More than that, you can also receive support and maintenance of your database software, if there is such a need. Big data visualization Big data visualization promotes better understanding of the whole data, by breaking it into pieces with the help of colors, graphs, symbols etc. Business Intelligence Use Business Intelligence services to receive the assessment and summary of current situations from the point of view of market trends, financial reporting, budget planning, customer analysis and many more.
  2. Enterprise resource planning (ERP) refers to a type of software that organizations use to manage day-to-day business activities such as accounting, procurement, project management, risk management and compliance, and supply chain operations.
  3. Let me tell you about one such organization, the New York Police Department (NYPD). The NYPD brilliantly uses Big Data analytics to detect and identify crimes before they occur. They analyze historical arrest patterns and then maps them with events such as federal holidays, paydays, traffic flows, rainfall etc. This aids them in analyzing the information immediately by utilizing these data patterns. Big Data analytics strategy helps them identify crime locations, through which they deploy their officers to these locations. Thus by reaching these locations before the crimes were committed, they prevent the occurrence of crime.
  4. Most organizations use behavioral analytics of customers in order to provide customer satisfaction and hence, increase their customer base. The best example of this is Amazon. Amazon is one of the best and most widely used e-commerce websites with a customer base of about 300 million. They use customer click-stream data and historical purchase data to provide them with customized results on customized web pages. Analyzing the clicks of every visitor on their website aids them in understanding their site-navigation behavior, paths the user took to buy the product, paths that led them to leave the site and more. All this information helps Amazon to improve their user experience, thereby improving their sales and marketing.
  5. Descriptive Analytics: It uses data aggregation and data mining to provide insight into the past and answer: “What has happened?” The descriptive analytics does exactly what the name implies they “describe” or summarize raw data and make it interpretable by humans. 
  6. Diagnostic Analytics: It is used to determine why something happened in the past. It is characterized by techniques such as drill-down, data discovery, data mining and correlations. Diagnostic analytics takes a deeper look at data to understand the root causes of the events. 
  7. Predictive Analytics: It uses statistical models and forecasts techniques to understand the future and answer: “What could happen?” Predictive analytics provides companies with actionable insights based on data. It provides estimates about the likelihood of a future outcome. 
  8. Prescriptive Analytics: It uses optimization and simulation algorithms to advice on possible outcomes and answers: “What should we do?” It allows users to “prescribe” a number of different possible actions and guide them towards a solution. In a nutshell, this analytics is all about providing advice.  
  9. R and Python are the top programming languages used in the Data Analytics field. R is an open-source tool used for Statistics and Analytics whereas Python is a high level, an interpreted language that has an easy syntax and dynamic semantics.  QlikView is a Self-Service Business Intelligence, Data Visualization, and Data Analytics tool. Being named a leader in Gartner Magic Quadrant 2020 for Analytics and BI platforms, it aims to accelerate business value through data by providing features such as Data Integration, Data Literacy, and Data Analytics. Power BI is a Microsoft product used for business analytics. Named as a leader for the 13th consecutive year in the Gartner 2020 Magic Quadrant, it provides interactive visualizations with self-service business intelligence capabilities, where end users can create dashboards and reports by themselves, without having to depend on anybody. Apache Spark is one of the most successful projects in the Apache Software Foundation and is a cluster computing framework that is open-source and is used for real-time processing. Being the most active Apache project at the moment, it comes with a fantastic open-source community and an interface for programming. This interface makes sure of fault tolerance and implicit data parallelism. Tableau is a market-leading Business Intelligence tool used to analyze and visualize data in an easy format. Being named as a leader in the Gartner Magic Quadrant 2020 For the eighth consecutive year, Tableau allows you to work on live data-set and spend more time on Data Analysis rather than Data Wrangling.
  10. Healthcare: Healthcare is using data analytics to reduce costs, predict epidemics, avoid preventable diseases and improve the quality of life in general. One of the most widespread applications of Big Data in healthcare is Electronic Health Record(EHRs). Almost the majority of the Healthcare industry knows about the importance of  Big data Analysis in recent years. Telecom: They are one of the most significant contributors to Big Data. Telecom industry improves the quality of service and routes traffic more effectively. By analyzing call data records in real-time, these companies can identify fraudulent behavior and act on them immediately. The marketing division can modify its campaigns to better target its customers and use insights gained to develop new products and services.  Insurance: These companies use Big Data analytics for risk assessment, fraud detection, marketing, customer insights, customer experience and more.  Government: The government use data analytics to get an estimate of trade in the country. They used Central sales tax invoices to analyze the extent to which states trade with each other.  Finance: Banks and financial services firms use analytics to differentiate fraudulent interactions from legitimate business transactions. The analytics systems suggest immediate actions, such as blocking irregular transactions, which stops fraud before it occurs and improves profitability.  Automobile: Rolls Royce which has embraced Big Data analysis by fitting hundreds of sensors into its engines and propulsion systems, which record every tiny detail about their operation. The changes in data in real-time are reported to engineers who will decide the best course of action such as scheduling maintenance or dispatching engineering teams. Education: This is one field where Big Data Analytics is being absorbed slowly and gradually. Opting for big data powered technology as a learning tool instead of traditional lecture methods, enhanced the learning of students as well as aided the teachers to track their performance better. Retail: Retail including e-commerce and in-stores are widely using Big Data Analytics to optimize their business. For example, Amazon, Walmart etc.
  11. RStudio is an integrated development environment for R, a programming language for statistical computing and graphics. It is available in two formats: RStudio Desktop is a regular desktop application while RStudio Server runs on a remote server and allows accessing RStudio using a web browser. 
  12. R is a programming language for statistical computing and graphics supported by the R Core Team and the R Foundation for Statistical Computing. R is a free software environment for statistical computing and graphics. It compiles and runs on a wide variety of UNIX platforms, Windows and MacOS. To download R, please choose your preferred CRAN mirror. Python is a high-level, interpreted, general-purpose programming language. Its design philosophy emphasizes code readability with the use of significant indentation. Python is dynamically-typed and garbage-collected.  KEY DIFFERENCES: R is mainly used for statistical analysis while Python provides a more general approach to data science. The primary objective of R is Data analysis and Statistics whereas the primary objective of Python is Deployment and Production. In the end, the choice between R or Python depends on: The objectives of your mission: Statistical analysis or deployment The amount of time you can invest Your company/industry most-used tool
  13. Data visualization is the practice of translating information into a visual context, such as a map or graph, to make data easier for the human brain to understand and pull insights from. The main goal of data visualization is to make it easier to identify patterns, trends and outliers in large data sets. It is very important to know how to visualize the data in order to be able to understand the insights and apply it in action. Machine learning (ML) is a type of artificial intelligence (AI) that allows software applications to become more accurate at predicting outcomes without being explicitly programmed to do so. Machine learning algorithms use historical data as input to predict new output values. Microsoft Office, or simply Office, is a family of client software, server software, and services developed by Microsoft. It was first announced by Bill Gates on August 1, 1988, at COMDEX in Las Vegas.  Structured Query Language (SQL) is a standardized programming language that is used to manage relational databases and perform various operations on the data in them. Presentation skills are the abilities one needs in order to deliver compelling, engaging, informative, transformative, educational, enlightening, and/or instructive presentations. Central to effective presentation skills are public speaking, tone of voice, body language, creativity, and delivery. Critical thinking is the analysis of available facts, evidence, observations, and arguments to form a judgement. The subject is complex; several different definitions exist, which generally include the rational, skeptical, and unbiased analysis or evaluation of factual evidence.