Submit Search
Upload
Data analytics all about data v5
•
Download as PPTX, PDF
•
1 like
•
498 views
H
Harish Dixit
Follow
Information about Data Analytics and Big Data.
Read less
Read more
Data & Analytics
Slideshow view
Report
Share
Slideshow view
Report
Share
1 of 26
Download now
Recommended
Big data and hadoop
Big data and hadoop
Prashanth Yennampelli
Spark - The beginnings
Spark - The beginnings
Daniel Leon
Integrating Hadoop & Solr
Integrating Hadoop & Solr
Lucidworks (Archived)
Hadoop and Big Data: Revealed
Hadoop and Big Data: Revealed
Sachin Holla
Hadoop introduction
Hadoop introduction
Rabindra Nath Nandi
An Introduction to Apache Spark
An Introduction to Apache Spark
Elvis Saravia
Big Data and Hadoop Ecosystem
Big Data and Hadoop Ecosystem
Rajkumar Singh
Big data vahidamiri-tabriz-13960226-datastack.ir
Big data vahidamiri-tabriz-13960226-datastack.ir
datastack
Recommended
Big data and hadoop
Big data and hadoop
Prashanth Yennampelli
Spark - The beginnings
Spark - The beginnings
Daniel Leon
Integrating Hadoop & Solr
Integrating Hadoop & Solr
Lucidworks (Archived)
Hadoop and Big Data: Revealed
Hadoop and Big Data: Revealed
Sachin Holla
Hadoop introduction
Hadoop introduction
Rabindra Nath Nandi
An Introduction to Apache Spark
An Introduction to Apache Spark
Elvis Saravia
Big Data and Hadoop Ecosystem
Big Data and Hadoop Ecosystem
Rajkumar Singh
Big data vahidamiri-tabriz-13960226-datastack.ir
Big data vahidamiri-tabriz-13960226-datastack.ir
datastack
Hadoop
Hadoop
Kasam Sharif
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiry
datastack
Hadoop Ecosystem
Hadoop Ecosystem
Patrick Nicolas
Big data vahidamiri-datastack.ir
Big data vahidamiri-datastack.ir
datastack
Hadoop
Hadoop
avnishagr
Introduction To Hadoop Ecosystem
Introduction To Hadoop Ecosystem
InSemble
Anju
Anju
Anju Shekhawat
Hadoop Architecture
Hadoop Architecture
Dr. C.V. Suresh Babu
The Fundamentals Guide to HDP and HDInsight
The Fundamentals Guide to HDP and HDInsight
Gert Drapers
WHAT IS HADOOP AND ITS COMPONENTS?
WHAT IS HADOOP AND ITS COMPONENTS?
nakshatraL
Big Data A La Carte Menu
Big Data A La Carte Menu
Venkatesh Balakumar
Big data architecture on cloud computing infrastructure
Big data architecture on cloud computing infrastructure
datastack
Microsoft sql-server-2016 Tutorial & Overview
Microsoft sql-server-2016 Tutorial & Overview
QA TrainingHub
Using Machine Learning with HDInsight
Using Machine Learning with HDInsight
Eng Teong Cheah
Hadoop-Quick introduction
Hadoop-Quick introduction
Sandeep Singh
Hadoop distributions - ecosystem
Hadoop distributions - ecosystem
Jakub Stransky
Indexing with solr search server and hadoop framework
Indexing with solr search server and hadoop framework
keval dalasaniya
Cloudera Hadoop Distribution
Cloudera Hadoop Distribution
Thisara Pramuditha
Introduction to Hadoop at Data-360 Conference
Introduction to Hadoop at Data-360 Conference
Avkash Chauhan
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
datastack
Big Data and Cloud Computing
Big Data and Cloud Computing
Farzad Nozarian
Cloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
Geoffrey Fox
More Related Content
What's hot
Hadoop
Hadoop
Kasam Sharif
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiry
datastack
Hadoop Ecosystem
Hadoop Ecosystem
Patrick Nicolas
Big data vahidamiri-datastack.ir
Big data vahidamiri-datastack.ir
datastack
Hadoop
Hadoop
avnishagr
Introduction To Hadoop Ecosystem
Introduction To Hadoop Ecosystem
InSemble
Anju
Anju
Anju Shekhawat
Hadoop Architecture
Hadoop Architecture
Dr. C.V. Suresh Babu
The Fundamentals Guide to HDP and HDInsight
The Fundamentals Guide to HDP and HDInsight
Gert Drapers
WHAT IS HADOOP AND ITS COMPONENTS?
WHAT IS HADOOP AND ITS COMPONENTS?
nakshatraL
Big Data A La Carte Menu
Big Data A La Carte Menu
Venkatesh Balakumar
Big data architecture on cloud computing infrastructure
Big data architecture on cloud computing infrastructure
datastack
Microsoft sql-server-2016 Tutorial & Overview
Microsoft sql-server-2016 Tutorial & Overview
QA TrainingHub
Using Machine Learning with HDInsight
Using Machine Learning with HDInsight
Eng Teong Cheah
Hadoop-Quick introduction
Hadoop-Quick introduction
Sandeep Singh
Hadoop distributions - ecosystem
Hadoop distributions - ecosystem
Jakub Stransky
Indexing with solr search server and hadoop framework
Indexing with solr search server and hadoop framework
keval dalasaniya
Cloudera Hadoop Distribution
Cloudera Hadoop Distribution
Thisara Pramuditha
Introduction to Hadoop at Data-360 Conference
Introduction to Hadoop at Data-360 Conference
Avkash Chauhan
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
datastack
What's hot
(20)
Hadoop
Hadoop
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiry
Hadoop Ecosystem
Hadoop Ecosystem
Big data vahidamiri-datastack.ir
Big data vahidamiri-datastack.ir
Hadoop
Hadoop
Introduction To Hadoop Ecosystem
Introduction To Hadoop Ecosystem
Anju
Anju
Hadoop Architecture
Hadoop Architecture
The Fundamentals Guide to HDP and HDInsight
The Fundamentals Guide to HDP and HDInsight
WHAT IS HADOOP AND ITS COMPONENTS?
WHAT IS HADOOP AND ITS COMPONENTS?
Big Data A La Carte Menu
Big Data A La Carte Menu
Big data architecture on cloud computing infrastructure
Big data architecture on cloud computing infrastructure
Microsoft sql-server-2016 Tutorial & Overview
Microsoft sql-server-2016 Tutorial & Overview
Using Machine Learning with HDInsight
Using Machine Learning with HDInsight
Hadoop-Quick introduction
Hadoop-Quick introduction
Hadoop distributions - ecosystem
Hadoop distributions - ecosystem
Indexing with solr search server and hadoop framework
Indexing with solr search server and hadoop framework
Cloudera Hadoop Distribution
Cloudera Hadoop Distribution
Introduction to Hadoop at Data-360 Conference
Introduction to Hadoop at Data-360 Conference
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
Similar to Data analytics all about data v5
Big Data and Cloud Computing
Big Data and Cloud Computing
Farzad Nozarian
Cloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
Geoffrey Fox
Cloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
Geoffrey Fox
BDAS Shark study report 03 v1.1
BDAS Shark study report 03 v1.1
Stefanie Zhao
Apache Hadoop 1.1
Apache Hadoop 1.1
Sperasoft
Introduction to Hadoop
Introduction to Hadoop
York University
Big Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL Server
Mark Kromer
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...
Fwdays
What's new in hadoop 3.0
What's new in hadoop 3.0
Heiko Loewe
Apache Cassandra training. Overview and Basics
Apache Cassandra training. Overview and Basics
Oleg Magazov
P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.
P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.
MaharajothiP
Cheetah:Data Warehouse on Top of MapReduce
Cheetah:Data Warehouse on Top of MapReduce
Tilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL
Introduction to Spark - Phoenix Meetup 08-19-2014
Introduction to Spark - Phoenix Meetup 08-19-2014
cdmaxime
Apache Spark
Apache Spark
SugumarSarDurai
Hadoop: A distributed framework for Big Data
Hadoop: A distributed framework for Big Data
Dhanashri Yadav
Modern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An Overview
Great Wide Open
Colorado Springs Open Source Hadoop/MySQL
Colorado Springs Open Source Hadoop/MySQL
David Smelker
Apache Drill talk ApacheCon 2018
Apache Drill talk ApacheCon 2018
Aman Sinha
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Databricks
Big Data_Architecture.pptx
Big Data_Architecture.pptx
betalab
Similar to Data analytics all about data v5
(20)
Big Data and Cloud Computing
Big Data and Cloud Computing
Cloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
BDAS Shark study report 03 v1.1
BDAS Shark study report 03 v1.1
Apache Hadoop 1.1
Apache Hadoop 1.1
Introduction to Hadoop
Introduction to Hadoop
Big Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL Server
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...
What's new in hadoop 3.0
What's new in hadoop 3.0
Apache Cassandra training. Overview and Basics
Apache Cassandra training. Overview and Basics
P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.
P.Maharajothi,II-M.sc(computer science),Bon secours college for women,thanjavur.
Cheetah:Data Warehouse on Top of MapReduce
Cheetah:Data Warehouse on Top of MapReduce
Introduction to Spark - Phoenix Meetup 08-19-2014
Introduction to Spark - Phoenix Meetup 08-19-2014
Apache Spark
Apache Spark
Hadoop: A distributed framework for Big Data
Hadoop: A distributed framework for Big Data
Modern Big Data Analytics Tools: An Overview
Modern Big Data Analytics Tools: An Overview
Colorado Springs Open Source Hadoop/MySQL
Colorado Springs Open Source Hadoop/MySQL
Apache Drill talk ApacheCon 2018
Apache Drill talk ApacheCon 2018
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Big Data_Architecture.pptx
Big Data_Architecture.pptx
Recently uploaded
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
ssuserf63bd7
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
Alison Pitt
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
basics of data science with application areas.pdf
basics of data science with application areas.pdf
vyankatesh1
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
pyhepag
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Emmanuel Dauda
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
Amil baba
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Valters Lauzums
Easy and simple project file on mp online
Easy and simple project file on mp online
balibahu1313
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
Boston Institute of Analytics
社内勉強会資料 Mamba - A new era or ephemeral
社内勉強会資料 Mamba - A new era or ephemeral
NABLAS株式会社
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Jack Cole
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
RafigAliyev2
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
Payment Village
The Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
mohamed Elzalabany
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
Bisnar Chase Personal Injury Attorneys
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
pyhepag
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
Recently uploaded
(20)
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
basics of data science with application areas.pdf
basics of data science with application areas.pdf
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Easy and simple project file on mp online
Easy and simple project file on mp online
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
社内勉強会資料 Mamba - A new era or ephemeral
社内勉強会資料 Mamba - A new era or ephemeral
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
The Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
Data analytics all about data v5
1.
Data Analytics –
All about data Harish Dixit
2.
Getting the context •
Examining raw data. • Discovering useful information. • Make better business decisions.
3.
Life cycle stages
4.
What is Big
Data?
5.
From where data
is coming?
6.
Complex ecosystem Cloud
7.
Hadoop Architecture
8.
HDFS • Distributed file
system. • Partitioning of data. • Fault tolerant. • Java API. • Higher scalability. • Master slave paradigm.
9.
HDFS
10.
Map Reduce • Parallel
processing model. • Move operations not data. • Distributed computations. • User defined functions – Map() – Reduce()
11.
Map/Reduce Operations
12.
Example
13.
R - Add
analytic power to Hadoop
14.
Data modeling techniques •
Regression • Classification • Clustering • Recommendation • Text mining
15.
Regression Regression can be
formulated as follows: y = ax +e x y ----------------------------- 63 3.1 64 3.6 65 3.8 66 4 -----------------------------
16.
Classification
17.
Clustering
18.
Recommendation
19.
Text Mining
20.
NoSQL • Non-relational. • Distributed
environment. • Large volume. • No fixed schemas. • Horizontally scalable.
21.
CAP Theorem (any
2 of 3)
22.
Variants of NoSQL •
Key-Value Systems. • Document-based Systems. • Column-based Systems. • Graph-based Systems.
23.
Distributed Key-Value Systems
24.
Column-based Systems
25.
Applications of Big
Data
26.
Think Big Q &
A
Download now