SlideShare a Scribd company logo
1 of 25
By: Milind Zodge
Big Data to
Analytics
Agenda
2
Big Data01
Big Data Forecast02
Data Analytics Process03
Embedded Analytics
Use Case
04
Dynamic Cloud
Computing
05
Big Data Lake06
Milind Zodge
Small Data vs Big Data
3
Small Data
 Low Volumes
 Batch Velocities
 Structured Varieties
Big Data
 Into Petabyte Volumes
 Real-time Velocities
 Multistructured Varieties
Vs
Milind Zodge
What is Big Data
4Milind Zodge
3 Vs of Big Data
5
3 Vs of
Big Data
Velocity
Variety
Volume
 Terabytes
 Records
 Transactions
 Tables, files
 Batch
 Near time
 Semi structured
 Streams  Structured
 Unstructured
 Semi structured
Milind Zodge
Forms/ Type of Big Data
6
Structured
01
Enterprise
systems
Data
warehouses
Databases
Unstructured
02
Audio/ video
streams
Analog data
GPS tracking
information
Semi-Structured
03
Xml
E- Mail
EDI
Milind Zodge
How Big is Big Data
7
Number of emails
sent every second
2.9 Million
Data consumed by
households each
day
375 Megabytes
Video upload to
YouTube every
minute
20 Hours
Data per day
processed by Google
24 Petabytes
Tweets per day
50 Million
Total minutes spent on
Facebook each month
700 Billion
Data sent and
received by mobile
internet users
1.3 Exabytes
Products ordered
on amazon per
second
72.9 Items
Milind Zodge
Big Data Market Forecast
8
$58.08 B
$61.16 B
$12.25 B
$48.79 B
$54.05 B
2019
04
2020
05
2012
01
2017
02
2018
03
Milind Zodge
9Milind Zodge
10Milind Zodge
11Milind Zodge
Data Analytics Process
12
Data
Data can be stored in data lake
environment on various different
technologies
Decision
Recommendations will be
generated based on insights
which will help for decision
making
Info
From this harmonized data
analytics can be determined
which will generate information
Insight
Using the information and the
historical outcomes insights can
be formed using machine
learning algorithms
Milind Zodge
Embedded Analytics
13Milind Zodge
14Milind Zodge
15Milind Zodge
Data
16Milind Zodge
Info
Data
17Milind Zodge
Info
Data
Insight
18Milind Zodge
Insight
Data
Info
Decision
19Milind Zodge
Dynamic Cloud Computing and Big Data Lake
Lambda
Function
20Milind Zodge
Dynamic Cloud Computing and Big Data Lake
Lambda
Function
21
S3 Glue
Crawler
Glue
Catalog
Redshift
Spectrum
Kenesis
Firehose
JS-Tracker
Recorder
Milind Zodge
Dynamic Cloud Computing and Big Data Lake
Lambda
Function
22
S3 Glue
Crawler
Glue
Catalog
Redshift
Spectrum
Kenesis
Firehose
JS-Tracker
Recorder
External Data
Lambda
Function Glue ETL S3
Milind Zodge
Dynamic Cloud Computing and Big Data Lake
Lambda
Function
23
S3 Glue
Crawler
Glue
Catalog
Redshift
Spectrum
Kenesis
Firehose
JS-Tracker
Recorder
External Data
Lambda
Function Glue ETL S3
Analytics
Milind Zodge
24Milind Zodge
THANK YOU
milzod milzod@gmail.com

More Related Content

Similar to Big Data to Analytics

Big Data Is Not Enough - Real-Time Analytics Needs Streaming Archtectures
Big Data Is Not Enough - Real-Time Analytics Needs Streaming ArchtecturesBig Data Is Not Enough - Real-Time Analytics Needs Streaming Archtectures
Big Data Is Not Enough - Real-Time Analytics Needs Streaming ArchtecturesDr. Tim Frey
 
Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...
Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...
Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...Yahoo Developer Network
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Hritika Raj
 
Big data 2017 final
Big data 2017   finalBig data 2017   final
Big data 2017 finalAmjid Ali
 
Demystify big data data science
Demystify big data  data scienceDemystify big data  data science
Demystify big data data scienceMahesh Kumar CV
 
CSF18 - Through a Mirror Darkly- a journey to the dark side of metadata - Sas...
CSF18 - Through a Mirror Darkly- a journey to the dark side of metadata - Sas...CSF18 - Through a Mirror Darkly- a journey to the dark side of metadata - Sas...
CSF18 - Through a Mirror Darkly- a journey to the dark side of metadata - Sas...NCCOMMS
 
08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdf
08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdf08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdf
08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdfDrAdeelAkram2
 
Big data use cases in the cloud presentation
Big data use cases in the cloud presentationBig data use cases in the cloud presentation
Big data use cases in the cloud presentationTUSHAR GARG
 
Big data - Basics
Big data - BasicsBig data - Basics
Big data - BasicsRohit Gupta
 
Cisco niels vd berg
Cisco niels vd bergCisco niels vd berg
Cisco niels vd bergBigDataExpo
 

Similar to Big Data to Analytics (20)

Big Data 101
Big Data 101Big Data 101
Big Data 101
 
Ictam big data
Ictam big dataIctam big data
Ictam big data
 
Big data
Big dataBig data
Big data
 
Big Data
Big DataBig Data
Big Data
 
Introduction to BIG DATA part 01
Introduction to BIG DATA   part 01Introduction to BIG DATA   part 01
Introduction to BIG DATA part 01
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Big Data Is Not Enough - Real-Time Analytics Needs Streaming Archtectures
Big Data Is Not Enough - Real-Time Analytics Needs Streaming ArchtecturesBig Data Is Not Enough - Real-Time Analytics Needs Streaming Archtectures
Big Data Is Not Enough - Real-Time Analytics Needs Streaming Archtectures
 
Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...
Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...
Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
Big data 2017 final
Big data 2017   finalBig data 2017   final
Big data 2017 final
 
Demystify big data data science
Demystify big data  data scienceDemystify big data  data science
Demystify big data data science
 
Big data.pptx
Big data.pptxBig data.pptx
Big data.pptx
 
CSF18 - Through a Mirror Darkly- a journey to the dark side of metadata - Sas...
CSF18 - Through a Mirror Darkly- a journey to the dark side of metadata - Sas...CSF18 - Through a Mirror Darkly- a journey to the dark side of metadata - Sas...
CSF18 - Through a Mirror Darkly- a journey to the dark side of metadata - Sas...
 
08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdf
08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdf08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdf
08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdf
 
Big Data
Big DataBig Data
Big Data
 
Big data use cases in the cloud presentation
Big data use cases in the cloud presentationBig data use cases in the cloud presentation
Big data use cases in the cloud presentation
 
Big data - Basics
Big data - BasicsBig data - Basics
Big data - Basics
 
Big Data.pptx
Big Data.pptxBig Data.pptx
Big Data.pptx
 
Cisco niels vd berg
Cisco niels vd bergCisco niels vd berg
Cisco niels vd berg
 
Big data
Big dataBig data
Big data
 

More from Milind Zodge

Cassandra one page
Cassandra one pageCassandra one page
Cassandra one pageMilind Zodge
 
Open source information architecture
Open source information architectureOpen source information architecture
Open source information architectureMilind Zodge
 
Data Staging Strategy
Data Staging StrategyData Staging Strategy
Data Staging StrategyMilind Zodge
 

More from Milind Zodge (6)

Cassandra one page
Cassandra one pageCassandra one page
Cassandra one page
 
Mongo db onepage
Mongo db onepageMongo db onepage
Mongo db onepage
 
H base one page
H base one pageH base one page
H base one page
 
Open source information architecture
Open source information architectureOpen source information architecture
Open source information architecture
 
Big datawarehouse
Big datawarehouseBig datawarehouse
Big datawarehouse
 
Data Staging Strategy
Data Staging StrategyData Staging Strategy
Data Staging Strategy
 

Recently uploaded

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

Big Data to Analytics

  • 1. By: Milind Zodge Big Data to Analytics
  • 2. Agenda 2 Big Data01 Big Data Forecast02 Data Analytics Process03 Embedded Analytics Use Case 04 Dynamic Cloud Computing 05 Big Data Lake06 Milind Zodge
  • 3. Small Data vs Big Data 3 Small Data  Low Volumes  Batch Velocities  Structured Varieties Big Data  Into Petabyte Volumes  Real-time Velocities  Multistructured Varieties Vs Milind Zodge
  • 4. What is Big Data 4Milind Zodge
  • 5. 3 Vs of Big Data 5 3 Vs of Big Data Velocity Variety Volume  Terabytes  Records  Transactions  Tables, files  Batch  Near time  Semi structured  Streams  Structured  Unstructured  Semi structured Milind Zodge
  • 6. Forms/ Type of Big Data 6 Structured 01 Enterprise systems Data warehouses Databases Unstructured 02 Audio/ video streams Analog data GPS tracking information Semi-Structured 03 Xml E- Mail EDI Milind Zodge
  • 7. How Big is Big Data 7 Number of emails sent every second 2.9 Million Data consumed by households each day 375 Megabytes Video upload to YouTube every minute 20 Hours Data per day processed by Google 24 Petabytes Tweets per day 50 Million Total minutes spent on Facebook each month 700 Billion Data sent and received by mobile internet users 1.3 Exabytes Products ordered on amazon per second 72.9 Items Milind Zodge
  • 8. Big Data Market Forecast 8 $58.08 B $61.16 B $12.25 B $48.79 B $54.05 B 2019 04 2020 05 2012 01 2017 02 2018 03 Milind Zodge
  • 12. Data Analytics Process 12 Data Data can be stored in data lake environment on various different technologies Decision Recommendations will be generated based on insights which will help for decision making Info From this harmonized data analytics can be determined which will generate information Insight Using the information and the historical outcomes insights can be formed using machine learning algorithms Milind Zodge
  • 20. Dynamic Cloud Computing and Big Data Lake Lambda Function 20Milind Zodge
  • 21. Dynamic Cloud Computing and Big Data Lake Lambda Function 21 S3 Glue Crawler Glue Catalog Redshift Spectrum Kenesis Firehose JS-Tracker Recorder Milind Zodge
  • 22. Dynamic Cloud Computing and Big Data Lake Lambda Function 22 S3 Glue Crawler Glue Catalog Redshift Spectrum Kenesis Firehose JS-Tracker Recorder External Data Lambda Function Glue ETL S3 Milind Zodge
  • 23. Dynamic Cloud Computing and Big Data Lake Lambda Function 23 S3 Glue Crawler Glue Catalog Redshift Spectrum Kenesis Firehose JS-Tracker Recorder External Data Lambda Function Glue ETL S3 Analytics Milind Zodge