SlideShare a Scribd company logo
1 of 34
Download to read offline
Analysing data analytics use cases
to understand purpose of big data ecosystem components
by
Purpose of any data platform (big / not big)
is to enable analytics on data
dataeaze
Why?
Different analytics use cases expect different set of
features from data platform
Components part of big data ecosystem
are made
to serve needed features of analytics use cases
dataeaze
Why?
So to understand data platform
to understand data platform components
It is necessary to know purpose
It is necessary to know needs of analytics use cases
which are served by data platform
dataeaze
Why?
Here
We take look at all categories of analytics use
cases on data platform
dataeaze
What?
Analytics data processing use case categories
dataeaze
What?
We analyse each use case as
Nature of data
processing in order to
serve this use case
Expectations from data
platform to enable
required data processing
dataeaze
What?
Static Reports
are summary reports prepared for the purpose of
giving status to decision makers
Example
Report for top management at end of day specifying
daily sales, transactions, revenue, total traffic
dataeaze
Nature of data processing
Static reports are
Scheduled to execute at fixed time interval,
Generate analysis reports for given time period,
Can execute on raw data directly or on intermediate store
dataeaze
Static Reports
Expectations from data platform
Scheduled data processing
Static reports are executed at predefined schedule repeatedly
Timely arrival of data
Generated reports should represent complete picture of given
timeframe, and should be generated before deadline.
Process raw data to get result
Capability to generate report from raw data if it cannot be
extracted from intermediate data form
dataeaze
Static Reports
Dashboard Reports
Dashboard is reporting user interface where user can interactively
choose his own view of data with limited set of filters.
Example
An e-commerce company having dashboard for sellers where
sellers get to know how much inventory sold across demographic,
across product categories, across time range.
dataeaze
Nature of data processing
Periodically keep on processing raw data to
bring it in form required by dashboards
Populate transformed data into interactive
store backend of dashboards
dataeaze
Dashboard
Expectations from data platform
ETL
To convert raw data in format required by dashboard
Scheduled data processing
Timely repeated executions of ETL jobs to populate
dashboards with latest updates
Interactive data store
Dashboard reports are interactive in nature, so backend store
is supposed to return results in near real time
dataeaze
Dashboard
Ad Hoc data analysis
This is for business queries which are raised as per need,
This is not scheduled and is executed one time whenever necessary
Example
A product manager wanting to know detail analysis about
customer behavior on a navigation panel, so as to define optimised
ad placements.
dataeaze
Nature of data processing
Steps to serve an ad hoc report,
Identify data sources which will satisfy given
request
Execute data processing (preferable sql like
query) on identified source
Load results in data representation tool
dataeaze
Ad Hoc
Expectations from data platform
data processing SQL engine
SQL query engine makes it easy to represent required analysis
in form of SQL query, saves analyst’s time
complex data processing
A platform which supports writing custom complex data
analysis, which is not possible through SQL
dataeaze
Ad Hoc
BI Reporting
Business Intelligence tools provide advanced general purpose
dashboards which host wide array of dimensions in backend data
store. User can define and save transformations, analysis queries
through BI tool and get back reports in tabular or graphical form.
Example
A BI report representing weekly sales stats across multiple regions
for previous 6 months. This report is once created and saved. Users
execute saved report whenever they want.
dataeaze
Nature of data processing
Scheduled ETL jobs to convert raw data to
required intermediate data form
Data is loaded to interactive SQL data stores
BI tools are connected to SQL data store as
backend
dataeaze
BI Reporting
Expectations from data platform
ETL
Raw data should be transformed to required format and get
loaded to SQL data warehouse
Scheduling of ETL
Defined ETL jobs should be scheduled to execute at fixed time
interval.
data processing SQL engine
SQL query engine makes it easy to extract data out, saves
time. BI tools can connect to this SQL data store.
dataeaze
BI Reporting
Data Processing for Applications
This is data processing done to provide feedback input to business
applications. Business applications take better decisions based on
latest data feedback.
Example
Ad servers getting periodically updated about latest minimum
ecpm to expect for an ad placement getting filled dynamically.
dataeaze
Nature of data processing
Complex data processing (machine learning) on raw
data
Scheduled data processing
Update result into interactive key-value store which get
fetched directly from applications
dataeaze
App data processing
Expectations from data platform
Capability to implement custom complex data processing
User should be able to easily define custom complex data processing
algorithms (like machine learning)
Scheduled data processing
Required for periodic execution of data processing jobs
dataeaze
App data processing
Real time stream data processing
It is analysing an event as soon as it happens. Sooner the analysis
better is value obtained from it.
Example
Stock ticker getting displayed on yahoo finance
dataeaze
Nature of data processing
As soon as event happens its log entry is
collected
All log entries are buffered, made available
for processing layer.
Pull records from message buffer and
perform processing on it.
dataeaze
Real time stream
Expectations from data platform
Scalable message buffer
A message buffer to keep received messages which are pulled
from this buffer for processing
Real time stream processing engine
To pull and process records in real time. Provide user ability to
define custom data processing.
dataeaze
Real time stream
Let us take a look at super set of expectations across
all use cases
dataeaze
Expectations from data platform
across all use cases
Summarise all
dataeaze
Super set of expectations
Expectation / Capability Use caseNeeded by
Complex data analysis using query
language
Scheduled ETL data processing
Data store for interactive data
analysis
Data ingestion with timely arrival of
data
Scalable message buffer to be
consumed by stream data processing
Streaming data processing platform
Static reports
ad hoc data analysis
BI reporting
Dashboard reports
app specific data processing
Real time stream data processing
Summarise all
dataeaze
Let’s conclude
dataeaze
We have identified common set of features expected
from data platform
by most of analytics use cases
Let us map these to data platform components
Conclude
dataeaze
Capabilities provided by data platform components
Expectation / Capability Data platform
component
Supported by
Complex data analysis using query
language
Scheduled ETL data processing
Data store for interactive data
analysis
Data ingestion with timely arrival of
data
Scalable message buffer to be
consumed by stream data processing
Streaming data processing platform
Data Ingestion
Batch data processing
Workflow scheduler
Interactive data stores
Message buffers
Real time stream
engine
Data Platform
Tools
Flume, Kafka, Scribe
Hive, Mapred
Oozie
Hbase, Spark, ..
Kafka
Storm, Spark
Conclude
dataeaze
Data platform components satisfying expectations
Conclude
dataeaze
Going backwords
Now you know about
Data platform components
capabilities supported by those
satisfying features of analytics use cases
Conclude
dataeaze
Thank You
Get in touch with us at
contactus@dataeaze.io
dataeaze

More Related Content

What's hot

business analysis-Data warehousing
business analysis-Data warehousingbusiness analysis-Data warehousing
business analysis-Data warehousingDhilsath Fathima
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingVibrant Event
 
Data warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswersData warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswersSourav Singh
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Hortonworks
 
Get started with data migration
Get started with data migrationGet started with data migration
Get started with data migrationThinqloud
 
Tuning data warehouse
Tuning data warehouseTuning data warehouse
Tuning data warehouseSrinivasan R
 
Business Intelligence: Data Warehouses
Business Intelligence: Data WarehousesBusiness Intelligence: Data Warehouses
Business Intelligence: Data WarehousesMichael Lamont
 
Jaspersoft and Clarity PPM - Advanced Reporting with Data Warehouse
Jaspersoft and Clarity PPM - Advanced Reporting with Data WarehouseJaspersoft and Clarity PPM - Advanced Reporting with Data Warehouse
Jaspersoft and Clarity PPM - Advanced Reporting with Data WarehouseThiago Bottoni
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Materialobieefans
 
Data Verification In QA Department Final
Data Verification In QA Department FinalData Verification In QA Department Final
Data Verification In QA Department FinalWayne Yaddow
 
Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecturesamaksh1982
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl conceptsjeshocarme
 
SAP HANA Integrated with Microstrategy
SAP HANA Integrated with MicrostrategySAP HANA Integrated with Microstrategy
SAP HANA Integrated with Microstrategysnehal parikh
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schemaSayed Ahmed
 
Informaticapowercenter pennon soft
Informaticapowercenter pennon softInformaticapowercenter pennon soft
Informaticapowercenter pennon softPennonSoft
 
Planning learn step by step
Planning learn step by stepPlanning learn step by step
Planning learn step by stepksrajakumar
 
Introduction Data warehouse
Introduction Data warehouseIntroduction Data warehouse
Introduction Data warehouseAmin Choroomi
 

What's hot (20)

business analysis-Data warehousing
business analysis-Data warehousingbusiness analysis-Data warehousing
business analysis-Data warehousing
 
ETL QA
ETL QAETL QA
ETL QA
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testing
 
Data warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswersData warehousing interview_questionsandanswers
Data warehousing interview_questionsandanswers
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014
 
Get started with data migration
Get started with data migrationGet started with data migration
Get started with data migration
 
Tuning data warehouse
Tuning data warehouseTuning data warehouse
Tuning data warehouse
 
Business Intelligence: Data Warehouses
Business Intelligence: Data WarehousesBusiness Intelligence: Data Warehouses
Business Intelligence: Data Warehouses
 
Jaspersoft and Clarity PPM - Advanced Reporting with Data Warehouse
Jaspersoft and Clarity PPM - Advanced Reporting with Data WarehouseJaspersoft and Clarity PPM - Advanced Reporting with Data Warehouse
Jaspersoft and Clarity PPM - Advanced Reporting with Data Warehouse
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Material
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data Verification In QA Department Final
Data Verification In QA Department FinalData Verification In QA Department Final
Data Verification In QA Department Final
 
Data Warehouse 101
Data Warehouse 101Data Warehouse 101
Data Warehouse 101
 
Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecture
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl concepts
 
SAP HANA Integrated with Microstrategy
SAP HANA Integrated with MicrostrategySAP HANA Integrated with Microstrategy
SAP HANA Integrated with Microstrategy
 
Data modeling star schema
Data modeling star schemaData modeling star schema
Data modeling star schema
 
Informaticapowercenter pennon soft
Informaticapowercenter pennon softInformaticapowercenter pennon soft
Informaticapowercenter pennon soft
 
Planning learn step by step
Planning learn step by stepPlanning learn step by step
Planning learn step by step
 
Introduction Data warehouse
Introduction Data warehouseIntroduction Data warehouse
Introduction Data warehouse
 

Viewers also liked

Viewers also liked (17)

Elbala
ElbalaElbala
Elbala
 
Muni Economic Outlook
Muni Economic OutlookMuni Economic Outlook
Muni Economic Outlook
 
Safwan-CV
Safwan-CVSafwan-CV
Safwan-CV
 
Federal Highway Trust Fund Analysis
Federal Highway Trust Fund AnalysisFederal Highway Trust Fund Analysis
Federal Highway Trust Fund Analysis
 
7
77
7
 
5
55
5
 
2
22
2
 
mac - culinary
mac - culinarymac - culinary
mac - culinary
 
2
22
2
 
SponsorshipU Intro Deck
SponsorshipU  Intro DeckSponsorshipU  Intro Deck
SponsorshipU Intro Deck
 
zaw lat CV latest
zaw lat CV latestzaw lat CV latest
zaw lat CV latest
 
Analysing data analytics use cases to understand big data platform
Analysing data analytics use cases  to understand big data platformAnalysing data analytics use cases  to understand big data platform
Analysing data analytics use cases to understand big data platform
 
Calabration principles-chapter1
Calabration principles-chapter1Calabration principles-chapter1
Calabration principles-chapter1
 
Débutant tps et tds
Débutant tps et tdsDébutant tps et tds
Débutant tps et tds
 
Avancé t ps et tds
Avancé t ps et tdsAvancé t ps et tds
Avancé t ps et tds
 
Lecture notes of production & operation management
Lecture notes of production & operation managementLecture notes of production & operation management
Lecture notes of production & operation management
 
Resume
ResumeResume
Resume
 

Similar to Analysing data analytics use cases to understand big data platform

Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architectureDeepak Chaurasia
 
Datawarehouse Overview
Datawarehouse OverviewDatawarehouse Overview
Datawarehouse Overviewashok kumar
 
introduction to datawarehouse
introduction to datawarehouseintroduction to datawarehouse
introduction to datawarehousekiran14360
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSSDeepali Raut
 
SAP BW vs Teradat; A White Paper
SAP BW vs Teradat; A White PaperSAP BW vs Teradat; A White Paper
SAP BW vs Teradat; A White PaperVipul Neema
 
Bi Dw Presentation
Bi Dw PresentationBi Dw Presentation
Bi Dw Presentationvickyc
 
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSAWS User Group Kochi
 
Dataware housing
Dataware housingDataware housing
Dataware housingwork
 
Click to Disk Troubleshooting with AppDynamics and OpsDataStore - AppSphere16
Click to Disk Troubleshooting with AppDynamics and OpsDataStore - AppSphere16Click to Disk Troubleshooting with AppDynamics and OpsDataStore - AppSphere16
Click to Disk Troubleshooting with AppDynamics and OpsDataStore - AppSphere16AppDynamics
 
Expert summit SQL Server 2016
Expert summit   SQL Server 2016Expert summit   SQL Server 2016
Expert summit SQL Server 2016Łukasz Grala
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Conceptsdataware
 
SAP HANA SPS10- Enterprise Information Management
SAP HANA SPS10- Enterprise Information ManagementSAP HANA SPS10- Enterprise Information Management
SAP HANA SPS10- Enterprise Information ManagementSAP Technology
 
Informatica Interview Questions & Answers
Informatica Interview Questions & AnswersInformatica Interview Questions & Answers
Informatica Interview Questions & AnswersZaranTech LLC
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Group
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 
Top SAP Online training institute in Hyderabad
Top SAP Online training institute in HyderabadTop SAP Online training institute in Hyderabad
Top SAP Online training institute in HyderabadAadhyaKrishnan
 

Similar to Analysing data analytics use cases to understand big data platform (20)

Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architecture
 
Datawarehouse Overview
Datawarehouse OverviewDatawarehouse Overview
Datawarehouse Overview
 
introduction to datawarehouse
introduction to datawarehouseintroduction to datawarehouse
introduction to datawarehouse
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
 
Data mining
Data miningData mining
Data mining
 
SAP BW vs Teradat; A White Paper
SAP BW vs Teradat; A White PaperSAP BW vs Teradat; A White Paper
SAP BW vs Teradat; A White Paper
 
Bi Dw Presentation
Bi Dw PresentationBi Dw Presentation
Bi Dw Presentation
 
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Click to Disk Troubleshooting with AppDynamics and OpsDataStore - AppSphere16
Click to Disk Troubleshooting with AppDynamics and OpsDataStore - AppSphere16Click to Disk Troubleshooting with AppDynamics and OpsDataStore - AppSphere16
Click to Disk Troubleshooting with AppDynamics and OpsDataStore - AppSphere16
 
Expert summit SQL Server 2016
Expert summit   SQL Server 2016Expert summit   SQL Server 2016
Expert summit SQL Server 2016
 
SAP BI/BW
SAP BI/BWSAP BI/BW
SAP BI/BW
 
Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
 
SAP HANA SPS10- Enterprise Information Management
SAP HANA SPS10- Enterprise Information ManagementSAP HANA SPS10- Enterprise Information Management
SAP HANA SPS10- Enterprise Information Management
 
Informatica Interview Questions & Answers
Informatica Interview Questions & AnswersInformatica Interview Questions & Answers
Informatica Interview Questions & Answers
 
Sap Bw 3.5 Overview
Sap Bw 3.5 OverviewSap Bw 3.5 Overview
Sap Bw 3.5 Overview
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Top SAP Online training institute in Hyderabad
Top SAP Online training institute in HyderabadTop SAP Online training institute in Hyderabad
Top SAP Online training institute in Hyderabad
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 

Recently uploaded

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/managementakshesh doshi
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 

Recently uploaded (20)

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Spark3's new memory model/management
Spark3's new memory model/managementSpark3's new memory model/management
Spark3's new memory model/management
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 

Analysing data analytics use cases to understand big data platform

  • 1. Analysing data analytics use cases to understand purpose of big data ecosystem components by
  • 2. Purpose of any data platform (big / not big) is to enable analytics on data dataeaze Why?
  • 3. Different analytics use cases expect different set of features from data platform Components part of big data ecosystem are made to serve needed features of analytics use cases dataeaze Why?
  • 4. So to understand data platform to understand data platform components It is necessary to know purpose It is necessary to know needs of analytics use cases which are served by data platform dataeaze Why?
  • 5. Here We take look at all categories of analytics use cases on data platform dataeaze What?
  • 6. Analytics data processing use case categories dataeaze What?
  • 7. We analyse each use case as Nature of data processing in order to serve this use case Expectations from data platform to enable required data processing dataeaze What?
  • 8. Static Reports are summary reports prepared for the purpose of giving status to decision makers Example Report for top management at end of day specifying daily sales, transactions, revenue, total traffic dataeaze
  • 9. Nature of data processing Static reports are Scheduled to execute at fixed time interval, Generate analysis reports for given time period, Can execute on raw data directly or on intermediate store dataeaze Static Reports
  • 10. Expectations from data platform Scheduled data processing Static reports are executed at predefined schedule repeatedly Timely arrival of data Generated reports should represent complete picture of given timeframe, and should be generated before deadline. Process raw data to get result Capability to generate report from raw data if it cannot be extracted from intermediate data form dataeaze Static Reports
  • 11. Dashboard Reports Dashboard is reporting user interface where user can interactively choose his own view of data with limited set of filters. Example An e-commerce company having dashboard for sellers where sellers get to know how much inventory sold across demographic, across product categories, across time range. dataeaze
  • 12. Nature of data processing Periodically keep on processing raw data to bring it in form required by dashboards Populate transformed data into interactive store backend of dashboards dataeaze Dashboard
  • 13. Expectations from data platform ETL To convert raw data in format required by dashboard Scheduled data processing Timely repeated executions of ETL jobs to populate dashboards with latest updates Interactive data store Dashboard reports are interactive in nature, so backend store is supposed to return results in near real time dataeaze Dashboard
  • 14. Ad Hoc data analysis This is for business queries which are raised as per need, This is not scheduled and is executed one time whenever necessary Example A product manager wanting to know detail analysis about customer behavior on a navigation panel, so as to define optimised ad placements. dataeaze
  • 15. Nature of data processing Steps to serve an ad hoc report, Identify data sources which will satisfy given request Execute data processing (preferable sql like query) on identified source Load results in data representation tool dataeaze Ad Hoc
  • 16. Expectations from data platform data processing SQL engine SQL query engine makes it easy to represent required analysis in form of SQL query, saves analyst’s time complex data processing A platform which supports writing custom complex data analysis, which is not possible through SQL dataeaze Ad Hoc
  • 17. BI Reporting Business Intelligence tools provide advanced general purpose dashboards which host wide array of dimensions in backend data store. User can define and save transformations, analysis queries through BI tool and get back reports in tabular or graphical form. Example A BI report representing weekly sales stats across multiple regions for previous 6 months. This report is once created and saved. Users execute saved report whenever they want. dataeaze
  • 18. Nature of data processing Scheduled ETL jobs to convert raw data to required intermediate data form Data is loaded to interactive SQL data stores BI tools are connected to SQL data store as backend dataeaze BI Reporting
  • 19. Expectations from data platform ETL Raw data should be transformed to required format and get loaded to SQL data warehouse Scheduling of ETL Defined ETL jobs should be scheduled to execute at fixed time interval. data processing SQL engine SQL query engine makes it easy to extract data out, saves time. BI tools can connect to this SQL data store. dataeaze BI Reporting
  • 20. Data Processing for Applications This is data processing done to provide feedback input to business applications. Business applications take better decisions based on latest data feedback. Example Ad servers getting periodically updated about latest minimum ecpm to expect for an ad placement getting filled dynamically. dataeaze
  • 21. Nature of data processing Complex data processing (machine learning) on raw data Scheduled data processing Update result into interactive key-value store which get fetched directly from applications dataeaze App data processing
  • 22. Expectations from data platform Capability to implement custom complex data processing User should be able to easily define custom complex data processing algorithms (like machine learning) Scheduled data processing Required for periodic execution of data processing jobs dataeaze App data processing
  • 23. Real time stream data processing It is analysing an event as soon as it happens. Sooner the analysis better is value obtained from it. Example Stock ticker getting displayed on yahoo finance dataeaze
  • 24. Nature of data processing As soon as event happens its log entry is collected All log entries are buffered, made available for processing layer. Pull records from message buffer and perform processing on it. dataeaze Real time stream
  • 25. Expectations from data platform Scalable message buffer A message buffer to keep received messages which are pulled from this buffer for processing Real time stream processing engine To pull and process records in real time. Provide user ability to define custom data processing. dataeaze Real time stream
  • 26. Let us take a look at super set of expectations across all use cases dataeaze
  • 27. Expectations from data platform across all use cases Summarise all dataeaze
  • 28. Super set of expectations Expectation / Capability Use caseNeeded by Complex data analysis using query language Scheduled ETL data processing Data store for interactive data analysis Data ingestion with timely arrival of data Scalable message buffer to be consumed by stream data processing Streaming data processing platform Static reports ad hoc data analysis BI reporting Dashboard reports app specific data processing Real time stream data processing Summarise all dataeaze
  • 30. We have identified common set of features expected from data platform by most of analytics use cases Let us map these to data platform components Conclude dataeaze
  • 31. Capabilities provided by data platform components Expectation / Capability Data platform component Supported by Complex data analysis using query language Scheduled ETL data processing Data store for interactive data analysis Data ingestion with timely arrival of data Scalable message buffer to be consumed by stream data processing Streaming data processing platform Data Ingestion Batch data processing Workflow scheduler Interactive data stores Message buffers Real time stream engine Data Platform Tools Flume, Kafka, Scribe Hive, Mapred Oozie Hbase, Spark, .. Kafka Storm, Spark Conclude dataeaze
  • 32. Data platform components satisfying expectations Conclude dataeaze
  • 33. Going backwords Now you know about Data platform components capabilities supported by those satisfying features of analytics use cases Conclude dataeaze
  • 34. Thank You Get in touch with us at contactus@dataeaze.io dataeaze