SlideShare a Scribd company logo
1 of 38
Big Data Analytics
from the Rich Cloud
to the Frugal Edge
Inaugural lecture by Ahmed Awad
1
Table of Contents
• Success of Data-driven analytics
• Characteristics of Big Data ( Processing Systems)
• Challenges for Current Big Data Processing Systems
• Directions for Solutions
2
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Success of Data-driven Analytics
3
Source: Gartner
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Pillars for Data-driven Analytics
Success
4
Algorithms: since 1980s Computing power: Moore’s law
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Data Generation Evolution
5
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Characteristics of Big Data
6
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Supporting Architecture
7
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Correlation between BD and CC
8
0
20
40
60
80
100
120
2010-10
2011-01
2011-04
2011-07
2011-10
2012-01
2012-04
2012-07
2012-10
2013-01
2013-04
2013-07
2013-10
2014-01
2014-04
2014-07
2014-10
2015-01
2015-04
2015-07
2015-10
2016-01
2016-04
2016-07
2016-10
2017-01
2017-04
2017-07
2017-10
2018-01
2018-04
2018-07
2018-10
2019-01
2019-04
2019-07
2019-10
2020-01
2020-04
2020-07
2020-10
2021-01
2021-04
2021-07
2021-10
Cloud computing Big data
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Computing Clusters
9
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Big Data Analytics (BDA) Landscape
10
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
11
IoT the Killer Application for BDA
12
Why?
• Latency
• Privacy
• Distribution
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Let’s take an example
13
Location 1 Locatio
n 2
Location 3
Cloud Data Center/Cluster
Collect raw data, split it
by location, process,
aggregate, compute, store
results
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Beyond cloud revolution
• Data networks are growing
in size
• Applications become data-
intensive
• Data still needs to be
gathered in centralized data
centers
Data Infrastructure
Source: KEROS
14
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Location 1 Locati
on 2
Location 3
Cloud Data Center/Cluster
Collect anonymized raw data,
split it by location,
process, aggregate, compute,
store results
15
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Paradigm Shift
16
IoT-Edge-Fog-Cloud Network
Architecture Resources, availability,
fault tolerance, latency,
Edge
Privacy, locality,
Heterogeneity, mobility?
17
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Devices
Devices
Devices
Edge Layer E3
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Edge Layer E2
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Edge Layer E1
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Fog Layer F2
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Fog Layer F1
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Cloud Layer
Local
Registry App Manager
Node Manager
State
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Master Service
Restful API
Location 3
Location 2
Location 1
Locations 2, 3
18
Devices
Devices
Devices
Edge Layer E3
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Edge Layer E2
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Edge Layer E1
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Fog Layer F2
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Fog Layer F1
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Cloud Layer
Local
Registry App Manager
Node Manager
State
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Master Service
Restful API
Location 3
Location 2
Location 1
Locations 2, 3
10/16/2023 BDA from the Rich Cloud to the Frugal Edge 19
Devices
Devices
Devices
Edge Layer E3
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Edge Layer E2
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Edge Layer E1
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Fog Layer F2
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Fog Layer F1
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Cloud Layer
Local
Registry App Manager
Node Manager
State
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Master Service
Restful API
Location 3
Location 2
Location 1
Locations 2, 3
3
3
2
2
10/16/2023 BDA from the Rich Cloud to the Frugal Edge 20
Devices
Devices
Devices
Edge Layer E3
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Edge Layer E2
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Edge Layer E1
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Device Manager
App Container
App Container
App Containers
Data Receiver
Fog Layer F2
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Fog Layer F1
Local
Registry App Manager
Node Manager
State
State/Registry
Propagation/Recepti
on
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Cloud Layer
Local
Registry App Manager
Node Manager
State
Subordinate
Manager
App Container
App Container
App Containers
Data Receiver
Master Service
Restful API
Location 3
Location 2
Location 1
Locations 2, 3
2
2 3
3
10/16/2023 BDA from the Rich Cloud to the Frugal Edge 21
22
From Big Data Vs to Edge Us
Source: KEROS
23
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Challenges of Analytics on the
Edge
• Applications development
• Data identification
• Deployment
• Operator implementation
• Security
24
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Application Development
• QoS and location awareness
• Unevenness
• Unboundedness
• Unchartedness
• Migration constraints
• Unevenness
• Unstability
• Semantic representation of
data sources and operators
• Unchartedness
Source: https://ieeexplore.ieee.org/iel7/7578983/7579346/07579390.pdf
25
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Data Identification
• Data Fabric
• Knowledge Graphs
• Data catalog
• Device (source) resolution
• Unchartedness
• Communication protocol resolution
• Unchartedness
26
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Deployment
• A decentralized scheduler
• leverage the hierarchal nature of the network
• Continuous monitoring
• Autonomy of workers
• A logical to physical resolution mechanism
• DNS like
• URI like
• FaaS and Microservices for operators
• Sharing: state, operator
• Operator migration
27
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Operator Implementation
• ML:
• Embrace learning paradigms that fit the distributed nature of the data
(Federated Learning)
• Embrace ML models that learn from data streams (volatility of data
value, concept drift, computing resources limitations)
• Reinforcement learning to underpin automated ML
• Analytical:
• Embrace data sketches (trade accuracy for lower-latency and lower-
overhead)
• Native versus containerized implementation
28
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Computing Model
Osmotic computing
29
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Osmotic Computing
• In Chemistry, “osmosis” represents the seamless diffusion of
molecules from a higher to a lower concentration solution.
• A fitting metaphor for the migration of operators in a deployment
• Osmotic computing implies the dynamic management of
services and micro services across cloud, fog, and edge
datacenters
30
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
31
Osmotic Computing As Is
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
32
Osmotic Computing To Be
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
From Rich Cloud to Frugal Edge
33
Resources, availability,
fault tolerance, latency,
Edge
Privacy, locality,
Heterogeneity, mobility?
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
34
BDA/Cloud
BDA/Edge
35
36
Data sketches
Osmotic computing
FaaS
Native
Containers
Decentralization
Knowledge
Graphs
Ad-hoc Cloud
Micro Services
Kubernetes
Start End
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
Summary
• Moving from cloud to edge
• Bidirectional: Osmosis
• Us in addition to Vs
• Data identification
• Semantics is not a luxury
• Operator implementation
• Native versus containerization
• Decentralization
• Local decisions
37
28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
38

More Related Content

Similar to Presentation V5.pptx

Roberto minerva 20181130
Roberto minerva 20181130  Roberto minerva 20181130
Roberto minerva 20181130 Roberto Minerva
 
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...HostedbyConfluent
 
New Design Patterns in Microservice Solutions
New Design Patterns in Microservice SolutionsNew Design Patterns in Microservice Solutions
New Design Patterns in Microservice SolutionsMichel Burger
 
Accelerating a Path to Digital with a Cloud Data Strategy
Accelerating a Path to Digital with a Cloud Data StrategyAccelerating a Path to Digital with a Cloud Data Strategy
Accelerating a Path to Digital with a Cloud Data StrategyMongoDB
 
IoT: An Introduction and Getting Started Session
IoT: An Introduction and Getting Started SessionIoT: An Introduction and Getting Started Session
IoT: An Introduction and Getting Started SessionDebasis Das
 
System Support for Internet of Things
System Support for Internet of ThingsSystem Support for Internet of Things
System Support for Internet of ThingsHarshitParkar6677
 
cncf overview and building edge computing using kubernetes
cncf overview and building edge computing using kubernetescncf overview and building edge computing using kubernetes
cncf overview and building edge computing using kubernetesKrishna-Kumar
 
FIWARE Global Summit - FogFlow Enabled Sharing across LoRa Applications
FIWARE Global Summit - FogFlow Enabled Sharing across LoRa ApplicationsFIWARE Global Summit - FogFlow Enabled Sharing across LoRa Applications
FIWARE Global Summit - FogFlow Enabled Sharing across LoRa ApplicationsFIWARE
 
EDGE SEMINAR.pptx
EDGE SEMINAR.pptxEDGE SEMINAR.pptx
EDGE SEMINAR.pptxSachuS16
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data MeshLibbySchulze
 
Istio Service Mesh
Istio Service MeshIstio Service Mesh
Istio Service MeshLew Tucker
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsMapR Technologies
 
5G Edge Computing Whitepaper, FCC Advisory Council
5G Edge Computing Whitepaper, FCC Advisory Council5G Edge Computing Whitepaper, FCC Advisory Council
5G Edge Computing Whitepaper, FCC Advisory CouncilDESMOND YUEN
 
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystem
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystemEricsson Technology Review: Creating the next-generation edge-cloud ecosystem
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystemEricsson
 
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?Gabriele Bozzi
 
CloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom ItaliaCloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom ItaliaGabriele Bozzi
 
PhD Proposal: Toward Open and Programmable Infrastructure for Smarter Wireles...
PhD Proposal: Toward Open and Programmable Infrastructure for Smarter Wireles...PhD Proposal: Toward Open and Programmable Infrastructure for Smarter Wireles...
PhD Proposal: Toward Open and Programmable Infrastructure for Smarter Wireles...Mostafa Uddin
 
presentation_these_141215
presentation_these_141215presentation_these_141215
presentation_these_141215Patrick Raad
 
DimenXional Cloud Technologies (slideshare)
DimenXional Cloud Technologies (slideshare)DimenXional Cloud Technologies (slideshare)
DimenXional Cloud Technologies (slideshare)Rick Goldstein
 

Similar to Presentation V5.pptx (20)

Roberto minerva 20181130
Roberto minerva 20181130  Roberto minerva 20181130
Roberto minerva 20181130
 
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
 
New Design Patterns in Microservice Solutions
New Design Patterns in Microservice SolutionsNew Design Patterns in Microservice Solutions
New Design Patterns in Microservice Solutions
 
Accelerating a Path to Digital with a Cloud Data Strategy
Accelerating a Path to Digital with a Cloud Data StrategyAccelerating a Path to Digital with a Cloud Data Strategy
Accelerating a Path to Digital with a Cloud Data Strategy
 
IoT: An Introduction and Getting Started Session
IoT: An Introduction and Getting Started SessionIoT: An Introduction and Getting Started Session
IoT: An Introduction and Getting Started Session
 
System Support for Internet of Things
System Support for Internet of ThingsSystem Support for Internet of Things
System Support for Internet of Things
 
cncf overview and building edge computing using kubernetes
cncf overview and building edge computing using kubernetescncf overview and building edge computing using kubernetes
cncf overview and building edge computing using kubernetes
 
FIWARE Global Summit - FogFlow Enabled Sharing across LoRa Applications
FIWARE Global Summit - FogFlow Enabled Sharing across LoRa ApplicationsFIWARE Global Summit - FogFlow Enabled Sharing across LoRa Applications
FIWARE Global Summit - FogFlow Enabled Sharing across LoRa Applications
 
EDGE SEMINAR.pptx
EDGE SEMINAR.pptxEDGE SEMINAR.pptx
EDGE SEMINAR.pptx
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
 
The Great IT Migration
The Great IT MigrationThe Great IT Migration
The Great IT Migration
 
Istio Service Mesh
Istio Service MeshIstio Service Mesh
Istio Service Mesh
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
5G Edge Computing Whitepaper, FCC Advisory Council
5G Edge Computing Whitepaper, FCC Advisory Council5G Edge Computing Whitepaper, FCC Advisory Council
5G Edge Computing Whitepaper, FCC Advisory Council
 
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystem
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystemEricsson Technology Review: Creating the next-generation edge-cloud ecosystem
Ericsson Technology Review: Creating the next-generation edge-cloud ecosystem
 
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
Cloud Camp Milan 2K9 Telecom Italia: Where P2P?
 
CloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom ItaliaCloudCamp Milan 2009: Telecom Italia
CloudCamp Milan 2009: Telecom Italia
 
PhD Proposal: Toward Open and Programmable Infrastructure for Smarter Wireles...
PhD Proposal: Toward Open and Programmable Infrastructure for Smarter Wireles...PhD Proposal: Toward Open and Programmable Infrastructure for Smarter Wireles...
PhD Proposal: Toward Open and Programmable Infrastructure for Smarter Wireles...
 
presentation_these_141215
presentation_these_141215presentation_these_141215
presentation_these_141215
 
DimenXional Cloud Technologies (slideshare)
DimenXional Cloud Technologies (slideshare)DimenXional Cloud Technologies (slideshare)
DimenXional Cloud Technologies (slideshare)
 

Recently uploaded

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 

Recently uploaded (20)

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 

Presentation V5.pptx

  • 1. Big Data Analytics from the Rich Cloud to the Frugal Edge Inaugural lecture by Ahmed Awad 1
  • 2. Table of Contents • Success of Data-driven analytics • Characteristics of Big Data ( Processing Systems) • Challenges for Current Big Data Processing Systems • Directions for Solutions 2 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 3. Success of Data-driven Analytics 3 Source: Gartner 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 4. Pillars for Data-driven Analytics Success 4 Algorithms: since 1980s Computing power: Moore’s law 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 5. Data Generation Evolution 5 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 6. Characteristics of Big Data 6 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 7. Supporting Architecture 7 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 8. Correlation between BD and CC 8 0 20 40 60 80 100 120 2010-10 2011-01 2011-04 2011-07 2011-10 2012-01 2012-04 2012-07 2012-10 2013-01 2013-04 2013-07 2013-10 2014-01 2014-04 2014-07 2014-10 2015-01 2015-04 2015-07 2015-10 2016-01 2016-04 2016-07 2016-10 2017-01 2017-04 2017-07 2017-10 2018-01 2018-04 2018-07 2018-10 2019-01 2019-04 2019-07 2019-10 2020-01 2020-04 2020-07 2020-10 2021-01 2021-04 2021-07 2021-10 Cloud computing Big data 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 9. Computing Clusters 9 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 10. Big Data Analytics (BDA) Landscape 10 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 11. 11
  • 12. IoT the Killer Application for BDA 12 Why? • Latency • Privacy • Distribution 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 13. Let’s take an example 13 Location 1 Locatio n 2 Location 3 Cloud Data Center/Cluster Collect raw data, split it by location, process, aggregate, compute, store results 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 14. Beyond cloud revolution • Data networks are growing in size • Applications become data- intensive • Data still needs to be gathered in centralized data centers Data Infrastructure Source: KEROS 14 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 15. Location 1 Locati on 2 Location 3 Cloud Data Center/Cluster Collect anonymized raw data, split it by location, process, aggregate, compute, store results 15 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 17. IoT-Edge-Fog-Cloud Network Architecture Resources, availability, fault tolerance, latency, Edge Privacy, locality, Heterogeneity, mobility? 17 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 18. Devices Devices Devices Edge Layer E3 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Edge Layer E2 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Edge Layer E1 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Fog Layer F2 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Subordinate Manager App Container App Container App Containers Data Receiver Fog Layer F1 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Subordinate Manager App Container App Container App Containers Data Receiver Cloud Layer Local Registry App Manager Node Manager State Subordinate Manager App Container App Container App Containers Data Receiver Master Service Restful API Location 3 Location 2 Location 1 Locations 2, 3 18
  • 19. Devices Devices Devices Edge Layer E3 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Edge Layer E2 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Edge Layer E1 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Fog Layer F2 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Subordinate Manager App Container App Container App Containers Data Receiver Fog Layer F1 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Subordinate Manager App Container App Container App Containers Data Receiver Cloud Layer Local Registry App Manager Node Manager State Subordinate Manager App Container App Container App Containers Data Receiver Master Service Restful API Location 3 Location 2 Location 1 Locations 2, 3 10/16/2023 BDA from the Rich Cloud to the Frugal Edge 19
  • 20. Devices Devices Devices Edge Layer E3 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Edge Layer E2 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Edge Layer E1 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Fog Layer F2 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Subordinate Manager App Container App Container App Containers Data Receiver Fog Layer F1 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Subordinate Manager App Container App Container App Containers Data Receiver Cloud Layer Local Registry App Manager Node Manager State Subordinate Manager App Container App Container App Containers Data Receiver Master Service Restful API Location 3 Location 2 Location 1 Locations 2, 3 3 3 2 2 10/16/2023 BDA from the Rich Cloud to the Frugal Edge 20
  • 21. Devices Devices Devices Edge Layer E3 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Edge Layer E2 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Edge Layer E1 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Device Manager App Container App Container App Containers Data Receiver Fog Layer F2 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Subordinate Manager App Container App Container App Containers Data Receiver Fog Layer F1 Local Registry App Manager Node Manager State State/Registry Propagation/Recepti on Subordinate Manager App Container App Container App Containers Data Receiver Cloud Layer Local Registry App Manager Node Manager State Subordinate Manager App Container App Container App Containers Data Receiver Master Service Restful API Location 3 Location 2 Location 1 Locations 2, 3 2 2 3 3 10/16/2023 BDA from the Rich Cloud to the Frugal Edge 21
  • 22. 22
  • 23. From Big Data Vs to Edge Us Source: KEROS 23 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 24. Challenges of Analytics on the Edge • Applications development • Data identification • Deployment • Operator implementation • Security 24 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 25. Application Development • QoS and location awareness • Unevenness • Unboundedness • Unchartedness • Migration constraints • Unevenness • Unstability • Semantic representation of data sources and operators • Unchartedness Source: https://ieeexplore.ieee.org/iel7/7578983/7579346/07579390.pdf 25 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 26. Data Identification • Data Fabric • Knowledge Graphs • Data catalog • Device (source) resolution • Unchartedness • Communication protocol resolution • Unchartedness 26 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 27. Deployment • A decentralized scheduler • leverage the hierarchal nature of the network • Continuous monitoring • Autonomy of workers • A logical to physical resolution mechanism • DNS like • URI like • FaaS and Microservices for operators • Sharing: state, operator • Operator migration 27 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 28. Operator Implementation • ML: • Embrace learning paradigms that fit the distributed nature of the data (Federated Learning) • Embrace ML models that learn from data streams (volatility of data value, concept drift, computing resources limitations) • Reinforcement learning to underpin automated ML • Analytical: • Embrace data sketches (trade accuracy for lower-latency and lower- overhead) • Native versus containerized implementation 28 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 29. Computing Model Osmotic computing 29 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 30. Osmotic Computing • In Chemistry, “osmosis” represents the seamless diffusion of molecules from a higher to a lower concentration solution. • A fitting metaphor for the migration of operators in a deployment • Osmotic computing implies the dynamic management of services and micro services across cloud, fog, and edge datacenters 30 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 31. 31 Osmotic Computing As Is 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 32. 32 Osmotic Computing To Be 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 33. From Rich Cloud to Frugal Edge 33 Resources, availability, fault tolerance, latency, Edge Privacy, locality, Heterogeneity, mobility? 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 34. 34
  • 36. 36 Data sketches Osmotic computing FaaS Native Containers Decentralization Knowledge Graphs Ad-hoc Cloud Micro Services Kubernetes Start End 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 37. Summary • Moving from cloud to edge • Bidirectional: Osmosis • Us in addition to Vs • Data identification • Semantics is not a luxury • Operator implementation • Native versus containerization • Decentralization • Local decisions 37 28.10.2021 | BDA from the Rich Cloud to the Frugal Edge
  • 38. 38

Editor's Notes

  1. Hello everyone and welcome to my talk. Today, I will talking about big data analytics from the rich cloud to the frugal edge
  2. I will start with the pillars of the success, in my view, of data-driven analytics we are witnessing nowadays. The characteristics of big data and the architectures of processing systems thereof. Next, I’ll discuss the new challenges and new types of applications that require creating new architectures/systems to cope with. And, at the end, I will share a vision towards realizing these architectures/systems.
  3. We are living in a big data era that unleashes our ability to explore sophisticated services affecting every aspect of our lives. Machine learning (ML), a leading example of this data era, has revolutionized business verticals over the past decade. In a way, “Data-driven organizations” have appeared as a planning and management style where decisions are solely driven by data-based evidence and analysis.
  4. In my view, there are three pillars underpinning this success. Namely, ML algorithms, the computing hardware and the fuel for these machines, the data. Notably, deep learning has witnessed leaps in prediction and classification accuracy due to: (i) advanced algorithms, (ii) powerful hardware, and (iii) an explosive growth of collected data. These three pillars are the foundation for big data analytics that has been leading research and industry practices since the mid-2000.
  5. Over the time, digital data generation moved across volume, structure and speed of generation. From structured data generated by ERP systems to semi-structured data produced by for example office-activity support to totally free form content at web-scale.
  6. These are the main characteristics of what is known as big data. Volume to describe amounts of data that need out-of-core computing capacity, variety that called for beyond-relational models to store and query this data and velocity that require on-the-fly processing of the data unlike the traditional store-then-process approach.
  7. So, to cope with these big data characteristics, distributed and parallel processing architectures are the choices. Notably, horizontal scaling proved more successful than vertical scaling. That is, we split the data over more computing machines where each portion of the data is being processed in parallel and then a consolidation step follows, this was fitting well with the MapReduce computing model. The main principle of computing here was to move the processing logic (a few Kilobytes of data ) to where the data reside (Giga/Tera/Peta bytes),
  8. In the mean time, around mid-2000, cloud computing was evolving as a new pay-as-you-go computing model
  9. Public data centers around the globe have appeared with different levels of control on how to setup your data clusters
  10. We now have a huge number of different big data analytics systems and services with more services and systems being added constantly. However, the model at the end is still central. That is, you have to send the data from wherever they are generated to the cloud for analysis. This might be acceptable for low-rate data generation that are owned by the organization.
  11. But, with unprecedented data generation rates by virtually anyone, we cannot afford the latency of sending the data over the network to the cloud, nor can we afford the privacy-breaking threats. Velocity is now overtaking the challenge of processing.
  12. To make the challenge worse, not only humans are the data generators. Rather, IoT adds billions of data sources to the data sphere. With the increasing adoption of IoT, applications thereon are killer apps for BDA, why? Because of the nature of data generators and the rate at which data are generated. Before that, all data generators were mainly human-driven. Now, with sensors, devices, and so on, we have machines generating data at an unprecedented rates. This makes the roundtrip from the data source to the processing and backwards unacceptably slow, talking about a few milliseconds.
  13. Here, we are with an application like Google maps. All GPS updates are sent to the cloud for analysis. Besides the delays in updates about the traffic status, there are also issues related to privacy, at least.
  14. But the data communication infrastructure has grown into layers following the capabilities of the network. It is possible to put computing and storage capabilities at the different layers. This has coined terms like Fog and Edge computing. Prominent examples thereof are mobile-edge computing where processing is offloaded from edge devices like mobile phones to nearby servers at the edge of the network.
  15. Back to our example of data traffic, it makes more sense and also provides recency of traffic data if we decentralize and further localize the sharing and processing of the data. We still might need to send data to the cloud but, at this stage we can send summaries for which delays are acceptable. Add reference to big data 2018 paper
  16. Such applications require another paradigm shift in big data analytics systems. This links to the way processing pipelines are defined, deployed and maintained.
  17. Looking at the network hierarchy, we can conceptually divide into at least three layers, cloud, Fog and edge. The computing capacity, fault tolerance are virtually unlimited at the top most layer. But, since they are farther away from data generation, latency is also very high. On the other hand, as we move closer to data generators, we can further localize data processing and guarantee higher privacy. However, we are faced with challenges about the mobility of data generators, and possibly data processors. Additionally, we are faced with the heterogeneity of the processing infrastructure and its instability.
  18. Now, this is a visionary scenario where we want to deploy an analytics job symbolized by the pipeline on the top left. This is could be a job that analyzes video images along with some other sensor readings. It has three operators, a filter for the data, an aggregation and then storing the data. In the rest this is our network. Moreover, the application should be deployed on locations 2 and 3 only.
  19. Here, the operator placement takes place where the storage part remains at the cloud, the rest of the pipeline is routed to the relevant part of the network.
  20. Due to the workload on the network, a decision is made to host the aggregation for location 3 on the fog layer, where as the filter and aggregation for location 2 are pushed further down closer to the data, the same for the filter operator of location 3. This could be due to resource limitation at edge node E3.
  21. Within the same job, due to the changing workload on the different nodes, a migration of the operators takes place. Such a decision should be done in a decentralized way. That is, local nodes should reach the decision without consulting the cloud. We will refer to this as decenarlized scheduling. Moreover, such migration should be reactive. Additionally, the planning and placement of operators should be learned.
  22. Look at decentralized web, and edge analytics, edge AI If we look at Gartner’s Hype cycle, we can notice that decentralized web, edge analytics and edge AI. Edge AI here is for both service providing and the internal use by the scheduler.
  23. So, we are deploying beyond cloud, we are bringing more challenges. We are moving from the Vs of big data to the Us of edge computing. Uboundedness => volume/ velocity/variety but also the deployment landscape Unevenness => unlike cloud computing not all workers are of comparable computing resources. Unstable => mobility of workers, network stability, autonomy of the workers Unchareted: no global view and the topology is continuously changing. Unsafe => veracity of the data and communication.
  24. We need to give app developer the means to describe their data needs and processing logic. The best is to let them specify what they need and how tolerant they are with processing delays in the form of QoS constraints. The system should provide abstract means to identify data sources and a semantic approach to describe the data in way that allow semi-automated discovery of data sources Such jobs are of a streaming nature and could be long running, so the deployment should account for a much-less controllable environment. Unlike computing clusters, the deployment landscape is more open and more heterogeneous and less secure. Moreover, computing resources are not owned by the application developer. As parts of the application might be deployed on resource-constrained devices, we should not treat the implementations of the operators the same. That is, we should consider cases where succinct data structures with approximate results are favored for accuracy in order to cope with resource limitations. Last but not least, security of the execution and protection of the exchanged data.
  25. So, we can see that this is partially addressed. But, in an open environment, the runtime system should be responsible for resolving the exact source address. Moreover, types of operators and data are predefined and this should be subject to continuous discovery. Last but not least, what should be the migration constraints
  26. To cope with the unstability of the network
  27. One backup slide about data sketches
  28. So, we moved from cloud to fog then to edge computing. We need an abstraction to describe the lifecycle of our applications across these layers
  29. Also, we can have service migration across the same level of fog
  30. Low-cost single board computers at the edge, Embedded AI, and data Fabric
  31. So, which path to go? Actually, there is no single path. And, there are lots of technologies available. We just need to connect the dots and fill some gaps.
  32. Several design decisions and no one-size fits all.