SlideShare a Scribd company logo
1 of 16
1
OBSERVABILITYand Intelligence
In Real time, at Scale
-By Balvinder Khurana & Sarang Shinde
2
Agenda
1. What is Observability
2. How it is different from Monitoring
3. Why do we need Observability
4. A typical observability pipeline
5. How observability enables intelligence
6. Comparison of tools
7. Questions
3
Discovering the information about System and User
Behaviour that leads to Customer and Business Impact.
4
Monitoring
What data does our system has
1. Facts
Things we are aware of and understand.
E.g . We are running spark jobs on ephemeral clusters.
1. Hypothesis
Things we are aware of but don’t understand
E.g. The VMs were preempted, causing user requests to fail.
1. Assumptions
Things we understand but are not aware of.
E.g. Data flow job will be able to handle increasing load automatically
1. Discoveries
Things we are neither aware of nor understand.
E.g. User leaving from particular page of website too early this is happening
because the microservice pod serving that page restarts many times due to jvm
heap error and memory limit issue.
Observability
5
Explosion of Data Sources
Explosion of Requirements
6
Too many tools capture vital information
7
Typical sources, Insights and Roles
Store Data Point KPI derived (eg.) Personas
DataDog Infrastructure Metrics -Deployment status
-Request/Response time* (micrometer
integration)
-Downtime for Service/Kafka
-Load on a service /Load balancer information
-Users affected due to Infra issues
-Developers
-Client Technical Team
-Product Owners
Kafka Application Events -Number of User
-Orders Placed
-Orders returned
-Revenue Generated
-Price Sensitivity
-CXOs
-Data Scientists
-Analysts
-Product Owners
EFK Application Exceptions
DB Exceptions
UI EXceptions
-Applications affected due to System issues
-Causes of exceptions
-Developers
-Client Technical Team
-Operations
-Security Champions
Istio
Network Rules/Service
Mesh
-Routing
-Service Availability
-Service traceability
-Developers
-Security Team
GTM Click Stream
-Customer Behavior
-Issues(UI) faced by Customer
-Device information
-Developers
-Product Owners
-Data Scientists
Jenkins Value stream -Path to Production -Developers
8
Data Sources
VMs
Elastic
New Relic
Prometheus
Data dog
Mongo
RDBMS
S3
Istio
GTM
Omniture
Observability
Pipeline
Data Sinks
HDFS
S3
GCS
OLAPs
9
10
Why…
Standard Specs?
Lambda/Kappa?
Raw layer?
Compaction?
Partitioning?
Delta Lake?
11
12
Streaming Technologies
Spark Kafka Flink
Processing Model Micro Batch One Record at a time One Record at a time
Deployment Own cluster, supports YARN,
Mesos, or containers
Library that any Java
application can embed.
Own cluster, supports YARN,
Mesos, or containers
Life Cycle Stream processing code is
deployed and run as a job in
the Spark cluster
Stream processing code
runs inside their
application
Stream processing code is
deployed and run as a job in the
Flink cluster
Typically Owned By Data infrastructure or BI
team
BI queries Data infrastructure or BI team
Coordination Yes No Yes
Source of continuous data Kafka, File Systems, other
message queues
Strictly Kafka , Other data
out of Kafka is a problem
Kafka, File Systems, other
message queues
Bounded and Unbounded Data
Streams
Avro, Parquet, JSON, CSV, ORC Text, SequenceFile, RCFile,
ORC, Parquet
Avro, Parquet, JSON, CSV
Semantics Exactly-once end-to-end with
specific Source and Sink
Exactly-once end-to-end
with Kafka
Exactly-once end-to-end with
specific Source and Sink
13
Querying Tools
SparkSQL Presto Drill Druid
Can query petabytes of
Data
Yes Yes Yes No
Used for Complex math, statistics,
ML intensive tasks
BI queries BI queries BI and real time analytics on
event driven data
Fault Tolerance Yes No Yes Yes
In memory processing Yes Yes Yes No
Processing speed Slower than Presto and
Drill
Faster than SparkSQL Faster than SparkSQL Faster for specific type of
Queries than Spark,Preso
and Drill.
File formats Avro, Parquet, JSON, CSV,
ORC
Text, SequenceFile,
RCFile, ORC, Parquet
Avro, Parquet, JSON, CSV Avro, Parquet, JSON, CSV
Schema-free querying
support
Yes No Yes No
Supports ANSI SQL Yes Yes Yes Subset
JDBC / ODBC Support Yes Yes Yes Yes
Performance benefits Catalyst and Tungsten Vectorized columnar
processing
Columnar execution and
Vector Processing
Columnar time based
segments, bitmap indexing
14
Reporting Tools
Tableau Looker
Apache Superset
(Incubating) Pentaho Metabase
Visualizations Drag and drop, SQL Drag and Drop
Spark, SQL, basic drag
and drop Drag and Drop Drag and drop, SQL
Intuitiveness and
Usability
Intuitive, Interactive &
Easy to Use Easy to Use
Intuitive, Interactive &
Easy to Use
Comparatively less ease
of use Easy to Use
Databases Supported
Natively supports all
well-known databases
Most of well-known
databases
Fewer databases (Druid
& DBs supporting SQL
Alchemy)
Fewer databases (JDBC
Compliant DBs,
MongoDB)
Most of well-known
databases
Security and access
control
Kerberos, SSPI, SAML,
OpenID, Active
Directory, LDAP, Local
etc.
Google OAuth, LDAP,
SAML, OpenID Flask AppBuilder (FAB)
Pentaho Security, LDAP,
Single Sign-On, Active
Directory, Kerberos Google OAuth, LDAP
Self Service Visualization Yes Yes Yes Yes Yes
Pricing $245 per month
$3,000 – $5,000 per
month Free
Subscription-based
pricing models Free
Data Science and ML
Support Predictive Analysis Advance analytics Predictive Analysis Predictive Analysis Analytical
Other
Better support for
Advanced analytics
and corresponding
data visualisation No support for OLAP
Advanced analytics is not
as mature as Tableau Easy setup and usability
15
Build it Incrementally.
16
Questions ?

More Related Content

What's hot

Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...Data Con LA
 
How To Tell if Your Business Needs NoSQL
How To Tell if Your Business Needs NoSQLHow To Tell if Your Business Needs NoSQL
How To Tell if Your Business Needs NoSQLDataStax
 
Dsc 2021 presentation_radovan_bacovic
Dsc 2021 presentation_radovan_bacovicDsc 2021 presentation_radovan_bacovic
Dsc 2021 presentation_radovan_bacovicRadovan Baćović
 
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017Big Data Spain
 
Building Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceBuilding Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceDatabricks
 
Brokering Data: Accelerating Data Evaluation with Databricks White Label
Brokering Data: Accelerating Data Evaluation with Databricks White LabelBrokering Data: Accelerating Data Evaluation with Databricks White Label
Brokering Data: Accelerating Data Evaluation with Databricks White LabelDatabricks
 
Designing a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for DummiesDesigning a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for DummiesDataStax
 
Webinar: 2 Billion Data Points Each Day
Webinar: 2 Billion Data Points Each DayWebinar: 2 Billion Data Points Each Day
Webinar: 2 Billion Data Points Each DayDataStax
 
Executive Briefing: What Is Fast Data And Why Is It Important
Executive Briefing: What Is Fast Data And Why Is It ImportantExecutive Briefing: What Is Fast Data And Why Is It Important
Executive Briefing: What Is Fast Data And Why Is It ImportantLightbend
 
Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...
Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...
Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...Aditya Bhattacharya
 
Big Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyondBig Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyondDataWorks Summit/Hadoop Summit
 
Trivadis - Microsoft Transform your data estate with cloud, data and AI
Trivadis - Microsoft Transform your data estate with cloud, data and AITrivadis - Microsoft Transform your data estate with cloud, data and AI
Trivadis - Microsoft Transform your data estate with cloud, data and AITrivadis
 
Introduction: Architecting for Scale
Introduction: Architecting for ScaleIntroduction: Architecting for Scale
Introduction: Architecting for ScaleDataStax
 
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...Spark Summit
 
ProdSec: A Technical Approach
ProdSec: A Technical ApproachProdSec: A Technical Approach
ProdSec: A Technical ApproachJeremy Brown
 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 
Cloudera, Inc.
 

What's hot (20)

Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
Big Data Day LA 2016/ NoSQL track - Architecting Real Life IoT Architecture, ...
 
How To Tell if Your Business Needs NoSQL
How To Tell if Your Business Needs NoSQLHow To Tell if Your Business Needs NoSQL
How To Tell if Your Business Needs NoSQL
 
Dsc 2021 presentation_radovan_bacovic
Dsc 2021 presentation_radovan_bacovicDsc 2021 presentation_radovan_bacovic
Dsc 2021 presentation_radovan_bacovic
 
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
Disaster Recovery for Big Data by Carlos Izquierdo at Big Data Spain 2017
 
Building Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceBuilding Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field Experience
 
Brokering Data: Accelerating Data Evaluation with Databricks White Label
Brokering Data: Accelerating Data Evaluation with Databricks White LabelBrokering Data: Accelerating Data Evaluation with Databricks White Label
Brokering Data: Accelerating Data Evaluation with Databricks White Label
 
Big Data Usecases
Big Data UsecasesBig Data Usecases
Big Data Usecases
 
Designing a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for DummiesDesigning a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for Dummies
 
Webinar: 2 Billion Data Points Each Day
Webinar: 2 Billion Data Points Each DayWebinar: 2 Billion Data Points Each Day
Webinar: 2 Billion Data Points Each Day
 
Executive Briefing: What Is Fast Data And Why Is It Important
Executive Briefing: What Is Fast Data And Why Is It ImportantExecutive Briefing: What Is Fast Data And Why Is It Important
Executive Briefing: What Is Fast Data And Why Is It Important
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...
Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...
Accelerating Data Science and Machine Learning Workflow with Azure Machine Le...
 
Big Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyondBig Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyond
 
Trivadis - Microsoft Transform your data estate with cloud, data and AI
Trivadis - Microsoft Transform your data estate with cloud, data and AITrivadis - Microsoft Transform your data estate with cloud, data and AI
Trivadis - Microsoft Transform your data estate with cloud, data and AI
 
Introduction: Architecting for Scale
Introduction: Architecting for ScaleIntroduction: Architecting for Scale
Introduction: Architecting for Scale
 
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
Spark in the Wild: An In-Depth Analysis of 50+ Production Deployments-(Arsala...
 
ProdSec: A Technical Approach
ProdSec: A Technical ApproachProdSec: A Technical Approach
ProdSec: A Technical Approach
 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 

 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 

Similar to Observability in real time at scale

Orion Network Performance Monitor (NPM) Optimization and Tuning Training
Orion Network Performance Monitor (NPM) Optimization and Tuning TrainingOrion Network Performance Monitor (NPM) Optimization and Tuning Training
Orion Network Performance Monitor (NPM) Optimization and Tuning TrainingSolarWinds
 
Application Security Workshop
Application Security Workshop Application Security Workshop
Application Security Workshop Priyanka Aash
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Bhupesh Bansal
 
Hadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedInHadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedInHadoop User Group
 
Server Monitoring (Scaling while bootstrapped)
Server Monitoring  (Scaling while bootstrapped)Server Monitoring  (Scaling while bootstrapped)
Server Monitoring (Scaling while bootstrapped)Ajibola Aiyedogbon
 
Scalable Apache for Beginners
Scalable Apache for BeginnersScalable Apache for Beginners
Scalable Apache for Beginnerswebhostingguy
 
Big Data Introduction - Solix empower
Big Data Introduction - Solix empowerBig Data Introduction - Solix empower
Big Data Introduction - Solix empowerDurga Gadiraju
 
Azure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User StoreAzure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User StoreDataStax Academy
 
Łukasz Romaszewski on Internet of Things Raspberry Pi and Java Embedded JavaC...
Łukasz Romaszewski on Internet of Things Raspberry Pi and Java Embedded JavaC...Łukasz Romaszewski on Internet of Things Raspberry Pi and Java Embedded JavaC...
Łukasz Romaszewski on Internet of Things Raspberry Pi and Java Embedded JavaC...Tomek Borek
 
Application Performance Troubleshooting 1x1 - Part 2 - Noch mehr Schweine und...
Application Performance Troubleshooting 1x1 - Part 2 - Noch mehr Schweine und...Application Performance Troubleshooting 1x1 - Part 2 - Noch mehr Schweine und...
Application Performance Troubleshooting 1x1 - Part 2 - Noch mehr Schweine und...rschuppe
 
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...Adrian Cockcroft
 
Super Sizing Youtube with Python
Super Sizing Youtube with PythonSuper Sizing Youtube with Python
Super Sizing Youtube with Pythondidip
 
Gavin M
Gavin MGavin M
Gavin MOntico
 
Handling Data in Mega Scale Systems
Handling Data in Mega Scale SystemsHandling Data in Mega Scale Systems
Handling Data in Mega Scale SystemsDirecti Group
 
Succeding with the Apache SOA stack
Succeding with the Apache SOA stackSucceding with the Apache SOA stack
Succeding with the Apache SOA stackJohan Edstrom
 
Web Speed And Scalability
Web Speed And ScalabilityWeb Speed And Scalability
Web Speed And ScalabilityJason Ragsdale
 
R2DBC Reactive Relational Database Connectivity
R2DBC Reactive Relational Database ConnectivityR2DBC Reactive Relational Database Connectivity
R2DBC Reactive Relational Database ConnectivityMaarten Smeets
 
Akka for big data developers
Akka for big data developersAkka for big data developers
Akka for big data developersTaras Fedorov
 

Similar to Observability in real time at scale (20)

Orion Network Performance Monitor (NPM) Optimization and Tuning Training
Orion Network Performance Monitor (NPM) Optimization and Tuning TrainingOrion Network Performance Monitor (NPM) Optimization and Tuning Training
Orion Network Performance Monitor (NPM) Optimization and Tuning Training
 
Application Security Workshop
Application Security Workshop Application Security Workshop
Application Security Workshop
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
 
Hadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedInHadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedIn
 
Server Monitoring (Scaling while bootstrapped)
Server Monitoring  (Scaling while bootstrapped)Server Monitoring  (Scaling while bootstrapped)
Server Monitoring (Scaling while bootstrapped)
 
Scalable Apache for Beginners
Scalable Apache for BeginnersScalable Apache for Beginners
Scalable Apache for Beginners
 
Big Data Introduction - Solix empower
Big Data Introduction - Solix empowerBig Data Introduction - Solix empower
Big Data Introduction - Solix empower
 
Azure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User StoreAzure + DataStax Enterprise (DSE) Powers Office365 Per User Store
Azure + DataStax Enterprise (DSE) Powers Office365 Per User Store
 
Perf 101-lab-slideshare
Perf 101-lab-slidesharePerf 101-lab-slideshare
Perf 101-lab-slideshare
 
Łukasz Romaszewski on Internet of Things Raspberry Pi and Java Embedded JavaC...
Łukasz Romaszewski on Internet of Things Raspberry Pi and Java Embedded JavaC...Łukasz Romaszewski on Internet of Things Raspberry Pi and Java Embedded JavaC...
Łukasz Romaszewski on Internet of Things Raspberry Pi and Java Embedded JavaC...
 
Application Performance Troubleshooting 1x1 - Part 2 - Noch mehr Schweine und...
Application Performance Troubleshooting 1x1 - Part 2 - Noch mehr Schweine und...Application Performance Troubleshooting 1x1 - Part 2 - Noch mehr Schweine und...
Application Performance Troubleshooting 1x1 - Part 2 - Noch mehr Schweine und...
 
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
 
Os Solomon
Os SolomonOs Solomon
Os Solomon
 
Super Sizing Youtube with Python
Super Sizing Youtube with PythonSuper Sizing Youtube with Python
Super Sizing Youtube with Python
 
Gavin M
Gavin MGavin M
Gavin M
 
Handling Data in Mega Scale Systems
Handling Data in Mega Scale SystemsHandling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
 
Succeding with the Apache SOA stack
Succeding with the Apache SOA stackSucceding with the Apache SOA stack
Succeding with the Apache SOA stack
 
Web Speed And Scalability
Web Speed And ScalabilityWeb Speed And Scalability
Web Speed And Scalability
 
R2DBC Reactive Relational Database Connectivity
R2DBC Reactive Relational Database ConnectivityR2DBC Reactive Relational Database Connectivity
R2DBC Reactive Relational Database Connectivity
 
Akka for big data developers
Akka for big data developersAkka for big data developers
Akka for big data developers
 

Recently uploaded

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 

Recently uploaded (20)

办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 

Observability in real time at scale

  • 1. 1 OBSERVABILITYand Intelligence In Real time, at Scale -By Balvinder Khurana & Sarang Shinde
  • 2. 2 Agenda 1. What is Observability 2. How it is different from Monitoring 3. Why do we need Observability 4. A typical observability pipeline 5. How observability enables intelligence 6. Comparison of tools 7. Questions
  • 3. 3 Discovering the information about System and User Behaviour that leads to Customer and Business Impact.
  • 4. 4 Monitoring What data does our system has 1. Facts Things we are aware of and understand. E.g . We are running spark jobs on ephemeral clusters. 1. Hypothesis Things we are aware of but don’t understand E.g. The VMs were preempted, causing user requests to fail. 1. Assumptions Things we understand but are not aware of. E.g. Data flow job will be able to handle increasing load automatically 1. Discoveries Things we are neither aware of nor understand. E.g. User leaving from particular page of website too early this is happening because the microservice pod serving that page restarts many times due to jvm heap error and memory limit issue. Observability
  • 5. 5 Explosion of Data Sources Explosion of Requirements
  • 6. 6 Too many tools capture vital information
  • 7. 7 Typical sources, Insights and Roles Store Data Point KPI derived (eg.) Personas DataDog Infrastructure Metrics -Deployment status -Request/Response time* (micrometer integration) -Downtime for Service/Kafka -Load on a service /Load balancer information -Users affected due to Infra issues -Developers -Client Technical Team -Product Owners Kafka Application Events -Number of User -Orders Placed -Orders returned -Revenue Generated -Price Sensitivity -CXOs -Data Scientists -Analysts -Product Owners EFK Application Exceptions DB Exceptions UI EXceptions -Applications affected due to System issues -Causes of exceptions -Developers -Client Technical Team -Operations -Security Champions Istio Network Rules/Service Mesh -Routing -Service Availability -Service traceability -Developers -Security Team GTM Click Stream -Customer Behavior -Issues(UI) faced by Customer -Device information -Developers -Product Owners -Data Scientists Jenkins Value stream -Path to Production -Developers
  • 8. 8 Data Sources VMs Elastic New Relic Prometheus Data dog Mongo RDBMS S3 Istio GTM Omniture Observability Pipeline Data Sinks HDFS S3 GCS OLAPs
  • 9. 9
  • 11. 11
  • 12. 12 Streaming Technologies Spark Kafka Flink Processing Model Micro Batch One Record at a time One Record at a time Deployment Own cluster, supports YARN, Mesos, or containers Library that any Java application can embed. Own cluster, supports YARN, Mesos, or containers Life Cycle Stream processing code is deployed and run as a job in the Spark cluster Stream processing code runs inside their application Stream processing code is deployed and run as a job in the Flink cluster Typically Owned By Data infrastructure or BI team BI queries Data infrastructure or BI team Coordination Yes No Yes Source of continuous data Kafka, File Systems, other message queues Strictly Kafka , Other data out of Kafka is a problem Kafka, File Systems, other message queues Bounded and Unbounded Data Streams Avro, Parquet, JSON, CSV, ORC Text, SequenceFile, RCFile, ORC, Parquet Avro, Parquet, JSON, CSV Semantics Exactly-once end-to-end with specific Source and Sink Exactly-once end-to-end with Kafka Exactly-once end-to-end with specific Source and Sink
  • 13. 13 Querying Tools SparkSQL Presto Drill Druid Can query petabytes of Data Yes Yes Yes No Used for Complex math, statistics, ML intensive tasks BI queries BI queries BI and real time analytics on event driven data Fault Tolerance Yes No Yes Yes In memory processing Yes Yes Yes No Processing speed Slower than Presto and Drill Faster than SparkSQL Faster than SparkSQL Faster for specific type of Queries than Spark,Preso and Drill. File formats Avro, Parquet, JSON, CSV, ORC Text, SequenceFile, RCFile, ORC, Parquet Avro, Parquet, JSON, CSV Avro, Parquet, JSON, CSV Schema-free querying support Yes No Yes No Supports ANSI SQL Yes Yes Yes Subset JDBC / ODBC Support Yes Yes Yes Yes Performance benefits Catalyst and Tungsten Vectorized columnar processing Columnar execution and Vector Processing Columnar time based segments, bitmap indexing
  • 14. 14 Reporting Tools Tableau Looker Apache Superset (Incubating) Pentaho Metabase Visualizations Drag and drop, SQL Drag and Drop Spark, SQL, basic drag and drop Drag and Drop Drag and drop, SQL Intuitiveness and Usability Intuitive, Interactive & Easy to Use Easy to Use Intuitive, Interactive & Easy to Use Comparatively less ease of use Easy to Use Databases Supported Natively supports all well-known databases Most of well-known databases Fewer databases (Druid & DBs supporting SQL Alchemy) Fewer databases (JDBC Compliant DBs, MongoDB) Most of well-known databases Security and access control Kerberos, SSPI, SAML, OpenID, Active Directory, LDAP, Local etc. Google OAuth, LDAP, SAML, OpenID Flask AppBuilder (FAB) Pentaho Security, LDAP, Single Sign-On, Active Directory, Kerberos Google OAuth, LDAP Self Service Visualization Yes Yes Yes Yes Yes Pricing $245 per month $3,000 – $5,000 per month Free Subscription-based pricing models Free Data Science and ML Support Predictive Analysis Advance analytics Predictive Analysis Predictive Analysis Analytical Other Better support for Advanced analytics and corresponding data visualisation No support for OLAP Advanced analytics is not as mature as Tableau Easy setup and usability

Editor's Notes

  1. Hypothesis: A supposition or proposed explanation made on the basis of limited evidence as a starting point for further investigation.
  2. Replication of same for microservices Also serverless or functions as a service Tight coupling to operations What happens when we want to change, how big it is to change tools, how easy it is to experiment with new ones
  3. Decouple the data producers from consumer. (infrastructure from operational systems) Host centric model to service centric model. Pattern that can evolve while still having wins on the way Maps to serverless architecture Empowers teams in siloed organizations
  4. What is Raw layer, its usage and advantages. Building Raw layer with streaming source data introduces challenges such as too many small files. It puts burden on hdfs and hadoop ecosystem and makes difficult to manage. Reducing small files at Raw and other layers can be done using compaction jobs. Delta lake is open sourced by databricks it has build table like abstraction with ACID,versioning and time travel support on file system. Easy to use just change file format. Support hdfs,s3 as well. Lambda or Kappa - Lambda is Speed + Batch layer. Kappa only thinks in terms of Streaming layer with claim of Batch is special case of streaming. Lambda got criticism because maintaining different code base at batch and stream was difficult and this is because during that time different technologies were used for batch and streaming. Now with advantant of apache beam, spark and flink you can write unified pipelines and with clean code practices you can make it less maintainable. Using kappa architecture and trying to fit every problem into streams has it own consequences such as you need to think on checkpointing, reprocessing partial data and it also create hurdles in upgrading your applications quickly. If you are starting new and don’t hold any prior streaming experience choose Lambda over Kappa and slowly used mix approach of both. Lambda architecture is an approach to big data management that provides access to batch processing and near real-time processing with a hybrid approach. The basic architecture of Lambda has three layers: Batch, speed and serving. The batch layer, which typically makes use of Hadoop, is the location where all the data is stored. MapReduce runs regular batch processing on the totality of this data. This information is sent to a data store and is used to gain insights into historic data trends. Alongside this slower layer, new data is captured and processed as it comes in. The speed layer provides business users with the ability to adjust decision making and respond quickly to rapidly emerging trends. Data that passes into this real-time layer is also copied into the larger data set for slower, batch processing. Once the real-time processing is complete, the data is cleared from the speed layer to clear the way for more incoming data. The real-time layer can operate efficiently even with a steady stream of complex data because it only has to handle the volume of data that comes in between rounds of batch processing. The speed and batch layers are merged together for querying through the serving layer which features a massively parallel processing query engine. Having access to this combined data set helps ensure that accurate reporting is available at all times with low latency. Standard Specs - As this platform will help multiple teams unification of structure of events and formats is important.
  5. Explain lambda architecture. What are possible tools/technologies in each layer.
  6. Scalable, fault-tolerant stream to handle the petabytes of data generated.
  7. benefit of using data visualization tools is it enable business people, data scientists and subject matter experts can participate in the application development process. Check skill and level of business users, their needs, also what amount and kind of data you often process keep these things in mind when choosing right tool.
  8. Last thought is build this Incrementally one step at time no need to add complexity from beginning. For small and limited number of applications in enterprise use traditional warehousing and reporting tools they serve well. While incrementally building it you can build it for one small microservice/bounded context at a time. Also if you are building with existing on prem infra then need coordination with other teams to do upfront capacity planning. In cloud you can take advantage of isolation and serverless infra to build and get started with it.