SlideShare a Scribd company logo
1 of 26
Download to read offline
eBay INAR -
“Intelligent
Architecture”
Hanzhang Wang, Sanjeev Katariya
Nov 15, 2019
© 2019 eBay. All rights reserved.
Agenda
eBay INAR Vision and Challenges
INAR Blueprint and Progress
INAR Example: Grano – Cross Level Anomaly Detection
Existing and Future Collaborations
© 2018 eBay. All rights reserved.3
Enables the rapid, frequent and reliable delivery of large, complex applications.
Highly maintainable and testable
Loosely coupled
Independently deployable
Organized around business capabilities
Owned by a small team
Micro-Service Architecture
© 2018 eBay. All rights reserved.4
Enables the rapid, frequent and reliable delivery of large, complex applications.
Highly maintainable and testable
Loosely coupled
Independently deployable
Organized around business capabilities
Owned by a small team
Everything seems perfect, until...
Micro-Service Architecture
Erhh….
Hi, can I change
this …?
eBay INAR Vision
“To build sustainable service architecture by providing
automated visibility, assessment, and governance
Intelligence.”
© 2018 eBay. All rights reserved.6
Open Challenges for Large-Scale Service Ecosystem
Blindness
Limited observability on
architectural
knowledge or issues
Primitiveness
Missing diagnostic, engineering
and run-time automation
Ignorance
Lack of measurability
for service architecture,
or technical debts
© 2018 eBay. All rights reserved.7
Limited observability on architectural knowledge or issues
Cannot see…
Architectural antipattern problems
Incorrect / Inappropriate dependencies
Low tier services dependent by the critical/secure services
Detection of invalid cross domain endpoint cyclic dependencies
Architectural evolution corruption problems which cause low engineering
productivity and poor sustainability
Popular software and services evolve frequently and prone to become monolithic
Redundant services and duplicated functionalities
No recommendation to support architecture evolution
Easy access to the cross-domain data
Root cause visibility on cross-domain incidents
The “Blindness”
© 2018 eBay. All rights reserved.8
Lack of measurability for service architecture quality, or technical debts
Cannot know…
Metrics Suite
Lack of service’s importance and impact metrics
Lack of service quality (e.g., reliability, maintainability) metrics
Automation to generate team / organization product dependencies and symbiosis
System/Ecosystem visualization enhanced/powered by metrics
Estimation of technical debts
The “Ignorance”
© 2018 eBay. All rights reserved.9
Missing real time algorithm-driven automation for diagnostic-ability or productivity
Cannot have…
Algorithmic/Artificial Intelligent IT operations or AI for IT Operations
Anomaly detection for operations
Antipattern detection for evolution
Automated mining software code repository solutions
Lack of automated solution to discover code-level knowledge or to support quality analysis.
Governance in the ecosystem
Service ecosystem evolution governance
Optimization and architectural decision support (e.g., legacy code migration)
The “Primitiveness”
Blindness
ENHANCE DEVOPS
Enhance code intelligence
for quality assurance and
engineering productivity
PROVIDE VISIBILITY
Improve productivity and
understandability of
service ecosystem through
knowledge graph
Ignorance
METRIC SUITE
Quantitatively measure
service behavior and
metrics. “If you can’t
measure it, you can’t
improve it”
PATTERN/ANOMALY
DETECTION
Automate service laying,
service ranking, service
classification; enforce
rule-based
Primitiveness
AIOPs
Enable observability,
unified analysis and
proactive incident/root
cause detection and
remediation
SYMBIOSIS & DISCIPLINE
Automated governance over service ecosystem for evolution
The eBay INAR Blueprint
{
}
© 2018 eBay. All rights reserved.11
Bring DevOPs Into Connected Graph Visibility
Action
Bring in data into graph to connect,
understand, process and visualize
Result: Graph-based Visibility
We can now see by connecting
cloud-native data, hardware,
people, code and business
What kind of data?
● WIRI (What it really is) and WISB
(What it should be)
● Large-scale data within/across
domains
● Analysis (Pattern, Dynamics and
Behavior)
● Recommendation (Improvement)
{
}
© 2018 eBay. All rights reserved.12
Knowledge Graph (Heterogeneous Graph)
Knowledge Graph:
1. Represents knowledge domain.
2. Connects things of different types in a systematic way.
3. Encode knowledge arranged in a network of nodes and edges rather
than tables of rows and columns.
Why graph here?
1. Native method to understand the “Unknown”
2. White box and Visualization
3. The platform and support is ready (e.g. Data, Graph DB, Computation
Power)
4. “Get ready for AI” or “translator of AI” analysis (Pattern)
5. Recommendation/Refactoring
© 2018 eBay. All rights reserved.13
An Initial Design..
© 2018 eBay. All rights reserved.14
A “Baby” Step..
© 2018 eBay. All rights reserved.15
A “Baby” Step...
© 2018 eBay. All rights reserved.16
Build Metrics Suite and Understand by AI
Action
Build Metrics on previous
connected graph, to develop and
understand patterns and
anti-patterns in raw data and metrics
Result: Intelligence Data
We transform industrial SE data which
is often noisy, low-quality, and large into
clean / intelligence data
© 2018 eBay. All rights reserved.17
Metrics (Data-driven Insights)
Structural Metrics (by dependency)
Traffic Metrics (by traffic)
Mid-Activity (13%)
High-Activity Pools
(6%)
Mid-Activity (8%)Low-Activity (72%)
Essential
Consumption
Inbound
Services
Inbound
Endpoints
Traffic Vol.
© 2018 eBay. All rights reserved.18
Metrics (Data-driven Insights)
Essential
Consumption
Automated classification of service layering :
1. Back-End Pools
2. Middle-Tier Pools
3. Customer-Facing Pools (Experience, Public APIs)
Service architecture refactoring support:
1. Classifying external-facing pools to support build “large
team” APIs
2. Impact and importance
© 2018 eBay. All rights reserved.19
Our current progress
Enable AIOPs With SEI Applications
Dependency system
Pre/post release
quality analysis
Graph-based distributed
system anomaly detection
Capacity forecasting
and planning
Service popularity metrics
and layering
recommendation
Invalid service
dependency detection
Action
Leveraging
graph-based visibility
and intelligence data to
build applications to
support site operation
and engineering
productivity
Result: Symbiosis
Automated discipline
and governance among
software engineering
services
© 2018 eBay. All rights reserved.20
Grano Research Motivation Cross Level Anomaly Detection
Univariate Anomaly by Detections Models
(High False-Positive and Busy)
Visibility
Multi-Variate Anomaly Detection
(Black-Box with Bad Interpretability)
ML Models for Anomaly Detection for the components
Ignorance
Topology Example of eBay distributed data
platform
© 2018 eBay. All rights reserved.21
Solution: Grano ML +Graph
Project health signals should be in-line with the lines
ML - First level anomaly detection for
components of distributed data platform.
Consists of three different types of detection
models to handle different anomalies
(clustering-based model, forecast-based
mode, statistical model)
Visibility Ignorance
Graph – Constructing the anomaly graph with different (health) signals
Scoring the significance
of every signal based on
its distribution, and
history frequency
Scoring the system
components based
on their signals
Propagating the
scores through the
topology to enhance
the detection
accuracy
© 2018 eBay. All rights reserved.22
Solution: Grano System
GRANO is an end-to-end graph-based
anomaly detection and root cause analysis
system for distributed cloud-native data
platform
Anomaly Detection
Time series anomaly detection on metrics
Adaptive Anomaly Graph
Adaptive analysis on system topology and all
signals to identify the root cause
Applications
Alerting, interactive Graph-UI, and integration
with monitoring systems
© 2018 eBay. All rights reserved.23
GRANO: Interactive Graph-based Root Cause Analysis for
Cloud-Native Distributed Data Platform link
Hanzhang Wang; Phuong Nguyen; Jun li, Selcuk Kopru, Gene Zhang,
Sanjeev Katariya, Sami Ben-Romdhane
eBay Inc.
E2E Solution: Grano Applications (In Production)
Intelligent anomaly graph view Automated Root cause analysis
Component-based view
Automated RCA graph
High-level comprehensive monitoring
Low-level RCA
© 2018 eBay. All rights reserved.24
University Topics Area
Fudan
University
Industry microservice study
Distributed tracing flow analysis
Drexel
University
Service quality metrics suite
University of
Michigan
Intelligent Software Engineering
Existing INAR Collaborations
eBay SEI Vision
“To build sustainable service architecture by providing automated visibility,
assessment, and governance Intelligence.”
Challenge Blindness Ignorance Primitiveness
Progress
DevOPs, quality dashboards,
knowledge graph and etc.
Service popularity and behavior
metrics, pre/post release quality
analysis and etc.
Anomaly Detection, RCA and etc.
Future work
Intelligent visibility for developers Prioritize and quantify technical debt
using service metrics
Auto remediation
Collaboration
Service dependency analysis and
governance
Extend service metrics suite Critical flow and anomaly detection
on distributed tracing graph
Please Contact: hanzwang@ebay.com
Request for Future INAR Collaboration
eBay Intelligent Architecture

More Related Content

What's hot

Predictive Analytics Using R | Edureka
Predictive Analytics Using R | EdurekaPredictive Analytics Using R | Edureka
Predictive Analytics Using R | EdurekaEdureka!
 
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig..."How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...Data Science Milan
 
Putting together AI pipelines with Acumos
Putting together AI pipelines with AcumosPutting together AI pipelines with Acumos
Putting together AI pipelines with AcumosPantelis Monogioudis
 
Machine Learning in the Real World
Machine Learning in the Real WorldMachine Learning in the Real World
Machine Learning in the Real WorldSrinath Perera
 
Tuning 2.0: Advanced Optimization Techniques Webinar
Tuning 2.0: Advanced Optimization Techniques WebinarTuning 2.0: Advanced Optimization Techniques Webinar
Tuning 2.0: Advanced Optimization Techniques WebinarSigOpt
 
How to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraHow to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraChun Myung Kyu
 
Predictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial IntelligencePredictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial IntelligenceManish Jain
 
GITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP PresentationGITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP PresentationPedro Pereira
 
Machinel Learning with spark
Machinel Learning with spark Machinel Learning with spark
Machinel Learning with spark Ons Dridi
 
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...TigerGraph
 
A practical look at how to build & run IoT business logic
A practical look at how to build & run IoT business logicA practical look at how to build & run IoT business logic
A practical look at how to build & run IoT business logicVeselin Pizurica
 
Graph-Based Identity Resolution at Scale
Graph-Based Identity Resolution at ScaleGraph-Based Identity Resolution at Scale
Graph-Based Identity Resolution at ScaleTigerGraph
 
Scalability and Autonomous Analytics
Scalability and Autonomous AnalyticsScalability and Autonomous Analytics
Scalability and Autonomous AnalyticsInspirient
 
MLSEV. Use Case: Smart Energy Management
MLSEV. Use Case: Smart Energy ManagementMLSEV. Use Case: Smart Energy Management
MLSEV. Use Case: Smart Energy ManagementBigML, Inc
 
Sparklyr: Big Data enabler for R users
Sparklyr: Big Data enabler for R usersSparklyr: Big Data enabler for R users
Sparklyr: Big Data enabler for R usersICTeam S.p.A.
 

What's hot (20)

Predictive Analytics Using R | Edureka
Predictive Analytics Using R | EdurekaPredictive Analytics Using R | Edureka
Predictive Analytics Using R | Edureka
 
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig..."How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
 
Putting together AI pipelines with Acumos
Putting together AI pipelines with AcumosPutting together AI pipelines with Acumos
Putting together AI pipelines with Acumos
 
Real-Time Machine Learning at Industrial scale (University of Oxford, 9th Oct...
Real-Time Machine Learning at Industrial scale (University of Oxford, 9th Oct...Real-Time Machine Learning at Industrial scale (University of Oxford, 9th Oct...
Real-Time Machine Learning at Industrial scale (University of Oxford, 9th Oct...
 
Building a Data Analytics PaaS for Smart Cities
Building a Data Analytics PaaS for Smart CitiesBuilding a Data Analytics PaaS for Smart Cities
Building a Data Analytics PaaS for Smart Cities
 
Machine Learning in the Real World
Machine Learning in the Real WorldMachine Learning in the Real World
Machine Learning in the Real World
 
Tuning 2.0: Advanced Optimization Techniques Webinar
Tuning 2.0: Advanced Optimization Techniques WebinarTuning 2.0: Advanced Optimization Techniques Webinar
Tuning 2.0: Advanced Optimization Techniques Webinar
 
How to design ai functions to the cloud native infra
How to design ai functions to the cloud native infraHow to design ai functions to the cloud native infra
How to design ai functions to the cloud native infra
 
Predictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial IntelligencePredictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial Intelligence
 
GITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP PresentationGITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP Presentation
 
Resume_Vignesh_ThulasiDass
Resume_Vignesh_ThulasiDass Resume_Vignesh_ThulasiDass
Resume_Vignesh_ThulasiDass
 
Machinel Learning with spark
Machinel Learning with spark Machinel Learning with spark
Machinel Learning with spark
 
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
 
A practical look at how to build & run IoT business logic
A practical look at how to build & run IoT business logicA practical look at how to build & run IoT business logic
A practical look at how to build & run IoT business logic
 
Graph-Based Identity Resolution at Scale
Graph-Based Identity Resolution at ScaleGraph-Based Identity Resolution at Scale
Graph-Based Identity Resolution at Scale
 
Scalability and Autonomous Analytics
Scalability and Autonomous AnalyticsScalability and Autonomous Analytics
Scalability and Autonomous Analytics
 
MLSEV. Use Case: Smart Energy Management
MLSEV. Use Case: Smart Energy ManagementMLSEV. Use Case: Smart Energy Management
MLSEV. Use Case: Smart Energy Management
 
Meetup Spark UDF performance
Meetup Spark UDF performanceMeetup Spark UDF performance
Meetup Spark UDF performance
 
Sparklyr: Big Data enabler for R users
Sparklyr: Big Data enabler for R usersSparklyr: Big Data enabler for R users
Sparklyr: Big Data enabler for R users
 
KBC Proven Application of Digital Twin
KBC Proven Application of Digital TwinKBC Proven Application of Digital Twin
KBC Proven Application of Digital Twin
 

Similar to eBay Intelligent Architecture

Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaCapgemini
 
IoT Update | Hoe implementeer je IoT Schaalbaar in je IT landschap
IoT Update | Hoe implementeer je IoT Schaalbaar in je IT landschapIoT Update | Hoe implementeer je IoT Schaalbaar in je IT landschap
IoT Update | Hoe implementeer je IoT Schaalbaar in je IT landschapIoT Academy
 
AWS O&G Day - Ambyint and AWS
AWS O&G Day - Ambyint and AWSAWS O&G Day - Ambyint and AWS
AWS O&G Day - Ambyint and AWSAWS Summits
 
AI now ron tolido, capgemini cwin18_toulouse
AI now ron tolido, capgemini cwin18_toulouseAI now ron tolido, capgemini cwin18_toulouse
AI now ron tolido, capgemini cwin18_toulouseCapgemini
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksAmazon Web Services
 
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018Amazon Web Services
 
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018Amazon Web Services
 
Smarter Event-Driven Edge with Amazon SageMaker & Project Flogo (AIM204-S) - ...
Smarter Event-Driven Edge with Amazon SageMaker & Project Flogo (AIM204-S) - ...Smarter Event-Driven Edge with Amazon SageMaker & Project Flogo (AIM204-S) - ...
Smarter Event-Driven Edge with Amazon SageMaker & Project Flogo (AIM204-S) - ...Amazon Web Services
 
Operationalizing Your Analysis with AWS IoT Analytics (IOT358-R1) - AWS re:In...
Operationalizing Your Analysis with AWS IoT Analytics (IOT358-R1) - AWS re:In...Operationalizing Your Analysis with AWS IoT Analytics (IOT358-R1) - AWS re:In...
Operationalizing Your Analysis with AWS IoT Analytics (IOT358-R1) - AWS re:In...Amazon Web Services
 
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...Splunk
 
OpsRamp Platform Winter 2020 Release
OpsRamp Platform Winter 2020 ReleaseOpsRamp Platform Winter 2020 Release
OpsRamp Platform Winter 2020 ReleaseOpsRamp
 
Bitrock manufacturing
Bitrock manufacturing Bitrock manufacturing
Bitrock manufacturing cosma_r
 
AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
 AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
AWS Live Panel Discussion: Simplify Large-Scale IoT Device ManagementAmazon Web Services
 
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...apidays
 
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...Sogeti Nederland B.V.
 
Real-World AI and Deep Learning for Enterprise with Case Studies
Real-World AI and Deep Learning for Enterprise with Case StudiesReal-World AI and Deep Learning for Enterprise with Case Studies
Real-World AI and Deep Learning for Enterprise with Case StudiesAmazon Web Services
 
Motadata - Unified Product Suite for IT Operations and Big Data Analytics
Motadata - Unified Product Suite for IT Operations and Big Data AnalyticsMotadata - Unified Product Suite for IT Operations and Big Data Analytics
Motadata - Unified Product Suite for IT Operations and Big Data Analyticsnovsela
 

Similar to eBay Intelligent Architecture (20)

Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
 
IoT Update | Hoe implementeer je IoT Schaalbaar in je IT landschap
IoT Update | Hoe implementeer je IoT Schaalbaar in je IT landschapIoT Update | Hoe implementeer je IoT Schaalbaar in je IT landschap
IoT Update | Hoe implementeer je IoT Schaalbaar in je IT landschap
 
AWS O&G Day - Ambyint and AWS
AWS O&G Day - Ambyint and AWSAWS O&G Day - Ambyint and AWS
AWS O&G Day - Ambyint and AWS
 
AI now ron tolido, capgemini cwin18_toulouse
AI now ron tolido, capgemini cwin18_toulouseAI now ron tolido, capgemini cwin18_toulouse
AI now ron tolido, capgemini cwin18_toulouse
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech Talks
 
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
 
Motadata product itsm overview
Motadata product itsm overviewMotadata product itsm overview
Motadata product itsm overview
 
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
AIoT: AI Meets IoT (IOT204) - AWS re:Invent 2018
 
Smarter Event-Driven Edge with Amazon SageMaker & Project Flogo (AIM204-S) - ...
Smarter Event-Driven Edge with Amazon SageMaker & Project Flogo (AIM204-S) - ...Smarter Event-Driven Edge with Amazon SageMaker & Project Flogo (AIM204-S) - ...
Smarter Event-Driven Edge with Amazon SageMaker & Project Flogo (AIM204-S) - ...
 
Operationalizing Your Analysis with AWS IoT Analytics (IOT358-R1) - AWS re:In...
Operationalizing Your Analysis with AWS IoT Analytics (IOT358-R1) - AWS re:In...Operationalizing Your Analysis with AWS IoT Analytics (IOT358-R1) - AWS re:In...
Operationalizing Your Analysis with AWS IoT Analytics (IOT358-R1) - AWS re:In...
 
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...
How to Move from Monitoring to Observability, On-Premises and in a Multi-Clou...
 
The future of AIOps
The future of AIOpsThe future of AIOps
The future of AIOps
 
OpsRamp Platform Winter 2020 Release
OpsRamp Platform Winter 2020 ReleaseOpsRamp Platform Winter 2020 Release
OpsRamp Platform Winter 2020 Release
 
Bitrock manufacturing
Bitrock manufacturing Bitrock manufacturing
Bitrock manufacturing
 
AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
 AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
 
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...
 
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
Revolutionising Testing with the Power of AI - Deepa Mamtani, Pillay Almira &...
 
Real-World AI and Deep Learning for Enterprise with Case Studies
Real-World AI and Deep Learning for Enterprise with Case StudiesReal-World AI and Deep Learning for Enterprise with Case Studies
Real-World AI and Deep Learning for Enterprise with Case Studies
 
Motadata - Unified Product Suite for IT Operations and Big Data Analytics
Motadata - Unified Product Suite for IT Operations and Big Data AnalyticsMotadata - Unified Product Suite for IT Operations and Big Data Analytics
Motadata - Unified Product Suite for IT Operations and Big Data Analytics
 

Recently uploaded

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 

Recently uploaded (20)

(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 

eBay Intelligent Architecture

  • 1. eBay INAR - “Intelligent Architecture” Hanzhang Wang, Sanjeev Katariya Nov 15, 2019 © 2019 eBay. All rights reserved.
  • 2. Agenda eBay INAR Vision and Challenges INAR Blueprint and Progress INAR Example: Grano – Cross Level Anomaly Detection Existing and Future Collaborations
  • 3. © 2018 eBay. All rights reserved.3 Enables the rapid, frequent and reliable delivery of large, complex applications. Highly maintainable and testable Loosely coupled Independently deployable Organized around business capabilities Owned by a small team Micro-Service Architecture
  • 4. © 2018 eBay. All rights reserved.4 Enables the rapid, frequent and reliable delivery of large, complex applications. Highly maintainable and testable Loosely coupled Independently deployable Organized around business capabilities Owned by a small team Everything seems perfect, until... Micro-Service Architecture Erhh…. Hi, can I change this …?
  • 5. eBay INAR Vision “To build sustainable service architecture by providing automated visibility, assessment, and governance Intelligence.”
  • 6. © 2018 eBay. All rights reserved.6 Open Challenges for Large-Scale Service Ecosystem Blindness Limited observability on architectural knowledge or issues Primitiveness Missing diagnostic, engineering and run-time automation Ignorance Lack of measurability for service architecture, or technical debts
  • 7. © 2018 eBay. All rights reserved.7 Limited observability on architectural knowledge or issues Cannot see… Architectural antipattern problems Incorrect / Inappropriate dependencies Low tier services dependent by the critical/secure services Detection of invalid cross domain endpoint cyclic dependencies Architectural evolution corruption problems which cause low engineering productivity and poor sustainability Popular software and services evolve frequently and prone to become monolithic Redundant services and duplicated functionalities No recommendation to support architecture evolution Easy access to the cross-domain data Root cause visibility on cross-domain incidents The “Blindness”
  • 8. © 2018 eBay. All rights reserved.8 Lack of measurability for service architecture quality, or technical debts Cannot know… Metrics Suite Lack of service’s importance and impact metrics Lack of service quality (e.g., reliability, maintainability) metrics Automation to generate team / organization product dependencies and symbiosis System/Ecosystem visualization enhanced/powered by metrics Estimation of technical debts The “Ignorance”
  • 9. © 2018 eBay. All rights reserved.9 Missing real time algorithm-driven automation for diagnostic-ability or productivity Cannot have… Algorithmic/Artificial Intelligent IT operations or AI for IT Operations Anomaly detection for operations Antipattern detection for evolution Automated mining software code repository solutions Lack of automated solution to discover code-level knowledge or to support quality analysis. Governance in the ecosystem Service ecosystem evolution governance Optimization and architectural decision support (e.g., legacy code migration) The “Primitiveness”
  • 10. Blindness ENHANCE DEVOPS Enhance code intelligence for quality assurance and engineering productivity PROVIDE VISIBILITY Improve productivity and understandability of service ecosystem through knowledge graph Ignorance METRIC SUITE Quantitatively measure service behavior and metrics. “If you can’t measure it, you can’t improve it” PATTERN/ANOMALY DETECTION Automate service laying, service ranking, service classification; enforce rule-based Primitiveness AIOPs Enable observability, unified analysis and proactive incident/root cause detection and remediation SYMBIOSIS & DISCIPLINE Automated governance over service ecosystem for evolution The eBay INAR Blueprint { }
  • 11. © 2018 eBay. All rights reserved.11 Bring DevOPs Into Connected Graph Visibility Action Bring in data into graph to connect, understand, process and visualize Result: Graph-based Visibility We can now see by connecting cloud-native data, hardware, people, code and business What kind of data? ● WIRI (What it really is) and WISB (What it should be) ● Large-scale data within/across domains ● Analysis (Pattern, Dynamics and Behavior) ● Recommendation (Improvement) { }
  • 12. © 2018 eBay. All rights reserved.12 Knowledge Graph (Heterogeneous Graph) Knowledge Graph: 1. Represents knowledge domain. 2. Connects things of different types in a systematic way. 3. Encode knowledge arranged in a network of nodes and edges rather than tables of rows and columns. Why graph here? 1. Native method to understand the “Unknown” 2. White box and Visualization 3. The platform and support is ready (e.g. Data, Graph DB, Computation Power) 4. “Get ready for AI” or “translator of AI” analysis (Pattern) 5. Recommendation/Refactoring
  • 13. © 2018 eBay. All rights reserved.13 An Initial Design..
  • 14. © 2018 eBay. All rights reserved.14 A “Baby” Step..
  • 15. © 2018 eBay. All rights reserved.15 A “Baby” Step...
  • 16. © 2018 eBay. All rights reserved.16 Build Metrics Suite and Understand by AI Action Build Metrics on previous connected graph, to develop and understand patterns and anti-patterns in raw data and metrics Result: Intelligence Data We transform industrial SE data which is often noisy, low-quality, and large into clean / intelligence data
  • 17. © 2018 eBay. All rights reserved.17 Metrics (Data-driven Insights) Structural Metrics (by dependency) Traffic Metrics (by traffic) Mid-Activity (13%) High-Activity Pools (6%) Mid-Activity (8%)Low-Activity (72%) Essential Consumption Inbound Services Inbound Endpoints Traffic Vol.
  • 18. © 2018 eBay. All rights reserved.18 Metrics (Data-driven Insights) Essential Consumption Automated classification of service layering : 1. Back-End Pools 2. Middle-Tier Pools 3. Customer-Facing Pools (Experience, Public APIs) Service architecture refactoring support: 1. Classifying external-facing pools to support build “large team” APIs 2. Impact and importance
  • 19. © 2018 eBay. All rights reserved.19 Our current progress Enable AIOPs With SEI Applications Dependency system Pre/post release quality analysis Graph-based distributed system anomaly detection Capacity forecasting and planning Service popularity metrics and layering recommendation Invalid service dependency detection Action Leveraging graph-based visibility and intelligence data to build applications to support site operation and engineering productivity Result: Symbiosis Automated discipline and governance among software engineering services
  • 20. © 2018 eBay. All rights reserved.20 Grano Research Motivation Cross Level Anomaly Detection Univariate Anomaly by Detections Models (High False-Positive and Busy) Visibility Multi-Variate Anomaly Detection (Black-Box with Bad Interpretability) ML Models for Anomaly Detection for the components Ignorance Topology Example of eBay distributed data platform
  • 21. © 2018 eBay. All rights reserved.21 Solution: Grano ML +Graph Project health signals should be in-line with the lines ML - First level anomaly detection for components of distributed data platform. Consists of three different types of detection models to handle different anomalies (clustering-based model, forecast-based mode, statistical model) Visibility Ignorance Graph – Constructing the anomaly graph with different (health) signals Scoring the significance of every signal based on its distribution, and history frequency Scoring the system components based on their signals Propagating the scores through the topology to enhance the detection accuracy
  • 22. © 2018 eBay. All rights reserved.22 Solution: Grano System GRANO is an end-to-end graph-based anomaly detection and root cause analysis system for distributed cloud-native data platform Anomaly Detection Time series anomaly detection on metrics Adaptive Anomaly Graph Adaptive analysis on system topology and all signals to identify the root cause Applications Alerting, interactive Graph-UI, and integration with monitoring systems
  • 23. © 2018 eBay. All rights reserved.23 GRANO: Interactive Graph-based Root Cause Analysis for Cloud-Native Distributed Data Platform link Hanzhang Wang; Phuong Nguyen; Jun li, Selcuk Kopru, Gene Zhang, Sanjeev Katariya, Sami Ben-Romdhane eBay Inc. E2E Solution: Grano Applications (In Production) Intelligent anomaly graph view Automated Root cause analysis Component-based view Automated RCA graph High-level comprehensive monitoring Low-level RCA
  • 24. © 2018 eBay. All rights reserved.24 University Topics Area Fudan University Industry microservice study Distributed tracing flow analysis Drexel University Service quality metrics suite University of Michigan Intelligent Software Engineering Existing INAR Collaborations
  • 25. eBay SEI Vision “To build sustainable service architecture by providing automated visibility, assessment, and governance Intelligence.” Challenge Blindness Ignorance Primitiveness Progress DevOPs, quality dashboards, knowledge graph and etc. Service popularity and behavior metrics, pre/post release quality analysis and etc. Anomaly Detection, RCA and etc. Future work Intelligent visibility for developers Prioritize and quantify technical debt using service metrics Auto remediation Collaboration Service dependency analysis and governance Extend service metrics suite Critical flow and anomaly detection on distributed tracing graph Please Contact: hanzwang@ebay.com Request for Future INAR Collaboration