SlideShare a Scribd company logo
1 of 19
Download to read offline
Diving Deep into the API
Ocean with Open Source
Deep Learning Tools
Paul M. Cray, APImetrics
Who are APImetrics?
Seattle-based startup
Blue chip clients include banks, fintech, carriers, utilities
and vehicle IoT
• APImetrics makes individual or sequences of functional API calls
• Synthetic test calls can be scheduled to be made from any location
in any of the 4 main clouds (AWS, Azure, Google, IBM)
• Codebase written in Python with JavaScript for UI
• Data is analyzed using ML and AI functionality we are developing
using open source tools
Who does APImetrics do?
• to manage your APIs you need to understand how they actually behave
from the end-user’s perspective in the real world
• APImetrics is an API performance and quality monitoring system running as
Software-as-a-Service on Google App Engine
• we provide wizards that allow users to create authentications, test calls and
workflows (back-to-back calls) easily
• test calls can deployed to more than 60 cloud locations on four continents to
make scheduled calls to exercise API endpoints
• we support our own API to facilitate deep integration into higher-level
management systems
What does APImetrics look like?
APImetrics 4.7TB historical dataset
• Over 400M API call records made from multiple clouds and locations
• We retain retained all data associated with each call including
payload to give complete picture of API performance
– Timestamp of call
– API endpoint
– Call cloud location
– HTTP response code
– Payload
– Latency breakdown times
• DNS lookup, Connect, Handshake, Upload, Processing, Download
APImetrics Insights CASC score
• What metric do you use measure API performance?
– Latency? Availability? Pass rate?
• Too many variables to compare and contrast API quality easily
• APImetrics use our own magic sauce to combine metrics into a
single blended credit rating-like score
• CASC score allows at-a-glance like-to-like comparison and trend
analysis of the performance and quality of different API calls
• CASC scores are currently calculated on a weekly and monthly
basis, but daily scores coming soon
Typical APImetrics Insights CASC scores
The CASC score and Machine Learning
CHALLENGE: How do we calculate CASC scores in real time? What
do we need?
• More robust (patent application in progress) method for calculating
CASC score that leverages our unrivalled historical dataset
• Uses supervised learning with linear regression used to calculate
CASC parameters
• Python scikit-learn package also numpy, pandas, scipy and
statsmodel used in APImetrics Insights
It’s 2017. How about a
neural net?
The components to be looked
• Outlier detection
• Handling multimodality
• Identifying clusters of related events
• Anomaly detection
Outlier detection
• Historically:
– Heuristic designated a record an outlier if overall latency exceeded a certain
number of standard deviations from the mean
• Outlier detection is a visual problem
– We can see (some/most of) the outliers by eye
• How to use deep learning techniques to detect outliers?
– Implement Recurrent Neural Net (RNN) to analyze time series data?
– Implement Convolutional Neural Net (CNN) to recognise outlier patterns?
– Use PyTorch as it is emerging as the leading Deep Learning framework and
supports idiomatic Python approach
What outliers look like
Multimodality detection
• Latency distribution are typically neither unimodal nor normal
• Outlier detection heuristics relying on latencies being so are flawed
• Reliable outlier detection must first determine modality
• Easy by eye, but sensitive to binning
– Use a CNN to detect modality?
– Use a clustering algorithm to assign modality?
– How to handle binning problem?
What multimodality looks like
Cluster detection
• Currently using a heuristic to construct clusters of outliers
– Much too simplistic
• Exploring algorithms like k-means implemented in a package such
as scikit-learn
• But a result is more like to be an outlier if it is close to other outliers,
i.e. if it is in a cluster
• We believe outlier and cluster detection should be done
simultaneously
– Investigating if an RNN can identify whether a record is an outlier and whether it
belongs to a cluster
What clusters look like
APIs and AIPIs
• APImetrics has 4.7TB of (semi-)structured data packed with
actionable intelligence
– If we can discover it
• We know what we can look for, but what is hidden in the data
ocean?
• An experienced API support engineer can extrapolate from an issue
with one API to an similar issue with a completely different API
– Ultimate goal is a domain-specific AI that does this automagically: an Artificially
Intelligence Programming Interface (AIPI) that can capture, generate and
manipulate API-related knowledge
Diving Deep into the API
Ocean with Open Source
Deep Learning Tools
Paul M. Cray, APImetrics

More Related Content

What's hot

Add ons for stash
Add ons for stashAdd ons for stash
Add ons for stash
Xpand IT
 
A Spark-Based Intelligent Assistant: Making Data Exploration in Natural Langu...
A Spark-Based Intelligent Assistant: Making Data Exploration in Natural Langu...A Spark-Based Intelligent Assistant: Making Data Exploration in Natural Langu...
A Spark-Based Intelligent Assistant: Making Data Exploration in Natural Langu...
Databricks
 

What's hot (20)

Micro-Servicing Linked Data
Micro-Servicing Linked DataMicro-Servicing Linked Data
Micro-Servicing Linked Data
 
apidays LIVE Paris 2021 - Stargate.io, An OSS Api Layer for your Cassandra by...
apidays LIVE Paris 2021 - Stargate.io, An OSS Api Layer for your Cassandra by...apidays LIVE Paris 2021 - Stargate.io, An OSS Api Layer for your Cassandra by...
apidays LIVE Paris 2021 - Stargate.io, An OSS Api Layer for your Cassandra by...
 
The Pursuit of Happiness: Building a Scalable Pipeline Using Apache Spark and...
The Pursuit of Happiness: Building a Scalable Pipeline Using Apache Spark and...The Pursuit of Happiness: Building a Scalable Pipeline Using Apache Spark and...
The Pursuit of Happiness: Building a Scalable Pipeline Using Apache Spark and...
 
Algolia's Fury Road to a Worldwide API - Take Off Conference 2016
Algolia's Fury Road to a Worldwide API - Take Off Conference 2016Algolia's Fury Road to a Worldwide API - Take Off Conference 2016
Algolia's Fury Road to a Worldwide API - Take Off Conference 2016
 
Add ons for stash
Add ons for stashAdd ons for stash
Add ons for stash
 
Librecon 2016 bilbao: kappa architecture IoT of the cars
Librecon 2016 bilbao:   kappa architecture IoT of the carsLibrecon 2016 bilbao:   kappa architecture IoT of the cars
Librecon 2016 bilbao: kappa architecture IoT of the cars
 
Algolia's Fury Road to a Worldwide API
Algolia's Fury Road to a Worldwide APIAlgolia's Fury Road to a Worldwide API
Algolia's Fury Road to a Worldwide API
 
Evolving the Netflix API
Evolving the Netflix APIEvolving the Netflix API
Evolving the Netflix API
 
Lessons Learned in Deploying the ELK Stack (Elasticsearch, Logstash, and Kibana)
Lessons Learned in Deploying the ELK Stack (Elasticsearch, Logstash, and Kibana)Lessons Learned in Deploying the ELK Stack (Elasticsearch, Logstash, and Kibana)
Lessons Learned in Deploying the ELK Stack (Elasticsearch, Logstash, and Kibana)
 
Global Azure Virtual - Application Autoscaling with KEDA
Global Azure Virtual - Application Autoscaling with KEDAGlobal Azure Virtual - Application Autoscaling with KEDA
Global Azure Virtual - Application Autoscaling with KEDA
 
Big data at AWS Chicago User Group - 2014
Big data at AWS Chicago User Group - 2014Big data at AWS Chicago User Group - 2014
Big data at AWS Chicago User Group - 2014
 
Introduction to Promitor
Introduction to PromitorIntroduction to Promitor
Introduction to Promitor
 
A Spark-Based Intelligent Assistant: Making Data Exploration in Natural Langu...
A Spark-Based Intelligent Assistant: Making Data Exploration in Natural Langu...A Spark-Based Intelligent Assistant: Making Data Exploration in Natural Langu...
A Spark-Based Intelligent Assistant: Making Data Exploration in Natural Langu...
 
Fury road to a worldwide API - API Days - December 2015
Fury road to a worldwide API - API Days - December 2015Fury road to a worldwide API - API Days - December 2015
Fury road to a worldwide API - API Days - December 2015
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
 
HBase Meetup @ Cask HQ 09/25
HBase Meetup @ Cask HQ 09/25HBase Meetup @ Cask HQ 09/25
HBase Meetup @ Cask HQ 09/25
 
Scalable Open-Source IoT Solutions on Microsoft Azure
Scalable Open-Source IoT Solutions on Microsoft AzureScalable Open-Source IoT Solutions on Microsoft Azure
Scalable Open-Source IoT Solutions on Microsoft Azure
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
AI at Scale
AI at ScaleAI at Scale
AI at Scale
 
Apache flink
Apache flinkApache flink
Apache flink
 

Similar to LF_APIStrat17_Diving Deep into the API Ocean with Open Source Deep Learning Tools

Expose Yourself Without Insecurity: Cloud Breach Patterns
Expose Yourself Without Insecurity: Cloud Breach PatternsExpose Yourself Without Insecurity: Cloud Breach Patterns
Expose Yourself Without Insecurity: Cloud Breach Patterns
Rob Ragan
 
API and Big Data Solution Patterns
API and Big Data Solution Patterns API and Big Data Solution Patterns
API and Big Data Solution Patterns
WSO2
 
Monitoring-Docker-Container-and-Dockerized-Applications
Monitoring-Docker-Container-and-Dockerized-ApplicationsMonitoring-Docker-Container-and-Dockerized-Applications
Monitoring-Docker-Container-and-Dockerized-Applications
Satya Sanjibani Routray
 
API Gateways are going through an identity crisis
API Gateways are going through an identity crisisAPI Gateways are going through an identity crisis
API Gateways are going through an identity crisis
Christian Posta
 
Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...
Simplilearn
 

Similar to LF_APIStrat17_Diving Deep into the API Ocean with Open Source Deep Learning Tools (20)

ADDO Open Source Observability Tools
ADDO Open Source Observability Tools ADDO Open Source Observability Tools
ADDO Open Source Observability Tools
 
GitOps 101 Presentation.pdf
GitOps 101 Presentation.pdfGitOps 101 Presentation.pdf
GitOps 101 Presentation.pdf
 
DevSecCon Asia 2017 - Abhay Bhargav: Building an Application Vulnerability To...
DevSecCon Asia 2017 - Abhay Bhargav: Building an Application Vulnerability To...DevSecCon Asia 2017 - Abhay Bhargav: Building an Application Vulnerability To...
DevSecCon Asia 2017 - Abhay Bhargav: Building an Application Vulnerability To...
 
Expose Yourself Without Insecurity: Cloud Breach Patterns
Expose Yourself Without Insecurity: Cloud Breach PatternsExpose Yourself Without Insecurity: Cloud Breach Patterns
Expose Yourself Without Insecurity: Cloud Breach Patterns
 
State of Florida Neo4j Graph Briefing - Cyber IAM
State of Florida Neo4j Graph Briefing - Cyber IAMState of Florida Neo4j Graph Briefing - Cyber IAM
State of Florida Neo4j Graph Briefing - Cyber IAM
 
Open source historian
Open source historianOpen source historian
Open source historian
 
Webinar: Fusion 3.1 - What's New
Webinar: Fusion 3.1 - What's NewWebinar: Fusion 3.1 - What's New
Webinar: Fusion 3.1 - What's New
 
Testing APIs in the Cloud
Testing APIs in the CloudTesting APIs in the Cloud
Testing APIs in the Cloud
 
Reduce API Security Risk by Leveraging Graph Analytics Webinar Slides
Reduce API Security Risk by Leveraging Graph Analytics Webinar SlidesReduce API Security Risk by Leveraging Graph Analytics Webinar Slides
Reduce API Security Risk by Leveraging Graph Analytics Webinar Slides
 
API and Big Data Solution Patterns
API and Big Data Solution Patterns API and Big Data Solution Patterns
API and Big Data Solution Patterns
 
Monitoring-Docker-Container-and-Dockerized-Applications
Monitoring-Docker-Container-and-Dockerized-ApplicationsMonitoring-Docker-Container-and-Dockerized-Applications
Monitoring-Docker-Container-and-Dockerized-Applications
 
Monitoring docker container and dockerized applications
Monitoring docker container and dockerized applicationsMonitoring docker container and dockerized applications
Monitoring docker container and dockerized applications
 
Monitoring docker-container-and-dockerized-applications
Monitoring docker-container-and-dockerized-applicationsMonitoring docker-container-and-dockerized-applications
Monitoring docker-container-and-dockerized-applications
 
API Gateways are going through an identity crisis
API Gateways are going through an identity crisisAPI Gateways are going through an identity crisis
API Gateways are going through an identity crisis
 
Open Banking & Open Insurance
Open Banking & Open InsuranceOpen Banking & Open Insurance
Open Banking & Open Insurance
 
Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...
 
CloudHealth: A Model-Driven Approach to Watch the Health of Cloud Services
CloudHealth: A Model-Driven Approach to Watch the Health of Cloud ServicesCloudHealth: A Model-Driven Approach to Watch the Health of Cloud Services
CloudHealth: A Model-Driven Approach to Watch the Health of Cloud Services
 
DevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft AzureDevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft Azure
 
Monitoring docker containers and dockerized applications
Monitoring docker containers and dockerized applicationsMonitoring docker containers and dockerized applications
Monitoring docker containers and dockerized applications
 

More from LF_APIStrat

More from LF_APIStrat (20)

LF_APIStrat17_OWASP’s Latest Category: API Underprotection
LF_APIStrat17_OWASP’s Latest Category: API UnderprotectionLF_APIStrat17_OWASP’s Latest Category: API Underprotection
LF_APIStrat17_OWASP’s Latest Category: API Underprotection
 
LF_APIStrat17_Creating Communication Applications using the Asterisk RESTFul ...
LF_APIStrat17_Creating Communication Applications using the Asterisk RESTFul ...LF_APIStrat17_Creating Communication Applications using the Asterisk RESTFul ...
LF_APIStrat17_Creating Communication Applications using the Asterisk RESTFul ...
 
LF_APIStrat17_Super-Powered REST API Testing
LF_APIStrat17_Super-Powered REST API TestingLF_APIStrat17_Super-Powered REST API Testing
LF_APIStrat17_Super-Powered REST API Testing
 
LF_APIStrat17_How Mature are You? A Developer Experience Maturity Model
LF_APIStrat17_How Mature are You? A Developer Experience Maturity ModelLF_APIStrat17_How Mature are You? A Developer Experience Maturity Model
LF_APIStrat17_How Mature are You? A Developer Experience Maturity Model
 
LF_APIStrat17_Connect Your RESTful API to Hundreds of Others in Minutes (Zapi...
LF_APIStrat17_Connect Your RESTful API to Hundreds of Others in Minutes (Zapi...LF_APIStrat17_Connect Your RESTful API to Hundreds of Others in Minutes (Zapi...
LF_APIStrat17_Connect Your RESTful API to Hundreds of Others in Minutes (Zapi...
 
LF_APIStrat17_Things I Wish People Told Me About Writing Docs
LF_APIStrat17_Things I Wish People Told Me About Writing DocsLF_APIStrat17_Things I Wish People Told Me About Writing Docs
LF_APIStrat17_Things I Wish People Told Me About Writing Docs
 
LF_APIStrat17_Lifting Legacy to the Cloud on API Boosters
LF_APIStrat17_Lifting Legacy to the Cloud on API BoostersLF_APIStrat17_Lifting Legacy to the Cloud on API Boosters
LF_APIStrat17_Lifting Legacy to the Cloud on API Boosters
 
LF_APIStrat17_Contract-first API Development: A Case Study in Parallel API Pu...
LF_APIStrat17_Contract-first API Development: A Case Study in Parallel API Pu...LF_APIStrat17_Contract-first API Development: A Case Study in Parallel API Pu...
LF_APIStrat17_Contract-first API Development: A Case Study in Parallel API Pu...
 
LF_APIStrat17_Don't Repeat Yourself - Your API is Your Documentation
LF_APIStrat17_Don't Repeat Yourself - Your API is Your DocumentationLF_APIStrat17_Don't Repeat Yourself - Your API is Your Documentation
LF_APIStrat17_Don't Repeat Yourself - Your API is Your Documentation
 
LF_APIStrat17_How We Doubled the Velocity of Our Developer Experience Team
LF_APIStrat17_How We Doubled the Velocity of Our Developer Experience TeamLF_APIStrat17_How We Doubled the Velocity of Our Developer Experience Team
LF_APIStrat17_How We Doubled the Velocity of Our Developer Experience Team
 
LF_APIStrat17_API Marketing: First Comes Usability, then Discoverability
LF_APIStrat17_API Marketing: First Comes Usability, then DiscoverabilityLF_APIStrat17_API Marketing: First Comes Usability, then Discoverability
LF_APIStrat17_API Marketing: First Comes Usability, then Discoverability
 
LF_APIStrat17_Standing Taller with Technology: APIs, IoT, and the Digital Wor...
LF_APIStrat17_Standing Taller with Technology: APIs, IoT, and the Digital Wor...LF_APIStrat17_Standing Taller with Technology: APIs, IoT, and the Digital Wor...
LF_APIStrat17_Standing Taller with Technology: APIs, IoT, and the Digital Wor...
 
LF_APIStrat17_REST API Microversions
LF_APIStrat17_REST API Microversions LF_APIStrat17_REST API Microversions
LF_APIStrat17_REST API Microversions
 
LF_APIStrat17_I Believe You But My Enterprise Don't: Adopting Open Standards ...
LF_APIStrat17_I Believe You But My Enterprise Don't: Adopting Open Standards ...LF_APIStrat17_I Believe You But My Enterprise Don't: Adopting Open Standards ...
LF_APIStrat17_I Believe You But My Enterprise Don't: Adopting Open Standards ...
 
LF_APIStrat17_Case Study: Cold Decision Trees
LF_APIStrat17_Case Study: Cold Decision TreesLF_APIStrat17_Case Study: Cold Decision Trees
LF_APIStrat17_Case Study: Cold Decision Trees
 
LF_APIStrat17_Getting Your API House In Order
LF_APIStrat17_Getting Your API House In OrderLF_APIStrat17_Getting Your API House In Order
LF_APIStrat17_Getting Your API House In Order
 
LF_APIStrat17_Supporting SDKs in 7 Different Programming Languages While Main...
LF_APIStrat17_Supporting SDKs in 7 Different Programming Languages While Main...LF_APIStrat17_Supporting SDKs in 7 Different Programming Languages While Main...
LF_APIStrat17_Supporting SDKs in 7 Different Programming Languages While Main...
 
LF_APIStrat17_Open Data vs. the World
LF_APIStrat17_Open Data vs. the World LF_APIStrat17_Open Data vs. the World
LF_APIStrat17_Open Data vs. the World
 
LF_APIStrat17_Practical DevSecOps for APIs
LF_APIStrat17_Practical DevSecOps for APIsLF_APIStrat17_Practical DevSecOps for APIs
LF_APIStrat17_Practical DevSecOps for APIs
 
LF_APIStrat17_Bulletproofing Your API's
LF_APIStrat17_Bulletproofing Your API'sLF_APIStrat17_Bulletproofing Your API's
LF_APIStrat17_Bulletproofing Your API's
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

LF_APIStrat17_Diving Deep into the API Ocean with Open Source Deep Learning Tools

  • 1.
  • 2. Diving Deep into the API Ocean with Open Source Deep Learning Tools Paul M. Cray, APImetrics
  • 3. Who are APImetrics? Seattle-based startup Blue chip clients include banks, fintech, carriers, utilities and vehicle IoT • APImetrics makes individual or sequences of functional API calls • Synthetic test calls can be scheduled to be made from any location in any of the 4 main clouds (AWS, Azure, Google, IBM) • Codebase written in Python with JavaScript for UI • Data is analyzed using ML and AI functionality we are developing using open source tools
  • 4. Who does APImetrics do? • to manage your APIs you need to understand how they actually behave from the end-user’s perspective in the real world • APImetrics is an API performance and quality monitoring system running as Software-as-a-Service on Google App Engine • we provide wizards that allow users to create authentications, test calls and workflows (back-to-back calls) easily • test calls can deployed to more than 60 cloud locations on four continents to make scheduled calls to exercise API endpoints • we support our own API to facilitate deep integration into higher-level management systems
  • 5. What does APImetrics look like?
  • 6. APImetrics 4.7TB historical dataset • Over 400M API call records made from multiple clouds and locations • We retain retained all data associated with each call including payload to give complete picture of API performance – Timestamp of call – API endpoint – Call cloud location – HTTP response code – Payload – Latency breakdown times • DNS lookup, Connect, Handshake, Upload, Processing, Download
  • 7. APImetrics Insights CASC score • What metric do you use measure API performance? – Latency? Availability? Pass rate? • Too many variables to compare and contrast API quality easily • APImetrics use our own magic sauce to combine metrics into a single blended credit rating-like score • CASC score allows at-a-glance like-to-like comparison and trend analysis of the performance and quality of different API calls • CASC scores are currently calculated on a weekly and monthly basis, but daily scores coming soon
  • 9. The CASC score and Machine Learning CHALLENGE: How do we calculate CASC scores in real time? What do we need? • More robust (patent application in progress) method for calculating CASC score that leverages our unrivalled historical dataset • Uses supervised learning with linear regression used to calculate CASC parameters • Python scikit-learn package also numpy, pandas, scipy and statsmodel used in APImetrics Insights
  • 10. It’s 2017. How about a neural net?
  • 11. The components to be looked • Outlier detection • Handling multimodality • Identifying clusters of related events • Anomaly detection
  • 12. Outlier detection • Historically: – Heuristic designated a record an outlier if overall latency exceeded a certain number of standard deviations from the mean • Outlier detection is a visual problem – We can see (some/most of) the outliers by eye • How to use deep learning techniques to detect outliers? – Implement Recurrent Neural Net (RNN) to analyze time series data? – Implement Convolutional Neural Net (CNN) to recognise outlier patterns? – Use PyTorch as it is emerging as the leading Deep Learning framework and supports idiomatic Python approach
  • 14. Multimodality detection • Latency distribution are typically neither unimodal nor normal • Outlier detection heuristics relying on latencies being so are flawed • Reliable outlier detection must first determine modality • Easy by eye, but sensitive to binning – Use a CNN to detect modality? – Use a clustering algorithm to assign modality? – How to handle binning problem?
  • 16. Cluster detection • Currently using a heuristic to construct clusters of outliers – Much too simplistic • Exploring algorithms like k-means implemented in a package such as scikit-learn • But a result is more like to be an outlier if it is close to other outliers, i.e. if it is in a cluster • We believe outlier and cluster detection should be done simultaneously – Investigating if an RNN can identify whether a record is an outlier and whether it belongs to a cluster
  • 18. APIs and AIPIs • APImetrics has 4.7TB of (semi-)structured data packed with actionable intelligence – If we can discover it • We know what we can look for, but what is hidden in the data ocean? • An experienced API support engineer can extrapolate from an issue with one API to an similar issue with a completely different API – Ultimate goal is a domain-specific AI that does this automagically: an Artificially Intelligence Programming Interface (AIPI) that can capture, generate and manipulate API-related knowledge
  • 19. Diving Deep into the API Ocean with Open Source Deep Learning Tools Paul M. Cray, APImetrics