SlideShare a Scribd company logo
1 of 14
Download to read offline
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Machine learning for developers &
data scientists with Amazon SageMaker
Vikrant Kahlir
Enterprise solution architect
Amazon Web Services
A I M 3 0 2
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Put machine learning in the
hands of every developer
Our mission at AWS
Our mission at AWS
Put machine learning in the
hands of every developer
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
The Amazon ML stack:
Broadest and deepest set of capabilities
AI services
Easily add intelligence to applications without machine learning skills
Vision | Documents | Speech | Language | Chatbots | Forecasting | Recommendations
ML services
Build, train, and deploy machine learning models quickly and easily
Data labeling | Pre-built algorithms and notebooks | One-click training and deployment
ML frameworks &
infrastructure
Flexibility and choice, highest-performing infrastructure
Support for ML frameworks | Compute options purpose-built for ML
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
1
2
3
Amazon SageMaker:
Build, train, and deploy ML models at scale
Pre-built notebooks
for common problems
Amazon
SageMaker Ground
Truth
Collect and prepare
training data
Choose and optimize
your ML algorithm
Built-in,
high-performance
algorithms
AWS Marketplace for
machine learning
One-click training on
the highest-performing
infrastructure
Set up and manage
environments for
training
Amazon EC2 P3
instances
Amazon
SageMaker RL
Train and
tune models
Model
optimization
Amazon
SageMaker Neo
Scale and manage
the production
environment
Fully managed with
auto-scaling for 75%
less
Amazon Elastic
Inference
One-click deployment
Deploy models in
production
Deploy everywhere
Flexibility and choice with modules for ML developers and data scientists
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Problem statement
Health care insurance fraud is a pressing problem, causing substantial and
increasing costs in medical insurance programs.
Due to large amounts of claims submitted, review of individual claims becomes a
difficult task and encourages the employment of automated pre-payment controls
and better post-payment decision support tools to enable subject matter expert
analysis.
We will demonstrate the application of unsupervised anomalous outlier
techniques on a minimal set of metrics made available in the CMS Medicare
inpatient claims from 2008.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Dataset
CMS Medicare inpatient claims from 2008
• Medicare inpatient claims from 2008
• Each record is an inpatient claim incurred by a 5% sample of Medicare
beneficiaries
• Beneficiary identities are not provided
• ZIP Codes of facilities where patient was treated are not provided
• The file contains eight variables: one primary key and 7 analytic variables
• Data dictionary required to interpret codes in dataset are provided
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Data variables
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
1
2
3
End-to-end process for workshop
Apply Word2Vec for
feature extraction on
procedure and
diagnosis
Feature
engineering
Train to identify
principal components
in training dataset
Train with Amazon
PCA algorithm
Calculate anomaly
score for test claims
and analyze results
Inference and
analysis
Deploy trained model
Deploy model
Download workshop
resources
Launch Amazon
SageMaker Jupyter
Notebook
Workshop
preparation
Download data
Clean data
Preprocess data
Data
processing
Objective: Flag healthcare claims as a targeting method for further investigation
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Techniques and algorithms used in workshop
• Outlier detection
• Word embedding and Word2Vec
• Principal component analysis
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Outlier detection
“Observation which deviates so much from other observations as to arouse suspicion it was generated
by a different mechanism” 
 Douglas M. Hawkins
• Modeling normal objects and outlier effectively. The border between data normality and abnormality
(outliers) is often not clear-cut.
• Outlier detection methods could be application specific. For example, in clinical data small deviation could
be an outlier, but in a marketing application large deviation is required to justify an outlier.
• Noise in data may be present as deviations in attribute values or even as missing values. Noise may hide an
outlier or may flag deviation as an outlier.
• Providing justification for an outlier from understandability point of view may be difficult.
Challenges
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Word embedding and Word2Vec
Word embedding
Word2Vec
• Text converted into numbers, and there may be different numerical
representations of the same text
• Many techniques exist; Continuous Bag of Words (CBOW) and skip-gram are
popular and effective for large corpus of documents
• Example: CBOW predicts the probability of a word given a context
• Shallow neural networks that map word(s) to the target variable, which is also a
word or words
• Learn weights, which act as word vector representations
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Principal component analysis
When
• Do you want to reduce the number of variables, but aren’t able to identify variables to completely
remove from consideration?
• Do you want to ensure that your variables are independent of one another?
• Are you comfortable making your independent variables less interpretable?
How
• A measure of how each variable is associated with one another (covariance matrix)
• The directions in which our data are dispersed (eigenvectors)
• The relative importance of these different directions (eigenvalues)
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Complete the workshop
http://bit.ly/2LX4sRf
Thank you!
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Vikrant Kahlir
Contact information

More Related Content

What's hot

Mythical Mysfits: Build & collaborate on a modern web application on AWS - MA...
Mythical Mysfits: Build & collaborate on a modern web application on AWS - MA...Mythical Mysfits: Build & collaborate on a modern web application on AWS - MA...
Mythical Mysfits: Build & collaborate on a modern web application on AWS - MA...Amazon Web Services
 
Breaking down monolithic applications into microservices - DEM06-S - Chicago ...
Breaking down monolithic applications into microservices - DEM06-S - Chicago ...Breaking down monolithic applications into microservices - DEM06-S - Chicago ...
Breaking down monolithic applications into microservices - DEM06-S - Chicago ...Amazon Web Services
 
Making CI/CD pipelines safer with application monitoring and tracing - MAD202...
Making CI/CD pipelines safer with application monitoring and tracing - MAD202...Making CI/CD pipelines safer with application monitoring and tracing - MAD202...
Making CI/CD pipelines safer with application monitoring and tracing - MAD202...Amazon Web Services
 
Machine learning at the IoT Edge with AWS IoT Greengrass - SVC203 - Atlanta A...
Machine learning at the IoT Edge with AWS IoT Greengrass - SVC203 - Atlanta A...Machine learning at the IoT Edge with AWS IoT Greengrass - SVC203 - Atlanta A...
Machine learning at the IoT Edge with AWS IoT Greengrass - SVC203 - Atlanta A...Amazon Web Services
 
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...Amazon Web Services
 
Unified monitoring of the container environment, containers, and applications...
Unified monitoring of the container environment, containers, and applications...Unified monitoring of the container environment, containers, and applications...
Unified monitoring of the container environment, containers, and applications...Amazon Web Services
 
Introducing Open Distro for Elasticsearch - ADB201 - Atlanta AWS Summit
Introducing Open Distro for Elasticsearch - ADB201 - Atlanta AWS SummitIntroducing Open Distro for Elasticsearch - ADB201 - Atlanta AWS Summit
Introducing Open Distro for Elasticsearch - ADB201 - Atlanta AWS SummitAmazon Web Services
 
Move desktops & applications to AWS with Amazon WorkSpaces & AppStream 2.0 - ...
Move desktops & applications to AWS with Amazon WorkSpaces & AppStream 2.0 - ...Move desktops & applications to AWS with Amazon WorkSpaces & AppStream 2.0 - ...
Move desktops & applications to AWS with Amazon WorkSpaces & AppStream 2.0 - ...Amazon Web Services
 
Monitoring with container insights & anomaly detection - SVC308 - New York AW...
Monitoring with container insights & anomaly detection - SVC308 - New York AW...Monitoring with container insights & anomaly detection - SVC308 - New York AW...
Monitoring with container insights & anomaly detection - SVC308 - New York AW...Amazon Web Services
 
Lessons from WuXi NextCODE Scales Up To Accelerate Data Sequencing in Their D...
Lessons from WuXi NextCODE Scales Up To Accelerate Data Sequencing in Their D...Lessons from WuXi NextCODE Scales Up To Accelerate Data Sequencing in Their D...
Lessons from WuXi NextCODE Scales Up To Accelerate Data Sequencing in Their D...Amazon Web Services
 
Deploy and manage Kubernetes on AWS from your on-premises environment - DEM04...
Deploy and manage Kubernetes on AWS from your on-premises environment - DEM04...Deploy and manage Kubernetes on AWS from your on-premises environment - DEM04...
Deploy and manage Kubernetes on AWS from your on-premises environment - DEM04...Amazon Web Services
 
Safeguarding the integrity of your code for fast, secure deployments - SVC301...
Safeguarding the integrity of your code for fast, secure deployments - SVC301...Safeguarding the integrity of your code for fast, secure deployments - SVC301...
Safeguarding the integrity of your code for fast, secure deployments - SVC301...Amazon Web Services
 
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...Amazon Web Services
 
Securely deliver applications with AWS - SVC305 - Atlanta AWS Summit
Securely deliver applications with AWS - SVC305 - Atlanta AWS SummitSecurely deliver applications with AWS - SVC305 - Atlanta AWS Summit
Securely deliver applications with AWS - SVC305 - Atlanta AWS SummitAmazon Web Services
 
Deep dive on security in Amazon S3 - STG304 - Chicago AWS Summit
Deep dive on security in Amazon S3 - STG304 - Chicago AWS SummitDeep dive on security in Amazon S3 - STG304 - Chicago AWS Summit
Deep dive on security in Amazon S3 - STG304 - Chicago AWS SummitAmazon Web Services
 
AWS IoT Greengrass Workshop - SVC303 - Anaheim AWS Summit
AWS IoT Greengrass Workshop - SVC303 - Anaheim AWS SummitAWS IoT Greengrass Workshop - SVC303 - Anaheim AWS Summit
AWS IoT Greengrass Workshop - SVC303 - Anaheim AWS SummitAmazon Web Services
 
Detecting and responding to critical events with AWS IoT Events - SVC205 - Ch...
Detecting and responding to critical events with AWS IoT Events - SVC205 - Ch...Detecting and responding to critical events with AWS IoT Events - SVC205 - Ch...
Detecting and responding to critical events with AWS IoT Events - SVC205 - Ch...Amazon Web Services
 
Architecting Digital Media Archive Migrations with AWS - STG301 - Anaheim AWS...
Architecting Digital Media Archive Migrations with AWS - STG301 - Anaheim AWS...Architecting Digital Media Archive Migrations with AWS - STG301 - Anaheim AWS...
Architecting Digital Media Archive Migrations with AWS - STG301 - Anaheim AWS...Amazon Web Services
 
Carry security with you to the cloud - DEM14-SR - New York AWS Summit
Carry security with you to the cloud - DEM14-SR - New York AWS SummitCarry security with you to the cloud - DEM14-SR - New York AWS Summit
Carry security with you to the cloud - DEM14-SR - New York AWS SummitAmazon Web Services
 
CI/CD best practices for building modern applications - MAD302 - Atlanta AWS ...
CI/CD best practices for building modern applications - MAD302 - Atlanta AWS ...CI/CD best practices for building modern applications - MAD302 - Atlanta AWS ...
CI/CD best practices for building modern applications - MAD302 - Atlanta AWS ...Amazon Web Services
 

What's hot (20)

Mythical Mysfits: Build & collaborate on a modern web application on AWS - MA...
Mythical Mysfits: Build & collaborate on a modern web application on AWS - MA...Mythical Mysfits: Build & collaborate on a modern web application on AWS - MA...
Mythical Mysfits: Build & collaborate on a modern web application on AWS - MA...
 
Breaking down monolithic applications into microservices - DEM06-S - Chicago ...
Breaking down monolithic applications into microservices - DEM06-S - Chicago ...Breaking down monolithic applications into microservices - DEM06-S - Chicago ...
Breaking down monolithic applications into microservices - DEM06-S - Chicago ...
 
Making CI/CD pipelines safer with application monitoring and tracing - MAD202...
Making CI/CD pipelines safer with application monitoring and tracing - MAD202...Making CI/CD pipelines safer with application monitoring and tracing - MAD202...
Making CI/CD pipelines safer with application monitoring and tracing - MAD202...
 
Machine learning at the IoT Edge with AWS IoT Greengrass - SVC203 - Atlanta A...
Machine learning at the IoT Edge with AWS IoT Greengrass - SVC203 - Atlanta A...Machine learning at the IoT Edge with AWS IoT Greengrass - SVC203 - Atlanta A...
Machine learning at the IoT Edge with AWS IoT Greengrass - SVC203 - Atlanta A...
 
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
 
Unified monitoring of the container environment, containers, and applications...
Unified monitoring of the container environment, containers, and applications...Unified monitoring of the container environment, containers, and applications...
Unified monitoring of the container environment, containers, and applications...
 
Introducing Open Distro for Elasticsearch - ADB201 - Atlanta AWS Summit
Introducing Open Distro for Elasticsearch - ADB201 - Atlanta AWS SummitIntroducing Open Distro for Elasticsearch - ADB201 - Atlanta AWS Summit
Introducing Open Distro for Elasticsearch - ADB201 - Atlanta AWS Summit
 
Move desktops & applications to AWS with Amazon WorkSpaces & AppStream 2.0 - ...
Move desktops & applications to AWS with Amazon WorkSpaces & AppStream 2.0 - ...Move desktops & applications to AWS with Amazon WorkSpaces & AppStream 2.0 - ...
Move desktops & applications to AWS with Amazon WorkSpaces & AppStream 2.0 - ...
 
Monitoring with container insights & anomaly detection - SVC308 - New York AW...
Monitoring with container insights & anomaly detection - SVC308 - New York AW...Monitoring with container insights & anomaly detection - SVC308 - New York AW...
Monitoring with container insights & anomaly detection - SVC308 - New York AW...
 
Lessons from WuXi NextCODE Scales Up To Accelerate Data Sequencing in Their D...
Lessons from WuXi NextCODE Scales Up To Accelerate Data Sequencing in Their D...Lessons from WuXi NextCODE Scales Up To Accelerate Data Sequencing in Their D...
Lessons from WuXi NextCODE Scales Up To Accelerate Data Sequencing in Their D...
 
Deploy and manage Kubernetes on AWS from your on-premises environment - DEM04...
Deploy and manage Kubernetes on AWS from your on-premises environment - DEM04...Deploy and manage Kubernetes on AWS from your on-premises environment - DEM04...
Deploy and manage Kubernetes on AWS from your on-premises environment - DEM04...
 
Safeguarding the integrity of your code for fast, secure deployments - SVC301...
Safeguarding the integrity of your code for fast, secure deployments - SVC301...Safeguarding the integrity of your code for fast, secure deployments - SVC301...
Safeguarding the integrity of your code for fast, secure deployments - SVC301...
 
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
 
Securely deliver applications with AWS - SVC305 - Atlanta AWS Summit
Securely deliver applications with AWS - SVC305 - Atlanta AWS SummitSecurely deliver applications with AWS - SVC305 - Atlanta AWS Summit
Securely deliver applications with AWS - SVC305 - Atlanta AWS Summit
 
Deep dive on security in Amazon S3 - STG304 - Chicago AWS Summit
Deep dive on security in Amazon S3 - STG304 - Chicago AWS SummitDeep dive on security in Amazon S3 - STG304 - Chicago AWS Summit
Deep dive on security in Amazon S3 - STG304 - Chicago AWS Summit
 
AWS IoT Greengrass Workshop - SVC303 - Anaheim AWS Summit
AWS IoT Greengrass Workshop - SVC303 - Anaheim AWS SummitAWS IoT Greengrass Workshop - SVC303 - Anaheim AWS Summit
AWS IoT Greengrass Workshop - SVC303 - Anaheim AWS Summit
 
Detecting and responding to critical events with AWS IoT Events - SVC205 - Ch...
Detecting and responding to critical events with AWS IoT Events - SVC205 - Ch...Detecting and responding to critical events with AWS IoT Events - SVC205 - Ch...
Detecting and responding to critical events with AWS IoT Events - SVC205 - Ch...
 
Architecting Digital Media Archive Migrations with AWS - STG301 - Anaheim AWS...
Architecting Digital Media Archive Migrations with AWS - STG301 - Anaheim AWS...Architecting Digital Media Archive Migrations with AWS - STG301 - Anaheim AWS...
Architecting Digital Media Archive Migrations with AWS - STG301 - Anaheim AWS...
 
Carry security with you to the cloud - DEM14-SR - New York AWS Summit
Carry security with you to the cloud - DEM14-SR - New York AWS SummitCarry security with you to the cloud - DEM14-SR - New York AWS Summit
Carry security with you to the cloud - DEM14-SR - New York AWS Summit
 
CI/CD best practices for building modern applications - MAD302 - Atlanta AWS ...
CI/CD best practices for building modern applications - MAD302 - Atlanta AWS ...CI/CD best practices for building modern applications - MAD302 - Atlanta AWS ...
CI/CD best practices for building modern applications - MAD302 - Atlanta AWS ...
 

Similar to Machine learning for developers & data scientists with Amazon SageMaker - AIM302 - Chicago AWS Summit

Build, train, and deploy ML models with Amazon SageMaker - AIM302 - New York ...
Build, train, and deploy ML models with Amazon SageMaker - AIM302 - New York ...Build, train, and deploy ML models with Amazon SageMaker - AIM302 - New York ...
Build, train, and deploy ML models with Amazon SageMaker - AIM302 - New York ...Amazon Web Services
 
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain PipelineThe Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain PipelineAmazon Web Services
 
Two Approaches to Cloud Adoption and Organizational Transformation: Emory Uni...
Two Approaches to Cloud Adoption and Organizational Transformation: Emory Uni...Two Approaches to Cloud Adoption and Organizational Transformation: Emory Uni...
Two Approaches to Cloud Adoption and Organizational Transformation: Emory Uni...Amazon Web Services
 
How Do I Plan for Security, Risk and Compliance when Migrating to AWS?
How Do I Plan for Security, Risk and Compliance when Migrating to AWS?How Do I Plan for Security, Risk and Compliance when Migrating to AWS?
How Do I Plan for Security, Risk and Compliance when Migrating to AWS?Amazon Web Services
 
How to Wrangle Data for Machine Learning on AWS
 How to Wrangle Data for Machine Learning on AWS How to Wrangle Data for Machine Learning on AWS
How to Wrangle Data for Machine Learning on AWSAmazon Web Services
 
AI ﹑大數據媒體應用和利用機器學習與 AWS 媒體服務實現自動化內容生成
AI ﹑大數據媒體應用和利用機器學習與 AWS 媒體服務實現自動化內容生成AI ﹑大數據媒體應用和利用機器學習與 AWS 媒體服務實現自動化內容生成
AI ﹑大數據媒體應用和利用機器學習與 AWS 媒體服務實現自動化內容生成Amazon Web Services
 
Build sophisticated forecasting and recommendation models - AIM202 - New York...
Build sophisticated forecasting and recommendation models - AIM202 - New York...Build sophisticated forecasting and recommendation models - AIM202 - New York...
Build sophisticated forecasting and recommendation models - AIM202 - New York...Amazon Web Services
 
Mythbusting the Federal Cloud Journey
Mythbusting the Federal Cloud JourneyMythbusting the Federal Cloud Journey
Mythbusting the Federal Cloud JourneyAmazon Web Services
 
Mining Intelligent Insights: AI/ML for Financial Services
Mining Intelligent Insights: AI/ML for Financial ServicesMining Intelligent Insights: AI/ML for Financial Services
Mining Intelligent Insights: AI/ML for Financial ServicesAmazon Web Services LATAM
 
Pensi di essere pronto per i microservizi?
Pensi di essere pronto per i microservizi?Pensi di essere pronto per i microservizi?
Pensi di essere pronto per i microservizi?Amazon Web Services
 
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...Amazon Web Services Korea
 
DevSecOps: Instituting Cultural Transformation for Public Sector Organization...
DevSecOps: Instituting Cultural Transformation for Public Sector Organization...DevSecOps: Instituting Cultural Transformation for Public Sector Organization...
DevSecOps: Instituting Cultural Transformation for Public Sector Organization...Amazon Web Services
 
Leveraging AWS ML to Improve Forecasting and Governance
Leveraging AWS ML to Improve Forecasting and GovernanceLeveraging AWS ML to Improve Forecasting and Governance
Leveraging AWS ML to Improve Forecasting and GovernanceAmazon Web Services
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksAmazon Web Services
 
From Strategy to Reality: Better Decisions With Data
From Strategy to Reality: Better Decisions With DataFrom Strategy to Reality: Better Decisions With Data
From Strategy to Reality: Better Decisions With DataAmazon Web Services
 
Leveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven DecisionsLeveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven DecisionsAmazon Web Services
 
AIM102-S_Cognizant_CognizantCognitive
AIM102-S_Cognizant_CognizantCognitiveAIM102-S_Cognizant_CognizantCognitive
AIM102-S_Cognizant_CognizantCognitivePhilipBasford
 
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...Amazon Web Services
 
Streamlining Application Development with AWS Service Catalog (DEV328) - AWS ...
Streamlining Application Development with AWS Service Catalog (DEV328) - AWS ...Streamlining Application Development with AWS Service Catalog (DEV328) - AWS ...
Streamlining Application Development with AWS Service Catalog (DEV328) - AWS ...Amazon Web Services
 

Similar to Machine learning for developers & data scientists with Amazon SageMaker - AIM302 - Chicago AWS Summit (20)

Build, train, and deploy ML models with Amazon SageMaker - AIM302 - New York ...
Build, train, and deploy ML models with Amazon SageMaker - AIM302 - New York ...Build, train, and deploy ML models with Amazon SageMaker - AIM302 - New York ...
Build, train, and deploy ML models with Amazon SageMaker - AIM302 - New York ...
 
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain PipelineThe Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
The Zen of DataOps – AWS Lake Formation and the Data Supply Chain Pipeline
 
Two Approaches to Cloud Adoption and Organizational Transformation: Emory Uni...
Two Approaches to Cloud Adoption and Organizational Transformation: Emory Uni...Two Approaches to Cloud Adoption and Organizational Transformation: Emory Uni...
Two Approaches to Cloud Adoption and Organizational Transformation: Emory Uni...
 
How Do I Plan for Security, Risk and Compliance when Migrating to AWS?
How Do I Plan for Security, Risk and Compliance when Migrating to AWS?How Do I Plan for Security, Risk and Compliance when Migrating to AWS?
How Do I Plan for Security, Risk and Compliance when Migrating to AWS?
 
How to Wrangle Data for Machine Learning on AWS
 How to Wrangle Data for Machine Learning on AWS How to Wrangle Data for Machine Learning on AWS
How to Wrangle Data for Machine Learning on AWS
 
AI ﹑大數據媒體應用和利用機器學習與 AWS 媒體服務實現自動化內容生成
AI ﹑大數據媒體應用和利用機器學習與 AWS 媒體服務實現自動化內容生成AI ﹑大數據媒體應用和利用機器學習與 AWS 媒體服務實現自動化內容生成
AI ﹑大數據媒體應用和利用機器學習與 AWS 媒體服務實現自動化內容生成
 
Build sophisticated forecasting and recommendation models - AIM202 - New York...
Build sophisticated forecasting and recommendation models - AIM202 - New York...Build sophisticated forecasting and recommendation models - AIM202 - New York...
Build sophisticated forecasting and recommendation models - AIM202 - New York...
 
Mythbusting the Federal Cloud Journey
Mythbusting the Federal Cloud JourneyMythbusting the Federal Cloud Journey
Mythbusting the Federal Cloud Journey
 
Mining Intelligent Insights: AI/ML for Financial Services
Mining Intelligent Insights: AI/ML for Financial ServicesMining Intelligent Insights: AI/ML for Financial Services
Mining Intelligent Insights: AI/ML for Financial Services
 
Pensi di essere pronto per i microservizi?
Pensi di essere pronto per i microservizi?Pensi di essere pronto per i microservizi?
Pensi di essere pronto per i microservizi?
 
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
Datarobot, 자동화된 분석 적용 시 분석 절차의 변화 및 효용 - 홍운표 데이터 사이언티스트, DataRobot :: AWS Sum...
 
DevSecOps: Instituting Cultural Transformation for Public Sector Organization...
DevSecOps: Instituting Cultural Transformation for Public Sector Organization...DevSecOps: Instituting Cultural Transformation for Public Sector Organization...
DevSecOps: Instituting Cultural Transformation for Public Sector Organization...
 
Leveraging AWS ML to Improve Forecasting and Governance
Leveraging AWS ML to Improve Forecasting and GovernanceLeveraging AWS ML to Improve Forecasting and Governance
Leveraging AWS ML to Improve Forecasting and Governance
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech Talks
 
From Strategy to Reality: Better Decisions With Data
From Strategy to Reality: Better Decisions With DataFrom Strategy to Reality: Better Decisions With Data
From Strategy to Reality: Better Decisions With Data
 
Introduction to Sagemaker
Introduction to SagemakerIntroduction to Sagemaker
Introduction to Sagemaker
 
Leveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven DecisionsLeveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven Decisions
 
AIM102-S_Cognizant_CognizantCognitive
AIM102-S_Cognizant_CognizantCognitiveAIM102-S_Cognizant_CognizantCognitive
AIM102-S_Cognizant_CognizantCognitive
 
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
NLP in Healthcare to Predict Adverse Events with Amazon SageMaker (AIM346) - ...
 
Streamlining Application Development with AWS Service Catalog (DEV328) - AWS ...
Streamlining Application Development with AWS Service Catalog (DEV328) - AWS ...Streamlining Application Development with AWS Service Catalog (DEV328) - AWS ...
Streamlining Application Development with AWS Service Catalog (DEV328) - AWS ...
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Machine learning for developers & data scientists with Amazon SageMaker - AIM302 - Chicago AWS Summit

  • 1. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Machine learning for developers & data scientists with Amazon SageMaker Vikrant Kahlir Enterprise solution architect Amazon Web Services A I M 3 0 2
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Put machine learning in the hands of every developer Our mission at AWS Our mission at AWS Put machine learning in the hands of every developer
  • 3. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T The Amazon ML stack: Broadest and deepest set of capabilities AI services Easily add intelligence to applications without machine learning skills Vision | Documents | Speech | Language | Chatbots | Forecasting | Recommendations ML services Build, train, and deploy machine learning models quickly and easily Data labeling | Pre-built algorithms and notebooks | One-click training and deployment ML frameworks & infrastructure Flexibility and choice, highest-performing infrastructure Support for ML frameworks | Compute options purpose-built for ML
  • 4. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T 1 2 3 Amazon SageMaker: Build, train, and deploy ML models at scale Pre-built notebooks for common problems Amazon SageMaker Ground Truth Collect and prepare training data Choose and optimize your ML algorithm Built-in, high-performance algorithms AWS Marketplace for machine learning One-click training on the highest-performing infrastructure Set up and manage environments for training Amazon EC2 P3 instances Amazon SageMaker RL Train and tune models Model optimization Amazon SageMaker Neo Scale and manage the production environment Fully managed with auto-scaling for 75% less Amazon Elastic Inference One-click deployment Deploy models in production Deploy everywhere Flexibility and choice with modules for ML developers and data scientists
  • 5. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Problem statement Health care insurance fraud is a pressing problem, causing substantial and increasing costs in medical insurance programs. Due to large amounts of claims submitted, review of individual claims becomes a difficult task and encourages the employment of automated pre-payment controls and better post-payment decision support tools to enable subject matter expert analysis. We will demonstrate the application of unsupervised anomalous outlier techniques on a minimal set of metrics made available in the CMS Medicare inpatient claims from 2008.
  • 6. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Dataset CMS Medicare inpatient claims from 2008 • Medicare inpatient claims from 2008 • Each record is an inpatient claim incurred by a 5% sample of Medicare beneficiaries • Beneficiary identities are not provided • ZIP Codes of facilities where patient was treated are not provided • The file contains eight variables: one primary key and 7 analytic variables • Data dictionary required to interpret codes in dataset are provided
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Data variables
  • 8. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T 1 2 3 End-to-end process for workshop Apply Word2Vec for feature extraction on procedure and diagnosis Feature engineering Train to identify principal components in training dataset Train with Amazon PCA algorithm Calculate anomaly score for test claims and analyze results Inference and analysis Deploy trained model Deploy model Download workshop resources Launch Amazon SageMaker Jupyter Notebook Workshop preparation Download data Clean data Preprocess data Data processing Objective: Flag healthcare claims as a targeting method for further investigation
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Techniques and algorithms used in workshop • Outlier detection • Word embedding and Word2Vec • Principal component analysis
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Outlier detection “Observation which deviates so much from other observations as to arouse suspicion it was generated by a different mechanism”   Douglas M. Hawkins • Modeling normal objects and outlier effectively. The border between data normality and abnormality (outliers) is often not clear-cut. • Outlier detection methods could be application specific. For example, in clinical data small deviation could be an outlier, but in a marketing application large deviation is required to justify an outlier. • Noise in data may be present as deviations in attribute values or even as missing values. Noise may hide an outlier or may flag deviation as an outlier. • Providing justification for an outlier from understandability point of view may be difficult. Challenges
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Word embedding and Word2Vec Word embedding Word2Vec • Text converted into numbers, and there may be different numerical representations of the same text • Many techniques exist; Continuous Bag of Words (CBOW) and skip-gram are popular and effective for large corpus of documents • Example: CBOW predicts the probability of a word given a context • Shallow neural networks that map word(s) to the target variable, which is also a word or words • Learn weights, which act as word vector representations
  • 12. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Principal component analysis When • Do you want to reduce the number of variables, but aren’t able to identify variables to completely remove from consideration? • Do you want to ensure that your variables are independent of one another? • Are you comfortable making your independent variables less interpretable? How • A measure of how each variable is associated with one another (covariance matrix) • The directions in which our data are dispersed (eigenvectors) • The relative importance of these different directions (eigenvalues)
  • 13. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Complete the workshop http://bit.ly/2LX4sRf
  • 14. Thank you! S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Vikrant Kahlir Contact information