by Keith Steward, Solutions Architect, AWS
AI services on the AWS cloud bring deep learning technologies like natural language understanding, automatic speech recognition, computer vision, text-to-speech, and machine learning within reach of every developer. For more in-depth deep learning applications, the Deep Learning AMIs let you create managed, auto-scaling clusters of GPUs for large scale training, or run inference on trained models with compute-optimized or general-purpose CPU instances. Whether you’re just getting started with AI or you’re a deep learning expert, this session will provide a meaningful overview of how to improve scale and efficiency with the AWS Cloud. Level 200
3. An Introduction to the AI Services at AWS
Apache
Apache
MXNet
Deep learning framework
4. An Introduction to the AI Services at AWS
Apache
Amazon
Polly
Text-to-Speech
Apache
MXNet
Deep learning framework
5. An Introduction to the AI Services at AWS
Apache
Amazon
Polly
Text-to-Speech
Amazon
Rekognition
Computer Vision
Apache
MXNet
Deep learning framework
6. An Introduction to the AI Services at AWS
Apache
Amazon
Polly
Text-to-Speech
Amazon
Rekognition
Amazon
Lex
Computer Vision ASR & NLU
Apache
MXNet
Deep learning framework
7. An Introduction to the AI Services at AWS
Apache
MXNet
Apache
Deep learning framework
8. Apache MXNet
Programmable Portable High Performance
Near linear scaling
across hundreds of GPUs
Highly efficient
models for mobile
and IoT
Simple syntax,
multiple languages
9. Why Apache MXNet?
Most Open Best On AWS
Optimized for
deep learning on AWS
Accepted into the
Apache Incubator
(Integration with AWS)
10. Apache MXNet is the deep learning framework
of choice for AWS
11. P2, DL AMIs, AND DL TEMPLATE
P2 INSTANCES
Up to 40k
CUDA cores
DL CLOUD FORMATION
TEMPLATE
Deep learning
clusters
DL AMIS
Pre-configured for
deep learning
12. An Introduction to the AI Services at AWS
Amazon
Polly
Text-to-Speech
Apache
13. Amazon Polly: Life-like Text-to-Speech Service
Converts text
to life-like speech
47 voices 24 languages Low latency,
real time
Fully managed
15. Amazon Polly: Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
“Today in Seattle, WA, it’s 11°F”
‘"We live for the music" live from the Madison Square Garden.
16. Amazon Polly: Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
2. Intelligible and Easy to Understand
17. Amazon Polly: Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
2. Intelligible and Easy to Understand
3. Add Semantic Meaning to Text
“Richard’s number is 2122341237“
“Richard’s number is 2122341237“
Telephone Number
18. Amazon Polly: Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
2. Intelligible and Easy to Understand
3. Add Semantic Meaning to Text
4. Customized Pronunciation
“My daughter’s name is Kaja.”
“My daughter’s name is Kaja.”
19. Amazon Polly: Common Use Cases
• Internet of Things (smart home, connected devices)
• Education (language learning, training videos)
• Voiced Media (news, blogs, email)
• Voiced Chat Bots (Amazon Lex, Alexa skills)
• Gaming (avatars, Amazon Lumberyard)
#VoiceFirst Movement
20. An Introduction to the AI Services at AWS
Amazon
Rekognition
Computer Vision
Apache
21. Amazon Rekognition: Computer Vision Service
Object and Scene
Detection
Facial
Analysis
Facial
Comparison
Facial
Recognition
25. Amazon Rekognition: Facial Search
Facial
verification
Face
Search
Visual Similarity
Search
(compare two faces) (compare many faces) (find similar faces)
26. Amazon Rekognition: A few use cases
Best photo: use the attributes smile and eyesOpen to determine the best photos to post
Demographic detection: collect the age and gender of customers in your store
Sentiment capture: detect the emotions of your customers as they try your product
A/B tuning: identify visually similar alternatives to high-scoring images for A/B testing
Smart filtering: identify images with high visual similarity to ensure only one is displayed
Verify face: compare two faces, receive a confidence score that they are the same person
Protected images: identify visually similar images that are protected by trademarks
31. Amazon Lex ... for Conversational Interactions
Powered by the same deep learning technology as Alexa
Enterprise SaaS Connectors
Deployment to chat platforms, like Slack, Facebook
Messenger, Twilio SMS
Build Voice and Text Chatbots
Interactions on mobile, web, and devices
34. Amazon Lex Use Cases
Informational Bots
Chatbots for everyday consumer requests
Application Bots
Build powerful interfaces to mobile applications
• News updates
• Weather information
• Game scores ….
• Book tickets
• Order food
• Manage bank accounts ….
Enterprise Productivity Bots
Streamline enterprise work activities and improve efficiencies
• Check sales numbers
• Marketing performance
• Inventory status ….
Internet of Things (IoT) Bots
Enable conversational interfaces for device interactions
• Wearables
• Appliances
• Auto ….