2. • Technical Evangelist, Developer Advocate,
… Software Engineer
• Own bed in Finland
• Previously:
• Solutions Architect @AWS
• Lead Cloud Architect @Dreambroker
• Director of Engineering, Software Engineer, DevOps, Manager, ... @Hdm
• Researcher @Nokia Research Center
• and a bunch of other stuff.
• Climber, like Ginger shots.
3. What to Expect from the Session
1. A little bit history & theory never kills
2. AI in AWS
3. Building AI-powered apps x3
4.
5. No servers to provision
or manage
Scales with usage
Never pay for idle Availability and fault
tolerance built in
Serverless means…
16. Text In, Life-like Speech Out
Amazon Polly
“Today in Seattle, WA
it’s 11°F”
“Today in Seattle Washington
it’s 11 degrees Fahrenheit”
47 lifelike voices spread across 24 languages
17. “Today in Seattle, WA, it’s 11°F”
‘"We live for the music" live from the Madison Square Garden.’
1. Automatic, Accurate Text Processing
A Focus On Voice Quality & Pronunciation
18. 2. Intelligible and Easy to Understand
1. Automatic, Accurate Text Processing
A Focus On Voice Quality & Pronunciation
19. 2. Intelligible and Easy to Understand
3. Add Semantic Meaning to Text
“Richard’s number is 2122341237“
“Richard’s number is 2122341237“
Telephone Number
A Focus On Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
20. 2. Intelligible and Easy to Understand
3. Add Semantic Meaning to Text
4. Customized Pronunciation
“My daughter’s name is Kaja.”
“My daughter’s name is Kaja.”
A Focus On Voice Quality & Pronunciation
1. Automatic, Accurate Text Processing
21. A Focus On Voice Quality & Pronunciation
https://www.w3.org/TR/speech-synthesis/<speak>
The spelling of my last name is
<prosody rate='x-slow'>
<say-as interpret-as="characters">Adrian</say-as>
</prosody></speak>
22. Duolingo voices its language learning service Using Polly
Duolingo is a free language learning service where
users help translate the web and rate translations.
With Amazon Polly our users
benefit from the most lifelike
Text-to-Speech voices
available on the market.
Severin Hacker
CTO, Duolingo
”
“
• Spoken language crucial for
language learning
• Accurate pronunciation matters
• Faster iteration thanks to TTS
• As good as natural human speech
23.
24. <API>
Amazon Polly
</API>
aws polly synthesize-speech
--text "It was nice to live such a wonderful live show"
--output-format mp3
--voice-id Joanna
--text-type text johanna.mp3
28. Amazon Rekognition
Deep learning-based image recognition service
Search, verify, and organize millions of images
Object and Scene
Detection
Facial
Analysis
Face
Comparison
Facial
Recognition
36. Amazon Rekognition
Customers
• Digital Asset Management
• Media and Entertainment
• Travel and Hospitality
• Influencer Marketing
• Systems Integration
• Digital Advertising
• Consumer Storage
• Law Enforcement
• Public Safety
• eCommerce
• Education
40. Cognito support for Identity
Username
Password
Sign In
SAML
Identity Provider
Amazon Cognito2. Get AWS credentials
API Gateway
DynamoDB S3
Lambda
Cognito User Pools
Rekognition
Polly
45. S e l e c t i m a g e
c o n v e rt e r
R A W t o J P E G R A W t o P N GR A W t o T I F F
L o a d i n D a t a b a s e
Sta rt
End
U n s u p p or t e d i m a g e
t y p eParallel Steps
AWS Step Functions
46. P r o c e s s p h o t o
R e s i ze i m a g e
Sta rt
End
E x t r a c t m e t a d a t a F a c i a l r e c o g n it i on
L o a d i n D a t a b a s e
Branching Steps
AWS Step Functions