SlideShare a Scribd company logo
1 / 78
Multimodal
Voice Development
with Amazon Alexa
2 / 78
Ralf Eggert
CEO Travello GmbH
Alexa Skill Developer
Advisory Board Voice Con
Alexa Champion
3 / 78
Portfolio (excerpt)
4 / 78
Three questions
5 / 78
Question 1:
Who owns an Echo
device with a display?
6 / 78
Question 2:
Who has developed a
Multimodal Skill for Alexa?
7 / 78
Question 3:
Who has an APL based
Multimodal Skill for Alexa
live?
8 / 78
A definition of multimodal
9 / 78
What is this
multimodal stuff
anyway?
10 / 78
Multimodal
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
11 / 78
Textual
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
12 / 78
Aural
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
13 / 78
Linguistic
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
14 / 78
Spatial
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
15 / 78
Visual
Multimodality describes
communication practices in
terms of the textual, aural,
linguistic, spatial, and visual
resources - or modes - used to
compose messages.
Murray, Joddy (2013) / Wikipedia
16 / 78
Textual Aural Linguistic Visual
Let's focus on
17 / 78
Interlude
18 / 78
OTIS REM
Who knows this smart
voice assistant?
Is it multimodal?
19 / 78
Alexa & Multimodal
20 / 78
OK!
How about Alexa?
21 / 78
AWS Lambda /
HTTPS Endpoint
Server
Alexa Voice Service


Headless
22 / 78
Cards
Support headless Echo
devices with visuals.
Are they multimodal?
23 / 78
Echo Buttons
Another input media.
Are they multimodal?
What do you think?
24 / 78
AWS Lambda /
HTTPS Endpoint
Server
Alexa Voice Service

Multimodal
25 / 78
Echo Dot Echo Echo Show Echo Spot
Headless vs. Multimodal
26 / 78
Display devices
Only 5.9 % of Alexa users in
the US own an Echo device
with a display
Voicebot.ai, June 24, 2018, see https://goo.gl/7WSkjD
27 / 78
More numbers
56.2 % own Echo Spot
25.0 % own Echo Show
18.7 % own both devices
Voicebot.ai, June 24, 2018, see https://goo.gl/7WSkjD
28 / 78
Mind the trap!
Many Alexa Skills rather
focus on Echo Show than on
Echo Spot.
More than twice as much
Echo Spot than Echo Show
devices sold.
29 / 78
History: display interface
30 / 78
Echo Show
1st generation Echo Show
was introduced in May 2017.
31 / 78
Echo Spot
1st generation Echo Spot
was introduced in
September 2017.
32 / 78
Display Interface
Support for displays added to
Alexa developer console
33 / 78
Reference
Learn how to build
multimodal Alexa Skills
with the Display
Interface Reference
https://goo.gl/eeSQah
34 / 78
Display Interface Templates
35 / 78
Body Template 1
Echo Show Echo Spot
36 / 78
Body Template 2
Echo Show Echo Spot
37 / 78
Body Template 3
Echo Show Echo Spot
38 / 78
Body Template 6
Echo Show Echo Spot
39 / 78
Body Template 7
Echo Show Echo Spot
40 / 78
List Template 1
Echo Show Echo Spot
41 / 78
List Template 2
Echo Show Echo Spot
42 / 78
Wait!
What happened to
Body Templates
4 and 5?
Does anyone know?
43 / 78
Template Docs
Learn to choose the best
template for your needs.
https://goo.gl/pp4STX
44 / 78
{
"version": "1.0",
"session": {},
"context": {
"Display": {},
"System": {
"device": {
"deviceId": "...",
"supportedInterfaces": {
"Display": {
"templateVersion": "1.0",
"markupVersion": "1.0"
}
}
}
}
},
"request": {}
}
Alexa Request
Very simplified
45 / 78
{
"body": {
"version": "1.0",
"response": {
"outputSpeech": {},
"card": {},
"directives": [
{
"type": "Display.RenderTemplate",
"template": {}
},
{
"type": "Hint",
"hint": {}
}
],
"reprompt": {},
"shouldEndSession": false
}
}
}
Alexa Response
Also very simplified
46 / 78
{
"type": "BodyTemplate6",
"token": "launch",
"backgroundImage": {
"contentDescription": "Hello",
"sources": [
{
"url": "https://host.com/image.jpg",
"widthPixels": 0,
"heightPixels": 0
}
]
},
"textContent": {
"primaryText": {
"type": "RichText",
"text": "<font size="7">Hello</font>"
}
},
"backButton": "HIDDEN"
}
RenderTemplate
As well simplified
47 / 78
Cons of Display
Interface
Only 7 templates
Distinction Show vs. Spot
Templates on Spot suck
Static layouts
Hard to test
48 / 78
Alexa Presentation Language
49 / 78
Alexa
Presentation
Language
For the win
50 / 78
APL schedule
Private beta June 2018
Public beta October 2018
General availability soon
51 / 78
Echo Show
2nd generation Echo Show
was presented in September
2018.
52 / 78
Alexa
Presentation
Language
Support for APL added to
Alexa developer console
53 / 78
Reference
Learn how to get started with
the Alexa Presentation
Language
https://goo.gl/Zw9m62
54 / 78
Authoring Tool
Design your own templates
https://goo.gl/jbC535
55 / 78
Authoring Tool
Pick from the predefined
samples, start from scratch
oder upload your code
56 / 78
APL terminology
Document, package,
layout, resources, styles,
and components
https://goo.gl/ufq6SH
57 / 78
APL document
JSON file containing list of
packages, resources, layouts,
and styles.
Works like a container and is
send to the device.
58 / 78
APL package
Packages contain APL
documents and images.
Can easily be reused and are
cached on the device.
59 / 78
APL layout
Hierarchy set of components
for rendering one the display.
Can contain text, images,
scrolling regions and even
other layouts.
Can be used to build libraries.
60 / 78
APL resources
Defined constants to be used
for drawing text or images on
the screen.
For example font sizes,
colours or spacing.
61 / 78
APL styles
Collection of grouped
resources to build a style.
Defines size, background
colour, text colour, borders,
etc.
62 / 78
APL components
Components are primitive
types to be added to a layout.
Examples are containers,
text, images, sequences,
scroll views, or touch
wrappers
63 / 78
{
"version": "1.0",
"session": {},
"context": {
"Display": {},
"System": {
"device": {
"deviceId": "...",
"supportedInterfaces": {
"Alexa.Presentation.APL": {
"runtime": {
"maxVersion": "1.0"
}
}
}
}
}
},
"request": {}
}
Alexa Request
With active APL support
64 / 78
{
"body": {
"version": "1.0",
"response": {
"outputSpeech": {},
"card": {},
"directives": [
{
"type": "Alexa.Presentation.
APL.RenderDocument",
"version": "1.0",
"document": {},
"datasource": {}
}
],
"reprompt": {},
"shouldEndSession": false
}
}
}
Alexa Response
Also very simplified
65 / 78
{
"type": "APL",
"version": "1.0",
"import": [],
"resources": [],
"styles": {},
"layouts": {},
"mainTemplate": {}
}
APL Document
Just the structure
66 / 78
Authoring tool hands-on
67 / 78
Pros of APL
Start with samples
Build from scratch
Build reusable packages
Authoring tool
Easy to test
Split display design from code
68 / 78
Cons of APL
Response is more complex
No saving option in the
authoring tool
Not fully rolled out yet
No graphical editor yet
69 / 78
SDKs / tools with APL support
Third party toolsOffical SDKs
December 2018
January 2019
Alexa.NET
70 / 78
Compare
display interface and APL
71 / 78
Display interface start screen
Thanks toDanielMittendorf
72 / 78
APL start screen
Thanks toDanielMittendorf
73 / 78
Display interface list screen
Thanks toDanielMittendorf
74 / 78
APL list screen
Thanks toDanielMittendorf
75 / 78
Conclusion
76 / 78
Start with APL
Public beta phase
You could start today
Consider the numbers of sold
display devices!
77 / 78
Need more
motivation?
Alexa Skills Challenge
$150K in total prizes
Bonus prize for Germany
Enter til 22th of January 2019
https://goo.gl/EETRu5
78 / 78
Any questions?
ralf@travello.audio
https://www.travello.audio

More Related Content

Similar to Multi-Modal Voice Development with Amazon Alexa

Building appsinsilverlight4 part_1
Building appsinsilverlight4 part_1Building appsinsilverlight4 part_1
Building appsinsilverlight4 part_1
Dennis Perlot
 
Web Accessbility
Web AccessbilityWeb Accessbility
Web Accessbility
Hend Al-Khalifa
 
Scalding Big (Ad)ta
Scalding Big (Ad)taScalding Big (Ad)ta
Scalding Big (Ad)ta
b0ris_1
 
Confessions of-a-gadget-holic
Confessions of-a-gadget-holicConfessions of-a-gadget-holic
Confessions of-a-gadget-holic
Tyrell Perera
 
Cross Device Accessibility
Cross Device AccessibilityCross Device Accessibility
Cross Device Accessibility
Chris Mills
 
BarCamp KL H20 Open Social Hackathon
BarCamp KL H20 Open Social HackathonBarCamp KL H20 Open Social Hackathon
BarCamp KL H20 Open Social Hackathon
marvin337
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications database
Open Science Fair
 
Accessibility Enterprise
Accessibility EnterpriseAccessibility Enterprise
Accessibility Enterprise
Jacques
 
Mashup Y! widget
Mashup Y! widgetMashup Y! widget
Mashup Y! widget
phornthep khongsathian
 
Creating Interactive Olap Applications With My Sql Enterprise And Mondrian Pr...
Creating Interactive Olap Applications With My Sql Enterprise And Mondrian Pr...Creating Interactive Olap Applications With My Sql Enterprise And Mondrian Pr...
Creating Interactive Olap Applications With My Sql Enterprise And Mondrian Pr...
Indus Khaitan
 
PPT Companion to Android
PPT Companion to AndroidPPT Companion to Android
PPT Companion to Android
Dharani Kumar Madduri
 
Choosing the right software for your research study : an overview of leading ...
Choosing the right software for your research study : an overview of leading ...Choosing the right software for your research study : an overview of leading ...
Choosing the right software for your research study : an overview of leading ...
Merlien Institute
 
Interoperable Ajax Tools And Mashups Ferraiolo
Interoperable Ajax Tools And Mashups FerraioloInteroperable Ajax Tools And Mashups Ferraiolo
Interoperable Ajax Tools And Mashups Ferraiolo
rajivmordani
 
Jarkko Moilanen, APInf, “Get Control of Your IoT Cruisers” - Mindtrek 2017
Jarkko Moilanen, APInf, “Get Control of Your IoT Cruisers” - Mindtrek 2017Jarkko Moilanen, APInf, “Get Control of Your IoT Cruisers” - Mindtrek 2017
Jarkko Moilanen, APInf, “Get Control of Your IoT Cruisers” - Mindtrek 2017
Mindtrek
 
Anatomy of Open edX at DjangoCon 2018 (San Diego)
Anatomy of Open edX at DjangoCon 2018 (San Diego)Anatomy of Open edX at DjangoCon 2018 (San Diego)
Anatomy of Open edX at DjangoCon 2018 (San Diego)
Nate Aune
 
Rich User Interaction with SVG
Rich User Interaction with SVGRich User Interaction with SVG
Rich User Interaction with SVG
stsire
 
Spring into AI presented by Dan Vega 5/14
Spring into AI presented by Dan Vega 5/14Spring into AI presented by Dan Vega 5/14
Spring into AI presented by Dan Vega 5/14
VMware Tanzu
 
POSI Overview
POSI OverviewPOSI Overview
POSI Overview
aindilis
 
2022 APIsecure_Securing APIs with Open Standards
2022 APIsecure_Securing APIs with Open Standards2022 APIsecure_Securing APIs with Open Standards
2022 APIsecure_Securing APIs with Open Standards
APIsecure_ Official
 
Identifying Users Across Platforms with a Universal ID Webinar Slides
Identifying Users Across Platforms with a Universal ID Webinar SlidesIdentifying Users Across Platforms with a Universal ID Webinar Slides
Identifying Users Across Platforms with a Universal ID Webinar Slides
Looker
 

Similar to Multi-Modal Voice Development with Amazon Alexa (20)

Building appsinsilverlight4 part_1
Building appsinsilverlight4 part_1Building appsinsilverlight4 part_1
Building appsinsilverlight4 part_1
 
Web Accessbility
Web AccessbilityWeb Accessbility
Web Accessbility
 
Scalding Big (Ad)ta
Scalding Big (Ad)taScalding Big (Ad)ta
Scalding Big (Ad)ta
 
Confessions of-a-gadget-holic
Confessions of-a-gadget-holicConfessions of-a-gadget-holic
Confessions of-a-gadget-holic
 
Cross Device Accessibility
Cross Device AccessibilityCross Device Accessibility
Cross Device Accessibility
 
BarCamp KL H20 Open Social Hackathon
BarCamp KL H20 Open Social HackathonBarCamp KL H20 Open Social Hackathon
BarCamp KL H20 Open Social Hackathon
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications database
 
Accessibility Enterprise
Accessibility EnterpriseAccessibility Enterprise
Accessibility Enterprise
 
Mashup Y! widget
Mashup Y! widgetMashup Y! widget
Mashup Y! widget
 
Creating Interactive Olap Applications With My Sql Enterprise And Mondrian Pr...
Creating Interactive Olap Applications With My Sql Enterprise And Mondrian Pr...Creating Interactive Olap Applications With My Sql Enterprise And Mondrian Pr...
Creating Interactive Olap Applications With My Sql Enterprise And Mondrian Pr...
 
PPT Companion to Android
PPT Companion to AndroidPPT Companion to Android
PPT Companion to Android
 
Choosing the right software for your research study : an overview of leading ...
Choosing the right software for your research study : an overview of leading ...Choosing the right software for your research study : an overview of leading ...
Choosing the right software for your research study : an overview of leading ...
 
Interoperable Ajax Tools And Mashups Ferraiolo
Interoperable Ajax Tools And Mashups FerraioloInteroperable Ajax Tools And Mashups Ferraiolo
Interoperable Ajax Tools And Mashups Ferraiolo
 
Jarkko Moilanen, APInf, “Get Control of Your IoT Cruisers” - Mindtrek 2017
Jarkko Moilanen, APInf, “Get Control of Your IoT Cruisers” - Mindtrek 2017Jarkko Moilanen, APInf, “Get Control of Your IoT Cruisers” - Mindtrek 2017
Jarkko Moilanen, APInf, “Get Control of Your IoT Cruisers” - Mindtrek 2017
 
Anatomy of Open edX at DjangoCon 2018 (San Diego)
Anatomy of Open edX at DjangoCon 2018 (San Diego)Anatomy of Open edX at DjangoCon 2018 (San Diego)
Anatomy of Open edX at DjangoCon 2018 (San Diego)
 
Rich User Interaction with SVG
Rich User Interaction with SVGRich User Interaction with SVG
Rich User Interaction with SVG
 
Spring into AI presented by Dan Vega 5/14
Spring into AI presented by Dan Vega 5/14Spring into AI presented by Dan Vega 5/14
Spring into AI presented by Dan Vega 5/14
 
POSI Overview
POSI OverviewPOSI Overview
POSI Overview
 
2022 APIsecure_Securing APIs with Open Standards
2022 APIsecure_Securing APIs with Open Standards2022 APIsecure_Securing APIs with Open Standards
2022 APIsecure_Securing APIs with Open Standards
 
Identifying Users Across Platforms with a Universal ID Webinar Slides
Identifying Users Across Platforms with a Universal ID Webinar SlidesIdentifying Users Across Platforms with a Universal ID Webinar Slides
Identifying Users Across Platforms with a Universal ID Webinar Slides
 

More from Ralf Eggert

PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
PHP Rewrite: Do the right thing (IPC Berlin 2024)
PHP Rewrite: Do the right thing (IPC Berlin 2024)PHP Rewrite: Do the right thing (IPC Berlin 2024)
PHP Rewrite: Do the right thing (IPC Berlin 2024)
Ralf Eggert
 
ChatGPT: unser täglich' Bot gib uns heute
ChatGPT: unser täglich' Bot gib uns heuteChatGPT: unser täglich' Bot gib uns heute
ChatGPT: unser täglich' Bot gib uns heute
Ralf Eggert
 
Der ultimative PHP Framework Vergleich 2023 Edition
Der ultimative PHP Framework Vergleich 2023 EditionDer ultimative PHP Framework Vergleich 2023 Edition
Der ultimative PHP Framework Vergleich 2023 Edition
Ralf Eggert
 
PHP Module als Rundum-Sorglos-Pakete entwickeln
PHP Module als Rundum-Sorglos-Pakete entwickelnPHP Module als Rundum-Sorglos-Pakete entwickeln
PHP Module als Rundum-Sorglos-Pakete entwickeln
Ralf Eggert
 
Alexa, what's next?
Alexa, what's next?Alexa, what's next?
Alexa, what's next?
Ralf Eggert
 
Alexa, wohin geht die Reise
Alexa, wohin geht die ReiseAlexa, wohin geht die Reise
Alexa, wohin geht die Reise
Ralf Eggert
 
8. Hamburg Voice Interface Meetup
8. Hamburg Voice Interface Meetup8. Hamburg Voice Interface Meetup
8. Hamburg Voice Interface Meetup
Ralf Eggert
 
Welcome Bixby
Welcome BixbyWelcome Bixby
Welcome Bixby
Ralf Eggert
 
Alexa Skill Maintenance
Alexa Skill MaintenanceAlexa Skill Maintenance
Alexa Skill Maintenance
Ralf Eggert
 
Vom Zend Framework zu Laminas
Vom Zend Framework zu LaminasVom Zend Framework zu Laminas
Vom Zend Framework zu Laminas
Ralf Eggert
 
Alexa Skills und PHP? Passt das zusammen?
Alexa Skills und PHP? Passt das zusammen?Alexa Skills und PHP? Passt das zusammen?
Alexa Skills und PHP? Passt das zusammen?
Ralf Eggert
 
Mit Jovo von 0 auf 100
Mit Jovo von 0 auf 100Mit Jovo von 0 auf 100
Mit Jovo von 0 auf 100
Ralf Eggert
 
Vom Zend Framework zu Laminas
Vom Zend Framework zu LaminasVom Zend Framework zu Laminas
Vom Zend Framework zu Laminas
Ralf Eggert
 
Alexa for Hospitality
Alexa for HospitalityAlexa for Hospitality
Alexa for Hospitality
Ralf Eggert
 
Alexa, lass uns Geld verdienen – fünf Geschäftsmodelle, die wirklich funktion...
Alexa, lass uns Geld verdienen – fünf Geschäftsmodelle, die wirklich funktion...Alexa, lass uns Geld verdienen – fünf Geschäftsmodelle, die wirklich funktion...
Alexa, lass uns Geld verdienen – fünf Geschäftsmodelle, die wirklich funktion...
Ralf Eggert
 
Fortgeschrittene Techniken für erfolgreiche Sprachanwendungen
Fortgeschrittene Techniken für erfolgreiche SprachanwendungenFortgeschrittene Techniken für erfolgreiche Sprachanwendungen
Fortgeschrittene Techniken für erfolgreiche Sprachanwendungen
Ralf Eggert
 
Die sieben Projektphasen für Voice Projekte
Die sieben Projektphasen für Voice ProjekteDie sieben Projektphasen für Voice Projekte
Die sieben Projektphasen für Voice Projekte
Ralf Eggert
 
Künstliche Intelligenz – Traum und Wirklichkeit
Künstliche Intelligenz – Traum und WirklichkeitKünstliche Intelligenz – Traum und Wirklichkeit
Künstliche Intelligenz – Traum und Wirklichkeit
Ralf Eggert
 
Mein Haus, mein Auto, mein Backend
Mein Haus, mein Auto, mein BackendMein Haus, mein Auto, mein Backend
Mein Haus, mein Auto, mein Backend
Ralf Eggert
 

More from Ralf Eggert (20)

PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
PHP Rewrite: Do the right thing (IPC Berlin 2024)
PHP Rewrite: Do the right thing (IPC Berlin 2024)PHP Rewrite: Do the right thing (IPC Berlin 2024)
PHP Rewrite: Do the right thing (IPC Berlin 2024)
 
ChatGPT: unser täglich' Bot gib uns heute
ChatGPT: unser täglich' Bot gib uns heuteChatGPT: unser täglich' Bot gib uns heute
ChatGPT: unser täglich' Bot gib uns heute
 
Der ultimative PHP Framework Vergleich 2023 Edition
Der ultimative PHP Framework Vergleich 2023 EditionDer ultimative PHP Framework Vergleich 2023 Edition
Der ultimative PHP Framework Vergleich 2023 Edition
 
PHP Module als Rundum-Sorglos-Pakete entwickeln
PHP Module als Rundum-Sorglos-Pakete entwickelnPHP Module als Rundum-Sorglos-Pakete entwickeln
PHP Module als Rundum-Sorglos-Pakete entwickeln
 
Alexa, what's next?
Alexa, what's next?Alexa, what's next?
Alexa, what's next?
 
Alexa, wohin geht die Reise
Alexa, wohin geht die ReiseAlexa, wohin geht die Reise
Alexa, wohin geht die Reise
 
8. Hamburg Voice Interface Meetup
8. Hamburg Voice Interface Meetup8. Hamburg Voice Interface Meetup
8. Hamburg Voice Interface Meetup
 
Welcome Bixby
Welcome BixbyWelcome Bixby
Welcome Bixby
 
Alexa Skill Maintenance
Alexa Skill MaintenanceAlexa Skill Maintenance
Alexa Skill Maintenance
 
Vom Zend Framework zu Laminas
Vom Zend Framework zu LaminasVom Zend Framework zu Laminas
Vom Zend Framework zu Laminas
 
Alexa Skills und PHP? Passt das zusammen?
Alexa Skills und PHP? Passt das zusammen?Alexa Skills und PHP? Passt das zusammen?
Alexa Skills und PHP? Passt das zusammen?
 
Mit Jovo von 0 auf 100
Mit Jovo von 0 auf 100Mit Jovo von 0 auf 100
Mit Jovo von 0 auf 100
 
Vom Zend Framework zu Laminas
Vom Zend Framework zu LaminasVom Zend Framework zu Laminas
Vom Zend Framework zu Laminas
 
Alexa for Hospitality
Alexa for HospitalityAlexa for Hospitality
Alexa for Hospitality
 
Alexa, lass uns Geld verdienen – fünf Geschäftsmodelle, die wirklich funktion...
Alexa, lass uns Geld verdienen – fünf Geschäftsmodelle, die wirklich funktion...Alexa, lass uns Geld verdienen – fünf Geschäftsmodelle, die wirklich funktion...
Alexa, lass uns Geld verdienen – fünf Geschäftsmodelle, die wirklich funktion...
 
Fortgeschrittene Techniken für erfolgreiche Sprachanwendungen
Fortgeschrittene Techniken für erfolgreiche SprachanwendungenFortgeschrittene Techniken für erfolgreiche Sprachanwendungen
Fortgeschrittene Techniken für erfolgreiche Sprachanwendungen
 
Die sieben Projektphasen für Voice Projekte
Die sieben Projektphasen für Voice ProjekteDie sieben Projektphasen für Voice Projekte
Die sieben Projektphasen für Voice Projekte
 
Künstliche Intelligenz – Traum und Wirklichkeit
Künstliche Intelligenz – Traum und WirklichkeitKünstliche Intelligenz – Traum und Wirklichkeit
Künstliche Intelligenz – Traum und Wirklichkeit
 
Mein Haus, mein Auto, mein Backend
Mein Haus, mein Auto, mein BackendMein Haus, mein Auto, mein Backend
Mein Haus, mein Auto, mein Backend
 

Recently uploaded

AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
CAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on BlockchainCAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on Blockchain
Claudio Di Ciccio
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
FODUU
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 

Recently uploaded (20)

AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
CAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on BlockchainCAKE: Sharing Slices of Confidential Data on Blockchain
CAKE: Sharing Slices of Confidential Data on Blockchain
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 

Multi-Modal Voice Development with Amazon Alexa

  • 1. 1 / 78 Multimodal Voice Development with Amazon Alexa
  • 2. 2 / 78 Ralf Eggert CEO Travello GmbH Alexa Skill Developer Advisory Board Voice Con Alexa Champion
  • 3. 3 / 78 Portfolio (excerpt)
  • 4. 4 / 78 Three questions
  • 5. 5 / 78 Question 1: Who owns an Echo device with a display?
  • 6. 6 / 78 Question 2: Who has developed a Multimodal Skill for Alexa?
  • 7. 7 / 78 Question 3: Who has an APL based Multimodal Skill for Alexa live?
  • 8. 8 / 78 A definition of multimodal
  • 9. 9 / 78 What is this multimodal stuff anyway?
  • 10. 10 / 78 Multimodal Multimodality describes communication practices in terms of the textual, aural, linguistic, spatial, and visual resources - or modes - used to compose messages. Murray, Joddy (2013) / Wikipedia
  • 11. 11 / 78 Textual Multimodality describes communication practices in terms of the textual, aural, linguistic, spatial, and visual resources - or modes - used to compose messages. Murray, Joddy (2013) / Wikipedia
  • 12. 12 / 78 Aural Multimodality describes communication practices in terms of the textual, aural, linguistic, spatial, and visual resources - or modes - used to compose messages. Murray, Joddy (2013) / Wikipedia
  • 13. 13 / 78 Linguistic Multimodality describes communication practices in terms of the textual, aural, linguistic, spatial, and visual resources - or modes - used to compose messages. Murray, Joddy (2013) / Wikipedia
  • 14. 14 / 78 Spatial Multimodality describes communication practices in terms of the textual, aural, linguistic, spatial, and visual resources - or modes - used to compose messages. Murray, Joddy (2013) / Wikipedia
  • 15. 15 / 78 Visual Multimodality describes communication practices in terms of the textual, aural, linguistic, spatial, and visual resources - or modes - used to compose messages. Murray, Joddy (2013) / Wikipedia
  • 16. 16 / 78 Textual Aural Linguistic Visual Let's focus on
  • 18. 18 / 78 OTIS REM Who knows this smart voice assistant? Is it multimodal?
  • 19. 19 / 78 Alexa & Multimodal
  • 20. 20 / 78 OK! How about Alexa?
  • 21. 21 / 78 AWS Lambda / HTTPS Endpoint Server Alexa Voice Service   Headless
  • 22. 22 / 78 Cards Support headless Echo devices with visuals. Are they multimodal?
  • 23. 23 / 78 Echo Buttons Another input media. Are they multimodal? What do you think?
  • 24. 24 / 78 AWS Lambda / HTTPS Endpoint Server Alexa Voice Service  Multimodal
  • 25. 25 / 78 Echo Dot Echo Echo Show Echo Spot Headless vs. Multimodal
  • 26. 26 / 78 Display devices Only 5.9 % of Alexa users in the US own an Echo device with a display Voicebot.ai, June 24, 2018, see https://goo.gl/7WSkjD
  • 27. 27 / 78 More numbers 56.2 % own Echo Spot 25.0 % own Echo Show 18.7 % own both devices Voicebot.ai, June 24, 2018, see https://goo.gl/7WSkjD
  • 28. 28 / 78 Mind the trap! Many Alexa Skills rather focus on Echo Show than on Echo Spot. More than twice as much Echo Spot than Echo Show devices sold.
  • 29. 29 / 78 History: display interface
  • 30. 30 / 78 Echo Show 1st generation Echo Show was introduced in May 2017.
  • 31. 31 / 78 Echo Spot 1st generation Echo Spot was introduced in September 2017.
  • 32. 32 / 78 Display Interface Support for displays added to Alexa developer console
  • 33. 33 / 78 Reference Learn how to build multimodal Alexa Skills with the Display Interface Reference https://goo.gl/eeSQah
  • 34. 34 / 78 Display Interface Templates
  • 35. 35 / 78 Body Template 1 Echo Show Echo Spot
  • 36. 36 / 78 Body Template 2 Echo Show Echo Spot
  • 37. 37 / 78 Body Template 3 Echo Show Echo Spot
  • 38. 38 / 78 Body Template 6 Echo Show Echo Spot
  • 39. 39 / 78 Body Template 7 Echo Show Echo Spot
  • 40. 40 / 78 List Template 1 Echo Show Echo Spot
  • 41. 41 / 78 List Template 2 Echo Show Echo Spot
  • 42. 42 / 78 Wait! What happened to Body Templates 4 and 5? Does anyone know?
  • 43. 43 / 78 Template Docs Learn to choose the best template for your needs. https://goo.gl/pp4STX
  • 44. 44 / 78 { "version": "1.0", "session": {}, "context": { "Display": {}, "System": { "device": { "deviceId": "...", "supportedInterfaces": { "Display": { "templateVersion": "1.0", "markupVersion": "1.0" } } } } }, "request": {} } Alexa Request Very simplified
  • 45. 45 / 78 { "body": { "version": "1.0", "response": { "outputSpeech": {}, "card": {}, "directives": [ { "type": "Display.RenderTemplate", "template": {} }, { "type": "Hint", "hint": {} } ], "reprompt": {}, "shouldEndSession": false } } } Alexa Response Also very simplified
  • 46. 46 / 78 { "type": "BodyTemplate6", "token": "launch", "backgroundImage": { "contentDescription": "Hello", "sources": [ { "url": "https://host.com/image.jpg", "widthPixels": 0, "heightPixels": 0 } ] }, "textContent": { "primaryText": { "type": "RichText", "text": "<font size="7">Hello</font>" } }, "backButton": "HIDDEN" } RenderTemplate As well simplified
  • 47. 47 / 78 Cons of Display Interface Only 7 templates Distinction Show vs. Spot Templates on Spot suck Static layouts Hard to test
  • 48. 48 / 78 Alexa Presentation Language
  • 50. 50 / 78 APL schedule Private beta June 2018 Public beta October 2018 General availability soon
  • 51. 51 / 78 Echo Show 2nd generation Echo Show was presented in September 2018.
  • 52. 52 / 78 Alexa Presentation Language Support for APL added to Alexa developer console
  • 53. 53 / 78 Reference Learn how to get started with the Alexa Presentation Language https://goo.gl/Zw9m62
  • 54. 54 / 78 Authoring Tool Design your own templates https://goo.gl/jbC535
  • 55. 55 / 78 Authoring Tool Pick from the predefined samples, start from scratch oder upload your code
  • 56. 56 / 78 APL terminology Document, package, layout, resources, styles, and components https://goo.gl/ufq6SH
  • 57. 57 / 78 APL document JSON file containing list of packages, resources, layouts, and styles. Works like a container and is send to the device.
  • 58. 58 / 78 APL package Packages contain APL documents and images. Can easily be reused and are cached on the device.
  • 59. 59 / 78 APL layout Hierarchy set of components for rendering one the display. Can contain text, images, scrolling regions and even other layouts. Can be used to build libraries.
  • 60. 60 / 78 APL resources Defined constants to be used for drawing text or images on the screen. For example font sizes, colours or spacing.
  • 61. 61 / 78 APL styles Collection of grouped resources to build a style. Defines size, background colour, text colour, borders, etc.
  • 62. 62 / 78 APL components Components are primitive types to be added to a layout. Examples are containers, text, images, sequences, scroll views, or touch wrappers
  • 63. 63 / 78 { "version": "1.0", "session": {}, "context": { "Display": {}, "System": { "device": { "deviceId": "...", "supportedInterfaces": { "Alexa.Presentation.APL": { "runtime": { "maxVersion": "1.0" } } } } } }, "request": {} } Alexa Request With active APL support
  • 64. 64 / 78 { "body": { "version": "1.0", "response": { "outputSpeech": {}, "card": {}, "directives": [ { "type": "Alexa.Presentation. APL.RenderDocument", "version": "1.0", "document": {}, "datasource": {} } ], "reprompt": {}, "shouldEndSession": false } } } Alexa Response Also very simplified
  • 65. 65 / 78 { "type": "APL", "version": "1.0", "import": [], "resources": [], "styles": {}, "layouts": {}, "mainTemplate": {} } APL Document Just the structure
  • 66. 66 / 78 Authoring tool hands-on
  • 67. 67 / 78 Pros of APL Start with samples Build from scratch Build reusable packages Authoring tool Easy to test Split display design from code
  • 68. 68 / 78 Cons of APL Response is more complex No saving option in the authoring tool Not fully rolled out yet No graphical editor yet
  • 69. 69 / 78 SDKs / tools with APL support Third party toolsOffical SDKs December 2018 January 2019 Alexa.NET
  • 70. 70 / 78 Compare display interface and APL
  • 71. 71 / 78 Display interface start screen Thanks toDanielMittendorf
  • 72. 72 / 78 APL start screen Thanks toDanielMittendorf
  • 73. 73 / 78 Display interface list screen Thanks toDanielMittendorf
  • 74. 74 / 78 APL list screen Thanks toDanielMittendorf
  • 76. 76 / 78 Start with APL Public beta phase You could start today Consider the numbers of sold display devices!
  • 77. 77 / 78 Need more motivation? Alexa Skills Challenge $150K in total prizes Bonus prize for Germany Enter til 22th of January 2019 https://goo.gl/EETRu5
  • 78. 78 / 78 Any questions? ralf@travello.audio https://www.travello.audio