SlideShare a Scribd company logo
1 of 18
Download to read offline
SSML - Making your skills
sound right
@RichMerrett815
@VeniLoqui
#alexadevscamb
What is it?
● Speech Synthesis Markup Language
● XML Based
● W3C
● Gives you extra control over the speech in your skills
● Uses tags like HTML
● Ever heard anything pronounced wrong by Alexa? They haven’t used SSML!
● One of our brands requires it
○ <phoneme alphabet="ipa" ph="vɒks'ɛl'əɹeɪt">VoxLR8</phoneme>
● Disclaimer: Amazon have a great site guiding you through SSML
https://developer.amazon.com/docs/custom-skills/speech-synthesis-markup-language-ssml-reference.html
What can it be used for?
● Pauses
● Emphasis
● Phonemes
● Prosody
● Saying as
● Tense/Word Type
● Amazon Effects
● Adding short audio files
● Language - New
● Amazon Polly - New
● Speechcons
<speak>
● Need to include all SSML within the <speak></speak> tags
● Within your index.js file you need to set the output speech type to SSML
(default is plain text)
○ Workaround for having to add the <speak> tags to every bit of speech.
<break>
● Break tag adds in pauses.
● Good to use in place of commas, full stops and <p> if your skill is multi modal (including cards)
● Attributes
○ Strength
■ Medium (equivalent of a comma)
■ Strong (equivalent of a full stop or <s>)
■ X-strong (equivalent of a <p>)
This pause <break strength= "medium"/> is a medium pause
○ Time
■ Seconds (s)
■ Milliseconds (ms)
This pause <break time= "2s"/> is a two second pause
This pause <break time= "300ms"/> is a three hundred millisecond pause
<emphasis>
● Changes the rate and volume of the speech
● The more emphasis, the more like you are trying to order a cheeseburger in
France (Louder and Slower)
● Attributes
○ Strong - increase volume and slow down speaking rate
○ Moderate - increase volume and slow down speaking rate (but not as much as strong)
○ Reduced - Decrease volume and speed up speaking rate
I said I would <emphasis level="strong">really like a cheeseburger please </emphasis>
I said I would <emphasis level="reduced">really like a cheeseburger please </emphasis>
<phoneme>
● Allows you to program Alexa how to pronounce something.
● Attributes
○ Alphabet (phonetic alphabet to use)
■ Ipa (International Phonetic Alphabet)
■ X-sampa (Extended Speech Assessment Methods Phonetic Alphabet)
○ ph (the phonetic pronunciation to speak - symbols)
● Symbols have slight variations across countries.
<phoneme>...continued
<prosody>
● Changes the volume, pitch and rate of speech.
● Attributes
○ Rate - change the rate of speech
■ X-slow, slow, medium, fast, x-fast
■ 100% - no change
■ > 100% increase rate
■ < 100% decrease rate (min 20%)
○ Pitch - raise or lower the tone of the speech
■ X-low, low, medium, high, x-high
■ 1% - 51% increase pitch
■ -1% - -33.3% decrease the pitch
○ Volume - change the volume of the speech relative to the current volume level.
■ Silent, x-soft, soft, medium, loud, x-loud
■ +0.01db - 4.08db increase volume
■ -0.01db - -6db decrease volume
<say-as>
● Determines how text should be interpreted.
● Attribute:
○ Interpret-as:
■ Characters, spell-out
■ Cardinal, number
■ Ordinal
■ Digits
■ Fraction
■ Unit
■ Date - can specify the format, using the ‘format’ attribute
■ Time
■ Telephone
■ Address
■ Interjection
■ Expletive
<w>
● Customises the pronunciation of words by specifying the word’s part of
speech.
● Attribute
○ Role
■ amazon:VB - Verb
■ amazon:VBD - past participle
■ amazon:NN - noun
■ amazon:SENSE_1 - where there are different meanings (Homographs) i.e. Bass and
Bass, Bow and Bow, Wind and Wind.
<amazon:effect>
● Specialist Amazon effects. Cannot be used elsewhere.
● Attribute:
○ Name
■ Whispered
<audio>
● Allows you to insert MP3 files into the speech
○ HTTPS endpoint
○ No customer specific information
○ Valid MP3 file (MPEG v2)
○ No longer than 90 seconds
○ Bit rate = 48kbps
○ Sample rate = 16000Hz
● Attribute
○ Src
● Alexa Sound Library - NEW!
<lang> NEW!
● The language tag allows you to make the text spoken as it should be in the
language it is written in
● Attribute
○ xml:lang
■ Supports all Amazon Polly languages
<voice>
● Can use an Amazon Polly voice in your skills response
○ 50+ voices in 25+ languages
● Dependant on locale what tags you use.
● <lang> can be used with it.
Speechcons
● Speech Emoji’s
● Part of the <say-as> tag using the interpret as “interjection” attribute.
○ <say-as interpret-as="interjection">Wow.</say-as>
● Supported in the languages Alexa is available for
https://developer.amazon.com/docs/custom-skills/speechcon-reference-interjections-english-uk.html
SSML Top Tips...from the front line
● Use the voice simulator to test every bit of speech
○ At the very least the key parts.
● Design for multi-modal (whether you are using it or not)
● Sound each word out aloud so you can hear what each element sounds like
● Trial and error with phonemes
● Go back to school
● Utilise other voices
● Use audio files
● When creating other types of media e.g. webpages, run them through the
voice simulator before publishing. It's a great way to pick up mistakes!
<phoneme alphabet="ipa" ph="θ'ænk
ju">Thank You</phoneme>
@RichMerrett815
@VeniLoqui
#alexadevscamb

More Related Content

Similar to SSML - Why Bother?

Malayalam computing
Malayalam computingMalayalam computing
Malayalam computingCelsius A T
 
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Polly (MAC204)
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Polly (MAC204)AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Polly (MAC204)
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Polly (MAC204)Amazon Web Services
 
HREFLANG for International SEO: Lessons from 3,000 Implementations
HREFLANG for International SEO: Lessons from 3,000 ImplementationsHREFLANG for International SEO: Lessons from 3,000 Implementations
HREFLANG for International SEO: Lessons from 3,000 ImplementationsErudite
 
Amazon Polly for Drupal
Amazon Polly for DrupalAmazon Polly for Drupal
Amazon Polly for DrupalSwapnil Pawar
 
Ry pyconjp2015 karaoke
Ry pyconjp2015 karaokeRy pyconjp2015 karaoke
Ry pyconjp2015 karaokeRenyuan Lyu
 
LocJam 2014 Barcelona Workshop - Carme Mangiron
LocJam 2014 Barcelona Workshop - Carme MangironLocJam 2014 Barcelona Workshop - Carme Mangiron
LocJam 2014 Barcelona Workshop - Carme MangironCarme Mangiron Hevia
 
Narrate Your Way To Success
Narrate Your Way To SuccessNarrate Your Way To Success
Narrate Your Way To SuccessTCUK
 
International sites hreflang
International sites hreflangInternational sites hreflang
International sites hreflangDmitry Sergiets
 
Building Speech Enabled Products with Amazon Polly & Amazon Lex
Building Speech Enabled Products with Amazon Polly & Amazon LexBuilding Speech Enabled Products with Amazon Polly & Amazon Lex
Building Speech Enabled Products with Amazon Polly & Amazon LexAmazon Web Services
 
Building Speech Enabled Products with Amazon Polly & Amazon Lex
Building Speech Enabled Products with Amazon Polly & Amazon LexBuilding Speech Enabled Products with Amazon Polly & Amazon Lex
Building Speech Enabled Products with Amazon Polly & Amazon LexAmazon Web Services
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...nehachhh
 
머신러닝으로 유창해지는 AWS의 언어실력 - 트랙1, Community Day 2018 re:Invent 특집
머신러닝으로 유창해지는 AWS의 언어실력 - 트랙1, Community Day 2018 re:Invent 특집머신러닝으로 유창해지는 AWS의 언어실력 - 트랙1, Community Day 2018 re:Invent 특집
머신러닝으로 유창해지는 AWS의 언어실력 - 트랙1, Community Day 2018 re:Invent 특집AWSKRUG - AWS한국사용자모임
 
International sites hreflang video slides
International sites hreflang video slidesInternational sites hreflang video slides
International sites hreflang video slidesDmitry Sergiets
 
SynPhony2014
SynPhony2014SynPhony2014
SynPhony2014langtech
 

Similar to SSML - Why Bother? (20)

Malayalam computing
Malayalam computingMalayalam computing
Malayalam computing
 
Build your own ASR engine
Build your own ASR engineBuild your own ASR engine
Build your own ASR engine
 
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Polly (MAC204)
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Polly (MAC204)AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Polly (MAC204)
AWS re:Invent 2016: NEW LAUNCH! Introducing Amazon Polly (MAC204)
 
HREFLANG for International SEO: Lessons from 3,000 Implementations
HREFLANG for International SEO: Lessons from 3,000 ImplementationsHREFLANG for International SEO: Lessons from 3,000 Implementations
HREFLANG for International SEO: Lessons from 3,000 Implementations
 
Amazon Polly for Drupal
Amazon Polly for DrupalAmazon Polly for Drupal
Amazon Polly for Drupal
 
Ry pyconjp2015 karaoke
Ry pyconjp2015 karaokeRy pyconjp2015 karaoke
Ry pyconjp2015 karaoke
 
Sslis
SslisSslis
Sslis
 
LocJam 2014 Barcelona Workshop - Carme Mangiron
LocJam 2014 Barcelona Workshop - Carme MangironLocJam 2014 Barcelona Workshop - Carme Mangiron
LocJam 2014 Barcelona Workshop - Carme Mangiron
 
Add more Speech API to your bot
Add more Speech API to your botAdd more Speech API to your bot
Add more Speech API to your bot
 
Narrate Your Way To Success
Narrate Your Way To SuccessNarrate Your Way To Success
Narrate Your Way To Success
 
International sites hreflang
International sites hreflangInternational sites hreflang
International sites hreflang
 
Aman Writer TRAINING MODULE
Aman Writer TRAINING MODULEAman Writer TRAINING MODULE
Aman Writer TRAINING MODULE
 
Voicepal review
Voicepal reviewVoicepal review
Voicepal review
 
Building Speech Enabled Products with Amazon Polly & Amazon Lex
Building Speech Enabled Products with Amazon Polly & Amazon LexBuilding Speech Enabled Products with Amazon Polly & Amazon Lex
Building Speech Enabled Products with Amazon Polly & Amazon Lex
 
Building Speech Enabled Products with Amazon Polly & Amazon Lex
Building Speech Enabled Products with Amazon Polly & Amazon LexBuilding Speech Enabled Products with Amazon Polly & Amazon Lex
Building Speech Enabled Products with Amazon Polly & Amazon Lex
 
Python overview
Python overviewPython overview
Python overview
 
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 
머신러닝으로 유창해지는 AWS의 언어실력 - 트랙1, Community Day 2018 re:Invent 특집
머신러닝으로 유창해지는 AWS의 언어실력 - 트랙1, Community Day 2018 re:Invent 특집머신러닝으로 유창해지는 AWS의 언어실력 - 트랙1, Community Day 2018 re:Invent 특집
머신러닝으로 유창해지는 AWS의 언어실력 - 트랙1, Community Day 2018 re:Invent 특집
 
International sites hreflang video slides
International sites hreflang video slidesInternational sites hreflang video slides
International sites hreflang video slides
 
SynPhony2014
SynPhony2014SynPhony2014
SynPhony2014
 

Recently uploaded

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Paige Cruz
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptxFIDO Alliance
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxFIDO Alliance
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...ScyllaDB
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuidePixlogix Infotech
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxFIDO Alliance
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxFIDO Alliance
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data SciencePaolo Missier
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)Wonjun Hwang
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMKumar Satyam
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdfMuhammad Subhan
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsLeah Henrickson
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentationyogeshlabana357357
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!Memoori
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctBrainSell Technologies
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Skynet Technologies
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe中 央社
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfdanishmna97
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...marcuskenyatta275
 

Recently uploaded (20)

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 

SSML - Why Bother?

  • 1. SSML - Making your skills sound right @RichMerrett815 @VeniLoqui #alexadevscamb
  • 2. What is it? ● Speech Synthesis Markup Language ● XML Based ● W3C ● Gives you extra control over the speech in your skills ● Uses tags like HTML ● Ever heard anything pronounced wrong by Alexa? They haven’t used SSML! ● One of our brands requires it ○ <phoneme alphabet="ipa" ph="vɒks'ɛl'əɹeɪt">VoxLR8</phoneme> ● Disclaimer: Amazon have a great site guiding you through SSML https://developer.amazon.com/docs/custom-skills/speech-synthesis-markup-language-ssml-reference.html
  • 3. What can it be used for? ● Pauses ● Emphasis ● Phonemes ● Prosody ● Saying as ● Tense/Word Type ● Amazon Effects ● Adding short audio files ● Language - New ● Amazon Polly - New ● Speechcons
  • 4. <speak> ● Need to include all SSML within the <speak></speak> tags ● Within your index.js file you need to set the output speech type to SSML (default is plain text) ○ Workaround for having to add the <speak> tags to every bit of speech.
  • 5. <break> ● Break tag adds in pauses. ● Good to use in place of commas, full stops and <p> if your skill is multi modal (including cards) ● Attributes ○ Strength ■ Medium (equivalent of a comma) ■ Strong (equivalent of a full stop or <s>) ■ X-strong (equivalent of a <p>) This pause <break strength= "medium"/> is a medium pause ○ Time ■ Seconds (s) ■ Milliseconds (ms) This pause <break time= "2s"/> is a two second pause This pause <break time= "300ms"/> is a three hundred millisecond pause
  • 6. <emphasis> ● Changes the rate and volume of the speech ● The more emphasis, the more like you are trying to order a cheeseburger in France (Louder and Slower) ● Attributes ○ Strong - increase volume and slow down speaking rate ○ Moderate - increase volume and slow down speaking rate (but not as much as strong) ○ Reduced - Decrease volume and speed up speaking rate I said I would <emphasis level="strong">really like a cheeseburger please </emphasis> I said I would <emphasis level="reduced">really like a cheeseburger please </emphasis>
  • 7. <phoneme> ● Allows you to program Alexa how to pronounce something. ● Attributes ○ Alphabet (phonetic alphabet to use) ■ Ipa (International Phonetic Alphabet) ■ X-sampa (Extended Speech Assessment Methods Phonetic Alphabet) ○ ph (the phonetic pronunciation to speak - symbols) ● Symbols have slight variations across countries.
  • 9. <prosody> ● Changes the volume, pitch and rate of speech. ● Attributes ○ Rate - change the rate of speech ■ X-slow, slow, medium, fast, x-fast ■ 100% - no change ■ > 100% increase rate ■ < 100% decrease rate (min 20%) ○ Pitch - raise or lower the tone of the speech ■ X-low, low, medium, high, x-high ■ 1% - 51% increase pitch ■ -1% - -33.3% decrease the pitch ○ Volume - change the volume of the speech relative to the current volume level. ■ Silent, x-soft, soft, medium, loud, x-loud ■ +0.01db - 4.08db increase volume ■ -0.01db - -6db decrease volume
  • 10. <say-as> ● Determines how text should be interpreted. ● Attribute: ○ Interpret-as: ■ Characters, spell-out ■ Cardinal, number ■ Ordinal ■ Digits ■ Fraction ■ Unit ■ Date - can specify the format, using the ‘format’ attribute ■ Time ■ Telephone ■ Address ■ Interjection ■ Expletive
  • 11. <w> ● Customises the pronunciation of words by specifying the word’s part of speech. ● Attribute ○ Role ■ amazon:VB - Verb ■ amazon:VBD - past participle ■ amazon:NN - noun ■ amazon:SENSE_1 - where there are different meanings (Homographs) i.e. Bass and Bass, Bow and Bow, Wind and Wind.
  • 12. <amazon:effect> ● Specialist Amazon effects. Cannot be used elsewhere. ● Attribute: ○ Name ■ Whispered
  • 13. <audio> ● Allows you to insert MP3 files into the speech ○ HTTPS endpoint ○ No customer specific information ○ Valid MP3 file (MPEG v2) ○ No longer than 90 seconds ○ Bit rate = 48kbps ○ Sample rate = 16000Hz ● Attribute ○ Src ● Alexa Sound Library - NEW!
  • 14. <lang> NEW! ● The language tag allows you to make the text spoken as it should be in the language it is written in ● Attribute ○ xml:lang ■ Supports all Amazon Polly languages
  • 15. <voice> ● Can use an Amazon Polly voice in your skills response ○ 50+ voices in 25+ languages ● Dependant on locale what tags you use. ● <lang> can be used with it.
  • 16. Speechcons ● Speech Emoji’s ● Part of the <say-as> tag using the interpret as “interjection” attribute. ○ <say-as interpret-as="interjection">Wow.</say-as> ● Supported in the languages Alexa is available for https://developer.amazon.com/docs/custom-skills/speechcon-reference-interjections-english-uk.html
  • 17. SSML Top Tips...from the front line ● Use the voice simulator to test every bit of speech ○ At the very least the key parts. ● Design for multi-modal (whether you are using it or not) ● Sound each word out aloud so you can hear what each element sounds like ● Trial and error with phonemes ● Go back to school ● Utilise other voices ● Use audio files ● When creating other types of media e.g. webpages, run them through the voice simulator before publishing. It's a great way to pick up mistakes!
  • 18. <phoneme alphabet="ipa" ph="θ'ænk ju">Thank You</phoneme> @RichMerrett815 @VeniLoqui #alexadevscamb