Designing XR Experiences with Speech & Natural Language Understandingin Unity

Nick Landry
Designing XR Experiences
with Speech & Natural
Language Understanding
in Unity

1950 1960 1970 1980 1990 2000 2010

Azure AI Services
Azure Infrastructure
Tools

• Computer Vision + Holographic/AR
• Language Services for MR
Cognitive Services
• Access to cloud data (SQL, Cosmos, etc.) from MR
• Calling Azure ML APIs from MR
Custom AI & Data
Services
• Smart assistants powered by Bots
• Learning agents powered by ML
Immersive Agents
• Offline AI access via Windows ML
• Access to Deep Learning frameworks (CNTK, TF, etc.)
Local AI Services

Emotion
Speaker
Recognition
Custom Speech
Recognition (CRIS)
Speech Synthesis &
Recognition
Computer Vision
Face
Video
microsoft.com/cognitive
Linguistic Analysis
Language
Understanding
Bing Spell Check
Entity Linking
Knowledge
Exploration
Academic
Knowledge
Bing
Image Search
Bing
Video Search
Bing
Web Search
WebLM
Text Analytics Recommendations
Bing
Autosuggest
Bing
News Search
Translator Speech
https://www.microsoft.com/cognitive-services/
Custom Vision Custom Voice

Why Speech Matters in XR
Experiences
• Speech is the most convenient input method
• No keyboard, mouse, or touch screen (MR, VR)
• Limited input options with gestures and/or
motion controllers (MR, VR)
• Limited ability to interact with screen (phone-
based AR)
• Voice recognition vs Speech recognition
• Speech recognition vs Intent recognition

Not just for XR
Many games can benefit from voice
input, speech recognition & synthesis:
FPS squad orders
RPG dialogue
RTS commands
Space sim controls
Interactive Fiction & trivia choices
Hands-free gaming
Voiced computers, robots, etc.

Local
Services
Speech
Synthesis
Speech
Recognition
Cloud
Services
Speech
Synthesis
Speech
Recognition
Intent
Recognition
Translation
Custom
Recognition

{ Your Code }
REST Endpoint
Direct Line Protocol
Conversational and
Business Logic
Canvas Aware Context
Sensitive
SDK
Bot Builder SDK
Platform Platform Services
HTTP
REST Endpoint
AI
Intelligent Tools

Goals
• Start Simple. Add Complexity. No dead-ends.
• Bot adapts to the user, based on context
• Composable and intelligent controls to manage complexity
Bot Controls
LUIS
Query over database via
Azure Search
Form
Filling
QnA
C#
Customer’s
Business Logic
& DataBot Connector
What?
• Tools for building REST Web Sites
• Services to enrich
• Mechanisms for receive events
• Data to debug and analyze
Why?
• Implements standard protocols
• Modeling conversations is hard. Tools help!
• UI across multiple canvases is hard. Cards rock!
• Language Understanding is hard
• Common and well understood patterns

O N E B O T
M U L T I P L E C A N V A S E S

https://dev.botframework.com/
https://www.luis.ai/
https://azure.microsoft.com/services/bot-service/
https://github.com/Microsoft/BotBuilder
https://blog.botframework.com/
https://github.com/DanielEgan/BotWorkshop
http://developer.microsoft.com/cortana

https://github.com/ActiveNick/HoloBot

Simple reusable solution
that easily demonstrates
the potential of Mixed
Reality combined with AI
services and a cloud
backend in Azure
1
HoloBot model can easily
be replaced to match any
company branded asset
using custom textures or
full 3D models
2
HoloBot can be integrated
as a virtual assistant for
any immersive/VR or
holographic Mixed Reality
experience, powered by
Bot Framework
3
Beyond LUIS, bots can
connect to more advanced
Machine Learning models
or data sources, allowing
voice-activated touch-free
access
4

https://github.com/ActiveNick/Unity-Text-to-Speech
https://github.com/ActiveNick/Unity-MS-SpeechSDK
https://www.luis.ai/
https://aka.ms/MRLuis
https://github.com/ActiveNick/Unity-SpeechWithLUIS
https://dev.botframework.com/
https://github.com/ActiveNick/HoloBot
https://github.com/ActiveNick/TheMakerShowBot
https://github.com/ActiveNick/BotFrameworkTestClient

https://aka.ms/mr
https://github.com/Microsoft/mixedreality-azure-samples https://aka.ms/mrazure
https://github.com/Unity3dAzure
https://docs.microsoft.com/sandbox/gamedev https://aka.ms/azgamedev
https://github.com/ActiveNick
https://github.com/jbienzms
https://github.com/meulta
https://github.com/vladkol
https://github.com/SarahSexton

Thank You
Nick Landry
Senior Software Engineer, Microsoft
activenick@microsoft.com
slideshare.net/ActiveNick
github.com/ActiveNick
@ActiveNick

© 2018 Microsoft Corporation. All rights reserved.
Microsoft, Windows, Microsoft Surface and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.
The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation.
Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft
cannot guarantee the accuracy of any information provided after the date of this presentation.
MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Designing XR Experiences with Speech & Natural Language Understandingin Unity

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Designing XR Experiences with Speech & Natural Language Understandingin Unity

Similar to Designing XR Experiences with Speech & Natural Language Understandingin Unity (20)

More from Nick Landry

More from Nick Landry (20)

Recently uploaded

Recently uploaded (20)