SlideShare a Scribd company logo
1 of 81
Download to read offline
Microsoft Cognitive Services at
a Glance
Marvin Heng | @hmheng
www.techconnect.io
Microsoft AI
Azure
Cognitive
Services
From faces to feelings, allow your
apps to understand images and video
Hear and speak to your users by filtering noise, identifying
speakers, and understanding intent
Process text and learn how to recognize what
users want
Tap into rich knowledge amassed from
the web, academia, or your own data
Access billions of web pages, images, videos, and news with
the power of Bing APIs
Why Azure Cognitive Services ?
Cognitive Services
Emotion
Computer Vision
Face
Video Indexer
Form Recognizer
Speech To Text
Text To Speech
Speech Translation
Speaker Recognition
Immersive Reader
Language
Understanding
QnA Maker
Text Analytics
Translator
Anomaly Detector
Content Moderator
Metrics Advisor
Personalizer
Bing Autosuggest
Bing Custom Search
Bing Entity Search
Bing Image Search
Bing News Search
Bing Spell Check
Bing Video Search
Bing Visual Search
Bing Web Search
Custom Vision
Bing Search
Bing Search
• Allow developers to integrate a search function to their apps that
allows users to find webpages, images, news, locations, and more
without advertisements
• For knowledge mining
Bing Search
Autosuggest
Entity Search
Custom Search
Image Search
News Search
Video Search
Visual Search
Spell Check
Local Business
Speech
Speech-to-Text
• Speech-to-text service
• Improves meeting efficiency by transcribing conversations in real-time
• Help safeguard data with industry-leading security and compliance
certifications.
• Integrates with a variety of meeting conference solutions including
Microsoft Teams and other third-party meeting software.
• SDK is available.
Speech-to-Text
Speaker Recognition
“who is speaking?”
Speaker Verification
• Text-dependent verification means
speakers need to choose the same
passphrase to use during both enrollment
and verification phases.
• Text-independent verification means
speakers can speak in everyday language in
the enrollment and verification phrases.
Text-to-Speech
• Convert text into human-like synthesized speech.
• Offer 75+ standard in more than 45 languages and locales, and 5
neural voices
• Tune voice output by easily adjusting rate, pitch, pronunciation,
pauses, and more.
• Speech synthesis
• Asynchronous synthesis of long audio
• Speech Synthesis Markup Language (SSML)
Speech Translation
Microsoft’s
Translation
Engine
Statistical
machine
translation
(SMT)
Neural
machine
translation
(NMT)
Speech Translation
• Speech-to-text translation with recognition results.
• Speech-to-speech translation.
• Support for translation to multiple target languages.
• Interim recognition and translation results.
Use case of Speech Service
Language
Language Understanding
• Applies custom machine-learning intelligence to a user's
conversational, natural language text to predict overall meaning, and
pull out relevant, detailed information.
• Often used in Chatbots or conversational bots.
Language Understanding
Use Cases of Language Understanding
• Automate capturing order for your application
• Automate social media feedback & response
• Conversation bot for your HR service, IT service, or other customer
services.
• Integrate with speech services for enabling your app to responding
voice request from users.
QnA Maker
• Natural Language Processing (NLP) service
• Create a natural conversational layer over your data
• Find the most appropriate answer for any input from your custom
knowledge base (KB) of information
Development Lifecycle for QnA Maker
Immersive Reader
• Embed text reading and comprehension capabilities into applications
• Features:
• Reading aloud,
• translating languages, and
• focusing attention through highlighting
• No machine learning expertise is required.
Shows content in a minimal reading view
Displays pictures of commonly used words
Highlights nouns, verbs, adjectives, and adverbs
Reads your content out loud to you
Translates your content into another language
Breaks down words into syllables
What proves that Immersive Reader helps people
with reading?
• A 2017 study by RTI International showed that reading comprehension
among groups of fourth-grade students improved an average of 10 percent.
Read the research.
Translator
• Support over 70+ languages & customizable.
• Easy integration via REST API
Text Analytics
• For text mining and text analysis
• Understand the context in a conversation better
• Sentiment analysis, opinion mining, key phrase extraction, language
detection, and named entity recognition
• More than 20 languages being supported (reference)
Text Analytics for Health
• Extract information from unstructured English-language text in clinical
documents such as: patient intake forms, doctor's notes, research
papers and discharge summaries
Decision
Personalizer
• Provide information about your users and content and receive the top
action to show your users.
• No need to clean and label data before using Personalizer.
• Provide feedback to Personalizer when it is convenient to you.
• View real-time analytics.
• Use Personalizer as part of a larger data science effort to validate
existing experiments.
Personalizer
Personalizer
Where can I use Personalizer?
• Personalize what article is highlighted on a news website.
• Display a personalized "recommended item" on a shopping website.
• Suggest user interface elements such as filters to apply to a specific
photo.
Where can I use Personalizer?
• Send information (features) about your users and the content
(actions) to personalize. Personalizer responds with the top action.
• Send feedback to Personalizer about how well the ranking worked as
a number typically between 0 and 1.
Vision
Computer Vision
• Computer vision is an area of artificial intelligence (AI) in which
software systems are designed to perceive the world visually, though
cameras, images, and video.
• Computer vision is one of the core areas of artificial intelligence (AI),
and focuses on creating solutions that enable AI-enabled applications
to "see" the world and make sense of it.
Use Cases of Computer Vision
• Analyze an image and suggest an appropriate caption.
• Suggest relevant tags that could be used to index an image.
• Categorize an image.
• Identify objects in an image.
• Detect faces and people in an image.
• Recognize celebrities and landmarks in an image.
• Read text in an image.
What can CV tell us?
• A black and white photo of a city
• A black and white photo of a large city
• A large white building in a city
Not only that! It tags too!
• Tagging
• Type of identified object
• Bounding Box
• Set of coordinates (Top, left, width and height)
Image Categorization
Categorization in 86-category taxonomy
abstract_ animal_horse building_street food_grilled others_ outdoor_road people_hand plant_tree text_menu
abstract_net animal_panda dark_ food_pizza outdoor_
outdoor_sportsf
ield people_many object_screen text_sign
abstract_nonph
oto building_ drink_ indoor_ outdoor_city
outdoor_stoner
ock people_portrait
object_sculptur
e trans_bicycle
abstract_rect building_arch drink_can
indoor_churchw
indow outdoor_field outdoor_street people_show sky_cloud trans_bus
abstract_shape
building_brickw
all dark_fire indoor_court outdoor_grass outdoor_water people_tattoo sky_sun trans_car
abstract_texture building_church dark_fireworks
indoor_doorwin
dows outdoor_house
outdoor_watersi
de people_young
people_swimmi
ng
trans_trainstatio
n
animal_ building_corner sky_object
indoor_markets
tore
outdoor_mount
ain people_ plant_
outdoor_pool
animal_bird
building_doorwi
ndows food_ indoor_room
outdoor_oceanb
each people_baby plant_branch text_
animal_cat building_pillar food_bread indoor_venue
outdoor_playgro
und people_crowd plant_flower text_mag
animal_dog building_stair food_fastfood dark_light outdoor_railway people_group plant_leaves text_map
Domain-specific content
Optical character recognition
Faith
CAN MOVE
MOUNTAINS
Some Additional Capabilities
• Detect image
• Detect image color schemes
• Generate thumbnails
• Moderate content
Custom Vision
• Azure Custom Vision is an image recognition service that lets you
build, deploy, and improve your own image identifiers.
• An image identifier applies labels (which represent classes or objects)
to images, according to their visual characteristics.
• The Custom Vision service uses a machine learning algorithm to
analyze images.
What can Custom Vision do?
• Classification
• Object Detection
• Export as standalone offline
model for your app
development.
Face Detection
Face Verification
Verification result: The two faces belong to the same
person. Confidence is 0.93468.
Perceived emotion recognition
Video Indexer
• Video Indexer provides ability to extract deep
insights (with no need for data analysis or coding
skills) using machine learning models based on
multiple channels (voice, vocals, visual).
• The service enables deep search, reduces
operational costs, enables new monetization
opportunities, and creates new user experiences on
large archives of videos (with low entry barriers).
Video Indexer
• Keywords extraction
• Named entities extraction
• Topic inference
• Artifacts Sentiment analysis: Identifies positive, negative, and neutral
sentiments from speech and visual text.
Video Indexer
Use Cases of Video Indexer
• Deep search
• Content creation
• Accessibility.
• Monetization
• Content moderation
• Recommendations
Video Indexer
Face detection
Celebrity identification
Account-based face identification
Visual text recognition
Visual content moderation
Labels identification
Scene segmentation
Shot detection
Black frame detection
Keyframe extraction
Rolling credits
Animated characters detection
Editorial shot type detection
Audio transcription
Automatic language detection
Multi-language speech identification and transcription
Two channel processing
Closed captioning
Noise reduction
Transcript customization (CRIS)
Speaker enumeration
Speaker statistics
Textual content moderation
Audio effects
Emotion detection
Translation
Form Recognizer
• Extract text and data from business’s forms and documents.
• Easily extract text and structure, with simple REST API
• Pre-trained model:
• Receipt
• Business Card
• Layouts
• Custom Trained Model
• Supports printed and handwritten forms, PDFs and images.
• Container support
What can you do with Form Recognizer?
• Automate written text > digital text conversion
• Automate capturing receipt data
• Automate converting business card into digital contacts
Sample Form Recognizer tool
• Client library / REST API quickstart (all languages, multiple scenarios)
• Web UI quickstarts
• Train with labels - sample labeling tool
• REST samples (GitHub)
• Extract text, selection marks and table structure from documents
• Extract layout data - Python
Sample Form Recognizer tool
• Train custom models and extract form data
• Train without labels - Python
• Train with labels - Python
• Extract data from invoices
• Extract invoice data - Python
• Extract data from sales receipts
• Extract receipt data - Python
• Extract data from business cards
• Extract business card data - Python
Things you may be interested in.
•AI Labs@hmheng (GitHub):
https://github.com/hmheng/AILabs
Extras Add-on:
• Cognitive Service
• https://github.com/Azure-Samples/cognitive-services-python-sdk-
samples/tree/deb4483ec1db33138e32ce472a775f3ed874fb80?WT.mc_id=AI-MVP-5003227
• Personalizer
• https://azure.microsoft.com/en-us/services/cognitive-services/personalizer?WT.mc_id=AI-MVP-5003227
• https://personalizationdemo.azurewebsites.net
• Speech-To-Text
• https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text?WT.mc_id=AI-MVP-5003227
• Speech
• https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text?WT.mc_id=AI-MVP-5003227
• https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-synthesis-
markup?WT.mc_id=AI-MVP-5003227
Extras Add-on:
• Bing Search
• https://docs.microsoft.com/en-us/azure/cognitive-services/bing-web-search?WT.mc_id=AI-MVP-5003227
• Personalizer
• https://azure.microsoft.com/en-us/services/cognitive-services/personalizer?WT.mc_id=AI-MVP-5003227
• https://personalizationdemo.azurewebsites.net
• Speech-To-Text
• https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text?WT.mc_id=AI-MVP-5003227
• Speech
• https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text?WT.mc_id=AI-MVP-5003227
• https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-synthesis-
markup?WT.mc_id=AI-MVP-5003227
Extras Add-on:
• Form Recognizer:
• https://docs.microsoft.com/en-us/azure/cognitive-services/form-
recognizer/language-support?WT.mc_id=AI-MVP-5003227
• https://docs.microsoft.com/en-us/azure/cognitive-services/form-
recognizer/quickstarts/label-tool?WT.mc_id=AI-MVP-5003227
• Form OCR Testing Tool
• On Azure @ https://fott.azurewebsites.net/
• GitHub @ https://github.com/microsoft/OCR-Form-
Tools/?ocid=AID2463683&WT.mc_id=AI-MVP-5003227
Thank you!
Marvin Heng | @hmheng
www.techconnect.io

More Related Content

What's hot

Microsoft Azure beyond IaaS
Microsoft Azure  beyond IaaSMicrosoft Azure  beyond IaaS
Microsoft Azure beyond IaaSBipeen Sinha
 
Discover AI with Microsoft Azure
Discover AI with Microsoft AzureDiscover AI with Microsoft Azure
Discover AI with Microsoft AzureJürgen Ambrosi
 
Understanding Pre-Built AI: AI for Every Developer
Understanding Pre-Built AI: AI for Every DeveloperUnderstanding Pre-Built AI: AI for Every Developer
Understanding Pre-Built AI: AI for Every DeveloperAI Leadership Institute
 
Artificial Intelligence API Services Compared
Artificial Intelligence API Services ComparedArtificial Intelligence API Services Compared
Artificial Intelligence API Services ComparedCraig Milroy
 
Microsoft AI Overview: Cognitive Services
Microsoft AI Overview: Cognitive ServicesMicrosoft AI Overview: Cognitive Services
Microsoft AI Overview: Cognitive ServicesAI Leadership Institute
 
Global Azure2021 Verona.pptx
Global Azure2021 Verona.pptxGlobal Azure2021 Verona.pptx
Global Azure2021 Verona.pptxLuis Beltran
 
Compared: IBM Watson Services / Microsoft Azure Services
Compared: IBM Watson Services / Microsoft Azure ServicesCompared: IBM Watson Services / Microsoft Azure Services
Compared: IBM Watson Services / Microsoft Azure ServicesCraig Milroy
 
Overview of Microsoft Azure AI Services
Overview of Microsoft Azure AI ServicesOverview of Microsoft Azure AI Services
Overview of Microsoft Azure AI ServicesCraig Milroy
 
Intro to Microsoft Cognitive Services
Intro to Microsoft Cognitive ServicesIntro to Microsoft Cognitive Services
Intro to Microsoft Cognitive ServicesAmanda Lange
 
Custom vision app step by step and cognitive service quick view
Custom vision app step by step and cognitive service quick viewCustom vision app step by step and cognitive service quick view
Custom vision app step by step and cognitive service quick viewR Ladies Taipei
 
Cognitive Services en acción
Cognitive Services en acciónCognitive Services en acción
Cognitive Services en acciónPlain Concepts
 
Build your model using azure custom vision and deploy it in a webapp
Build your model using azure custom vision and deploy it in a webappBuild your model using azure custom vision and deploy it in a webapp
Build your model using azure custom vision and deploy it in a webappCodeOps Technologies LLP
 
Microsoft Cognitive Services
Microsoft Cognitive ServicesMicrosoft Cognitive Services
Microsoft Cognitive ServicesJoshua Drew
 
Cloud Skills Challenge.pptx
Cloud Skills Challenge.pptxCloud Skills Challenge.pptx
Cloud Skills Challenge.pptxLuis Beltran
 
IBM Cloud Artificial Intelligence : A Comprehensive Overview
IBM Cloud Artificial Intelligence : A Comprehensive OverviewIBM Cloud Artificial Intelligence : A Comprehensive Overview
IBM Cloud Artificial Intelligence : A Comprehensive OverviewSatyajit Panda
 
AI-900: Microsoft Azure AI Fundamentals 2021
AI-900: Microsoft Azure AI Fundamentals 2021AI-900: Microsoft Azure AI Fundamentals 2021
AI-900: Microsoft Azure AI Fundamentals 2021Sean Xie
 
Microsoft Cognitive Services - pratibha sharma
Microsoft Cognitive Services - pratibha sharmaMicrosoft Cognitive Services - pratibha sharma
Microsoft Cognitive Services - pratibha sharmaPratibha Sharma
 
DLD TLV Cognitive Services: The Brains Behind Your Bot
DLD TLV Cognitive Services:The Brains Behind Your BotDLD TLV Cognitive Services:The Brains Behind Your Bot
DLD TLV Cognitive Services: The Brains Behind Your BotAaron (Ari) Bornstein
 

What's hot (20)

Microsoft Azure beyond IaaS
Microsoft Azure  beyond IaaSMicrosoft Azure  beyond IaaS
Microsoft Azure beyond IaaS
 
Azure beyond IaaS
Azure  beyond IaaSAzure  beyond IaaS
Azure beyond IaaS
 
Discover AI with Microsoft Azure
Discover AI with Microsoft AzureDiscover AI with Microsoft Azure
Discover AI with Microsoft Azure
 
Understanding Pre-Built AI: AI for Every Developer
Understanding Pre-Built AI: AI for Every DeveloperUnderstanding Pre-Built AI: AI for Every Developer
Understanding Pre-Built AI: AI for Every Developer
 
Artificial Intelligence API Services Compared
Artificial Intelligence API Services ComparedArtificial Intelligence API Services Compared
Artificial Intelligence API Services Compared
 
Microsoft AI Overview: Cognitive Services
Microsoft AI Overview: Cognitive ServicesMicrosoft AI Overview: Cognitive Services
Microsoft AI Overview: Cognitive Services
 
Intelligent ChatBot
Intelligent ChatBotIntelligent ChatBot
Intelligent ChatBot
 
Global Azure2021 Verona.pptx
Global Azure2021 Verona.pptxGlobal Azure2021 Verona.pptx
Global Azure2021 Verona.pptx
 
Compared: IBM Watson Services / Microsoft Azure Services
Compared: IBM Watson Services / Microsoft Azure ServicesCompared: IBM Watson Services / Microsoft Azure Services
Compared: IBM Watson Services / Microsoft Azure Services
 
Overview of Microsoft Azure AI Services
Overview of Microsoft Azure AI ServicesOverview of Microsoft Azure AI Services
Overview of Microsoft Azure AI Services
 
Intro to Microsoft Cognitive Services
Intro to Microsoft Cognitive ServicesIntro to Microsoft Cognitive Services
Intro to Microsoft Cognitive Services
 
Custom vision app step by step and cognitive service quick view
Custom vision app step by step and cognitive service quick viewCustom vision app step by step and cognitive service quick view
Custom vision app step by step and cognitive service quick view
 
Cognitive Services en acción
Cognitive Services en acciónCognitive Services en acción
Cognitive Services en acción
 
Build your model using azure custom vision and deploy it in a webapp
Build your model using azure custom vision and deploy it in a webappBuild your model using azure custom vision and deploy it in a webapp
Build your model using azure custom vision and deploy it in a webapp
 
Microsoft Cognitive Services
Microsoft Cognitive ServicesMicrosoft Cognitive Services
Microsoft Cognitive Services
 
Cloud Skills Challenge.pptx
Cloud Skills Challenge.pptxCloud Skills Challenge.pptx
Cloud Skills Challenge.pptx
 
IBM Cloud Artificial Intelligence : A Comprehensive Overview
IBM Cloud Artificial Intelligence : A Comprehensive OverviewIBM Cloud Artificial Intelligence : A Comprehensive Overview
IBM Cloud Artificial Intelligence : A Comprehensive Overview
 
AI-900: Microsoft Azure AI Fundamentals 2021
AI-900: Microsoft Azure AI Fundamentals 2021AI-900: Microsoft Azure AI Fundamentals 2021
AI-900: Microsoft Azure AI Fundamentals 2021
 
Microsoft Cognitive Services - pratibha sharma
Microsoft Cognitive Services - pratibha sharmaMicrosoft Cognitive Services - pratibha sharma
Microsoft Cognitive Services - pratibha sharma
 
DLD TLV Cognitive Services: The Brains Behind Your Bot
DLD TLV Cognitive Services:The Brains Behind Your BotDLD TLV Cognitive Services:The Brains Behind Your Bot
DLD TLV Cognitive Services: The Brains Behind Your Bot
 

Similar to Microsoft Cognitive Services at a Glance

Mobile & Cognitive Services | Harnessing the Power of IoT – Xamarin Experienc...
Mobile & Cognitive Services | Harnessing the Power of IoT – Xamarin Experienc...Mobile & Cognitive Services | Harnessing the Power of IoT – Xamarin Experienc...
Mobile & Cognitive Services | Harnessing the Power of IoT – Xamarin Experienc...Xpand IT
 
AI Days LATAM Intelligent Apps and Agents
AI Days LATAM Intelligent Apps and AgentsAI Days LATAM Intelligent Apps and Agents
AI Days LATAM Intelligent Apps and AgentsAI Leadership Institute
 
Big Data LDN 2017: Applied AI using Cognitive Services
Big Data LDN 2017: Applied AI using Cognitive ServicesBig Data LDN 2017: Applied AI using Cognitive Services
Big Data LDN 2017: Applied AI using Cognitive ServicesMatt Stubbs
 
ABTO Software computer vision 2016
ABTO Software computer vision 2016ABTO Software computer vision 2016
ABTO Software computer vision 2016ABTO Software
 
ai_pptIntroduction to AI tools , NLP.pptx
ai_pptIntroduction to AI tools , NLP.pptxai_pptIntroduction to AI tools , NLP.pptx
ai_pptIntroduction to AI tools , NLP.pptxanushya2915
 
Microsoft cognitive services
Microsoft cognitive servicesMicrosoft cognitive services
Microsoft cognitive servicesAli Zoljodi
 
SG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptx
SG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptxSG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptx
SG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptxPriyankaShah668821
 
Cognitive Services: Building Smart Apps with Speech, NLP & Vision
Cognitive Services: Building Smart Apps with Speech, NLP & VisionCognitive Services: Building Smart Apps with Speech, NLP & Vision
Cognitive Services: Building Smart Apps with Speech, NLP & VisionNick Landry
 
Computers have feelings too
Computers have feelings tooComputers have feelings too
Computers have feelings tooPaul Glavich
 
Computer Vision Software Development.pdf
Computer Vision Software Development.pdfComputer Vision Software Development.pdf
Computer Vision Software Development.pdfJohnAdams514191
 
Automatic multi-modal metadata annotation based on trained cognitive solution...
Automatic multi-modal metadata annotation based on trained cognitive solution...Automatic multi-modal metadata annotation based on trained cognitive solution...
Automatic multi-modal metadata annotation based on trained cognitive solution...FIAT/IFTA
 
Intelligent Apps - Amplifying Human Ingenuity
Intelligent Apps - Amplifying Human IngenuityIntelligent Apps - Amplifying Human Ingenuity
Intelligent Apps - Amplifying Human IngenuityDavid J Rosenthal
 
Computer Vision and Text Analytics Using Azure Cognitive Services
Computer Vision and Text Analytics Using Azure Cognitive ServicesComputer Vision and Text Analytics Using Azure Cognitive Services
Computer Vision and Text Analytics Using Azure Cognitive ServicesUmme Rubaiyat Chowdhury
 
Rita Arrigo, Microsoft
Rita Arrigo, Microsoft Rita Arrigo, Microsoft
Rita Arrigo, Microsoft Hilary Ip
 
Using Cognitive Services
Using Cognitive ServicesUsing Cognitive Services
Using Cognitive ServicesEng Teong Cheah
 
Introduction to AI & ML
Introduction to AI & MLIntroduction to AI & ML
Introduction to AI & MLMandy Sidana
 
Yurii Pashchenko: Tips and tricks for building your own automated visual data...
Yurii Pashchenko: Tips and tricks for building your own automated visual data...Yurii Pashchenko: Tips and tricks for building your own automated visual data...
Yurii Pashchenko: Tips and tricks for building your own automated visual data...Lviv Startup Club
 
Machine Learning with Azure Cognitive Services - Face Recognition and Deep Fa...
Machine Learning with Azure Cognitive Services - Face Recognition and Deep Fa...Machine Learning with Azure Cognitive Services - Face Recognition and Deep Fa...
Machine Learning with Azure Cognitive Services - Face Recognition and Deep Fa...CodeOps Technologies LLP
 

Similar to Microsoft Cognitive Services at a Glance (20)

Mobile & Cognitive Services | Harnessing the Power of IoT – Xamarin Experienc...
Mobile & Cognitive Services | Harnessing the Power of IoT – Xamarin Experienc...Mobile & Cognitive Services | Harnessing the Power of IoT – Xamarin Experienc...
Mobile & Cognitive Services | Harnessing the Power of IoT – Xamarin Experienc...
 
AI Days LATAM Intelligent Apps and Agents
AI Days LATAM Intelligent Apps and AgentsAI Days LATAM Intelligent Apps and Agents
AI Days LATAM Intelligent Apps and Agents
 
Big Data LDN 2017: Applied AI using Cognitive Services
Big Data LDN 2017: Applied AI using Cognitive ServicesBig Data LDN 2017: Applied AI using Cognitive Services
Big Data LDN 2017: Applied AI using Cognitive Services
 
ABTO Software computer vision 2016
ABTO Software computer vision 2016ABTO Software computer vision 2016
ABTO Software computer vision 2016
 
ai_pptIntroduction to AI tools , NLP.pptx
ai_pptIntroduction to AI tools , NLP.pptxai_pptIntroduction to AI tools , NLP.pptx
ai_pptIntroduction to AI tools , NLP.pptx
 
Microsoft cognitive services
Microsoft cognitive servicesMicrosoft cognitive services
Microsoft cognitive services
 
SG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptx
SG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptxSG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptx
SG_UserGroup_Oct20_2022_NLP_AzureLangStudio.pptx
 
Cognitive Services: Building Smart Apps with Speech, NLP & Vision
Cognitive Services: Building Smart Apps with Speech, NLP & VisionCognitive Services: Building Smart Apps with Speech, NLP & Vision
Cognitive Services: Building Smart Apps with Speech, NLP & Vision
 
Computers have feelings too
Computers have feelings tooComputers have feelings too
Computers have feelings too
 
Computer Vision Software Development.pdf
Computer Vision Software Development.pdfComputer Vision Software Development.pdf
Computer Vision Software Development.pdf
 
Automatic multi-modal metadata annotation based on trained cognitive solution...
Automatic multi-modal metadata annotation based on trained cognitive solution...Automatic multi-modal metadata annotation based on trained cognitive solution...
Automatic multi-modal metadata annotation based on trained cognitive solution...
 
Intelligent Apps - Amplifying Human Ingenuity
Intelligent Apps - Amplifying Human IngenuityIntelligent Apps - Amplifying Human Ingenuity
Intelligent Apps - Amplifying Human Ingenuity
 
Computer Vision and Text Analytics Using Azure Cognitive Services
Computer Vision and Text Analytics Using Azure Cognitive ServicesComputer Vision and Text Analytics Using Azure Cognitive Services
Computer Vision and Text Analytics Using Azure Cognitive Services
 
Rita Arrigo, Microsoft
Rita Arrigo, Microsoft Rita Arrigo, Microsoft
Rita Arrigo, Microsoft
 
Using Cognitive Services
Using Cognitive ServicesUsing Cognitive Services
Using Cognitive Services
 
Introduction to AI & ML
Introduction to AI & MLIntroduction to AI & ML
Introduction to AI & ML
 
Yurii Pashchenko: Tips and tricks for building your own automated visual data...
Yurii Pashchenko: Tips and tricks for building your own automated visual data...Yurii Pashchenko: Tips and tricks for building your own automated visual data...
Yurii Pashchenko: Tips and tricks for building your own automated visual data...
 
Machine Learning with Azure Cognitive Services - Face Recognition and Deep Fa...
Machine Learning with Azure Cognitive Services - Face Recognition and Deep Fa...Machine Learning with Azure Cognitive Services - Face Recognition and Deep Fa...
Machine Learning with Azure Cognitive Services - Face Recognition and Deep Fa...
 
Machine Learning in the AWS Cloud
Machine Learning in the AWS CloudMachine Learning in the AWS Cloud
Machine Learning in the AWS Cloud
 
AI NOTES.docx
AI NOTES.docxAI NOTES.docx
AI NOTES.docx
 

More from Marvin Heng

Accelerating Personal Development through Microsoft Certifications
Accelerating Personal Development through Microsoft CertificationsAccelerating Personal Development through Microsoft Certifications
Accelerating Personal Development through Microsoft CertificationsMarvin Heng
 
Microsoft BotFramework - Global AI Bootcamp Nepal 2022
Microsoft BotFramework - Global AI Bootcamp Nepal 2022Microsoft BotFramework - Global AI Bootcamp Nepal 2022
Microsoft BotFramework - Global AI Bootcamp Nepal 2022Marvin Heng
 
What's New With Azure AI
What's New With Azure AIWhat's New With Azure AI
What's New With Azure AIMarvin Heng
 
Intelligent Mobile App with Azure Custom Vision
Intelligent Mobile App with Azure Custom VisionIntelligent Mobile App with Azure Custom Vision
Intelligent Mobile App with Azure Custom VisionMarvin Heng
 
Bot & AI - A Bot for Productivity
Bot & AI - A Bot for ProductivityBot & AI - A Bot for Productivity
Bot & AI - A Bot for ProductivityMarvin Heng
 
Artificial Intelligence - Tell You What I See
Artificial Intelligence - Tell You What I SeeArtificial Intelligence - Tell You What I See
Artificial Intelligence - Tell You What I SeeMarvin Heng
 
Handwriting Detection with Microsoft Cognitive Services
Handwriting Detection with Microsoft Cognitive ServicesHandwriting Detection with Microsoft Cognitive Services
Handwriting Detection with Microsoft Cognitive ServicesMarvin Heng
 
Create a Q&A Bot to Serve Your Customers
Create a Q&A Bot to Serve Your CustomersCreate a Q&A Bot to Serve Your Customers
Create a Q&A Bot to Serve Your CustomersMarvin Heng
 
Facial Analysis with Angular Web App & ASP.NET Core
Facial Analysis with Angular Web App & ASP.NET CoreFacial Analysis with Angular Web App & ASP.NET Core
Facial Analysis with Angular Web App & ASP.NET CoreMarvin Heng
 
AI/ML/DL: Introduction to Deep Learning with Cognitive ToolKit
AI/ML/DL: Introduction to Deep Learning with Cognitive ToolKitAI/ML/DL: Introduction to Deep Learning with Cognitive ToolKit
AI/ML/DL: Introduction to Deep Learning with Cognitive ToolKitMarvin Heng
 
AI/ML/DL: Getting Started with Machine Learning on Azure
AI/ML/DL: Getting Started with Machine Learning on AzureAI/ML/DL: Getting Started with Machine Learning on Azure
AI/ML/DL: Getting Started with Machine Learning on AzureMarvin Heng
 
AI: Integrate Search Function into Your App Using Bing Search API.
AI: Integrate Search Function into Your App Using Bing Search API.AI: Integrate Search Function into Your App Using Bing Search API.
AI: Integrate Search Function into Your App Using Bing Search API.Marvin Heng
 
Artificial Intelligent: Intelligent Bot With Microsoft Bot Framework & Azure
Artificial Intelligent: Intelligent Bot With Microsoft Bot Framework & AzureArtificial Intelligent: Intelligent Bot With Microsoft Bot Framework & Azure
Artificial Intelligent: Intelligent Bot With Microsoft Bot Framework & AzureMarvin Heng
 
AI: Mobile Apps That Understands Your Intention When You Typed
AI: Mobile Apps That Understands Your Intention When You TypedAI: Mobile Apps That Understands Your Intention When You Typed
AI: Mobile Apps That Understands Your Intention When You TypedMarvin Heng
 
App Development: Create Cross Platform Mobile App with .NETStandard 2.0
App Development: Create Cross Platform Mobile App with .NETStandard 2.0App Development: Create Cross Platform Mobile App with .NETStandard 2.0
App Development: Create Cross Platform Mobile App with .NETStandard 2.0Marvin Heng
 
App Development: Create Cross Platform Mobile App with Just 6 Steps
App Development: Create Cross Platform Mobile App with Just 6 StepsApp Development: Create Cross Platform Mobile App with Just 6 Steps
App Development: Create Cross Platform Mobile App with Just 6 StepsMarvin Heng
 
AI: Together We Make World Smarter With LUIS
AI: Together We Make World Smarter With LUISAI: Together We Make World Smarter With LUIS
AI: Together We Make World Smarter With LUISMarvin Heng
 
Cloud: Publish First Web Application to Azure Using Visual Studio
Cloud: Publish First Web Application to Azure Using Visual StudioCloud: Publish First Web Application to Azure Using Visual Studio
Cloud: Publish First Web Application to Azure Using Visual StudioMarvin Heng
 

More from Marvin Heng (18)

Accelerating Personal Development through Microsoft Certifications
Accelerating Personal Development through Microsoft CertificationsAccelerating Personal Development through Microsoft Certifications
Accelerating Personal Development through Microsoft Certifications
 
Microsoft BotFramework - Global AI Bootcamp Nepal 2022
Microsoft BotFramework - Global AI Bootcamp Nepal 2022Microsoft BotFramework - Global AI Bootcamp Nepal 2022
Microsoft BotFramework - Global AI Bootcamp Nepal 2022
 
What's New With Azure AI
What's New With Azure AIWhat's New With Azure AI
What's New With Azure AI
 
Intelligent Mobile App with Azure Custom Vision
Intelligent Mobile App with Azure Custom VisionIntelligent Mobile App with Azure Custom Vision
Intelligent Mobile App with Azure Custom Vision
 
Bot & AI - A Bot for Productivity
Bot & AI - A Bot for ProductivityBot & AI - A Bot for Productivity
Bot & AI - A Bot for Productivity
 
Artificial Intelligence - Tell You What I See
Artificial Intelligence - Tell You What I SeeArtificial Intelligence - Tell You What I See
Artificial Intelligence - Tell You What I See
 
Handwriting Detection with Microsoft Cognitive Services
Handwriting Detection with Microsoft Cognitive ServicesHandwriting Detection with Microsoft Cognitive Services
Handwriting Detection with Microsoft Cognitive Services
 
Create a Q&A Bot to Serve Your Customers
Create a Q&A Bot to Serve Your CustomersCreate a Q&A Bot to Serve Your Customers
Create a Q&A Bot to Serve Your Customers
 
Facial Analysis with Angular Web App & ASP.NET Core
Facial Analysis with Angular Web App & ASP.NET CoreFacial Analysis with Angular Web App & ASP.NET Core
Facial Analysis with Angular Web App & ASP.NET Core
 
AI/ML/DL: Introduction to Deep Learning with Cognitive ToolKit
AI/ML/DL: Introduction to Deep Learning with Cognitive ToolKitAI/ML/DL: Introduction to Deep Learning with Cognitive ToolKit
AI/ML/DL: Introduction to Deep Learning with Cognitive ToolKit
 
AI/ML/DL: Getting Started with Machine Learning on Azure
AI/ML/DL: Getting Started with Machine Learning on AzureAI/ML/DL: Getting Started with Machine Learning on Azure
AI/ML/DL: Getting Started with Machine Learning on Azure
 
AI: Integrate Search Function into Your App Using Bing Search API.
AI: Integrate Search Function into Your App Using Bing Search API.AI: Integrate Search Function into Your App Using Bing Search API.
AI: Integrate Search Function into Your App Using Bing Search API.
 
Artificial Intelligent: Intelligent Bot With Microsoft Bot Framework & Azure
Artificial Intelligent: Intelligent Bot With Microsoft Bot Framework & AzureArtificial Intelligent: Intelligent Bot With Microsoft Bot Framework & Azure
Artificial Intelligent: Intelligent Bot With Microsoft Bot Framework & Azure
 
AI: Mobile Apps That Understands Your Intention When You Typed
AI: Mobile Apps That Understands Your Intention When You TypedAI: Mobile Apps That Understands Your Intention When You Typed
AI: Mobile Apps That Understands Your Intention When You Typed
 
App Development: Create Cross Platform Mobile App with .NETStandard 2.0
App Development: Create Cross Platform Mobile App with .NETStandard 2.0App Development: Create Cross Platform Mobile App with .NETStandard 2.0
App Development: Create Cross Platform Mobile App with .NETStandard 2.0
 
App Development: Create Cross Platform Mobile App with Just 6 Steps
App Development: Create Cross Platform Mobile App with Just 6 StepsApp Development: Create Cross Platform Mobile App with Just 6 Steps
App Development: Create Cross Platform Mobile App with Just 6 Steps
 
AI: Together We Make World Smarter With LUIS
AI: Together We Make World Smarter With LUISAI: Together We Make World Smarter With LUIS
AI: Together We Make World Smarter With LUIS
 
Cloud: Publish First Web Application to Azure Using Visual Studio
Cloud: Publish First Web Application to Azure Using Visual StudioCloud: Publish First Web Application to Azure Using Visual Studio
Cloud: Publish First Web Application to Azure Using Visual Studio
 

Recently uploaded

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Recently uploaded (20)

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

Microsoft Cognitive Services at a Glance

  • 1. Microsoft Cognitive Services at a Glance Marvin Heng | @hmheng www.techconnect.io
  • 2.
  • 3.
  • 5. Azure Cognitive Services From faces to feelings, allow your apps to understand images and video Hear and speak to your users by filtering noise, identifying speakers, and understanding intent Process text and learn how to recognize what users want Tap into rich knowledge amassed from the web, academia, or your own data Access billions of web pages, images, videos, and news with the power of Bing APIs
  • 6. Why Azure Cognitive Services ?
  • 7. Cognitive Services Emotion Computer Vision Face Video Indexer Form Recognizer Speech To Text Text To Speech Speech Translation Speaker Recognition Immersive Reader Language Understanding QnA Maker Text Analytics Translator Anomaly Detector Content Moderator Metrics Advisor Personalizer Bing Autosuggest Bing Custom Search Bing Entity Search Bing Image Search Bing News Search Bing Spell Check Bing Video Search Bing Visual Search Bing Web Search Custom Vision
  • 9. Bing Search • Allow developers to integrate a search function to their apps that allows users to find webpages, images, news, locations, and more without advertisements • For knowledge mining
  • 10. Bing Search Autosuggest Entity Search Custom Search Image Search News Search Video Search Visual Search Spell Check Local Business
  • 12. Speech-to-Text • Speech-to-text service • Improves meeting efficiency by transcribing conversations in real-time • Help safeguard data with industry-leading security and compliance certifications. • Integrates with a variety of meeting conference solutions including Microsoft Teams and other third-party meeting software. • SDK is available.
  • 15. Speaker Verification • Text-dependent verification means speakers need to choose the same passphrase to use during both enrollment and verification phases. • Text-independent verification means speakers can speak in everyday language in the enrollment and verification phrases.
  • 16. Text-to-Speech • Convert text into human-like synthesized speech. • Offer 75+ standard in more than 45 languages and locales, and 5 neural voices • Tune voice output by easily adjusting rate, pitch, pronunciation, pauses, and more. • Speech synthesis • Asynchronous synthesis of long audio • Speech Synthesis Markup Language (SSML)
  • 17.
  • 19. Speech Translation • Speech-to-text translation with recognition results. • Speech-to-speech translation. • Support for translation to multiple target languages. • Interim recognition and translation results.
  • 20.
  • 21. Use case of Speech Service
  • 23. Language Understanding • Applies custom machine-learning intelligence to a user's conversational, natural language text to predict overall meaning, and pull out relevant, detailed information. • Often used in Chatbots or conversational bots.
  • 25. Use Cases of Language Understanding • Automate capturing order for your application • Automate social media feedback & response • Conversation bot for your HR service, IT service, or other customer services. • Integrate with speech services for enabling your app to responding voice request from users.
  • 26.
  • 27. QnA Maker • Natural Language Processing (NLP) service • Create a natural conversational layer over your data • Find the most appropriate answer for any input from your custom knowledge base (KB) of information
  • 29. Immersive Reader • Embed text reading and comprehension capabilities into applications • Features: • Reading aloud, • translating languages, and • focusing attention through highlighting • No machine learning expertise is required.
  • 30. Shows content in a minimal reading view
  • 31. Displays pictures of commonly used words
  • 32. Highlights nouns, verbs, adjectives, and adverbs
  • 33. Reads your content out loud to you
  • 34. Translates your content into another language
  • 35. Breaks down words into syllables
  • 36. What proves that Immersive Reader helps people with reading? • A 2017 study by RTI International showed that reading comprehension among groups of fourth-grade students improved an average of 10 percent. Read the research.
  • 37.
  • 38. Translator • Support over 70+ languages & customizable. • Easy integration via REST API
  • 39. Text Analytics • For text mining and text analysis • Understand the context in a conversation better • Sentiment analysis, opinion mining, key phrase extraction, language detection, and named entity recognition • More than 20 languages being supported (reference)
  • 40.
  • 41. Text Analytics for Health • Extract information from unstructured English-language text in clinical documents such as: patient intake forms, doctor's notes, research papers and discharge summaries
  • 43. Personalizer • Provide information about your users and content and receive the top action to show your users. • No need to clean and label data before using Personalizer. • Provide feedback to Personalizer when it is convenient to you. • View real-time analytics. • Use Personalizer as part of a larger data science effort to validate existing experiments.
  • 46. Where can I use Personalizer? • Personalize what article is highlighted on a news website. • Display a personalized "recommended item" on a shopping website. • Suggest user interface elements such as filters to apply to a specific photo.
  • 47. Where can I use Personalizer? • Send information (features) about your users and the content (actions) to personalize. Personalizer responds with the top action. • Send feedback to Personalizer about how well the ranking worked as a number typically between 0 and 1.
  • 49. Computer Vision • Computer vision is an area of artificial intelligence (AI) in which software systems are designed to perceive the world visually, though cameras, images, and video. • Computer vision is one of the core areas of artificial intelligence (AI), and focuses on creating solutions that enable AI-enabled applications to "see" the world and make sense of it.
  • 50. Use Cases of Computer Vision • Analyze an image and suggest an appropriate caption. • Suggest relevant tags that could be used to index an image. • Categorize an image. • Identify objects in an image. • Detect faces and people in an image. • Recognize celebrities and landmarks in an image. • Read text in an image.
  • 51. What can CV tell us? • A black and white photo of a city • A black and white photo of a large city • A large white building in a city
  • 52. Not only that! It tags too! • Tagging • Type of identified object • Bounding Box • Set of coordinates (Top, left, width and height)
  • 54. Categorization in 86-category taxonomy abstract_ animal_horse building_street food_grilled others_ outdoor_road people_hand plant_tree text_menu abstract_net animal_panda dark_ food_pizza outdoor_ outdoor_sportsf ield people_many object_screen text_sign abstract_nonph oto building_ drink_ indoor_ outdoor_city outdoor_stoner ock people_portrait object_sculptur e trans_bicycle abstract_rect building_arch drink_can indoor_churchw indow outdoor_field outdoor_street people_show sky_cloud trans_bus abstract_shape building_brickw all dark_fire indoor_court outdoor_grass outdoor_water people_tattoo sky_sun trans_car abstract_texture building_church dark_fireworks indoor_doorwin dows outdoor_house outdoor_watersi de people_young people_swimmi ng trans_trainstatio n animal_ building_corner sky_object indoor_markets tore outdoor_mount ain people_ plant_ outdoor_pool animal_bird building_doorwi ndows food_ indoor_room outdoor_oceanb each people_baby plant_branch text_ animal_cat building_pillar food_bread indoor_venue outdoor_playgro und people_crowd plant_flower text_mag animal_dog building_stair food_fastfood dark_light outdoor_railway people_group plant_leaves text_map
  • 57. Some Additional Capabilities • Detect image • Detect image color schemes • Generate thumbnails • Moderate content
  • 58.
  • 59. Custom Vision • Azure Custom Vision is an image recognition service that lets you build, deploy, and improve your own image identifiers. • An image identifier applies labels (which represent classes or objects) to images, according to their visual characteristics. • The Custom Vision service uses a machine learning algorithm to analyze images.
  • 60. What can Custom Vision do? • Classification • Object Detection • Export as standalone offline model for your app development.
  • 61.
  • 63. Face Verification Verification result: The two faces belong to the same person. Confidence is 0.93468.
  • 65.
  • 66. Video Indexer • Video Indexer provides ability to extract deep insights (with no need for data analysis or coding skills) using machine learning models based on multiple channels (voice, vocals, visual). • The service enables deep search, reduces operational costs, enables new monetization opportunities, and creates new user experiences on large archives of videos (with low entry barriers).
  • 67. Video Indexer • Keywords extraction • Named entities extraction • Topic inference • Artifacts Sentiment analysis: Identifies positive, negative, and neutral sentiments from speech and visual text.
  • 69. Use Cases of Video Indexer • Deep search • Content creation • Accessibility. • Monetization • Content moderation • Recommendations
  • 70. Video Indexer Face detection Celebrity identification Account-based face identification Visual text recognition Visual content moderation Labels identification Scene segmentation Shot detection Black frame detection Keyframe extraction Rolling credits Animated characters detection Editorial shot type detection Audio transcription Automatic language detection Multi-language speech identification and transcription Two channel processing Closed captioning Noise reduction Transcript customization (CRIS) Speaker enumeration Speaker statistics Textual content moderation Audio effects Emotion detection Translation
  • 71. Form Recognizer • Extract text and data from business’s forms and documents. • Easily extract text and structure, with simple REST API • Pre-trained model: • Receipt • Business Card • Layouts • Custom Trained Model • Supports printed and handwritten forms, PDFs and images. • Container support
  • 72. What can you do with Form Recognizer? • Automate written text > digital text conversion • Automate capturing receipt data • Automate converting business card into digital contacts
  • 73.
  • 74. Sample Form Recognizer tool • Client library / REST API quickstart (all languages, multiple scenarios) • Web UI quickstarts • Train with labels - sample labeling tool • REST samples (GitHub) • Extract text, selection marks and table structure from documents • Extract layout data - Python
  • 75. Sample Form Recognizer tool • Train custom models and extract form data • Train without labels - Python • Train with labels - Python • Extract data from invoices • Extract invoice data - Python • Extract data from sales receipts • Extract receipt data - Python • Extract data from business cards • Extract business card data - Python
  • 76.
  • 77. Things you may be interested in. •AI Labs@hmheng (GitHub): https://github.com/hmheng/AILabs
  • 78. Extras Add-on: • Cognitive Service • https://github.com/Azure-Samples/cognitive-services-python-sdk- samples/tree/deb4483ec1db33138e32ce472a775f3ed874fb80?WT.mc_id=AI-MVP-5003227 • Personalizer • https://azure.microsoft.com/en-us/services/cognitive-services/personalizer?WT.mc_id=AI-MVP-5003227 • https://personalizationdemo.azurewebsites.net • Speech-To-Text • https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text?WT.mc_id=AI-MVP-5003227 • Speech • https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text?WT.mc_id=AI-MVP-5003227 • https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-synthesis- markup?WT.mc_id=AI-MVP-5003227
  • 79. Extras Add-on: • Bing Search • https://docs.microsoft.com/en-us/azure/cognitive-services/bing-web-search?WT.mc_id=AI-MVP-5003227 • Personalizer • https://azure.microsoft.com/en-us/services/cognitive-services/personalizer?WT.mc_id=AI-MVP-5003227 • https://personalizationdemo.azurewebsites.net • Speech-To-Text • https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text?WT.mc_id=AI-MVP-5003227 • Speech • https://azure.microsoft.com/en-us/services/cognitive-services/speech-to-text?WT.mc_id=AI-MVP-5003227 • https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-synthesis- markup?WT.mc_id=AI-MVP-5003227
  • 80. Extras Add-on: • Form Recognizer: • https://docs.microsoft.com/en-us/azure/cognitive-services/form- recognizer/language-support?WT.mc_id=AI-MVP-5003227 • https://docs.microsoft.com/en-us/azure/cognitive-services/form- recognizer/quickstarts/label-tool?WT.mc_id=AI-MVP-5003227 • Form OCR Testing Tool • On Azure @ https://fott.azurewebsites.net/ • GitHub @ https://github.com/microsoft/OCR-Form- Tools/?ocid=AID2463683&WT.mc_id=AI-MVP-5003227
  • 81. Thank you! Marvin Heng | @hmheng www.techconnect.io