http://devrain.com
https://www.linkedin.com/in/sashaeve/
https://www.facebook.com/alex.krakovetskiy
“Microsoft AI Inner Circle Program” member
Microsoft Silver Partner (Data Analytics)
Ukrainian IT Awards
80%
of business relevant information is
unstructured
management free
keyword search
faceting language analyzers
geospatial support
suggestions/auto-complete
customizable scoring
proximity search
synonyms, etc.
TIFF HTML
…
JPG
JPG
Scott Guthrie
Title:
Executive Vice President,
C+E
Company: Microsoft
accent color: blue?
Cognitive Services capabilities
Infuse your apps, websites, and bots with human-like intelligence
Тема доклада
Тема доклада
Тема доклада
Extract information from text
Text Analytics
Extract information from text
Extract information from text
Description Values
LANGUAGES English
KEY PHRASES money laundering, criminal charges, prime
minister, Exchange, country, development,
Malaysia's capital, Kuala Lumpur, abuse of power,
MDB financial controversy, showpiece skyscraper
SENTIMENT Positive
NAMED ENTITIES Malaysia [Location]
Kuala Lumpur [Location]
106 [Quantity-Number]
first [Quantity-Ordinal]
December [DateTime-DateRange]
skyscraper [Location]
1Malaysia Development Berhad [Organization]
Extract information from text
Extract information from images
Extract information from text
Generate captions from images
"captions": [ {
"text": "a view of Sagrada Família",
"confidence": 0.9284265
} ]
Extract information from text
Optical character recognition (OCR) in images
Extract information from text
Recognize digital ink and handwriting
Extract information from text
Celebrity recognition in images
Extract information from text
Emotion recognition in images
Extract information from text
Landmark recognition in images
Extract information from text
Form recognizer
Extract information from text
Form recognizer
INGEST
Data in any
format, any
Azure store
ENRICH EXPLORE
Annotations
Cognitive skills
Search
Annotated
Documents
Customer
Data
Search
Index
INGEST ENRICH EXPLORE
Annotated
Documents
Customer
Data
Search
Index
INGEST ENRICH EXPLORE
Annotated
Documents
Customer
Data
Search
Index
INGEST ENRICH EXPLORE
Built-in skills
Azure
Databricks
Machine Learning
VMs
Azure Machine
Learning
Custom skills
Your custom
skill goes here!
OCR (text
recognition)
OCR (text
recognition)
redaction
classifier
cryptonym
extraction
handwritten
text recognition
face
detection
face
detection
https
Custom
translation
skill
{
"values": [
{
"recordId": "7cad2",
"data":
{
"value1":
"I owe you 5 grand"
}
},
{
"recordId": "7cad3",
"data":
{
"value1":
"Just my 2 cents",
}
},
…
]
}
{
"values": [
{
"recordId": "7cad2",
"data":
{
"myOuput1":
"Te debo cinco mil"
}
},
{
"recordId": "7cad3",
"data":
{
"myOutput1":
"Solo mis 2 centavos"
}
},
…
]
}
/document
/content
/normalized_images
/1
/2
/…
/n
"skills": [
{
"@odata.type": "#Microsoft.Skills.Text.LanguageDetectionSkill",
"inputs":
[
{ "name": "text", "source": "/document/content" }
],
"outputs":
[
{ "name": "languageCode", "targetName": "myLanguageCode" },
{ "name": "languageName", "targetName": "myLanguageName" }
]
},
…,
{
"@odata.type": "#Microsoft.Skills.Text.NamedEntityRecognitionSkill",
"categories": [ "Organization" ],
"defaultLanguageCode": "en",
"inputs":
[
{ "name": "text", "source": "/document/content" },
"name" "languageCode" "source" "/document/myLanguageCode"
],
"outputs":
[
{ "name": "organizations", "targetName": "organizations" }
]
},
/document
/content
/normalized_images
/1
/2
/…
/n
/myLanguageCode
/organizations
/1
/2
/n
/…
/document
/content
/normalized_images
/1
/2
/…
/n
/mylanguagecode
Annotated
Documents
Customer
Data
Search
Index
INGEST ENRICH EXPLORE
/document
/keyPhrases
/0
/1
/…
/n
/organizations
/0
/1
/…
/n
organizationId
organizationId
organizationId
organizationId
/images
/0
/1
/…
/n
tags
tags
tags
tags
Option 1: Flatten the data
{
…
"outputFieldMappings":
[
{
"sourceFieldName":
"/document/organizations/*/organizationId",
"targetFieldName":
"myClients"
} ,
…
]
}
https://azure.microsoft.com/en-us/services/search/
https://github.com/Azure-Samples/azure-search-
knowledge-mining
https://github.com/Azure/LearnAI-
KnowledgeMiningBootcamp

Oleksander Krakovetskyi "Webinar: Azure Cognitive Search"

Editor's Notes