SlideShare a Scribd company logo
It was the best of times
Rutger
It was the worst of times
Albert-Jan
Using AI to classify your SharePoint Data
A love story
Cortex.
The next step in Knowledge in SharePoint through:
• Image and text recognition
• Form processing
• Machine teaching
To create knowledge entities through AI and experts
in the MMS. Providing a knowledge card within
content and a people card with knowledge. As well as
a topic page available through the knowledge center.
Form Recognizer.
Example
Our case.
• Our customer had a large set of unstructured
data (mainly Office and PDF files).
• All data was stored on a File share.
• The data was not searchable, no full text
search was configured.
• No metadata was captured and most of the
files where scans of typewriter created data.
• Legal requirements prevented us of putting
the full data-set in the cloud.
Unstructured data.
Unstructured data.
Unstructured data.
So how does it work.
Crawl
SharePoint 2019
Users
Content Processing Index
Document Parsing Content Enrichment Word breaking Custom Entity Extraction
Site Collection
Network share
App Server Cloud
Webservice Metadata extraction Data Cleansing
LUIS Training Model
LUIS intents & entities
So how does it work.
Crawl
SharePoint 2019
Users
Content Processing Index
Document Parsing Content Enrichment Word breaking Custom Entity Extraction
Site Collection
Network share
App Server Cloud
Webservice Metadata extraction Data Cleansing
LUIS Training Model
LUIS intents & entities
SharePoint provides us
with Full Text Search
Custom Content
Processing allows us to
inject intelligence in the
search process
Our ‘logic’ can
determine what can be
send to the cloud.
LUIS provides intents
and entities that can be
written back to our
search results
LUIS.
Utterance
What a user says
Intent
What a user means
Entities
What you want to extract
What would we do differently.
1. We scored between 80% to 95% of auto tagging with
our data set after just 10 days. We are not data
scientists so we could do better.
2. We would love to extract more data, not only from the
files you are processing but also from other back-end
systems. When we retrieve a project number from a file,
we can use that to gather the PM from ERP.
3. Train in the cloud, run on-premises (locally run Docker
with LUIS or Text-Analyser trained in the cloud would
allow us to analyze even more text!
Thank you.
Reach out for questions!
• Rutger Knijnenburg • Albert-Jan Schot
Technical Cloud Consultant
@knijnOps
knijnops.com
CTO & MVP
@appieschot
cloudappie.nl

More Related Content

What's hot

Identi.ca and RDF for Linking Data
Identi.ca and RDF for Linking DataIdenti.ca and RDF for Linking Data
Identi.ca and RDF for Linking Data
Saeed Moaddeli
 
Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engine
ankur881120
 

What's hot (15)

Overview of Microsoft Azure AI Services
Overview of Microsoft Azure AI ServicesOverview of Microsoft Azure AI Services
Overview of Microsoft Azure AI Services
 
Real World Challenges in Enterprise Search
Real World Challenges in Enterprise SearchReal World Challenges in Enterprise Search
Real World Challenges in Enterprise Search
 
Identi.ca and RDF for Linking Data
Identi.ca and RDF for Linking DataIdenti.ca and RDF for Linking Data
Identi.ca and RDF for Linking Data
 
Is BCS Dead?
Is BCS Dead?Is BCS Dead?
Is BCS Dead?
 
week13
week13week13
week13
 
Archonnex at ICPSR
Archonnex at ICPSRArchonnex at ICPSR
Archonnex at ICPSR
 
Share point metadata
Share point metadataShare point metadata
Share point metadata
 
How goole search engine work
How goole search engine workHow goole search engine work
How goole search engine work
 
Business Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search EngineBusiness Intelligence Solution Using Search Engine
Business Intelligence Solution Using Search Engine
 
Office 365 Emails & Archiving
Office 365 Emails & ArchivingOffice 365 Emails & Archiving
Office 365 Emails & Archiving
 
Sarath_Kumar_Prabhakaran_Graduate_Resume
Sarath_Kumar_Prabhakaran_Graduate_ResumeSarath_Kumar_Prabhakaran_Graduate_Resume
Sarath_Kumar_Prabhakaran_Graduate_Resume
 
Rishubh Agrawal Resume
Rishubh Agrawal ResumeRishubh Agrawal Resume
Rishubh Agrawal Resume
 
Rishubh Agrawal Resume
Rishubh Agrawal ResumeRishubh Agrawal Resume
Rishubh Agrawal Resume
 
Jeremy cabral search marketing summit - scraping data-driven content (1)
Jeremy cabral   search marketing summit - scraping data-driven content (1)Jeremy cabral   search marketing summit - scraping data-driven content (1)
Jeremy cabral search marketing summit - scraping data-driven content (1)
 
Training Project Report on Search Engines
Training Project Report on Search EnginesTraining Project Report on Search Engines
Training Project Report on Search Engines
 

Similar to Using AI to classify your SharePoint Data

MS Azure with IoT - Final Version
MS Azure with IoT - Final VersionMS Azure with IoT - Final Version
MS Azure with IoT - Final Version
Janani Eshwaran
 
MS Azure with IoT - Final Version
MS Azure with IoT - Final VersionMS Azure with IoT - Final Version
MS Azure with IoT - Final Version
Janani Eshwaran
 

Similar to Using AI to classify your SharePoint Data (20)

2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
 
Evolution of Content Services
Evolution of Content ServicesEvolution of Content Services
Evolution of Content Services
 
Pingar App for SharePoint
Pingar App for SharePointPingar App for SharePoint
Pingar App for SharePoint
 
Microsoft ignite 2019 highlights
Microsoft ignite 2019 highlightsMicrosoft ignite 2019 highlights
Microsoft ignite 2019 highlights
 
presentation
presentationpresentation
presentation
 
Running SharePoint on Small Business Server: Pros and Cons - 100 Level
Running SharePoint on Small Business Server: Pros and Cons - 100 LevelRunning SharePoint on Small Business Server: Pros and Cons - 100 Level
Running SharePoint on Small Business Server: Pros and Cons - 100 Level
 
MS Azure with IoT - Final Version
MS Azure with IoT - Final VersionMS Azure with IoT - Final Version
MS Azure with IoT - Final Version
 
MS Azure with IoT - Final Version
MS Azure with IoT - Final VersionMS Azure with IoT - Final Version
MS Azure with IoT - Final Version
 
Use O365 and Azure Cognitive Services for intelligent search
Use O365 and Azure Cognitive Services for intelligent searchUse O365 and Azure Cognitive Services for intelligent search
Use O365 and Azure Cognitive Services for intelligent search
 
Steering Away from Bolted-On Analytics
Steering Away from Bolted-On AnalyticsSteering Away from Bolted-On Analytics
Steering Away from Bolted-On Analytics
 
Data science and Artificial Intelligence
Data science and Artificial IntelligenceData science and Artificial Intelligence
Data science and Artificial Intelligence
 
Introduction to Microsoft Syntex
Introduction to Microsoft SyntexIntroduction to Microsoft Syntex
Introduction to Microsoft Syntex
 
Getting Ready for Project Cortex and SharePoint Syntex
Getting Ready for Project Cortex and SharePoint SyntexGetting Ready for Project Cortex and SharePoint Syntex
Getting Ready for Project Cortex and SharePoint Syntex
 
Getting Ready for Project Cortex and SharePoint Syntex
Getting Ready for Project Cortex and SharePoint SyntexGetting Ready for Project Cortex and SharePoint Syntex
Getting Ready for Project Cortex and SharePoint Syntex
 
Getting Ready for Project Cortex
Getting Ready for Project CortexGetting Ready for Project Cortex
Getting Ready for Project Cortex
 
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
 
7 Innovative Ways Project Cortex Delivers Business Value
7 Innovative Ways Project Cortex Delivers Business Value7 Innovative Ways Project Cortex Delivers Business Value
7 Innovative Ways Project Cortex Delivers Business Value
 
Getting Started With Dato - August 2015
Getting Started With Dato - August 2015Getting Started With Dato - August 2015
Getting Started With Dato - August 2015
 
From Traditional ECM to Content Services: Modernizing Content Management with...
From Traditional ECM to Content Services: Modernizing Content Management with...From Traditional ECM to Content Services: Modernizing Content Management with...
From Traditional ECM to Content Services: Modernizing Content Management with...
 
Getting started with with SharePoint Syntex
Getting started with with SharePoint SyntexGetting started with with SharePoint Syntex
Getting started with with SharePoint Syntex
 

More from Albert-Jan Schot

Office 365 Truths Delve and Office Graph
Office 365 Truths Delve and Office GraphOffice 365 Truths Delve and Office Graph
Office 365 Truths Delve and Office Graph
Albert-Jan Schot
 
TechDays 2015 - Building Solutions with the Office Graph
TechDays 2015 - Building Solutions with the Office GraphTechDays 2015 - Building Solutions with the Office Graph
TechDays 2015 - Building Solutions with the Office Graph
Albert-Jan Schot
 
TechDays 2015 - SharePoint van traditie naar verandering
TechDays 2015 - SharePoint van traditie naar veranderingTechDays 2015 - SharePoint van traditie naar verandering
TechDays 2015 - SharePoint van traditie naar verandering
Albert-Jan Schot
 
Haagse Hogeschool - IDMConferentie - Office Graph
Haagse Hogeschool - IDMConferentie - Office GraphHaagse Hogeschool - IDMConferentie - Office Graph
Haagse Hogeschool - IDMConferentie - Office Graph
Albert-Jan Schot
 

More from Albert-Jan Schot (20)

Site templates, site life cycle management and Modern SharePoint
Site templates, site life cycle management and Modern SharePointSite templates, site life cycle management and Modern SharePoint
Site templates, site life cycle management and Modern SharePoint
 
Diwug 2019 Teams Development
Diwug 2019 Teams DevelopmentDiwug 2019 Teams Development
Diwug 2019 Teams Development
 
Developing for Office365 groups
Developing for Office365 groupsDeveloping for Office365 groups
Developing for Office365 groups
 
DIWUG - Groups & Teams and Planner
DIWUG - Groups & Teams and Planner DIWUG - Groups & Teams and Planner
DIWUG - Groups & Teams and Planner
 
SPSBE2016 - Leverage the power of groups
SPSBE2016 - Leverage the power of groupsSPSBE2016 - Leverage the power of groups
SPSBE2016 - Leverage the power of groups
 
SharePoint Framework Ignite 2016 recap @ Sparked
SharePoint Framework Ignite 2016 recap @ SparkedSharePoint Framework Ignite 2016 recap @ Sparked
SharePoint Framework Ignite 2016 recap @ Sparked
 
Working with the Groups
Working with the GroupsWorking with the Groups
Working with the Groups
 
Microsoft graph api deep dive
Microsoft graph api deep diveMicrosoft graph api deep dive
Microsoft graph api deep dive
 
Office add ins 0-60
Office add ins 0-60Office add ins 0-60
Office add ins 0-60
 
DIWUG - Groups for Developers
DIWUG - Groups for DevelopersDIWUG - Groups for Developers
DIWUG - Groups for Developers
 
Office 365 Truths Delve and Office Graph
Office 365 Truths Delve and Office GraphOffice 365 Truths Delve and Office Graph
Office 365 Truths Delve and Office Graph
 
TechDays 2015 - Building Solutions with the Office Graph
TechDays 2015 - Building Solutions with the Office GraphTechDays 2015 - Building Solutions with the Office Graph
TechDays 2015 - Building Solutions with the Office Graph
 
TechDays 2015 - SharePoint van traditie naar verandering
TechDays 2015 - SharePoint van traditie naar veranderingTechDays 2015 - SharePoint van traditie naar verandering
TechDays 2015 - SharePoint van traditie naar verandering
 
Vinden Of Zoeken
Vinden Of ZoekenVinden Of Zoeken
Vinden Of Zoeken
 
Haagse Hogeschool - IDMConferentie - Office Graph
Haagse Hogeschool - IDMConferentie - Office GraphHaagse Hogeschool - IDMConferentie - Office Graph
Haagse Hogeschool - IDMConferentie - Office Graph
 
SPStockHolm 2015 - Search Intents with Apps
SPStockHolm 2015 - Search Intents with Apps SPStockHolm 2015 - Search Intents with Apps
SPStockHolm 2015 - Search Intents with Apps
 
DIWUG Developing Apps with Office Graph
DIWUG Developing Apps with Office GraphDIWUG Developing Apps with Office Graph
DIWUG Developing Apps with Office Graph
 
SPConnect 2014 - Search Intents with Apps
SPConnect 2014 - Search Intents with AppsSPConnect 2014 - Search Intents with Apps
SPConnect 2014 - Search Intents with Apps
 
European O365 Connect SharePoint Online Applification
European O365 Connect SharePoint Online ApplificationEuropean O365 Connect SharePoint Online Applification
European O365 Connect SharePoint Online Applification
 
SharePoint Connections 2013 - Navigation in a Multi Site Collection environment
SharePoint Connections 2013 - Navigation in a Multi Site Collection environmentSharePoint Connections 2013 - Navigation in a Multi Site Collection environment
SharePoint Connections 2013 - Navigation in a Multi Site Collection environment
 

Recently uploaded

AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
Alluxio, Inc.
 

Recently uploaded (20)

Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus Compute wth IRI Workflows - GlobusWorld 2024
 
Breaking the Code : A Guide to WhatsApp Business API.pdf
Breaking the Code : A Guide to WhatsApp Business API.pdfBreaking the Code : A Guide to WhatsApp Business API.pdf
Breaking the Code : A Guide to WhatsApp Business API.pdf
 
Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
 
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
AI/ML Infra Meetup | Improve Speed and GPU Utilization for Model Training & S...
 
Studiovity film pre-production and screenwriting software
Studiovity film pre-production and screenwriting softwareStudiovity film pre-production and screenwriting software
Studiovity film pre-production and screenwriting software
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
 
AI/ML Infra Meetup | Perspective on Deep Learning Framework
AI/ML Infra Meetup | Perspective on Deep Learning FrameworkAI/ML Infra Meetup | Perspective on Deep Learning Framework
AI/ML Infra Meetup | Perspective on Deep Learning Framework
 
Corporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMSCorporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMS
 
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?
How Does XfilesPro Ensure Security While Sharing Documents in Salesforce?
 
A Comprehensive Appium Guide for Hybrid App Automation Testing.pdf
A Comprehensive Appium Guide for Hybrid App Automation Testing.pdfA Comprehensive Appium Guide for Hybrid App Automation Testing.pdf
A Comprehensive Appium Guide for Hybrid App Automation Testing.pdf
 
Using IESVE for Room Loads Analysis - Australia & New Zealand
Using IESVE for Room Loads Analysis - Australia & New ZealandUsing IESVE for Room Loads Analysis - Australia & New Zealand
Using IESVE for Room Loads Analysis - Australia & New Zealand
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
 
De mooiste recreatieve routes ontdekken met RouteYou en FME
De mooiste recreatieve routes ontdekken met RouteYou en FMEDe mooiste recreatieve routes ontdekken met RouteYou en FME
De mooiste recreatieve routes ontdekken met RouteYou en FME
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx
 
Advanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should KnowAdvanced Flow Concepts Every Developer Should Know
Advanced Flow Concepts Every Developer Should Know
 
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.ILBeyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
 

Using AI to classify your SharePoint Data

  • 1. It was the best of times Rutger
  • 2. It was the worst of times Albert-Jan
  • 3.
  • 4. Using AI to classify your SharePoint Data A love story
  • 5. Cortex. The next step in Knowledge in SharePoint through: • Image and text recognition • Form processing • Machine teaching To create knowledge entities through AI and experts in the MMS. Providing a knowledge card within content and a people card with knowledge. As well as a topic page available through the knowledge center.
  • 6.
  • 7.
  • 9. Our case. • Our customer had a large set of unstructured data (mainly Office and PDF files). • All data was stored on a File share. • The data was not searchable, no full text search was configured. • No metadata was captured and most of the files where scans of typewriter created data. • Legal requirements prevented us of putting the full data-set in the cloud.
  • 13. So how does it work. Crawl SharePoint 2019 Users Content Processing Index Document Parsing Content Enrichment Word breaking Custom Entity Extraction Site Collection Network share App Server Cloud Webservice Metadata extraction Data Cleansing LUIS Training Model LUIS intents & entities
  • 14. So how does it work. Crawl SharePoint 2019 Users Content Processing Index Document Parsing Content Enrichment Word breaking Custom Entity Extraction Site Collection Network share App Server Cloud Webservice Metadata extraction Data Cleansing LUIS Training Model LUIS intents & entities SharePoint provides us with Full Text Search Custom Content Processing allows us to inject intelligence in the search process Our ‘logic’ can determine what can be send to the cloud. LUIS provides intents and entities that can be written back to our search results
  • 15. LUIS. Utterance What a user says Intent What a user means Entities What you want to extract
  • 16.
  • 17.
  • 18. What would we do differently. 1. We scored between 80% to 95% of auto tagging with our data set after just 10 days. We are not data scientists so we could do better. 2. We would love to extract more data, not only from the files you are processing but also from other back-end systems. When we retrieve a project number from a file, we can use that to gather the PM from ERP. 3. Train in the cloud, run on-premises (locally run Docker with LUIS or Text-Analyser trained in the cloud would allow us to analyze even more text!
  • 19. Thank you. Reach out for questions! • Rutger Knijnenburg • Albert-Jan Schot Technical Cloud Consultant @knijnOps knijnops.com CTO & MVP @appieschot cloudappie.nl