SlideShare a Scribd company logo
1 of 28
© Concept Searching 2017
The Nuts and Bolts of Metadata Tagging
and Taxonomies Made Easy
Michael Paye
Chief Technology Officer
Concept Searching
mikep@conceptsearching.com
www.conceptsearching.com
marketing@conceptsearching.com
Twitter @conceptsearch
Paul Billingham
Director of Sales, Europe
Concept Searching
paulb@conceptsearching.com
© Concept Searching 2017
Michael Paye – Chief Technology Officer at Concept Searching
has been the driving force behind many of the company's recent
innovations, including the SharePoint Add-in and hybrid search
products. He has a wealth of experience across the Microsoft
platform and related technologies, and oversees all product
development.
Paul Billingham – Director of Sales, Europe at Concept Searching
is one of the company founders, and has over 20 years’ sales
experience, working primarily within the document management and
workflow industry. He has a technical background, which is a major
benefit when selling complex technology through a partner channel.
© Concept Searching 2017
Agenda
• Who we are and what we do
• What’s the problem?
• What does it impact?
• How do you measure performance?
• Metadata generation
• Auto-classification – What does it do?
• Taxonomies – What kinds are there?
• SharePoint Term Store
• Calculating return on investment
© Concept Searching 2017
• Company founded in 2002
• Product launched in 2003
• Focus on management of structured and unstructured information
• Profitable, debt free
• Technology Platform
• Delivered as a web service
• Automatic concept identification, content tagging, auto-classification,
taxonomy management
• Only statistical vendor that can extract conceptual metadata
• 8 years KMWorld ‘100 Companies that Matter in Knowledge Management’
8 years KMWorld ‘Trend Setting Product’
• Authority to Operate enterprise wide US Air Force, NETCON US Army,
and Canadian SLSA
• Client base: Fortune 500/1000 organizations in Healthcare,
Financial Services, Manufacturing, Energy, Professional Services,
Pharmaceutical, Public sector and DoD
• Microsoft Gold Certification in Application Development
• Member of SharePoint PAC and TAP programs
• Suitable for all versions of SharePoint on-premises and SharePoint Online,
including the latest vNext dedicated platform and the government cloud
The Global Leader in
Managed Metadata Solutions
© Concept Searching 2017
Concept Searching’s technology platforms deliver
semantic metadata generation, auto-classification and
taxonomy/Term Store management, and are fully
integrated with all versions of SharePoint on-premises,
Microsoft Online/Office 365, and OneDrive for Business
What Do We Do?
These infrastructure platforms integrate not only with
SharePoint but also other content repositories, search
engines and file shares, enabling our clients to add
structure and manage their enterprise content,
regardless of environment
The resulting classification metadata is used by clients
to deliver ‘intelligent metadata solutions’ in areas such
as enhanced search, migration, data privacy, records
management, policy enforcement, compliance, text
analytics, and business and social collaboration
© Concept Searching 2017
Definition
• Metadata describes other data, it
provides information about a certain
item's content
• For example, an image may include
metadata that describes how large
the picture is, the color depth, the
image resolution, when the image
was created, and other data
• A text document's metadata may
contain information about how long
the document is, who the author is,
when the document was written, and
a short summary of the document
TechTerms.com
Metadata
© Concept Searching 2017
Types of Metadata
Intrinsic
• Information that can be extracted directly
from an object (file name, size)
Administrative/Management
• Information used to manage the document
(author, date created, date to be reviewed)
Descriptive
• Information that describes the object (title,
subject, audience)
Semantic
• Ability to extract concepts from within
content and generate the metadata
(intelligent metadata)
© Concept Searching 2017
“Over 80% of business decisions are made using unstructured data.”
IDC
What’s the Problem?
© Concept Searching 2017
• 91% use manual metadata tagging
• Free-for-all mode
• Drop down lists
• 15% maintain a homegrown manual taxonomy
• 77% have no rhyme or reason for managing content
Information Chaos
• Unstructured data is growing at the rate of 62% per year IDG
• By 2022, 93% of all data in the digital universe will be unstructured IDG
• Data volume is set to grow 800% over the next five years and 80% of it
will reside as unstructured data Gartner
What’s the Problem?
© Concept Searching 2017
It’s not just about search
What Does it Impact?
© Concept Searching 2017
How do you measure performance?
© Concept Searching 2017
Precision Versus Recall
• Usually used by academics
• Precision
• Positive predictive value
• Fraction of retrieved instances that are relevant
• Recall
• Sensitivity
• Correct number of documents that are relevant
• Fraction of relevant instances that are retrieved
• In a perfect world, they should be balanced
• Commercial evaluation criteria also take into account
• Order of the returned results
• Overall ability of a user to find an answer rather
than relying on a search being submitted only once
© Concept Searching 2017
• Automated metadata generation is
difficult to achieve consistently with
high precision and recall
• Many products on the market today
require complex rules to be generated,
often involving search syntax and
complicated Boolean expressions
• Some require a document training set
for every term to be processed
• Some of these products employ
linguistic techniques that will not
perform consistently across different
vertical markets
Result is very high initial cost in terms of
time and level of qualified staff
Precision Versus Recall
© Concept Searching 2017
A manual metadata approach will fail 95% of the time
Why is it So Hard to Get Metadata Right?
Issue Organizational Impact
Inconsistent Less than 50% of content is correctly indexed, meta-tagged or
efficiently searchable rendering it unusable to the organization. (IDC)
Subjective Highly trained Information Specialists will agree on meta tags between
33% - 50% of the time. (C. Cleverdon)
Cumbersome – expensive Average cost of manually tagging one item runs from $4-$7 per
document and does not factor in the accuracy of the meta tags nor the
repercussions from mis-tagged content. (Hoovers)
Malicious compliance End users select first value in list.
(Perspectives on Metadata, Sarah Courier)
No perceived value for end user What’s in it for me? End user creates document, does not see value for
organization nor risks associated with litigation and non-conformance to
policies.
What have you seen Metadata will continue to be a problem due to inconsistent human
behavior.
© Concept Searching 2017
• A feature found in some content management
systems or records management applications
that will scan the contents of a document and
automatically assign metadata, categories,
and keywords based on the document
contents
• Content-based assignment of one or more
pre-defined categories to documents
(records), usually machine learning, statistical
pattern recognition, or neural network
approaches that are used to construct
classifiers automatically
What is Auto-classification?
© Concept Searching 2017
Automatic generation of compound term metadata
Set up a taxonomy node, suggest clues for class, document feedback
© Concept Searching 2017
Auto-classification Systems – What Do They Do?
Document
Preparation
• Split into language
blocks (paragraphs,
headings),
formatting, layout
Parsing
• Entity extraction
• NLP: parts of speech,
phrases
• Terms, variants
Weighting
• Frequency
• Location in text,
phrase
• Proximity
• Combination
• Format of text
Classification
• If threshold reached
• Can influence search
results
This is where rules
vs statistics come
into play…
Not all classification solutions are created equal
© Concept Searching 2017
Auto-classification Systems
Keyword
• Boolean operators add a degree of sophistication,
but also tend to improve precision at the expense
of recall, because any document that does not
match the Boolean expression is ignored
• The majority of search users are unable to
formulate even basic Boolean expressions
Linguistic
• No commitment to a taxonomic tree
• Related to parts of speech, syntactic parses,
or semantic interpretations
• Typically not scalable
• Usually delivered as pre-configured for an
industry, hard to integrate your unique
organizational vocabulary
© Concept Searching 2017
Semantic Networks
• Refers to a set of relationships between
concepts and words, including parts of
speech and real-world relationships
• These can include rules of various types,
not just Boolean
Machine Learning
• Subfield of computer science (CS)
and artificial intelligence (AI) that deals with
the construction and study of systems that
can learn from data, rather than follow only
explicitly programmed instructions
Auto-classification Systems
© Concept Searching 2017
Auto-classification in action
© Concept Searching 2017
Taxonomies
Taxonomy
• A taxonomy is an organized set of
concepts or definitions, usually labeled
keywords
• For search engines, a taxonomy can
also be a set of organized searches
• Taxonomies are typically nested in a
hierarchical manner, often called a ‘tree’
• Subject-based taxonomy – created by
domain experts
• Content-based taxonomy – organizing
the data you already have
• Behavior-based taxonomy – driven by
search analytics, user tagging, or
vocabulary analysis
© Concept Searching 2017
Types of Taxonomies
List, Picklist, Controlled Vocabulary, Authority Files
List of lead or preferred terms, selected by the end
user, may or may not have relationships among the
terms, can include a synonym ring
Synonym Lists
The use of synonyms allows one concept to be
instantiated as the same as the other, but still
allows a term to be preferred over another
Hierarchical
Each content item resides in only one category,
referred to as a ‘tree’
• Piano
• Musical instrument
© Concept Searching 2017
Types of Taxonomies
Polyhierarchical, Faceted, Thesauri
Content items can exist in more than one category,
more structured controlled vocabulary, provides
information about each term and its relationship to
other terms, features of a hierarchical taxonomy
plus associative relationships
• Piano
• Musical instrument
• Stringed instrument
• Percussion instrument
Ontology
Multiple taxonomies with additional relationships
added to specify concepts within a domain
Marlene Rockmore – The Taxonomy Blog
Heather Hedden – The Accidental Taxonomist
© Concept Searching 2017
SharePoint Term Store
• Introduced in 2010
• Provides infrastructure for
taxonomy management
• Managed metadata properties
designed for hierarchical
metadata
• Integrated with search via the
refinement panel
• Utilizes GUIDs for term/tag
identification
SharePoint has no automatic generation of metadata
SharePoint has no auto-classification capability
SharePoint has no facility to generate concepts
© Concept Searching 2017
Automatic, real-time update of the SharePoint Term Store
© Concept Searching 2017
Return On Investment
© Concept Searching 2017
Return On Investment – Real World Savings
Pique Solutions
The Business Solutions
• Search
• Records Management
• Intelligent Migration
• Data Security/Confidentiality
• eDiscovery/Litigation
Support, FOIA
• Information Governance
• Text Analytics
• Business Social Networking
• Collaboration
• Content Lifecycle
Management
• Metadata Management
• Research
• Knowledge Management
© Concept Searching 2017
Thank You
Michael Paye
Chief Technology Officer
Concept Searching
mikep@conceptsearching.com
www.conceptsearching.com
marketing@conceptsearching.com
Twitter @conceptsearch
Paul Billingham
Director of Sales, Europe
Concept Searching
paulb@conceptsearching.com

More Related Content

What's hot

Reduce Your Taxonomy Deployment Time from Months to Weeks Webinar
Reduce Your Taxonomy Deployment Time from Months to Weeks WebinarReduce Your Taxonomy Deployment Time from Months to Weeks Webinar
Reduce Your Taxonomy Deployment Time from Months to Weeks WebinarConcept Searching, Inc
 
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...Concept Searching, Inc
 
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Concept Searching, Inc
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarConcept Searching, Inc
 
Why Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic WebinarWhy Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic WebinarConcept Searching, Inc
 
Content marketing for human action
Content marketing for human action Content marketing for human action
Content marketing for human action Econsultancy
 
Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarConcept Searching, Inc
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DATAVERSITY
 
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge DiscoveryWhat You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge DiscoveryConcept Searching, Inc
 
Metadata Matters – Collaboration, Search, and Information Governance at Brail...
Metadata Matters – Collaboration, Search, and Information Governance at Brail...Metadata Matters – Collaboration, Search, and Information Governance at Brail...
Metadata Matters – Collaboration, Search, and Information Governance at Brail...Concept Searching, Inc
 
Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Precisely
 
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...Concept Searching, Inc
 
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...DATAVERSITY
 
Going Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointGoing Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointConcept Searching, Inc
 
Business Rules For Metadata Governance & Stewardship
Business Rules For Metadata Governance & StewardshipBusiness Rules For Metadata Governance & Stewardship
Business Rules For Metadata Governance & StewardshipRobert J. Abate, CBIP, CDMP
 
Why You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management WebinarWhy You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management WebinarConcept Searching, Inc
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesDATAVERSITY
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...DATAVERSITY
 

What's hot (20)

Reduce Your Taxonomy Deployment Time from Months to Weeks Webinar
Reduce Your Taxonomy Deployment Time from Months to Weeks WebinarReduce Your Taxonomy Deployment Time from Months to Weeks Webinar
Reduce Your Taxonomy Deployment Time from Months to Weeks Webinar
 
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
 
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right Webinar
 
Why Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic WebinarWhy Most Migration Projects Fail – Don’t Be a Statistic Webinar
Why Most Migration Projects Fail – Don’t Be a Statistic Webinar
 
Content marketing for human action
Content marketing for human action Content marketing for human action
Content marketing for human action
 
SharePoint Fest Chicago Presentation
SharePoint Fest Chicago PresentationSharePoint Fest Chicago Presentation
SharePoint Fest Chicago Presentation
 
Getting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide WebinarGetting Knowledge Transfer Right Enterprise Wide Webinar
Getting Knowledge Transfer Right Enterprise Wide Webinar
 
DQ Book Review
DQ Book ReviewDQ Book Review
DQ Book Review
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
 
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge DiscoveryWhat You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
 
Metadata Matters – Collaboration, Search, and Information Governance at Brail...
Metadata Matters – Collaboration, Search, and Information Governance at Brail...Metadata Matters – Collaboration, Search, and Information Governance at Brail...
Metadata Matters – Collaboration, Search, and Information Governance at Brail...
 
Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality 
 
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
Intelligent Compliance to Optimize Energy Sector Enterprise Content Managemen...
 
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
Data Architecture Strategies: Artificial Intelligence - Real-World Applicatio...
 
Going Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePointGoing Meta – How to use Metadata in SharePoint
Going Meta – How to use Metadata in SharePoint
 
Business Rules For Metadata Governance & Stewardship
Business Rules For Metadata Governance & StewardshipBusiness Rules For Metadata Governance & Stewardship
Business Rules For Metadata Governance & Stewardship
 
Why You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management WebinarWhy You Need Metadata-Driven Records Management Webinar
Why You Need Metadata-Driven Records Management Webinar
 
Data-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance StrategiesData-Ed Webinar: Data Governance Strategies
Data-Ed Webinar: Data Governance Strategies
 
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
LDM Slides: Conceptual Data Models - How to Get the Attention of Business Use...
 

Similar to SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy

Collaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous WebinarCollaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous WebinarConcept Searching, Inc
 
SharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results WebinarSharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results WebinarConcept Searching, Inc
 
Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Concept Searching, Inc
 
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...Concept Searching, Inc
 
Why Use Add ins with SharePoint and SharePoint Online? Webinar
Why Use Add ins with SharePoint and SharePoint Online? WebinarWhy Use Add ins with SharePoint and SharePoint Online? Webinar
Why Use Add ins with SharePoint and SharePoint Online? WebinarConcept Searching, Inc
 
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term SetsExploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term SetsConcept Searching, Inc
 
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint WebinarConcept Searching, Inc
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...Concept Searching, Inc
 
Getting started with with SharePoint Syntex
Getting started with with SharePoint SyntexGetting started with with SharePoint Syntex
Getting started with with SharePoint SyntexDrew Madelung
 
Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365Concept Searching, Inc
 
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...Concept Searching, Inc
 
Is Your Content Migration Strategy Garbage In, Garbage Out? Webinar
Is Your Content Migration Strategy Garbage In, Garbage Out? WebinarIs Your Content Migration Strategy Garbage In, Garbage Out? Webinar
Is Your Content Migration Strategy Garbage In, Garbage Out? WebinarConcept Searching, Inc
 
Drowning in Data and Starving for Information
Drowning in Dataand Starving for InformationDrowning in Dataand Starving for Information
Drowning in Data and Starving for InformationConcept Searching, Inc
 
ECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish WebinarECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish WebinarConcept Searching, Inc
 
eDiscovery at Nottinghamshire County Council
eDiscovery at Nottinghamshire County Council eDiscovery at Nottinghamshire County Council
eDiscovery at Nottinghamshire County Council Concept Searching, Inc
 
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?Concept Searching, Inc
 
Coexist or Integrate? How Add-ins Deliver an Integrated Environment to Manage...
Coexist or Integrate? How Add-ins Deliver an Integrated Environment to Manage...Coexist or Integrate? How Add-ins Deliver an Integrated Environment to Manage...
Coexist or Integrate? How Add-ins Deliver an Integrated Environment to Manage...Concept Searching, Inc
 
Why Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarWhy Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarConcept Searching, Inc
 
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Concept Searching, Inc
 

Similar to SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy (19)

Collaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous WebinarCollaboration Can Be Dangerous Webinar
Collaboration Can Be Dangerous Webinar
 
SharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results WebinarSharePoint and Office 365 State of the Market Survey Results Webinar
SharePoint and Office 365 State of the Market Survey Results Webinar
 
Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!
 
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
conceptTermStoreManager – The Native SharePoint Utility to Manage Term Sets W...
 
Why Use Add ins with SharePoint and SharePoint Online? Webinar
Why Use Add ins with SharePoint and SharePoint Online? WebinarWhy Use Add ins with SharePoint and SharePoint Online? Webinar
Why Use Add ins with SharePoint and SharePoint Online? Webinar
 
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term SetsExploring Automatic Metadata Generation Based on SharePoint Term Sets
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
 
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
 
Getting started with with SharePoint Syntex
Getting started with with SharePoint SyntexGetting started with with SharePoint Syntex
Getting started with with SharePoint Syntex
 
Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365Going Meta – How to Use Metadata in SharePoint and Office 365
Going Meta – How to Use Metadata in SharePoint and Office 365
 
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
 
Is Your Content Migration Strategy Garbage In, Garbage Out? Webinar
Is Your Content Migration Strategy Garbage In, Garbage Out? WebinarIs Your Content Migration Strategy Garbage In, Garbage Out? Webinar
Is Your Content Migration Strategy Garbage In, Garbage Out? Webinar
 
Drowning in Data and Starving for Information
Drowning in Dataand Starving for InformationDrowning in Dataand Starving for Information
Drowning in Data and Starving for Information
 
ECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish WebinarECM or CLM? A Fight to the Finish Webinar
ECM or CLM? A Fight to the Finish Webinar
 
eDiscovery at Nottinghamshire County Council
eDiscovery at Nottinghamshire County Council eDiscovery at Nottinghamshire County Council
eDiscovery at Nottinghamshire County Council
 
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
 
Coexist or Integrate? How Add-ins Deliver an Integrated Environment to Manage...
Coexist or Integrate? How Add-ins Deliver an Integrated Environment to Manage...Coexist or Integrate? How Add-ins Deliver an Integrated Environment to Manage...
Coexist or Integrate? How Add-ins Deliver an Integrated Environment to Manage...
 
Why Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarWhy Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance Webinar
 
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
 

More from Concept Searching, Inc

ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase Concept Searching, Inc
 
Using Metadata and Classification in Records Management
Using Metadata and Classification in Records ManagementUsing Metadata and Classification in Records Management
Using Metadata and Classification in Records ManagementConcept Searching, Inc
 
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarDiscovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarConcept Searching, Inc
 
Metadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarMetadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarConcept Searching, Inc
 
Why You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementWhy You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementConcept Searching, Inc
 
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance WebinarEnough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance WebinarConcept Searching, Inc
 
Eliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches WebinarEliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches WebinarConcept Searching, Inc
 
The Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online SearchThe Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online SearchConcept Searching, Inc
 
How To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization WebinarHow To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization WebinarConcept Searching, Inc
 
conceptTermStoreManager Demo On Demand
conceptTermStoreManager Demo On DemandconceptTermStoreManager Demo On Demand
conceptTermStoreManager Demo On DemandConcept Searching, Inc
 
Optimize and Organize Your Content with conceptClassifier for File Shares
Optimize and Organize Your Content with conceptClassifier for File Shares Optimize and Organize Your Content with conceptClassifier for File Shares
Optimize and Organize Your Content with conceptClassifier for File Shares Concept Searching, Inc
 

More from Concept Searching, Inc (11)

ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase ARMA NOVA’s Auto-Categorization Showcase
ARMA NOVA’s Auto-Categorization Showcase
 
Using Metadata and Classification in Records Management
Using Metadata and Classification in Records ManagementUsing Metadata and Classification in Records Management
Using Metadata and Classification in Records Management
 
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarDiscovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
 
Metadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email WebinarMetadata-Driven Cleanup of Files, Content, and Email Webinar
Metadata-Driven Cleanup of Files, Content, and Email Webinar
 
Why You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementWhy You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records Management
 
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance WebinarEnough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
 
Eliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches WebinarEliminate the 49% of Documents that Contain Data Breaches Webinar
Eliminate the 49% of Documents that Contain Data Breaches Webinar
 
The Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online SearchThe Value of Adding Managed Metadata to Microsoft Online Search
The Value of Adding Managed Metadata to Microsoft Online Search
 
How To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization WebinarHow To Implement Engineering Search Within Your Organization Webinar
How To Implement Engineering Search Within Your Organization Webinar
 
conceptTermStoreManager Demo On Demand
conceptTermStoreManager Demo On DemandconceptTermStoreManager Demo On Demand
conceptTermStoreManager Demo On Demand
 
Optimize and Organize Your Content with conceptClassifier for File Shares
Optimize and Organize Your Content with conceptClassifier for File Shares Optimize and Organize Your Content with conceptClassifier for File Shares
Optimize and Organize Your Content with conceptClassifier for File Shares
 

Recently uploaded

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 

Recently uploaded (20)

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 

SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy

  • 1. © Concept Searching 2017 The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Michael Paye Chief Technology Officer Concept Searching mikep@conceptsearching.com www.conceptsearching.com marketing@conceptsearching.com Twitter @conceptsearch Paul Billingham Director of Sales, Europe Concept Searching paulb@conceptsearching.com
  • 2. © Concept Searching 2017 Michael Paye – Chief Technology Officer at Concept Searching has been the driving force behind many of the company's recent innovations, including the SharePoint Add-in and hybrid search products. He has a wealth of experience across the Microsoft platform and related technologies, and oversees all product development. Paul Billingham – Director of Sales, Europe at Concept Searching is one of the company founders, and has over 20 years’ sales experience, working primarily within the document management and workflow industry. He has a technical background, which is a major benefit when selling complex technology through a partner channel.
  • 3. © Concept Searching 2017 Agenda • Who we are and what we do • What’s the problem? • What does it impact? • How do you measure performance? • Metadata generation • Auto-classification – What does it do? • Taxonomies – What kinds are there? • SharePoint Term Store • Calculating return on investment
  • 4. © Concept Searching 2017 • Company founded in 2002 • Product launched in 2003 • Focus on management of structured and unstructured information • Profitable, debt free • Technology Platform • Delivered as a web service • Automatic concept identification, content tagging, auto-classification, taxonomy management • Only statistical vendor that can extract conceptual metadata • 8 years KMWorld ‘100 Companies that Matter in Knowledge Management’ 8 years KMWorld ‘Trend Setting Product’ • Authority to Operate enterprise wide US Air Force, NETCON US Army, and Canadian SLSA • Client base: Fortune 500/1000 organizations in Healthcare, Financial Services, Manufacturing, Energy, Professional Services, Pharmaceutical, Public sector and DoD • Microsoft Gold Certification in Application Development • Member of SharePoint PAC and TAP programs • Suitable for all versions of SharePoint on-premises and SharePoint Online, including the latest vNext dedicated platform and the government cloud The Global Leader in Managed Metadata Solutions
  • 5. © Concept Searching 2017 Concept Searching’s technology platforms deliver semantic metadata generation, auto-classification and taxonomy/Term Store management, and are fully integrated with all versions of SharePoint on-premises, Microsoft Online/Office 365, and OneDrive for Business What Do We Do? These infrastructure platforms integrate not only with SharePoint but also other content repositories, search engines and file shares, enabling our clients to add structure and manage their enterprise content, regardless of environment The resulting classification metadata is used by clients to deliver ‘intelligent metadata solutions’ in areas such as enhanced search, migration, data privacy, records management, policy enforcement, compliance, text analytics, and business and social collaboration
  • 6. © Concept Searching 2017 Definition • Metadata describes other data, it provides information about a certain item's content • For example, an image may include metadata that describes how large the picture is, the color depth, the image resolution, when the image was created, and other data • A text document's metadata may contain information about how long the document is, who the author is, when the document was written, and a short summary of the document TechTerms.com Metadata
  • 7. © Concept Searching 2017 Types of Metadata Intrinsic • Information that can be extracted directly from an object (file name, size) Administrative/Management • Information used to manage the document (author, date created, date to be reviewed) Descriptive • Information that describes the object (title, subject, audience) Semantic • Ability to extract concepts from within content and generate the metadata (intelligent metadata)
  • 8. © Concept Searching 2017 “Over 80% of business decisions are made using unstructured data.” IDC What’s the Problem?
  • 9. © Concept Searching 2017 • 91% use manual metadata tagging • Free-for-all mode • Drop down lists • 15% maintain a homegrown manual taxonomy • 77% have no rhyme or reason for managing content Information Chaos • Unstructured data is growing at the rate of 62% per year IDG • By 2022, 93% of all data in the digital universe will be unstructured IDG • Data volume is set to grow 800% over the next five years and 80% of it will reside as unstructured data Gartner What’s the Problem?
  • 10. © Concept Searching 2017 It’s not just about search What Does it Impact?
  • 11. © Concept Searching 2017 How do you measure performance?
  • 12. © Concept Searching 2017 Precision Versus Recall • Usually used by academics • Precision • Positive predictive value • Fraction of retrieved instances that are relevant • Recall • Sensitivity • Correct number of documents that are relevant • Fraction of relevant instances that are retrieved • In a perfect world, they should be balanced • Commercial evaluation criteria also take into account • Order of the returned results • Overall ability of a user to find an answer rather than relying on a search being submitted only once
  • 13. © Concept Searching 2017 • Automated metadata generation is difficult to achieve consistently with high precision and recall • Many products on the market today require complex rules to be generated, often involving search syntax and complicated Boolean expressions • Some require a document training set for every term to be processed • Some of these products employ linguistic techniques that will not perform consistently across different vertical markets Result is very high initial cost in terms of time and level of qualified staff Precision Versus Recall
  • 14. © Concept Searching 2017 A manual metadata approach will fail 95% of the time Why is it So Hard to Get Metadata Right? Issue Organizational Impact Inconsistent Less than 50% of content is correctly indexed, meta-tagged or efficiently searchable rendering it unusable to the organization. (IDC) Subjective Highly trained Information Specialists will agree on meta tags between 33% - 50% of the time. (C. Cleverdon) Cumbersome – expensive Average cost of manually tagging one item runs from $4-$7 per document and does not factor in the accuracy of the meta tags nor the repercussions from mis-tagged content. (Hoovers) Malicious compliance End users select first value in list. (Perspectives on Metadata, Sarah Courier) No perceived value for end user What’s in it for me? End user creates document, does not see value for organization nor risks associated with litigation and non-conformance to policies. What have you seen Metadata will continue to be a problem due to inconsistent human behavior.
  • 15. © Concept Searching 2017 • A feature found in some content management systems or records management applications that will scan the contents of a document and automatically assign metadata, categories, and keywords based on the document contents • Content-based assignment of one or more pre-defined categories to documents (records), usually machine learning, statistical pattern recognition, or neural network approaches that are used to construct classifiers automatically What is Auto-classification?
  • 16. © Concept Searching 2017 Automatic generation of compound term metadata Set up a taxonomy node, suggest clues for class, document feedback
  • 17. © Concept Searching 2017 Auto-classification Systems – What Do They Do? Document Preparation • Split into language blocks (paragraphs, headings), formatting, layout Parsing • Entity extraction • NLP: parts of speech, phrases • Terms, variants Weighting • Frequency • Location in text, phrase • Proximity • Combination • Format of text Classification • If threshold reached • Can influence search results This is where rules vs statistics come into play… Not all classification solutions are created equal
  • 18. © Concept Searching 2017 Auto-classification Systems Keyword • Boolean operators add a degree of sophistication, but also tend to improve precision at the expense of recall, because any document that does not match the Boolean expression is ignored • The majority of search users are unable to formulate even basic Boolean expressions Linguistic • No commitment to a taxonomic tree • Related to parts of speech, syntactic parses, or semantic interpretations • Typically not scalable • Usually delivered as pre-configured for an industry, hard to integrate your unique organizational vocabulary
  • 19. © Concept Searching 2017 Semantic Networks • Refers to a set of relationships between concepts and words, including parts of speech and real-world relationships • These can include rules of various types, not just Boolean Machine Learning • Subfield of computer science (CS) and artificial intelligence (AI) that deals with the construction and study of systems that can learn from data, rather than follow only explicitly programmed instructions Auto-classification Systems
  • 20. © Concept Searching 2017 Auto-classification in action
  • 21. © Concept Searching 2017 Taxonomies Taxonomy • A taxonomy is an organized set of concepts or definitions, usually labeled keywords • For search engines, a taxonomy can also be a set of organized searches • Taxonomies are typically nested in a hierarchical manner, often called a ‘tree’ • Subject-based taxonomy – created by domain experts • Content-based taxonomy – organizing the data you already have • Behavior-based taxonomy – driven by search analytics, user tagging, or vocabulary analysis
  • 22. © Concept Searching 2017 Types of Taxonomies List, Picklist, Controlled Vocabulary, Authority Files List of lead or preferred terms, selected by the end user, may or may not have relationships among the terms, can include a synonym ring Synonym Lists The use of synonyms allows one concept to be instantiated as the same as the other, but still allows a term to be preferred over another Hierarchical Each content item resides in only one category, referred to as a ‘tree’ • Piano • Musical instrument
  • 23. © Concept Searching 2017 Types of Taxonomies Polyhierarchical, Faceted, Thesauri Content items can exist in more than one category, more structured controlled vocabulary, provides information about each term and its relationship to other terms, features of a hierarchical taxonomy plus associative relationships • Piano • Musical instrument • Stringed instrument • Percussion instrument Ontology Multiple taxonomies with additional relationships added to specify concepts within a domain Marlene Rockmore – The Taxonomy Blog Heather Hedden – The Accidental Taxonomist
  • 24. © Concept Searching 2017 SharePoint Term Store • Introduced in 2010 • Provides infrastructure for taxonomy management • Managed metadata properties designed for hierarchical metadata • Integrated with search via the refinement panel • Utilizes GUIDs for term/tag identification SharePoint has no automatic generation of metadata SharePoint has no auto-classification capability SharePoint has no facility to generate concepts
  • 25. © Concept Searching 2017 Automatic, real-time update of the SharePoint Term Store
  • 26. © Concept Searching 2017 Return On Investment
  • 27. © Concept Searching 2017 Return On Investment – Real World Savings Pique Solutions The Business Solutions • Search • Records Management • Intelligent Migration • Data Security/Confidentiality • eDiscovery/Litigation Support, FOIA • Information Governance • Text Analytics • Business Social Networking • Collaboration • Content Lifecycle Management • Metadata Management • Research • Knowledge Management
  • 28. © Concept Searching 2017 Thank You Michael Paye Chief Technology Officer Concept Searching mikep@conceptsearching.com www.conceptsearching.com marketing@conceptsearching.com Twitter @conceptsearch Paul Billingham Director of Sales, Europe Concept Searching paulb@conceptsearching.com