SlideShare a Scribd company logo
1 of 59
Download to read offline
INTRODUCTION TO ENTERPRISE
          SEARCH
          Kristian Norling
Introduction
• Who is here?
• Your expectations?
• Kristian?
• 2 hours, one break
• Lifetime answer Guarantee on this class
Hikingartist
Agenda
• Problem
• History of (web) search
• How we search and !nd?
• Current state of Enterprise Search + stats
• Technical concept
• Information quality
• Feedback cycle
• Five dimensions of Findability
•List




        mrflip
nathansnider
erikref
The Problems
• Growing amounts of Information
• Changing patterns of information
  consumption
• Information silos
• Web like behaviour > Information !lters
• Internal information use is still in the
  Digital Stone Age
History of Search
In Academia search is called Information
Retrieval.
It is an old discipline, dating back
thousands of years...
Basic concepts in Information Retrieval:
Recall and Precision, more later...
Directories vs. Search Engines
• Directories are manually compiled taxonomies of
  websites
• Directories are far more costly and time intensive to
  maintain
• Directories lack coverage, although it provides an
  important alternative, especially for novice surfers
• Search engines rely mainly on automated search
  algorithms
• Search engines rank pages by popularity on the web,
  the more referrals (links) the more relevant
Early days of Web Search
Yahoo – searchable directory (1994, ~10000 websites)
    • Integrates	
  search	
  over	
  its	
  directory.	
  Organized	
  by	
  subject	
  
      ma8ers.	
  Sites	
  can	
  be	
  suggested,	
  but	
  human	
  editors	
  control	
  
      quality	
  of	
  directory	
  (~100	
  dedicated	
  editors)
Ask – natural language search engine (1998)
    • used	
  human	
  editors	
  to	
  match	
  popular	
  queries.	
  Tried	
  
      different	
  algorithms	
  to	
  rank	
  pages	
  by	
  popularity
Google – searchable index (1998)
    • Developed	
  Pagerank,	
  popularity	
  algorithm	
  that	
  hides	
  bad	
  
      content.	
  Set	
  standards	
  (spellchecking,	
  query	
  suggesIon,	
  
      search	
  results	
  page	
  design)
Web Search - evolution
First generation (1995-97) – AltaVista, Excite, WebCrawler
Uses mostly on-page data (text and formatting).
Informational queries.
Second generation (1998-2010) – Google, Yahoo
Use o"-page, web-speci!c data: link analysis, anchor-text, click-
through data. Informational and navigational queries.
Third generation (2010-present) – Google, Wolfram-Alpha,
Bing
Blend data from many sources, tries to answer ‘‘the need
behind the query’’: semantic analysis, context determination,
dynamic database selection etc. Informational, navigational, and
transactional queries.
Seeking information modes:
Informational
Find information assumed to be available
on the web in a static form.
Seeking information modes:
Navigational
Reach a particular site that the user has in
mind, either because they visited it in the
past or because they assume that such a
site exists. Have usually only one "right"
result.
Seeking information modes:
Transactional
Reach a site where further interaction will happen. This
interaction constitutes the transaction de!ning these
queries. The main categories for such queries are
shopping, !nding various web-mediated services,
downloading various type of !le (images, songs, etc),
accessing certain data-bases (e.g. Yellow Pages type data),
!nding servers (e.g.for gaming) etc.
Four modes of seeking information


 Finding something when I
 know what I want and have
 words to describe it.
Four modes of seeking information


 Exploring when I only have
 some idea of what I want and
 may lack the words to
 articulate it.
Four modes of seeking information


 Finding relevant items when I
 don’t know what I need.
Four modes of seeking information


 Finding something I have seen
 before, but can’t remember
 where.
The State of Enterprise Search
• Amount of information is growing
  everyday
• What to Search for?
• Where to Search?
• How to Search?
• Search is simple, complex and powerful
• Findability Dimensions
STATS FROM THE
“ENTERPRISE SEARCH AND
FINDABILITY SURVEY 2012”
        SIGN-UP
HOW CRITICAL IS FINDING
THE RIGHT INFORMATION
 TO BUSINESS GOALS AND
        SUCCESS?
EUROPE
       76.5%
IMPERATIVE/SIGNIFICANT
Zoom Zoom
IS IT EASY TO FIND THE
  RIGHT INFORMATION
      WITHIN YOUR
ORGANISATION TODAY?
EUROPE
       77%
MODERATELY/VERY HARD
LEVEL OF SATISFACTION?
proimos
EUROPE
      18.5%
MOSTLY/VERY SATISFIED
WHAT ARE THE OBSTACLES
 TO FINDING THE RIGHT
     INFORMATION?
Globally
63.4% POOR SEARCH FUNCTIONALITY
52.1% DON'T KNOW WHERE TO LOOK
51.4% INCONSISTENCY IN HOW WE TAG
     CONTENT
50.0% LACK OF ADEQUATE TAGS
33.1% DON’T KNOW WHAT TO LOOK FOR
Wikipedia De!nition
“Enterprise search is the practice of
making content from multiple
enterprise-type sources, such as
databases and intranets, searchable to a
de!ned audience.”
http://en.wikipedia.org/wiki/Enterprise_search
The Concept of Enterprise
 Search: Precision
 In the !eld of information retrieval, precision is the
 fraction of retrieved documents that are relevant to the
 search.


 Precision takes all retrieved documents into account,
 but it can also be evaluated at a given cut-o" rank,
 considering only the topmost results returned by the
 system. This measure is called precision at n or P@n.
                                             Source: Wikipedia
The Concept of Enterprise
 Search: Recall
 Recall in information retrieval is the fraction of the
 documents that are relevant to the query that are
 successfully retrieved.


 For example for text search on a set of documents recall
 is the number of correct results divided by the number
 of results that should have been returned.
                                                Source: Wikipedia
Precision and Recall


                        R number of
       M number of                               N number of
                        retrieved documents
   relevant documents                            retrieved documents
                        that are also relevant
Precision and Recall
Recall = R / M =
Number of retrieved documents that are
also relevant / Total number of relevant
documents.
Precision = R / N =
Number of retrieved documents that are
also relevant / Total number of retrieved
documents.
Relevance
...enterprises typically have to use other query-
independent factors, such as a document's recency or
popularity, along with query-dependent factors
traditionally associated with information retrieval
algorithms. Also, the rich functionality of enterprise
search UIs, such as clustering and faceting, diminish
reliance on ranking as the means to direct the user's
attention.
                                         Source: Wikipedia
PageRank
Relevance
We do not have PageRank...
...but we have social!
Social Reconnects Enterprise Search
Emails, People Catalogues, Connections,
Tagging, Sharing etc.
The Concept of Enterprise Search
Search based Solutions
Examples of implementations:
- People Search
- Product Search
- Document Search
- Intranet and Website Search
- E-commerce
- Dashboard / Search as a Service
Information / Content
• Good Data/Information hygiene
• Crap in = Crap out
• Metadata is very important!
• Taxonomy and Metadata demysti!ed
• TetraPak example (video)
• SimCorp example
• VGR example (video)
•List




        yeraze
svenwerk
HCE (SWEDEN)
DEWEY DECIMAL CLASSIFICATION
KristianNorling
Author: Douglas Coupland
Title: Hej Nostradamus!
Publisher: Norstedts
Year: 2003
Printed by: Smedjebacken
Printed: 2004



                     KristianNorling
Metadata
Semantic




           KristianNorling
ESEO: Actionable activities
Example: Ernst & Young
• Metadata
• Titles


• Content Quality
• Information Life Cycle Management
Show me the Money
But, an average Search budget is 100K Euro
• TCO
• ROI
• KPI


Search Analytics is key
Search Analytics
Important, delivers actionable to-dos quickly
• 0-results
• Top Terms Searched for

Video: Search Analytics in Practice
User Satisfaction
• Feedback form
• KPI from Search Analytics
• Session time x n:o sessions = Time spent
  on search x hourly price = Cost per
  “answer”
• Add search re!nements + exit page (=is
  the right answer)
Findability by Findwise

                       1. BUSINESS
Build solutions to support your business processes and goals
                     2. INFORMATION
          Prepare information to make it !ndable
                         3. USERS
        Build usable solutions based on user needs
                    4. ORGANISATION
        Govern and improve your solution over time
                 5. SEARCH TECHNOLOGY
Build solutions based on state-of-the-art search technology
Business
• Analyze how your business goals and
strategies can be met by improved
information access
• Set Findability goals. Examples; increase the
revenue on sales, raise productivity, improve
knowledge sharing, better collaboration
• Specify your requirements
• De!ne KPI’s and measure the success of your
investments
Information
• Clean up and archive or delete outdated/
unrelevant information
• Ensure good quality of information by
adding structured and suitable metadata
• Create and use information models and
taxonomies
• Tagging?
Users
• Get to know your users and their needs
• Make sure your solution is easy to use
• Perform continuous usability evaluations,
like usage tests and expert evaluations
• Make sure users !nd what they are looking
for
• Enable feedback loops for complaints,
feedback and praise
Organisation
• Resources!
• De!ne processes, roles and routines to
govern the solution
• Perform Search Analytics
• Create easy to use administration
interfaces
• Perform training, technical and editorial
• Help publishers get started with processes
for better !ndability
Search Technology
• Select a suitable search platform or make
the most of your current solution
• Design your architecture with search-as-a-
service in mind
• Utilise the full potential of the selected
technology
Kristian Norling
  Kristian Norling
     LinkedIn
  @kristiannorling
    @!ndwise
   !ndwise.com
  Findability Blog
     Slideshare
      Vimeo
    Newsroom

More Related Content

What's hot

Taxonomies And Search Aiim Mn
Taxonomies And Search Aiim MnTaxonomies And Search Aiim Mn
Taxonomies And Search Aiim MnAIIM Minnesota
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarConcept Searching, Inc
 
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarThe Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarConcept Searching, Inc
 
How to be successful with search in your organisation
How to be successful with search in your organisationHow to be successful with search in your organisation
How to be successful with search in your organisationvoginip
 
Enterprise analytics: Strategies and partnerships
Enterprise analytics: Strategies and partnershipsEnterprise analytics: Strategies and partnerships
Enterprise analytics: Strategies and partnershipsWilliam O'Shea
 
Change Your Search to Find – SharePoint and Office 365 Webinar
Change Your Search to Find – SharePoint and Office 365 WebinarChange Your Search to Find – SharePoint and Office 365 Webinar
Change Your Search to Find – SharePoint and Office 365 WebinarConcept Searching, Inc
 
OK So Enterprise Search is "Janky" - Now What?
OK So Enterprise Search is "Janky" - Now What?OK So Enterprise Search is "Janky" - Now What?
OK So Enterprise Search is "Janky" - Now What?Earley Information Science
 
Km World Taxonomy Boot Camp 2011
Km World Taxonomy Boot Camp  2011Km World Taxonomy Boot Camp  2011
Km World Taxonomy Boot Camp 2011ajrhem
 
Taxonomy 101: Presented at Taxonomy Boot Camp 2019
Taxonomy 101: Presented at Taxonomy Boot Camp 2019Taxonomy 101: Presented at Taxonomy Boot Camp 2019
Taxonomy 101: Presented at Taxonomy Boot Camp 2019Enterprise Knowledge
 
Information Architecture Exposing the Secret Sauce for Success
Information Architecture Exposing the Secret Sauce for Success Information Architecture Exposing the Secret Sauce for Success
Information Architecture Exposing the Secret Sauce for Success Baltimore SharePoint (BSPUG)
 
Data-Ed Online: Let's Talk Metadata: Strategies and Successes
Data-Ed Online: Let's Talk Metadata: Strategies and Successes Data-Ed Online: Let's Talk Metadata: Strategies and Successes
Data-Ed Online: Let's Talk Metadata: Strategies and Successes Data Blueprint
 
Going Meta in SharePoint – Tricks of the Trade
Going Meta in SharePoint – Tricks of the TradeGoing Meta in SharePoint – Tricks of the Trade
Going Meta in SharePoint – Tricks of the TradeConcept Searching, Inc
 
11 Strategic Considerations & Davinci Demo
11 Strategic Considerations & Davinci Demo11 Strategic Considerations & Davinci Demo
11 Strategic Considerations & Davinci DemoChristian Buckley
 
Groundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarGroundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarConcept Searching, Inc
 
Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Precisely
 
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Concept Searching, Inc
 
LDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementLDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementDATAVERSITY
 

What's hot (20)

Taxonomies And Search Aiim Mn
Taxonomies And Search Aiim MnTaxonomies And Search Aiim Mn
Taxonomies And Search Aiim Mn
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right Webinar
 
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy WebinarThe Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
 
How to be successful with search in your organisation
How to be successful with search in your organisationHow to be successful with search in your organisation
How to be successful with search in your organisation
 
Enterprise analytics: Strategies and partnerships
Enterprise analytics: Strategies and partnershipsEnterprise analytics: Strategies and partnerships
Enterprise analytics: Strategies and partnerships
 
Change Your Search to Find – SharePoint and Office 365 Webinar
Change Your Search to Find – SharePoint and Office 365 WebinarChange Your Search to Find – SharePoint and Office 365 Webinar
Change Your Search to Find – SharePoint and Office 365 Webinar
 
OK So Enterprise Search is "Janky" - Now What?
OK So Enterprise Search is "Janky" - Now What?OK So Enterprise Search is "Janky" - Now What?
OK So Enterprise Search is "Janky" - Now What?
 
Km World Taxonomy Boot Camp 2011
Km World Taxonomy Boot Camp  2011Km World Taxonomy Boot Camp  2011
Km World Taxonomy Boot Camp 2011
 
Taxonomy 101: Presented at Taxonomy Boot Camp 2019
Taxonomy 101: Presented at Taxonomy Boot Camp 2019Taxonomy 101: Presented at Taxonomy Boot Camp 2019
Taxonomy 101: Presented at Taxonomy Boot Camp 2019
 
Information Architecture Exposing the Secret Sauce for Success
Information Architecture Exposing the Secret Sauce for Success Information Architecture Exposing the Secret Sauce for Success
Information Architecture Exposing the Secret Sauce for Success
 
Data-Ed Online: Let's Talk Metadata: Strategies and Successes
Data-Ed Online: Let's Talk Metadata: Strategies and Successes Data-Ed Online: Let's Talk Metadata: Strategies and Successes
Data-Ed Online: Let's Talk Metadata: Strategies and Successes
 
Going Meta in SharePoint – Tricks of the Trade
Going Meta in SharePoint – Tricks of the TradeGoing Meta in SharePoint – Tricks of the Trade
Going Meta in SharePoint – Tricks of the Trade
 
11 Strategic Considerations & Davinci Demo
11 Strategic Considerations & Davinci Demo11 Strategic Considerations & Davinci Demo
11 Strategic Considerations & Davinci Demo
 
Groundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search WebinarGroundbreaking and Game-changing Enterprise Search Webinar
Groundbreaking and Game-changing Enterprise Search Webinar
 
Km share point and semantik-va
Km   share point and semantik-vaKm   share point and semantik-va
Km share point and semantik-va
 
Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality Transform Your Downstream Cloud Analytics with Data Quality 
Transform Your Downstream Cloud Analytics with Data Quality 
 
SharePoint Fest Chicago Presentation
SharePoint Fest Chicago PresentationSharePoint Fest Chicago Presentation
SharePoint Fest Chicago Presentation
 
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
 
LDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata ManagementLDM Webinar: Data Modeling & Metadata Management
LDM Webinar: Data Modeling & Metadata Management
 
Taxonomy 101
Taxonomy 101Taxonomy 101
Taxonomy 101
 

Viewers also liked

Linked Data and Citizen Participation - Next Gen of Muncipality Service
Linked Data and Citizen Participation - Next Gen of Muncipality ServiceLinked Data and Citizen Participation - Next Gen of Muncipality Service
Linked Data and Citizen Participation - Next Gen of Muncipality ServiceFredric Landqvist
 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Daniel Tunkelang
 
Science of the Interwebs
Science of the InterwebsScience of the Interwebs
Science of the Interwebsnitchmarketing
 
Ranking algorithms
Ranking algorithmsRanking algorithms
Ranking algorithmsAnkit Raj
 
Introduction to Search Engines
Introduction to Search EnginesIntroduction to Search Engines
Introduction to Search EnginesNitin Pande
 
An Introduction to Elastic Search.
An Introduction to Elastic Search.An Introduction to Elastic Search.
An Introduction to Elastic Search.Jurriaan Persyn
 

Viewers also liked (7)

Linked Data and Citizen Participation - Next Gen of Muncipality Service
Linked Data and Citizen Participation - Next Gen of Muncipality ServiceLinked Data and Citizen Participation - Next Gen of Muncipality Service
Linked Data and Citizen Participation - Next Gen of Muncipality Service
 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?
 
Science of the Interwebs
Science of the InterwebsScience of the Interwebs
Science of the Interwebs
 
Ranking algorithms
Ranking algorithmsRanking algorithms
Ranking algorithms
 
Introduction to Search Engines
Introduction to Search EnginesIntroduction to Search Engines
Introduction to Search Engines
 
Search engines
Search enginesSearch engines
Search engines
 
An Introduction to Elastic Search.
An Introduction to Elastic Search.An Introduction to Elastic Search.
An Introduction to Elastic Search.
 

Similar to Introduction to Enterprise Search

Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findabilityKristian Norling
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologiesenterprisesearchmeetup
 
How to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR OrganizationHow to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR OrganizationAgnes Molnar
 
SharePoint Saturday Belgium 2013 Intranet search fail
SharePoint Saturday Belgium 2013 Intranet search failSharePoint Saturday Belgium 2013 Intranet search fail
SharePoint Saturday Belgium 2013 Intranet search failBIWUG
 
SPSBE14 Intranet Search #fail
SPSBE14 Intranet Search #failSPSBE14 Intranet Search #fail
SPSBE14 Intranet Search #failBen van Mol
 
Enterprise search Information
Enterprise search Information Enterprise search Information
Enterprise search Information Netwoven Inc.
 
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comEnhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comSimon Hughes
 
Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)Duncan MacGruer
 
Search Quality Management
Search Quality ManagementSearch Quality Management
Search Quality ManagementAgnes Molnar
 
Predictive Analytics, AI and the Promise of Personalization
Predictive Analytics, AI and the Promise of PersonalizationPredictive Analytics, AI and the Promise of Personalization
Predictive Analytics, AI and the Promise of PersonalizationEarley Information Science
 
Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Concept Searching, Inc
 
Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan Bessie Chu
 
Business Analytics and Data mining.pdf
Business Analytics and Data mining.pdfBusiness Analytics and Data mining.pdf
Business Analytics and Data mining.pdfssuser0413ec
 
Marketing AI - How to Build a Keyword Ontology
Marketing AI - How to Build a Keyword OntologyMarketing AI - How to Build a Keyword Ontology
Marketing AI - How to Build a Keyword OntologyDan Segal
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...SEAD
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
Webinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better SearchWebinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better SearchLucidworks
 

Similar to Introduction to Enterprise Search (20)

Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
 
How to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR OrganizationHow to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR Organization
 
Digital Economics
Digital EconomicsDigital Economics
Digital Economics
 
SharePoint Saturday Belgium 2013 Intranet search fail
SharePoint Saturday Belgium 2013 Intranet search failSharePoint Saturday Belgium 2013 Intranet search fail
SharePoint Saturday Belgium 2013 Intranet search fail
 
SPSBE14 Intranet Search #fail
SPSBE14 Intranet Search #failSPSBE14 Intranet Search #fail
SPSBE14 Intranet Search #fail
 
Enterprise search Information
Enterprise search Information Enterprise search Information
Enterprise search Information
 
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.comEnhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
Enhancing Enterprise Search with Machine Learning - Simon Hughes, Dice.com
 
Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)
 
Search Quality Management
Search Quality ManagementSearch Quality Management
Search Quality Management
 
Predictive Analytics, AI and the Promise of Personalization
Predictive Analytics, AI and the Promise of PersonalizationPredictive Analytics, AI and the Promise of Personalization
Predictive Analytics, AI and the Promise of Personalization
 
Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!Taxonomy and tagging – manual tagging does not work!
Taxonomy and tagging – manual tagging does not work!
 
Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan
 
Business Analytics and Data mining.pdf
Business Analytics and Data mining.pdfBusiness Analytics and Data mining.pdf
Business Analytics and Data mining.pdf
 
Marketing AI - How to Build a Keyword Ontology
Marketing AI - How to Build a Keyword OntologyMarketing AI - How to Build a Keyword Ontology
Marketing AI - How to Build a Keyword Ontology
 
Search Analytics - Comperio
Search Analytics - ComperioSearch Analytics - Comperio
Search Analytics - Comperio
 
FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Webinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better SearchWebinar: Increase Conversion With Better Search
Webinar: Increase Conversion With Better Search
 

More from Findwise

White Arkitekter - Findability Day Roadshow 2017
White Arkitekter - Findability Day Roadshow 2017White Arkitekter - Findability Day Roadshow 2017
White Arkitekter - Findability Day Roadshow 2017Findwise
 
AI och maskininlärning - Findability Day Roadshow 2017
AI och maskininlärning - Findability Day Roadshow 2017AI och maskininlärning - Findability Day Roadshow 2017
AI och maskininlärning - Findability Day Roadshow 2017Findwise
 
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017Findwise
 
Findwise and IBM Watson
Findwise and IBM WatsonFindwise and IBM Watson
Findwise and IBM WatsonFindwise
 
Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findwise
 
Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findwise
 
Findability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindwise
 
Findability Day 2016 - Enterprise social collaboration
Findability Day 2016 - Enterprise social collaborationFindability Day 2016 - Enterprise social collaboration
Findability Day 2016 - Enterprise social collaborationFindwise
 
Findability Day 2016 - SKF case study
Findability Day 2016 - SKF case studyFindability Day 2016 - SKF case study
Findability Day 2016 - SKF case studyFindwise
 
Findability Day 2016 - Structuring content for user experience
Findability Day 2016 - Structuring content for user experienceFindability Day 2016 - Structuring content for user experience
Findability Day 2016 - Structuring content for user experienceFindwise
 
Findability Day 2016 - Augmented intelligence
Findability Day 2016 - Augmented intelligenceFindability Day 2016 - Augmented intelligence
Findability Day 2016 - Augmented intelligenceFindwise
 
Findability Day 2016 - What is GDPR?
Findability Day 2016 - What is GDPR?Findability Day 2016 - What is GDPR?
Findability Day 2016 - What is GDPR?Findwise
 
Findability Day 2016 - Get started with GDPR
Findability Day 2016 - Get started with GDPRFindability Day 2016 - Get started with GDPR
Findability Day 2016 - Get started with GDPRFindwise
 
Digital workplace och informationshantering i office 365
Digital workplace och informationshantering i office 365Digital workplace och informationshantering i office 365
Digital workplace och informationshantering i office 365Findwise
 
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...Findwise
 
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any messFindability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any messFindwise
 
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...Findwise
 
Findability Day 2015 Mattias Ellison - Findwise - Enterprise Search and fin...
Findability Day 2015   Mattias Ellison - Findwise - Enterprise Search and fin...Findability Day 2015   Mattias Ellison - Findwise - Enterprise Search and fin...
Findability Day 2015 Mattias Ellison - Findwise - Enterprise Search and fin...Findwise
 
Findability Day 2015 - Martin White - The future is search!
Findability Day 2015 - Martin White - The future is search!Findability Day 2015 - Martin White - The future is search!
Findability Day 2015 - Martin White - The future is search!Findwise
 
Findability Day 2015 Liam Holley - Dassault systems - Insight and discovery...
Findability Day 2015   Liam Holley - Dassault systems - Insight and discovery...Findability Day 2015   Liam Holley - Dassault systems - Insight and discovery...
Findability Day 2015 Liam Holley - Dassault systems - Insight and discovery...Findwise
 

More from Findwise (20)

White Arkitekter - Findability Day Roadshow 2017
White Arkitekter - Findability Day Roadshow 2017White Arkitekter - Findability Day Roadshow 2017
White Arkitekter - Findability Day Roadshow 2017
 
AI och maskininlärning - Findability Day Roadshow 2017
AI och maskininlärning - Findability Day Roadshow 2017AI och maskininlärning - Findability Day Roadshow 2017
AI och maskininlärning - Findability Day Roadshow 2017
 
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
De kognitiva eran med IBM Watson - Findability Day Roadshow 2017
 
Findwise and IBM Watson
Findwise and IBM WatsonFindwise and IBM Watson
Findwise and IBM Watson
 
Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016
 
Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findability Day 2016 - Enterprise Search and Findability Survey 2016
Findability Day 2016 - Enterprise Search and Findability Survey 2016
 
Findability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learningFindability Day 2016 - Big data analytics and machine learning
Findability Day 2016 - Big data analytics and machine learning
 
Findability Day 2016 - Enterprise social collaboration
Findability Day 2016 - Enterprise social collaborationFindability Day 2016 - Enterprise social collaboration
Findability Day 2016 - Enterprise social collaboration
 
Findability Day 2016 - SKF case study
Findability Day 2016 - SKF case studyFindability Day 2016 - SKF case study
Findability Day 2016 - SKF case study
 
Findability Day 2016 - Structuring content for user experience
Findability Day 2016 - Structuring content for user experienceFindability Day 2016 - Structuring content for user experience
Findability Day 2016 - Structuring content for user experience
 
Findability Day 2016 - Augmented intelligence
Findability Day 2016 - Augmented intelligenceFindability Day 2016 - Augmented intelligence
Findability Day 2016 - Augmented intelligence
 
Findability Day 2016 - What is GDPR?
Findability Day 2016 - What is GDPR?Findability Day 2016 - What is GDPR?
Findability Day 2016 - What is GDPR?
 
Findability Day 2016 - Get started with GDPR
Findability Day 2016 - Get started with GDPRFindability Day 2016 - Get started with GDPR
Findability Day 2016 - Get started with GDPR
 
Digital workplace och informationshantering i office 365
Digital workplace och informationshantering i office 365Digital workplace och informationshantering i office 365
Digital workplace och informationshantering i office 365
 
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
Findability Day 2015 - Mickel Grönroos - Findwise - How to increase safety on...
 
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any messFindability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
Findability Day 2015 - Abby Covert - Keynote - How to make sense of any mess
 
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
Findability Day 2015 - Noel Garry - IBM - Information governance and a 360 de...
 
Findability Day 2015 Mattias Ellison - Findwise - Enterprise Search and fin...
Findability Day 2015   Mattias Ellison - Findwise - Enterprise Search and fin...Findability Day 2015   Mattias Ellison - Findwise - Enterprise Search and fin...
Findability Day 2015 Mattias Ellison - Findwise - Enterprise Search and fin...
 
Findability Day 2015 - Martin White - The future is search!
Findability Day 2015 - Martin White - The future is search!Findability Day 2015 - Martin White - The future is search!
Findability Day 2015 - Martin White - The future is search!
 
Findability Day 2015 Liam Holley - Dassault systems - Insight and discovery...
Findability Day 2015   Liam Holley - Dassault systems - Insight and discovery...Findability Day 2015   Liam Holley - Dassault systems - Insight and discovery...
Findability Day 2015 Liam Holley - Dassault systems - Insight and discovery...
 

Recently uploaded

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 

Recently uploaded (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Introduction to Enterprise Search

  • 1. INTRODUCTION TO ENTERPRISE SEARCH Kristian Norling
  • 2. Introduction • Who is here? • Your expectations? • Kristian? • 2 hours, one break • Lifetime answer Guarantee on this class
  • 4. Agenda • Problem • History of (web) search • How we search and !nd? • Current state of Enterprise Search + stats • Technical concept • Information quality • Feedback cycle • Five dimensions of Findability
  • 5. •List mrflip
  • 8. The Problems • Growing amounts of Information • Changing patterns of information consumption • Information silos • Web like behaviour > Information !lters • Internal information use is still in the Digital Stone Age
  • 9. History of Search In Academia search is called Information Retrieval. It is an old discipline, dating back thousands of years... Basic concepts in Information Retrieval: Recall and Precision, more later...
  • 10. Directories vs. Search Engines • Directories are manually compiled taxonomies of websites • Directories are far more costly and time intensive to maintain • Directories lack coverage, although it provides an important alternative, especially for novice surfers • Search engines rely mainly on automated search algorithms • Search engines rank pages by popularity on the web, the more referrals (links) the more relevant
  • 11. Early days of Web Search Yahoo – searchable directory (1994, ~10000 websites) • Integrates  search  over  its  directory.  Organized  by  subject   ma8ers.  Sites  can  be  suggested,  but  human  editors  control   quality  of  directory  (~100  dedicated  editors) Ask – natural language search engine (1998) • used  human  editors  to  match  popular  queries.  Tried   different  algorithms  to  rank  pages  by  popularity Google – searchable index (1998) • Developed  Pagerank,  popularity  algorithm  that  hides  bad   content.  Set  standards  (spellchecking,  query  suggesIon,   search  results  page  design)
  • 12. Web Search - evolution First generation (1995-97) – AltaVista, Excite, WebCrawler Uses mostly on-page data (text and formatting). Informational queries. Second generation (1998-2010) – Google, Yahoo Use o"-page, web-speci!c data: link analysis, anchor-text, click- through data. Informational and navigational queries. Third generation (2010-present) – Google, Wolfram-Alpha, Bing Blend data from many sources, tries to answer ‘‘the need behind the query’’: semantic analysis, context determination, dynamic database selection etc. Informational, navigational, and transactional queries.
  • 13. Seeking information modes: Informational Find information assumed to be available on the web in a static form.
  • 14. Seeking information modes: Navigational Reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. Have usually only one "right" result.
  • 15. Seeking information modes: Transactional Reach a site where further interaction will happen. This interaction constitutes the transaction de!ning these queries. The main categories for such queries are shopping, !nding various web-mediated services, downloading various type of !le (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), !nding servers (e.g.for gaming) etc.
  • 16. Four modes of seeking information Finding something when I know what I want and have words to describe it.
  • 17. Four modes of seeking information Exploring when I only have some idea of what I want and may lack the words to articulate it.
  • 18. Four modes of seeking information Finding relevant items when I don’t know what I need.
  • 19. Four modes of seeking information Finding something I have seen before, but can’t remember where.
  • 20. The State of Enterprise Search • Amount of information is growing everyday • What to Search for? • Where to Search? • How to Search? • Search is simple, complex and powerful • Findability Dimensions
  • 21. STATS FROM THE “ENTERPRISE SEARCH AND FINDABILITY SURVEY 2012” SIGN-UP
  • 22. HOW CRITICAL IS FINDING THE RIGHT INFORMATION TO BUSINESS GOALS AND SUCCESS?
  • 23. EUROPE 76.5% IMPERATIVE/SIGNIFICANT
  • 25. IS IT EASY TO FIND THE RIGHT INFORMATION WITHIN YOUR ORGANISATION TODAY?
  • 26. EUROPE 77% MODERATELY/VERY HARD
  • 29. EUROPE 18.5% MOSTLY/VERY SATISFIED
  • 30. WHAT ARE THE OBSTACLES TO FINDING THE RIGHT INFORMATION?
  • 31. Globally 63.4% POOR SEARCH FUNCTIONALITY 52.1% DON'T KNOW WHERE TO LOOK 51.4% INCONSISTENCY IN HOW WE TAG CONTENT 50.0% LACK OF ADEQUATE TAGS 33.1% DON’T KNOW WHAT TO LOOK FOR
  • 32. Wikipedia De!nition “Enterprise search is the practice of making content from multiple enterprise-type sources, such as databases and intranets, searchable to a de!ned audience.” http://en.wikipedia.org/wiki/Enterprise_search
  • 33. The Concept of Enterprise Search: Precision In the !eld of information retrieval, precision is the fraction of retrieved documents that are relevant to the search. Precision takes all retrieved documents into account, but it can also be evaluated at a given cut-o" rank, considering only the topmost results returned by the system. This measure is called precision at n or P@n. Source: Wikipedia
  • 34. The Concept of Enterprise Search: Recall Recall in information retrieval is the fraction of the documents that are relevant to the query that are successfully retrieved. For example for text search on a set of documents recall is the number of correct results divided by the number of results that should have been returned. Source: Wikipedia
  • 35. Precision and Recall R number of M number of N number of retrieved documents relevant documents retrieved documents that are also relevant
  • 36. Precision and Recall Recall = R / M = Number of retrieved documents that are also relevant / Total number of relevant documents. Precision = R / N = Number of retrieved documents that are also relevant / Total number of retrieved documents.
  • 37. Relevance ...enterprises typically have to use other query- independent factors, such as a document's recency or popularity, along with query-dependent factors traditionally associated with information retrieval algorithms. Also, the rich functionality of enterprise search UIs, such as clustering and faceting, diminish reliance on ranking as the means to direct the user's attention. Source: Wikipedia
  • 39. Relevance We do not have PageRank... ...but we have social! Social Reconnects Enterprise Search Emails, People Catalogues, Connections, Tagging, Sharing etc.
  • 40. The Concept of Enterprise Search
  • 41. Search based Solutions Examples of implementations: - People Search - Product Search - Document Search - Intranet and Website Search - E-commerce - Dashboard / Search as a Service
  • 42. Information / Content • Good Data/Information hygiene • Crap in = Crap out • Metadata is very important! • Taxonomy and Metadata demysti!ed • TetraPak example (video) • SimCorp example • VGR example (video)
  • 43. •List yeraze
  • 45. HCE (SWEDEN) DEWEY DECIMAL CLASSIFICATION
  • 47. Author: Douglas Coupland Title: Hej Nostradamus! Publisher: Norstedts Year: 2003 Printed by: Smedjebacken Printed: 2004 KristianNorling
  • 48. Metadata Semantic KristianNorling
  • 49. ESEO: Actionable activities Example: Ernst & Young • Metadata • Titles • Content Quality • Information Life Cycle Management
  • 50. Show me the Money But, an average Search budget is 100K Euro • TCO • ROI • KPI Search Analytics is key
  • 51. Search Analytics Important, delivers actionable to-dos quickly • 0-results • Top Terms Searched for Video: Search Analytics in Practice
  • 52. User Satisfaction • Feedback form • KPI from Search Analytics • Session time x n:o sessions = Time spent on search x hourly price = Cost per “answer” • Add search re!nements + exit page (=is the right answer)
  • 53. Findability by Findwise 1. BUSINESS Build solutions to support your business processes and goals 2. INFORMATION Prepare information to make it !ndable 3. USERS Build usable solutions based on user needs 4. ORGANISATION Govern and improve your solution over time 5. SEARCH TECHNOLOGY Build solutions based on state-of-the-art search technology
  • 54. Business • Analyze how your business goals and strategies can be met by improved information access • Set Findability goals. Examples; increase the revenue on sales, raise productivity, improve knowledge sharing, better collaboration • Specify your requirements • De!ne KPI’s and measure the success of your investments
  • 55. Information • Clean up and archive or delete outdated/ unrelevant information • Ensure good quality of information by adding structured and suitable metadata • Create and use information models and taxonomies • Tagging?
  • 56. Users • Get to know your users and their needs • Make sure your solution is easy to use • Perform continuous usability evaluations, like usage tests and expert evaluations • Make sure users !nd what they are looking for • Enable feedback loops for complaints, feedback and praise
  • 57. Organisation • Resources! • De!ne processes, roles and routines to govern the solution • Perform Search Analytics • Create easy to use administration interfaces • Perform training, technical and editorial • Help publishers get started with processes for better !ndability
  • 58. Search Technology • Select a suitable search platform or make the most of your current solution • Design your architecture with search-as-a- service in mind • Utilise the full potential of the selected technology
  • 59. Kristian Norling Kristian Norling LinkedIn @kristiannorling @!ndwise !ndwise.com Findability Blog Slideshare Vimeo Newsroom

Editor's Notes

  1. \n
  2. \n
  3. \n
  4. What do you want to know?\n
  5. \n
  6. We humans love to collect information, we have a harder time deleting/archiving.\nWhen we start valuing information correctly we can also motivate investments in search and put processes in place to keep information updated AND with high quality. \nInformation hygiene. Structure, metadata.\nInfonomics = information as an asset in the balance sheet. \n
  7. Is this how you feel information is organised and structured in your organsation?\n
  8. Is the information you need stored in a silo somewhere?\n
  9. \n
  10. \n
  11. \n
  12. \n
  13. Yahoo: The directory is organized by subject matter, the top level containing categories such as Arts and Humanities, Business and Economy, Computers and the Internet, Education, Government, Health, News and Media, Recreation and Sports, Science, Society and Culture, and so on.\nThe natural hierarchical structure of the directory allows users easy navigation through and across its categories.\nThe directory is not strictly hierarchical, as it has many cross-references between categories from different parts of the hierarchy. For example, the subcategory Musicals under the Theater category, has a reference to the Movies and Film subcategory, which comes under Entertainment.\nWeb directories provide an important alternative to search engines, especially for novice surfers, as the directory structure makes it is easy to find relevant information provided when an appropriate category for the search query can be found. The fundamental drawback of directories is their lack of coverage.\nKnowing the category of a web page that a user clicked on is very indicative of the user's interests, and may be used to recommend to the user similar pages from the same or a related category. To solve the problem of how to automatically associate a web page with a category we need to make use of machine learning techniques for automatic categorization of web pages\n
  14. Yahoo: The directory is organized by subject matter, the top level containing categories such as Arts and Humanities, Business and Economy, Computers and the Internet, Education, Government, Health, News and Media, Recreation and Sports, Science, Society and Culture, and so on.\nThe natural hierarchical structure of the directory allows users easy navigation through and across its categories.\nThe directory is not strictly hierarchical, as it has many cross-references between categories from different parts of the hierarchy. For example, the subcategory Musicals under the Theater category, has a reference to the Movies and Film subcategory, which comes under Entertainment.\nWeb directories provide an important alternative to search engines, especially for novice surfers, as the directory structure makes it is easy to find relevant information provided when an appropriate category for the search query can be found. The fundamental drawback of directories is their lack of coverage.\nKnowing the category of a web page that a user clicked on is very indicative of the user's interests, and may be used to recommend to the user similar pages from the same or a related category. To solve the problem of how to automatically associate a web page with a category we need to make use of machine learning techniques for automatic categorization of web pages\n
  15. Yahoo: The directory is organized by subject matter, the top level containing categories such as Arts and Humanities, Business and Economy, Computers and the Internet, Education, Government, Health, News and Media, Recreation and Sports, Science, Society and Culture, and so on.\nThe natural hierarchical structure of the directory allows users easy navigation through and across its categories.\nThe directory is not strictly hierarchical, as it has many cross-references between categories from different parts of the hierarchy. For example, the subcategory Musicals under the Theater category, has a reference to the Movies and Film subcategory, which comes under Entertainment.\nWeb directories provide an important alternative to search engines, especially for novice surfers, as the directory structure makes it is easy to find relevant information provided when an appropriate category for the search query can be found. The fundamental drawback of directories is their lack of coverage.\nKnowing the category of a web page that a user clicked on is very indicative of the user's interests, and may be used to recommend to the user similar pages from the same or a related category. To solve the problem of how to automatically associate a web page with a category we need to make use of machine learning techniques for automatic categorization of web pages\n
  16. Yahoo: The directory is organized by subject matter, the top level containing categories such as Arts and Humanities, Business and Economy, Computers and the Internet, Education, Government, Health, News and Media, Recreation and Sports, Science, Society and Culture, and so on.\nThe natural hierarchical structure of the directory allows users easy navigation through and across its categories.\nThe directory is not strictly hierarchical, as it has many cross-references between categories from different parts of the hierarchy. For example, the subcategory Musicals under the Theater category, has a reference to the Movies and Film subcategory, which comes under Entertainment.\nWeb directories provide an important alternative to search engines, especially for novice surfers, as the directory structure makes it is easy to find relevant information provided when an appropriate category for the search query can be found. The fundamental drawback of directories is their lack of coverage.\nKnowing the category of a web page that a user clicked on is very indicative of the user's interests, and may be used to recommend to the user similar pages from the same or a related category. To solve the problem of how to automatically associate a web page with a category we need to make use of machine learning techniques for automatic categorization of web pages\n
  17. Yahoo: The directory is organized by subject matter, the top level containing categories such as Arts and Humanities, Business and Economy, Computers and the Internet, Education, Government, Health, News and Media, Recreation and Sports, Science, Society and Culture, and so on.\nThe natural hierarchical structure of the directory allows users easy navigation through and across its categories.\nThe directory is not strictly hierarchical, as it has many cross-references between categories from different parts of the hierarchy. For example, the subcategory Musicals under the Theater category, has a reference to the Movies and Film subcategory, which comes under Entertainment.\nWeb directories provide an important alternative to search engines, especially for novice surfers, as the directory structure makes it is easy to find relevant information provided when an appropriate category for the search query can be found. The fundamental drawback of directories is their lack of coverage.\nKnowing the category of a web page that a user clicked on is very indicative of the user's interests, and may be used to recommend to the user similar pages from the same or a related category. To solve the problem of how to automatically associate a web page with a category we need to make use of machine learning techniques for automatic categorization of web pages\n
  18. Yahoo: The directory is organized by subject matter, the top level containing categories such as Arts and Humanities, Business and Economy, Computers and the Internet, Education, Government, Health, News and Media, Recreation and Sports, Science, Society and Culture, and so on.\nThe natural hierarchical structure of the directory allows users easy navigation through and across its categories.\nThe directory is not strictly hierarchical, as it has many cross-references between categories from different parts of the hierarchy. For example, the subcategory Musicals under the Theater category, has a reference to the Movies and Film subcategory, which comes under Entertainment.\nWeb directories provide an important alternative to search engines, especially for novice surfers, as the directory structure makes it is easy to find relevant information provided when an appropriate category for the search query can be found. The fundamental drawback of directories is their lack of coverage.\nKnowing the category of a web page that a user clicked on is very indicative of the user's interests, and may be used to recommend to the user similar pages from the same or a related category. To solve the problem of how to automatically associate a web page with a category we need to make use of machine learning techniques for automatic categorization of web pages\n
  19. Yahoo: The directory is organized by subject matter, the top level containing categories such as Arts and Humanities, Business and Economy, Computers and the Internet, Education, Government, Health, News and Media, Recreation and Sports, Science, Society and Culture, and so on.\nThe natural hierarchical structure of the directory allows users easy navigation through and across its categories.\nThe directory is not strictly hierarchical, as it has many cross-references between categories from different parts of the hierarchy. For example, the subcategory Musicals under the Theater category, has a reference to the Movies and Film subcategory, which comes under Entertainment.\nWeb directories provide an important alternative to search engines, especially for novice surfers, as the directory structure makes it is easy to find relevant information provided when an appropriate category for the search query can be found. The fundamental drawback of directories is their lack of coverage.\nKnowing the category of a web page that a user clicked on is very indicative of the user's interests, and may be used to recommend to the user similar pages from the same or a related category. To solve the problem of how to automatically associate a web page with a category we need to make use of machine learning techniques for automatic categorization of web pages\n
  20. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  21. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  22. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  23. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  24. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  25. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  26. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  27. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  28. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  29. Navigational queries. reach a particular site that the user has in mind, either because they visited it in the past or because they assume that such a site exists. have usually only one "right" result.\nInformational queries. find information assumed to be available on the web in a static form. \nTransactional queries. reach a site where further interaction will happen. This interaction constitutes the transaction defining these queries. The main categories for such queries are shopping, finding various web-mediated services, downloading various type of file (images, songs, etc), accessing certain data-bases (e.g. Yellow Pages type data), finding servers (e.g.for gaming) etc.\n \n 2nd gen. - Google, first engine to use link analysis as a primary ranking factor and DirectHit concentrated on click-through data. By now, all major engines use all these types of data. Link analysis and anchortext seems crucial for navigational queries.\n3rd gen. - For instance on a query like San Francisco the engine might present direct links to a hotel reservation page for San Francisco, a map server, a weather server, etc.\nRapidly changing landscape\n
  30. \n
  31. \n
  32. \n
  33. \n
  34. \n
  35. \n
  36. \n
  37. \n
  38. Information silos. They are everywhere. \nEnterprise Search can “integrate” them.\n
  39. \n
  40. \n
  41. \n
  42. \n
  43. \n
  44. \n
  45. \n
  46. \n
  47. \n
  48. \n
  49. \n
  50. \n
  51. \n
  52. On intranets or our web site search we do not have the equivalent of PageRank.\nWe can’t use the amount of inbound link as a factor for relevancy. \nWe have to find other ways...\n
  53. \n
  54. \n
  55. \n
  56. \n
  57. \n
  58. \n
  59. \n
  60. \n
  61. \n
  62. \n
  63. \n
  64. \n
  65. \n
  66. \n
  67. \n
  68. \n
  69. \n
  70. \n
  71. \n
  72. \n
  73. \n
  74. \n
  75. \n
  76. \n
  77. \n
  78. \n
  79. \n
  80. \n
  81. \n
  82. \n
  83. \n
  84. \n
  85. \n
  86. \n
  87. \n
  88. \n
  89. \n
  90. \n
  91. \n
  92. \n
  93. \n
  94. \n
  95. \n
  96. \n
  97. \n
  98. \n
  99. \n
  100. \n
  101. \n
  102. \n