SlideShare a Scribd company logo
SharePoint 2010                SharePoint 2013
Managed Property               (Multiple) Search Schemas
Best Bets                      Promoted Results (Query Rule)
Scope and Federated Location   Result Source
Content By Query               Content By Search
Incremental Crawl              Continuous Crawl
MCM                            MCSM
Continous Crawl Benefits        Continus Crawl Facts

• No more waiting for index   • Runs every 15 minutes by
  merge                         default
• Does not wait for other     • Default interval can be
  crawls to complete            changed with PowerShell
• Can have multiple           • Should be used instead of
  continuous crawls running     incremental crawls for
  simultaneously                SharePoint content sources
• Continuous crawls ignores
  errors
HTTP
  Other
                File Share                                              End User Query
User Profile
                                                                              Or
               Content                                                  Process Initiated
 SharePoint    Sources                                                       Query



                                 Content                   Query
            Crawl                              Index
                                Processing               Processing
          Component                          Component
                                Component                Component


                                 Analytics
                                Processing     Link        Index
             Crawl                                       Partition(s)
                                Component    Database
          Database(s)

                                                                          Event Store

                    Analytics
                    Database
What it Does                       Important Facts

• Crawls content sources to           • We can have multiple crawl
  populate index                        components
• Delivers crawl items (binary) and   • MS Recommends: 2 Crawl
  metadata to content processor         Components per Search Service
• Invokes connectors or protocol        Application
  handlers to interact with content   • MS Recommends: 8(4vm) CPU /
  sources to retrieve data              8GB RAM per Crawl Component
• Uses one or more crawl
  databases to store info about
  crawl items and crawl history
What it Does                                  Important Facts

• Processes crawl items and feeds to index     • We must only have one (1) crawl
  component                                      processing component per server – more
• Transforms crawl items into artifacts that     will hurt, not help crawl performance
  can be included in search index              • Max of 2 per search service application
  (Performs document parsing and               • Feeding Sessions are scaled based on
  property mapping)                              CPU cores using a default coefficient of 3
• Writes information about links and urls               8 (cores) * 3 = 24 feeding sessions
  in link database (which are analyzed by               4 (cores) * 3 = 12 feeding sessions
  analytics to calculate relevance and         • MS Recommends: 8(4vm) CPU / 8GB
  currency - Results written back to search      RAM per Content Processing Component
  index by content processing component        • Feeding sessions require RAM – More
• Generates phonetic name variations to          RAM is necessary when more cores are
  improve people search                          present – monitoring required
What it Does                                   Important Facts

• Runs analytics jobs that analyze crawl items   • Maximum of 6 per search service
  and user interaction with search results to      application
  perform both search analytics and usage        • Add more Analytics Processing Components
  analytics                                        to improve analytics performance
• Analyzes Link & Anchor text analysis, Clear    • MS Recommends: 8(4vm) CPU / 8GB RAM /
  distance, Search Clicks, Deep Links, Social      300GB disk space per Analytics Processing
  Tags, Social Distance, Search Reports,           Component.
  Recommendations, Usage Counts, Activity        • Interacts with Analytics Reporting to store
  Ranking                                          statistical information
• Improves search relevance and create           • Interacts with Link database to store
  search results                                   information about searches and crawled
• Output included in search index by content       documents
  processor
What it Does                              Important Facts

• Receives processed items from content   • Maximum of 60 index partitions (20
  processing component and writes the       index partitions X 3 index replicas) per
  items to the index file                   search service application
• Receives queries from the query         • Must provision one Index Component
  processing component and returns          for each index replica.
  result sets                             • MS Recommends: 8(4vm) CPU / 16GB
• Redistributes content among index         RAM / 500GB disk space per Index
  partitions when index architecture is     Component.
  changed by Search Administration
  Component
• Index partition is logical portion of entire search index (same
                 as before)
               • Index partition is served by one or more index components
               • Index components can be primary "replica" or secondary

   Index         "replica"
               • Primary Replica is contacted by content processing
                 component to write new data in the index
Architecture   • Secondary Replica is read only copy that get updated with
                 the data.
               • Adding replicas improves query performance under load
               • Add partitions to handle increased content corpus
               • Can't remove partition after it has been added.
What it Does                              Important Facts

• Analyzes and processes queries and         • Maximum of 1 per server
  results                                    • MS Recommends: 8(4vm) CPU / 8GB
• After receiving a query, it analyzes and     RAM per Query Processing
  processes the query to optimize              Component.
  precision, recall and relevance
• Submits processed queries to the index
  component
• Processes the result set returned by
  the index component before returning
  to the querying entity.
Get-SPEnterpriseSearchService
                                             Get-SPEnterpriseSearchServiceApplication
Get-SPEnterpriseSearchStatus                                                                     Get-SPEnterpriseSearchQueryAndSiteSettingsService     Get-SPEnterpriseSearchLanguageResourcePhrase
                                             Get-SPEnterpriseSearchServiceApplicationProxy
New-SPEnterpriseSearchAdminComponent                                                             Get-                                                  Get-SPEnterpriseSearchSiteHitRule
                                             Get-SPEnterpriseSearchServiceInstance               SPEnterpriseSearchQueryAndSiteSettingsServiceInstan   New-SPEnterpriseSearchLanguageResourcePhrase
                                             New-SPEnterpriseSearchServiceApplication            ce
Get-SPEnterpriseSearchCrawlContentSource                                                                                                               New-SPEnterpriseSearchSiteHitRule
                                             New-SPEnterpriseSearchServiceApplicationProxy       Get-
Get-SPEnterpriseSearchCrawlCustomConnector                                                                                                             Remove-SPEnterpriseSearchLanguageResourcePhrase
                                             Remove-SPEnterpriseSearchServiceApplication         SPEnterpriseSearchQueryAndSiteSettingsServiceProxy
Get-SPEnterpriseSearchCrawlDatabase                                                                                                                    Remove-SPEnterpriseSearchSiteHitRule
                                             Remove-SPEnterpriseSearchServiceApplicationProxy    Get-SPEnterpriseSearchQueryAuthority
Get-SPEnterpriseSearchCrawlExtension                                                                                                                   Get-SPEnterpriseSearchVssDataPath
                                             Restore-SPEnterpriseSearchServiceApplication        Get-SPEnterpriseSearchQueryDemoted
Get-SPEnterpriseSearchCrawlMapping                                                                                                                     Get-
                                             Resume-SPEnterpriseSearchServiceApplication         Get-SPEnterpriseSearchQueryKeyword                    SPEnterpriseSearchContentEnrichmentConfiguration
Get-SPEnterpriseSearchCrawlRule
                                             Set-SPEnterpriseSearchService                       Get-SPEnterpriseSearchQueryScope                      Set-SPEnterpriseSearchPrimaryHostController
New-SPEnterpriseSearchCrawlComponent
                                             Set-SPEnterpriseSearchServiceApplication            Get-SPEnterpriseSearchQueryScopeRule                  Set-SPEnterpriseSearchLinguisticComponentsStatus
New-SPEnterpriseSearchCrawlContentSource
                                             Set-SPEnterpriseSearchServiceApplicationProxy       Get-SPEnterpriseSearchQuerySuggestionCandidates       Set-
New-SPEnterpriseSearchCrawlCustomConnector
                                             Start-SPEnterpriseSearchServiceInstance             Get-SPEnterpriseSearchRankingModel                    SPEnterpriseSearchContentEnrichmentConfiguration
New-SPEnterpriseSearchCrawlDatabase
                                             Stop-SPEnterpriseSearchServiceInstance              Get-SPEnterpriseSearchSecurityTrimmer                 Remove-
New-SPEnterpriseSearchCrawlExtension
                                             Suspend-SPEnterpriseSearchServiceApplication        New-SPEnterpriseSearchQueryAuthority                  SPEnterpriseSearchContentEnrichmentConfiguration
New-SPEnterpriseSearchCrawlMapping
                                             Upgrade-SPEnterpriseSearchServiceApplication        New-SPEnterpriseSearchQueryDemoted                    New-
New-SPEnterpriseSearchCrawlRule                                                                                                                        SPEnterpriseSearchContentEnrichmentConfiguration
                                             Backup-SPEnterpriseSearchServiceApplicationIndex    New-SPEnterpriseSearchQueryKeyword
Remove-SPEnterpriseSearchCrawlContentSource                                                                                                            Get-SPEnterpriseSearchLinguisticComponentsStatus
                                             Upgrade-                                            New-SPEnterpriseSearchQueryScope
Remove-                                                                                                                                                Get-SPEnterpriseSearchHostController
                                             SPEnterpriseSearchServiceApplicationSiteSettings    New-SPEnterpriseSearchQueryScopeRule
SPEnterpriseSearchCrawlCustomConnector
                                             Restore-SPEnterpriseSearchServiceApplicationIndex   New-SPEnterpriseSearchRankingModel                    Set-SPEnterpriseSearchLinksDatabase
Remove-SPEnterpriseSearchCrawlDatabase
                                             Remove-                                             New-SPEnterpriseSearchSecurityTrimmer                 Repartition-SPEnterpriseSearchLinksDatabases
Remove-SPEnterpriseSearchCrawlExtension
                                             SPEnterpriseSearchServiceApplicationSiteSettings    Remove-SPEnterpriseSearchQueryAuthority               Move-SPEnterpriseSearchLinksDatabases
Remove-SPEnterpriseSearchCrawlMapping
                                             Get-SPEnterpriseSearchOwner                         Remove-SPEnterpriseSearchQueryDemoted                 Remove-SPEnterpriseSearchTenantSchema
Remove-SPEnterpriseSearchCrawlRule
                                             Suspend-SPEnterpriseSearchServiceApplication        Remove-SPEnterpriseSearchQueryKeyword                 Remove-SPEnterpriseSearchTenantConfiguration
Set-SPEnterpriseSearchCrawlContentSource
                                             Set-SPEnterpriseSearchServiceInstance               Remove-SPEnterpriseSearchQueryScope                   Remove-SPEnterpriseSearchLinksDatabase
Set-SPEnterpriseSearchCrawlDatabase
                                                                                                 Remove-SPEnterpriseSearchQueryScopeRule               Remove-SPEnterpriseSearchFileFormat
Set-SPEnterpriseSearchCrawlRule
                                             Get-SPEnterpriseSearchMetadataCategory              Remove-SPEnterpriseSearchRankingModel                 New-SPEnterpriseSearchLinksDatabase
Set-SPEnterpriseSearchCrawlLogReadPermission
                                             Get-SPEnterpriseSearchMetadataCrawledProperty       Remove-SPEnterpriseSearchSecurityTrimmer              New-SPEnterpriseSearchFileFormat
Remove-
                                             Get-SPEnterpriseSearchMetadataManagedProperty       Set-SPEnterpriseSearchQueryAuthority                  New-
SPEnterpriseSearchCrawlLogReadPermission
                                             Get-SPEnterpriseSearchMetadataMapping               Set-SPEnterpriseSearchQueryKeyword                    SPEnterpriseSearchAnalyticsProcessingComponent
Remove-
SPEnterpriseSearchCrawlLogReadPermission     New-SPEnterpriseSearchMetadataCategory              Set-SPEnterpriseSearchQueryScope                      Import-SPEnterpriseSearchCustomExtractionDictionary
                                             New-SPEnterpriseSearchMetadataCrawledProperty       Set-SPEnterpriseSearchQueryScopeRule                  Get-SPEnterpriseSearchLinksDatabase
Import-SPEnterpriseSearchTopology            New-SPEnterpriseSearchMetadataManagedProperty       Set-SPEnterpriseSearchRankingModel                    Get-SPEnterpriseSearchFileFormat
Export-SPEnterpriseSearchTopology            New-SPEnterpriseSearchMetadataMapping               Start-                                                Set-SPEnterpriseSearchFileFormatState
Set-SPEnterpriseSearchTopology               Remove-SPEnterpriseSearchMetadataCategory           SPEnterpriseSearchQueryAndSiteSettingsServiceInstan   Get-SPEnterpriseSearchComponent
Remove-SPEnterpriseSearchTopology            Remove-                                             ce                                                    Get-
                                             SPEnterpriseSearchMetadataManagedProperty           Stop-                                                 SPEnterpriseSearchServiceApplicationBackupStore
Remove-SPEnterpriseSearchComponent
                                             Remove-SPEnterpriseSearchMetadataMapping            SPEnterpriseSearchQueryAndSiteSettingsServiceInstan
New-SPEnterpriseSearchTopology                                                                   ce
                                             Set-SPEnterpriseSearchMetadataCategory
New-                                                                                             Import-SPEnterpriseSearchPopularQueries
SPEnterpriseSearchQueryProcessingComponent Set-SPEnterpriseSearchMetadataCrawledProperty
                                             Set-SPEnterpriseSearchMetadataManagedProperty       Set-SPEnterpriseSearchResultItemType
New-SPEnterpriseSearchIndexComponent
                                             Set-SPEnterpriseSearchMetadataMapping               Set-SPEnterpriseSearchQuerySpellingCorrection
Host 1                                              Host 2                                    Host 5                                   Host 6

  Web server               Web server                   Web server               Web server
                                                                                                    All SharePoint databases                  All SharePoint databases


  Application              Office                       Application              Office        Search admin db            Link db
  Server                   Web Apps                     Server                   Web Apps
                           Server                                                Server           Crawl db              Analytics db    Redundant copies of all databases using
                                                                                                                                        SQL clustering, mirroring, or SQL Server
                                                                                                        SharePoint Config db                        2012 AlwaysOn

                                                                                                   All other SharePoint databases
Host 3                                              Host 4

  Application Server                                    Application Server

       Query Processing                                     Query Processing

                   Replica         Index part ition 0       Replica




  Application Server                                    Application Server

      Crawl                                                 Crawl

       Admin                                                Admin

       Analytics                                            Analytics

      Content processing                                    Content processing
Host A                                            Host B                     Host E                                   Host F

  Application Server                                    Application Server

               Query Processing

                       Replica     Index part ition 0        Replica
                                                                                Application Server                        Application Server
                                                                                                         Analytics                                  Analytics
  Application Server                                    Application Server                      Content processing                        Content processing


                                                                                Application Server                        Application Server
                       Replica     Index part ition 1        Replica
                                                                                                           Admin                                      Admin

                                                                                   Crawl        Content processing            Crawl       Content processing


Host C                                            Host D
                                                                             Host G                                   Host H
  Application Server                                    Application Server

                Query Processing                                                  SharePoint databases                      SharePoint databases

                       Replica     Index part ition 2         Replica                                   Crawl db

                                                                              Search admin db           Crawl db
                                                                                                                       Redundant copies of all databases using
  Application Server                                    Application Server        Link db              Analytics db    SQL clustering, mirroring, or SQL Server
                                                                                                                                   2012 AlwaysOn


                       Replica     Index part ition 3         Replica
Host A                                              Host B                     Host C                                              Host D                     Host K                                   Host L                                      Host M                                 Host N



    Application Server                                    Application Server       Application Server                                    Application Server


                  Query Processing                                                               Query Processing

                                                                                                        Replica     Index part ition 2        Replica
                         Replica     Index part ition 0        Replica
                                                                                                                                                                 Application Server                        Application Server                         Application Server                       Application Server
                                                                                                                                                                                           Analytics                                 Analytics                                Analytics                                  Analytics

    Application Server                                    Application Server       Application Server                                    Application Server                                                                                                         Content processing                         Content processing
                                                                                                                                                                                  Content processing                       Content processing


                                                                                                                                                                  Application Server                       Application Server                         Application Server                       Application Server
                                     Index part ition 1                                                 Replica     Index part ition 3        Replica
                         Replica                               Replica
                                                                                                                                                                                                                                                                              Analytics                                  Analytics

                                                                                                                                                                                 Crawl       Admin                        Crawl        Admin                        Content processing                         Content processing



Host E                                              Host F                     Host G                                              Host H
                                                                                                                                                              Host O                                   Host P                                      Host Q                                 Host R

    Application Server                                    Application Server       Application Server                                    Application Server
                                                                                                                                                                   SharePoint databases                      SharePoint databases                      SharePoint databases                     SharePoint databases
                 Query Processing                                                               Query Processing

                                     Index part ition 4        Replica                                  Replica     Index part ition 6        Replica
                         Replica
                                                                                                                                                               Search admin db                                                                          Link db
                                                                                                                                                                                                        Redundant copies of all databases using         Crawl db                           Redundant copies of all databases using
                                                                                                                                                                 Analytics db                                                                                                              SQL clustering, mirroring, or SQL Server
    Application Server                                    Application Server       Application Server                                    Application Server                                             SQL clustering, mirroring, or SQL Server
                                                                                                                                                                                                                    2012 AlwaysOn                       Crawl db                                       2012 AlwaysOn
                                                                                                                                                                 Analytics db

                                                                                                                                                                  Crawl db                                                                              Crawl db

                         Replica     Index part ition 5        Replica                                  Replica     Index part ition 7        Replica
                                                                                                                                                                  Crawl db




Host I                                              Host J



    Application Server                                    Application Server



                         Replica     Index part ition 8         Replica




    Application Server                                    Application Server



                         Replica     Index part ition 9         Replica
Schema can be managed by site admins, reducing the load on search administrator


     Schema can be configured to allow more granularity (query, retrieve, refine, sort, etc) - Affects
     content index size

       Remote result sources can be crawled locally and then queried by remote farms. Huge impact
       on geo-distributed search… KL may be able to help!


       Individual items can be re-crawled easily


     Automatic URL balancing in crawl databases minimizes host name restrictions for large archive
     repositories

Scalability limit changes will have a big impact on farm design for large archive content repositories in
the near future.
SharePoint 2013  Search Architecture with Russ Houberg

More Related Content

What's hot

(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
(ATS6-PLAT02) Accelrys Catalog and Protocol Validation(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
BIOVIA
 
MongoDB and Hadoop: Driving Business Insights
MongoDB and Hadoop: Driving Business InsightsMongoDB and Hadoop: Driving Business Insights
MongoDB and Hadoop: Driving Business Insights
MongoDB
 
ITPro's taking the SharePoint 2013 Red Pill
ITPro's taking the SharePoint 2013 Red PillITPro's taking the SharePoint 2013 Red Pill
ITPro's taking the SharePoint 2013 Red Pill
Sentri
 
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Lucidworks
 
Search On Hadoop
Search On HadoopSearch On Hadoop
Search On Hadoop
bigdatagurus_meetup
 
Integrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applicationsIntegrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applications
thelabdude
 
MongoDB et Hadoop
MongoDB et HadoopMongoDB et Hadoop
MongoDB et HadoopMongoDB
 
In search of: A meetup about Liferay and Search 2016-04-20
In search of: A meetup about Liferay and Search   2016-04-20In search of: A meetup about Liferay and Search   2016-04-20
In search of: A meetup about Liferay and Search 2016-04-20
Tibor Lipusz
 
Customizing Ranking Models for Enterprise Search: Presented by Ammar Haris & ...
Customizing Ranking Models for Enterprise Search: Presented by Ammar Haris & ...Customizing Ranking Models for Enterprise Search: Presented by Ammar Haris & ...
Customizing Ranking Models for Enterprise Search: Presented by Ammar Haris & ...
Lucidworks
 
Deduplication Using Solr: Presented by Neeraj Jain, Stubhub
Deduplication Using Solr: Presented by Neeraj Jain, StubhubDeduplication Using Solr: Presented by Neeraj Jain, Stubhub
Deduplication Using Solr: Presented by Neeraj Jain, Stubhub
Lucidworks
 
Adding Search to the Hadoop Ecosystem
Adding Search to the Hadoop EcosystemAdding Search to the Hadoop Ecosystem
Adding Search to the Hadoop EcosystemCloudera, Inc.
 
Cost-based Query Optimization
Cost-based Query Optimization Cost-based Query Optimization
Cost-based Query Optimization
DataWorks Summit/Hadoop Summit
 
Optimizing Your Search Experience
Optimizing Your Search ExperienceOptimizing Your Search Experience
Optimizing Your Search Experience
Sumo Logic
 
Search On Hadoop Frontier Meetup
Search On Hadoop Frontier MeetupSearch On Hadoop Frontier Meetup
Search On Hadoop Frontier Meetupgregchanan
 
Solr+Hadoop = Big Data Search
Solr+Hadoop = Big Data SearchSolr+Hadoop = Big Data Search
Solr+Hadoop = Big Data Search
Cloudera, Inc.
 
Introduction to Lucidworks Fusion - Alexander Kanarsky, Lucidworks
Introduction to Lucidworks Fusion - Alexander Kanarsky, LucidworksIntroduction to Lucidworks Fusion - Alexander Kanarsky, Lucidworks
Introduction to Lucidworks Fusion - Alexander Kanarsky, Lucidworks
Lucidworks
 
Practical SharePoint 2010 Architecture Planning
Practical SharePoint 2010 Architecture PlanningPractical SharePoint 2010 Architecture Planning
Practical SharePoint 2010 Architecture Planning
Chakkaradeep Chandran
 
Introduction to Cloudera Search Training
Introduction to Cloudera Search TrainingIntroduction to Cloudera Search Training
Introduction to Cloudera Search Training
Cloudera, Inc.
 
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Lucidworks
 
Python Awareness for Exploration and Production Students and Professionals
Python Awareness for Exploration and Production Students and ProfessionalsPython Awareness for Exploration and Production Students and Professionals
Python Awareness for Exploration and Production Students and Professionals
Yohanes Nuwara
 

What's hot (20)

(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
(ATS6-PLAT02) Accelrys Catalog and Protocol Validation(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
 
MongoDB and Hadoop: Driving Business Insights
MongoDB and Hadoop: Driving Business InsightsMongoDB and Hadoop: Driving Business Insights
MongoDB and Hadoop: Driving Business Insights
 
ITPro's taking the SharePoint 2013 Red Pill
ITPro's taking the SharePoint 2013 Red PillITPro's taking the SharePoint 2013 Red Pill
ITPro's taking the SharePoint 2013 Red Pill
 
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
Evolving The Optimal Relevancy Scoring Model at Dice.com: Presented by Simon ...
 
Search On Hadoop
Search On HadoopSearch On Hadoop
Search On Hadoop
 
Integrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applicationsIntegrate Solr with real-time stream processing applications
Integrate Solr with real-time stream processing applications
 
MongoDB et Hadoop
MongoDB et HadoopMongoDB et Hadoop
MongoDB et Hadoop
 
In search of: A meetup about Liferay and Search 2016-04-20
In search of: A meetup about Liferay and Search   2016-04-20In search of: A meetup about Liferay and Search   2016-04-20
In search of: A meetup about Liferay and Search 2016-04-20
 
Customizing Ranking Models for Enterprise Search: Presented by Ammar Haris & ...
Customizing Ranking Models for Enterprise Search: Presented by Ammar Haris & ...Customizing Ranking Models for Enterprise Search: Presented by Ammar Haris & ...
Customizing Ranking Models for Enterprise Search: Presented by Ammar Haris & ...
 
Deduplication Using Solr: Presented by Neeraj Jain, Stubhub
Deduplication Using Solr: Presented by Neeraj Jain, StubhubDeduplication Using Solr: Presented by Neeraj Jain, Stubhub
Deduplication Using Solr: Presented by Neeraj Jain, Stubhub
 
Adding Search to the Hadoop Ecosystem
Adding Search to the Hadoop EcosystemAdding Search to the Hadoop Ecosystem
Adding Search to the Hadoop Ecosystem
 
Cost-based Query Optimization
Cost-based Query Optimization Cost-based Query Optimization
Cost-based Query Optimization
 
Optimizing Your Search Experience
Optimizing Your Search ExperienceOptimizing Your Search Experience
Optimizing Your Search Experience
 
Search On Hadoop Frontier Meetup
Search On Hadoop Frontier MeetupSearch On Hadoop Frontier Meetup
Search On Hadoop Frontier Meetup
 
Solr+Hadoop = Big Data Search
Solr+Hadoop = Big Data SearchSolr+Hadoop = Big Data Search
Solr+Hadoop = Big Data Search
 
Introduction to Lucidworks Fusion - Alexander Kanarsky, Lucidworks
Introduction to Lucidworks Fusion - Alexander Kanarsky, LucidworksIntroduction to Lucidworks Fusion - Alexander Kanarsky, Lucidworks
Introduction to Lucidworks Fusion - Alexander Kanarsky, Lucidworks
 
Practical SharePoint 2010 Architecture Planning
Practical SharePoint 2010 Architecture PlanningPractical SharePoint 2010 Architecture Planning
Practical SharePoint 2010 Architecture Planning
 
Introduction to Cloudera Search Training
Introduction to Cloudera Search TrainingIntroduction to Cloudera Search Training
Introduction to Cloudera Search Training
 
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
 
Python Awareness for Exploration and Production Students and Professionals
Python Awareness for Exploration and Production Students and ProfessionalsPython Awareness for Exploration and Production Students and Professionals
Python Awareness for Exploration and Production Students and Professionals
 

Similar to SharePoint 2013 Search Architecture with Russ Houberg

MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013
Terrence Nguyen
 
10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search
SPC Adriatics
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchAgnes Molnar
 
SharePoint 2013 Search Operations
SharePoint 2013 Search OperationsSharePoint 2013 Search Operations
SharePoint 2013 Search Operations
SPC Adriatics
 
Installing and tweaking FASTSearch
Installing and tweaking FASTSearchInstalling and tweaking FASTSearch
Installing and tweaking FASTSearch
Arno Flapper
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationKim Negaard
 
Rev Your Engines - SharePoint Performance Best Practices
Rev Your Engines - SharePoint Performance Best PracticesRev Your Engines - SharePoint Performance Best Practices
Rev Your Engines - SharePoint Performance Best Practices
Eric Shupps
 
Rev Your Engines: SharePoint Performance Best Practices
Rev Your Engines: SharePoint Performance Best PracticesRev Your Engines: SharePoint Performance Best Practices
Rev Your Engines: SharePoint Performance Best Practices
SPC Adriatics
 
Open source enterprise search and retrieval platform
Open source enterprise search and retrieval platformOpen source enterprise search and retrieval platform
Open source enterprise search and retrieval platformmteutelink
 
I2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
I2 - SharePoint Hybrid Search Start to Finish - Thomas VochtenI2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
I2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
SPS Paris
 
#SPSPhilly search topology & optimization
#SPSPhilly search topology & optimization#SPSPhilly search topology & optimization
#SPSPhilly search topology & optimization
Mike Maadarani
 
Ektron 8.5 RC - Search
Ektron 8.5 RC - SearchEktron 8.5 RC - Search
Ektron 8.5 RC - Search
BillCavaUs
 
SharePoint Search Topology and Optimization
SharePoint Search Topology and OptimizationSharePoint Search Topology and Optimization
SharePoint Search Topology and Optimization
Mike Maadarani
 
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Sezai Komur
 
Search
SearchSearch
Best Practices for SharePoint 2010 Search
Best Practices for SharePoint 2010 SearchBest Practices for SharePoint 2010 Search
Best Practices for SharePoint 2010 SearchAgnes Molnar
 
SharePoint 2013 Search Topology and Optimization
SharePoint 2013 Search Topology and OptimizationSharePoint 2013 Search Topology and Optimization
SharePoint 2013 Search Topology and Optimization
Mike Maadarani
 
Building Search Driven Applications in SharePoint 2010 - SharePoint Fest 2012
Building Search Driven Applications in SharePoint 2010 - SharePoint Fest 2012Building Search Driven Applications in SharePoint 2010 - SharePoint Fest 2012
Building Search Driven Applications in SharePoint 2010 - SharePoint Fest 2012Nik Patel
 

Similar to SharePoint 2013 Search Architecture with Russ Houberg (20)

MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013
 
10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
 
SharePoint 2013 Search Operations
SharePoint 2013 Search OperationsSharePoint 2013 Search Operations
SharePoint 2013 Search Operations
 
Installing and tweaking FASTSearch
Installing and tweaking FASTSearchInstalling and tweaking FASTSearch
Installing and tweaking FASTSearch
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar Presentation
 
Rev Your Engines - SharePoint Performance Best Practices
Rev Your Engines - SharePoint Performance Best PracticesRev Your Engines - SharePoint Performance Best Practices
Rev Your Engines - SharePoint Performance Best Practices
 
Rev Your Engines: SharePoint Performance Best Practices
Rev Your Engines: SharePoint Performance Best PracticesRev Your Engines: SharePoint Performance Best Practices
Rev Your Engines: SharePoint Performance Best Practices
 
Open source enterprise search and retrieval platform
Open source enterprise search and retrieval platformOpen source enterprise search and retrieval platform
Open source enterprise search and retrieval platform
 
I2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
I2 - SharePoint Hybrid Search Start to Finish - Thomas VochtenI2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
I2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
 
#SPSPhilly search topology & optimization
#SPSPhilly search topology & optimization#SPSPhilly search topology & optimization
#SPSPhilly search topology & optimization
 
Ektron 8.5 RC - Search
Ektron 8.5 RC - SearchEktron 8.5 RC - Search
Ektron 8.5 RC - Search
 
SharePoint Search Topology and Optimization
SharePoint Search Topology and OptimizationSharePoint Search Topology and Optimization
SharePoint Search Topology and Optimization
 
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
 
Search
SearchSearch
Search
 
Best Practices for SharePoint 2010 Search
Best Practices for SharePoint 2010 SearchBest Practices for SharePoint 2010 Search
Best Practices for SharePoint 2010 Search
 
SharePoint 2013 Search Topology and Optimization
SharePoint 2013 Search Topology and OptimizationSharePoint 2013 Search Topology and Optimization
SharePoint 2013 Search Topology and Optimization
 
Building Search Driven Applications in SharePoint 2010 - SharePoint Fest 2012
Building Search Driven Applications in SharePoint 2010 - SharePoint Fest 2012Building Search Driven Applications in SharePoint 2010 - SharePoint Fest 2012
Building Search Driven Applications in SharePoint 2010 - SharePoint Fest 2012
 

More from knowledgelakemarketing

Customer Success Story with Jeff Borghoff
Customer Success Story with Jeff BorghoffCustomer Success Story with Jeff Borghoff
Customer Success Story with Jeff Borghoffknowledgelakemarketing
 
Integrate SharePoint Throughout Your Business with Derek Watson
Integrate SharePoint Throughout Your Business with Derek WatsonIntegrate SharePoint Throughout Your Business with Derek Watson
Integrate SharePoint Throughout Your Business with Derek Watsonknowledgelakemarketing
 
ECM Trends & Best Practices with Joe Shepley
ECM Trends & Best Practices with Joe ShepleyECM Trends & Best Practices with Joe Shepley
ECM Trends & Best Practices with Joe Shepleyknowledgelakemarketing
 
Customer Success Story with Andrew Hutson
Customer Success Story with Andrew HutsonCustomer Success Story with Andrew Hutson
Customer Success Story with Andrew Hutsonknowledgelakemarketing
 

More from knowledgelakemarketing (7)

Customer Success Story with Jeff Borghoff
Customer Success Story with Jeff BorghoffCustomer Success Story with Jeff Borghoff
Customer Success Story with Jeff Borghoff
 
Integrate SharePoint Throughout Your Business with Derek Watson
Integrate SharePoint Throughout Your Business with Derek WatsonIntegrate SharePoint Throughout Your Business with Derek Watson
Integrate SharePoint Throughout Your Business with Derek Watson
 
ECM Trends & Best Practices with Joe Shepley
ECM Trends & Best Practices with Joe ShepleyECM Trends & Best Practices with Joe Shepley
ECM Trends & Best Practices with Joe Shepley
 
Next Gen Products with Floyd Bell
Next Gen Products with Floyd BellNext Gen Products with Floyd Bell
Next Gen Products with Floyd Bell
 
Customer Success Story with Andrew Hutson
Customer Success Story with Andrew HutsonCustomer Success Story with Andrew Hutson
Customer Success Story with Andrew Hutson
 
KL News with Brett Grego
KL News with Brett GregoKL News with Brett Grego
KL News with Brett Grego
 
Closing Remarks
Closing Remarks Closing Remarks
Closing Remarks
 

Recently uploaded

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 

Recently uploaded (20)

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 

SharePoint 2013 Search Architecture with Russ Houberg

  • 1.
  • 2.
  • 3.
  • 4.
  • 5. SharePoint 2010 SharePoint 2013 Managed Property (Multiple) Search Schemas Best Bets Promoted Results (Query Rule) Scope and Federated Location Result Source Content By Query Content By Search Incremental Crawl Continuous Crawl MCM MCSM
  • 6.
  • 7.
  • 8. Continous Crawl Benefits Continus Crawl Facts • No more waiting for index • Runs every 15 minutes by merge default • Does not wait for other • Default interval can be crawls to complete changed with PowerShell • Can have multiple • Should be used instead of continuous crawls running incremental crawls for simultaneously SharePoint content sources • Continuous crawls ignores errors
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14. HTTP Other File Share End User Query User Profile Or Content Process Initiated SharePoint Sources Query Content Query Crawl Index Processing Processing Component Component Component Component Analytics Processing Link Index Crawl Partition(s) Component Database Database(s) Event Store Analytics Database
  • 15. What it Does Important Facts • Crawls content sources to • We can have multiple crawl populate index components • Delivers crawl items (binary) and • MS Recommends: 2 Crawl metadata to content processor Components per Search Service • Invokes connectors or protocol Application handlers to interact with content • MS Recommends: 8(4vm) CPU / sources to retrieve data 8GB RAM per Crawl Component • Uses one or more crawl databases to store info about crawl items and crawl history
  • 16. What it Does Important Facts • Processes crawl items and feeds to index • We must only have one (1) crawl component processing component per server – more • Transforms crawl items into artifacts that will hurt, not help crawl performance can be included in search index • Max of 2 per search service application (Performs document parsing and • Feeding Sessions are scaled based on property mapping) CPU cores using a default coefficient of 3 • Writes information about links and urls 8 (cores) * 3 = 24 feeding sessions in link database (which are analyzed by 4 (cores) * 3 = 12 feeding sessions analytics to calculate relevance and • MS Recommends: 8(4vm) CPU / 8GB currency - Results written back to search RAM per Content Processing Component index by content processing component • Feeding sessions require RAM – More • Generates phonetic name variations to RAM is necessary when more cores are improve people search present – monitoring required
  • 17. What it Does Important Facts • Runs analytics jobs that analyze crawl items • Maximum of 6 per search service and user interaction with search results to application perform both search analytics and usage • Add more Analytics Processing Components analytics to improve analytics performance • Analyzes Link & Anchor text analysis, Clear • MS Recommends: 8(4vm) CPU / 8GB RAM / distance, Search Clicks, Deep Links, Social 300GB disk space per Analytics Processing Tags, Social Distance, Search Reports, Component. Recommendations, Usage Counts, Activity • Interacts with Analytics Reporting to store Ranking statistical information • Improves search relevance and create • Interacts with Link database to store search results information about searches and crawled • Output included in search index by content documents processor
  • 18. What it Does Important Facts • Receives processed items from content • Maximum of 60 index partitions (20 processing component and writes the index partitions X 3 index replicas) per items to the index file search service application • Receives queries from the query • Must provision one Index Component processing component and returns for each index replica. result sets • MS Recommends: 8(4vm) CPU / 16GB • Redistributes content among index RAM / 500GB disk space per Index partitions when index architecture is Component. changed by Search Administration Component
  • 19. • Index partition is logical portion of entire search index (same as before) • Index partition is served by one or more index components • Index components can be primary "replica" or secondary Index "replica" • Primary Replica is contacted by content processing component to write new data in the index Architecture • Secondary Replica is read only copy that get updated with the data. • Adding replicas improves query performance under load • Add partitions to handle increased content corpus • Can't remove partition after it has been added.
  • 20. What it Does Important Facts • Analyzes and processes queries and • Maximum of 1 per server results • MS Recommends: 8(4vm) CPU / 8GB • After receiving a query, it analyzes and RAM per Query Processing processes the query to optimize Component. precision, recall and relevance • Submits processed queries to the index component • Processes the result set returned by the index component before returning to the querying entity.
  • 21.
  • 22.
  • 23. Get-SPEnterpriseSearchService Get-SPEnterpriseSearchServiceApplication Get-SPEnterpriseSearchStatus Get-SPEnterpriseSearchQueryAndSiteSettingsService Get-SPEnterpriseSearchLanguageResourcePhrase Get-SPEnterpriseSearchServiceApplicationProxy New-SPEnterpriseSearchAdminComponent Get- Get-SPEnterpriseSearchSiteHitRule Get-SPEnterpriseSearchServiceInstance SPEnterpriseSearchQueryAndSiteSettingsServiceInstan New-SPEnterpriseSearchLanguageResourcePhrase New-SPEnterpriseSearchServiceApplication ce Get-SPEnterpriseSearchCrawlContentSource New-SPEnterpriseSearchSiteHitRule New-SPEnterpriseSearchServiceApplicationProxy Get- Get-SPEnterpriseSearchCrawlCustomConnector Remove-SPEnterpriseSearchLanguageResourcePhrase Remove-SPEnterpriseSearchServiceApplication SPEnterpriseSearchQueryAndSiteSettingsServiceProxy Get-SPEnterpriseSearchCrawlDatabase Remove-SPEnterpriseSearchSiteHitRule Remove-SPEnterpriseSearchServiceApplicationProxy Get-SPEnterpriseSearchQueryAuthority Get-SPEnterpriseSearchCrawlExtension Get-SPEnterpriseSearchVssDataPath Restore-SPEnterpriseSearchServiceApplication Get-SPEnterpriseSearchQueryDemoted Get-SPEnterpriseSearchCrawlMapping Get- Resume-SPEnterpriseSearchServiceApplication Get-SPEnterpriseSearchQueryKeyword SPEnterpriseSearchContentEnrichmentConfiguration Get-SPEnterpriseSearchCrawlRule Set-SPEnterpriseSearchService Get-SPEnterpriseSearchQueryScope Set-SPEnterpriseSearchPrimaryHostController New-SPEnterpriseSearchCrawlComponent Set-SPEnterpriseSearchServiceApplication Get-SPEnterpriseSearchQueryScopeRule Set-SPEnterpriseSearchLinguisticComponentsStatus New-SPEnterpriseSearchCrawlContentSource Set-SPEnterpriseSearchServiceApplicationProxy Get-SPEnterpriseSearchQuerySuggestionCandidates Set- New-SPEnterpriseSearchCrawlCustomConnector Start-SPEnterpriseSearchServiceInstance Get-SPEnterpriseSearchRankingModel SPEnterpriseSearchContentEnrichmentConfiguration New-SPEnterpriseSearchCrawlDatabase Stop-SPEnterpriseSearchServiceInstance Get-SPEnterpriseSearchSecurityTrimmer Remove- New-SPEnterpriseSearchCrawlExtension Suspend-SPEnterpriseSearchServiceApplication New-SPEnterpriseSearchQueryAuthority SPEnterpriseSearchContentEnrichmentConfiguration New-SPEnterpriseSearchCrawlMapping Upgrade-SPEnterpriseSearchServiceApplication New-SPEnterpriseSearchQueryDemoted New- New-SPEnterpriseSearchCrawlRule SPEnterpriseSearchContentEnrichmentConfiguration Backup-SPEnterpriseSearchServiceApplicationIndex New-SPEnterpriseSearchQueryKeyword Remove-SPEnterpriseSearchCrawlContentSource Get-SPEnterpriseSearchLinguisticComponentsStatus Upgrade- New-SPEnterpriseSearchQueryScope Remove- Get-SPEnterpriseSearchHostController SPEnterpriseSearchServiceApplicationSiteSettings New-SPEnterpriseSearchQueryScopeRule SPEnterpriseSearchCrawlCustomConnector Restore-SPEnterpriseSearchServiceApplicationIndex New-SPEnterpriseSearchRankingModel Set-SPEnterpriseSearchLinksDatabase Remove-SPEnterpriseSearchCrawlDatabase Remove- New-SPEnterpriseSearchSecurityTrimmer Repartition-SPEnterpriseSearchLinksDatabases Remove-SPEnterpriseSearchCrawlExtension SPEnterpriseSearchServiceApplicationSiteSettings Remove-SPEnterpriseSearchQueryAuthority Move-SPEnterpriseSearchLinksDatabases Remove-SPEnterpriseSearchCrawlMapping Get-SPEnterpriseSearchOwner Remove-SPEnterpriseSearchQueryDemoted Remove-SPEnterpriseSearchTenantSchema Remove-SPEnterpriseSearchCrawlRule Suspend-SPEnterpriseSearchServiceApplication Remove-SPEnterpriseSearchQueryKeyword Remove-SPEnterpriseSearchTenantConfiguration Set-SPEnterpriseSearchCrawlContentSource Set-SPEnterpriseSearchServiceInstance Remove-SPEnterpriseSearchQueryScope Remove-SPEnterpriseSearchLinksDatabase Set-SPEnterpriseSearchCrawlDatabase Remove-SPEnterpriseSearchQueryScopeRule Remove-SPEnterpriseSearchFileFormat Set-SPEnterpriseSearchCrawlRule Get-SPEnterpriseSearchMetadataCategory Remove-SPEnterpriseSearchRankingModel New-SPEnterpriseSearchLinksDatabase Set-SPEnterpriseSearchCrawlLogReadPermission Get-SPEnterpriseSearchMetadataCrawledProperty Remove-SPEnterpriseSearchSecurityTrimmer New-SPEnterpriseSearchFileFormat Remove- Get-SPEnterpriseSearchMetadataManagedProperty Set-SPEnterpriseSearchQueryAuthority New- SPEnterpriseSearchCrawlLogReadPermission Get-SPEnterpriseSearchMetadataMapping Set-SPEnterpriseSearchQueryKeyword SPEnterpriseSearchAnalyticsProcessingComponent Remove- SPEnterpriseSearchCrawlLogReadPermission New-SPEnterpriseSearchMetadataCategory Set-SPEnterpriseSearchQueryScope Import-SPEnterpriseSearchCustomExtractionDictionary New-SPEnterpriseSearchMetadataCrawledProperty Set-SPEnterpriseSearchQueryScopeRule Get-SPEnterpriseSearchLinksDatabase Import-SPEnterpriseSearchTopology New-SPEnterpriseSearchMetadataManagedProperty Set-SPEnterpriseSearchRankingModel Get-SPEnterpriseSearchFileFormat Export-SPEnterpriseSearchTopology New-SPEnterpriseSearchMetadataMapping Start- Set-SPEnterpriseSearchFileFormatState Set-SPEnterpriseSearchTopology Remove-SPEnterpriseSearchMetadataCategory SPEnterpriseSearchQueryAndSiteSettingsServiceInstan Get-SPEnterpriseSearchComponent Remove-SPEnterpriseSearchTopology Remove- ce Get- SPEnterpriseSearchMetadataManagedProperty Stop- SPEnterpriseSearchServiceApplicationBackupStore Remove-SPEnterpriseSearchComponent Remove-SPEnterpriseSearchMetadataMapping SPEnterpriseSearchQueryAndSiteSettingsServiceInstan New-SPEnterpriseSearchTopology ce Set-SPEnterpriseSearchMetadataCategory New- Import-SPEnterpriseSearchPopularQueries SPEnterpriseSearchQueryProcessingComponent Set-SPEnterpriseSearchMetadataCrawledProperty Set-SPEnterpriseSearchMetadataManagedProperty Set-SPEnterpriseSearchResultItemType New-SPEnterpriseSearchIndexComponent Set-SPEnterpriseSearchMetadataMapping Set-SPEnterpriseSearchQuerySpellingCorrection
  • 24. Host 1 Host 2 Host 5 Host 6 Web server Web server Web server Web server All SharePoint databases All SharePoint databases Application Office Application Office Search admin db Link db Server Web Apps Server Web Apps Server Server Crawl db Analytics db Redundant copies of all databases using SQL clustering, mirroring, or SQL Server SharePoint Config db 2012 AlwaysOn All other SharePoint databases Host 3 Host 4 Application Server Application Server Query Processing Query Processing Replica Index part ition 0 Replica Application Server Application Server Crawl Crawl Admin Admin Analytics Analytics Content processing Content processing
  • 25. Host A Host B Host E Host F Application Server Application Server Query Processing Replica Index part ition 0 Replica Application Server Application Server Analytics Analytics Application Server Application Server Content processing Content processing Application Server Application Server Replica Index part ition 1 Replica Admin Admin Crawl Content processing Crawl Content processing Host C Host D Host G Host H Application Server Application Server Query Processing SharePoint databases SharePoint databases Replica Index part ition 2 Replica Crawl db Search admin db Crawl db Redundant copies of all databases using Application Server Application Server Link db Analytics db SQL clustering, mirroring, or SQL Server 2012 AlwaysOn Replica Index part ition 3 Replica
  • 26. Host A Host B Host C Host D Host K Host L Host M Host N Application Server Application Server Application Server Application Server Query Processing Query Processing Replica Index part ition 2 Replica Replica Index part ition 0 Replica Application Server Application Server Application Server Application Server Analytics Analytics Analytics Analytics Application Server Application Server Application Server Application Server Content processing Content processing Content processing Content processing Application Server Application Server Application Server Application Server Index part ition 1 Replica Index part ition 3 Replica Replica Replica Analytics Analytics Crawl Admin Crawl Admin Content processing Content processing Host E Host F Host G Host H Host O Host P Host Q Host R Application Server Application Server Application Server Application Server SharePoint databases SharePoint databases SharePoint databases SharePoint databases Query Processing Query Processing Index part ition 4 Replica Replica Index part ition 6 Replica Replica Search admin db Link db Redundant copies of all databases using Crawl db Redundant copies of all databases using Analytics db SQL clustering, mirroring, or SQL Server Application Server Application Server Application Server Application Server SQL clustering, mirroring, or SQL Server 2012 AlwaysOn Crawl db 2012 AlwaysOn Analytics db Crawl db Crawl db Replica Index part ition 5 Replica Replica Index part ition 7 Replica Crawl db Host I Host J Application Server Application Server Replica Index part ition 8 Replica Application Server Application Server Replica Index part ition 9 Replica
  • 27.
  • 28. Schema can be managed by site admins, reducing the load on search administrator Schema can be configured to allow more granularity (query, retrieve, refine, sort, etc) - Affects content index size Remote result sources can be crawled locally and then queried by remote farms. Huge impact on geo-distributed search… KL may be able to help! Individual items can be re-crawled easily Automatic URL balancing in crawl databases minimizes host name restrictions for large archive repositories Scalability limit changes will have a big impact on farm design for large archive content repositories in the near future.