• Save
Improve Performance in Fast Search for SharePoint - Comperio
Upcoming SlideShare
Loading in...5
×
 

Improve Performance in Fast Search for SharePoint - Comperio

on

  • 452 views

Comperio Webinar - Improve Performance in Fast Search for SharePoint 2010, presented by Job Maelane (Comperio UK)

Comperio Webinar - Improve Performance in Fast Search for SharePoint 2010, presented by Job Maelane (Comperio UK)

Statistics

Views

Total Views
452
Views on SlideShare
452
Embed Views
0

Actions

Likes
0
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Improve Performance in Fast Search for SharePoint - Comperio Improve Performance in Fast Search for SharePoint - Comperio Presentation Transcript

  • OSLO STOCKHOLM LONDON BOSTONImprove Performance inFAST Search for SharePoint2010 (FS4SP)Enda Flynn – Marketing ManagerJob Maelane - Senior Consultant
  • FinancialsFinancially stabile and secureProfitable growth when expandingAAA Rating (D&B)Employees50 + specialists and growingAbility to attract highly qualified employeesCustomers90 +3 continentsSearch Projects200 + FAST projects completedComperio | Search Matters.
  • Agenda• Comperio Search Introduction• FS4SP Architecture Overview• Factors influencing performance• Analyzing Feeding & Indexing• Questions
  • Common Issues• “Crawler is slow. Crawler takes too long”• “Queries are slow”• “Incremental indexing takes a long time”• “Indexing fails”
  • FAST Search Server 2010Summary of architectural componentsContent
  • Search CenterContentUser Profiles…PropertyExtraction…ContentretrievalContentprocessingQueryprocessing& matchingAdministration
  • Hardware – Best PracticesCPU: 2 x 2GHz+ (Quad/six core)Memory: 24-48 GBDisk:2 x 300 GB, SAS, 10K RPM (RAID 1)CPU: 2 x 2GHz+ (Quad/six core)Memory: 24-48 GBDisk alternatives:1.0 TB: 8 x 300 GB, SAS, 10K RPM (RAID10)1.8 TB: 8 x 300 GB, SAS, 10K RPM (RAID 5)3.6 TB: 16 x 300 GB, SAS, 10K RPM (RAID 5+0)New: 7.2 TB: 16 x 600 GB, SAS, 10K RPM (RAID5+0)SAN: Configured for “database performance”Storage ServerAdmin / ProcessingServer
  • • Index latency– How long, on average, a document takes to index• Documents per second– How many documents processed and indexed persecond• Query latency– How quickly are the results generated from a query• Queries per second (QPS)– How many queries are processed per secondHow is search performance measured?
  • • Search Administration Reports• Crawl Rate Per Content Source• Crawl Rate Per Type• Crawl Processing Per Activity• Crawl Processing Per Component• Crawl Queue• Query Latencyand more• Web Analytics Reports• Total Number of Search Queries• Top queries• Failed queries• Best Bet usage• Keywords usageand moreFS4SP- Search Administration ReportsAnalyze Ribbon- More date range options- Filtering search scope- Search query text- Export to Spreadsheet etc.Web Analytics Web Part- Display popular items on a site(such as popular content, popularsearch queries, or search results)
  • • No of documents• Content types• Deep or shallow refiners• Entity extraction• Complex queries (many terms)• Substring search• Lemmatisation• Spell check• Maximum and average document size• And many more….What influences performance?Lots of things!
  • Resource ConsumptionsCPU RAM DISK TRANS DISK SPACE NETW B/WContent Distributor     Document Processor     Index Dispatcher     Indexer   Search Engine     QR Server     Admin Services     Web Link Analysis    
  • FAST Search for SharePoint Scale outContentVolumeQueryVolumeScale-out multiple“dimensions”Query VolumeContent VolumeIndexing freshnessRedundancy optionsSearchIndexingPerformance targets*15M Docs/node25 QPS/node*Depends on content and hardware specificsSearch and IndexingCrawling and ContentProcessingQuery and ResultProcessingBack-end with extreme and flexible scale out options
  • FS4SP – Medium DeploymentFAST Search for SharePoint 2010 FarmFAST-ADM-1AdminContent Distributor 1Web Analyzer12 Docprocs+FAST-FSTIDX-11Index (Search)12 Docprocs+FAST-FSTIDX-12Index (Search)12 Docprocs+FAST-FSTIDX-21(Index) SearchQR ServerFAST-FSTIDX-22(Index) SearchQR ServerFAST-ADM-2Content Distributor 2Web Analyzer12 Docprocs+(Enterprise Crawler)FAST-FSTIDX-13Index (Search)12 Docprocs+FAST-FSTIDX-23(Index) SearchQR ServerSP2010 FarmSQL 2008 ClusterWFEQuery SSAWFEQuery SSASP CrawlPeople CrawlSP CrawlPeople CrawlCrawl DBSearch Admin DB
  • FS4SP – Large DeploymentSP2010 FarmFAST Search for SharePoint 2010 FarmSQL 2008 ClusterWFEQuery SSAWFEQuery SSASP CrawlPeople CrawlSP CrawlPeople CrawlCrawl DBSearch Admin DBSP CrawlFAST-ADM-1AdminConfigServerSpelltunerSamAdminContent Distributor 1Web Analyzer12 Docprocs+FAST-FSTIDX-11Index (Search)12 Docprocs+FAST-FSTIDX-12Index (Search)12 Docprocs+FAST-FSTIDX-21(Index) SearchQR ServerFAST-FSTIDX-22(Index) SearchQR ServerFAST-ADM-2Content Distributor 2Web Analyzer12 Docprocs+FAST-FSTIDX-13Index (Search)12 Docprocs+FAST-FSTIDX-23(Index) SearchQR ServerFAST-FSTIDX-14Index (Search)12 Docprocs+FAST-FSTIDX-15Index (Search)12 Docprocs+FAST-FSTIDX-24(Index) SearchQR ServerFAST-FSTIDX-25(Index) SearchQR ServerFAST-FSTIDX-16Index (Search)12 Docprocs+FAST-FSTIDX-26(Index) SearchQR ServerFAST-ADM-3Web Analyzer12 Docprocs+
  • FS4SP – Server Calculation MatrixMax item count(in Millions) Adm + WAIndexers(1 row)SharePointCrawlersCrawl DBServer Redundancy Total1 0 1 0 0 1 210 1 1 1 1 2 640 2 3 2 1 3 11100 3 6 3 1 6 19150 5 10 5 1 10 31200 6 14 6 2 14 42500 10 34 16 2 34 96
  • FS4SP – Disk Calculation MatrixDisclaimer:This table is based on early testing and results from an internal dogfood project. The numbers might not berepresentative for the customer environment and data. Please use caution when using these numbers for sizing.Max item count(in Millions) Adm Web Analyzer Crawl DB Server Indexer1 1 x 72 GB 1 x 5 GB 1 x 10 GB 1 x 120 GB10 1 x 72 GB 1 x 50 GB 1 x 40 GB 1 x 1.2 TB40 1 x 72 GB 1 x 60 GB 1 x 150 GB 3 x 2.0 TB100 1 x 72 GB 2 x 75 GB 1 x 350 GB 6 x 2.0 TB150 1 x 72 GB 4 x 75 GB 1 x 500 GB 10 x 2.0 TB200 1 x 72 GB 5 x 75 GB 2 x 350 GB 14 x 2.0 TB500 1 x 72 GB 9 x 75 GB 2 x 500 GB 34 x 2.0 TB
  • Feeding and Indexing Performance• Feed and indexing processing chain in FS4SPhas the components below:– Crawler– Content Distributors– Item Processing– Indexing Dispatcher– Primary Indexer– Backup Indexer
  • Feeding and Indexing PerformanceCrawlersContentDistributorsDocumentProcessorsIndexingDispatchersPrimaryIndexersSecondaryindexers1 2 3 4 56789
  • Feeding and Indexing PerformanceCrawlersContentDistributorsDocumentProcessorsIndexingDispatchersPrimaryIndexersSecondaryindexers1
  • Questions???