Your SlideShare is downloading. ×
OSLO STOCKHOLM LONDON BOSTONImprove Performance inFAST Search for SharePoint2010 (FS4SP)Enda Flynn – Marketing ManagerJob ...
FinancialsFinancially stabile and secureProfitable growth when expandingAAA Rating (D&B)Employees50 + specialists and grow...
Agenda• Comperio Search Introduction• FS4SP Architecture Overview• Factors influencing performance• Analyzing Feeding & In...
Common Issues• “Crawler is slow. Crawler takes too long”• “Queries are slow”• “Incremental indexing takes a long time”• “I...
FAST Search Server 2010Summary of architectural componentsContent
Search CenterContentUser Profiles…PropertyExtraction…ContentretrievalContentprocessingQueryprocessing& matchingAdministrat...
Hardware – Best PracticesCPU: 2 x 2GHz+ (Quad/six core)Memory: 24-48 GBDisk:2 x 300 GB, SAS, 10K RPM (RAID 1)CPU: 2 x 2GHz...
• Index latency– How long, on average, a document takes to index• Documents per second– How many documents processed and i...
• Search Administration Reports• Crawl Rate Per Content Source• Crawl Rate Per Type• Crawl Processing Per Activity• Crawl ...
• No of documents• Content types• Deep or shallow refiners• Entity extraction• Complex queries (many terms)• Substring sea...
Resource ConsumptionsCPU RAM DISK TRANS DISK SPACE NETW B/WContent Distributor     Document Processor     Index ...
FAST Search for SharePoint Scale outContentVolumeQueryVolumeScale-out multiple“dimensions”Query VolumeContent VolumeIndexi...
FS4SP – Medium DeploymentFAST Search for SharePoint 2010 FarmFAST-ADM-1AdminContent Distributor 1Web Analyzer12 Docprocs+F...
FS4SP – Large DeploymentSP2010 FarmFAST Search for SharePoint 2010 FarmSQL 2008 ClusterWFEQuery SSAWFEQuery SSASP CrawlPeo...
FS4SP – Server Calculation MatrixMax item count(in Millions) Adm + WAIndexers(1 row)SharePointCrawlersCrawl DBServer Redun...
FS4SP – Disk Calculation MatrixDisclaimer:This table is based on early testing and results from an internal dogfood projec...
Feeding and Indexing Performance• Feed and indexing processing chain in FS4SPhas the components below:– Crawler– Content D...
Feeding and Indexing PerformanceCrawlersContentDistributorsDocumentProcessorsIndexingDispatchersPrimaryIndexersSecondaryin...
Feeding and Indexing PerformanceCrawlersContentDistributorsDocumentProcessorsIndexingDispatchersPrimaryIndexersSecondaryin...
Questions???
Upcoming SlideShare
Loading in...5
×

Improve Performance in Fast Search for SharePoint - Comperio

449

Published on

Comperio Webinar - Improve Performance in Fast Search for SharePoint 2010, presented by Job Maelane (Comperio UK)

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
449
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "Improve Performance in Fast Search for SharePoint - Comperio"

  1. 1. OSLO STOCKHOLM LONDON BOSTONImprove Performance inFAST Search for SharePoint2010 (FS4SP)Enda Flynn – Marketing ManagerJob Maelane - Senior Consultant
  2. 2. FinancialsFinancially stabile and secureProfitable growth when expandingAAA Rating (D&B)Employees50 + specialists and growingAbility to attract highly qualified employeesCustomers90 +3 continentsSearch Projects200 + FAST projects completedComperio | Search Matters.
  3. 3. Agenda• Comperio Search Introduction• FS4SP Architecture Overview• Factors influencing performance• Analyzing Feeding & Indexing• Questions
  4. 4. Common Issues• “Crawler is slow. Crawler takes too long”• “Queries are slow”• “Incremental indexing takes a long time”• “Indexing fails”
  5. 5. FAST Search Server 2010Summary of architectural componentsContent
  6. 6. Search CenterContentUser Profiles…PropertyExtraction…ContentretrievalContentprocessingQueryprocessing& matchingAdministration
  7. 7. Hardware – Best PracticesCPU: 2 x 2GHz+ (Quad/six core)Memory: 24-48 GBDisk:2 x 300 GB, SAS, 10K RPM (RAID 1)CPU: 2 x 2GHz+ (Quad/six core)Memory: 24-48 GBDisk alternatives:1.0 TB: 8 x 300 GB, SAS, 10K RPM (RAID10)1.8 TB: 8 x 300 GB, SAS, 10K RPM (RAID 5)3.6 TB: 16 x 300 GB, SAS, 10K RPM (RAID 5+0)New: 7.2 TB: 16 x 600 GB, SAS, 10K RPM (RAID5+0)SAN: Configured for “database performance”Storage ServerAdmin / ProcessingServer
  8. 8. • Index latency– How long, on average, a document takes to index• Documents per second– How many documents processed and indexed persecond• Query latency– How quickly are the results generated from a query• Queries per second (QPS)– How many queries are processed per secondHow is search performance measured?
  9. 9. • Search Administration Reports• Crawl Rate Per Content Source• Crawl Rate Per Type• Crawl Processing Per Activity• Crawl Processing Per Component• Crawl Queue• Query Latencyand more• Web Analytics Reports• Total Number of Search Queries• Top queries• Failed queries• Best Bet usage• Keywords usageand moreFS4SP- Search Administration ReportsAnalyze Ribbon- More date range options- Filtering search scope- Search query text- Export to Spreadsheet etc.Web Analytics Web Part- Display popular items on a site(such as popular content, popularsearch queries, or search results)
  10. 10. • No of documents• Content types• Deep or shallow refiners• Entity extraction• Complex queries (many terms)• Substring search• Lemmatisation• Spell check• Maximum and average document size• And many more….What influences performance?Lots of things!
  11. 11. Resource ConsumptionsCPU RAM DISK TRANS DISK SPACE NETW B/WContent Distributor     Document Processor     Index Dispatcher     Indexer   Search Engine     QR Server     Admin Services     Web Link Analysis    
  12. 12. FAST Search for SharePoint Scale outContentVolumeQueryVolumeScale-out multiple“dimensions”Query VolumeContent VolumeIndexing freshnessRedundancy optionsSearchIndexingPerformance targets*15M Docs/node25 QPS/node*Depends on content and hardware specificsSearch and IndexingCrawling and ContentProcessingQuery and ResultProcessingBack-end with extreme and flexible scale out options
  13. 13. FS4SP – Medium DeploymentFAST Search for SharePoint 2010 FarmFAST-ADM-1AdminContent Distributor 1Web Analyzer12 Docprocs+FAST-FSTIDX-11Index (Search)12 Docprocs+FAST-FSTIDX-12Index (Search)12 Docprocs+FAST-FSTIDX-21(Index) SearchQR ServerFAST-FSTIDX-22(Index) SearchQR ServerFAST-ADM-2Content Distributor 2Web Analyzer12 Docprocs+(Enterprise Crawler)FAST-FSTIDX-13Index (Search)12 Docprocs+FAST-FSTIDX-23(Index) SearchQR ServerSP2010 FarmSQL 2008 ClusterWFEQuery SSAWFEQuery SSASP CrawlPeople CrawlSP CrawlPeople CrawlCrawl DBSearch Admin DB
  14. 14. FS4SP – Large DeploymentSP2010 FarmFAST Search for SharePoint 2010 FarmSQL 2008 ClusterWFEQuery SSAWFEQuery SSASP CrawlPeople CrawlSP CrawlPeople CrawlCrawl DBSearch Admin DBSP CrawlFAST-ADM-1AdminConfigServerSpelltunerSamAdminContent Distributor 1Web Analyzer12 Docprocs+FAST-FSTIDX-11Index (Search)12 Docprocs+FAST-FSTIDX-12Index (Search)12 Docprocs+FAST-FSTIDX-21(Index) SearchQR ServerFAST-FSTIDX-22(Index) SearchQR ServerFAST-ADM-2Content Distributor 2Web Analyzer12 Docprocs+FAST-FSTIDX-13Index (Search)12 Docprocs+FAST-FSTIDX-23(Index) SearchQR ServerFAST-FSTIDX-14Index (Search)12 Docprocs+FAST-FSTIDX-15Index (Search)12 Docprocs+FAST-FSTIDX-24(Index) SearchQR ServerFAST-FSTIDX-25(Index) SearchQR ServerFAST-FSTIDX-16Index (Search)12 Docprocs+FAST-FSTIDX-26(Index) SearchQR ServerFAST-ADM-3Web Analyzer12 Docprocs+
  15. 15. FS4SP – Server Calculation MatrixMax item count(in Millions) Adm + WAIndexers(1 row)SharePointCrawlersCrawl DBServer Redundancy Total1 0 1 0 0 1 210 1 1 1 1 2 640 2 3 2 1 3 11100 3 6 3 1 6 19150 5 10 5 1 10 31200 6 14 6 2 14 42500 10 34 16 2 34 96
  16. 16. FS4SP – Disk Calculation MatrixDisclaimer:This table is based on early testing and results from an internal dogfood project. The numbers might not berepresentative for the customer environment and data. Please use caution when using these numbers for sizing.Max item count(in Millions) Adm Web Analyzer Crawl DB Server Indexer1 1 x 72 GB 1 x 5 GB 1 x 10 GB 1 x 120 GB10 1 x 72 GB 1 x 50 GB 1 x 40 GB 1 x 1.2 TB40 1 x 72 GB 1 x 60 GB 1 x 150 GB 3 x 2.0 TB100 1 x 72 GB 2 x 75 GB 1 x 350 GB 6 x 2.0 TB150 1 x 72 GB 4 x 75 GB 1 x 500 GB 10 x 2.0 TB200 1 x 72 GB 5 x 75 GB 2 x 350 GB 14 x 2.0 TB500 1 x 72 GB 9 x 75 GB 2 x 500 GB 34 x 2.0 TB
  17. 17. Feeding and Indexing Performance• Feed and indexing processing chain in FS4SPhas the components below:– Crawler– Content Distributors– Item Processing– Indexing Dispatcher– Primary Indexer– Backup Indexer
  18. 18. Feeding and Indexing PerformanceCrawlersContentDistributorsDocumentProcessorsIndexingDispatchersPrimaryIndexersSecondaryindexers1 2 3 4 56789
  19. 19. Feeding and Indexing PerformanceCrawlersContentDistributorsDocumentProcessorsIndexingDispatchersPrimaryIndexersSecondaryindexers1
  20. 20. Questions???

×