SlideShare a Scribd company logo
1 of 19
The Lifecycle of a FAST
Search Implementation
    Rem Purushothaman
        Search Practice Lead
Rem Purushothaman
                Search Practice Lead

                Contact:
                rem.purushothaman@perficient.com
                312.589.3371
                @RemSearchPro


www.perficient.com
Prepare
Prepare

You’ve decided to implement FAST. Now what?
      Build your team. What kind of skill sets do you need?

            Solution Architect

            Project Manager

            Business Analyst

            SharePoint Developers

            QA and Testers

            Operations Personnel

      Get your team trained on FAST Search

      Understand at a high level what needs to be accomplished in each of the project phases

            Envision, Plan, Build, Stabilize, Deploy

            Operations
Envision
Envision

Determine what needs to be accomplished.
      High level search requirements

            Custom UI for Search?

            Identify content that needs to be crawled and indexed. Do you need custom connectors?

            How do you measure relevancy?

            Security Considerations

            Linguistics (Synonyms, Spell Check Exceptions)

            Integration to other systems

      Initial Architecture and Environment Sizing

            Number of servers (Dev, QA, Staging/Performance, Production)

            Failover, Index Redundancy, High Availability

            Initial specifications for crawling and indexing performance

            Initial specifications for query and search performance
Plan
Plan

This is the most important phase of any search project.
      Gather detailed search requirements for the UI

            Identify metadata that needs to be displayed in the results. Are multiple search result
             pages needed?

            Identify Refiners for each search result page

            Is there a need for a custom UI and web parts?

            If necessary, design the services layer and integration to external systems

      Content Sources

            Identify content sources (crawl rules) and access restrictions (security considerations)

            Identify content metadata (properties, fields, elements) to be crawled

            Estimate work required to crawl content using custom connectors

            Identify special security requirement. If necessary, map custom security model to Active
             Directory for security trimming

            Determine if any of the content sources have to be cleaned up before crawling
Plan

   Crawling and Indexing

         Map crawl properties (content metadata) to managed properties (indexed items)

         Identify custom relevancy models

         Identify managed properties to be indexed

         Identify full text index priority

         Identify Linguistic Components - keywords, synonyms, best bets, type ahead

   Testing

         Create content set for testing

         Create plan for performance testing

   Operations

         Identify plans for incremental updates and deletes

         Create plans for identifying and managing the incremental growth of the index

         Identify how to monitor and manage search issues
Build
Build

Put it all together. Good planning will have big payoffs.
       Build the UI and Services Layer

             Search and Search Results Pages including Advanced Search

             If necessary, build the Services Layer and integrate to external systems

             Custom Web Parts (refiners, federation)

       Crawling and Indexing

             Set up Content Sources (Custom Connectors) and Crawl Rules

                 Optional: Custom pipeline development

                 Optional: External process to pre-process & scrub the content

                 Optional: Map custom security to AD

             Set up Crawled and Managed Properties

             Set up Type Ahead, Keywords, Best Bests, Synonyms, Refiners

             Crawl and Index the Content (rinse and repeat)
Stabilize
Stabilize

Getting results back from search doesn’t mean the results are valid.

        Application and Search Related Testing

              Its critical that the QA understand how FAST works to test it properly

              Use external tools to test queries outside of the SharePoint Search Center

              Validate search result relevancy, document counts, and refiner counts

              Validate Security Trimming

              Validate Linguistics (spell check, synonyms, stemming, stop words, etc..)

        Performance Testing

              Ideally, the performance test environment should be approximately half the size of the
               production environment

              Performance test early in the project life cycle and then on a regular basis

              Compile a list of expensive queries that will really stress the system

              Determine the max QPS system can handle

              Determine max number of documents that can be supported in a column
Deploy
Deploy

For the most, deploying a FAST search solution is just like deploying a SP solution

   Create PowerShell scripts for all the FAST related deployment items

         Content Source, Crawl Rules

         Keywords (with Best Bets, Synonyms, etc…), Type Ahead, Spell Check Exceptions

         Make the scripts generic (try not to hard code environment specific values)

   Test the FAST deployment scripts through each of the environments (dev, test, staging)

   Validate crawler access to all content source systems

   In production, pre-populate (crawl and index) the content ahead of the application deployment
Best Practices
Best Practices

   Get key people trained in FAST Search

   There is no substitute for good planning and design. This pays huge dividends in terms of saving
    time from having to re-crawl and re-index your content

   Do you best to prepare the content to have consistent and clean metadata.

   Introduce search organically within your organization. Start with a pilot group and grow from
    there

   Use the search engine for search

         Don’t use it as your content repository

         Don’t use it as a way to populate lists in the UI

   Keep the number of queries from a UI page to a minimum. Bigger search results are sometimes
    better than multiple search queries from a single page.

   Don’t index the world. Start with a manageable content set and add content in a phases

   Index only a small portion in your dev and test environments. This will make you agile and save
    you a lot of time during the build and stabilize phases.
Best Practices

   Be prepared to handle growth. Have a plan in place. Indexes can get large quickly

   Do a through job with performance testing and sizing. This will pay off in terms of performance
    and scalability in production

   Understand your performance requirements (number of users, QPS) and build your environment
    with excess capacity. It gives you breathing room for spikes and unexpected scenarios

   A fail over/redundant farm is highly recommended for the production environment. Besides the
    obvious benefits, a redundant farm helps with deployments

   Initial environment architecture and sizing will always be guess until performance testing
    validates environment architecture and size. Be prepared to add more servers

   Have measurable metrics that operations personnel can use for monitoring the health of FAST
Q&A

More Related Content

Similar to Lifecycle of a FAST Search Implementation

ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Large scale agile development practices
Large scale agile development practicesLarge scale agile development practices
Large scale agile development practicesSkills Matter
 
Best practice adoption (and lack there of)
Best practice adoption (and lack there of)Best practice adoption (and lack there of)
Best practice adoption (and lack there of)John Pape
 
Advanced Application Lifecycle Managment
Advanced Application Lifecycle ManagmentAdvanced Application Lifecycle Managment
Advanced Application Lifecycle ManagmentSalesforce Developers
 
Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013Findwise
 
Keynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified searchKeynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified searchElasticsearch
 
Implementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMImplementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMrtpaem
 
Developer 2: Building A DevOps Toolchain
Developer 2: Building A DevOps ToolchainDeveloper 2: Building A DevOps Toolchain
Developer 2: Building A DevOps ToolchainInflectra
 
Naukri Search Team achievements, 2009-2010
Naukri Search Team achievements, 2009-2010Naukri Search Team achievements, 2009-2010
Naukri Search Team achievements, 2009-2010Aditya Varun Chadha
 
AvePoint - SharePoint App Lifecycle Mgmnt
AvePoint - SharePoint App Lifecycle MgmntAvePoint - SharePoint App Lifecycle Mgmnt
AvePoint - SharePoint App Lifecycle MgmntMary Leigh Mackie
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Planning SharePoint 2013 Search for IT PROs
Planning SharePoint 2013 Search for IT PROsPlanning SharePoint 2013 Search for IT PROs
Planning SharePoint 2013 Search for IT PROsBenjamin Athawes
 
Roadmap to Enterprise Quality
Roadmap to Enterprise QualityRoadmap to Enterprise Quality
Roadmap to Enterprise QualityJeff Bramwell
 
Martin Woodward - DevOps Habits
Martin Woodward  - DevOps HabitsMartin Woodward  - DevOps Habits
Martin Woodward - DevOps HabitsPeter Laker
 
1 Ads
1 Ads1 Ads
1 Adslcbj
 
SPSHOU - Mastering SharePoint Migration Planning
SPSHOU - Mastering SharePoint Migration PlanningSPSHOU - Mastering SharePoint Migration Planning
SPSHOU - Mastering SharePoint Migration PlanningChristian Buckley
 
Build your open source data science platform
Build your open source data science platformBuild your open source data science platform
Build your open source data science platformDavid Talby
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineSalford Systems
 

Similar to Lifecycle of a FAST Search Implementation (20)

ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
Large scale agile development practices
Large scale agile development practicesLarge scale agile development practices
Large scale agile development practices
 
Best practice adoption (and lack there of)
Best practice adoption (and lack there of)Best practice adoption (and lack there of)
Best practice adoption (and lack there of)
 
Advanced Application Lifecycle Managment
Advanced Application Lifecycle ManagmentAdvanced Application Lifecycle Managment
Advanced Application Lifecycle Managment
 
Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013
 
Vsts
VstsVsts
Vsts
 
Project
ProjectProject
Project
 
Keynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified searchKeynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified search
 
Implementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMImplementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEM
 
Developer 2: Building A DevOps Toolchain
Developer 2: Building A DevOps ToolchainDeveloper 2: Building A DevOps Toolchain
Developer 2: Building A DevOps Toolchain
 
Naukri Search Team achievements, 2009-2010
Naukri Search Team achievements, 2009-2010Naukri Search Team achievements, 2009-2010
Naukri Search Team achievements, 2009-2010
 
AvePoint - SharePoint App Lifecycle Mgmnt
AvePoint - SharePoint App Lifecycle MgmntAvePoint - SharePoint App Lifecycle Mgmnt
AvePoint - SharePoint App Lifecycle Mgmnt
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
 
Planning SharePoint 2013 Search for IT PROs
Planning SharePoint 2013 Search for IT PROsPlanning SharePoint 2013 Search for IT PROs
Planning SharePoint 2013 Search for IT PROs
 
Roadmap to Enterprise Quality
Roadmap to Enterprise QualityRoadmap to Enterprise Quality
Roadmap to Enterprise Quality
 
Martin Woodward - DevOps Habits
Martin Woodward  - DevOps HabitsMartin Woodward  - DevOps Habits
Martin Woodward - DevOps Habits
 
1 Ads
1 Ads1 Ads
1 Ads
 
SPSHOU - Mastering SharePoint Migration Planning
SPSHOU - Mastering SharePoint Migration PlanningSPSHOU - Mastering SharePoint Migration Planning
SPSHOU - Mastering SharePoint Migration Planning
 
Build your open source data science platform
Build your open source data science platformBuild your open source data science platform
Build your open source data science platform
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search Engine
 

More from Perficient, Inc.

Driving Strong 2020 Holiday Season Results
Driving Strong 2020 Holiday Season ResultsDriving Strong 2020 Holiday Season Results
Driving Strong 2020 Holiday Season ResultsPerficient, Inc.
 
Transforming Pharmacovigilance Workflows with AI & Automation
Transforming Pharmacovigilance Workflows with AI & Automation Transforming Pharmacovigilance Workflows with AI & Automation
Transforming Pharmacovigilance Workflows with AI & Automation Perficient, Inc.
 
The Secret to Acquiring and Retaining Customers in Financial Services
The Secret to Acquiring and Retaining Customers in Financial ServicesThe Secret to Acquiring and Retaining Customers in Financial Services
The Secret to Acquiring and Retaining Customers in Financial ServicesPerficient, Inc.
 
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.Perficient, Inc.
 
Content, Commerce, and... COVID
Content, Commerce, and... COVIDContent, Commerce, and... COVID
Content, Commerce, and... COVIDPerficient, Inc.
 
Centene's Financial Transformation Journey: A OneStream Success Story
Centene's Financial Transformation Journey: A OneStream Success StoryCentene's Financial Transformation Journey: A OneStream Success Story
Centene's Financial Transformation Journey: A OneStream Success StoryPerficient, Inc.
 
Automate Medical Coding With WHODrug Koda
Automate Medical Coding With WHODrug KodaAutomate Medical Coding With WHODrug Koda
Automate Medical Coding With WHODrug KodaPerficient, Inc.
 
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration ProjectPreparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration ProjectPerficient, Inc.
 
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19Perficient, Inc.
 
The Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
The Critical Role of Audience Intelligence with Eric Enge and Rand FishkinThe Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
The Critical Role of Audience Intelligence with Eric Enge and Rand FishkinPerficient, Inc.
 
Cardtronics Future Ready with Oracle EPM Cloud
Cardtronics Future Ready with Oracle EPM CloudCardtronics Future Ready with Oracle EPM Cloud
Cardtronics Future Ready with Oracle EPM CloudPerficient, Inc.
 
Teams Summit - What is New and Coming
Teams Summit -  What is New and ComingTeams Summit -  What is New and Coming
Teams Summit - What is New and ComingPerficient, Inc.
 
Empower Your Organization with Teams & Remote Work Crisis Management
Empower Your Organization with Teams & Remote Work Crisis ManagementEmpower Your Organization with Teams & Remote Work Crisis Management
Empower Your Organization with Teams & Remote Work Crisis ManagementPerficient, Inc.
 
Adoption & Change Management Overview
Adoption & Change Management OverviewAdoption & Change Management Overview
Adoption & Change Management OverviewPerficient, Inc.
 
Microsoft Teams: Measuring Activity of Employees Working from Home
Microsoft Teams: Measuring Activity of Employees Working from HomeMicrosoft Teams: Measuring Activity of Employees Working from Home
Microsoft Teams: Measuring Activity of Employees Working from HomePerficient, Inc.
 
Securing Teams with Microsoft 365 Security for Remote Work
Securing Teams with Microsoft 365 Security for Remote WorkSecuring Teams with Microsoft 365 Security for Remote Work
Securing Teams with Microsoft 365 Security for Remote WorkPerficient, Inc.
 
Infrastructure Best Practices for Teams Remote Workers
Infrastructure Best Practices for Teams Remote WorkersInfrastructure Best Practices for Teams Remote Workers
Infrastructure Best Practices for Teams Remote WorkersPerficient, Inc.
 
Accelerate Adoption for Microsoft Teams
Accelerate Adoption for Microsoft TeamsAccelerate Adoption for Microsoft Teams
Accelerate Adoption for Microsoft TeamsPerficient, Inc.
 
Preparing for Project Cortex and the Future of Knowledge Management
Preparing for Project Cortex and the Future of Knowledge ManagementPreparing for Project Cortex and the Future of Knowledge Management
Preparing for Project Cortex and the Future of Knowledge ManagementPerficient, Inc.
 
Utilizing Microsoft 365 Security for Remote Work
Utilizing Microsoft 365 Security for Remote Work Utilizing Microsoft 365 Security for Remote Work
Utilizing Microsoft 365 Security for Remote Work Perficient, Inc.
 

More from Perficient, Inc. (20)

Driving Strong 2020 Holiday Season Results
Driving Strong 2020 Holiday Season ResultsDriving Strong 2020 Holiday Season Results
Driving Strong 2020 Holiday Season Results
 
Transforming Pharmacovigilance Workflows with AI & Automation
Transforming Pharmacovigilance Workflows with AI & Automation Transforming Pharmacovigilance Workflows with AI & Automation
Transforming Pharmacovigilance Workflows with AI & Automation
 
The Secret to Acquiring and Retaining Customers in Financial Services
The Secret to Acquiring and Retaining Customers in Financial ServicesThe Secret to Acquiring and Retaining Customers in Financial Services
The Secret to Acquiring and Retaining Customers in Financial Services
 
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
 
Content, Commerce, and... COVID
Content, Commerce, and... COVIDContent, Commerce, and... COVID
Content, Commerce, and... COVID
 
Centene's Financial Transformation Journey: A OneStream Success Story
Centene's Financial Transformation Journey: A OneStream Success StoryCentene's Financial Transformation Journey: A OneStream Success Story
Centene's Financial Transformation Journey: A OneStream Success Story
 
Automate Medical Coding With WHODrug Koda
Automate Medical Coding With WHODrug KodaAutomate Medical Coding With WHODrug Koda
Automate Medical Coding With WHODrug Koda
 
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration ProjectPreparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
 
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
 
The Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
The Critical Role of Audience Intelligence with Eric Enge and Rand FishkinThe Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
The Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
 
Cardtronics Future Ready with Oracle EPM Cloud
Cardtronics Future Ready with Oracle EPM CloudCardtronics Future Ready with Oracle EPM Cloud
Cardtronics Future Ready with Oracle EPM Cloud
 
Teams Summit - What is New and Coming
Teams Summit -  What is New and ComingTeams Summit -  What is New and Coming
Teams Summit - What is New and Coming
 
Empower Your Organization with Teams & Remote Work Crisis Management
Empower Your Organization with Teams & Remote Work Crisis ManagementEmpower Your Organization with Teams & Remote Work Crisis Management
Empower Your Organization with Teams & Remote Work Crisis Management
 
Adoption & Change Management Overview
Adoption & Change Management OverviewAdoption & Change Management Overview
Adoption & Change Management Overview
 
Microsoft Teams: Measuring Activity of Employees Working from Home
Microsoft Teams: Measuring Activity of Employees Working from HomeMicrosoft Teams: Measuring Activity of Employees Working from Home
Microsoft Teams: Measuring Activity of Employees Working from Home
 
Securing Teams with Microsoft 365 Security for Remote Work
Securing Teams with Microsoft 365 Security for Remote WorkSecuring Teams with Microsoft 365 Security for Remote Work
Securing Teams with Microsoft 365 Security for Remote Work
 
Infrastructure Best Practices for Teams Remote Workers
Infrastructure Best Practices for Teams Remote WorkersInfrastructure Best Practices for Teams Remote Workers
Infrastructure Best Practices for Teams Remote Workers
 
Accelerate Adoption for Microsoft Teams
Accelerate Adoption for Microsoft TeamsAccelerate Adoption for Microsoft Teams
Accelerate Adoption for Microsoft Teams
 
Preparing for Project Cortex and the Future of Knowledge Management
Preparing for Project Cortex and the Future of Knowledge ManagementPreparing for Project Cortex and the Future of Knowledge Management
Preparing for Project Cortex and the Future of Knowledge Management
 
Utilizing Microsoft 365 Security for Remote Work
Utilizing Microsoft 365 Security for Remote Work Utilizing Microsoft 365 Security for Remote Work
Utilizing Microsoft 365 Security for Remote Work
 

Recently uploaded

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 

Lifecycle of a FAST Search Implementation

  • 1. The Lifecycle of a FAST Search Implementation Rem Purushothaman Search Practice Lead
  • 2. Rem Purushothaman Search Practice Lead Contact: rem.purushothaman@perficient.com 312.589.3371 @RemSearchPro www.perficient.com
  • 4. Prepare You’ve decided to implement FAST. Now what?  Build your team. What kind of skill sets do you need?  Solution Architect  Project Manager  Business Analyst  SharePoint Developers  QA and Testers  Operations Personnel  Get your team trained on FAST Search  Understand at a high level what needs to be accomplished in each of the project phases  Envision, Plan, Build, Stabilize, Deploy  Operations
  • 6. Envision Determine what needs to be accomplished.  High level search requirements  Custom UI for Search?  Identify content that needs to be crawled and indexed. Do you need custom connectors?  How do you measure relevancy?  Security Considerations  Linguistics (Synonyms, Spell Check Exceptions)  Integration to other systems  Initial Architecture and Environment Sizing  Number of servers (Dev, QA, Staging/Performance, Production)  Failover, Index Redundancy, High Availability  Initial specifications for crawling and indexing performance  Initial specifications for query and search performance
  • 8. Plan This is the most important phase of any search project.  Gather detailed search requirements for the UI  Identify metadata that needs to be displayed in the results. Are multiple search result pages needed?  Identify Refiners for each search result page  Is there a need for a custom UI and web parts?  If necessary, design the services layer and integration to external systems  Content Sources  Identify content sources (crawl rules) and access restrictions (security considerations)  Identify content metadata (properties, fields, elements) to be crawled  Estimate work required to crawl content using custom connectors  Identify special security requirement. If necessary, map custom security model to Active Directory for security trimming  Determine if any of the content sources have to be cleaned up before crawling
  • 9. Plan  Crawling and Indexing  Map crawl properties (content metadata) to managed properties (indexed items)  Identify custom relevancy models  Identify managed properties to be indexed  Identify full text index priority  Identify Linguistic Components - keywords, synonyms, best bets, type ahead  Testing  Create content set for testing  Create plan for performance testing  Operations  Identify plans for incremental updates and deletes  Create plans for identifying and managing the incremental growth of the index  Identify how to monitor and manage search issues
  • 10. Build
  • 11. Build Put it all together. Good planning will have big payoffs.  Build the UI and Services Layer  Search and Search Results Pages including Advanced Search  If necessary, build the Services Layer and integrate to external systems  Custom Web Parts (refiners, federation)  Crawling and Indexing  Set up Content Sources (Custom Connectors) and Crawl Rules  Optional: Custom pipeline development  Optional: External process to pre-process & scrub the content  Optional: Map custom security to AD  Set up Crawled and Managed Properties  Set up Type Ahead, Keywords, Best Bests, Synonyms, Refiners  Crawl and Index the Content (rinse and repeat)
  • 13. Stabilize Getting results back from search doesn’t mean the results are valid.  Application and Search Related Testing  Its critical that the QA understand how FAST works to test it properly  Use external tools to test queries outside of the SharePoint Search Center  Validate search result relevancy, document counts, and refiner counts  Validate Security Trimming  Validate Linguistics (spell check, synonyms, stemming, stop words, etc..)  Performance Testing  Ideally, the performance test environment should be approximately half the size of the production environment  Performance test early in the project life cycle and then on a regular basis  Compile a list of expensive queries that will really stress the system  Determine the max QPS system can handle  Determine max number of documents that can be supported in a column
  • 15. Deploy For the most, deploying a FAST search solution is just like deploying a SP solution  Create PowerShell scripts for all the FAST related deployment items  Content Source, Crawl Rules  Keywords (with Best Bets, Synonyms, etc…), Type Ahead, Spell Check Exceptions  Make the scripts generic (try not to hard code environment specific values)  Test the FAST deployment scripts through each of the environments (dev, test, staging)  Validate crawler access to all content source systems  In production, pre-populate (crawl and index) the content ahead of the application deployment
  • 17. Best Practices  Get key people trained in FAST Search  There is no substitute for good planning and design. This pays huge dividends in terms of saving time from having to re-crawl and re-index your content  Do you best to prepare the content to have consistent and clean metadata.  Introduce search organically within your organization. Start with a pilot group and grow from there  Use the search engine for search  Don’t use it as your content repository  Don’t use it as a way to populate lists in the UI  Keep the number of queries from a UI page to a minimum. Bigger search results are sometimes better than multiple search queries from a single page.  Don’t index the world. Start with a manageable content set and add content in a phases  Index only a small portion in your dev and test environments. This will make you agile and save you a lot of time during the build and stabilize phases.
  • 18. Best Practices  Be prepared to handle growth. Have a plan in place. Indexes can get large quickly  Do a through job with performance testing and sizing. This will pay off in terms of performance and scalability in production  Understand your performance requirements (number of users, QPS) and build your environment with excess capacity. It gives you breathing room for spikes and unexpected scenarios  A fail over/redundant farm is highly recommended for the production environment. Besides the obvious benefits, a redundant farm helps with deployments  Initial environment architecture and sizing will always be guess until performance testing validates environment architecture and size. Be prepared to add more servers  Have measurable metrics that operations personnel can use for monitoring the health of FAST
  • 19. Q&A