SlideShare a Scribd company logo
Enterprise Search
• Best Practices
• Connector Framework
• Relevancy overview



Sharepoint User Group
2013, March 26, Minsk
                 Confidential   1
EPAM has more than 100
 systems
knowledgebase.epam.com                      trainings.epam.co
                 ???????
                                           HR file shares
Information.epam.com

XXX.epam.com
                                           Bla.bla.bla.epam.co

  YYY.epam.com           Jira.epam.com

                            Confidential                    2
Confidential   3
Little homework




                  Confidential   4
We started POC in September 2012




                        Confidential   5
Available as search.epam.com in November 2012




                    •   Sharepoint 2010
                    •   FAST Search for Sharepoint
                    •   Branded Search Center
                    •   Custom connectors
                    •   Fine-tuned relevance to reflect EPAM
                        landscape




                          Confidential                    6
Confidential   7
We become stronger every day…

                      • 550 000 searchable
                        items

                      • 30+ content sources

                      • 400+ daily searches

                      • Exposed to internet



                        Confidential          9
… to help you search




                       Confidential   10
What we’ve learned

1. Deploy “painkiller” project as soon as possible
2. Connect as much systems as possible (Cap O. speaking)
3. Analyze
   • Watch search logs
   • Connect external analytics
   • Speak with users
      • Feedback forms sucks
4. Tune relevancy
   • hot-fix using bugs using best-bets
5. Work with departments to adopt their content
   • Basic SEO



                                    Confidential           12
Search Connectors in SP2010/2013
                Search Connectors


 Protocol Handers                   BCS

                      Lotus
     File Share                 Exchange   Custom BCS
                      Notes

     SharePoint                               Database


      WebSite                                WebService


       People                                   .NET
BCS Connectors in SP 2010/2013




    Stereotyped Operations
•    Get IDs
•    Get By ID
•    Describe Security
•    Read Stream
EPAM Data Import Framework

                Tree DescribeTree()                 • Altassian Confluence
   ISource      Node DownloadData(Node)             • SVN
                                                    • PMC



                Workflow
                1. Source to build tree
  IImporter     2. Destination to build tree
                3. Diff trees
                4. Destination to import diff (add, remove)    Timer Job



                Tree DescribeTree()                  • SharePoint Library
 IDestination   void Import(Tree)                    • File System
BCS vs DataImport Comparison
                           Data Import   BCS

Effort to build the same
                               +          +
Document Previews
                               +          -
Indexing Speed
                               +         +/-
Customizable
                               +          -
Storage Space
                               -          +
Unit Testing
                               +         +/-
Incremental crawl
                               +         +/-
RELEVANCY
Search is a two step process
0. User submits query
1. Get candidates: all docs that match query
2. Predict relevancy
     • Query terms importance
     • Proximity of query terms
     • Hit location (mp) importance
     • Freshness
     • Clicks
     • User rating
     •…

                                  Confidential   18
Relevancy in FAST Search

•   Linear combination of features
•   RankProfile
•   Weights are configured via Powershell
•   Easy to understand via RankLog

• Easy tuning
    – Content Source
    – Managed Property

                            Confidential    19
RankLog example (QueryLogger @codeplex)




                        Confidential      20
Relevancy in Sharepoint




                          Confidential
Relevancy in Sharepoint
• Nonlinear combination of features. Two Neural Networks.
• Ranking Model Schema described
    •   http://www.google.com/patents/US8296292
    •   http://www.google.com/patents/US7840569
• Cmdlets to import/export
• Default Ranking Model Features:
  Type                Instance
  BM25                BM25
  Static              UrlDepth
  BucketedStatic      InternalFileType
  BucketedStatic      Language
  Static              ClickDistance
  Static              QueryLogClicks
  Static              QueryLogSkips
  Static              LastClicks
  Static              EventRate
  MinSpan - soft      Title
  MinSpan - soft      Title
  MinSpan - soft      Title
  MinSpan - soft      Content




                                                  Confidential   22
ExplainRank page
• Google for “explain rank sharepoint”
• Parses RankDetail managed Property




                                         Confidential   23
Ranking Model Tuning




                       Confidential   24
Ranking Model Tuning

Approach described by Microsoft
   – http://msdn.microsoft.com/en-us/library/bb499682(v=office.12).aspx
1. Collect Query Judgements
2. Use Machine Learning to train Neural Network




• namespace Microsoft.Office.Server.Search.RankerTuning
• Wait for tuning tool




                                        Confidential                25
Query Judgment framework




                       Confidential   26
Manual relevancy tuning in Sharepoint

• Authoritative Pages
• QueryRules
    – Best Bets
    – Understanding User Intent
• Synonyms (cmdlets)
• Entity Extractors
• Spelling Corrections
• Query Suggestions


• ManagedMetadata


•   (!) Query Builder


                                  Confidential   27
Manual relevancy tuning in Sharepoint




                                        28
Manual relevancy tuning in Sharepoint




                                        29
SP 2013 REST Query tool

• http://sp2013searchtool.codeplex.com/




                          Confidential    30
Alexey Kozhemiakin
Solution Architect, Enterprise Search




            Confidential

                                        31

More Related Content

What's hot

Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceLucidworks
 
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...Sezai Komur
 
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013Corey Roth
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Jeff Fried
 
SharePoint Search Secrets for Power Users & Administrators - Mike Smith
SharePoint Search Secrets for Power Users & Administrators - Mike SmithSharePoint Search Secrets for Power Users & Administrators - Mike Smith
SharePoint Search Secrets for Power Users & Administrators - Mike SmithMAX Technical Training
 
Webinar: Building Customer-Targeted Search with Fusion
Webinar: Building Customer-Targeted Search with FusionWebinar: Building Customer-Targeted Search with Fusion
Webinar: Building Customer-Targeted Search with FusionLucidworks
 
Cloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePointCloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePointJeff Fried
 
Webinar: Building Conversational Search with Fusion
Webinar: Building Conversational Search with FusionWebinar: Building Conversational Search with Fusion
Webinar: Building Conversational Search with FusionLucidworks
 
Developing Search-driven application in SharePoint 2013
 Developing Search-driven application in SharePoint 2013  Developing Search-driven application in SharePoint 2013
Developing Search-driven application in SharePoint 2013 SPC Adriatics
 
Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)Maximilian Melcher
 
Succeeding with Hybrid SharePoint
Succeeding with Hybrid SharePointSucceeding with Hybrid SharePoint
Succeeding with Hybrid SharePointJeff Fried
 
Ms. Search and Mr. Project
Ms. Search and Mr. ProjectMs. Search and Mr. Project
Ms. Search and Mr. ProjectSPC Adriatics
 
SPSBE building an faq for end users
SPSBE building an faq for end usersSPSBE building an faq for end users
SPSBE building an faq for end usersPaul Hunt
 
Hybrid SharePoint Solutions for the Business Decision-Maker
Hybrid SharePoint Solutions for the Business Decision-MakerHybrid SharePoint Solutions for the Business Decision-Maker
Hybrid SharePoint Solutions for the Business Decision-MakerChristian Buckley
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Jeff Fried
 
Bpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearchBpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearchAgnes Molnar
 
Assuring the code quality of share point solutions and apps - Matthias Einig
Assuring the code quality of share point solutions and apps - Matthias EinigAssuring the code quality of share point solutions and apps - Matthias Einig
Assuring the code quality of share point solutions and apps - Matthias EinigSPC Adriatics
 
SPUnite17 The New Enterprise Content Management
SPUnite17 The New Enterprise Content ManagementSPUnite17 The New Enterprise Content Management
SPUnite17 The New Enterprise Content ManagementNCCOMMS
 
Fried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveFried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveJeff Fried
 
Office 365 and SharePoint 2013 Hybrid Environments
Office 365 and SharePoint 2013 Hybrid EnvironmentsOffice 365 and SharePoint 2013 Hybrid Environments
Office 365 and SharePoint 2013 Hybrid EnvironmentsRene Modery
 

What's hot (20)

Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
 
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
 
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
 
SharePoint Search Secrets for Power Users & Administrators - Mike Smith
SharePoint Search Secrets for Power Users & Administrators - Mike SmithSharePoint Search Secrets for Power Users & Administrators - Mike Smith
SharePoint Search Secrets for Power Users & Administrators - Mike Smith
 
Webinar: Building Customer-Targeted Search with Fusion
Webinar: Building Customer-Targeted Search with FusionWebinar: Building Customer-Targeted Search with Fusion
Webinar: Building Customer-Targeted Search with Fusion
 
Cloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePointCloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePoint
 
Webinar: Building Conversational Search with Fusion
Webinar: Building Conversational Search with FusionWebinar: Building Conversational Search with Fusion
Webinar: Building Conversational Search with Fusion
 
Developing Search-driven application in SharePoint 2013
 Developing Search-driven application in SharePoint 2013  Developing Search-driven application in SharePoint 2013
Developing Search-driven application in SharePoint 2013
 
Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)
 
Succeeding with Hybrid SharePoint
Succeeding with Hybrid SharePointSucceeding with Hybrid SharePoint
Succeeding with Hybrid SharePoint
 
Ms. Search and Mr. Project
Ms. Search and Mr. ProjectMs. Search and Mr. Project
Ms. Search and Mr. Project
 
SPSBE building an faq for end users
SPSBE building an faq for end usersSPSBE building an faq for end users
SPSBE building an faq for end users
 
Hybrid SharePoint Solutions for the Business Decision-Maker
Hybrid SharePoint Solutions for the Business Decision-MakerHybrid SharePoint Solutions for the Business Decision-Maker
Hybrid SharePoint Solutions for the Business Decision-Maker
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
 
Bpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearchBpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearch
 
Assuring the code quality of share point solutions and apps - Matthias Einig
Assuring the code quality of share point solutions and apps - Matthias EinigAssuring the code quality of share point solutions and apps - Matthias Einig
Assuring the code quality of share point solutions and apps - Matthias Einig
 
SPUnite17 The New Enterprise Content Management
SPUnite17 The New Enterprise Content ManagementSPUnite17 The New Enterprise Content Management
SPUnite17 The New Enterprise Content Management
 
Fried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveFried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper dive
 
Office 365 and SharePoint 2013 Hybrid Environments
Office 365 and SharePoint 2013 Hybrid EnvironmentsOffice 365 and SharePoint 2013 Hybrid Environments
Office 365 and SharePoint 2013 Hybrid Environments
 

Similar to Enterprise Search @EPAM

10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 SearchSPC Adriatics
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchAgnes Molnar
 
SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013Agnes Molnar
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Best Practices for SharePoint 2010 Search
Best Practices for SharePoint 2010 SearchBest Practices for SharePoint 2010 Search
Best Practices for SharePoint 2010 SearchAgnes Molnar
 
I2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
I2 - SharePoint Hybrid Search Start to Finish - Thomas VochtenI2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
I2 - SharePoint Hybrid Search Start to Finish - Thomas VochtenSPS Paris
 
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
2018 09-03 aOS Aachen - SharePoint demystified - Thomas VochtenaOS Community
 
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...Agnes Molnar
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
SPSDenver - Wrapping Your Head Around the SharePoint Beast
SPSDenver - Wrapping Your Head Around the SharePoint BeastSPSDenver - Wrapping Your Head Around the SharePoint Beast
SPSDenver - Wrapping Your Head Around the SharePoint BeastMark Rackley
 
Sim-webcast-part1-1aa
Sim-webcast-part1-1aaSim-webcast-part1-1aa
Sim-webcast-part1-1aaOracleIDM
 
The journey to share point steve sofian_arvato
The journey to share point steve sofian_arvatoThe journey to share point steve sofian_arvato
The journey to share point steve sofian_arvatoMicrosoft Singapore
 
2/15/2012 - Wrapping Your Head Around the SharePoint Beast
2/15/2012 - Wrapping Your Head Around the SharePoint Beast2/15/2012 - Wrapping Your Head Around the SharePoint Beast
2/15/2012 - Wrapping Your Head Around the SharePoint BeastMark Rackley
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Designing and Implementing Search Solutions
Designing and Implementing Search SolutionsDesigning and Implementing Search Solutions
Designing and Implementing Search SolutionsFindwise
 
Take Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next LevelTake Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next LevelJeff Fried
 
Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Petter Skodvin-Hvammen
 
SPSNL17 - Implementing SharePoint hybrid search, start to finish - Thomas Voc...
SPSNL17 - Implementing SharePoint hybrid search, start to finish - Thomas Voc...SPSNL17 - Implementing SharePoint hybrid search, start to finish - Thomas Voc...
SPSNL17 - Implementing SharePoint hybrid search, start to finish - Thomas Voc...DIWUG
 

Similar to Enterprise Search @EPAM (20)

10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search10 Things I Like in SharePoint 2013 Search
10 Things I Like in SharePoint 2013 Search
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
 
SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
 
Best Practices for SharePoint 2010 Search
Best Practices for SharePoint 2010 SearchBest Practices for SharePoint 2010 Search
Best Practices for SharePoint 2010 Search
 
I2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
I2 - SharePoint Hybrid Search Start to Finish - Thomas VochtenI2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
I2 - SharePoint Hybrid Search Start to Finish - Thomas Vochten
 
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
 
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
SPSDenver - Wrapping Your Head Around the SharePoint Beast
SPSDenver - Wrapping Your Head Around the SharePoint BeastSPSDenver - Wrapping Your Head Around the SharePoint Beast
SPSDenver - Wrapping Your Head Around the SharePoint Beast
 
Discovery Interfaces
Discovery InterfacesDiscovery Interfaces
Discovery Interfaces
 
Sim-webcast-part1-1aa
Sim-webcast-part1-1aaSim-webcast-part1-1aa
Sim-webcast-part1-1aa
 
The journey to share point steve sofian_arvato
The journey to share point steve sofian_arvatoThe journey to share point steve sofian_arvato
The journey to share point steve sofian_arvato
 
2/15/2012 - Wrapping Your Head Around the SharePoint Beast
2/15/2012 - Wrapping Your Head Around the SharePoint Beast2/15/2012 - Wrapping Your Head Around the SharePoint Beast
2/15/2012 - Wrapping Your Head Around the SharePoint Beast
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
 
Designing and Implementing Search Solutions
Designing and Implementing Search SolutionsDesigning and Implementing Search Solutions
Designing and Implementing Search Solutions
 
Take Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next LevelTake Cloud Hybrid Search to the Next Level
Take Cloud Hybrid Search to the Next Level
 
Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)
 
SPSNL17 - Implementing SharePoint hybrid search, start to finish - Thomas Voc...
SPSNL17 - Implementing SharePoint hybrid search, start to finish - Thomas Voc...SPSNL17 - Implementing SharePoint hybrid search, start to finish - Thomas Voc...
SPSNL17 - Implementing SharePoint hybrid search, start to finish - Thomas Voc...
 
Highlights from SharePoint Conference 2011
Highlights from SharePoint Conference 2011Highlights from SharePoint Conference 2011
Highlights from SharePoint Conference 2011
 

Enterprise Search @EPAM

  • 1. Enterprise Search • Best Practices • Connector Framework • Relevancy overview Sharepoint User Group 2013, March 26, Minsk Confidential 1
  • 2. EPAM has more than 100 systems knowledgebase.epam.com trainings.epam.co ??????? HR file shares Information.epam.com XXX.epam.com Bla.bla.bla.epam.co YYY.epam.com Jira.epam.com Confidential 2
  • 4. Little homework Confidential 4
  • 5. We started POC in September 2012 Confidential 5
  • 6. Available as search.epam.com in November 2012 • Sharepoint 2010 • FAST Search for Sharepoint • Branded Search Center • Custom connectors • Fine-tuned relevance to reflect EPAM landscape Confidential 6
  • 8.
  • 9. We become stronger every day… • 550 000 searchable items • 30+ content sources • 400+ daily searches • Exposed to internet Confidential 9
  • 10. … to help you search Confidential 10
  • 11. What we’ve learned 1. Deploy “painkiller” project as soon as possible 2. Connect as much systems as possible (Cap O. speaking) 3. Analyze • Watch search logs • Connect external analytics • Speak with users • Feedback forms sucks 4. Tune relevancy • hot-fix using bugs using best-bets 5. Work with departments to adopt their content • Basic SEO Confidential 12
  • 12. Search Connectors in SP2010/2013 Search Connectors Protocol Handers BCS Lotus File Share Exchange Custom BCS Notes SharePoint Database WebSite WebService People .NET
  • 13. BCS Connectors in SP 2010/2013 Stereotyped Operations • Get IDs • Get By ID • Describe Security • Read Stream
  • 14. EPAM Data Import Framework Tree DescribeTree() • Altassian Confluence ISource Node DownloadData(Node) • SVN • PMC Workflow 1. Source to build tree IImporter 2. Destination to build tree 3. Diff trees 4. Destination to import diff (add, remove) Timer Job Tree DescribeTree() • SharePoint Library IDestination void Import(Tree) • File System
  • 15. BCS vs DataImport Comparison Data Import BCS Effort to build the same + + Document Previews + - Indexing Speed + +/- Customizable + - Storage Space - + Unit Testing + +/- Incremental crawl + +/-
  • 17. Search is a two step process 0. User submits query 1. Get candidates: all docs that match query 2. Predict relevancy • Query terms importance • Proximity of query terms • Hit location (mp) importance • Freshness • Clicks • User rating •… Confidential 18
  • 18. Relevancy in FAST Search • Linear combination of features • RankProfile • Weights are configured via Powershell • Easy to understand via RankLog • Easy tuning – Content Source – Managed Property Confidential 19
  • 19. RankLog example (QueryLogger @codeplex) Confidential 20
  • 20. Relevancy in Sharepoint Confidential
  • 21. Relevancy in Sharepoint • Nonlinear combination of features. Two Neural Networks. • Ranking Model Schema described • http://www.google.com/patents/US8296292 • http://www.google.com/patents/US7840569 • Cmdlets to import/export • Default Ranking Model Features: Type Instance BM25 BM25 Static UrlDepth BucketedStatic InternalFileType BucketedStatic Language Static ClickDistance Static QueryLogClicks Static QueryLogSkips Static LastClicks Static EventRate MinSpan - soft Title MinSpan - soft Title MinSpan - soft Title MinSpan - soft Content Confidential 22
  • 22. ExplainRank page • Google for “explain rank sharepoint” • Parses RankDetail managed Property Confidential 23
  • 23. Ranking Model Tuning Confidential 24
  • 24. Ranking Model Tuning Approach described by Microsoft – http://msdn.microsoft.com/en-us/library/bb499682(v=office.12).aspx 1. Collect Query Judgements 2. Use Machine Learning to train Neural Network • namespace Microsoft.Office.Server.Search.RankerTuning • Wait for tuning tool Confidential 25
  • 25. Query Judgment framework Confidential 26
  • 26. Manual relevancy tuning in Sharepoint • Authoritative Pages • QueryRules – Best Bets – Understanding User Intent • Synonyms (cmdlets) • Entity Extractors • Spelling Corrections • Query Suggestions • ManagedMetadata • (!) Query Builder Confidential 27
  • 27. Manual relevancy tuning in Sharepoint 28
  • 28. Manual relevancy tuning in Sharepoint 29
  • 29. SP 2013 REST Query tool • http://sp2013searchtool.codeplex.com/ Confidential 30
  • 30. Alexey Kozhemiakin Solution Architect, Enterprise Search Confidential 31