SlideShare a Scribd company logo
1 of 18
How did it go?
The first large enterprise search project
in Europe using SharePoint 2013
Smarta Sök :: Stockholm :: January 29th 2014
Petter Skodvin-Hvammen, Solutions Architect
Who am I
Petter Skodvin-Hvammen

•
•
•
•

Senior Consultant, AD-Gruppen
Advisor, Architect, Developer
15 years in consulting
7 years with search (FAST + SharePoint)

Search clients

Live in Tønsberg, Norway
Oldest city in Scandinavia
visittonsberg.com/en

Nokas (no), Aker Solutions (no), L’Oréal (fr), Gyldendal
(no), Media Saturn (de), Chalmers (se), Atlas Copco
(be), Marktplaats (nl), Metro (de), Eniro (no), Genpact
(in),
News International (uk), Aller Group (dk), …
Background
• Global oil services
company
• 20,000 employees
• 85 locations in 30
countries
• Grow revenue 15 %
• Grow head count with
13,000 in 4 years

• Company acquisitions
• Global knowledge sharing
• Senior engineers retiring
• Employee onboarding
• Employee turnover
Knowledge Initiative
• Knowledge mgmt
organization
• Culture for sharing
• IT investments for supporting
discovery and sharing
•
•
•
•
•

Enterprise Search
People profile sites
Communities of practice
Knowledge sharing portal
Tools for validated information

• Avanade
• Project Manager

• AD-Gruppen:
• Solutions Architect

• Comperio:
• Search Consultants
• Designers
• Developers
Key pains
• It’s hard to find own documents
• Only closest colleagues are asked for help
• People don’t search for answers, but others who may know it
• Sharing across projects, business areas and locations are
limited
• It’s hard to find and reuse knowledge assets from the past
• Core systems and applications are not consolidated
60 core systems exist, and more than 2,000 applications are in
use
Content Sources
• SharePoint 2007 Intranet
Sites
• SharePoint 2007 Team Sites
• SharePoint 2013
Communities
• SharePoint 2013 My Sites
• SharePoint 2013 People
Profiles
• Windows Server File Shares
• SQL Server Database

• 700,000 items
• 7 M items | 1,000 sites | 2.5
TB
• 20 sites (new)
• 25,000 sites (new)
• 25,000 profiles (new)
• > 100 M items | 3,000 shares
• > 5,000 items
• > 4,000 items
Search Strategy
• Create an internal «Google»
• «One search to find it all»
• Find answers by
•
•
•
•
•

Finding people
Finding more than you expect
Finding people by documents
Finding documents by people
Finding people by projects

• Users where looking for
•
•
•
•
•

People
Projects
Products
Documents
Communities
Extended Search
Box

User Experience
Verticals from
Information Model

Topic Pages

Result Groups and
Display Templates

Custom Refiners

People Hits in
Search Box

http://intranet/search/pages/results/documentname.docx

http://intranet/search/pages/results/documentname.docx

Custom Feedback

http://intranet/search/pages/results/documentname.docx
Customizations
Searching

Indexing

Administration

Comperio FRONT,
JavaScript hooks,
Search REST API,
SharePoint Apps,
Topic Pages,
Custom Search Box,
Branding and Styling

BCS Connector,
Content Enrichment,
Content Extractors,
Symbolic Links,
3,000 File Shares,
Spelling and Synonyms
synchronised with
Corporate Taxonomy

Power Shell Deployment,
Custom Cmdlets,
Crawl config Automation,
Custom Timer Jobs
Custom Relevancy
Models,
Custom Promoted Results
Infrastructure Needs
40

WFE

Query

Caching

Admin

FRONT

Million
Documents

Query

Admin

10

WFE

FRONT

Queries /
Second

Index-0

Index-1

Index-0

Index-1

Doc Proc

Doc Proc

Doc Proc

Doc Proc

Enrichment

Enrichment

Enrichment

Enrichment

Crawling

Index-2

Index-3

Index-2

Index-3

Crawling

Analytics

Doc Proc

Doc Proc

Doc Proc

Doc Proc

Analytics

Central Admin

Enrichment

Enrichment

Enrichment

Enrichment

Doc Proc

Doc Proc
Enrichment

Caching

Enrichment
•
•
•
•

Admin DB
Analytics DB
Crawl DB
Link DB

•

Other SP DBs
SQL Server

SQL Server
Server Infrastructure
• 14 virtual machines
• 8 CPU cores (112)
• 16 GB memory (224 GB)
• Increased to 32 GB after go-live

• 150 GB system disk (2.1 TB)
• 450 GB data disk (6.3 TB)
• 200 IOPS (x 8 = 1,600 IOPS)

• 4 physical hosts (VMware
ESX)
• 2 datacenters
• UAT, QA, DEV environments

• Server Software
•
•
•
•

SharePoint Server 2013
Windows Server 2008 R2
SQL Server 2012 Enterprise
Netscaler Load Balancer

• Development Software
• Visual Studio Premium 2012
• Team Foundation Server
Infrastructure Tuning
Capacity test findings

Resource monitoring revealed

• Crawl rate declined 1% per
million items indexed
• Query latency increased
exponentially from 12 million
items per partition
• Disk usage significant lower
than expected
• 850GB -> 450GB = 5.6 TB

• Vmware:
• Don’t allocate more CPUs than
your virtual machines need
• Don’t dedicate more memory
than available when failover
occurs

• Optimizations are planned
Performance Considerations
• Add enough hardware
resources (CPU, Memory, IO)
• Give SQL Server enough
memory
• Exclude index location from
virus scanning
• Know your virtual
environment
• Use separate servers for
distributed caching

• Design for performance and
go easy on the query rules
Improving Relevancy
• Relevancy workshops with
knowledge managers
• Analyzed search reports
• Top queries
• Abandonned queries
• Zero result queries

• Testing and tuning by
business
•
•
•
•

Defined test cases
Managed promoted results
Managed synonyms
Managed query suggestions

• Created custom ranking
models
•
•
•
•

Increase weight for hits in title
Add weight for freshness
Improve weight for proximity
Boost intranet content

• Custom app for rank model
testing by business users
• Only admin has access
• Testers need to select model
for comparing results
Current Status
• 25% of index capacity in use
• Performance can be
improved
• Infrastructure to be optimized
• Relevancy can be improved
• Governance established but
not adopted by business
• Many users still don’t like to
see people when searching
for documents, and vice versa

• Unstable solution
• Information architecture
• People still don’t know how to
search effectively
• People don’t think findability
when creating content
• Important content sources not
added due to high complexity
and costs (SAP and ProArc)
How did it go?
• 12,000 searches per day
• Top 3: Best SharePoint Search Solution
-- European SharePoint Community Awards
2013
• Microsoft SharePoint Case Study
?

Questions?

Petter Skodvin-Hvammen
psh@adgruppen.no
no.linkedin.com/in/petterskodvin
@pettersh

More Related Content

What's hot

Governance - O365 How It's Affected & Where Do I Start
Governance - O365 How It's Affected & Where Do I StartGovernance - O365 How It's Affected & Where Do I Start
Governance - O365 How It's Affected & Where Do I Start
Stacy Deere
 
Office 365: Is Governance Affected and Where Do We Start? (SPS Charlotte)
Office 365: Is Governance Affected and Where Do We Start? (SPS Charlotte)Office 365: Is Governance Affected and Where Do We Start? (SPS Charlotte)
Office 365: Is Governance Affected and Where Do We Start? (SPS Charlotte)
Stacy Deere
 

What's hot (20)

SharePoint Requires Technical & Content Governance - How to Get Started
SharePoint Requires Technical & Content Governance  - How to Get StartedSharePoint Requires Technical & Content Governance  - How to Get Started
SharePoint Requires Technical & Content Governance - How to Get Started
 
Webinar: Introducing operational portal structure - the core of a digital wor...
Webinar: Introducing operational portal structure - the core of a digital wor...Webinar: Introducing operational portal structure - the core of a digital wor...
Webinar: Introducing operational portal structure - the core of a digital wor...
 
Governance - O365 How It's Affected & Where Do I Start
Governance - O365 How It's Affected & Where Do I StartGovernance - O365 How It's Affected & Where Do I Start
Governance - O365 How It's Affected & Where Do I Start
 
Hybrid Analytics in Healthcare: Leveraging Power BI and Office 365 to Make Sm...
Hybrid Analytics in Healthcare: Leveraging Power BI and Office 365 to Make Sm...Hybrid Analytics in Healthcare: Leveraging Power BI and Office 365 to Make Sm...
Hybrid Analytics in Healthcare: Leveraging Power BI and Office 365 to Make Sm...
 
Why Are We Still Talking About SharePoint Governance? presented by Anders Skj...
Why Are We Still Talking About SharePoint Governance? presented by Anders Skj...Why Are We Still Talking About SharePoint Governance? presented by Anders Skj...
Why Are We Still Talking About SharePoint Governance? presented by Anders Skj...
 
Salesforce: Tech & Products, New Hire Success & Onboarding
Salesforce: Tech & Products, New Hire Success & Onboarding Salesforce: Tech & Products, New Hire Success & Onboarding
Salesforce: Tech & Products, New Hire Success & Onboarding
 
The Modern Data Warehouse - A Hybrid Story
The Modern Data Warehouse - A Hybrid StoryThe Modern Data Warehouse - A Hybrid Story
The Modern Data Warehouse - A Hybrid Story
 
Nuts and Bolts of Building Compliance Process with Nintex and SharePoint 2013
Nuts and Bolts of Building Compliance Process with Nintex and SharePoint 2013Nuts and Bolts of Building Compliance Process with Nintex and SharePoint 2013
Nuts and Bolts of Building Compliance Process with Nintex and SharePoint 2013
 
Office 365: Is Governance Affected and Where Do We Start? (SPS Charlotte)
Office 365: Is Governance Affected and Where Do We Start? (SPS Charlotte)Office 365: Is Governance Affected and Where Do We Start? (SPS Charlotte)
Office 365: Is Governance Affected and Where Do We Start? (SPS Charlotte)
 
Getting the Most out of Siebel CTMS with APIs
Getting the Most out of Siebel CTMS with APIsGetting the Most out of Siebel CTMS with APIs
Getting the Most out of Siebel CTMS with APIs
 
Global payroll webinar 3 - Choosing the payroll service delivery model that ...
Global payroll webinar 3  - Choosing the payroll service delivery model that ...Global payroll webinar 3  - Choosing the payroll service delivery model that ...
Global payroll webinar 3 - Choosing the payroll service delivery model that ...
 
Webinar: Modernize your HRIS on Office 365 / SharePoint
Webinar: Modernize your HRIS on Office 365 / SharePointWebinar: Modernize your HRIS on Office 365 / SharePoint
Webinar: Modernize your HRIS on Office 365 / SharePoint
 
The future of managed meta data
The future of managed meta dataThe future of managed meta data
The future of managed meta data
 
Risks & Rewards of Upgrading to the Latest Version of Siebel CTMS
Risks & Rewards of Upgrading to the Latest Version of Siebel CTMSRisks & Rewards of Upgrading to the Latest Version of Siebel CTMS
Risks & Rewards of Upgrading to the Latest Version of Siebel CTMS
 
Why an intranet is key to office 365 adoption
Why an intranet is key to office 365 adoptionWhy an intranet is key to office 365 adoption
Why an intranet is key to office 365 adoption
 
Webinar Series: 5 steps to a Digital Workplace on Office 365 - Step 3: Define...
Webinar Series: 5 steps to a Digital Workplace on Office 365 - Step 3: Define...Webinar Series: 5 steps to a Digital Workplace on Office 365 - Step 3: Define...
Webinar Series: 5 steps to a Digital Workplace on Office 365 - Step 3: Define...
 
Webinar introducing operational portal structure - the core of a digital wo...
Webinar   introducing operational portal structure - the core of a digital wo...Webinar   introducing operational portal structure - the core of a digital wo...
Webinar introducing operational portal structure - the core of a digital wo...
 
Turn office 365 into a digital workplace
Turn office 365 into a digital workplace Turn office 365 into a digital workplace
Turn office 365 into a digital workplace
 
Actionable Data: Mastering the Hybrid Analytics Mix
Actionable Data: Mastering the Hybrid Analytics MixActionable Data: Mastering the Hybrid Analytics Mix
Actionable Data: Mastering the Hybrid Analytics Mix
 
Microsoft Office 365 & Microsoft Teams Security and Compliance Mayotte Oleson
Microsoft Office 365 & Microsoft Teams Security and Compliance Mayotte OlesonMicrosoft Office 365 & Microsoft Teams Security and Compliance Mayotte Oleson
Microsoft Office 365 & Microsoft Teams Security and Compliance Mayotte Oleson
 

Similar to How did it go? The first large enterprise search project in Europe using SharePoint 2013

Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)
Petter Skodvin-Hvammen
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
Agnes Molnar
 
Graphs fun vjug2
Graphs fun vjug2Graphs fun vjug2
Graphs fun vjug2
Neo4j
 
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Concept Searching, Inc
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
Agnes Molnar
 
U of A Web Strategy and Sitecore
U of A Web Strategy and SitecoreU of A Web Strategy and Sitecore
U of A Web Strategy and Sitecore
Tim Schneider
 
CRC-STC May 2013 Summit Presentation
CRC-STC May 2013 Summit PresentationCRC-STC May 2013 Summit Presentation
CRC-STC May 2013 Summit Presentation
crcstc
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 

Similar to How did it go? The first large enterprise search project in Europe using SharePoint 2013 (20)

Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)
 
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
 
Scoping a Successful SharePoint 2016 Hybrid Search Implementation
Scoping a Successful SharePoint 2016 Hybrid Search ImplementationScoping a Successful SharePoint 2016 Hybrid Search Implementation
Scoping a Successful SharePoint 2016 Hybrid Search Implementation
 
SharePoint Connections Conference Amsterdam - Pitfalls and success factors of...
SharePoint Connections Conference Amsterdam - Pitfalls and success factors of...SharePoint Connections Conference Amsterdam - Pitfalls and success factors of...
SharePoint Connections Conference Amsterdam - Pitfalls and success factors of...
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
Graphs fun vjug2
Graphs fun vjug2Graphs fun vjug2
Graphs fun vjug2
 
Tips and tricks for complex migrations to SharePoint Online
Tips and tricks for complex migrations to SharePoint OnlineTips and tricks for complex migrations to SharePoint Online
Tips and tricks for complex migrations to SharePoint Online
 
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
 
Tips in migrating to SharePoint 2016 or O365, to avoid a migration headache
Tips in migrating to SharePoint 2016 or O365, to avoid a migration headacheTips in migrating to SharePoint 2016 or O365, to avoid a migration headache
Tips in migrating to SharePoint 2016 or O365, to avoid a migration headache
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data Visualization
 
Text Mining & Sentiment Analysis with Power BI & Azure
Text Mining & Sentiment Analysis with Power BI & AzureText Mining & Sentiment Analysis with Power BI & Azure
Text Mining & Sentiment Analysis with Power BI & Azure
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
 
SPS Toronto 2015
SPS Toronto 2015SPS Toronto 2015
SPS Toronto 2015
 
Introduction and What’s new in SharePoint 2013
Introduction and What’s new in SharePoint 2013Introduction and What’s new in SharePoint 2013
Introduction and What’s new in SharePoint 2013
 
U of A Web Strategy and Sitecore
U of A Web Strategy and SitecoreU of A Web Strategy and Sitecore
U of A Web Strategy and Sitecore
 
CRC-STC May 2013 Summit Presentation
CRC-STC May 2013 Summit PresentationCRC-STC May 2013 Summit Presentation
CRC-STC May 2013 Summit Presentation
 
Developing an Effective Search Strategy for Office 365 and Hybrid Deployments
Developing an Effective Search Strategy for Office 365 and Hybrid DeploymentsDeveloping an Effective Search Strategy for Office 365 and Hybrid Deployments
Developing an Effective Search Strategy for Office 365 and Hybrid Deployments
 
Enterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for SearchEnterprise Search Summit Keynote: A Big Data Architecture for Search
Enterprise Search Summit Keynote: A Big Data Architecture for Search
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
 
O365Con19 - Tips and Tricks for Complex Migrations to SharePoint Online - And...
O365Con19 - Tips and Tricks for Complex Migrations to SharePoint Online - And...O365Con19 - Tips and Tricks for Complex Migrations to SharePoint Online - And...
O365Con19 - Tips and Tricks for Complex Migrations to SharePoint Online - And...
 

Recently uploaded

Recently uploaded (20)

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

How did it go? The first large enterprise search project in Europe using SharePoint 2013

  • 1. How did it go? The first large enterprise search project in Europe using SharePoint 2013 Smarta Sök :: Stockholm :: January 29th 2014 Petter Skodvin-Hvammen, Solutions Architect
  • 2. Who am I Petter Skodvin-Hvammen • • • • Senior Consultant, AD-Gruppen Advisor, Architect, Developer 15 years in consulting 7 years with search (FAST + SharePoint) Search clients Live in Tønsberg, Norway Oldest city in Scandinavia visittonsberg.com/en Nokas (no), Aker Solutions (no), L’Oréal (fr), Gyldendal (no), Media Saturn (de), Chalmers (se), Atlas Copco (be), Marktplaats (nl), Metro (de), Eniro (no), Genpact (in), News International (uk), Aller Group (dk), …
  • 3. Background • Global oil services company • 20,000 employees • 85 locations in 30 countries • Grow revenue 15 % • Grow head count with 13,000 in 4 years • Company acquisitions • Global knowledge sharing • Senior engineers retiring • Employee onboarding • Employee turnover
  • 4. Knowledge Initiative • Knowledge mgmt organization • Culture for sharing • IT investments for supporting discovery and sharing • • • • • Enterprise Search People profile sites Communities of practice Knowledge sharing portal Tools for validated information • Avanade • Project Manager • AD-Gruppen: • Solutions Architect • Comperio: • Search Consultants • Designers • Developers
  • 5. Key pains • It’s hard to find own documents • Only closest colleagues are asked for help • People don’t search for answers, but others who may know it • Sharing across projects, business areas and locations are limited • It’s hard to find and reuse knowledge assets from the past • Core systems and applications are not consolidated 60 core systems exist, and more than 2,000 applications are in use
  • 6. Content Sources • SharePoint 2007 Intranet Sites • SharePoint 2007 Team Sites • SharePoint 2013 Communities • SharePoint 2013 My Sites • SharePoint 2013 People Profiles • Windows Server File Shares • SQL Server Database • 700,000 items • 7 M items | 1,000 sites | 2.5 TB • 20 sites (new) • 25,000 sites (new) • 25,000 profiles (new) • > 100 M items | 3,000 shares • > 5,000 items • > 4,000 items
  • 7. Search Strategy • Create an internal «Google» • «One search to find it all» • Find answers by • • • • • Finding people Finding more than you expect Finding people by documents Finding documents by people Finding people by projects • Users where looking for • • • • • People Projects Products Documents Communities
  • 8. Extended Search Box User Experience Verticals from Information Model Topic Pages Result Groups and Display Templates Custom Refiners People Hits in Search Box http://intranet/search/pages/results/documentname.docx http://intranet/search/pages/results/documentname.docx Custom Feedback http://intranet/search/pages/results/documentname.docx
  • 9. Customizations Searching Indexing Administration Comperio FRONT, JavaScript hooks, Search REST API, SharePoint Apps, Topic Pages, Custom Search Box, Branding and Styling BCS Connector, Content Enrichment, Content Extractors, Symbolic Links, 3,000 File Shares, Spelling and Synonyms synchronised with Corporate Taxonomy Power Shell Deployment, Custom Cmdlets, Crawl config Automation, Custom Timer Jobs Custom Relevancy Models, Custom Promoted Results
  • 11. 40 WFE Query Caching Admin FRONT Million Documents Query Admin 10 WFE FRONT Queries / Second Index-0 Index-1 Index-0 Index-1 Doc Proc Doc Proc Doc Proc Doc Proc Enrichment Enrichment Enrichment Enrichment Crawling Index-2 Index-3 Index-2 Index-3 Crawling Analytics Doc Proc Doc Proc Doc Proc Doc Proc Analytics Central Admin Enrichment Enrichment Enrichment Enrichment Doc Proc Doc Proc Enrichment Caching Enrichment • • • • Admin DB Analytics DB Crawl DB Link DB • Other SP DBs SQL Server SQL Server
  • 12. Server Infrastructure • 14 virtual machines • 8 CPU cores (112) • 16 GB memory (224 GB) • Increased to 32 GB after go-live • 150 GB system disk (2.1 TB) • 450 GB data disk (6.3 TB) • 200 IOPS (x 8 = 1,600 IOPS) • 4 physical hosts (VMware ESX) • 2 datacenters • UAT, QA, DEV environments • Server Software • • • • SharePoint Server 2013 Windows Server 2008 R2 SQL Server 2012 Enterprise Netscaler Load Balancer • Development Software • Visual Studio Premium 2012 • Team Foundation Server
  • 13. Infrastructure Tuning Capacity test findings Resource monitoring revealed • Crawl rate declined 1% per million items indexed • Query latency increased exponentially from 12 million items per partition • Disk usage significant lower than expected • 850GB -> 450GB = 5.6 TB • Vmware: • Don’t allocate more CPUs than your virtual machines need • Don’t dedicate more memory than available when failover occurs • Optimizations are planned
  • 14. Performance Considerations • Add enough hardware resources (CPU, Memory, IO) • Give SQL Server enough memory • Exclude index location from virus scanning • Know your virtual environment • Use separate servers for distributed caching • Design for performance and go easy on the query rules
  • 15. Improving Relevancy • Relevancy workshops with knowledge managers • Analyzed search reports • Top queries • Abandonned queries • Zero result queries • Testing and tuning by business • • • • Defined test cases Managed promoted results Managed synonyms Managed query suggestions • Created custom ranking models • • • • Increase weight for hits in title Add weight for freshness Improve weight for proximity Boost intranet content • Custom app for rank model testing by business users • Only admin has access • Testers need to select model for comparing results
  • 16. Current Status • 25% of index capacity in use • Performance can be improved • Infrastructure to be optimized • Relevancy can be improved • Governance established but not adopted by business • Many users still don’t like to see people when searching for documents, and vice versa • Unstable solution • Information architecture • People still don’t know how to search effectively • People don’t think findability when creating content • Important content sources not added due to high complexity and costs (SAP and ProArc)
  • 17. How did it go? • 12,000 searches per day • Top 3: Best SharePoint Search Solution -- European SharePoint Community Awards 2013 • Microsoft SharePoint Case Study