• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Which SharePoint Search is Right for You?
 

Which SharePoint Search is Right for You?

on

  • 1,569 views

 

Statistics

Views

Total Views
1,569
Views on SlideShare
1,569
Embed Views
0

Actions

Likes
0
Downloads
33
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Opportunity to fix up bad/missing metadataFS4SP: Custom stage just before mapper; C# default.. Safe, has timeout..ESP: any CLI code – sdtin/stdout; even Bat file (newsletter example)Why a pipeline: Mark up documents w/ look-aside data (stock quotes); extend with sentiment analysis, auto-classification, etc. Very powerful capability for fixing content problems
  • Format: Filters, OutsideInVector: Similarity searching

Which SharePoint Search is Right for You? Which SharePoint Search is Right for You? Presentation Transcript

  • SharePoint 101:Which SharePoint Search is Right for You?
    Miles Kehoe
    New Idea Engineering Inc.
    miles.kehoe@ideaeng.com
  • SharePoint and FAST Search
    Introduction
    The Products
    Grand Tour
    Tech Details
    What Really counts
    Mapping Business Requirements to Technology
    Data
    Capacities
    Price
    Search Resources
  • New Idea Engineering Inc.
    Company Background
    Founded in 1996
    Headquarters: Santa Clara, California
    Customers in Europe and North America
    Vendor neutral approach to search
    Focus is on what is best for our clients
    Products and Consulting Services:
    Evaluation/Selection/Implementing Enterprise Search
    Search Best Practices
    SearchTrack Reporting & Analytics
    Search Data Quality Toolkit
    Enterprise Search Newsletter
    3
  • The Book
    Published Fall 2010
    Covers Search:
    Business
    SharePoint
    FS4SP
    ESP 5.3
    Trends
  • SharePoint Search Products:
    The Grand Tour
  • Naming Conventions
    Business Productivity
    (server/(e)CAL licensing)
    Internet Business
    (Server licensing)
    FAST Search Server 2010 for SharePoint
    SharePoint Server 2010
    for Internet Sites, Enterprise
    High End
    SharePoint Server 2010
    SharePoint Server 2010
    for Internet Sites,
    Standard
    Integrated with SharePoint
    Infrastructure
    SharePoint Foundation 2010
    Entry Level
    FAST Search Server 2010 for Internal Applications
    FAST Search Server 2010for Internet Sites
    High End
    Microsoft Search Server 2010
    Stand-Alone
    Infrastructure
    Microsoft Search Server 2010, Express
    Entry Level
  • The ‘SharePoint’ Codebase
    Business Productivity
    (server/(e)CAL licensing)
    Internet Business
    (Server licensing)
    FAST Search Server 2010 for SharePoint
    SharePoint Server 2010
    for Internet Sites, Enterprise
    High End
    SharePoint Server 2010
    SharePoint Server 2010
    for Internet Sites,
    Standard
    Integrated with SharePoint
    Infrastructure
    SharePoint Foundation 2010
    Entry Level
    FAST Search Server 2010 for Internal Applications
    FAST Search Server 2010for Internet Sites
    High End
    Microsoft Search Server 2010
    Stand-Alone
    Infrastructure
    Microsoft Search Server 2010, Express
    Entry Level
  • Product Names
  • Solutions for Internet Business
    Solutions for Business Productivity
    The Marketing Fantasy
    FAST Search
    for SharePoint Internet Sites
    FAST Search
    for SharePoint
    Integrated
    with
    SharePoint
    SharePoint Server for Internet Sites
    SharePoint Server
    FAST Search
    For Internet Sites
    FAST Search
    for Internal Applications
    Stand-alone
    Search Server
    Search Server
    Express
    Entry-Level
    Solutions
    SharePoint
    Foundation
  • In Summary
    • Two entry level (SPF, MSS-X), three infrastructure-tier (SP, SP_FIS, MSS), four high end (FS4SP, FS4SP-IS, FSIA, FSIS)
    • Four stand-alone (MSS-X, MSS, FSIA, FSIS), five integrated with SharePoint) SPF, SP, SP-FIS, FS4SP, FS4SP-IS)
    • Three intended/licensed for internally facing applications (SP, FSIA, FS4SP), three intended/licensed for externally facing applications (SP-FIS, FS4SP-IS, FSIS)
    • Six different images/media sets (SPF, MSS/MSS-X, SP/SP-FIS/SP/FIS-E, FS4SP/FS4SP/FS4SP-IS. FSIA, FSIS)
  • Confused Yet?
  • The Products
    SP and FS4SP Platforms
  • High Level Overview: SP vs FS4SP
  • SharePoint Server and FS4SP
    FS/FS4SP:
    • Two different codebases
    united by (mostly) common features
    • SharePoint environment
    • PowerShell
    • iFilters
    • OOB experience
    FS4SP adds
    • Index Pipeline
    • Deep facets
    • Geo/Location Search
    • FAST XRANK operator
    • Deep facets
    • Predictable scalability …
  • SP and FS4SP: End users
  • SP and FS4SP: IT
  • SP and FS4SP: Developers
  • Powerful FS4SP Features
    IT
    Content Processing pipeline
    Entity extraction
    Tunable relevance ranking
    Developers
    Add custom property extractors
    Extend content processing
    Include external data in relevance
    Build multiple relevance profiles
    USER
    Deep refiners with counts
    Context based user profile
    Multiple relevance profiles
    Sorting on any property
    Similarity Search
    Broader, better language support
    Richer query language
  • Indexing Pipeline

    FS4SP
    Stages coded in .Net
    Configure via UI or PowerShell
    Custom stages before ‘Mapper’
    Runs in sandbox w/ timeout
    Mapper
    FS4SP Index
    Language
    Detection
    Format
    Conversion
    EntityExtraction
    Lemmatization
    FS4SP
    Stages coded in Python
    (any CLI language OK)
    Configure via XML config file
    Custom stages allowed anywhere
    Runs in-line
  • Typical Content Pipeline Stages
    Default
    Optional
    • Format Conversion
    • Language and encoding detection
    • Lemmatizer
    • Tokenizer
    • Entity Extraction
    • DateTimeNormalizer
    • Vectorizer
    • WebAnalyzer
    • PropertiesMapper
    • PropertiesReporter
    • XML Properties mapper
    • Offensive Content Filter
    • Verbatim (whole word) extractor
    Loads dictionary for custom extraction, e.g. product names
    • Field Collapsing
    • Entity Extraction
    ‘Persons1’
  • The Role of the Pipeline
    The Content Pipeline
    Processing & refinement
    Date
    Lookaside
    Location
    Company
    REDMOND, Wash., andOSLO, Norway — Jan. 8, 2008
    Microsoft Corp. (Nasdaq “MSFT”) today announced that it will make an offer to acquire Fast Search & Transfer ASA (OSE: “FAST”), a leading provider of enterprise search solutions, through a cash tender offer for 19.00 Norwegian kroner (NOK) per share. This offer represents a 42 percent premium to the closing share price on Jan. 4, 2008 (the last trading day prior to this announcement), …
    Mapper

    Configurable
    Stages
    EntityExtraction
    Language
    Detection
    Format
    Conversion
  • The ‘Virtual Document’
    The initial document text content plus:
    • Explicit metadata (Title/Author/etc.)
    • Implicit metadata (Path/repository/filename)
    • Look-aside content from pipeline
    (synonyms/taxonomies/other mark-up)
    • Anything you can add to ‘make the needle bigger’
  • Entity Extraction
    FS4SP
    • Create/Edit Config Files
    • Update files (Persons/Places/Things)
    • Wait for system to update (5 minutes)
    • Voila!!
    ESP
    • Create/Edit dictionary file
    • Compile dictionary w/ ‘dictupdate’
    • Names, Companies, Job titles
    • Samples provided in many languages
    • Whitelist and blacklist
    • Bazinga!
  • Deep Refiners
    SP
    • Shallow refiners only
    • No count provided
    • Only managed properties/metadata
    • Uses top 50 results to populate refiners
    FS4SP
    • Provides document counts with refiners
    • All refiners shown (based on config)
  • FS4SP Result List
  • ESP Platforms
  • FAST ESP
    FAST ESP 5.3
    • Multiple Platform Support
    • Extreme Scalability
    • Petabytes of Content
    • >10K QPS
    FSIA = ESP
    • Identical to FAST ESP 5.3
    • ‘Internal Applications’
    • License based on ECAL + Server
    FSIS = FAST ESP 5.3 + IMS / CTS
    • IMS/CTS - Windows only (Visual Studio)
    • ‘Internet Sites’
    • License based on servers
  • ESP Platform
    Search development environment
    • ‘Some assembly required’
    • Not based on SharePoint
    • Connectors and WebParts available
    • No OOB end-user UI
    • Separate user and IT consoles
    • Fully customizable pipeline
    Development Tools
    • Java/Python/PostgreSQL
    • GUI and CLI (no PowerShell support)
    • Petabytes of Content
    • 10K+ QPS
  • ESP Admin Console
  • ESP Search Business Console
  • Search View: Results
  • Search View: Debugging
  • Search View: Debug Output
  • New FSIS Products
    Content Transformation Services: CTS
    • Design workflows for indexing from multiple sources
    Interaction Management Services: IMS
    • Manage query/results processing
    CTS and IMS:
    • Based on Visual Studio: Windows Only!
    • Licensed/Intended for FSIS ONLY (for now)
  • Content Transformation Services
  • Interaction Management Services
  • ESP Customer: Best Buy
  • ESP Customer: Financial Times
  • But…. What’s really important..
    What does your customer expect?
  • Mapping Business Requirements
    to the Technology
  • Identify business rules for facets/refiners
    If refiners are a business need:
    • Choose to index the appropriate metadata fields; or
    • Upgrade your data
    If refiners are not a business need:
    • Push the business rules into the 21st century
    • Use what you have; or
    • Update your content
  • The Data Audit
    Understand the data
    Repositories
    Where does the content live
    Is there security involved
    Documents & document structure
    Do documents have good metadata?
    Do you need to extract data
    Are there recognizable blocks of content?
    Lightweight publishing content
    Look at email/wikis/blogs/support calls…
  • Great search doesn’t just happen..
    Do search owners understand?
    Staffing Expectations
    Is there a search manager? An SCOE?
    How many people will be involved day to day?
    How to manage search?
    Review activity logs
    Update and manage best bets, new content
    Evangelize
    User Skills
    Are users knowledge workers or casual searchers?
    Is search business critical?
  • So what is the right answer?
    It depends
    Internal or external facing search?
    SharePoint or stand-alone?
    Casual users or Knowledge workers?
    Resources for managing search?
    Search box or search derived application?
    What’s the risk of missing content?
    That’s why your customer needs you!
  • Resources
    Search Dev Newsgroup:
    www.SearchDev.org
    Newsletter & Whitepapers:
    www.ideaeng.com/current
    www.ideaeng.com/wp
    Blog:
    EnterpriseSearchBlog.com
  • Questions/Follow-Up
    Miles Kehoe
    mbk@ideaeng.com
    www.ideaeng.com