SlideShare a Scribd company logo
1 of 54
or Transactional Content

Optimizing SharePoint for

Transactional
Content
Management
» Principal at Hershey Technologies
˃Twitter: @tomcastiglia
˃Email: tcastiglia@hersheytech.com

Joined Hershey Tech in 1998
This is my 7th SharePoint Saturday
Director of Hershey’s professional services team since 2001
Founding member of San Diego SharePoint User Group
(@sanspug)
» Founding member of San Diego .NET User Group
»
»
»
»
» Founded in 1991
» Microsoft Partner
» Specialists in
˃ End to End SharePoint Consulting Services
˃ Document Imaging / Scanning
˃ OCR (data and document capture)
˃ ECM / Document Management
˃ BPM / workflow

» SharePoint ISV
˃ XenDocs ECM for SharePoint

» Follow us on Twitter: @HersheyTech
Intuitive Document Query Builder
Middleware component
(Windows Service) to in
integrate content from
multi-function
scanners, fax servers
and reporting apps with
SharePoint
» Explanation of “Transactional Content Management”
(TCM)
» Overview of SharePoint features that are relevant to
TCM
» How to make SharePoint support TCM
» Demo of solutions that fill the feature gaps to ensure
SharePoint is successful for your transactional content
management project
˃
˃
˃
˃

Ad-hoc scanning / document capture into SharePoint
Optimizing SharePoint search for large scale TCM deployments
Enable collaboration of static, transactional documents
Make scanned images and PDF documents a 1st class citizen within SharePoint
» Assumptions - I presume that you understand:
˃ Columns (document metadata)
˃ Content Types
˃ Document Libraries

» Other topics not covered (just not enough time to
include):
˃
˃
˃
˃

Automated Data Capture/OCR
Records Management
Workflow
RBS
Web Content

SharePoint Rocks at this!

Document Collaboration
Transactional Documents

SharePoint Rocks at this!

SharePoint needs a little help here
“high-volume throughput of
relatively static documents”
“content which typically originates outside and
organization from external parties – customers
or partners-and relies on workflow or business
process management (BPM) to drive
transactional, back-office business processes.”
-Forrester Research
» Capturing content from MFPs & Fax servers
» Indexing scanned documents is clumsy
» Configuring Metadata Taxonomy for Search requires
unique expertise
» Lacks intuitive metadata query driven document search
» Treats scanned images and PDF files as a “2nd class
citizen” (compared to MS Office documents)
Transactional Documents

»
»
»
»
»
»
»

Purchase Orders
Vendor Invoices
Application Forms
Insurance claims
Student Records
Enrollment Forms
(Not project
based)

Collaborative documents

» Proposals, reports, spreadsheets,
presentations and other
documents created and edited by
knowledge worker users
˃ Office docs (Word, Excel, PowerPoint)
˃ PDF files

» Created and uploaded on an adhoc basis to support day to day
operations
» (Often project based)
Transactional Documents
Invoices@mycompany.com
Orders@mycompany.com

External
Systems
(AP, claims,
etc.)

OCR
Page Rotation
Barcode Rec.
Doc Sep.
Form Processing

Fax Server
Transactional Content

Collaborative content

» Centralized
» Often isolated to just
one or a few site
collections

» Decentralized
» Documents are often
spread throughout many
site collections, subsites, libraries and content
types
» Typically under 5K
documents per library.

˃ Document Center or Record
Center

» Thousands to millions
of documents per
library
Collaboration scenarios

Transactional Documents

» Navigation

» Navigation doesn’t work - too
many documents per library
» Search via metadata queries
only

˃ SiteSubSiteLibraryFolderDocument

» Keyword Search
˃ Searches both metadata and
document content
˃ Use of social algorithms improve
search results (e.g. highly rated
documents are returned above
other documents)

˃ Ignore document content
˃ Ignore social based algorithms like ratings

» Users expect intuitive, graphical
query builders to specify precise
search conditions against one or
more metadata fields.
Typical SharePoint search

Transactional Document search
» This native SharePoint feature does provide a
limited query builder …
˃ Allows users to query against specific SharePoint columns and choose
various search operators (Equals, At Most, At Least, On, Before, etc.)
˃ Filters document library providing results in a sortable, tabular display.
» Doesn’t support text columns
» Transactional documents
generally need text based
columns for fields like
InvoiceNumber, PONumber, Ve
ndorId, ClaimNumber, etc.
» Doesn’t scale well for libraries
that exceed the list view
threshold (5,000 documents by
default)
Metadata
Columns

Crawled
Properties

Managed
Properties

Search
Results
» Configuring Managed Properties in SharePoint
Search is more complex than it needs to be.
» SharePoint does not provide a robust query builder
for users to intuitively query documents (other
ECM solutions offer this OOB)
» SharePoint formats Search results like a search
engine, not like a document management product.
» SharePoint treats PDF documents and scanned
images as a 2nd class citizen.
» Crawled properties are metadata (such as
author, title, or subject) that are extracted from
SharePoint columns during crawls.
» However, this is the internal representation of
the metadata. To enable users to search on this
metadata, we need to use managed properties
that are mapped to the crawled properties.
» A new crawled property is created for each new custom
column, after…
˃The column is added to at least one list or library
˃The column is populated with a value in at least one item
˃A Full Crawl is performed
» All Crawled properties are grouped into
various categories.
» For Transactional Content Management
solutions, we generally care about the
“SharePoint” Category, which contains
crawled properties that are tied to list
columns in SharePoint.
» Accessible from Search Service
Application: Metadata
Properties>Categories
» The Naming convention is fully controlled by
SharePoint, using this convention:
˃ ows_[internal name of column]

» However, spaces or other symbols (.-!@#$%^, etc.)
within the internal column name are escaped, such
as:
Column Internal Name
Crawled Property Name
InvoiceNumber

ows_InvoiceNumber

Invoice Number

ows_Invoice_x0020_Number

Invoice.Number

ows_Invoice_x002e_Number

Invoice-Number

ows_Invoice_x002d_Number
» In SP2010, most SharePoint columns gets one crawled
property
˃ Managed Metadata Columns get a 2nd crawled property, with a
prefix of “ows_taxid”

» This extra crawled property is used to store the internal
GUID value that is associated with the managed metadata
term. For example:
Column Name: CostCenter

Normal Crawled Property: ows_CostCenter
MM Id Crawled Property: ows_taxid_CostCenter
» …Allow you to enable standardization in the
terms used for searching SharePoint.
» …Represent the end-user’s vision of the SP
taxonomy (at least with regards to Search)
˃So the name of your managed properties should
normally be something intuitive to your end-users
» One managed property may be mapped to one or more crawled
properties.
˃ Useful in low governance situations where multiple site owners or site
collection admins have duplicated site columns using different names
(e.g. InvoiceNumber vs ‘Invoice Number’)

» One crawled property may be mapped to one or more managed
properties
˃ Useful if different applications create their own managed
properties, and need to reference the same crawled property.
Without Managed Properties
Returns 16 items, only 6
of which are related to
what I wanted.
Included other
documents that happen
to contain the StudentId
value either as text in
the document or in
some other field (like an
Invoice Number, or
something else)

With managed properties
Returns only
the 6 correct
items
Provides an OOB
search interface that
allows users to select a
Managed Property
from a drop down
list, rather than having
to type out the
managed property
name (e.g.
“StudentID:” or
“StudentID=“)
Use your favorite XML editor (VS 2012)
Unlike Crawled Properties (which are
always auto-generated by SharePoint…
Managed properties can be created in one
of three ways…
SP 2010

Managed Properties can be created
manually by a SharePoint Administrator
from the Search Service Application
configuration.

» SP2010: “Metadata
Properties” link
» SP2013: “Search Schema”
link

SP 2013
» Click “New Managed
Property” link from Metadata
Property Mappings
˃ Property Name can contain most characters, except for
spaces (but please don’t use special characters)
˃ Based on the selected type, this managed property can only
be mapped to crawled properties with the same type.
˃ Add Mapping – Select 1 or more crawled properties to map
to this managed property.
+

˃

If multiple are selected decide whether to include all
values or just the first one found
Scopes – preset filter on content – like a global where clause

˃ Reduce storage requirements (“hash”) –
option actually works in reverse to what is
stated.
»
»
»

Property Name - Same as SP2010
Add Mapping - same as in SP2010
Reduce storage requirements (“hash”) option - No longer
exists in SP2013

» Many additional settings
˃ Searchable – Enables querying against the content
of the managed property
˃ Queryable – Enables querying against the specific
managed property
˃ Retrievable – Enable this setting for managed
properties that are relevant to present in search
results.
˃ Refinable – Can be used as a search refiner
˃ Sortable –
˃ Token Normalization
˃ Complete Matching
Automatically generated by custom
code or a 3rd party application
» For example, Hershey’s XenDocs
ECM for SharePoint will validate
that a managed property is
properly configured or
automatically create crawled and
managed properties for each
column when our web part is
configured.
Set References to…
• Microsoft.Office.Server.dll
• Microsoft.Office.Server.Search.dll
DLLs Located in:
C:Program FilesCommon FilesMicrosoft SharedWeb Server Extensions15ISAPI

Initialize the Search Schema…
using Microsoft.Office.Server.Search.Administration;

public void InitSearchSchema(string url)
{
SPSite site = new SPSite(url);
Schema _searchSchema = new Schema(SearchContext.GetContext(site));
}
public void CreateCrawledProperty(PropertySet propSet)
{
var propSetId = GetPropertySetId(propSet);
var category = _searchSchema.AllCategories[propSetId];
category.CreateCrawledProperty(crawledPropertyName, false, propSetId);
category.Update();
_crawledProps = _searchSchema.QueryCrawledProperties(string.Empty,
MAX_PROPS,
Guid.NewGuid(),
string.Empty,
true).Cast<CrawledProperty>();
}
public void CreateManagedProperty()
{
// Create new Managed Property to existing Crawled Property
ManagedPropertyCollection allProperties = _searchSchema.AllManagedProperties;
ManagedProperty managedProperty = allProperties.Create(propertyName, dataType);
// Map new Managed Property to existing Crawled Property
MappingCollection mappings = managedProperty.GetMappings();
Mapping mapping = new Mapping(crawledProperty.Propset,
crawledProperty.Name,
crawledProperty.VariantType,
property.PID);
}
public static void VerifySettings(ManagedDataType dataType, ManagedProperty
property, string fieldDataType, bool enabledForScoping, bool respectPriority)
{
property.Searchable = true;
property.Retrievable = true;
property.Sortable = true;
property.Queryable = true;
property.SortableType = SortableType.Enabled;
if (fieldDataType == "SPFieldLookupValueCollection" ||
fieldDataType == "TaxonomyFieldValueCollection")
{
property.HasMultipleValues = true;
}

Continued…
Continued…
if (dataType == ManagedDataType.Text &&
property.MaxCharactersInPropertyStoreIndex != 64)
{
// Reduce Storage requirements by using Hash..
// Yes for Text fields
// No for all others
property.MaxCharactersInPropertyStoreIndex = 64;
}
property.RespectPriority = respectPriority;
property.EnabledForScoping = enabledForScoping;
property.SafeForAnonymous = true;
property.TokenNormalization = true;
property.Update();
}
public static void VerifySettings(ManagedDataType dataType, ManagedProperty
property, string fieldDataType, bool enabledForScoping, bool respectPriority)
{
property.Searchable = true;
property.Retrievable = true;
property.Sortable = true;
property.Queryable = true;
property.SortableType = SortableType.Enabled;
if (fieldDataType == "SPFieldLookupValueCollection" ||
fieldDataType == "TaxonomyFieldValueCollection")
{
property.HasMultipleValues = true;
}
}
Let SharePoint Auto-Generate new
managed properties when it crawls
» In SharePoint 2010…
˃This feature is off by default, but it can be enabled in
your Search Service Application
From the Categories list, hover over the SharePoint
category, click the drop down arrow and then select the Edit
Category option.

Select the option to “automatically generate a new managed
property for each crawled property…”
» In SharePoint 2013…
˃All site columns that contain data will have a managed property
auto-generated upon a full crawl
˃This does not happen for list columns
˃This feature cannot be turned off and is not configurable (as far
as I can tell)
http://technet.microsoft.com/en-us/library/jj613136.aspx
Column

SharePoint 2010

SharePoint 2013

Name

Crawled Property

Managed Property

Crawled Properties

Managed Property

FooBar

ows_FooBar

owsFooBar1

ows_FooBar
ows_q_TEXT_FooBar

Not mapped
FooBarOWSTEXT

Foo Bar

ows_Foo_x0020_Bar

owsFoox0020Bar

ows_Foo_x0020_Bar

FooBarOWSTEXT

Foo_Bar

ows_Foo_Bar

owsFooBar

ows_Foo_Bar
ows_q_TEXT_Foo_Bar

Not mapped
FooBarOWSTEXT

Foo-Bar

ows_Foo_x002d_Bar

owsFoox002dBar ows_Foo-Bar
ows_q_TEXT_Foo-Bar

Foo.Bar

ows_Foo_x002e_Bar

owsFoox002eBar ows_Foo.Bar
ows_q_TEXT_Foo.Bar

Not mapped
Foo-BarOWSTEXT
Not mapped
Foo.BarOWSTEXT

The autogenerated
names for
managed
properties are
not “end-user
friendly” !
A vast improvement compared to the native Advanced Search Web Part
» MS Office Documents are first 1st class citizens in
SharePoint
˃ When office files are opened in Office 2007, 2010 or 2013, users can
perform many SharePoint functions on those documents:
+ Edit document content
+ Check in/out/discard
+ See version history
+ Edit metadata
˃ Preview Thumbnails in SP 2013

» Most other file types, especially PDF files and
scanned images are 2nd class citizens
˃ Read only view of document
Files typically open in native
apps such as Windows Photo
Gallery or Adobe Reader

» Users cannot edit
metadata
» If user rotates, reorders or deletes a
page, the changes
cannot be saved to SP
» User cannot annotate
pages (e.g. sticky
notes, redactions, etc.)
A powerful, low
cost PDF and
imaging viewer
for SharePoint
Visually
search
documents
with
thumbnails
and quick
previews
Search for text
within a PDF file
(just like Adobe
Reader/Acrobat)
Edit SharePoint
metadata within
the viewer for
PDF documents
and scanned
images
Adds robust
image editing
features –
annotations, reorder, rotate or
delete
pages, image
cleanup
» To leverage SharePoint’s native features for
transactional document management…
˃ Extensive upfront planning
˃ Complex configuration (many more steps to configure SP compared to
most dedicated document management products)

» To make the overall user experience in
SharePoint comparable with dedicated
Document Management products, plan on:
˃ Lots of custom code ... OR …
˃ 3rd party solutions

More Related Content

What's hot

Drilling Down to the Challenges of SharePoint Taxonomy Implementation
Drilling Down to the Challenges of SharePoint Taxonomy ImplementationDrilling Down to the Challenges of SharePoint Taxonomy Implementation
Drilling Down to the Challenges of SharePoint Taxonomy ImplementationTSoholt
 
Metadata management in SharePoint
Metadata management in SharePointMetadata management in SharePoint
Metadata management in SharePointMetataxis
 
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...NoSQLmatters
 
Webinar: Does the SharePoint 2010 Term Store Seem Like Alphabet Soup? Find ...
Webinar:  Does the SharePoint 2010 Term Store Seem Like Alphabet Soup?  Find ...Webinar:  Does the SharePoint 2010 Term Store Seem Like Alphabet Soup?  Find ...
Webinar: Does the SharePoint 2010 Term Store Seem Like Alphabet Soup? Find ...martingarland
 
Getting Started With Elasticsearch In .NET
Getting Started With Elasticsearch In .NETGetting Started With Elasticsearch In .NET
Getting Started With Elasticsearch In .NETAhmed Abd Ellatif
 
Introduction à DocumentDB
Introduction à DocumentDBIntroduction à DocumentDB
Introduction à DocumentDBMSDEVMTL
 
NHSPUG June 2015 - Must Love Term Sets: The New and Improved Managed Metadat...
NHSPUG June 2015  - Must Love Term Sets: The New and Improved Managed Metadat...NHSPUG June 2015  - Must Love Term Sets: The New and Improved Managed Metadat...
NHSPUG June 2015 - Must Love Term Sets: The New and Improved Managed Metadat...Jonathan Ralton
 
Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...ijsrd.com
 
Simplifying & accelerating application development with MongoDB's intelligent...
Simplifying & accelerating application development with MongoDB's intelligent...Simplifying & accelerating application development with MongoDB's intelligent...
Simplifying & accelerating application development with MongoDB's intelligent...Maxime Beugnet
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationKim Negaard
 
Spsvb Developer Intro to SharePoint Search
Spsvb   Developer Intro to SharePoint SearchSpsvb   Developer Intro to SharePoint Search
Spsvb Developer Intro to SharePoint SearchMichael Oryszak
 
Cool NoSQL on Azure with DocumentDB
Cool NoSQL on Azure with DocumentDBCool NoSQL on Azure with DocumentDB
Cool NoSQL on Azure with DocumentDBJan Hentschel
 
IOOF IT System Modernisation
IOOF IT System ModernisationIOOF IT System Modernisation
IOOF IT System ModernisationMongoDB
 
Content Management with MongoDB by Mark Helmstetter
 Content Management with MongoDB by Mark Helmstetter Content Management with MongoDB by Mark Helmstetter
Content Management with MongoDB by Mark HelmstetterMongoDB
 
Tuning for Performance: indexes & Queries
Tuning for Performance: indexes & QueriesTuning for Performance: indexes & Queries
Tuning for Performance: indexes & QueriesKeshav Murthy
 
Azure doc db (slideshare)
Azure doc db (slideshare)Azure doc db (slideshare)
Azure doc db (slideshare)David Green
 
Houston tech fest dev intro to sharepoint search
Houston tech fest   dev intro to sharepoint searchHouston tech fest   dev intro to sharepoint search
Houston tech fest dev intro to sharepoint searchMichael Oryszak
 
How to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR OrganizationHow to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR OrganizationAgnes Molnar
 

What's hot (20)

Drilling Down to the Challenges of SharePoint Taxonomy Implementation
Drilling Down to the Challenges of SharePoint Taxonomy ImplementationDrilling Down to the Challenges of SharePoint Taxonomy Implementation
Drilling Down to the Challenges of SharePoint Taxonomy Implementation
 
Metadata management in SharePoint
Metadata management in SharePointMetadata management in SharePoint
Metadata management in SharePoint
 
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
Benjamin Guinebertière - Microsoft Azure: Document DB and other noSQL databas...
 
Webinar: Does the SharePoint 2010 Term Store Seem Like Alphabet Soup? Find ...
Webinar:  Does the SharePoint 2010 Term Store Seem Like Alphabet Soup?  Find ...Webinar:  Does the SharePoint 2010 Term Store Seem Like Alphabet Soup?  Find ...
Webinar: Does the SharePoint 2010 Term Store Seem Like Alphabet Soup? Find ...
 
Getting Started With Elasticsearch In .NET
Getting Started With Elasticsearch In .NETGetting Started With Elasticsearch In .NET
Getting Started With Elasticsearch In .NET
 
Introduction à DocumentDB
Introduction à DocumentDBIntroduction à DocumentDB
Introduction à DocumentDB
 
NHSPUG June 2015 - Must Love Term Sets: The New and Improved Managed Metadat...
NHSPUG June 2015  - Must Love Term Sets: The New and Improved Managed Metadat...NHSPUG June 2015  - Must Love Term Sets: The New and Improved Managed Metadat...
NHSPUG June 2015 - Must Love Term Sets: The New and Improved Managed Metadat...
 
Spsl v unit - final
Spsl v unit - finalSpsl v unit - final
Spsl v unit - final
 
Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...
 
Simplifying & accelerating application development with MongoDB's intelligent...
Simplifying & accelerating application development with MongoDB's intelligent...Simplifying & accelerating application development with MongoDB's intelligent...
Simplifying & accelerating application development with MongoDB's intelligent...
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar Presentation
 
Spsvb Developer Intro to SharePoint Search
Spsvb   Developer Intro to SharePoint SearchSpsvb   Developer Intro to SharePoint Search
Spsvb Developer Intro to SharePoint Search
 
Cool NoSQL on Azure with DocumentDB
Cool NoSQL on Azure with DocumentDBCool NoSQL on Azure with DocumentDB
Cool NoSQL on Azure with DocumentDB
 
IOOF IT System Modernisation
IOOF IT System ModernisationIOOF IT System Modernisation
IOOF IT System Modernisation
 
Content Management with MongoDB by Mark Helmstetter
 Content Management with MongoDB by Mark Helmstetter Content Management with MongoDB by Mark Helmstetter
Content Management with MongoDB by Mark Helmstetter
 
Tuning for Performance: indexes & Queries
Tuning for Performance: indexes & QueriesTuning for Performance: indexes & Queries
Tuning for Performance: indexes & Queries
 
Azure doc db (slideshare)
Azure doc db (slideshare)Azure doc db (slideshare)
Azure doc db (slideshare)
 
Houston tech fest dev intro to sharepoint search
Houston tech fest   dev intro to sharepoint searchHouston tech fest   dev intro to sharepoint search
Houston tech fest dev intro to sharepoint search
 
Nosql part3
Nosql part3Nosql part3
Nosql part3
 
How to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR OrganizationHow to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR Organization
 

Similar to SPS Phoenix Optimizing SharePoint for Transactional Content Management

Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013Terrence Nguyen
 
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
2018 09-03 aOS Aachen - SharePoint demystified - Thomas VochtenaOS Community
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1Don Miller
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group
 
SharePoint 2013 Search Operations
SharePoint 2013 Search OperationsSharePoint 2013 Search Operations
SharePoint 2013 Search OperationsSPC Adriatics
 
Chris McNulty - Managed Metadata and Taxonomies
Chris McNulty - Managed Metadata and TaxonomiesChris McNulty - Managed Metadata and Taxonomies
Chris McNulty - Managed Metadata and TaxonomiesSharePoint Saturday NY
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010bgerman
 
Sharepoint Document Management System (DMS) Features
Sharepoint Document Management System (DMS) Features Sharepoint Document Management System (DMS) Features
Sharepoint Document Management System (DMS) Features Nitin Gupta
 
Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Petter Skodvin-Hvammen
 
Share Point2007 Best Practices Final
Share Point2007 Best Practices FinalShare Point2007 Best Practices Final
Share Point2007 Best Practices FinalMarianne Sweeny
 
B365 saturday practical guide to building a scalable search architecture in s...
B365 saturday practical guide to building a scalable search architecture in s...B365 saturday practical guide to building a scalable search architecture in s...
B365 saturday practical guide to building a scalable search architecture in s...Thuan Ng
 
Introduction To Enterprise Search - OKCSUG 2010
Introduction To Enterprise Search - OKCSUG 2010Introduction To Enterprise Search - OKCSUG 2010
Introduction To Enterprise Search - OKCSUG 2010Corey Roth
 

Similar to SPS Phoenix Optimizing SharePoint for Transactional Content Management (20)

Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013
 
Microsoft Enterprise Seach using SharePoint
Microsoft Enterprise Seach using SharePointMicrosoft Enterprise Seach using SharePoint
Microsoft Enterprise Seach using SharePoint
 
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
2018 09-03 aOS Aachen - SharePoint demystified - Thomas Vochten
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
(27.05) MOSSCA Invita - Búsqueda empresarial 2
(27.05) MOSSCA Invita - Búsqueda empresarial 2(27.05) MOSSCA Invita - Búsqueda empresarial 2
(27.05) MOSSCA Invita - Búsqueda empresarial 2
 
(28/05) MOSSCA Invita - Administración de Contenido Empresarial
(28/05) MOSSCA Invita - Administración de Contenido Empresarial(28/05) MOSSCA Invita - Administración de Contenido Empresarial
(28/05) MOSSCA Invita - Administración de Contenido Empresarial
 
Search
SearchSearch
Search
 
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
 
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
 
SharePoint 2013 Search Operations
SharePoint 2013 Search OperationsSharePoint 2013 Search Operations
SharePoint 2013 Search Operations
 
KMA on Mms2010 nyc
KMA on Mms2010 nycKMA on Mms2010 nyc
KMA on Mms2010 nyc
 
Chris McNulty - Managed Metadata and Taxonomies
Chris McNulty - Managed Metadata and TaxonomiesChris McNulty - Managed Metadata and Taxonomies
Chris McNulty - Managed Metadata and Taxonomies
 
KMA's mms2010nyc
KMA's mms2010nycKMA's mms2010nyc
KMA's mms2010nyc
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010
 
Sharepoint Document Management System (DMS) Features
Sharepoint Document Management System (DMS) Features Sharepoint Document Management System (DMS) Features
Sharepoint Document Management System (DMS) Features
 
Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)Share point 2013 enterprise search (public)
Share point 2013 enterprise search (public)
 
Share Point2007 Best Practices Final
Share Point2007 Best Practices FinalShare Point2007 Best Practices Final
Share Point2007 Best Practices Final
 
B365 saturday practical guide to building a scalable search architecture in s...
B365 saturday practical guide to building a scalable search architecture in s...B365 saturday practical guide to building a scalable search architecture in s...
B365 saturday practical guide to building a scalable search architecture in s...
 
Introduction To Enterprise Search - OKCSUG 2010
Introduction To Enterprise Search - OKCSUG 2010Introduction To Enterprise Search - OKCSUG 2010
Introduction To Enterprise Search - OKCSUG 2010
 

More from DocFluix, LLC

Modern Automated Site Provisioning for SharePoint Online
Modern Automated Site Provisioning for SharePoint OnlineModern Automated Site Provisioning for SharePoint Online
Modern Automated Site Provisioning for SharePoint OnlineDocFluix, LLC
 
IA& Taxonomy Planning for SharePoint Online & Office 365
IA& Taxonomy Planning for SharePoint Online & Office 365IA& Taxonomy Planning for SharePoint Online & Office 365
IA& Taxonomy Planning for SharePoint Online & Office 365DocFluix, LLC
 
Developing Azure Functions for Flow and Nintex SPS SD 2018
Developing Azure Functions for Flow and Nintex SPS SD 2018Developing Azure Functions for Flow and Nintex SPS SD 2018
Developing Azure Functions for Flow and Nintex SPS SD 2018DocFluix, LLC
 
Developing Azure Functions as custom connectors for Flow and Nintex
Developing Azure Functions as custom connectors for Flow and NintexDeveloping Azure Functions as custom connectors for Flow and Nintex
Developing Azure Functions as custom connectors for Flow and NintexDocFluix, LLC
 
Microsoft flow how, when &amp; why
Microsoft flow   how, when &amp; whyMicrosoft flow   how, when &amp; why
Microsoft flow how, when &amp; whyDocFluix, LLC
 
Using Nintex Workflow Cloud with Salesforce for customer surveys
Using Nintex Workflow Cloud with Salesforce for customer surveysUsing Nintex Workflow Cloud with Salesforce for customer surveys
Using Nintex Workflow Cloud with Salesforce for customer surveysDocFluix, LLC
 
Using SharePoint, Nintex & DocuSign for Employee Onboarding and Open Enrollme...
Using SharePoint, Nintex & DocuSign for Employee Onboarding and Open Enrollme...Using SharePoint, Nintex & DocuSign for Employee Onboarding and Open Enrollme...
Using SharePoint, Nintex & DocuSign for Employee Onboarding and Open Enrollme...DocFluix, LLC
 
14 tips for planning a ecm content migration to share point
14 tips for planning a ecm content migration to share point14 tips for planning a ecm content migration to share point
14 tips for planning a ecm content migration to share pointDocFluix, LLC
 
15 tips for bullet proof requirements analysis on SharePoint projects
15 tips for bullet proof requirements analysis on SharePoint projects15 tips for bullet proof requirements analysis on SharePoint projects
15 tips for bullet proof requirements analysis on SharePoint projectsDocFluix, LLC
 

More from DocFluix, LLC (9)

Modern Automated Site Provisioning for SharePoint Online
Modern Automated Site Provisioning for SharePoint OnlineModern Automated Site Provisioning for SharePoint Online
Modern Automated Site Provisioning for SharePoint Online
 
IA& Taxonomy Planning for SharePoint Online & Office 365
IA& Taxonomy Planning for SharePoint Online & Office 365IA& Taxonomy Planning for SharePoint Online & Office 365
IA& Taxonomy Planning for SharePoint Online & Office 365
 
Developing Azure Functions for Flow and Nintex SPS SD 2018
Developing Azure Functions for Flow and Nintex SPS SD 2018Developing Azure Functions for Flow and Nintex SPS SD 2018
Developing Azure Functions for Flow and Nintex SPS SD 2018
 
Developing Azure Functions as custom connectors for Flow and Nintex
Developing Azure Functions as custom connectors for Flow and NintexDeveloping Azure Functions as custom connectors for Flow and Nintex
Developing Azure Functions as custom connectors for Flow and Nintex
 
Microsoft flow how, when &amp; why
Microsoft flow   how, when &amp; whyMicrosoft flow   how, when &amp; why
Microsoft flow how, when &amp; why
 
Using Nintex Workflow Cloud with Salesforce for customer surveys
Using Nintex Workflow Cloud with Salesforce for customer surveysUsing Nintex Workflow Cloud with Salesforce for customer surveys
Using Nintex Workflow Cloud with Salesforce for customer surveys
 
Using SharePoint, Nintex & DocuSign for Employee Onboarding and Open Enrollme...
Using SharePoint, Nintex & DocuSign for Employee Onboarding and Open Enrollme...Using SharePoint, Nintex & DocuSign for Employee Onboarding and Open Enrollme...
Using SharePoint, Nintex & DocuSign for Employee Onboarding and Open Enrollme...
 
14 tips for planning a ecm content migration to share point
14 tips for planning a ecm content migration to share point14 tips for planning a ecm content migration to share point
14 tips for planning a ecm content migration to share point
 
15 tips for bullet proof requirements analysis on SharePoint projects
15 tips for bullet proof requirements analysis on SharePoint projects15 tips for bullet proof requirements analysis on SharePoint projects
15 tips for bullet proof requirements analysis on SharePoint projects
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

SPS Phoenix Optimizing SharePoint for Transactional Content Management

  • 1. or Transactional Content Optimizing SharePoint for Transactional Content Management
  • 2. » Principal at Hershey Technologies ˃Twitter: @tomcastiglia ˃Email: tcastiglia@hersheytech.com Joined Hershey Tech in 1998 This is my 7th SharePoint Saturday Director of Hershey’s professional services team since 2001 Founding member of San Diego SharePoint User Group (@sanspug) » Founding member of San Diego .NET User Group » » » »
  • 3. » Founded in 1991 » Microsoft Partner » Specialists in ˃ End to End SharePoint Consulting Services ˃ Document Imaging / Scanning ˃ OCR (data and document capture) ˃ ECM / Document Management ˃ BPM / workflow » SharePoint ISV ˃ XenDocs ECM for SharePoint » Follow us on Twitter: @HersheyTech
  • 5. Middleware component (Windows Service) to in integrate content from multi-function scanners, fax servers and reporting apps with SharePoint
  • 6. » Explanation of “Transactional Content Management” (TCM) » Overview of SharePoint features that are relevant to TCM » How to make SharePoint support TCM » Demo of solutions that fill the feature gaps to ensure SharePoint is successful for your transactional content management project ˃ ˃ ˃ ˃ Ad-hoc scanning / document capture into SharePoint Optimizing SharePoint search for large scale TCM deployments Enable collaboration of static, transactional documents Make scanned images and PDF documents a 1st class citizen within SharePoint
  • 7. » Assumptions - I presume that you understand: ˃ Columns (document metadata) ˃ Content Types ˃ Document Libraries » Other topics not covered (just not enough time to include): ˃ ˃ ˃ ˃ Automated Data Capture/OCR Records Management Workflow RBS
  • 8. Web Content SharePoint Rocks at this! Document Collaboration Transactional Documents SharePoint Rocks at this! SharePoint needs a little help here
  • 9. “high-volume throughput of relatively static documents” “content which typically originates outside and organization from external parties – customers or partners-and relies on workflow or business process management (BPM) to drive transactional, back-office business processes.” -Forrester Research
  • 10. » Capturing content from MFPs & Fax servers » Indexing scanned documents is clumsy » Configuring Metadata Taxonomy for Search requires unique expertise » Lacks intuitive metadata query driven document search » Treats scanned images and PDF files as a “2nd class citizen” (compared to MS Office documents)
  • 11. Transactional Documents » » » » » » » Purchase Orders Vendor Invoices Application Forms Insurance claims Student Records Enrollment Forms (Not project based) Collaborative documents » Proposals, reports, spreadsheets, presentations and other documents created and edited by knowledge worker users ˃ Office docs (Word, Excel, PowerPoint) ˃ PDF files » Created and uploaded on an adhoc basis to support day to day operations » (Often project based)
  • 13. Transactional Content Collaborative content » Centralized » Often isolated to just one or a few site collections » Decentralized » Documents are often spread throughout many site collections, subsites, libraries and content types » Typically under 5K documents per library. ˃ Document Center or Record Center » Thousands to millions of documents per library
  • 14. Collaboration scenarios Transactional Documents » Navigation » Navigation doesn’t work - too many documents per library » Search via metadata queries only ˃ SiteSubSiteLibraryFolderDocument » Keyword Search ˃ Searches both metadata and document content ˃ Use of social algorithms improve search results (e.g. highly rated documents are returned above other documents) ˃ Ignore document content ˃ Ignore social based algorithms like ratings » Users expect intuitive, graphical query builders to specify precise search conditions against one or more metadata fields.
  • 16. » This native SharePoint feature does provide a limited query builder … ˃ Allows users to query against specific SharePoint columns and choose various search operators (Equals, At Most, At Least, On, Before, etc.) ˃ Filters document library providing results in a sortable, tabular display.
  • 17. » Doesn’t support text columns » Transactional documents generally need text based columns for fields like InvoiceNumber, PONumber, Ve ndorId, ClaimNumber, etc. » Doesn’t scale well for libraries that exceed the list view threshold (5,000 documents by default)
  • 19. » Configuring Managed Properties in SharePoint Search is more complex than it needs to be. » SharePoint does not provide a robust query builder for users to intuitively query documents (other ECM solutions offer this OOB) » SharePoint formats Search results like a search engine, not like a document management product. » SharePoint treats PDF documents and scanned images as a 2nd class citizen.
  • 20. » Crawled properties are metadata (such as author, title, or subject) that are extracted from SharePoint columns during crawls. » However, this is the internal representation of the metadata. To enable users to search on this metadata, we need to use managed properties that are mapped to the crawled properties.
  • 21. » A new crawled property is created for each new custom column, after… ˃The column is added to at least one list or library ˃The column is populated with a value in at least one item ˃A Full Crawl is performed
  • 22. » All Crawled properties are grouped into various categories. » For Transactional Content Management solutions, we generally care about the “SharePoint” Category, which contains crawled properties that are tied to list columns in SharePoint. » Accessible from Search Service Application: Metadata Properties>Categories
  • 23. » The Naming convention is fully controlled by SharePoint, using this convention: ˃ ows_[internal name of column] » However, spaces or other symbols (.-!@#$%^, etc.) within the internal column name are escaped, such as: Column Internal Name Crawled Property Name InvoiceNumber ows_InvoiceNumber Invoice Number ows_Invoice_x0020_Number Invoice.Number ows_Invoice_x002e_Number Invoice-Number ows_Invoice_x002d_Number
  • 24. » In SP2010, most SharePoint columns gets one crawled property ˃ Managed Metadata Columns get a 2nd crawled property, with a prefix of “ows_taxid” » This extra crawled property is used to store the internal GUID value that is associated with the managed metadata term. For example: Column Name: CostCenter Normal Crawled Property: ows_CostCenter MM Id Crawled Property: ows_taxid_CostCenter
  • 25. » …Allow you to enable standardization in the terms used for searching SharePoint. » …Represent the end-user’s vision of the SP taxonomy (at least with regards to Search) ˃So the name of your managed properties should normally be something intuitive to your end-users
  • 26. » One managed property may be mapped to one or more crawled properties. ˃ Useful in low governance situations where multiple site owners or site collection admins have duplicated site columns using different names (e.g. InvoiceNumber vs ‘Invoice Number’) » One crawled property may be mapped to one or more managed properties ˃ Useful if different applications create their own managed properties, and need to reference the same crawled property.
  • 27. Without Managed Properties Returns 16 items, only 6 of which are related to what I wanted. Included other documents that happen to contain the StudentId value either as text in the document or in some other field (like an Invoice Number, or something else) With managed properties Returns only the 6 correct items
  • 28. Provides an OOB search interface that allows users to select a Managed Property from a drop down list, rather than having to type out the managed property name (e.g. “StudentID:” or “StudentID=“)
  • 29. Use your favorite XML editor (VS 2012)
  • 30.
  • 31. Unlike Crawled Properties (which are always auto-generated by SharePoint… Managed properties can be created in one of three ways…
  • 32. SP 2010 Managed Properties can be created manually by a SharePoint Administrator from the Search Service Application configuration. » SP2010: “Metadata Properties” link » SP2013: “Search Schema” link SP 2013
  • 33. » Click “New Managed Property” link from Metadata Property Mappings ˃ Property Name can contain most characters, except for spaces (but please don’t use special characters) ˃ Based on the selected type, this managed property can only be mapped to crawled properties with the same type. ˃ Add Mapping – Select 1 or more crawled properties to map to this managed property. + ˃ If multiple are selected decide whether to include all values or just the first one found Scopes – preset filter on content – like a global where clause ˃ Reduce storage requirements (“hash”) – option actually works in reverse to what is stated.
  • 34. » » » Property Name - Same as SP2010 Add Mapping - same as in SP2010 Reduce storage requirements (“hash”) option - No longer exists in SP2013 » Many additional settings ˃ Searchable – Enables querying against the content of the managed property ˃ Queryable – Enables querying against the specific managed property ˃ Retrievable – Enable this setting for managed properties that are relevant to present in search results. ˃ Refinable – Can be used as a search refiner ˃ Sortable – ˃ Token Normalization ˃ Complete Matching
  • 35. Automatically generated by custom code or a 3rd party application » For example, Hershey’s XenDocs ECM for SharePoint will validate that a managed property is properly configured or automatically create crawled and managed properties for each column when our web part is configured.
  • 36. Set References to… • Microsoft.Office.Server.dll • Microsoft.Office.Server.Search.dll DLLs Located in: C:Program FilesCommon FilesMicrosoft SharedWeb Server Extensions15ISAPI Initialize the Search Schema… using Microsoft.Office.Server.Search.Administration; public void InitSearchSchema(string url) { SPSite site = new SPSite(url); Schema _searchSchema = new Schema(SearchContext.GetContext(site)); }
  • 37. public void CreateCrawledProperty(PropertySet propSet) { var propSetId = GetPropertySetId(propSet); var category = _searchSchema.AllCategories[propSetId]; category.CreateCrawledProperty(crawledPropertyName, false, propSetId); category.Update(); _crawledProps = _searchSchema.QueryCrawledProperties(string.Empty, MAX_PROPS, Guid.NewGuid(), string.Empty, true).Cast<CrawledProperty>(); }
  • 38. public void CreateManagedProperty() { // Create new Managed Property to existing Crawled Property ManagedPropertyCollection allProperties = _searchSchema.AllManagedProperties; ManagedProperty managedProperty = allProperties.Create(propertyName, dataType); // Map new Managed Property to existing Crawled Property MappingCollection mappings = managedProperty.GetMappings(); Mapping mapping = new Mapping(crawledProperty.Propset, crawledProperty.Name, crawledProperty.VariantType, property.PID); }
  • 39. public static void VerifySettings(ManagedDataType dataType, ManagedProperty property, string fieldDataType, bool enabledForScoping, bool respectPriority) { property.Searchable = true; property.Retrievable = true; property.Sortable = true; property.Queryable = true; property.SortableType = SortableType.Enabled; if (fieldDataType == "SPFieldLookupValueCollection" || fieldDataType == "TaxonomyFieldValueCollection") { property.HasMultipleValues = true; } Continued…
  • 40. Continued… if (dataType == ManagedDataType.Text && property.MaxCharactersInPropertyStoreIndex != 64) { // Reduce Storage requirements by using Hash.. // Yes for Text fields // No for all others property.MaxCharactersInPropertyStoreIndex = 64; } property.RespectPriority = respectPriority; property.EnabledForScoping = enabledForScoping; property.SafeForAnonymous = true; property.TokenNormalization = true; property.Update(); }
  • 41. public static void VerifySettings(ManagedDataType dataType, ManagedProperty property, string fieldDataType, bool enabledForScoping, bool respectPriority) { property.Searchable = true; property.Retrievable = true; property.Sortable = true; property.Queryable = true; property.SortableType = SortableType.Enabled; if (fieldDataType == "SPFieldLookupValueCollection" || fieldDataType == "TaxonomyFieldValueCollection") { property.HasMultipleValues = true; } }
  • 42. Let SharePoint Auto-Generate new managed properties when it crawls
  • 43. » In SharePoint 2010… ˃This feature is off by default, but it can be enabled in your Search Service Application From the Categories list, hover over the SharePoint category, click the drop down arrow and then select the Edit Category option. Select the option to “automatically generate a new managed property for each crawled property…”
  • 44. » In SharePoint 2013… ˃All site columns that contain data will have a managed property auto-generated upon a full crawl ˃This does not happen for list columns ˃This feature cannot be turned off and is not configurable (as far as I can tell) http://technet.microsoft.com/en-us/library/jj613136.aspx
  • 45. Column SharePoint 2010 SharePoint 2013 Name Crawled Property Managed Property Crawled Properties Managed Property FooBar ows_FooBar owsFooBar1 ows_FooBar ows_q_TEXT_FooBar Not mapped FooBarOWSTEXT Foo Bar ows_Foo_x0020_Bar owsFoox0020Bar ows_Foo_x0020_Bar FooBarOWSTEXT Foo_Bar ows_Foo_Bar owsFooBar ows_Foo_Bar ows_q_TEXT_Foo_Bar Not mapped FooBarOWSTEXT Foo-Bar ows_Foo_x002d_Bar owsFoox002dBar ows_Foo-Bar ows_q_TEXT_Foo-Bar Foo.Bar ows_Foo_x002e_Bar owsFoox002eBar ows_Foo.Bar ows_q_TEXT_Foo.Bar Not mapped Foo-BarOWSTEXT Not mapped Foo.BarOWSTEXT The autogenerated names for managed properties are not “end-user friendly” !
  • 46. A vast improvement compared to the native Advanced Search Web Part
  • 47. » MS Office Documents are first 1st class citizens in SharePoint ˃ When office files are opened in Office 2007, 2010 or 2013, users can perform many SharePoint functions on those documents: + Edit document content + Check in/out/discard + See version history + Edit metadata ˃ Preview Thumbnails in SP 2013 » Most other file types, especially PDF files and scanned images are 2nd class citizens ˃ Read only view of document
  • 48. Files typically open in native apps such as Windows Photo Gallery or Adobe Reader » Users cannot edit metadata » If user rotates, reorders or deletes a page, the changes cannot be saved to SP » User cannot annotate pages (e.g. sticky notes, redactions, etc.)
  • 49. A powerful, low cost PDF and imaging viewer for SharePoint
  • 51. Search for text within a PDF file (just like Adobe Reader/Acrobat)
  • 52. Edit SharePoint metadata within the viewer for PDF documents and scanned images
  • 53. Adds robust image editing features – annotations, reorder, rotate or delete pages, image cleanup
  • 54. » To leverage SharePoint’s native features for transactional document management… ˃ Extensive upfront planning ˃ Complex configuration (many more steps to configure SP compared to most dedicated document management products) » To make the overall user experience in SharePoint comparable with dedicated Document Management products, plan on: ˃ Lots of custom code ... OR … ˃ 3rd party solutions

Editor's Notes

  1. Introduction slide
  2. Transactional DocsUsers need graphical &quot;query builders&quot; that allow them to combine multiple search conditions. Queries should search metadata only, not keywords or content in the document. Queries should return the exact results specified by the user. System should not attempt to &quot;figure out&quot; what the user really wanted, based on ratings or other social algorithms, or removing duplicates, etc. Results displayed in tabular format, with sort default order determined by the user or admin, and allowing ad-hoc sorting by any column.SharePoint DocsUsers find documents using Search. Query Builders is not OOB, but available through 3rd party vendors. Search looks at both metadata and document content. Search tries to be &quot;intelligent&quot; figure out what you really want. Search results are formatted like a Google or Bing search. Search engine decides how to order the results. Results cannot be re-sorted by user. Metadata Navigation supports tablular results with sorting, but is limited in terms of what column types are supported (e.g. single line of text is not supported). However, with large libraries, it only searches through the most recent 1,000 rows or so (fallback queries)
  3. Transactional DocsUsers need graphical &quot;query builders&quot; that allow them to combine multiple search conditions. Queries should search metadata only, not keywords or content in the document. Queries should return the exact results specified by the user. System should not attempt to &quot;figure out&quot; what the user really wanted, based on ratings or other social algorithms, or removing duplicates, etc. Results displayed in tabular format, with sort default order determined by the user or admin, and allowing ad-hoc sorting by any column.SharePoint DocsUsers find documents using Search. Query Builders is not OOB, but available through 3rd party vendors. Search looks at both metadata and document content. Search tries to be &quot;intelligent&quot; figure out what you really want. Search results are formatted like a Google or Bing search. Search engine decides how to order the results. Results cannot be re-sorted by user. Metadata Navigation supports tablular results with sorting, but is limited in terms of what column types are supported (e.g. single line of text is not supported). However, with large libraries, it only searches through the most recent 1,000 rows or so (fallback queries)