Drilling Down to the Challenges of SharePoint Taxonomy Implementation
Drilling Down to the Challenges ofSharePoint Taxonomy Implementation By Joe Shepley and Marjorie M.K. Hlava
What You’ll Learn 2 • At the end of this webinar, you’ll better understand – The problems caused by having a poor taxonomy for SharePoint – The benefits of having an effective taxonomy for SharePoint – How to create a taxonomy in SharePoint – How “partner” technologies can improve the taxonomy creation and management process in SharePoint
SharePoint 2010 Capabilities 3 SharePoint 2010 Enterprise Business Business EnterpriseCollaboration content process and Portal Taxonomy intelligence search management forms
SharePoint 2010 Capabilities 4 SharePoint 2010 Enterprise Business Business EnterpriseCollaboration content process and Portal intelligence search management forms TAXONOMY
SharePoint Has Many Potential Benefits 5 • Simple, familiar, and consistent user experience User Experience • Single, integrated platform for intranet, extranet, and Internet across the enterprise Process • Boosts employee productivity by simplifying everyday business activities Management • Accelerates shared business processes across organizational boundaries Content • Manage content to gain increased business value • Organization-wide access to structured and unstructured data across Management disparate systems Information Lifecycle • Manage content retention across user groups • Share business data securely Management • Enforce regulatory requirements Knowledge • Connects people with information and expertise • Enables people to make better-informed decisions by presenting Management business-critical information in one central location
If You Can Implement It Correctly 6 Ease of deployment Thousands of sites, most Terabytes of unknown to SharePoint unnecessary content administrators Ability to Lack of IA grow experience organically No rhyme or reason to No consistent use of site and site collection metadata…if used at structure Focus on all installation
The Results of Poor SharePoint Taxonomy 7 The result is a tangle of SharePoint sites, with poorly organized content at every level, which renders the SharePoint environment little better than traditional shared drives
The Results of Poor SharePoint Taxonomy 8 • In fact, in many ways a SharePoint without an IA (or with a poorly designed one) is worse than shared drives – Higher storage volumes (multiple copies of a document, each with version control on it) – Higher per user costs (need licenses to use SharePoint) – Higher maintenance (DBA, SharePoint developers and admins needed to care and feed SharePoint)
Challenges to Building a SharePoint Taxonomy 9 • Even when a SharePoint implementation is planned, taxonomy typically gets eclipsed by “nuts and bolts” activities required to stand up the environment, like network architecture – Tight schedule, budget constraints • Lack of experience with taxonomy at most organizations means it gets low (or no) priority during SharePoint design and implementation – Often no one owns taxonomy at the organization – Few people outside of web design have heard of it – Fewer have ever had an direct experience with it – Taxonomy may have never been done at any time, in any part of the organization at all
How Does a Taxonomy Help SharePoint? 10 • Consistent retrieval Search • Precision – exactly what the user wants • Recall – all of the relevant material • Discover colleagues working in same areas Collaboration • Stimulate collaboration • Productive interactions • Encourages return visits User • Saves people time Engagement • Promotes sales and use of site
SharePoint 2010 Metadata Management 11 • Create taxonomy lists in the Term Store • Use the taxonomy for assisted indexing – Type-ahead suggestion for indexing content – Use synonyms to represent multiple ways to express a single subject – Improves precision and recall for indexing • Import preexisting taxonomies from a CSV.
Term Sets 12 Select term store management Edit Term Sets to accuratelySite Administration located under reflect your document libraries and content types. Term sets can be individual taxonomies or flat controlled vocabulary lists
SharePoint server 2010 Capabilities 13 • Some of the features of Windows SharePoint Services are used directly by Office SharePoint Server 2010 – List management – Storage capabilities – Web Part framework.
Features of SharePoint Server 2010 14 • Features highlighted in Microsoft Office SharePoint Server (MOSS) 2010 – Search (FAST ESP) – Document management – Enterprise content management – Business process automation and workflows – Taxonomy and metadata management
Managing Site Content 15 • Create document libraries to reflect different content types used in all departments. • Add metadata – Author – Locations – File extension – Date added – Subject and indexing terms – Other metadata (Dublin – Company code Core) • Add Retention: Choose when the server deletes the content, or updates it.
SharePoint needs 16 • Metadata on every document • Relevant search • Related content alerts • Automatically aggregated content • Many use cases • Simple tagging – Authors – Staff – As uploaded • Automatic Security and retention for content types
Taxonomy in SharePoint Allows 17 • Browse by terms • Search Documents • Limit Search by Facets • Update terms • Reindex Documents • Automatic and Assisted indexing methods • Facilitate document retention • Document security by user and document type • Allow for the ability to use tagging – view and select • Integrate seamlessly with SharePoint 2010 • Integrate with other CMS (Ektron, Drupal, etc.)
Why add a partner to SharePoint? 18 • Use taxonomy in multiple systems • Manage audit and govern the taxonomy • Identify and extract information from documents • Legacy data tagging automatically • Bulk add the metadata by populating site columns with reference to taxonomies
Taxonomy Fully integrated with MOSS 19 Client Data Automatic Full Text Summarization Search HTML, PDF, Presentation: Data Feeds, etc. Machine Aided Indexer (M.A.I.™) Search 90% accuracy SharePoint Software Inline Tagging Server Browse by Subject Repository Auto-completion Client Broader Terms taxonomy Client Taxonomy Metadata and Narrower Terms Entity Extractor Related Terms Thesaurus Master
Adding terms to the taxonomy 20 • Suggest new (unused) terms for content after bulk import • Use the folksonomy features of SharePoint • Use the search logs • Could also use Novelty Detection
Taxonomy in Functions 21• Equivalent terms / synonyms / non preferred terms• Associative relationships / related terms• Easy updating and modification of terms Associative relationships (Related terms). Equivalent relationships (Synonyms/Preferred and non-preferred terms).
Adding the taxonomy 23 Core Architectural Components Administrator’s Dashboard FAST MANAGEMENT API Web WEB SEARCH Content CRAWLER SERVER Vertical Pipeline QUERY API Query Applications FILE Pipeline PROCESSOR Files, TRAVERSER QUERY Documents CONTENT API Portals DATABASE PROCESSOR DOCUMENT Databases CONNECTOR Index DB Results Custom EMAIL FILTER Alerts Front-Ends Email, Groupware CONNECTOR SERVER Mobile Search harmony Custom Content CUSTOM Devices Applications Push CONNECTOR Agent DB MAIstro Data Harmony Governance API
Role of Staff 26 • Project Coordination – Sample data – Copy of thesaurus • Update and maintain thesaurus • Take training • Decide who will do the indexing – Only staff – Everyone • SharePoint Server Admin will install
Incorporating Taxonomy into SharePoint 27 • Add an EventHandler to Document Library • After a user uploads a file, EventHandler will send the file content to the Data Harmony server • Data Harmony server creates metadata by adding suggested terms from M.A.I. • SharePoint updates metadata fields
Machine-Aided Indexing (M.A.I.) 29AutomaticallypopulateKeywords,Descriptors,Indexing terms,etc.Allow for manualreview of auto-tagging for qualityassurance.
Automated Indexing for SharePoint 30 • User adds a document to the SharePoint space and attach indexing terms to the document. • A new version is saved on the SharePoint 2010 server with edited properties • Batch upload documentation to SharePoint
Taxonomy Management 31 • Export an existing taxonomy into a CSV • Import new taxonomy as a Term set into SharePoint Term store management • Use the taxonomy for assisted searches and indexing
Data Harmony Sample 32 32 A sample taxonomy exported directly from Data Harmony
Data Harmony Sample 33 Create and name a Column for adding metadata. Select the Managed Metadata radio button to add a Term set or taxonomy
Managed Metadata 34 • Importing a taxonomy enhances the way users can manually add indexing terms – Inclusion of synonyms – Type-ahead for searching and adding metadata – Browsing the hierarchy for indexing terms
Data Harmony & MOSS 36 User uploads a document to SharePoint spaceBefore uploading to Data HarmonySharePoint server, the automatically attachesEventHandler sends the indexing terms beforedocument to Data uploading to MOSSHarmony. Data Harmony Microsoft Server Returns subject metadata SharePoint (M.A.I.) Server2010
About Doculabs 37 Doculabs consultants are experts in enterprise social collaboration and content management. We deliver highly actionable and comprehensive strategic plans and road maps that help our clients achieve their business goals, create competitive advantage, and reduce risk. Our services help organizations govern information for the benefit of internal and external constituents through enhanced customer communications, e-discovery, and collaboration processes. Quick Facts • Founded in 1993 • Headquartered in Chicago • Privately held • Delivered more than 1000 engagements to more than 500 customers
About Access Innovations 38 Access innovations are experts in content creation, enrichment and conversion services. We provide services to semantically enrich and tag and raw text into highly structured data. We deliver clean ,well formed, metadata enriched ,data so our clients can reuse repurpose, store, and find their knowledge assets. We go beyond the standards to build taxonomies and other data control structures as a solid foundation for data. Our services and software allow organizations to use and present their information to both internal and external constituents by leveraging search, presentation, e-commerce . We change search to found! Quick Facts • Founded in 1978 • Headquartered in Albuquerque • Privately held • Delivered more than 2000 engagements
Questions? 39 Marjorie M.K. Hlava Joe Shepley Access Innovations, Inc. Doculabs, Inc. firstname.lastname@example.org email@example.com (505) 998-0800 (773) 827-2945 http://flavors.me/jshepley Drilling Down to the Challenges of SharePoint Taxonomy Implementation: