Best Practices for Architecting Taxonomy and Metadata in an Open Source Environment Webinar


Published on

In this webinar, Best Practices for Architecting Taxonomy and Metadata in an Open Source Environment, Enterprise Knowledge and Concept Searching will discuss factors and best practices for integrating third party solutions into open source environments.

Open source solutions are now mainstream and being evaluated by organizations of all sizes and all industries. Some of the advantages include removing vendor lock-in, achieving lower total cost of ownership, and leveraging open source’s greater flexibility and commitment to interoperability.

Regardless of environment, organizations need to evaluate the integration of open source solutions and the ability to simultaneously develop taxonomies and an enterprise metadata environment that work seamlessly together.

What you will take away from this session:
• How Concept Searching has been deployed in open source environments to provide the taxonomy platform component to improve search outcomes and the end user experience
• How to integrate taxonomies and comprehensive metadata schemas into your open source environment
• Providing a user interface for selecting taxonomies and setting up content tags
• How to design, import, and implement taxonomies in open source environments
• How to improve your search results using taxonomy and automatic semantic metadata generation

Zach Wahl, President and Chief Executive Officer at Enterprise Knowledge
Don Miller, Vice President of Sales at Concept Searching

Published in: Technology, Education
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Best Practices for Architecting Taxonomy and Metadata in an Open Source Environment Webinar

  1. 1. Best Practices for Architecting Taxonomy and Metadata in an Open Source Environment Zach Wahl President and Chief Executive Officer Enterprise Knowledge Twitter @EKConsulting Don Miller Vice President of Sales Concept Searching Twitter @conceptsearch
  2. 2. Expert Speakers Zach Wahl - President and Chief Executive Officer at Enterprise Knowledge has over 15 years’ experience leading programs in knowledge and information management, working with more than 200 public and private organizations to successfully design and implement information management systems. He has developed his own taxonomy design methodology, has authored courses on knowledge management, and is a frequent speaker and trainer. Don Miller – Vice President of Sales at Concept Searching has over 20 years’ experience in knowledge management. He is a frequent speaker on records management, and information architecture challenges and solutions, and has been a guest speaker at Taxonomy Boot Camp, and numerous SharePoint events about information organization and records management.
  3. 3. Agenda • Enterprise Knowledge • • • • Introduction to Business Taxonomy for Open Source Open Source Challenges and Considerations Design Best Practices Taxonomy in Action • Concept Searching • Unique Approach • Considerations • Use Case • Demonstration • Next Steps
  4. 4. The Global Leader in Managed Metadata Solutions • Company founded in 2002 • Product launched in 2003 • Focus on management of structured and unstructured information • Technology Platform • Delivered as a web service • Automatic concept identification, content tagging, auto-classification, taxonomy management • Only statistical vendor that can extract conceptual metadata • 2009, 2010, 2011, 2012, 2013 ‘100 Companies that Matter in KM’ (KMWorld) and Trend Setting product of 2009, 2010, 2011, 2012, 2013 • Authority to Operate enterprise wide US Air Force and enterprise wide NETCON US Army • Locations: US, UK, and South Africa • Client base: Fortune 500/1000 organizations • Managed Partner under Microsoft global ISV Program - ‘go to partner’ for Microsoft for auto-classification and taxonomy management • Smart Content Framework for Information Governance comprising • Six Building Blocks for success • Product Suite: conceptSearch, conceptTaxonomyManager, conceptClassifier, conceptClassifier for SharePoint, conceptTaxonomyWorkflow, conceptContentTypeUpdater for SharePoint
  5. 5. Enterprise Knowledge Dedicated to Making Your Information Work for You • • • • • Principals bring over 15 years of taxonomy design consulting with support for over 200 organizations globally. Twitter: @EKConsulting Blog: Core services include: • Knowledge Management and Taxonomy • Enterprise Search • Application Development • Agile Consulting and Project Management
  6. 6. Taxonomy Definitions tax·on·o·my (tāk-sōn-mē) n. pl. tax·on·o·mies 1. The classification of organisms in an ordered system that indicates natural relationships. 2. The science, laws, or principles of classification; systematics. 3. Division into ordered groups or categories: "Scholars have been laboring to develop a taxonomy of young killers" (Aric Press). Zach’s Definition – Controlled vocabularies used to describe or characterize explicit concepts of information, for purposes of capture, management, and presentation. 6
  7. 7. Taxonomy and Metadata • • • • Provide structure to unstructured information Join or relate multiple disparate sources of information Provide multiple avenues to find and discover information Enable findability Findability 7
  8. 8. Taxonomy and Metadata Free Text Entry Metadata “Card” Brochures & Manuals Memos News Policies & Procedures Presentations Reports Title Author Doc Type Topic Department Employee Services Compensation Retirement Insurance Education & Training Manufacturing Safety Quality 8
  9. 9. Taxonomy and Metadata Content~Information~Data~Files Metadata Fields Metadata Values/Tags Taxonomies (Flat or Hierarchical)~ Controlled Vocabularies 9
  10. 10. Traditional v. Business Taxonomies Traditional Taxonomy Business Taxonomy Purpose Categorization Findability Designed By Scientists/Librarians The Business Managed By Scientists/Librarians The Business Used By Scientists/Librarians Everyone Complexity Deep, Wide, Detailed Flat, Simple, Deconstructed Key Characteristics Mutually Exclusive, Collectively Exhaustive Usable, Intuitive, Natural 10
  11. 11. The Business Taxonomy • Usable – Easy to adopt and utilize for any skill level • Relatively flat (2-3 levels) • “Easy” to navigate • Intuitive – Does not require training and reflects the way the user thinks • Natural – Uses the organization, vocabulary, and logic of the user 11
  12. 12. The Business Taxonomy • • • • Tend to be less rigid and constrained Influenced by “traditional” usability design Driven by the content and needs you have today Leverages multiple categorization approaches (via multiple metadata fields and multiple taxonomies) • Accepts imperfect categorization 12
  13. 13. Open Source Challenges and Considerations • Open Source is “free” and “easy” • But taxonomy isn’t… • There are multiple ways to use taxonomy • Menus, Search, Tag Clouds, Page Tags • Taxonomy design is not enough, you need to plan for taxonomy implementation and exposure • Open Source tools like Drupal favor “flat” taxonomies • Faceting is easy to enable but requires diligent tagging and oversight
  14. 14. Taxonomy Design for Open Source – Best Practices • Define taxonomy purpose, audience, and use cases upfront. Design before you build. • Practice usability design best practices (limit depth and breadth, use plain language, etc). Flat lists work best in Open Source content management tools. • Leverage primary category/topic taxonomy with supporting metadata fields. For instance, in Drupal, use of multiple Lists with Views to enable faceting. • Design for your end users and publishers. • Employ analytics and support iterative design. • Plan for the long-term – ensure governance plans are in place before content migration and rollout.
  15. 15. Taxonomy in Action (Drupal)
  16. 16. Creating a Taxonomy
  17. 17. Associating a Taxonomy to a Content Type
  18. 18. Filtering Using Taxonomy
  19. 19. Unique Approach Concept Searching has a unique approach to ensure success • Concept Searching’s unique statistical concept identification underpins all technologies • Multi-word suggestion is explicitly more valuable than single term suggestion algorithms Concept Searching provides Automatic Concept Term Extraction Triple Heart Bypass Baseball Three Organ Center Highway Avoid • conceptClassifier will generate conceptual metadata by extracting multi-word terms that identify ‘triple heart bypass’ as a concept as opposed to single keywords • Metadata can be used by any search engine index or any application/process that uses metadata.
  20. 20. Smart Content Framework™ Sum of parts is greater than whole • Metadata driven application and enforcement of policies - conceptClassifier has been deployed since 2010 to automatically generate metadata and use that metadata to apply and enforce policies. Many clients are using the platform to support their information governance strategy. • Proven, mature functionality out of the box - The platform has been deployed in numerous sites and applications across the enterprise, including MOSS and SharePoint 2010, 2013, Solr, Stellent, Documentum, SQL, Oracle, File Shares, Exchange via SharePoint and across the enterprise.
  21. 21. Open Source Considerations “Given enough eyeballs, all bugs are shallow.” Linus Torvalds Creator of Linux • • • • • • • • • Security Quality Customizability Freedom (avoid vendor lock-in) Interoperability Auditability Support Cost Try Before You Buy Any difference if you are purchasing ‘proprietary’ software? Not much!
  22. 22. Open Source or Proprietary – OK By Us • Concept Searching Technology Platform • • • • • conceptSearch conceptClassifier conceptTaxonomyManager conceptSQL conceptTaxonomyWorkflow • conceptClassifier Technology Platform • Compound Term Processing Engine • Licensed for concept extraction only • conceptClassifier • conceptTaxonomyManager • conceptTaxonomyWorkflow
  23. 23. Use Case Smart Content FrameworkTM Building Blocks - Metadata, Insight Situation • • • Company is the premier global provider of fee based market intelligence, advisory services, and events for the information technology, telecommunications and consumer technology markets Seeking a solution to enhance site visitors’ search experience Potential loss of revenues Challenge • • Complex taxonomy requirements Inability for clients to identify the relevant information they were seeking Solution • • • conceptTaxonomyManager and conceptClassifier Solr Integrated in-house Benefits • • Improved search results Increased accuracy and relevant retrieval of information for external clients and site visitors “Automation is great, but still needs a human eye to gain that last bit of ground. Anyway, it's a great story and I'm still very happy with Concept Searching and the flexibility it gives us.” Director, Enterprise Solutions Planning
  24. 24. Concept Searching Demonstration
  25. 25. What’s the End Result? • Technology from Concept Searching complements Enterprise Knowledge’s strategic and tactical planning experience and expertise in architecting solutions that improve business processes. • Utilizing Concept Searching’s Smart Content Framework™ and intelligent metadata enabled solutions, this partnership addresses key challenges in enterprise search, records management, data privacy, migration, and content management in secure and complex environments. For a comprehensive demo of the combined solution and discussion of expected ROI, please contact Don Miller at Concept Searching or Zach Wahl at Enterprise Knowledge
  26. 26. Thank You Zach Wahl President and Chief Executive Officer Enterprise Knowledge Twitter @EKConsulting Don Miller Vice President of Sales Concept Searching Twitter @conceptsearch