Technology Showcase – FAST Search June 29, 2010 Creating Innovative Enterprise Search Applications to enable Business Productivity
Technology Showcase FAST Search Webinar Schedule This Webinar will be presented two times to maximize coverage over time zones worldwide Session 1: June 29, 2010 6:00am Pacific Daylight Time9:00am Eastern Daylight Time2:00pm British Summer Time3:00pm Central European Time Session 2: June 29, 2010 11:00am Pacific Daylight Time1:00pm Eastern Daylight Time6:00pm British Summer Time7:00pm Central European Time
Communities of Practice SharePoint IA Group: http://tech.groups.yahoo.com/group/SharePointIACoP/ Taxonomy Group: http://finance.groups.yahoo.com/group/TaxoCoP Search Group: http://tech.groups.yahoo.com/group/SearchCoP Upcoming Taxonomy Community of Practice Webinars July 7, 2010 Rights Management Taxonomy – A Key to Digital Content Value August 4, 2010 Getting Greater Business Value from Taxonomy Projects
AIIM Information Organization and Access (IOA) Certificate Program by Earley & Associates July 26 - July 29 Chicago, IL Aug 9 - Aug 12 Toronto, Canada Sep 13 - Sep 16 Washington, DC Sep 20 - Sep 23 San Francisco, CA Oct 4 - Oct 7 Philadelphia, Oct 18 - Oct 21 Houston, TX Course details at www.earley.com/training/aiim-courses
Housekeeping Calls last approximately 60 minutes Questions will be taken at the end of the sessionemail your questions to firstname.lastname@example.org During the session send chats with questions to Earley & Associates If you have problems with your audio please contact Adam at 1-719-785-5674
Speakers & Sponsors Nate Treloar Principal Search Evangelist, Microsoft Toby Conrad Senior Consultant, Smartlogic Seth Earley President/CEO, Earley & Associates, Inc.
Earley & Associates Highlights Founded 1994 Focus Areas Holistic approach to specific business contexts and goals for: • Retail • Manufacturing • Public Sector • Pharmaceuticals & Life Sciences • Media & Entertainment Personnel Core team of 25 consultants Locations Concord, MA headquarters, consultants in US, UK & Canada, global projects Services • Taxonomy & Information Architecture • Search Strategy for Enterprise & Web • ECM, DAM & Information Lifecycle • Program Management & Governance
About the Series Earley & Associates has been sponsoring call series for the past 5 years on topics around content management, search, taxonomy, information architecture The Technology Showcase Series arose from requests by customers to explore tools and technologies that help bridge the gap between strategy and tactics Today’s call is sponsored by: Smartlogic, our partner in taxonomy management and integration technologies Microsoft FAST, provider of search application infrastructure
Session Themes We need to redefine our understanding and approach to search Search is an application and needs to be integrated into the user experience, not just bolted on or considered an afterthought Search can be made much more effective by leveraging metadata and taxonomies
The Challenge of Search Five basics truths about search
Search as Utility “search as a utility has become deeply ingrained into people's everyday lives.“ – Study by Nielsen/Net Ratings “search software, hardware, and support bundle or search appliance has become very popular since being introduced in early 2002" – Goebel Group These are misleading concepts. Search is used as a utility, but contexts vary so widely that “plugging search in” does not always produce satisfactory results.
Search is Heterogeneous Data Sources Business Intelligence Document repositories Intranets/web pages Customer Relationship Mgt Custom databases and applications Search/Classification/Taxonomy Integration Framework Auto categorization/ Clustering Entity Extraction Faceted Search Semantic Search Appliances Federated Search Search Mechanisms
Truth #1: We have to change our definition of search. Search is no longer just a white box. Search is an experience. Search is about information access & capabilities.
Truth #2: Search algorithms are getting better, but they cannot infer human context and intent. A search engine doesn’t know if I’m an engineer, an attorney, or a high school student. Perspective has an impact on whether a set of search results are useful and appropriate.
Truth #3: Taxonomy, metadata and information architecture are key aspects of search. Search is fundamentally about metadata Some content is structured, some isn’t and needs help Advanced search functionalities require taxonomy
Truth #4: Search is increasingly looking like navigation. What happens when you click on a link? Guided navigation and faceted search are really the same thing.
Truth #5: Search is messy. Knowledge is messy, information is messy. People find answers through haphazard and chaotic processes.
“…search terms are short, ambiguous and an approximation of the searcher’s real information need…” Source: http://research.microsoft.com/~ryenw/papers/WhiteCONTEXT2002.pdf Ryen W. White, Joemon M. Jose and Ian Ruthven
Enterprise Search Matters Create experiences that combine the magic of software with the power of Internet services across a world of devices Search is the key to engaginginformation experiences DESKTOP ENTERPRISE ONLINE DEVICES
Enterprise Search is Transforming Business Connecting people to information, driving better outcomes Search helps your customers get what they want Search helps your employees get their jobs done increasing revenue cutting costs
…and a common platform Solutions for Business Productivity Solutions for Internet Sites
SharePoint 2010Next Generation Platform Sites Connect and empower people Communities Composites Cut costs with a common infrastructure Content Insights Rapidly respond to business needs Search
Today’s Choices Force Compromise Out-of-the-Box Products High-end Products OR easy to manage highly capable but but limited in capability hard to manage, expensive
Enterprise Search from Microsoft Best of SharePoint Best of High-end Best of Microsoft Our Vision: A New Choice in Search
FAST Search for SharePoint User Interface Out-of-the-box features and controls Deep Refinement People Search Thumbnails Sorting Similar Results Federation Previews
FAST Content Enrichment “Pipeline" Sequential stages perform specific tasks while ingesting content Breaks down content to the smallest addressable chunks to build meaning Understands file encoding, data formats, and written languages Supports 400+ file formats, 80+ languages Process your content to make it searchable Normalizes content so that a consistent relevancy model can be applied Identifies structured and unstructured metadata in your content Maps document metadata to SharePoint Crawled Properties A systematic approach to interpreting your content
Search Driven Applications Meet all the search application needs you have across your business Sales: 360o Customer Insight “How do I support theunique search needs of teams and work that impact our business?” Analytics: Risk Management Marketing: Competitive Intelligence To do so, you need a search platform that has
How do you catalogue your content? Subject Creation Date Modified Date Location Author Project Function (IT,HR,Finance) Format (PDF,DOC,XLS) Site Expert Retention Expiry Protective Marker Publisher
How do I structure it? Information Subject Creation Date Modified Date Location Author Project Function (IT,HR,Finance) Format (PDF,DOC,XLS) Site Expert Retention Expiry Protective Marker Publisher Structural Process
Information Seeking Behaviour Subject Creation Date Modified Date Location Author Project Function (IT,HR,Finance) Format (PDF,DOC,XLS) Expert Protective Marker Retention/ Expiry Publisher Department
Doing it with Fast Ontology Lifecycle Management Search and Navigation Enhancement Classification and Text Mining User Requests Corpus Portal FAST ESP Server Search Application Framework Sample Interface Code Indexing Pipeline Document Processor Query Pipeline Index
Doing it with SharePoint Ontology Lifecycle Management Search and Navigation Enhancement Classification and Text Mining Microsoft Office SharePoint Server Crawler MOSS Search Web Application Mapped Property Search Web Part Semaphore Solution (WSP) User Requests Document Library Publishing Site
...and as part of the SharePoint load Migration Engine Open Semantic Platform
In summary Classifying your content tells you exactly what you have, where to locate it and what to use it for It is valuable in managing, re-purposing and finding content – and most of all in supporting key processes With text mining, content classification, off-the-shelf integration and UI frameworks it is now easy enough to be main stream
Seth Earley, Earley & AssociatesMobilizing a Semantic Classification Project
Mobilizing for Semantic Classification Define the information management problem you are attempting to solve Not enough to say “make the information easier to use” Focus on a specific process, user type, audience, context, information source For example – Proposal development process for sales organization in North America Enroll the appropriate business stakeholders, executive sponsor, content or repository owners Quantify business impact of the problem plan
Mobilizing for Semantic Classification Ask where and how taxonomy and information architecture can improve the user experience Consider overall site organizing principles as well as search functionality Develop both active and passive personalization to surface content to users in the context of their task plan
Leveraging Taxonomy to Improve Findability Faceted Search Navigational Search via Topic Taxonomy Scoped Search
Audience Analysis & IM Lifecycle Automation User Roles & Groups Personalization Design User Scenarios Workflow Design
Possible Approaches Option 1 - Small scale technology proof of concept Option 2 – More detailed current state assessment of content, processes and organizing principles Option 3 – Full information strategy and architecture development Determine the correct overall approach through a Semaphore Planning Engagement
Semaphore Planning Engagement Review your findability objectives and the overall context in which Semaphore will be deployed (technology, organization, processes, and representative assets) Plan and configure a proof of concept that helps you assess the potential of Semaphore Demonstrating how it will improve findability Evaluating the potential of automated taxonomy creation and tagging given a selection of representative documents or other digital assets Document results and identify opportunities and challenges to succeeding with a Semaphore deployment in your organization Collaborate to develop a roadmap for meeting your findability objectives Planning out technology integration, taxonomy development, knowledge transfer, sustainable governance and other aspects of a successful initiative
Answering Key Questions for Initiative Success
Is our current taxonomy sufficient to meet our findability objectives?
If not, where do we need to expand to more facets for search and navigation? Do we need to custom craft high-value areas of taxonomy to support our core business operations?
Does our taxonomy have the rich synonyms and relationships required to ensure automated tagging?
If not, will automated tagging do a good job with the metadata present in our unstructured content? Can we “extract” semantic metadata using Semaphore’s ability to text mine and generate an ontology? plan
Answering Key Questions for Initiative Success
Do we need to improve our governance model to maintain a deep and broad taxonomy over time?
Can we leverage our existing governance model for data management or do we need a different approach?
What are the skills and resources needed to successfully maintain a Semaphore solution over time?
How much training will our organization need? Given our environment and culture what is the best way to develop the skills we need? How will we leverage semantic classification to further improve findability and reuse in our IT initiatives? plan
In Summary - Creating Innovative Enterprise Search Applications Design the user experience of information access, not simply ‘white box’ search Think of search throughout the design of your application, not as an afterthought Allow the user to make choices that will “disambiguate” their searches – help them think more precisely Carefully evaluate your environment – people, process, content, tools – in order to improve the search experience