Technology Showcase – FAST Search<br />June 29, 2010<br />Creating Innovative Enterprise Search Applications to enable Bus...
Technology Showcase FAST Search Webinar Schedule<br />This Webinar will be presented two times to maximize coverage over t...
Communities of Practice<br />SharePoint IA Group: http://tech.groups.yahoo.com/group/SharePointIACoP/<br />Taxonomy Group:...
AIIM Information Organization and Access (IOA) Certificate Program by Earley & Associates <br />July 26 - July 29 		Chicag...
Housekeeping<br />Calls last approximately 60 minutes<br />Questions will be taken at the end of the sessionemail your que...
Speakers & Sponsors<br />Nate Treloar<br />Principal Search Evangelist, Microsoft<br />Toby Conrad <br />Senior Consultant...
Earley & Associates Highlights<br />Founded	1994  <br />Focus Areas	Holistic approach to specific 	business contexts and g...
About the Series<br />Earley & Associates has been sponsoring call series for the past 5 years on topics around content ma...
Session Themes<br />We need to redefine our understanding and approach to search<br />Search is an application and needs t...
The Challenge of Search<br />Five basics truths about search<br />
Search as Utility<br />“search as a utility has become deeply ingrained into people's everyday lives.“ – Study by Nielsen/...
Search is Heterogeneous<br />Data Sources<br />Business Intelligence<br />Document repositories<br />Intranets/web pages<b...
Truth #1:<br />We have to change our definition of search.<br />Search is no longer just a white box.<br />Search is an ex...
Truth #2:<br />Search algorithms are getting better, but they cannot infer human context and intent.<br />A search engine ...
Truth #3:<br />Taxonomy, metadata and information architecture are key aspects of search.<br />Search is fundamentally abo...
Truth #4:<br />Search is increasingly looking like navigation.<br />What happens when you click on a link?<br />Guided nav...
Truth #5:<br />Search is messy.<br />Knowledge is messy, information is messy.<br />People find answers through haphazard ...
“…search terms are short, ambiguous and an approximation of the searcher’s real information need…” <br />Source: http://re...
Nate Treloar, Microsoft The Search Experience <br />
Enterprise Search Matters<br />Create experiences that combine the magic of software with the power of Internet services a...
Enterprise Search is Transforming Business<br />Connecting people to information, driving better outcomes<br />Search help...
…and a common platform<br />Solutions for<br />Business Productivity<br />Solutions for<br />Internet Sites<br />
SharePoint 2010Next Generation Platform<br />Sites<br />Connect and empower <br />people<br />Communities<br />Composites<...
Today’s Choices Force Compromise<br />Out-of-the-Box Products<br />High-end Products<br />OR<br />easy to manage<br />high...
Enterprise Search from Microsoft<br />Best of SharePoint<br />Best of High-end<br />Best of Microsoft<br />Our Vision: A N...
FAST Search for SharePoint User Interface<br />Out-of-the-box features and controls<br />Deep Refinement<br />People Searc...
FAST Content Enrichment “Pipeline"<br />Sequential stages perform specific tasks while ingesting content<br />Breaks down ...
Search Driven Applications<br />Meet all the search application needs you have across your business<br />Sales: <br />360o...
Flexible relevance to meet diverse needs
A customizable UX to increase user efficiency</li></ul>Research & Development:<br />Innovation Portal<br />Support:<br />C...
Enterprise Search from Microsoft<br />www.microsoft.com/enterprisesearch/<br />UX<br />IT<br />DX<br />Eliminate compromis...
Building powerful applications using semantic classification<br />Toby Conrad<br />toby.conrad@smartlogic.com<br />+1 773 ...
Introduction<br />Is it useful to know what content you have?<br />Will it help me improve search?<br />Will it deliver po...
Knowing what you have<br />
What about your content?<br />
How do you catalogue your content?<br />Subject<br />Creation Date<br />Modified Date<br />Location<br />Author<br />Proje...
How do I structure it?<br />Information<br />Subject<br />Creation Date<br />Modified Date<br />Location<br />Author<br />...
Information Seeking Behaviour<br />Subject<br />Creation Date<br />Modified Date<br />Location<br />Author<br />Project<br...
SharePoint 2010 Search:Metadata Refinement Panel<br />
What is the benefit?<br />Discovery  <br />Precision <br />Recall      <br />Confidence  <br />Find      <br />Re-Use...
NASA screenshot<br />Explore relationships<br />Perfectly formed filters <br />organised by facet<br />
NASA screenshot<br />Topic overviews<br />
NASA screenshot<br />Concept Mapping<br />
NASA screenshot<br />Explore broader and<br />narrower topics<br />
NASA Screenshot<br />Hierarchical faceted<br />search<br />
NHS screenshot<br />Summer Health – Live Well<br />Over-50s can stay healthy and independent by keeping active and ... - L...
NHS screenshot<br />
Metadata enrichment process 1<br />Americas<br />USA<br />Earth<br />Florida<br />Tampa <br />North America<br />
Automating the process with Semantics<br />
Doing it with Fast<br />Ontology<br />Lifecycle <br />Management<br />Search and Navigation Enhancement<br />Classificatio...
Doing it with SharePoint<br />Ontology<br />Lifecycle <br />Management<br />Search and Navigation Enhancement<br />Classif...
...and as part of the SharePoint load<br />Migration Engine<br />Open Semantic Platform<br />
In summary<br />Classifying your content tells you exactly what you have, where to locate it and what to use it for<br />I...
Seth Earley, Earley & AssociatesMobilizing a Semantic Classification Project<br />
Mobilizing for Semantic Classification<br />Define the information management problem you are attempting to solve<br />Not...
Mobilizing for Semantic Classification<br />Ask where and how taxonomy and information architecture can improve the user e...
Leveraging Taxonomy to Improve Findability<br />Faceted Search<br />Navigational Search via <br />Topic Taxonomy<br />Scop...
Organization & Access<br />Site Map<br />& Navigation<br />Wireframes & Template Design<br />
Audience Analysis & IM Lifecycle Automation<br />User Roles & Groups<br />Personalization Design<br />User Scenarios<br />...
Possible Approaches<br />Option 1 - Small scale technology proof of concept<br />Option 2 – More detailed current state as...
Semaphore Planning Engagement<br />Review your findability objectives and the overall context in which Semaphore will be d...
Answering Key Questions for Initiative Success<br /><ul><li>Is our current taxonomy sufficient to meet our findability obj...
Answering Key Questions for Initiative Success<br /><ul><li>Do we need to improve our governance model to maintain a deep ...
In Summary - Creating Innovative Enterprise Search Applications<br />Design the user experience of information access, not...
Upcoming SlideShare
Loading in...5
×

FAST Search-webinar-06-29-2010

2,045

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
2,045
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
63
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Time: 1 minuteNotes:You may know by now that we segment the search market into two class of applications: those that are focused on the customer (or the citizen in the case of Gov’t applications) and customer service or revenue generation, and those that are focused on the employee and business productivity. Poll: How many in the audience are planning to deploy, are in the process of deploying, or have already deployed a customer-facing “Internet Business” search application?How many for “inside the firewall” business productivity applications?How many have both??
  • The way we think about search is we wanted to strongly move the needle for all three roles that interact with search, the end user, IT Pro, and Developer. And we have…For the End User…we’ve gone beyond the search box. From static 10 blue links experience to one where users can find, explore and connect with the information and people in the organization
  • FAST Search-webinar-06-29-2010

    1. 1. Technology Showcase – FAST Search<br />June 29, 2010<br />Creating Innovative Enterprise Search Applications to enable Business Productivity<br />
    2. 2. Technology Showcase FAST Search Webinar Schedule<br />This Webinar will be presented two times to maximize coverage over time zones worldwide<br />Session 1: <br />June 29, 2010 <br /> 6:00am Pacific Daylight Time9:00am Eastern Daylight Time2:00pm British Summer Time3:00pm Central European Time<br />Session 2:<br />June 29, 2010 <br /> 11:00am Pacific Daylight Time1:00pm Eastern Daylight Time6:00pm British Summer Time7:00pm Central European Time<br />
    3. 3. Communities of Practice<br />SharePoint IA Group: http://tech.groups.yahoo.com/group/SharePointIACoP/<br />Taxonomy Group: http://finance.groups.yahoo.com/group/TaxoCoP<br />Search Group: http://tech.groups.yahoo.com/group/SearchCoP<br />Upcoming Taxonomy Community of Practice Webinars<br />July 7, 2010 Rights Management Taxonomy – A Key to Digital Content Value <br />August 4, 2010 Getting Greater Business Value from Taxonomy Projects<br />
    4. 4. AIIM Information Organization and Access (IOA) Certificate Program by Earley & Associates <br />July 26 - July 29  Chicago, IL         <br />Aug 9 - Aug 12 Toronto, Canada       <br />Sep 13 - Sep 16 Washington, DC                   <br />Sep 20 - Sep 23 San Francisco, CA                <br />Oct 4 - Oct 7 Philadelphia,        <br />Oct 18 - Oct 21 Houston, TX  <br /> Course details at www.earley.com/training/aiim-courses<br />
    5. 5. Housekeeping<br />Calls last approximately 60 minutes<br />Questions will be taken at the end of the sessionemail your questions to sachie@earley.com<br />During the session send chats with questions to Earley & Associates<br /> If you have problems with your audio please contact Adam at 1-719-785-5674 <br />
    6. 6. Speakers & Sponsors<br />Nate Treloar<br />Principal Search Evangelist, Microsoft<br />Toby Conrad <br />Senior Consultant, Smartlogic<br />Seth Earley<br />President/CEO, Earley & Associates, Inc.<br />
    7. 7. Earley & Associates Highlights<br />Founded 1994 <br />Focus Areas Holistic approach to specific business contexts and goals for:<br /> • Retail<br /> • Manufacturing<br /> • Public Sector<br /> • Pharmaceuticals & Life Sciences<br /> • Media & Entertainment <br />Personnel Core team of 25 consultants<br />Locations Concord, MA headquarters, consultants in US, UK & Canada, global projects<br />Services • Taxonomy & Information Architecture<br /> • Search Strategy for Enterprise & Web<br /> • ECM, DAM & Information Lifecycle<br /> • Program Management & Governance<br />
    8. 8. About the Series<br />Earley & Associates has been sponsoring call series for the past 5 years on topics around content management, search, taxonomy, information architecture<br />The Technology Showcase Series arose from requests by customers to explore tools and technologies that help bridge the gap between strategy and tactics<br />Today’s call is sponsored by:<br />Smartlogic, our partner in taxonomy management and integration technologies<br />Microsoft FAST, provider of search application infrastructure<br />
    9. 9. Session Themes<br />We need to redefine our understanding and approach to search<br />Search is an application and needs to be integrated into the user experience, not just bolted on or considered an afterthought<br />Search can be made much more effective by leveraging metadata and taxonomies<br />
    10. 10. The Challenge of Search<br />Five basics truths about search<br />
    11. 11. Search as Utility<br />“search as a utility has become deeply ingrained into people's everyday lives.“ – Study by Nielsen/Net Ratings<br />“search software, hardware, and support bundle or search appliance has become very popular since being introduced in early 2002" – Goebel Group<br />These are misleading concepts.<br />Search is used as a utility, but contexts vary so widely that “plugging search in” does not always produce satisfactory results.<br />
    12. 12. Search is Heterogeneous<br />Data Sources<br />Business Intelligence<br />Document repositories<br />Intranets/web pages<br />Customer Relationship Mgt<br />Custom databases and applications<br />Search/Classification/Taxonomy Integration Framework<br />Auto categorization/<br />Clustering<br />Entity <br />Extraction<br />Faceted <br />Search<br />Semantic <br />Search<br />Appliances<br />Federated <br />Search<br />Search Mechanisms<br />
    13. 13. Truth #1:<br />We have to change our definition of search.<br />Search is no longer just a white box.<br />Search is an experience.<br />Search is about information access & capabilities.<br />
    14. 14. Truth #2:<br />Search algorithms are getting better, but they cannot infer human context and intent.<br />A search engine doesn’t know if I’m an engineer, an attorney, or a high school student.<br />Perspective has an impact on whether a set of search results are useful and appropriate.<br />
    15. 15. Truth #3:<br />Taxonomy, metadata and information architecture are key aspects of search.<br />Search is fundamentally about metadata<br />Some content is structured, some isn’t and needs help<br />Advanced search functionalities require taxonomy<br />
    16. 16. Truth #4:<br />Search is increasingly looking like navigation.<br />What happens when you click on a link?<br />Guided navigation and faceted search are really the same thing.<br />
    17. 17. Truth #5:<br />Search is messy.<br />Knowledge is messy, information is messy.<br />People find answers through haphazard and chaotic processes.<br />
    18. 18. “…search terms are short, ambiguous and an approximation of the searcher’s real information need…” <br />Source: http://research.microsoft.com/~ryenw/papers/WhiteCONTEXT2002.pdf<br />Ryen W. White, Joemon M. Jose and Ian Ruthven<br />
    19. 19. Nate Treloar, Microsoft The Search Experience <br />
    20. 20. Enterprise Search Matters<br />Create experiences that combine the magic of software with the power of Internet services across a world of devices<br />Search is the key to engaginginformation experiences<br />DESKTOP<br />ENTERPRISE<br />ONLINE<br />DEVICES<br />
    21. 21. Enterprise Search is Transforming Business<br />Connecting people to information, driving better outcomes<br />Search helps your customers get what they want<br />Search helps your employees get their jobs done <br />increasing revenue<br />cutting costs<br />
    22. 22. …and a common platform<br />Solutions for<br />Business Productivity<br />Solutions for<br />Internet Sites<br />
    23. 23. SharePoint 2010Next Generation Platform<br />Sites<br />Connect and empower <br />people<br />Communities<br />Composites<br />Cut costs with a common infrastructure<br />Content<br />Insights<br />Rapidly respond to business needs<br />Search<br />
    24. 24. Today’s Choices Force Compromise<br />Out-of-the-Box Products<br />High-end Products<br />OR<br />easy to manage<br />highly capable<br />but<br />but<br />limited in capability<br />hard to manage, expensive<br />
    25. 25. Enterprise Search from Microsoft<br />Best of SharePoint<br />Best of High-end<br />Best of Microsoft<br />Our Vision: A New Choice in Search<br />
    26. 26. FAST Search for SharePoint User Interface<br />Out-of-the-box features and controls<br />Deep Refinement<br />People Search<br />Thumbnails<br />Sorting<br />Similar Results<br />Federation<br />Previews<br />
    27. 27. FAST Content Enrichment “Pipeline"<br />Sequential stages perform specific tasks while ingesting content<br />Breaks down content to the smallest addressable chunks to build meaning<br />Understands file encoding, data formats, and written languages <br />Supports 400+ file formats, 80+ languages <br />Process your content to make it searchable<br />Normalizes content so that a consistent relevancy model can be applied<br />Identifies structured and unstructured metadata in your content<br />Maps document metadata to SharePoint Crawled Properties <br />A systematic approach to interpreting your content<br />
    28. 28. Search Driven Applications<br />Meet all the search application needs you have across your business<br />Sales: <br />360o Customer Insight<br />“How do I support theunique search needs of teams and work that impact our business?”<br />Analytics: <br />Risk Management<br />Marketing: <br />Competitive Intelligence<br />To do so, you need a search platform that has<br /><ul><li>A deep understanding of your information
    29. 29. Flexible relevance to meet diverse needs
    30. 30. A customizable UX to increase user efficiency</li></ul>Research & Development:<br />Innovation Portal<br />Support:<br />Call Center Advisor<br />Operations:<br />Systems/Logistics Portal<br />Legal, HR, IT, Finance, ……<br />
    31. 31.
    32. 32.
    33. 33.
    34. 34. Enterprise Search from Microsoft<br />www.microsoft.com/enterprisesearch/<br />UX<br />IT<br />DX<br />Eliminate compromise<br />Do more with search<br />Go beyond the search box<br />
    35. 35.
    36. 36. Building powerful applications using semantic classification<br />Toby Conrad<br />toby.conrad@smartlogic.com<br />+1 773 251 0824<br />
    37. 37. Introduction<br />Is it useful to know what content you have?<br />Will it help me improve search?<br />Will it deliver powerful search applications?<br />Examples<br />Conclusion<br />
    38. 38. Knowing what you have<br />
    39. 39. What about your content?<br />
    40. 40. How do you catalogue your content?<br />Subject<br />Creation Date<br />Modified Date<br />Location<br />Author<br />Project<br />Function<br />(IT,HR,Finance)<br />Format<br />(PDF,DOC,XLS)<br />Site<br />Expert<br />Retention<br />Expiry<br />Protective<br />Marker<br />Publisher<br />
    41. 41. How do I structure it?<br />Information<br />Subject<br />Creation Date<br />Modified Date<br />Location<br />Author<br />Project<br />Function<br />(IT,HR,Finance)<br />Format<br />(PDF,DOC,XLS)<br />Site<br />Expert<br />Retention<br />Expiry<br />Protective<br />Marker<br />Publisher<br />Structural<br />Process<br />
    42. 42. Information Seeking Behaviour<br />Subject<br />Creation Date<br />Modified Date<br />Location<br />Author<br />Project<br />Function<br />(IT,HR,Finance)<br />Format<br />(PDF,DOC,XLS)<br />Expert<br />Protective<br />Marker<br />Retention/<br />Expiry<br />Publisher<br />Department<br />
    43. 43. SharePoint 2010 Search:Metadata Refinement Panel<br />
    44. 44. What is the benefit?<br />Discovery <br />Precision <br />Recall <br />Confidence <br />Find <br />Re-Use <br />
    45. 45. NASA screenshot<br />Explore relationships<br />Perfectly formed filters <br />organised by facet<br />
    46. 46. NASA screenshot<br />Topic overviews<br />
    47. 47. NASA screenshot<br />Concept Mapping<br />
    48. 48. NASA screenshot<br />Explore broader and<br />narrower topics<br />
    49. 49. NASA Screenshot<br />Hierarchical faceted<br />search<br />
    50. 50. NHS screenshot<br />Summer Health – Live Well<br />Over-50s can stay healthy and independent by keeping active and ... - Live Well<br /><ul><li>More results</li></ul>Visual content<br />navigation<br />
    51. 51. NHS screenshot<br />
    52. 52. Metadata enrichment process 1<br />Americas<br />USA<br />Earth<br />Florida<br />Tampa <br />North America<br />
    53. 53. Automating the process with Semantics<br />
    54. 54. Doing it with Fast<br />Ontology<br />Lifecycle <br />Management<br />Search and Navigation Enhancement<br />Classification and Text Mining<br />User Requests<br />Corpus<br />Portal<br />FAST ESP Server<br />Search Application Framework<br />Sample Interface Code<br />Indexing Pipeline<br />Document Processor<br />Query Pipeline<br />Index<br />
    55. 55. Doing it with SharePoint<br />Ontology<br />Lifecycle <br />Management<br />Search and Navigation Enhancement<br />Classification and Text Mining<br />Microsoft Office SharePoint Server<br />Crawler<br />MOSS Search<br />Web Application<br />Mapped Property<br />Search Web Part<br />Semaphore Solution (WSP)<br />User Requests<br />Document Library<br />Publishing <br />Site<br />
    56. 56. ...and as part of the SharePoint load<br />Migration Engine<br />Open Semantic Platform<br />
    57. 57. In summary<br />Classifying your content tells you exactly what you have, where to locate it and what to use it for<br />It is valuable in managing, re-purposing and finding content – and most of all in supporting key processes<br />With text mining, content classification, off-the-shelf integration and UI frameworks it is now easy enough to be main stream<br />
    58. 58. Seth Earley, Earley & AssociatesMobilizing a Semantic Classification Project<br />
    59. 59. Mobilizing for Semantic Classification<br />Define the information management problem you are attempting to solve<br />Not enough to say “make the information easier to use”<br />Focus on a specific process, user type, audience, context, information source<br />For example – Proposal development process for sales organization in North America<br />Enroll the appropriate business stakeholders, executive sponsor, content or repository owners <br />Quantify business impact of the problem<br />plan<br />
    60. 60. Mobilizing for Semantic Classification<br />Ask where and how taxonomy and information architecture can improve the user experience<br />Consider overall site organizing principles as well as search functionality <br />Develop both active and passive personalization to surface content to users in the context of their task<br />plan<br />
    61. 61. Leveraging Taxonomy to Improve Findability<br />Faceted Search<br />Navigational Search via <br />Topic Taxonomy<br />Scoped Search<br />
    62. 62. Organization & Access<br />Site Map<br />& Navigation<br />Wireframes & Template Design<br />
    63. 63. Audience Analysis & IM Lifecycle Automation<br />User Roles & Groups<br />Personalization Design<br />User Scenarios<br />Workflow Design<br />
    64. 64. Possible Approaches<br />Option 1 - Small scale technology proof of concept<br />Option 2 – More detailed current state assessment of content, processes and organizing principles <br />Option 3 – Full information strategy and architecture development<br /> Determine the correct overall approach through a Semaphore Planning Engagement<br />
    65. 65. Semaphore Planning Engagement<br />Review your findability objectives and the overall context in which Semaphore will be deployed (technology, organization, processes, and representative assets)<br />Plan and configure a proof of concept that helps you assess the potential of Semaphore<br />Demonstrating how it will improve findability<br />Evaluating the potential of automated taxonomy creation and tagging given a selection of representative documents or other digital assets<br />Document results and identify opportunities and challenges to succeeding with a Semaphore deployment in your organization<br />Collaborate to develop a roadmap for meeting your findability objectives<br />Planning out technology integration, taxonomy development, knowledge transfer, sustainable governance and other aspects of a successful initiative<br />
    66. 66. Answering Key Questions for Initiative Success<br /><ul><li>Is our current taxonomy sufficient to meet our findability objectives?</li></ul>If not, where do we need to expand to more facets for search and navigation?<br />Do we need to custom craft high-value areas of taxonomy to support our core business operations? <br /><ul><li>Does our taxonomy have the rich synonyms and relationships required to ensure automated tagging?</li></ul> If not, will automated tagging do a good job with the metadata present in our unstructured content? <br />Can we “extract” semantic metadata using Semaphore’s ability to text mine and generate an ontology? <br />plan<br />
    67. 67. Answering Key Questions for Initiative Success<br /><ul><li>Do we need to improve our governance model to maintain a deep and broad taxonomy over time?</li></ul>Can we leverage our existing governance model for data management or do we need a different approach? <br /><ul><li>What are the skills and resources needed to successfully maintain a Semaphore solution over time?</li></ul>How much training will our organization need? Given our environment and culture what is the best way to develop the skills we need? <br />How will we leverage semantic classification to further improve findability and reuse in our IT initiatives?<br />plan<br />
    68. 68. In Summary - Creating Innovative Enterprise Search Applications<br />Design the user experience of information access, not simply ‘white box’ search<br />Think of search throughout the design of your application, not as an afterthought<br />Allow the user to make choices that will “disambiguate” their searches – help them think more precisely <br />Carefully evaluate your environment – people, process, content, tools – in order to improve the search experience<br />
    69. 69. Questions?<br />
    70. 70. Please fill out the survey that should be in your inbox. <br />Let us know what topics you are interested in and how we can improve the series.<br />Seth Earley<br />seth@earley.com<br />www.earley.com<br />781-820-8080<br />
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×