This document provides an overview of enterprise search capabilities in Microsoft Office SharePoint Server (MOSS) 2007. It discusses features like search scopes, best bets, federated search, people search, and business data catalog for integrating line-of-business applications. It also covers search configuration topics like defining a search roadmap, assigning relevance weighting, developing best bets and editorial guidelines. The document is intended to help configure and optimize MOSS 2007 search for an enterprise.
Optimizing SharePoint for Transactional Content ManagementDocFluix, LLC
While SharePoint 2010 and 2013 has a wide range of great document management features, organizations that need "transactional content management" (such as invoices, purchase orders, claims, registration forms or other high volume documents related to a business process or transaction) find numerous challenges in optimizing SharePoint for this purpose. This presentation will cover how best to configure and optimize SharePoint for this type of document management.
Developing Search-driven application in SharePoint 2013 SPC Adriatics
Search-driven solutions are applications that use a search engine to drive the data access and present results. Microsoft SharePoint 2013 offers developers new ways to extend search to create search-based solutions and Apps. Using Search applications, developers can unite and control data from different site collections and external locations. In this session, I will cover all different ways of querying SharePoint 2013 Search including Client-Side Object Model (CSOM) and REST API. The main goal of the session is to provide strong understanding of search-driven solutions for the attendees and encourage many new ideas for using search to deliver end-user productivity.
Darko Milevski
Optimizing SharePoint for Transactional Content ManagementDocFluix, LLC
While SharePoint 2010 and 2013 has a wide range of great document management features, organizations that need "transactional content management" (such as invoices, purchase orders, claims, registration forms or other high volume documents related to a business process or transaction) find numerous challenges in optimizing SharePoint for this purpose. This presentation will cover how best to configure and optimize SharePoint for this type of document management.
Developing Search-driven application in SharePoint 2013 SPC Adriatics
Search-driven solutions are applications that use a search engine to drive the data access and present results. Microsoft SharePoint 2013 offers developers new ways to extend search to create search-based solutions and Apps. Using Search applications, developers can unite and control data from different site collections and external locations. In this session, I will cover all different ways of querying SharePoint 2013 Search including Client-Side Object Model (CSOM) and REST API. The main goal of the session is to provide strong understanding of search-driven solutions for the attendees and encourage many new ideas for using search to deliver end-user productivity.
Darko Milevski
SEF2013 - Create a Business Solution, Step by Step, with No Managed CodeMarc D Anderson
with Christian Ståhl
In this session, you will learn how you can devise powerful solutions from beginning to end without deploying any managed code with two of the biggest proponents of this approach. We’ll take a business problem and go through the actual solution in SharePoint 2013, but we’ll dip into SharePoint 2010 as well to see how the solution might work there and discuss how we might approach things differently. You’ll get the solution in a WSP as well as the underlying code.
Presentation around several tips & trick to improve SharePoint (on premise) performance, mainly by tweaking the SQL databases.
Download the ppt for fun animations !
Building an unstructured data management solution with elastic search and ama...mobiusservices
Learn how to manage unstructured data by building a document database with document, page indexing and retrieval solutions using Elasticsearch and Amazon Web Services
Content Management with MongoDB by Mark HelmstetterMongoDB
MongoDB is great for content management and delivery across a multitude of apps such as e-commerce websites, online publications, web content management systems (CMS), document management, archives and others. MongoDB's flexible schema and data model make it easy to catalog multiple content types with diverse meta data.
-Schema design for content management
-Using GridFS for storing binary files
-How you can leverage MongoDB's auto-sharding to partition your content across multiple servers
If you've been meaning to learn jQuery but haven't found the time, come to this introductory session where we'll cover all of the important basics of jQuery in a SharePoint context. By the end of the workshop, you'll be ready to start adding jQuery customizations to your SharePoint pages. We'll cover Selectors, Traversing, Manipulation, Events and Effects as I cover in my article series at SharePoint Magazine.
With SharePoint 2013, Microsoft has combined the best features of SharePoint search and FAST into a single engine with better relevance, faster performance, and easier configuration.
This session will introduce SharePoint 2013’s new search capabilities and provide tips for deploying an enterprise-wide search platform. We’ll walk through strategies for managing content sources, optimizing filters, and providing a clean interface with display templates.
SharePoint 2016 Platform Adoption Lessons Learned and Advanced TroubleshootingJohn Calvert
Lessons learned from designing and building a modern SharePoint Server 2016 platform architecture for a Government of Canada agency and some advanced troubleshooting scenarios that arose. We will look in particular at web and service applications, host-named site collections, search, and security.
SEF2013 - Create a Business Solution, Step by Step, with No Managed CodeMarc D Anderson
with Christian Ståhl
In this session, you will learn how you can devise powerful solutions from beginning to end without deploying any managed code with two of the biggest proponents of this approach. We’ll take a business problem and go through the actual solution in SharePoint 2013, but we’ll dip into SharePoint 2010 as well to see how the solution might work there and discuss how we might approach things differently. You’ll get the solution in a WSP as well as the underlying code.
Presentation around several tips & trick to improve SharePoint (on premise) performance, mainly by tweaking the SQL databases.
Download the ppt for fun animations !
Building an unstructured data management solution with elastic search and ama...mobiusservices
Learn how to manage unstructured data by building a document database with document, page indexing and retrieval solutions using Elasticsearch and Amazon Web Services
Content Management with MongoDB by Mark HelmstetterMongoDB
MongoDB is great for content management and delivery across a multitude of apps such as e-commerce websites, online publications, web content management systems (CMS), document management, archives and others. MongoDB's flexible schema and data model make it easy to catalog multiple content types with diverse meta data.
-Schema design for content management
-Using GridFS for storing binary files
-How you can leverage MongoDB's auto-sharding to partition your content across multiple servers
If you've been meaning to learn jQuery but haven't found the time, come to this introductory session where we'll cover all of the important basics of jQuery in a SharePoint context. By the end of the workshop, you'll be ready to start adding jQuery customizations to your SharePoint pages. We'll cover Selectors, Traversing, Manipulation, Events and Effects as I cover in my article series at SharePoint Magazine.
With SharePoint 2013, Microsoft has combined the best features of SharePoint search and FAST into a single engine with better relevance, faster performance, and easier configuration.
This session will introduce SharePoint 2013’s new search capabilities and provide tips for deploying an enterprise-wide search platform. We’ll walk through strategies for managing content sources, optimizing filters, and providing a clean interface with display templates.
SharePoint 2016 Platform Adoption Lessons Learned and Advanced TroubleshootingJohn Calvert
Lessons learned from designing and building a modern SharePoint Server 2016 platform architecture for a Government of Canada agency and some advanced troubleshooting scenarios that arose. We will look in particular at web and service applications, host-named site collections, search, and security.
Workshop - Ways of Working Within the M365 Workspace.pptxSimon Rawson
This is an in-depth walkthrough of Microsoft 365 information architecture. It includes:
* Ways into M365 - Teams, SharePoint and Viva Connections
* Elements of information architecture in M365
* M365 governance
* Classification, security and retention models and the role they play in enterprise search
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010Henry Ong
This slide deck was presented by Henry Ong at SharePoint Saturday Los Angeles on April 14, 2012. The original content was contributed by Chris McNulty, Strategic Product Manager for Quest Software. There are notes in many of the slides so you may want to download this presentation to get all the content.
SPS Phoenix Optimizing SharePoint for Transactional Content ManagementDocFluix, LLC
Transactional content is comprised of documents that organization receives from external parties, such as vendors, customers and partners. Common examples of transactional documents are vendor invoices, purchase orders from clients, claim forms, application and enrollment forms and most any documents that a company receives on paper and needs to scan in.
While SharePoint is a an excellent platform for Enterprise Content Management, there are a few challenges in using SharePoint to manage transactional content.
This presentation addresses some of the more obscure (yet complex) out of the box features that can be configured in SharePoint to improve management of transactional content along with some software products from Hershey Technologies that simplify and enhance the user experience.
How to SEO a Terrific - and Profitable - User ExperienceBrightEdge
Tune in for Portent SEO Marianne Sweeny’s January webinar: “How to SEO a Terrific – and Profitable – User Experience.” Learn how search engine algorithms are now incorporating IA, UX and content strategy, as well as methods for directing Google, Bing & Co. to perform better for your users.
This presentation is introduction to search world and deep-dive to Azure Search. Building well functioning search for web site is never easy. It needs planning of search indexes, analysis for technical solution and good knowledge about how users interact with search. Azure Search is simple search service to search-enable web sites and other systems. This presentation uses simple example site to make deep-dive to Azure Search and show audience how to build scalable and powerful search solution on it.
10 Things I Like in SharePoint 2013 SearchSPC Adriatics
Speaker: Agnes Molnar;
Based on my SharePoint and FAST Search experience, I’ll demonstrate my “Research Path” on SharePoint 2013 Search. What’s new, what improvements we can find there as well as how to use our existing Search knowledge and experience in SharePoint 2013 Search.
You will learn:
Config options in SharePoint 2013 Search – Central Admin vs. PowerShell
Crawled and Managed Properties across Content Sources
Ranking and Relevancy
Site search is one of the core functionality of any website. This talk provides an overview of internal workings of CQ5 search, its limitations for implementing site search functionality and discusses design patterns & challenges for integrating various 3rd party search providers with CQ5/AEM.
Solving Real World Challenges with Enterprise SearchSPC Adriatics
Enterprise Search is complex, even in theory. But when you implement your search solution and everything turns to reality, you’ll find some new, never-seen challenges. In this session, I’ll collect the best, biggest and most exciting challenges from my experience, including real world customer scenarios and solutions. Regardless of the SharePoint version you use (SharePoint 2010, FAST Search for SharePoint, SharePoint 2013), this session is for you if you want to prepare for these “unexpected” scenarios.
Connection and Context: ROI of AI for Digital MarketingMarianne Sweeny
This presentation explores the intersection of emerging AI technology with SEO, UX, content strategy and digital marketing with prescriptive guidance on how to influence machine learning for the right outcomes.
Delivered at Enterprise Search and Discovery 2015, this presentation takes a look at the search landscape users enjoy outside the firewall and the expectations it fosters inside. It presents contemporary user research on enterprise search behavior and uses these findings to make recommendations to enhance enterprise search effectiveness.
Team of Rivals: UX, SEO, Content & Dev UXDC 2015Marianne Sweeny
The search engine landscape has changed dramatically and now relies heavily on user experience signals to influence rank in search results. In this presentation, I explore search engine methods for evaluating UX in a machine readable fashion and present a framework for successful cross-discipline collaboration.
Cross discipline collaboration benefits from group think, a consolidation of soft system methodology and user focused design that all starts with design thinking that sees clients, designers, developers and information architects working together to address user problems and needs. As with any great adventure, design thinking starts with exploration and discovery.This presentation examines the high level tenants of system thinking, expands the scope of user thinking to include tools and devices that users employ to find out designs and delve into the specifics of design thinking, its methods and outcomes.
I was invited to speak at OMCap Berlin 2014 about the close relationship between search engines and user experience with prescriptive guidance to gain higher rankings and more conversions.
While we have been busy trying to "define the damn thing" IA or answering the age old question of who rules, UX, IxDA or IA, the search engines have been busily transitioning to a machine mediated experience model for ranking. This means that SEO is now the responsibility of UX/IA whether we like it or not. This presentation lays out how search engines evaluate user experience and how we can influence this evaluation with an optimized design.
This presentation looks at new methodologies of keyword research to meet the linguistic and semantic sophistication that is Web search today. Search engines are changing and SEO must change with them to meet the challenge of getting the right visitors to the site.
Birds Bears and Bs:Optimal SEO for Today's Search EnginesMarianne Sweeny
In February of 2012, Google began launching the Panda Update (bears), the first of many steps away from a link-based model of relevance to a user experience model of relevance. This bearish focus on relevance use algorithms to determine a positive user experience focused on click-through (does the user select the result), bounce rate (does the user take action once they arrive at the landing page) and conversion (does the landing page satisfy the user’s information need). Content and information design became the foundation for relevance. Sadly, no one at Google told the content strategists, user experience professionals and information architects about their new influence on search engine performance. In April of 2012, Google followed up with the Penguin update (birds), a direct assault on link building, a mainstay of traditional search engine optimization (SEO). The Penguin algorithm evaluates the context and quality of links pointing to a site. Website found to be “over optimized” with low quality links are removed from Google’s index. Matt Cutts, GOogle Webmaster and the public face of Google, summed this up best: “And so that’s the sort of thing where we try to make the web site, uh Google Bot smarter, we try to make our relevance more adaptive so that people don’t do SEO, we handle that...” Sadly, Google is short on detail about how they are handling SEO, what constitutes adaptive relevance and how user experience professionals, information architects and content strategists can contribute thought-processing biped wisdom to computational algorithmic adaptive relevance so that searchers find what they are looking for even when they do not know that that is. This presentation will provide a brief introduction to the inner workings of information retrieval, the foundation of all search engines, even Google. On this foundation, I will dive deep into the Bs of how to optimize Web sites for today’s search technology: Be focused, Be authoritative, Be contextual and Be engaging. Birds (Penguin), Bears (Panda) & Bees: Optimal SEO will provide insight into recent search engine changes, proscriptive optimization guidance for usability and content strategy and foresight into the future direction of search.
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
When your colleagues say they want Google, they don’t mean the Google Search Appliance. They mean the Google Search user experience: pervasive, expedient and delivering the information that they need. Successful enterprise search does not start with the application features, is not part of the information architecture, does not come from a controlled vocabulary and does not emerge on its own from the developers. It requires enterprise-specific data mining, enterprise-specific user-centered design and fine tuning to turn “search sucks” into search success within the firewall. This presentation looks at action items, tools and deliverables for Discovery, Planning, Design and Post Launch phases of an enterprise search deployment.
Bearish SEO: Defining the User Experience for Google’s Panda Search LandscapeMarianne Sweeny
The search sun shifted in March 2011 when Google started rolling out the beginning of the Panda update. Instead of using the famous PageRank, a link-based relevance calculation, Panda rests on a machine interpretation of user experience to decide which sites are most relevant to a searchers quest for knowledge. This means that IA and UX practitioners need to start thinking about the machine implications of the way they structure information on the web, and think ahead about the human implications for how search engines present their sites in response to searcher queries. Bearish SEO will present real, actionable methods for content providers, information architects and user experience designers to directly influence search engine discoverability. Need is an experience. It is a state of being. The goal for this presentation is to ensure that user experience professionals become an integral part of designing search experience.
Finding, or not finding, information is consistently the most called out issue in the enterprise. Technology companies spend millions developing features that remain idle because, while everyone is concerned about optimizing enterprise search, no one is doing anything about it. The PM cuts the budget because "the devs will do it." The IA/UX architects do not have the specific expertise. The developers want to do it but do not have appropriate guidance.
This is a call-to-action for developers and ITpros to make sure that they get what they need to make search in the enterprise work. Because, after the interactive marketing agency has left the building, they are the ones that will be hearing "search sucks" directed at them.
At the 2011 Polish IA Summit, I examine big changes in optimizing for search engines.
We now know that Google is not infallible (seems that companies are easily able to game the PR system) or t all knowing (seems it takes a competitor with a friend at the New York Times to reveal said PR gaming). We also found out that Google can be capricious with blanket suppression of content from certain sites regardless of whether users find it relevant.
This presentation looks at search optimization tools ant tactics that work regardless of these changes and how to keep the site optimized.
Search engines have changed a lot over the last 15 years and optimizing Websites for them must keep up. This presentation looks at the search landscape and present strategies and tactics for optimizing for today's search.
Uw Digital Communications Social Media Is Not SearchMarianne Sweeny
I had the pleasure of speaking to one of the Digital Communication classes at the University of Washington on my favorite topic, why social media will never replace search as an information finding medium. Those students were wicked smart and I walked away learning a lot myself.
Enterprise Search Share Point2009 Best Practices FinalMarianne Sweeny
This presentation examines features and benefits in Microsoft Office SharePoint Server (MOSS) 2007 enteprise search. It contains configuration guidance, code snippets, tips and tricks.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
2. Marianne Sweeny
Ascentium
www.ascentium.com
Marianne.sweeny@ascentium.com
Director of Search Services, Web producer
at Microsoft for 7+ years, pointy-head not
propeller-head
3. Agenda
Introduction
MOSS 2007 Search
Configuring MOSS Search
Here There Be Dragons
Resources
Appendix
5. There is No Magic Bullet
Susan Feldman (IDC) Enterprise Search Summit West 2008
– Employees average 3.5 hours/week searching
– Cost = $5000 per employee per year
There can be no “silver bullet” solution for finding information
– Customers don‟t know what they don‟t know
– “Google experience” is finding what they want/need in the first
few pages and not necessarily Google itself
– Enterprises have different lines of business and different
information types
Search of tomorrow: is here today
– Personalized to the device and user
– Contextual
– Flexible
– Secure
– Adaptable
6. Search Index: A Different Kind of Database
Search Engine Index SQL Server Index
7. Web Search and Enterprise Search
Web Search Enterprise Search
Publishers want their content to Publishers do not think about
be found document discoverability
Anarchistic publishing model = Controlled corpus of documents
“anyone, anywhere, any time” Standards and practices in place
Unlimited document set No spam
No real standards or code, more Users and authors generally
like guidelines share contextual understanding
No central authority Customized tagging or metadata
Spam Can customize search
Commercialization technology to enterprise themes
Technology is agnostic and concepts
Has to work the same for
everyone worldwide
No shared understanding Successful enterprise search efforts target corpuses of information
and set search scopes appropriately. I&KM pros are wise to study
information worker context before trying to “Google-ize” their
enterprises. Forrester Search Wave Q2 2008
8. Advanced Search
Few customers use it and those that do are
disappointed
Boolean or SQL operators work sporadically
Confusing message
What is “regular” search…not as effective
Searchhas progressed beyond the stages
of Advanced
Filters
Facets
Context
9. MOSS 2007 Search
Query engine
breaks the search
terms down
Index engine stores
the properties
Content index
stores the text
10. Better Than Ever
MOSS 2007 SharePoint 2003
Relevance customizable to the Relevance keyed on numeric values
enterprise content derived solely from document text
Automated metadata extraction Collection frequency
Enhanced text analysis Term frequency
Fully integrated admin experience Document length
between Windows Term position
SharePoint Services v3 and MOSS Different systems between Windows
2007 SharePoint Systems and SharePoint
Single search system and index Portal Server
per server farm Multiple indexes
Custom content groups, Best Custom Content groups, Best
Bets, scheduling are now shared Bets, scheduling configurations
services are portal-based
Scopes can be tied to document Scopes tied to content sources
properties Index propagated at completion of
Improved control over indexing master crawl only
11. Simplified Administration UI
Search settings page at the SSP level
Managing crawls
• Content sources
• Explicit SharePoint Content Source Type
• Content source for Business Data (Enterprise CAL)
Crawl logs
• Snapshot of crawled content in your index – lists all documents found
in the content source and their status
• Filters by date, site, and etc.
• Summary by host name (#of successes, errors, and warnings)
Crawl rules
• Included and excluded rules
• Ability to pre-test crawl rules
• Easy to change order of crawl rules
Managing scopes
• Scopes decoupled from content sources
• Scopes can span multiple content sources
• Scope by Property, Site, Content Source, and URL
12. Indexing Performance Improvements
Search is a shared service
– Unified WSS and MOSS search for 1 index per SSP
– Crawls, content sources, crawl rules schema, shared scopes etc are
administered centrally at the shared service level
– Scopes and best bets can also be administered at the consuming sites
Crawl to small indexes that are then consolidated at scheduled times
into a “master merge”
Content index that holds text of pages with Property store that holds
other document values
Propagate data incrementally as it is being indexed to the query
servers
– Propagation starts within 30 seconds of the first shadow index written
– No need to wait till the end of the crawl for information to be available in
queries
– No propagation of properties
Single item add /removal without re-indexing entire corpus with
continuous propagation
– Change Log Crawl: detects what items have changed with in a WSS or a
MOSS 2007 site and crawl only those items
– Security Change Only Crawl: no need to fully index all the content of a site
when permissions on this site have changed
13. Relevance: Types
Dynamic ranking = relevance impacted by query term
– Frequency
– Location in document
– Appearance in link text
– Appearance in URL
Static ranking = relevance independent of customer query
– URL Depth
– Click Distance
– Authority/Demoted site
– Change property weights
– Language of customer (browser setting)
– Document type: HTML files, PPT, Word docs, emails
, XML files, Excel spreadsheets, Plain text, List
items
14. Relevance: Enhancements
Manually assign synonyms and editorialized results to keywords
– Use search logs to detect popular searches, low click-
through from results or 0 result queries
Search Alerts
– User can subscribe to receive email when results change
File type filtering
– Some file types are deemed more relevant (i.e. HTML,
DOC) than others (XML, txt)
– Supports 220 files types, MS and non-MS application
Property weights *
– Assign different weights to properties so that important
properties such as „Title‟ have a bigger influence on
ranking
– Change default property weights through the Schema
Object Model
– Note: The weights used in the product were carefully
tested. Changes to the weights may also have a negative
effect on relevance
* Marcy Tobin wants me to tell you that this is not a trivial undertaking
15. MOSS 2007 Faceted Search
Facets are predetermined
content categories presented
to the customer to narrow
search results
•Can be presented pre- or
post- query
•Used for Advanced search
Empowers customer to most
effectively refine their search
Filters results by
predetermined categories
16. Federated Search
Import or export federated locations using Federated
Location Definition (.FLD) files
Incorporates results from outside content sources that
subscribe to OpenSearch 1.1
Passes the query into the subscribed resource and
returns results into single interface
Relevance calculation done according to originating
resource criteria, not MOSS 2007 criteria
Pre-defined FLD files found at
http://www.microsoft.com/enterprisesearch/connector
s/federated.aspx#fscp
Can develop own FLD files if destination subscribes to
OpenSearch 1.1
– Day Software has developed a standard connector for LiveLink
ECM
17. People Search
Build and publish rich personal profiles
Customize personal profile attributes
Populate personal profiles using information from Active Directory, other
LDAP directories, or Line-of Business systems
Control access to information using security and privacy controls
Generate and display organizational charts based on directory
information
Publish personal profiles using MOSS My Sites
Identify people who can help
Find people based on keyword matches with MOSS personal profiles
Find people in line-of-business systems
Filter results by common attributes such as Job Title or Department
Find “in-common” connections, including managers, site memberships,
distribution lists, and colleagues
Group results by social distance
Subscribe to People Alerts
18. People Search Results Page
Find people by project, expertise or…
Filter by
relevant
attributes
Contact information & online availability
19. LOB Applications with BDC
Extracts data from line-of-
business, CRM, and other
3rd Party data stores
Caches for indexing by search
service
Searches any data source
accessible through ADO.net or
Web Services
Uses Live Communication
Server for connectivity options
Aggregated into a single
application
20. FAST ESP Technology
FAST is a sophisticated search engine tailor-made for ecommerce and help
desk
Uses sophisticated linguistic processing
Searches structured and unstructured content
Indexing Process: Conversion-language detection-synonyms-spell check-
external call outs-entity extraction-categorization-vectorization-custom
navigation-normalizer-alerting-indexing
Why is it Unique
Auto Classification
Advanced Linguistics: text mining for
concept and relationship mapping
Recall: Lemmatization, synonym
expansion, wildcards, anti-phrasing,
phonetic search
Precision: Exact word matching,
exact phrase matching, proximity,
tokenization
Location aware results (retail and
news) – excellent for mobile search
Recommendation engine
Increased capacity:100-200 million
documents on 1 server and 150
million q/second
21. Custom Results
Search Scopes
Allow users to refine search through filtering
Define content resources and map to business rules/key concepts
Focused content = shared understanding = more precise results
Duplicate results filtering
Collapsing duplicates from same directory or site to leave more room for
other relevant results
Less favoritism, more results on desired page 1
Definitions
Automatically extract “definitions” from indexed content and display them
as matches directly on the results page
A web property on the Search Best Bets web part (can turn on/off display
of definition)
Returned in the Query Object Model
Can not be edited
Best Bets
Editorially assigned results based on these key concepts assigned to
selected query terms
Can be many-to-many
22. Scalability
No physical limit for the maximum number of
documents in one index
Recommended document limit is 50 Millions of
documents per indexer
A document is anything from a Word or PowerPoint
file, to a web page, an individual SharePoint list
item, one people entry, or an SAP customer record
Large/small documents count the same
The „average document size‟ depends on the
corpus mix
– i.e., heavy use of WSS 3.0 lists versus limited use
Dependent on supporting hardware
23. Security
Query time stripping – customer only sees those results
that they have permission to view
Support for pluggable authentication for content in
SharePoint Server and WSS 3.0 Sites
Implements ASP.NET 2.0 authentication model
Minimum crawler permission is “Full Read”
Still provides the same security trimming functionality
Automatically configured for new sites
Search visibility options
Prevent sites/lists appearing in search results at a
site/list level
“Security only” crawl for single item add/removal
24. Search Analytics
Export search logs to Excel
Query terms
Page views
Number of results returned
Volume trends
Query success: can define success for
certain query terms
Report Center
Access to MOSS 2007 BI features
Filters data for permissions and relevance
Key Performance Indicators [KPI]
Create a KPI list or other measures of
success
Default KPIs exist in OOB deployment
KPI information can be drawn from MOSS
2007 data sources: SharePoint lists, Excel
workbooks, SQL Server 2005 Analysis
Services, manually entered information
26. Search Roadmap
Useful participants
Content creators
Information Architect/User Experience Architect
Taxonomist
Define key enterprise themes in content
Map existing content to these themes
Create filters and scopes to map for themes
Get as much customer data as possible to find search pain points
Review search logs and customer feedback mechanisms
What are they trying to find
What terms are they using
Assemble a cross functional team to:
Assign relevance weighting that makes sense to the customer behavior and the
corpus
Develop Best Bets for searches with 0 results
Create editorial guidelines and tools that enforce strong meta data standards across
the enterprise
Develop controlled vocabulary that best describes enterprise key concepts and
themes and Is used as a foundation for meaningful metadata and facets
Design a structure that leverages the structural elements like URL depth and click
distance
27. Pareto‟s Principle
Known as the 80/20 rule
Named after late 19th
century economist
20% of your content is
answering 80% of your
searches
Not an excuse to stop
optimizing at the top 20%
Don‟t forget the Long Tail
28. Define Content
Define content scopes
Segment content into logical groups
Create scope rule based on
– Address
– Property query
– Content source
At the SSP level or individual level
SSP level scopes are shared among all sites that use the SSP
Select Authority resources
Define special terms if needed
Terms or language proprietary to the enterprise
– i.e. “goat rodeo”
Provides additional clarification for searcher
Use synonym mapping for term variants
– C# and Csharp
Two information points can be displayed for a special term
– Definition of the term
– Best Bet
29. Designate Authority Sites
Hilltop Algorithm
Quality of links more important
than quantity of links
Segmentation of corpus into broad
topics
Selection of authority sources
within these topic areas
Pre-query calculation applied at
query time
Topic Sensitive Page Rank
Consolidation of Hypertext
Induced Topic Selection [HITS]
and PageRank
Pre-query calculation of factors
based on subset of corpus
– Context of term use in document
– Context of term use in history of
queries
– Context of term use by user
submitting query
30. Educate: Structural Influences
File Type Bias
In order of relevancy (highest to lowest )
– HTML Web pages
– PowerPoint presentations
– Word documents
– Emails
– XML files
– Excel spreadsheets
– Plain text files
– List items
Auto Language Detect
Foreign language results are less relevant than results in user‟s
language
English language is always considered as relevant as user‟s language
URL Depth and Click Distance
Short URLs are like prime real estate.
Items with shorter URLs are considered more relevant than items
placed in longer URLs
– The level is determined by reviewing the number of slash (“/”) characters in
the URL
Keywords separated by hyphens in the URL are good
31. Educate: Content Influences
Anchor Link Text
Search indexes the anchor text from the following elements:
– HTML anchor elements
– SharePoint Services link lists
– SharePoint Portal Server 2003 listings
– Word 2007, Excel 2007, and PowerPoint 2007 hyperlinks
Any file types handled by installed 3rd party iFilter components
which emit hyperlinks
Metadata extraction
Shadow title detection is provided within the body of the item
– Primarily based on text formatting features
– Shadow title is added automatically to the document
– Weighted the same as the original title
– Only for Microsoft Office file types
Auto Description text
Optimized URLs
Enterprise Search checks URL matching at query time:
If query matches to the host name of a page in the index it will
display as the first result
32. Enhanced Search Results
Site Actions >> Site Settings >> Modify All Site Settings >> Site Collection Administration
(Select Keywords) >> Manage Keywords >> quot;Add Keyword“ >>
Synonym Mapping Best Bets
33. Hardware Considerations
Dedicated crawl-target servers for large
sites
Separate SQL Server instance for Search
Fast disk for SQL, fast CPU for Indexer,
more memory
Dedicated Web Front End Server for
crawling
Separate indexer machine
In most cases, your search index is on its own
server
34. Indexing Configuration
Use dedicated web front ends for crawling large
farms/sites
Upgrade WSS 2003 sites to WSS 2007 sites to
index them faster
Define Crawler Impact Rules to avoid site overload
Schedule for off-hours crawling where appropriate
Balance results freshness with load on servers
Consider using single content access account per
region
Regularly cleanup and Review
Crawl rules
Property and schema
Best Bets / keywords
35. Customizing Results Display
To access the XSL property of the Search Core Results Web Part
1. In your browser, navigate to the results page URL:Copy Code
http://<ServerName>/SearchCenter/Pages/results.aspx
2. Click the Site Actions link, and then click Edit Page.
3. In the Search Core Results Web Part, click the edit down arrow to
display the Web Part menu, and then click Modify Shared Web
Part. This opens the Search Core Results Web Part tool pane.
4. Click Data Form Web Part to display the XSL Editornode.
5. Click the Source Editor button.
6. This opens the Text Entry window for the Web Part's XSL
property. You can modify the XSLT directly in this window;
however, you may find it easier to copy the code to a file. You
can then edit that file using an application such as Visual Studio
2005.
7. After you have finished editing the file, you can copy the modified
code back into the Text Entry window and save your changes to
the Search Core Results Web Part.
37. Dragons 1
Note the infrastructure update where Microsoft rolled
the features of Search Server 2008 into MOSS 2007
that includes federated search ability, and a unified
administration dashboard.
Read more here:
http://blogs.msdn.com/sharepoint/archive/2008/07/15/announci
ng-availability-of-infrastructure-updates.aspx
Also please note that it is *not* an easy installation,
and that users *must* read the entire documentation
for it before upgrading their portal.
More people destroy their portal than upgrade it due to not
reading the documentation and installing the prerequisite
patches
Must ensure a schedule for the incremental crawl to
catch additions to the document set
Must turn on PDF indexer and stemming
38. Dragons 2
Use the Web part to accommodates wildcard
search
Found here:
http://www.sharepointblogs.com/mirror/archive/2008/06
/09/new-web-part-for-wildcard-search-in-enterprise-
search.aspx
Use of special characters in the thesaurus can lead to
highly irrelevant results and impact “did you mean”
capabilities
The Expert search capacity is predicated on the My
Sites profile
Employee participation critical to optimal
functionality
Benefits of click-distance are missed if Authority sites
are not configured
39. Dragons 3
The value of statistical ranking can vary from the
partial indexes to the master merge index
Without authoritative sites configured in the relevance
settings, the benefits of click-distance are missed
Results delayed from servers without Internet
connections
Backward compatibility
Custom applications using SharePoint 2003
administrative object model must be rewritten to
use MOSS 2007 object model
Index files, scopes, search alerts, filters, word
breakers, thesaurus files not upgraded
Custom applications using SharePoint 2003
administrative object model must be rewritten to
use MOSS 2007 object model
40. Resources
Microsoft Enterprise Search website
http://www.microsoft.com/enterprisesearch/
Webcast: Installing and Configuring Search in MOSS
2007http://msevents.microsoft.com/cui/WebCastEventDetails.aspx?culture=en
US&EventID=1032325467&CountryCode=US
Tune Search server 2008
http://www.nonlinear.ca/blog/index.php/2008/02/27/how-to-tune-microsoft-
search-server-express-2008-etc/
Configuring MOSS 2007 Search (Cale Hoopes)
http://calehoopes.blogspot.com/2007/11/configuring-moss-as-search-
appliance.html
MOSS Developer Center on MSDN
http://msdn.microsoft.com/office/server/moss/default.aspx
MOSS 2007 Software Developers Kit http://msdn2.microsoft.com/en-
us/library/ms550992.aspx
MOSS 2007 on TechNet http://technet2.microsoft.com/Office/en-
us/library/3e3b8737-c6a3-4e2c-a35f-f0095d952b781033.mspx
Search Optimization for a MOSS 2007 Content Management site:
http://msdn.microsoft.com/en-us/library/cc721591.aspx
Faceted Search from the Microsoft SharePoint Team Blog
http://blogs.msdn.com/sharepoint/archive/2008/03/17/open
41. More Resources
Enterprise search bloghttp://blogs.msdn.com/enterprisesearch/
MOSS BDC Search
http://blogs.msdn.com/gunterstaes/archive/2007/01/16/putting-it-all-together-
moss-2007-business-data-catalog-search-excel-services-sql-analysis-
services.aspx
Find it All with SharePoint Enterprise Search http://technet.microsoft.com/en-
us/magazine/cc162512.aspx
Google Enterprise Connector for MOSS 2007
http://code.google.com/apis/searchappliance/documentation/50/connector_ad
min/sharepoint_connector.html
Ontologica Search for MOSS 2007
http://www.ontolica.com/upload/pdf/factsheets/ontolicasearch_featurelist.pdf
Michael Gannotti on SharePoint
http://sharepoint.microsoft.com/blogs/mikeg/Lists/Categories/Category.aspx?N
ame=Search%20Technologies
Sitemap.xml Generator:
http://www.thesug.org/blogs/lsuslinky/Lists/Posts/Post.aspx?ID=14
SEO Advice from a Propellerhead for … : http://www.mossseo.com/
42. Even More Resources
MOSS 2007 Administrator Documentation
http://jamorgan.wordpress.com/2006/09/07/administrator-documentation-for-
moss-2007-wss-v3/
SharePoint Search linkshttp://www.virtual-
generations.com/2007/01/29/sharepoint-moss-2007-search-links/
All About SharePoint : S.S. Ahmed
http://www.sharepointblogs.com/ssa/archive/2007/01/19/working-with-
sharepoint-search-part-1.aspx
Working with MOSS search - creating scopes
http://www.sharepointblogs.com/ssa/archive/2007/01/19/working-with-
sharepoint-search-part-2.aspx
MOSS 2007 search customization
http://blogs.technet.com/pavelka/archive/2007/05/24/moss-2007-search-
customization.aspx
MOSS 2007 Search & Indexing
http://www.sharepointblogs.com/zimmer/archive/2006/11/16/moss-2007-
search-and-indexing.aspx
Create a custom Search Page
http://www.sharepointblogs.com/zimmer/archive/2007/08/25/moss-2007-
connect-a-custom-search-page-to-a-custom-search-scope.aspx
44. Auto Classification Products
Concept Searching
Auto-classifies documents for MOSS 2007
Uses established probabilistic methods to distinguish
multiword concepts and weight by importance (relevance)
Extracts concepts and weights their relevance to searcher
query
– Presents for search refinement
http://www.conceptsearching.com/conceptHMSO/ (insider
trading)
Integration with MOSS
Extracts metadata and compound terms
Incorporates with existing taxonomy if one exists
Appends metadata and stores as MOSS property
Part of the main MOSS index
Uses standard MOSS administration features
45. Adjusting Relevance Property weights
Assign different weights to properties so that certain
properties such as „Title‟ have a bigger influence on
ranking
Change default property weights through the Schema
Object Model
using Microsoft.Office.Server.Search.Administration;());
Ranking ranking = new Ranking(SearchContext.GetContext( appGuid ));
//dump parameters
foreach (RankingParameter param in ranking.RankingParameters)
{
RankingParameter lookedup = ranking.RankingParameters[param.Name];
Console.WriteLine(lookedup.Name + quot;: quot; + lookedup.Value);
}
//Lookup by index
for (int i = 0; i < ranking.RankingParameters.Count; i++){
RankingParameter param = ranking.RankingParameters[i];
Console.WriteLine(param.Name + quot;: quot; + param.Value); }
//Setting the weight of property ‘prop’ to ‘weight’
ranking.RankingParameters[property].Value = float.Parse(weight);
ranking.StartRankingUpdate(RankingUpdateType.ClickDistanceUpdate);
Console.Write(quot;Updating ..quot;);
while (ranking.Status != RankingUpdateStatus.Idle)
{ Console.Write('.');
System.Threading.Thread.Sleep(1000);
} Console.WriteLine(quot;Done.quot;);
Remember that Marcy Tobin wants me to let you know that this is not a trivial matter and she knows of what she speaks.
46. Push/Pull Data to Users
Alerts
Same alerting infrastructure for WSS and MOSS
– Timer service is used to handle all alerts notifications
Frequency can be set to Daily/Weekly
– Notifications for search alerts will be sent according to the creation time
„Alert Me‟ link can be added/removed using a web part property
on the Search Action Links web part and on the Search Core
Results web part
A rollup of all user‟s alerts for a site collection
– http://<sitecollection>/_layouts/MySubs.aspx
Alert “gotchas”
– No “My Alerts Summary” web part
– No upgrade path from SPS2003 alerts to MOSS 2007 alerts except for
WSS alert types
RSS Feeds
Ability to subscribe for an RSS feed on the search results
„RSS‟ link can be added/removed using a web part property on the
Search Action Links web part and on the Search Core Results web part
47. Protocol Handlers
Connects to a content source and
enumerates the documents
Ships with support for
Web Content, NTFS File Shares, Exchange
Public Folders, Lotus Notes Databases,
SharePoint Content, SharePoint profiles, and
Business Data Catalog
Partners providing support for
Documentum, Hummingbird, OpenText, FileNet,
Interwoven, and others
http://msdn.microsoft.com/library/en-
us/spssdk/html/_introduction_to_a_protocol_handl
er.asp?frame=true
48. The Query object model
KeywordQuery request = new KeywordQuery(site);
request.QueryText = strQuery;
request.ResultTypes |= ResultType.RelevantResults;
//if we want to get more than one result table
//request.ResultTypes |= ResultType.SpecialTermResults;
//Setting optional parameters on the Query object
request.RowLimit = 10;
request.StartRow = 0;
request.KeywordInclusion = KeywordInclusion.AllKeywords;
//Executing the query
ResultTableCollection results = request.Execute();
49. Metadata Property Mapping
Crawled properties
Emitted by iFilters and Protocol Handlers
Identified by a property set (GUID) and property
ID (name or numeric ID)
Managed properties
Mapping target for crawled properties (many-to-
many)
Identified by internal ID
Friendly name used in queries
– Can be used in the query with property:
Value