This document discusses searching with Apache Solr, including what it is, when to use it, and how to implement it. Solr is a powerful and feature-rich search platform that can be used when basic search systems are no longer sufficient, as it supports advanced search capabilities and large datasets. The document outlines how to set up Solr, including choosing a container, configuring the Solr configuration files, defining fields, indexing data, and implementing different types of searches.
Essential SEO Tools for Your Website | AAO 2015 | Randall Wong, M.D.Randall Wong, M.D.
This a list of great SEO tools to use for your website to improve your website rankings.
This was delivered at the American Academy of Ophthalmology, 2015, in Las Vegas.
This Slideshare will review basic tools and requisite components of your website to improve your website rankings or "SEO."
Randall Wong, M.D.
Medical Marketing Enterprises, LLC
Use Blogging and Social Networking to Super Charge Your Website | Randall Won...Randall Wong, M.D.
This presentation was presented at the American Academy of Ophthalmology in Las Vegas, 2015, by Randall Wong, M.D.
The purpose is to define how blogging can improve your SEO, website rankings and ability to market your medical practice.
If you have questions, please feel free to contact me.
Randall Wong, M.D.
Medical Marketing Enterprises, LLC
randall.v.wong@gmail.com
Key Success Factors for Enterprise Content ManagementIntlock Ltd.
SharePoint has always had a big emphasis on Content Management. This focus has been become stronger version by version. We've had more and more options to organize and classify content through sites, lists, libraries and folders, as well as managed metadata and other properties. These tools help build a SharePoint Information Architecture and are the foundation for improving document search within SharePoint. But as that architecture gets more complex, users can get overwhelmed by the amount of content, and can find themselves easily with a tons of siloed content and, at the same time, with lots of content that cannot be found at all. In these scenarios, Search can be a good option to help getting better findability, but sometimes it’s not enough. In this webinar, we’ll discuss some real-world Content Management use cases and demonstrate how content analytics can help to improve in these scenarios.
SEO is definitely not the preserve of specialists. SEO is for everyone! This presentation covers 3 pillars of SEO that all bloggers & website owners can put into practice. Included is a deeper dive into keyword research & on-page optimization.
This lecture, course 511, was originally presented at the 2013 Academy of Ophthalmology (AAO) in New Orleans, LA.
Content marketing is the only way a website can achieve and maintain high rankings in search.
Content marketing is the only way a website can outrank another as search engines, such as Google, compare websites based upon the relevance of their content.
Hence, great websites have great content.
<a>Randall V. Wong, M.D.</a>
<a>Medical Marketing Enterprises, LLC</a>
Use Blogging & Social Media to Super Charge Your WebsiteRandall Wong, M.D.
Course SPE08
Lecture delivered at the American Academy of Ophthalmology (AAO) 2013 in New Orleans, LA.
Use of blogging is an essential not only to communicate with patients and customers, but is an easy way to add fresh and original content to your website.
CMS (content management systems) such as Wordpress.org are the keys to high rankings on Google and other search engines as they are coded to optimize the content of a website rather than focusing on design.
Design has no SEO value and does not help rankings whatsoever.
Essential SEO Tools for Your Website | AAO 2015 | Randall Wong, M.D.Randall Wong, M.D.
This a list of great SEO tools to use for your website to improve your website rankings.
This was delivered at the American Academy of Ophthalmology, 2015, in Las Vegas.
This Slideshare will review basic tools and requisite components of your website to improve your website rankings or "SEO."
Randall Wong, M.D.
Medical Marketing Enterprises, LLC
Use Blogging and Social Networking to Super Charge Your Website | Randall Won...Randall Wong, M.D.
This presentation was presented at the American Academy of Ophthalmology in Las Vegas, 2015, by Randall Wong, M.D.
The purpose is to define how blogging can improve your SEO, website rankings and ability to market your medical practice.
If you have questions, please feel free to contact me.
Randall Wong, M.D.
Medical Marketing Enterprises, LLC
randall.v.wong@gmail.com
Key Success Factors for Enterprise Content ManagementIntlock Ltd.
SharePoint has always had a big emphasis on Content Management. This focus has been become stronger version by version. We've had more and more options to organize and classify content through sites, lists, libraries and folders, as well as managed metadata and other properties. These tools help build a SharePoint Information Architecture and are the foundation for improving document search within SharePoint. But as that architecture gets more complex, users can get overwhelmed by the amount of content, and can find themselves easily with a tons of siloed content and, at the same time, with lots of content that cannot be found at all. In these scenarios, Search can be a good option to help getting better findability, but sometimes it’s not enough. In this webinar, we’ll discuss some real-world Content Management use cases and demonstrate how content analytics can help to improve in these scenarios.
SEO is definitely not the preserve of specialists. SEO is for everyone! This presentation covers 3 pillars of SEO that all bloggers & website owners can put into practice. Included is a deeper dive into keyword research & on-page optimization.
This lecture, course 511, was originally presented at the 2013 Academy of Ophthalmology (AAO) in New Orleans, LA.
Content marketing is the only way a website can achieve and maintain high rankings in search.
Content marketing is the only way a website can outrank another as search engines, such as Google, compare websites based upon the relevance of their content.
Hence, great websites have great content.
<a>Randall V. Wong, M.D.</a>
<a>Medical Marketing Enterprises, LLC</a>
Use Blogging & Social Media to Super Charge Your WebsiteRandall Wong, M.D.
Course SPE08
Lecture delivered at the American Academy of Ophthalmology (AAO) 2013 in New Orleans, LA.
Use of blogging is an essential not only to communicate with patients and customers, but is an easy way to add fresh and original content to your website.
CMS (content management systems) such as Wordpress.org are the keys to high rankings on Google and other search engines as they are coded to optimize the content of a website rather than focusing on design.
Design has no SEO value and does not help rankings whatsoever.
Ramp up Your Digital Marketing Plan | Internet Marketing for HealthcareRandall Wong, M.D.
This was presented at the 2019 AAO (American Academy of Ophthalmology) meeting in San Francisco.
Launching and maintaining a digital marketing strategy involves many moving parts. At the heart of your marketing strategy is a website. Your website should be up to date and optimized for best performance. Best practices and tips are reviewed for updating and improving the ability of your site to rank.
If you need to start a website, we review the different types of plans, for instance, hosted vs. self-hosted sites. There are free website building platforms, but there are some potential pitfalls you'd like to avoid.
Optimization (SEO) is the process of getting your website to rank. CMS (content management systems) allow the flexibility of turnkey design combined with a platform to best optimize your website.
Other digital marketing strategies are explored as well: Youtube, podcasts and webinars are alternative ways to market your practice...after your website is established.
Randall Wong, M.D.
www.MedicalMarketingEnterprises.com
www.RussandRandy.com
www.SunriseHostingServices.com
randall.v.wong@gmail.com
BrightonSEO: Context is King - Ian Miller, CEO, at Crafted Crafted
Presentation from Ian Miller's BrightonSEO talk April 2015 - Context is King: looking beyond keywords - how you can help search engines rank your content. Ian is CEO at digital agency Crafted, which specialises in web and software development, search marketing, conversion rate optimisation (CRO) and user experience (UX).
SMX - How to Know If You've Been Hit by Google's Panda PenaltyHelen Overland
Not sure if you've been hit by Google's On-Page Panda penalty? Find out how to diagnose Panda on your site, FIX Panda, Clear up your site and RECOVER. See proof that your site can recover from Google's Panda Penalty
Originally presented at Search Marketing Expo (SMX)
Humantics | Optimizing Your Content Strategy in an Entity-Driven WorldGrant Simmons
The intersection of human action & search engine understanding. (With a Terminator theme!)
In 2013 I made the bold prediction that “keywords are dead”. I wasn’t wrong. But even in a dystopian future, I wouldn’t have thought that fundamentals of content strategy would change to artificial intelligence writers and algorithms going head-to-head.
My prediction was driven by Google’s official October 2011 announcement of <cough>making search more secure</cough>, the frustration of SEO tools that pitched ‘not provided’ reclamation, and a general dearth of actionable insights beyond data manipulation of landing page and 3rdparty metrics. Not fun.
Enter semantics, the study of meanings through context and understanding of the ‘why’ that drives search queries. Whether you’re targeting buyers or browsers, Google is the semantic seer that will – with every fiber of their neural network drive the right traffic to the right pages based on meaning, not keywords.
Call it BERT or RankBrain or Hummingbird, Google is trying to think and understand like a human, which is ironic, because I’m a human, and so are you.
Dive into the world of ‘Humantics’ where we’ll learn how to create a content strategy that satisfies Google, searchers, clients, and humans too.
In this, my talk for Webinale in Berlin, June 1st 2011, I give an overview of HTML5 history and main features, relating it all back to how possible it is use develop with these new features today. Thanks to Patrick Lauke for allowing me to steal a lot of his slides ;-)
Internal Search - The Lost Child of Web AnalyticsCharles Meaden
Discover why you should take the time to analyse the words and phrases that people enter into the search engine on your web site. These words show the users intent and allow you to better understand their thinking.
Voted 'Best Presentation' at the 2012 MeasureCamp in London
If you're user can't find it, they can't buy it right? In this talk, Apache Lucene and Solr committer Grant Ingersoll will discuss architecture, techniques and tips for successfully deploying search tools like Lucene, Solr and LucidWorks Enterprise in eCommerce environments.
Developing a Search & Findability Practice for the Enterprise – Ravi Mynampat...Findwise
This presentation is a case study of how we are improving search at Harvard Business School (HBS) and building a practice area to support this endeavor. You will hear how we built the road towards the holy grail of one-stop Enterprise search as we travelled on it. This path initially included a set of incremental steps until we gained enough critical mass to formalize the process. While we have made significant progress towards the end goal, we are still very much on the journey. This talk will describe the path we took, where we are currently, and what the future roadmap looks like. Learn about the components of our search & findability practice, the skill sets of the people who comprise the team, tools we are using, and also some tactical steps you can take to improve search when you go back to work tomorrow so that you too can embark on this voyage.
For more information about Findability Day, go to findwise.com/findability-day-2013
Ramp up Your Digital Marketing Plan | Internet Marketing for HealthcareRandall Wong, M.D.
This was presented at the 2019 AAO (American Academy of Ophthalmology) meeting in San Francisco.
Launching and maintaining a digital marketing strategy involves many moving parts. At the heart of your marketing strategy is a website. Your website should be up to date and optimized for best performance. Best practices and tips are reviewed for updating and improving the ability of your site to rank.
If you need to start a website, we review the different types of plans, for instance, hosted vs. self-hosted sites. There are free website building platforms, but there are some potential pitfalls you'd like to avoid.
Optimization (SEO) is the process of getting your website to rank. CMS (content management systems) allow the flexibility of turnkey design combined with a platform to best optimize your website.
Other digital marketing strategies are explored as well: Youtube, podcasts and webinars are alternative ways to market your practice...after your website is established.
Randall Wong, M.D.
www.MedicalMarketingEnterprises.com
www.RussandRandy.com
www.SunriseHostingServices.com
randall.v.wong@gmail.com
BrightonSEO: Context is King - Ian Miller, CEO, at Crafted Crafted
Presentation from Ian Miller's BrightonSEO talk April 2015 - Context is King: looking beyond keywords - how you can help search engines rank your content. Ian is CEO at digital agency Crafted, which specialises in web and software development, search marketing, conversion rate optimisation (CRO) and user experience (UX).
SMX - How to Know If You've Been Hit by Google's Panda PenaltyHelen Overland
Not sure if you've been hit by Google's On-Page Panda penalty? Find out how to diagnose Panda on your site, FIX Panda, Clear up your site and RECOVER. See proof that your site can recover from Google's Panda Penalty
Originally presented at Search Marketing Expo (SMX)
Humantics | Optimizing Your Content Strategy in an Entity-Driven WorldGrant Simmons
The intersection of human action & search engine understanding. (With a Terminator theme!)
In 2013 I made the bold prediction that “keywords are dead”. I wasn’t wrong. But even in a dystopian future, I wouldn’t have thought that fundamentals of content strategy would change to artificial intelligence writers and algorithms going head-to-head.
My prediction was driven by Google’s official October 2011 announcement of <cough>making search more secure</cough>, the frustration of SEO tools that pitched ‘not provided’ reclamation, and a general dearth of actionable insights beyond data manipulation of landing page and 3rdparty metrics. Not fun.
Enter semantics, the study of meanings through context and understanding of the ‘why’ that drives search queries. Whether you’re targeting buyers or browsers, Google is the semantic seer that will – with every fiber of their neural network drive the right traffic to the right pages based on meaning, not keywords.
Call it BERT or RankBrain or Hummingbird, Google is trying to think and understand like a human, which is ironic, because I’m a human, and so are you.
Dive into the world of ‘Humantics’ where we’ll learn how to create a content strategy that satisfies Google, searchers, clients, and humans too.
In this, my talk for Webinale in Berlin, June 1st 2011, I give an overview of HTML5 history and main features, relating it all back to how possible it is use develop with these new features today. Thanks to Patrick Lauke for allowing me to steal a lot of his slides ;-)
Internal Search - The Lost Child of Web AnalyticsCharles Meaden
Discover why you should take the time to analyse the words and phrases that people enter into the search engine on your web site. These words show the users intent and allow you to better understand their thinking.
Voted 'Best Presentation' at the 2012 MeasureCamp in London
If you're user can't find it, they can't buy it right? In this talk, Apache Lucene and Solr committer Grant Ingersoll will discuss architecture, techniques and tips for successfully deploying search tools like Lucene, Solr and LucidWorks Enterprise in eCommerce environments.
Developing a Search & Findability Practice for the Enterprise – Ravi Mynampat...Findwise
This presentation is a case study of how we are improving search at Harvard Business School (HBS) and building a practice area to support this endeavor. You will hear how we built the road towards the holy grail of one-stop Enterprise search as we travelled on it. This path initially included a set of incremental steps until we gained enough critical mass to formalize the process. While we have made significant progress towards the end goal, we are still very much on the journey. This talk will describe the path we took, where we are currently, and what the future roadmap looks like. Learn about the components of our search & findability practice, the skill sets of the people who comprise the team, tools we are using, and also some tactical steps you can take to improve search when you go back to work tomorrow so that you too can embark on this voyage.
For more information about Findability Day, go to findwise.com/findability-day-2013
There are more opportunities for your school’s content to show up on search engines today than ever before, such as “structured snippets”, “people also ask", and the list goes on. The problem is that the SEO needed to win those spots has become incredibly complex and technical.
But if you’re not optimizing your content with the right SEO, you’re losing critical opportunities to reach prospects when they’re searching for you and letting your competitors attract those users and build relationships with them.
Winning the SEO Game for Schools: How to Dominate Organic Search and Get in Front of the Right Students in 2023, a fast-paced executive briefing for schools on what you need to know about SEO in 2023 and how to ensure you “steals” every opportunity to show up on search engines for relevant searches entered by your potential students.
Information Discovery and Search Strategies for Evidence-Based ResearchDavid Nzoputa Ofili
This event was on May 2, 2017 at Wesley University, Ondo State, Nigeria. I trained the university's staff (academic and non-academic) on "Information Discovery and Search Strategies for Evidence-Based Research" in an information/digital literacy session.
Originally presented at SXSW March 13, 2011, on panel with Fred Beecher and Austin Govella. Modified and updated for Web 2.0 Expo talk, October 12, 2011, UX Web Summit September 26, 2012; Webdagene September 10, 2013.
SEO in the Age of Artificial Intelligence | How AI influences SearchPhilipp Klöckner
SEO hast changes over the past decade. Understand how classical ranking factors become less important, while user experience dominates the top rankings.
As seen live on stage at @ProjectAcom #PakCon2018 in Berlin.
Today 2 seconds response time is no longer the metric to look at to figure out whether the user is having a good or bad user experience. This slide deck is about how SEO Manager, Performance Engineers and Operators can become happier if they work together and on top are getting happier customers
Search Engine Marketing (Oldschool) - an introduction.Tim Vermeire
An introduction to Search Engine Marketing that defines the basics of the 'art'. Goal is to provide a general framework upon which in-depth session follow to obtain expert knowledge in the SEO domain.
Top Floor's Justin Kerley and Eric Vallee discuss all the recent changes that Google has implemented and the implications they have for search marketing.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfPeter Spielvogel
Building better applications for business users with SAP Fiori.
• What is SAP Fiori and why it matters to you
• How a better user experience drives measurable business benefits
• How to get started with SAP Fiori today
• How SAP Fiori elements accelerates application development
• How SAP Build Code includes SAP Fiori tools and other generative artificial intelligence capabilities
• How SAP Fiori paves the way for using AI in SAP apps
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
44. Default Search Consider useful Analyzers Potentially match on more fields Enrich or refine results with personal data More in depth results 43
45. Advanced Search Offer user control Consider search storage Data size vs Additional queries To return more / less results “Search entire document” “Filter by Colour” 44
48. We’re Hiring NL Vlissingen Utrecht UK London Sheffield Liverpool Speak to me at the end… pmatthews@ibuildings.com 47
49. Thank you Resources Links: http://www.delicious.com/paulm86/solr This talk: http://joind.in/3221 Contact Me: @paulmatthews86 http://about.me/paul.matthews 48
Editor's Notes
Twitter: @paulmatthews86Personal Blog: 86pTechnicalNon-techSoftware Engineer at IbuildingsTechportalMongoDBSolr (May 2011)Solr ProjectsTravel CompanyMedia Company
This talk What Is Solr? When is right timeWhySearch ?How Start journey– investigate Explain to business integrateWho is this talk aimed at? Developers Toying with search DB search Starting with search
This talk When right time – identifying Why Search benefits Dark horse How Start journey– investigate Explain to business integrateWho is this talk aimed at? Developers Toying with search DB search Starting with search
What is search? Text based navigation To content / products Customers describing something Capture queries SortingOrganising content Examples Quick search Category listing Advanced search
The Power of SearchFrom LIKE to SOLR
First up DB Like
Pros: Little effort to use, or understand.Cons: Not good User data: Not greater than 1 word
Full Text Lots of people use
Pros: Some power Convenient In DBCons: Feature poor Slow
Basic / Easy to use proper Search
Pros: Can be very fast Often simple to setupCons: Feature poor Less accurate More application code?Google Custom Search Engine Crawls siteXapian Simple search solution
Pros:Poweful Feature rich Relatively Simple Lots of pluginsCons: Could be overkill Different language
On Java stand alone Requires servlet container Tomcat Jetty stand alone Lucene Search library Offers Full Text High performance Java - other implementations available
This talk When right time – identifying Why Search benefits Dark horse How Start journey– investigate Explain to business integrateWho is this talk aimed at? Developers Toying with search DB search Starting with search
Who? Traffic Not for Facebook Works for average Features It has many No need to use themWhen? Designed from beginning Easily used to enrich site navigation Implementation as post-live project Implementation into existing open source softwareDrupalMagento
Spending time / effort / money on the search box Fixing bugs Endless tuning Adding functionalityCustomers complaining Not finding content High Bounce rates Site is slow Not finding the *right* content
Large data sets 10000 records Speed Like queriesMySQL full-text Site performanceSlowlog? Results Inaccurate MissingGraceful degradation Important for quality Low cost
This talk When right time – identifying Why Search benefits Dark horse How Start journey– investigate Explain to business integrateWho is this talk aimed at? Developers Toying with search DB search Starting with search
Is Solr right for me?Before Answering:Terms:Find materialsCommunicate to peopleFunctionality:Most Use – Know FunctionalityRe-invent – Wheel
Main 2Database tables Data Import Handler Easy – just configAPI Anything publish API Hooked into contentCSV & XMLSolr Cell - Rich Docs PDF MS Office
Parse: text generate index Removes junk Improve matchesHalf now, half later: Reduce time searching
Analyzer Groups actions of Parsing Important to do same / similar in searching
TokenizerStrings to tokensExample ones:Whitespace – splits on whitespaceKeyword – strips special charsStandard – General purpose, adds context
Transforms tokensLower case.Stop – filters out stop words: a, if, to, andStandard – Remove dots, ‘s (Context only)Synonym.
Hit Highlighting* Remember to set the delimiter, not everything is a web page.
Phrase queries "search for a phrase"Wildcard queries Match with wildcards ? single * multipleFuzzy queriesLevenshtein Distance Similar to word ~Proximity queries Words close together "two words"~12Range queries Between two values started:[20110101 TO 20120101] Inclusivename:{Paul TO Jeff} exclusive
Fields Single field Target search Multiple field Build Queries
Faceted Set Counts Filter data Multiple classifications
Ordered results based on best matchOr order by any field
Simultaneous update and search
This talk When right time – identifying Why Search benefits Dark horse How Start journey– investigate Explain to business integrateWho is this talk aimed at? Developers Toying with search DB search Starting with search
Blog post – to explainsConfigure ContainerSolrIndex Documents Any sourceSearch Default search Advanced search
Container setup Choose Configure Accessible
Define the data Define what is indexed Define what is storedIntegral to returning relevant search responsesRequire tweaking to get rightConscious of space size of the index - speed
Docs to Schema SpecIndexing by Database or API
Partial Words Analyzing?Search all fields Possibly the main onesResponse Less data Stay clear of additional queries consider caching
Consider using stemming analyzers to return more resultsIncrease matching columnsUse session data affect results Consider caching effectsMore response data required
Users modify their search Specify fields For enriching the results Consider bloated storage Tradeoff with Additional queries Tweak later?Advanced for returning More / Less results Search more of the document Filter on property
This talk When right time – identifying Why Search benefits Dark horse How Start journey– investigate Explain to business integrateWho is this talk aimed at? Developers Toying with search DB search Starting with search
Twitter: @paulmatthews86Personal Blog: 86pTechnicalNon-techSoftware Engineer at IbuildingsTechportalMongoDBSolr (May 2011)Solr ProjectsTravel CompanyMedia Company