SlideShare a Scribd company logo
How GGooooggllee Works
Lisa Holmberg
Bibliographical Center for Research
lholmber@bcr.org
What happens when you
Google?
Google Search Results
•URL, size, date last crawled
•Cached link
•Pages like this one
Database Google Used
Approximate #
of hits
Ads selected by Google
based on you search
terms
Search terms are in bold
Google Cache
Google Cached
 Cached reveals the page as Google
found it
 may differ from the current page
 Cached exists if a page is full-text indexed
 About 1 billion pages in Google are not
cached
 Not fully searchable
 no Cached if a page owner requests not to be
cached
Boolean Searching
 And
Default AND between terms
The Fuzzy And
 only some of the words if a page is
“important”
 words may occur only in link to the
page
 words occur somewhere on the site a
page belongs to
Stemming
 Google stems “when appropriate”
 Includes plural, singular, past, present
tense of words in search
Search: school librarian
Result: library, librarian, library’s, librarian’s
 Single word searches aren’t stemmed
What Google doesn’t search
(unless you ask nicely)
 Common or Stop words are ignored
 No official list from Google
 Auto-phrasing
 Searches containing only stop
words
What Google doesn’t search
(unless you ask nicely)
Google Search Results
 More than 100 factors in the
metrics
 On-the-page metrics
 Word order matters
 Word frequency
 Automatic-phrasing
 In the title
 In unique fonts
 In prominent areas (like lists)
PageRank
 Off-the-page metrics
 Words describing the link
 Links on one site to another are like
votes-- PageRank
 Stuffing the ballot box
 Reputation of the ‘voting’ page
 Can’t buy a better PageRank
 PageRank independent of search terms
But how do I make my
searches better?
Improving Google’s AND
+ Inclusion operator
 Force searches on stop words
 Turns off stemming
Use quotation marks for phrases
 “public librarian” 234,000 .4% of
public librarian 58,600,000
 Forces searches on stop words
 Turns off stemming
Improving Google’s AND
 Hyphen makes phrases and searches
with and without hyphens
 bite-sized retrieves:
bite-sized, bite sized, bitesized
Other examples?
Boolean Searching
 Or
 Not
Search Operators
OR search
 Search for two terms at once
- exclusion operator
 Use with care;
Search:
twins Minnesota 2,750,000
Eliminate undesired words
twins Minnesota –sports 1,300,000
Search Operators
* full-word wild card, word substitution
 Ideal for partly remembered quotes
 Searching for answers to questions
 Proximity searches
~ synonym operator
 ~guide searches for: tutorial, manual, help,
map, tips
Limitless Options for Limits
 Intitle: terms are searched for in title only
 Pages concentrate on term
Hybrid cars intitle:mileage
 Combine with OR
intitle:"new urbanism" OR intitle:"sustainable
communities”
 allintitle:
 Combine with site:
allintitle: hybrid cars mileage –site:.com
Using URL’s
 Limit to a domain (edu, com, etc)
site:edu OR site:gov OR site:lib.co.us
 Search within a site
site:memory.loc.gov “dust bowl”
 Use Google as a search engine for a site
 Can ONLY use first part of URL
 Omit http: & final /
inurl:dustbowl
 searches for term anywhere in URL
Finding that file
 Filetype:
 Search for a particular type of document
tax return filetype:pdf
 Exclude a filetype
-filetype:xls
 Can use view as HTML
 Avoid viruses
 Allows you to read it even if you don’t have the
software
More about Google
 Google Guide
http://www.googleguide.com/
 Google Librarian Center
http://www.google.com/librariancenter/index.html

More Related Content

What's hot

Identifying Keywords and Searching Techniques
Identifying Keywords and Searching TechniquesIdentifying Keywords and Searching Techniques
Identifying Keywords and Searching Techniques
La Trobe University Library- College of ASSC
 
How search engine works
How search engine worksHow search engine works
How search engine works
leoniehannah
 
Google algorithm updates
Google algorithm updatesGoogle algorithm updates
Google algorithm updates
Kavya V K
 
Vandenbosch2010 04-13search the-internet
Vandenbosch2010 04-13search the-internetVandenbosch2010 04-13search the-internet
Vandenbosch2010 04-13search the-internet
Jan Beniest
 
Google Search Tips
Google Search TipsGoogle Search Tips
Google Search Tips
Mark Rotondella
 
How Google search works ppt
How Google search works pptHow Google search works ppt
How Google search works ppt
Hardik Mahant
 
Browsers and search engines
Browsers and search enginesBrowsers and search engines
Browsers and search engines
kavithaJayalal
 
Content Marketing - How Can You Stand out in 2016
Content Marketing - How Can You Stand out in 2016Content Marketing - How Can You Stand out in 2016
Content Marketing - How Can You Stand out in 2016
White Hat Media
 
Case Study: How I Turned Autocomplete Ideas into Traffic & Ranking Results wi...
Case Study: How I Turned Autocomplete Ideas into Traffic & Ranking Results wi...Case Study: How I Turned Autocomplete Ideas into Traffic & Ranking Results wi...
Case Study: How I Turned Autocomplete Ideas into Traffic & Ranking Results wi...
Charles Ryder
 
Content Re-Optimization
Content Re-OptimizationContent Re-Optimization
Content Re-Optimization
Heba Said
 
Google Hummingbird - What does it mean for SEO?
Google Hummingbird - What does it mean for SEO?Google Hummingbird - What does it mean for SEO?
Google Hummingbird - What does it mean for SEO?
Chris Schweppe
 
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine
Aniket_1415
 
Internal Linking. Why you can’t afford to ignore it!
Internal Linking. Why you can’t afford to ignore it!Internal Linking. Why you can’t afford to ignore it!
Internal Linking. Why you can’t afford to ignore it!
Heba Said
 
Seo training
Seo trainingSeo training
Seo training
Michelle Williams
 
How to Use A Search Engine Effectively
How to Use A Search Engine EffectivelyHow to Use A Search Engine Effectively
How to Use A Search Engine Effectively
Bianca King
 
KWFinder Review
KWFinder ReviewKWFinder Review
KWFinder Review
Noel Peter
 
Using Search Analytics in SharePoint 2010
Using Search Analytics in SharePoint 2010Using Search Analytics in SharePoint 2010
Using Search Analytics in SharePoint 2010
SurfRay
 
Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013
Top Floor Technologies
 
Seo and Content Presentation
Seo and Content PresentationSeo and Content Presentation
Seo and Content Presentation
Robert Pucciariello
 
Optimizing your content for search engines
Optimizing your content for search enginesOptimizing your content for search engines
Optimizing your content for search engines
Harumi Gondo
 

What's hot (20)

Identifying Keywords and Searching Techniques
Identifying Keywords and Searching TechniquesIdentifying Keywords and Searching Techniques
Identifying Keywords and Searching Techniques
 
How search engine works
How search engine worksHow search engine works
How search engine works
 
Google algorithm updates
Google algorithm updatesGoogle algorithm updates
Google algorithm updates
 
Vandenbosch2010 04-13search the-internet
Vandenbosch2010 04-13search the-internetVandenbosch2010 04-13search the-internet
Vandenbosch2010 04-13search the-internet
 
Google Search Tips
Google Search TipsGoogle Search Tips
Google Search Tips
 
How Google search works ppt
How Google search works pptHow Google search works ppt
How Google search works ppt
 
Browsers and search engines
Browsers and search enginesBrowsers and search engines
Browsers and search engines
 
Content Marketing - How Can You Stand out in 2016
Content Marketing - How Can You Stand out in 2016Content Marketing - How Can You Stand out in 2016
Content Marketing - How Can You Stand out in 2016
 
Case Study: How I Turned Autocomplete Ideas into Traffic & Ranking Results wi...
Case Study: How I Turned Autocomplete Ideas into Traffic & Ranking Results wi...Case Study: How I Turned Autocomplete Ideas into Traffic & Ranking Results wi...
Case Study: How I Turned Autocomplete Ideas into Traffic & Ranking Results wi...
 
Content Re-Optimization
Content Re-OptimizationContent Re-Optimization
Content Re-Optimization
 
Google Hummingbird - What does it mean for SEO?
Google Hummingbird - What does it mean for SEO?Google Hummingbird - What does it mean for SEO?
Google Hummingbird - What does it mean for SEO?
 
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine
 
Internal Linking. Why you can’t afford to ignore it!
Internal Linking. Why you can’t afford to ignore it!Internal Linking. Why you can’t afford to ignore it!
Internal Linking. Why you can’t afford to ignore it!
 
Seo training
Seo trainingSeo training
Seo training
 
How to Use A Search Engine Effectively
How to Use A Search Engine EffectivelyHow to Use A Search Engine Effectively
How to Use A Search Engine Effectively
 
KWFinder Review
KWFinder ReviewKWFinder Review
KWFinder Review
 
Using Search Analytics in SharePoint 2010
Using Search Analytics in SharePoint 2010Using Search Analytics in SharePoint 2010
Using Search Analytics in SharePoint 2010
 
Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013
 
Seo and Content Presentation
Seo and Content PresentationSeo and Content Presentation
Seo and Content Presentation
 
Optimizing your content for search engines
Optimizing your content for search enginesOptimizing your content for search engines
Optimizing your content for search engines
 

Viewers also liked

Qno 1 (b)
Qno 1 (b)Qno 1 (b)
What is SMO?
What is SMO?What is SMO?
What is SMO?
samsuree
 
La formation documentaire personnalisée (Université Libre de Bruxelles)
La formation documentaire personnalisée (Université Libre de Bruxelles)La formation documentaire personnalisée (Université Libre de Bruxelles)
La formation documentaire personnalisée (Université Libre de Bruxelles)
Sébastien Blondeel
 
2016/03/28付 オリジナルiTunes週間トップソングトピックス
2016/03/28付 オリジナルiTunes週間トップソングトピックス2016/03/28付 オリジナルiTunes週間トップソングトピックス
2016/03/28付 オリジナルiTunes週間トップソングトピックス
The Natsu Style
 
The living world
The living worldThe living world
The living world
HSE ZOOLOGY SHOWS
 
Biological classification
Biological classificationBiological classification
Biological classification
Samarji
 

Viewers also liked (6)

Qno 1 (b)
Qno 1 (b)Qno 1 (b)
Qno 1 (b)
 
What is SMO?
What is SMO?What is SMO?
What is SMO?
 
La formation documentaire personnalisée (Université Libre de Bruxelles)
La formation documentaire personnalisée (Université Libre de Bruxelles)La formation documentaire personnalisée (Université Libre de Bruxelles)
La formation documentaire personnalisée (Université Libre de Bruxelles)
 
2016/03/28付 オリジナルiTunes週間トップソングトピックス
2016/03/28付 オリジナルiTunes週間トップソングトピックス2016/03/28付 オリジナルiTunes週間トップソングトピックス
2016/03/28付 オリジナルiTunes週間トップソングトピックス
 
The living world
The living worldThe living world
The living world
 
Biological classification
Biological classificationBiological classification
Biological classification
 

Similar to Googleworks

Defaults
DefaultsDefaults
Defaults
peter-lee
 
Search engines
Search enginesSearch engines
Search engines
Stefanos Anastasiadis
 
Mpl brownbag sept2011
Mpl brownbag sept2011Mpl brownbag sept2011
Mpl brownbag sept2011
Jason Coleman
 
Week 9 10 ppt-google_search
Week 9 10 ppt-google_searchWeek 9 10 ppt-google_search
Week 9 10 ppt-google_search
carolyn oldham
 
Advanced google searching (1)
Advanced google searching (1)Advanced google searching (1)
Advanced google searching (1)
Brenda Crawford
 
Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011
cyberspaced educator
 
Ipe pp slides google talk 2013
Ipe pp slides google talk 2013Ipe pp slides google talk 2013
Ipe pp slides google talk 2013
Elizabeth Holmes
 
Google power search
Google power searchGoogle power search
Google power search
Muhammed Shokr
 
Wk5 contextualized onlinesearchandresearchskills
Wk5 contextualized onlinesearchandresearchskillsWk5 contextualized onlinesearchandresearchskills
Wk5 contextualized onlinesearchandresearchskills
Resty Aldana
 
Search Engine Strategies
Search Engine StrategiesSearch Engine Strategies
Search Engine Strategies
notess
 
Search Enginesv2
Search Enginesv2Search Enginesv2
Search Enginesv2
athiracyborg
 
Advanced google
Advanced googleAdvanced google
Advanced google
nayanthakur
 
Internet Search Presentation
Internet Search PresentationInternet Search Presentation
Internet Search Presentation
Steve Guinan
 
Improving Your Onsite Search
Improving Your Onsite SearchImproving Your Onsite Search
Improving Your Onsite Search
Caroline Roberts
 
Google Search Presentation
Google Search PresentationGoogle Search Presentation
Google Search Presentation
WFL Tech Trainer, Jen Farr
 
Google Search
Google SearchGoogle Search
Google Search
cyberspaced educator
 
Web Searching for All
Web Searching for AllWeb Searching for All
Web Searching for All
notess
 
Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014
notess
 
Searching Google For Website
Searching Google For WebsiteSearching Google For Website
Searching Google For Website
guestfc2a34c
 
Search Engine Strategies
Search  Engine  StrategiesSearch  Engine  Strategies
Search Engine Strategies
jsotir
 

Similar to Googleworks (20)

Defaults
DefaultsDefaults
Defaults
 
Search engines
Search enginesSearch engines
Search engines
 
Mpl brownbag sept2011
Mpl brownbag sept2011Mpl brownbag sept2011
Mpl brownbag sept2011
 
Week 9 10 ppt-google_search
Week 9 10 ppt-google_searchWeek 9 10 ppt-google_search
Week 9 10 ppt-google_search
 
Advanced google searching (1)
Advanced google searching (1)Advanced google searching (1)
Advanced google searching (1)
 
Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011Google search and beyond sasta 25 11-2011
Google search and beyond sasta 25 11-2011
 
Ipe pp slides google talk 2013
Ipe pp slides google talk 2013Ipe pp slides google talk 2013
Ipe pp slides google talk 2013
 
Google power search
Google power searchGoogle power search
Google power search
 
Wk5 contextualized onlinesearchandresearchskills
Wk5 contextualized onlinesearchandresearchskillsWk5 contextualized onlinesearchandresearchskills
Wk5 contextualized onlinesearchandresearchskills
 
Search Engine Strategies
Search Engine StrategiesSearch Engine Strategies
Search Engine Strategies
 
Search Enginesv2
Search Enginesv2Search Enginesv2
Search Enginesv2
 
Advanced google
Advanced googleAdvanced google
Advanced google
 
Internet Search Presentation
Internet Search PresentationInternet Search Presentation
Internet Search Presentation
 
Improving Your Onsite Search
Improving Your Onsite SearchImproving Your Onsite Search
Improving Your Onsite Search
 
Google Search Presentation
Google Search PresentationGoogle Search Presentation
Google Search Presentation
 
Google Search
Google SearchGoogle Search
Google Search
 
Web Searching for All
Web Searching for AllWeb Searching for All
Web Searching for All
 
Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014Advanced Search: WebSearch University 2014
Advanced Search: WebSearch University 2014
 
Searching Google For Website
Searching Google For WebsiteSearching Google For Website
Searching Google For Website
 
Search Engine Strategies
Search  Engine  StrategiesSearch  Engine  Strategies
Search Engine Strategies
 

Recently uploaded

Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
UiPathCommunity
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
zjhamm304
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
BibashShahi
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
FilipTomaszewski5
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
christinelarrosa
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
Sease
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 

Recently uploaded (20)

Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeckPoznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
Poznań ACE event - 19.06.2024 Team 24 Wrapup slidedeck
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance PanelsNorthern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
Northern Engraving | Modern Metal Trim, Nameplates and Appliance Panels
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 

Googleworks

  • 1. How GGooooggllee Works Lisa Holmberg Bibliographical Center for Research lholmber@bcr.org
  • 2. What happens when you Google?
  • 3. Google Search Results •URL, size, date last crawled •Cached link •Pages like this one Database Google Used Approximate # of hits Ads selected by Google based on you search terms Search terms are in bold
  • 5. Google Cached  Cached reveals the page as Google found it  may differ from the current page  Cached exists if a page is full-text indexed  About 1 billion pages in Google are not cached  Not fully searchable  no Cached if a page owner requests not to be cached
  • 7. Default AND between terms The Fuzzy And  only some of the words if a page is “important”  words may occur only in link to the page  words occur somewhere on the site a page belongs to
  • 8. Stemming  Google stems “when appropriate”  Includes plural, singular, past, present tense of words in search Search: school librarian Result: library, librarian, library’s, librarian’s  Single word searches aren’t stemmed
  • 9. What Google doesn’t search (unless you ask nicely)  Common or Stop words are ignored  No official list from Google  Auto-phrasing  Searches containing only stop words
  • 10. What Google doesn’t search (unless you ask nicely)
  • 11. Google Search Results  More than 100 factors in the metrics  On-the-page metrics  Word order matters  Word frequency  Automatic-phrasing  In the title  In unique fonts  In prominent areas (like lists)
  • 12. PageRank  Off-the-page metrics  Words describing the link  Links on one site to another are like votes-- PageRank  Stuffing the ballot box  Reputation of the ‘voting’ page  Can’t buy a better PageRank  PageRank independent of search terms
  • 13. But how do I make my searches better?
  • 14. Improving Google’s AND + Inclusion operator  Force searches on stop words  Turns off stemming Use quotation marks for phrases  “public librarian” 234,000 .4% of public librarian 58,600,000  Forces searches on stop words  Turns off stemming
  • 15. Improving Google’s AND  Hyphen makes phrases and searches with and without hyphens  bite-sized retrieves: bite-sized, bite sized, bitesized Other examples?
  • 17. Search Operators OR search  Search for two terms at once - exclusion operator  Use with care; Search: twins Minnesota 2,750,000 Eliminate undesired words twins Minnesota –sports 1,300,000
  • 18. Search Operators * full-word wild card, word substitution  Ideal for partly remembered quotes  Searching for answers to questions  Proximity searches ~ synonym operator  ~guide searches for: tutorial, manual, help, map, tips
  • 19. Limitless Options for Limits  Intitle: terms are searched for in title only  Pages concentrate on term Hybrid cars intitle:mileage  Combine with OR intitle:"new urbanism" OR intitle:"sustainable communities”  allintitle:  Combine with site: allintitle: hybrid cars mileage –site:.com
  • 20. Using URL’s  Limit to a domain (edu, com, etc) site:edu OR site:gov OR site:lib.co.us  Search within a site site:memory.loc.gov “dust bowl”  Use Google as a search engine for a site  Can ONLY use first part of URL  Omit http: & final / inurl:dustbowl  searches for term anywhere in URL
  • 21. Finding that file  Filetype:  Search for a particular type of document tax return filetype:pdf  Exclude a filetype -filetype:xls  Can use view as HTML  Avoid viruses  Allows you to read it even if you don’t have the software
  • 22. More about Google  Google Guide http://www.googleguide.com/  Google Librarian Center http://www.google.com/librariancenter/index.html

Editor's Notes

  1. REC
  2. Google doesn’t actually search the web. It searches it’s index of the web… a copy. The doc server assembles the results that the index server produces. This is where Google’s page rank software comes in to determine what order the results are in.
  3. Stress that Google is searching it’s database of copies of the web, spread out over 500 computers
  4. Proof that Google is searching a database and not the real web.
  5. Google’s default, but it’s fuzzy Problems? words can occur anywhere in results pages may have different meanings or contexts some pages may not contain all of your words some may not have any of your words
  6. And Talk briefly about Boolean searches (how many know what this is)
  7. Stemming The word is automatically searched as the stem or root with many endings allowed. kite flying retrieves words with kite kites, flying, fly, flyer’s, flyers’, flyers --side note not case sensitive Write in Turn off answers Operator, quotes, single word searches or searches using only ‘stop words’
  8. Write in Turn off answers Operator Quotes Single word
  9. Google Metrics Over 100 different factors in each search, algorithm is always changing + spider continually updating database (thus results change) Proprietary software Search words can appear in title of page, link to page, URL of the page & the page itself Pages weight by prominence of words & frequency of words; Searches for all your terms on a page, even better your terms near each other… best of all pages where your search terms appear in the order you typed them. Weights links pointing to the page (popularity contest doesn’t return the most creditable resource) Links from more popular sites are weighted more
  10. Reputation Some receive high rep by default, gov agencies, well-know or prominent companies, university faculty (smithsonian, nasa, JAMA…) Good rep by association with the above
  11. Use quotes or inclusion operator to turn off stemming, force search on stop words,
  12. Always use the hyphen on words that might be hyphenated since it searches both Words are treated as a phrase – simliar to w/ quotes Other examples: asian-american, african-american, mother-in-law, ex-wife, e-mail
  13. OR Useful when: stemming doesn’t cover the variation your looking for; To cover a common misspelling; For synonyms – parent/guardian; Address apostrophe variations Can also use | instead of OR NOT Not isn’t supported by Google, will use (-) instead
  14. Wildcard Recently ‘softened’ no need to use more than one asterisk per word -The parachute was invented by * - Vitamin * is good for eyes Ask class for examples ~college ~zoo ~library
  15. Other uses?
  16. How would you use this? Google toolbar feature
  17. How would you use this?