SlideShare a Scribd company logo
1 of 36
Web-scale Discovery Tools
and the Backgrounding of
Government Information
CHRISTOPHER C. BROWN, UNIVERSITY OF DENVER, MAIN LIBRARY
ELECTRONIC RESOURCES & LIBRARIES, AUSTIN, TX, FEB. 23, 2015
The Days of Broadcast Search
Often referred to as “federated search”, but not
It was disintegrated search, and maybe you could say federated results
Characterized by long waits (connection wait times, handshakes, mapping of search terms,
merging and “deduping” of results, messy results)
Searching across metadata. No FT searching.
Metasearch or broadcast search was a fail
Metasearch or Broadcast Search
Innovative technology, but never lived up to expectations
Web Metasearch Examples: Metasearch, Dogpile
Library Metasearch Examples: MetaLib, Research Pro, ENCompass, Central Search/360 Search,
WebFeat, AGent, iBistro
Metasearch or Broadcast Search
Was often referred to as “federated search” but it
was not; at best it was federated results.
Web-scale discovery tools are the real “federated
search”. But since vendors already used that term,
they had to invent “discovery”.
Query
Merging and de-duping on-the-fly
Federated Results
Metadata and sometimes FT
Query
Federated Search and Federated Results
Broadcast Search Webscale Discovery
Metadata only
Information Silos
Breadth and Depth
Breadth refers to the number and kinds
of resources covered. Google Scholar is
rather narrow in breadth, covering
scholarly articles, selected technical
reports, and Google Books
“bleadthrough”. Discovery tools are
much broader in scope, covering books
and ebooks, magazine articles,
scholarly articles, trade publications,
newsletters, newspapers, dissertations,
technical reports, maps, audio-visual
materials, institutional repositories,
and many other sources.
Depth refers to how deeply a search
tool goes down into the resource.
Google (Google Web, Google Scholar,
and Google Books) usually indexes
full text of nearly everything.
Discovery tools vary greatly in their
full text search reach. Sometimes
they don’t have access to the full
text, only metadata. Other times they
have access but choose not to
provide full text search access.
Metasearch tools were weak, but they
searched rather evenly
Magazines
Scholarly
Journals
Dissertations
Surface
Searching
Newspapers
eBooks
Print Books
Gov Info
Indexing: title, author, keywords, abstract
Gov InfoMagazines Scholarly
Journals
Dissertations
Discovery Tools
Surface
Searching
Deep
Searching
Newspapers
eBooks
Print
Books
Full text of content – Every last word
Google Scholar
Full text of content – Every last word
Scholarly
Journals
Google
Books
“bleedthrough”
Misc:
tech rpts,
other
Let’s Face It: Google Scholar Does It Better: There is Room for Improvement
Google Scholar: Setting the Bar
Discovery Tools are not very good at
exposing primary sources
Not all primary sources are created equal
Archival resources: unique and interesting
Government information: arguably one of most important primary sources – public policy
affects us all.
Primary Sources Include:
• Vendor primary sources (archival collections)
• Institutional repositories or Digital repositories
• Government primary sources
Why do Discovery Tools succeed more than metasearch tools?
The Information Access Anomaly
Book (average)
Journal Article
(average)
Google
(Scholar/Books)
Typical Length - full
text (FT)
200 pages x 400 = 80,000
words
15 pages x 400 = 6,000
words
Surrogate Record
(SR)
50-100 words (75 ave.) 300-500 words (400 ave. 1)
SR to FT ratio 1 to 10,666 1 to 15 1 to 1
1 http://www.writersservices.com/wps/p_word_count.htm
What this chart means: Very often at the Reference Desk, student will say
“Why doesn’t the library have any books on my topic?” Actually, we do; it’s
just that our discovery tools are weak.
Relevance vs. Discovery
Metadata
FullText
Higher Relevance Higher Discovery
Full text indexing within Discovery Tools is uneven, maybe even erratic. Efforts
should be made to increase full text presence within their indexing.
Unique Library of Congress Trials
A unique opportunity in 2012
Testing Discovery Systems
Select item to be tested (scholarly article, ebook, government publication etc.)
Test each of the four discovery tools to ensure metadata is present
Test each of the four to see if they can retrieve full text (test both with and without quotes around
text)
Text should not contain gremlin characters or cross lines (line breaks)
Use Google as a control (Google Web, Google Scholar, Google Books)
Test G. Scholar Summon EDS Primo WC Disc.
Citation Yes Yes Yes Yes Yes
Text1 Yes Yes No No No
Text2 Yes Yes No No No
Wood, Phillip K., Kenneth J. Sher, and Patricia C. Rutledge. "College student alcohol consumption,
day of the week, and class schedule." Alcoholism: Clinical and Experimental Research 31, no. 7
(2007): 1195-1207.
Text 1: drinking but based instead on the sum of the number of
Text 2: night of drinking. In this case, it is not the time spent
Test 1
Test 2
Citation: Garcia, Sònia, Teresa Garnatje, and Aleš Kovařík. "Plant rDNA database: ribosomal DNA loci information
goes online." Chromosoma 121, no. 4 (2012): 389-394.
Text 1: "FISH data is stored prior to its publication, without being"
Text 2: "outline of how to work with the database. The Simple"
Test G. Scholar Summon EDS Primo WC Disc.
Citation Yes Yes Yes Yes Yes
Text1 Yes Yes No No No
Text2 Yes Yes No No No
Test 3
Pound, Pandora, Shah Ebrahim, Peter Sandercock, Michael B. Bracken, and Ian Roberts. "Where is the evidence
that animal research benefits humans?." BMJ 328, no. 7438 (2004): 514-517.
Text 1: An unpublished study by Ciccone and Candelise
Text 2: Moreover, if animal experiments fail to inform medical
Text 3: single consultations. PP and SE applied (unsuccessfully) to the
Test G. Scholar Summon EDS Primo WC Disc.
Citation Yes Yes Yes Yes Yes
Text1 Yes Yes No Yes No
Text2 Yes Yes No Yes No
Test 4
Tartter, Molly A., and Lara A. Ray. "A prospective study of stress and alcohol craving in heavy
drinkers." Pharmacology Biochemistry and Behavior 101, no. 4 (2012): 625-631.
Text 1: "Participants were contacted for an on-line follow-up at 6 and 12 months after evaluation
in the laboratory"
Text 2: "The ACQ was modified to encompass the four quantitative indices of alcohol use
recommended"
Test G. Scholar Summon EDS Primo WC Disc.
Citation Yes Yes Yes Yes Yes
Text1 Yes Yes No Yes No
Text2 Yes Yes No Yes No
Test 5
Muggah, Robert, and Keith Krause. "Closing the gap between peace operations and post-conflict
insecurity: towards a violence reduction agenda." International Peacekeeping 16, no. 1 (2009):
136-150.
Text 1: "armed violence prevention and reduction programmes that draw upon"
Text 2: "before, during, and after wars come to a close. Armed violence does"
Test G. Scholar Summon EDS Primo WC Disc.
Citation Yes Yes Yes Yes Yes
Text1 Yes Yes Yes Yes No
Text2 Yes Yes No Yes No
Testing for Government Information
Choice of benchmark: Fdsys - http://www.gpo.gov/fdsys/
Sources for Government Information
The Government Publishing Office has these freely available tools:
Catalog of Government Publications (CGP) – library catalog, metadata only -
http://catalog.gpo.gov/
Metalib – metasearch tools, searches metadata only - http://metalib.gpo.gov/
FDsys – Repository, searches metadata and full text - http://www.gpo.gov/fdsys/
◦ FDsys searches full text by default.
◦ In 2011 FDsys officially replaced GPO Access as the official repository for Legislative, Executive, and
Judicial Branch documents that GPO hosts.
◦ FDsys was engineered from the ground up to bring quick, faceted searching for discovery of government
information.
◦ FDsys content is available for anyone to download and for third-party vendors to utilize.
Catalog of Government Publications:
GPO’s Online Catalog
http://catalog.gpo.gov/
MetaLib: the GPO Metasearch Platform
Metalib is a broadcast search for
government information. No FT
searching, only metadata.
http://metalib.gpo.gov/
Access to Archival
Databases (AAD)
System - NARA
DONE 99 30
AGRICOLA Books DONE 1 1
AUL Index to
Military Periodicals
DONE 0
Catalog of U.S.
Government
Publications (CGP)
DONE 8 8
Education
Resources
Information Center
(ERIC)
FETCHING 10000
EPA Publications
and Newsletters
DONE 10 10
Federal Digital
System (FDsys)
DONE 1352 30
Library of Congress
(LOC)
DONE 128 30
PubMed DONE 1 1
FDsys – Default FT Searching of Official
Content
http://www.gpo.gov/fdsys/
Although some of this content does not lend itself to
inclusion in Discovery Tools, some of these subsets are
essential for students to work with public policy,
legislative research, and history.
Priorities of what Full text Government
Information to Include in Discovery Tools
Congressional Reports – legislative intent (1995-present)
Congressional Hearings – social history and policy making (1995-present)
Congressional Documents – budget, treaties, special reports, and legislation-related (1995-
present)
Public and Private Laws (1995-present)
Compilation of Presidential Documents (1993-present)
Public Papers of the Presidents (1991-2010)
Economic Report of the President (1995-present)
Summon has govdocs – but their own PQ
Congressional content, not FDsys
Congressional Hearings and
Congressional Reports
Hearings are important because they can provide
valuable background information including statistics,
scientific research, social implications, and
stakeholder perspectives.
Reports are even more important that hearings in
that they can show legislative intent – why a bill is
needed. They can provide majority and minority
perspectives and section-by-section explication on
bills.
Test cases – Congressional Hearings
Hearings are important because they can provide
valuable background information including statistics,
scientific research, social implications, and
stakeholder perspectives.
Gov Test 1 – Congressional Hearings
Test Google FDsys Summon EDS Primo WC Disc.
Citation Yes Yes Cat rec only Cat rec only Cat rec only Cat rec only
Text1 Yes Yes No No No No
Text2 Yes Yes No No No No
Citation: United States. 2004. Exotic bird species and the Migratory Bird Treaty Act: oversight field
hearing before the Subcommittee on Fisheries Conservation, Wildlife and Oceans of the
Committee on Resources, U.S. House of Representatives, One Hundred Eighth Congress, first
session, Tuesday, December 16, 2003, in Annapolis, Maryland. Washington: U.S. G.P.O.
Text1: "change to the MBTA that would make clear that invasive birds are not protected"
Text2: "it is going into the Everglades system, in and around the"
Test cases – Congressional Reports
Reports are even more important that hearings in
that they can show legislative intent – why a bill is
needed. They can provide majority and minority
perspectives and section-by-section explication on
bills.
Gov Test 2 – Congressional Report
United States. 2009. Authorizing the designation of national environmental research parks by the Secretary of Energy, and for other purposes
report (to accompany H.R. 2729) (including cost estimate of the Congressional Budget Office). Washington, D.C.: U.S. G.P.O.
http://purl.access.gpo.gov/GPO/LPS115815.
Text1: "Biggert for her co-sponsorship and Ranking Member Hall for his"
Text2: "Chair GORDON. Let me again in closing say that just because we"
Test Google FDsys Summon EDS Primo WC Disc.
Citation Yes Yes Cat rec only Cat rec only Cat rec only Cat rec only
Text1 Yes Yes No No No No
Text2 Yes Yes No No No No
Gov Test 3: Compilation of Presidential
Documents
Remarks on the Patient Protection and Affordable Care Act, April 1, 2014. Compilation of
Presidential Documents. http://www.gpo.gov/fdsys/pkg/DCPD-201400224/pdf/DCPD-
201400224.pdf.
Text 1: "health insurance who didn't just a few years ago, and that's something to be proud of"
Text 2: "understand. I've got to admit, I don't get it. Why are folks working so hard for people
not"
Test Google FDsys Summon EDS Primo WC Disc.
Citation Yes Yes Cat rec only Cat rec only No No
Text1 Yes Yes Yes* No No No
Text2 Yes Yes Yes* No No No
*Not the freely available FDsys content.
Conclusions
Government information, when it is available through a Discovery
Tool, is available through licensed content, not freely available
sources.
Government information should not be backgrounded, buried with
the many other content types.
Since GPO will hand over FDsys content to any vendor that wants it,
vendors need to figure out how to acquire it, even if it means
developing a new ingest method.
What can be done?
Ask your vendor
Ask GPO
A Charge to Vendors
Keep working on relevance ranking
All more full text into your index (include all types of content, scholarly articles, magazine
articles, books and ebooks)
Work with GPO to acquire FDsys metadata and full text
The vendor that figures this out first will have a
competitive edge.
Questions?
Christopher C. Brown, Reference Technology Integration Librarian; Government Documents
Librarian
University of Denver, Main Library - http://library.du.edu/
cbrown@du.edu
(303) 871-3404

More Related Content

What's hot

Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biologHowe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biologEleanor Howe
 
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
NESCent visit:  Measuring progress toward a cultural norm of shared (and reus...NESCent visit:  Measuring progress toward a cultural norm of shared (and reus...
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...Heather Piwowar
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search petermurrayrust
 
On Incentive-based Tagging
On Incentive-based TaggingOn Incentive-based Tagging
On Incentive-based TaggingFrancesco Rizzo
 
Internet searching
Internet searchingInternet searching
Internet searchingBadheeb
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceGigaScience, BGI Hong Kong
 
Big Data and AI for Covid-19
Big Data and AI for Covid-19Big Data and AI for Covid-19
Big Data and AI for Covid-19Andrew Zhang
 
Opportunities and challenges presented by Wikidata in the context of biocuration
Opportunities and challenges presented by Wikidata in the context of biocurationOpportunities and challenges presented by Wikidata in the context of biocuration
Opportunities and challenges presented by Wikidata in the context of biocurationBenjamin Good
 
ContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific LiteratureContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific Literaturepetermurrayrust
 
Cochrane workshop 2016
Cochrane workshop 2016Cochrane workshop 2016
Cochrane workshop 2016TheContentMine
 
Automatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literature Automatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literature TheContentMine
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Managementcunera
 
Computational Research day 2015
Computational Research day 2015Computational Research day 2015
Computational Research day 2015cunera
 
Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...
Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...
Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...GigaScience, BGI Hong Kong
 
Automatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literatureAutomatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literaturepetermurrayrust
 

What's hot (20)

Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biologHowe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
Howe et al. - 2015 - BioAssay Research Database (BARD) chemical biolog
 
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
NESCent visit:  Measuring progress toward a cultural norm of shared (and reus...NESCent visit:  Measuring progress toward a cultural norm of shared (and reus...
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search
 
On Incentive-based Tagging
On Incentive-based TaggingOn Incentive-based Tagging
On Incentive-based Tagging
 
Internet searching
Internet searchingInternet searching
Internet searching
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
 
Dt35682686
Dt35682686Dt35682686
Dt35682686
 
Big Data and AI for Covid-19
Big Data and AI for Covid-19Big Data and AI for Covid-19
Big Data and AI for Covid-19
 
Opportunities and challenges presented by Wikidata in the context of biocuration
Opportunities and challenges presented by Wikidata in the context of biocurationOpportunities and challenges presented by Wikidata in the context of biocuration
Opportunities and challenges presented by Wikidata in the context of biocuration
 
ContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific LiteratureContentMine: Mining the Scientific Literature
ContentMine: Mining the Scientific Literature
 
Cochrane workshop 2016
Cochrane workshop 2016Cochrane workshop 2016
Cochrane workshop 2016
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811
 
Automatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literature Automatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literature
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
ChemSpider – A Community Platform for Chemistry and Resources Supporting the ...
 
Computational Research day 2015
Computational Research day 2015Computational Research day 2015
Computational Research day 2015
 
Searching PubMed
Searching PubMedSearching PubMed
Searching PubMed
 
Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...
Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...
Scott Edmunds, ReCon 2015: Beyond Dead Trees, Publishing Digital Research Obj...
 
Automatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literatureAutomatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literature
 
Research Data Management: How will Northwestern address new sharing requireme...
Research Data Management: How will Northwestern address new sharing requireme...Research Data Management: How will Northwestern address new sharing requireme...
Research Data Management: How will Northwestern address new sharing requireme...
 

Similar to Web-scale Discovery Tools and the Backgrounding of Government Information

National latina researchers network supercharge your search 2015 webinar
National latina researchers network supercharge your search 2015 webinarNational latina researchers network supercharge your search 2015 webinar
National latina researchers network supercharge your search 2015 webinarMatthew Von Hendy
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
googlization of information
googlization of informationgooglization of information
googlization of informationrajat00001in
 
Washington evaluators brown bag presentation ppt 2010
Washington evaluators brown bag presentation ppt 2010Washington evaluators brown bag presentation ppt 2010
Washington evaluators brown bag presentation ppt 2010Matthew Von Hendy
 
Locating scientific government information on the web
Locating scientific government information on the webLocating scientific government information on the web
Locating scientific government information on the webShannon Lynch
 
DOI Library Training Session Presentation - Locating Scientific Government In...
DOI Library Training Session Presentation - Locating Scientific Government In...DOI Library Training Session Presentation - Locating Scientific Government In...
DOI Library Training Session Presentation - Locating Scientific Government In...DOILibrary1151
 
Research Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesResearch Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesRebekah Cummings
 
Presentation from Code Camp 2017
Presentation from Code Camp 2017Presentation from Code Camp 2017
Presentation from Code Camp 2017Mitch Miller
 
WEEK4 RESPONSE 6052.docx
WEEK4 RESPONSE 6052.docxWEEK4 RESPONSE 6052.docx
WEEK4 RESPONSE 6052.docxwrite5
 
The Darkening of Government Information
The Darkening of Government InformationThe Darkening of Government Information
The Darkening of Government InformationChristopher Brown
 
Chemical information instruction in the age of Google(TM)
Chemical information instruction in the age of Google(TM)Chemical information instruction in the age of Google(TM)
Chemical information instruction in the age of Google(TM)Charles Huber
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapemhaendel
 
Masters of Health Informatics Library Intro, 2010
Masters of Health Informatics Library Intro, 2010Masters of Health Informatics Library Intro, 2010
Masters of Health Informatics Library Intro, 2010bellalli
 

Similar to Web-scale Discovery Tools and the Backgrounding of Government Information (20)

National latina researchers network supercharge your search 2015 webinar
National latina researchers network supercharge your search 2015 webinarNational latina researchers network supercharge your search 2015 webinar
National latina researchers network supercharge your search 2015 webinar
 
Martone grethe
Martone gretheMartone grethe
Martone grethe
 
Nordic health data metadata
Nordic health data   metadataNordic health data   metadata
Nordic health data metadata
 
Reproducibility
ReproducibilityReproducibility
Reproducibility
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Nov1 webinar intro_slides v
Nov1 webinar intro_slides vNov1 webinar intro_slides v
Nov1 webinar intro_slides v
 
googlization of information
googlization of informationgooglization of information
googlization of information
 
Washington evaluators brown bag presentation ppt 2010
Washington evaluators brown bag presentation ppt 2010Washington evaluators brown bag presentation ppt 2010
Washington evaluators brown bag presentation ppt 2010
 
Locating scientific government information on the web
Locating scientific government information on the webLocating scientific government information on the web
Locating scientific government information on the web
 
DOI Library Training Session Presentation - Locating Scientific Government In...
DOI Library Training Session Presentation - Locating Scientific Government In...DOI Library Training Session Presentation - Locating Scientific Government In...
DOI Library Training Session Presentation - Locating Scientific Government In...
 
Gov Docs Overview
Gov Docs Overview Gov Docs Overview
Gov Docs Overview
 
Research Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesResearch Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and Humanities
 
Presentation from Code Camp 2017
Presentation from Code Camp 2017Presentation from Code Camp 2017
Presentation from Code Camp 2017
 
WEEK4 RESPONSE 6052.docx
WEEK4 RESPONSE 6052.docxWEEK4 RESPONSE 6052.docx
WEEK4 RESPONSE 6052.docx
 
The Darkening of Government Information
The Darkening of Government InformationThe Darkening of Government Information
The Darkening of Government Information
 
Chemical information instruction in the age of Google(TM)
Chemical information instruction in the age of Google(TM)Chemical information instruction in the age of Google(TM)
Chemical information instruction in the age of Google(TM)
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Masters of Health Informatics Library Intro, 2010
Masters of Health Informatics Library Intro, 2010Masters of Health Informatics Library Intro, 2010
Masters of Health Informatics Library Intro, 2010
 

More from Christopher Brown

Migrating Government Publications without Going South: Our Alma/Primo Experience
Migrating Government Publications without Going South: Our Alma/Primo ExperienceMigrating Government Publications without Going South: Our Alma/Primo Experience
Migrating Government Publications without Going South: Our Alma/Primo ExperienceChristopher Brown
 
Downsizing Your Depository: Dealing with Mandates from Your Administration
Downsizing Your Depository: Dealing with Mandates from Your AdministrationDownsizing Your Depository: Dealing with Mandates from Your Administration
Downsizing Your Depository: Dealing with Mandates from Your AdministrationChristopher Brown
 
Downsizing your Depository: Tools and Ideas
Downsizing your Depository: Tools and IdeasDownsizing your Depository: Tools and Ideas
Downsizing your Depository: Tools and IdeasChristopher Brown
 
Collecting Usage Statistics for E-Government Resources
Collecting Usage Statistics for E-Government ResourcesCollecting Usage Statistics for E-Government Resources
Collecting Usage Statistics for E-Government ResourcesChristopher Brown
 
Outbound Harvesting with Encore as a Library Space-Saving Strategy : The Cas...
Outbound Harvesting with Encore as a Library Space-Saving  Strategy : The Cas...Outbound Harvesting with Encore as a Library Space-Saving  Strategy : The Cas...
Outbound Harvesting with Encore as a Library Space-Saving Strategy : The Cas...Christopher Brown
 
Item Deselection on the Fast Track
Item Deselection on the Fast TrackItem Deselection on the Fast Track
Item Deselection on the Fast TrackChristopher Brown
 
Going All-Electronic and Keeping Track of It: Clickthrough Statistics for On...
Going All-Electronic and Keeping Track of It: Clickthrough  Statistics for On...Going All-Electronic and Keeping Track of It: Clickthrough  Statistics for On...
Going All-Electronic and Keeping Track of It: Clickthrough Statistics for On...Christopher Brown
 
Harvesting HathiTrust Documents: A New Model for Online Access
Harvesting HathiTrust Documents: A New Model for Online  AccessHarvesting HathiTrust Documents: A New Model for Online  Access
Harvesting HathiTrust Documents: A New Model for Online AccessChristopher Brown
 
The Three Googles: How I Teach Google in an Academic Setting
The Three Googles: How I Teach Google in an Academic SettingThe Three Googles: How I Teach Google in an Academic Setting
The Three Googles: How I Teach Google in an Academic SettingChristopher Brown
 
Planning the Six-State Virtual Government Information Conference
Planning the Six-State Virtual Government Information ConferencePlanning the Six-State Virtual Government Information Conference
Planning the Six-State Virtual Government Information ConferenceChristopher Brown
 
Fiche Online: A Vision for Digitizing All Documents Fiche
Fiche Online: A Vision for Digitizing All Documents FicheFiche Online: A Vision for Digitizing All Documents Fiche
Fiche Online: A Vision for Digitizing All Documents FicheChristopher Brown
 
Summon and the Art of Discovery
Summon and the Art of DiscoverySummon and the Art of Discovery
Summon and the Art of DiscoveryChristopher Brown
 
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...Christopher Brown
 

More from Christopher Brown (14)

Migrating Government Publications without Going South: Our Alma/Primo Experience
Migrating Government Publications without Going South: Our Alma/Primo ExperienceMigrating Government Publications without Going South: Our Alma/Primo Experience
Migrating Government Publications without Going South: Our Alma/Primo Experience
 
Downsizing Your Depository: Dealing with Mandates from Your Administration
Downsizing Your Depository: Dealing with Mandates from Your AdministrationDownsizing Your Depository: Dealing with Mandates from Your Administration
Downsizing Your Depository: Dealing with Mandates from Your Administration
 
Downsizing your Depository: Tools and Ideas
Downsizing your Depository: Tools and IdeasDownsizing your Depository: Tools and Ideas
Downsizing your Depository: Tools and Ideas
 
Collecting Usage Statistics for E-Government Resources
Collecting Usage Statistics for E-Government ResourcesCollecting Usage Statistics for E-Government Resources
Collecting Usage Statistics for E-Government Resources
 
Outbound Harvesting with Encore as a Library Space-Saving Strategy : The Cas...
Outbound Harvesting with Encore as a Library Space-Saving  Strategy : The Cas...Outbound Harvesting with Encore as a Library Space-Saving  Strategy : The Cas...
Outbound Harvesting with Encore as a Library Space-Saving Strategy : The Cas...
 
Item Deselection on the Fast Track
Item Deselection on the Fast TrackItem Deselection on the Fast Track
Item Deselection on the Fast Track
 
Going All-Electronic and Keeping Track of It: Clickthrough Statistics for On...
Going All-Electronic and Keeping Track of It: Clickthrough  Statistics for On...Going All-Electronic and Keeping Track of It: Clickthrough  Statistics for On...
Going All-Electronic and Keeping Track of It: Clickthrough Statistics for On...
 
Harvesting HathiTrust Documents: A New Model for Online Access
Harvesting HathiTrust Documents: A New Model for Online  AccessHarvesting HathiTrust Documents: A New Model for Online  Access
Harvesting HathiTrust Documents: A New Model for Online Access
 
The Three Googles: How I Teach Google in an Academic Setting
The Three Googles: How I Teach Google in an Academic SettingThe Three Googles: How I Teach Google in an Academic Setting
The Three Googles: How I Teach Google in an Academic Setting
 
The Front Face of the ERM
The Front Face of the ERMThe Front Face of the ERM
The Front Face of the ERM
 
Planning the Six-State Virtual Government Information Conference
Planning the Six-State Virtual Government Information ConferencePlanning the Six-State Virtual Government Information Conference
Planning the Six-State Virtual Government Information Conference
 
Fiche Online: A Vision for Digitizing All Documents Fiche
Fiche Online: A Vision for Digitizing All Documents FicheFiche Online: A Vision for Digitizing All Documents Fiche
Fiche Online: A Vision for Digitizing All Documents Fiche
 
Summon and the Art of Discovery
Summon and the Art of DiscoverySummon and the Art of Discovery
Summon and the Art of Discovery
 
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
When there is no Vendor: Statistics for Free Clickthroughs via the Online Cat...
 

Recently uploaded

TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxAndrieCagasanAkio
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119APNIC
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxDyna Gilbert
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxNIMMANAGANTI RAMAKRISHNA
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxMario
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书rnrncn29
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书zdzoqco
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxmibuzondetrabajo
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa494f574xmv
 

Recently uploaded (11)

TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptx
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
Top 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptxTop 10 Interactive Website Design Trends in 2024.pptx
Top 10 Interactive Website Design Trends in 2024.pptx
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptx
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptx
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
 
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
办理多伦多大学毕业证成绩单|购买加拿大UTSG文凭证书
 
Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptx
 
Film cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasaFilm cover research (1).pptxsdasdasdasdasdasa
Film cover research (1).pptxsdasdasdasdasdasa
 

Web-scale Discovery Tools and the Backgrounding of Government Information

  • 1. Web-scale Discovery Tools and the Backgrounding of Government Information CHRISTOPHER C. BROWN, UNIVERSITY OF DENVER, MAIN LIBRARY ELECTRONIC RESOURCES & LIBRARIES, AUSTIN, TX, FEB. 23, 2015
  • 2. The Days of Broadcast Search Often referred to as “federated search”, but not It was disintegrated search, and maybe you could say federated results Characterized by long waits (connection wait times, handshakes, mapping of search terms, merging and “deduping” of results, messy results) Searching across metadata. No FT searching.
  • 3. Metasearch or broadcast search was a fail
  • 4. Metasearch or Broadcast Search Innovative technology, but never lived up to expectations Web Metasearch Examples: Metasearch, Dogpile Library Metasearch Examples: MetaLib, Research Pro, ENCompass, Central Search/360 Search, WebFeat, AGent, iBistro
  • 5. Metasearch or Broadcast Search Was often referred to as “federated search” but it was not; at best it was federated results. Web-scale discovery tools are the real “federated search”. But since vendors already used that term, they had to invent “discovery”. Query Merging and de-duping on-the-fly Federated Results Metadata and sometimes FT Query Federated Search and Federated Results Broadcast Search Webscale Discovery Metadata only Information Silos
  • 6. Breadth and Depth Breadth refers to the number and kinds of resources covered. Google Scholar is rather narrow in breadth, covering scholarly articles, selected technical reports, and Google Books “bleadthrough”. Discovery tools are much broader in scope, covering books and ebooks, magazine articles, scholarly articles, trade publications, newsletters, newspapers, dissertations, technical reports, maps, audio-visual materials, institutional repositories, and many other sources. Depth refers to how deeply a search tool goes down into the resource. Google (Google Web, Google Scholar, and Google Books) usually indexes full text of nearly everything. Discovery tools vary greatly in their full text search reach. Sometimes they don’t have access to the full text, only metadata. Other times they have access but choose not to provide full text search access.
  • 7. Metasearch tools were weak, but they searched rather evenly Magazines Scholarly Journals Dissertations Surface Searching Newspapers eBooks Print Books Gov Info
  • 8. Indexing: title, author, keywords, abstract Gov InfoMagazines Scholarly Journals Dissertations Discovery Tools Surface Searching Deep Searching Newspapers eBooks Print Books Full text of content – Every last word Google Scholar Full text of content – Every last word Scholarly Journals Google Books “bleedthrough” Misc: tech rpts, other Let’s Face It: Google Scholar Does It Better: There is Room for Improvement
  • 10. Discovery Tools are not very good at exposing primary sources Not all primary sources are created equal Archival resources: unique and interesting Government information: arguably one of most important primary sources – public policy affects us all. Primary Sources Include: • Vendor primary sources (archival collections) • Institutional repositories or Digital repositories • Government primary sources
  • 11. Why do Discovery Tools succeed more than metasearch tools? The Information Access Anomaly Book (average) Journal Article (average) Google (Scholar/Books) Typical Length - full text (FT) 200 pages x 400 = 80,000 words 15 pages x 400 = 6,000 words Surrogate Record (SR) 50-100 words (75 ave.) 300-500 words (400 ave. 1) SR to FT ratio 1 to 10,666 1 to 15 1 to 1 1 http://www.writersservices.com/wps/p_word_count.htm What this chart means: Very often at the Reference Desk, student will say “Why doesn’t the library have any books on my topic?” Actually, we do; it’s just that our discovery tools are weak.
  • 12. Relevance vs. Discovery Metadata FullText Higher Relevance Higher Discovery Full text indexing within Discovery Tools is uneven, maybe even erratic. Efforts should be made to increase full text presence within their indexing.
  • 13. Unique Library of Congress Trials A unique opportunity in 2012
  • 14. Testing Discovery Systems Select item to be tested (scholarly article, ebook, government publication etc.) Test each of the four discovery tools to ensure metadata is present Test each of the four to see if they can retrieve full text (test both with and without quotes around text) Text should not contain gremlin characters or cross lines (line breaks) Use Google as a control (Google Web, Google Scholar, Google Books)
  • 15. Test G. Scholar Summon EDS Primo WC Disc. Citation Yes Yes Yes Yes Yes Text1 Yes Yes No No No Text2 Yes Yes No No No Wood, Phillip K., Kenneth J. Sher, and Patricia C. Rutledge. "College student alcohol consumption, day of the week, and class schedule." Alcoholism: Clinical and Experimental Research 31, no. 7 (2007): 1195-1207. Text 1: drinking but based instead on the sum of the number of Text 2: night of drinking. In this case, it is not the time spent Test 1
  • 16. Test 2 Citation: Garcia, Sònia, Teresa Garnatje, and Aleš Kovařík. "Plant rDNA database: ribosomal DNA loci information goes online." Chromosoma 121, no. 4 (2012): 389-394. Text 1: "FISH data is stored prior to its publication, without being" Text 2: "outline of how to work with the database. The Simple" Test G. Scholar Summon EDS Primo WC Disc. Citation Yes Yes Yes Yes Yes Text1 Yes Yes No No No Text2 Yes Yes No No No
  • 17. Test 3 Pound, Pandora, Shah Ebrahim, Peter Sandercock, Michael B. Bracken, and Ian Roberts. "Where is the evidence that animal research benefits humans?." BMJ 328, no. 7438 (2004): 514-517. Text 1: An unpublished study by Ciccone and Candelise Text 2: Moreover, if animal experiments fail to inform medical Text 3: single consultations. PP and SE applied (unsuccessfully) to the Test G. Scholar Summon EDS Primo WC Disc. Citation Yes Yes Yes Yes Yes Text1 Yes Yes No Yes No Text2 Yes Yes No Yes No
  • 18. Test 4 Tartter, Molly A., and Lara A. Ray. "A prospective study of stress and alcohol craving in heavy drinkers." Pharmacology Biochemistry and Behavior 101, no. 4 (2012): 625-631. Text 1: "Participants were contacted for an on-line follow-up at 6 and 12 months after evaluation in the laboratory" Text 2: "The ACQ was modified to encompass the four quantitative indices of alcohol use recommended" Test G. Scholar Summon EDS Primo WC Disc. Citation Yes Yes Yes Yes Yes Text1 Yes Yes No Yes No Text2 Yes Yes No Yes No
  • 19. Test 5 Muggah, Robert, and Keith Krause. "Closing the gap between peace operations and post-conflict insecurity: towards a violence reduction agenda." International Peacekeeping 16, no. 1 (2009): 136-150. Text 1: "armed violence prevention and reduction programmes that draw upon" Text 2: "before, during, and after wars come to a close. Armed violence does" Test G. Scholar Summon EDS Primo WC Disc. Citation Yes Yes Yes Yes Yes Text1 Yes Yes Yes Yes No Text2 Yes Yes No Yes No
  • 20. Testing for Government Information Choice of benchmark: Fdsys - http://www.gpo.gov/fdsys/
  • 21. Sources for Government Information The Government Publishing Office has these freely available tools: Catalog of Government Publications (CGP) – library catalog, metadata only - http://catalog.gpo.gov/ Metalib – metasearch tools, searches metadata only - http://metalib.gpo.gov/ FDsys – Repository, searches metadata and full text - http://www.gpo.gov/fdsys/ ◦ FDsys searches full text by default. ◦ In 2011 FDsys officially replaced GPO Access as the official repository for Legislative, Executive, and Judicial Branch documents that GPO hosts. ◦ FDsys was engineered from the ground up to bring quick, faceted searching for discovery of government information. ◦ FDsys content is available for anyone to download and for third-party vendors to utilize.
  • 22. Catalog of Government Publications: GPO’s Online Catalog http://catalog.gpo.gov/
  • 23. MetaLib: the GPO Metasearch Platform Metalib is a broadcast search for government information. No FT searching, only metadata. http://metalib.gpo.gov/ Access to Archival Databases (AAD) System - NARA DONE 99 30 AGRICOLA Books DONE 1 1 AUL Index to Military Periodicals DONE 0 Catalog of U.S. Government Publications (CGP) DONE 8 8 Education Resources Information Center (ERIC) FETCHING 10000 EPA Publications and Newsletters DONE 10 10 Federal Digital System (FDsys) DONE 1352 30 Library of Congress (LOC) DONE 128 30 PubMed DONE 1 1
  • 24. FDsys – Default FT Searching of Official Content http://www.gpo.gov/fdsys/ Although some of this content does not lend itself to inclusion in Discovery Tools, some of these subsets are essential for students to work with public policy, legislative research, and history.
  • 25. Priorities of what Full text Government Information to Include in Discovery Tools Congressional Reports – legislative intent (1995-present) Congressional Hearings – social history and policy making (1995-present) Congressional Documents – budget, treaties, special reports, and legislation-related (1995- present) Public and Private Laws (1995-present) Compilation of Presidential Documents (1993-present) Public Papers of the Presidents (1991-2010) Economic Report of the President (1995-present)
  • 26. Summon has govdocs – but their own PQ Congressional content, not FDsys
  • 27. Congressional Hearings and Congressional Reports Hearings are important because they can provide valuable background information including statistics, scientific research, social implications, and stakeholder perspectives. Reports are even more important that hearings in that they can show legislative intent – why a bill is needed. They can provide majority and minority perspectives and section-by-section explication on bills.
  • 28. Test cases – Congressional Hearings Hearings are important because they can provide valuable background information including statistics, scientific research, social implications, and stakeholder perspectives.
  • 29. Gov Test 1 – Congressional Hearings Test Google FDsys Summon EDS Primo WC Disc. Citation Yes Yes Cat rec only Cat rec only Cat rec only Cat rec only Text1 Yes Yes No No No No Text2 Yes Yes No No No No Citation: United States. 2004. Exotic bird species and the Migratory Bird Treaty Act: oversight field hearing before the Subcommittee on Fisheries Conservation, Wildlife and Oceans of the Committee on Resources, U.S. House of Representatives, One Hundred Eighth Congress, first session, Tuesday, December 16, 2003, in Annapolis, Maryland. Washington: U.S. G.P.O. Text1: "change to the MBTA that would make clear that invasive birds are not protected" Text2: "it is going into the Everglades system, in and around the"
  • 30. Test cases – Congressional Reports Reports are even more important that hearings in that they can show legislative intent – why a bill is needed. They can provide majority and minority perspectives and section-by-section explication on bills.
  • 31. Gov Test 2 – Congressional Report United States. 2009. Authorizing the designation of national environmental research parks by the Secretary of Energy, and for other purposes report (to accompany H.R. 2729) (including cost estimate of the Congressional Budget Office). Washington, D.C.: U.S. G.P.O. http://purl.access.gpo.gov/GPO/LPS115815. Text1: "Biggert for her co-sponsorship and Ranking Member Hall for his" Text2: "Chair GORDON. Let me again in closing say that just because we" Test Google FDsys Summon EDS Primo WC Disc. Citation Yes Yes Cat rec only Cat rec only Cat rec only Cat rec only Text1 Yes Yes No No No No Text2 Yes Yes No No No No
  • 32. Gov Test 3: Compilation of Presidential Documents Remarks on the Patient Protection and Affordable Care Act, April 1, 2014. Compilation of Presidential Documents. http://www.gpo.gov/fdsys/pkg/DCPD-201400224/pdf/DCPD- 201400224.pdf. Text 1: "health insurance who didn't just a few years ago, and that's something to be proud of" Text 2: "understand. I've got to admit, I don't get it. Why are folks working so hard for people not" Test Google FDsys Summon EDS Primo WC Disc. Citation Yes Yes Cat rec only Cat rec only No No Text1 Yes Yes Yes* No No No Text2 Yes Yes Yes* No No No *Not the freely available FDsys content.
  • 33. Conclusions Government information, when it is available through a Discovery Tool, is available through licensed content, not freely available sources. Government information should not be backgrounded, buried with the many other content types. Since GPO will hand over FDsys content to any vendor that wants it, vendors need to figure out how to acquire it, even if it means developing a new ingest method.
  • 34. What can be done? Ask your vendor Ask GPO
  • 35. A Charge to Vendors Keep working on relevance ranking All more full text into your index (include all types of content, scholarly articles, magazine articles, books and ebooks) Work with GPO to acquire FDsys metadata and full text The vendor that figures this out first will have a competitive edge.
  • 36. Questions? Christopher C. Brown, Reference Technology Integration Librarian; Government Documents Librarian University of Denver, Main Library - http://library.du.edu/ cbrown@du.edu (303) 871-3404