The document discusses challenges facing government information libraries in the digital era. Specifically, it notes that while digital distribution allows for more convenient access, it often lacks standards and policies for long-term preservation. It also argues that calls for digitizing print collections could damage long-term preservation efforts and that the privatization of government information has paradoxically decreased public access despite being digital. The talk will explore strategies for digital collection and collaboration that uphold principles of public access and preservation.
Blind Spots and Broken Links: Access to Government InformationJames Jacobs
Panel presentation given by James R. Jacobs as part of a program at American Library Association's 2015 annual conference set up by the Federal & Armed Forces Libraries Round Table (FAFLRT). The program, "Open Government: Current Trends and Practices Concerning FOIA, Open Access, and Other Post-Wiki-Leaks Issues" also featured Anneliese Taylor, Assistant Director of Scholarly Communications & Collections at UCSF, who gave an in-depth and very interesting presentation on open access and the OSTP directive on "Expanding Public Access to the Results of Federally Funded Research"
Preservation for all: the future of government documents and the “digital FDL...James Jacobs
Preservation for all: the future of government documents and the “digital FDLP” puzzle. A presentation at the Ohio GODORT spring 2011 meeting (by invitation). Friday, June 3, 2011 at the State Library of Ohio.
Agenda:
library principles and best practices
case studies:
--Everyday Electronic Materials (EEMs) “Water droplets”
--Archive-it “Oceans”
--lockss-usdocs “Waterfalls”
--Collaboration: delicious, state agency databases “Reservoirs”
--reflection of projects based on principles
Blind Spots and Broken Links: Access to Government InformationJames Jacobs
Panel presentation given by James R. Jacobs as part of a program at American Library Association's 2015 annual conference set up by the Federal & Armed Forces Libraries Round Table (FAFLRT). The program, "Open Government: Current Trends and Practices Concerning FOIA, Open Access, and Other Post-Wiki-Leaks Issues" also featured Anneliese Taylor, Assistant Director of Scholarly Communications & Collections at UCSF, who gave an in-depth and very interesting presentation on open access and the OSTP directive on "Expanding Public Access to the Results of Federally Funded Research"
Preservation for all: the future of government documents and the “digital FDL...James Jacobs
Preservation for all: the future of government documents and the “digital FDLP” puzzle. A presentation at the Ohio GODORT spring 2011 meeting (by invitation). Friday, June 3, 2011 at the State Library of Ohio.
Agenda:
library principles and best practices
case studies:
--Everyday Electronic Materials (EEMs) “Water droplets”
--Archive-it “Oceans”
--lockss-usdocs “Waterfalls”
--Collaboration: delicious, state agency databases “Reservoirs”
--reflection of projects based on principles
Technology Trends in Libraries - Today & TomorrowRachel Vacek
This presentation discusses the basic concepts of Web 2.0 and how they are being used in libraries. It provides examples of these concepts, and emphasizes that over the next several years, the concepts of Web 2.0 (collaboration, participation, tagging, community, etc.) will only grow, but the actual technologies themselves will change.
Libraries: technology as artifact and technology in practicelisld
Research and learning workflows are increasingly enacted in data-rich network environments. New behaviors are emerging which are shaped by and in turn shape workflow and data tools and services. This means that library attention is shifting from not only providing support systems and services but to supporting those behaviors more directly as they emerge. This support may take the form of particular system or services, but will also involve consulting and advising about such things as publication venues, reputation management, profiles, research networking.
A keynote presentation given at the Association of Jesuit Colleges and Universities CITM and Library Deans meeting. Loyola University, Maryland.
The IFLA Trend Report identifies five top level trends which will play a key role in shaping our future information ecosystem:
TREND 1 New Technologies will Both Expand and Limit Who has Access to Information.
TREND 2 Online Education will Democratise and Disrupt Global Learning.
TREND 3 The Boundaries of Privacy and Data Protection will be Redefined.
TREND 4 Hyper-Connected Societies will Listen to and Empower New Voices and Groups.
TREND 5 The Global Information Economy will be Transformed by New Technologies.
This presentation was provided by Peter Murray of IndexData during the NISO virtual conference, Information Freedom, Ethics and Integrity, held on Wednesday, April 18, 2018.
his presentation was provided by Tom Bruno of the University of Pennsylvania, during the NISO Hot Topic Virtual Conference "Ebooks and Collections." The event was held on August 25, 2021
This presentation was provided by Brewster Kahle of The Internet Archive, during the NISO event "Owing, Licensing, and Sharing Digital Content." The virtual conference was held on Thursday, January 21, 2021.
Digital FDLP Louisiana GODORT 2012 slides+notesJames Jacobs
Keynote talk at the Spring 2012 meeting of the Louisiana Government Documents Round Table (LA GODORT) in Shreveport, LA Friday March 23, 2012.
The last slide includes a list of citations for further reading.
Will the Digital library sustain as a Social Capital for dissemination of Inf...Saptarshi Ghosh
Abstract
This paper deals with the relationship between digital library and social development. The core of digital library which rests with strong social bonding and participatory approach, has been reflected in this write-up. Today, global prosperity and individual productivity depend upon the ability to learn constantly, adapt to change readily, and to evaluate information critically. Right now in this information rich world, we must remain ways to transform information into knowledge. So, how can we ensure that our communities can access the resources and services that we have available? How can we ensure that we are responsive to, and representative of, our communities' actual, as opposed to perceived, needs? We will look at various ways that library services can partner with their communities to bring about better outcomes for all. The digital library can bridge these gaps and it may be turned as a people’s access to the information repository and can be a motivator to sustainable development.
Technology Trends in Libraries - Today & TomorrowRachel Vacek
This presentation discusses the basic concepts of Web 2.0 and how they are being used in libraries. It provides examples of these concepts, and emphasizes that over the next several years, the concepts of Web 2.0 (collaboration, participation, tagging, community, etc.) will only grow, but the actual technologies themselves will change.
Libraries: technology as artifact and technology in practicelisld
Research and learning workflows are increasingly enacted in data-rich network environments. New behaviors are emerging which are shaped by and in turn shape workflow and data tools and services. This means that library attention is shifting from not only providing support systems and services but to supporting those behaviors more directly as they emerge. This support may take the form of particular system or services, but will also involve consulting and advising about such things as publication venues, reputation management, profiles, research networking.
A keynote presentation given at the Association of Jesuit Colleges and Universities CITM and Library Deans meeting. Loyola University, Maryland.
The IFLA Trend Report identifies five top level trends which will play a key role in shaping our future information ecosystem:
TREND 1 New Technologies will Both Expand and Limit Who has Access to Information.
TREND 2 Online Education will Democratise and Disrupt Global Learning.
TREND 3 The Boundaries of Privacy and Data Protection will be Redefined.
TREND 4 Hyper-Connected Societies will Listen to and Empower New Voices and Groups.
TREND 5 The Global Information Economy will be Transformed by New Technologies.
This presentation was provided by Peter Murray of IndexData during the NISO virtual conference, Information Freedom, Ethics and Integrity, held on Wednesday, April 18, 2018.
his presentation was provided by Tom Bruno of the University of Pennsylvania, during the NISO Hot Topic Virtual Conference "Ebooks and Collections." The event was held on August 25, 2021
This presentation was provided by Brewster Kahle of The Internet Archive, during the NISO event "Owing, Licensing, and Sharing Digital Content." The virtual conference was held on Thursday, January 21, 2021.
Digital FDLP Louisiana GODORT 2012 slides+notesJames Jacobs
Keynote talk at the Spring 2012 meeting of the Louisiana Government Documents Round Table (LA GODORT) in Shreveport, LA Friday March 23, 2012.
The last slide includes a list of citations for further reading.
Will the Digital library sustain as a Social Capital for dissemination of Inf...Saptarshi Ghosh
Abstract
This paper deals with the relationship between digital library and social development. The core of digital library which rests with strong social bonding and participatory approach, has been reflected in this write-up. Today, global prosperity and individual productivity depend upon the ability to learn constantly, adapt to change readily, and to evaluate information critically. Right now in this information rich world, we must remain ways to transform information into knowledge. So, how can we ensure that our communities can access the resources and services that we have available? How can we ensure that we are responsive to, and representative of, our communities' actual, as opposed to perceived, needs? We will look at various ways that library services can partner with their communities to bring about better outcomes for all. The digital library can bridge these gaps and it may be turned as a people’s access to the information repository and can be a motivator to sustainable development.
for getting the library resources fro the libraries entire world, the important tool is Library catalogues. every can browse all most all the world literature through WorldCat fro the INTERNET.
Research into Practice case study 2: Library linked data implementations an...Hazel Hall
The research underlying this presentation explored the role that libraries play in the linked data context. Focusing on European national libraries and Scottish libraries, multiple data gathering methods and constant comparative analysis were applied in the study. Amongst the findings, a general lack of awareness within the library community of the Semantic Web and the implications of linked data was identified. At the same time, there is recognition that linked data augments the discoverability and enhances the interoperability of library data. The presentation will include recommendations for the application of the findings of this research in practice.
Keynote presentation at Montana Library Association meeting, Helena, 7 February. It looks at public and academic library directions in a network environment.
101 This is Digital Scholarship Staff TrainingNora McGregor
Slides for our internal staff introduction to Digital Scholarship course. When does scholarship become ‘digital scholarship’? This course takes a thought-provoking look at how information technology has transformed research today. Touching on the growing application of computing in various research disciplines we’ll familiarize ourselves with the concepts, methods and tools that define digital scholarship and explore how we can best support digital scholars at British Library.
Dr Joel Weintraub: Unique Aspects of the United States CensusJames Jacobs
Dr Weintraub gave a public talk at Stanford University 2/4/2013. His talk centered around the United States Decennial Census Manuscripts aka Enumerators' Notebooks, the history of the Census Questions, including controversial questions, undercounts, and truthfulness. For more on Dr Weintraub's census work see his 1940 census site and his collaborative work with Steve Morse at http://stevemorse.org.
Zoe Lofgren letter to ALA nominating Aaron Swartz for the Madison awardJames Jacobs
Congresswoman Zoe Lofgren's letter to ALA Washington Office in support of the nomination of Aaron Swartz for the 2013 James Madison award. The award is "named for President James Madison was established in 1986 and is presented annually on the anniversary of his birth to honor individuals or groups who have championed, protected and promoted public access to government information and the public’s right to know at the national level."
Opportunities and challenges for the 21st century FDLP (CNI Spring 2012)James Jacobs
James Jacobs, Government Information Librarian, Stanford University Libraries
Suzanne Sears, Assistant Dean for Public Services, University of North Texas Libraries
David Walls, Preservation Librarian, US Government Printing Office
Date: Monday, April 2, 2012
Abstract:
The vast majority of all US Government documents published today are “born digital,” published electronically and available through the Internet, and will never be printed by the federal government. The lack of a systematic process for capturing, preserving, and disseminating born-digital government information challenges the ability of the Federal Depository Library Program (FDLP) in being able to provide permanent and equal access to online-only government information to all citizens. However, GPO and the FDLP community have begun to make strides on this most critical issue. This project briefing will describe several exciting initiatives currently underway to capture, preserve and provide access to born-digital government information – including GPO’s Federal Digital System (FDsys) and web harvesting initiatives, and the agency’s partnerships with Federal agencies; the CyberCemetery, CRS Report archive and robust digitization program and digital repository of the University of North Texas; and the LOCKSS-USDOCS collaborative program. These projects offer examples of how the FDLP community, in partnership and under formal agreements with GPO can work collaboratively to assure the long-term preservation of born-digital government information to “keep America informed.”
Handout for Susan Kendall's panel presentation on the electronic government publications statistics program at SJSU. Part of the "Electronic Collection Management" Council Session at the Federal Depository Library Conference and Depository Library Council Meeting, October 17-20, 2011.
Tom Bruce iConference 2010 presentationJames Jacobs
iConference Panel, Thursday, February 4, 2010, University of IL at Urbana-Champaign
http://www.ischools.org/iConference10/2010index/
This was the presentation for Tom Bruce, Director of Cornell's Legal information institute.
For slides and notes, see:
http://freegovinfo.info/node/2897
Roundtable 4 : : Technology Room
• Gone today, Here tomorrow: assuring access to government information in the digital age. ShinJoung Yeo (moderator), University of Illinois; and James R. Jacobs, Stanford University
Panelists:
*James Jacobs
*Thomas Bruce (Legal Information Institute, Cornell University)
*Daniel Schuman (Sunlight Foundation policy director)
*Cindy Etkin (Govt Printing Office)
iConference Panel, Thursday, February 4, 2010, University of IL at Urbana-Champaign
Roundtable 4 : : Technology Room
• Gone today, Here tomorrow: assuring access to government information in the digital age. ShinJoung Yeo (moderator), University of Illinois; and James R. Jacobs, Stanford University
Panelists:
*James Jacobs
*Thomas Bruce (Legal Information Institute, Cornell University)
*Daniel Schuman (Sunlight Foundation policy director)
*Cindy Etkin (Govt Printing Office)
NGOs In Libraries: Why Bother? -- Jim ChurchJames Jacobs
Jim Church (UC Berkeley) presentation about Non-Governmental Organizations and libraries at the GODORT Update, American Library Association Midwinter conference in Denver CO, Saturday, January 24, 2009.
Speakers:
--Jim Church, University of California-Berkeley
--Nelson Fabian, National Environmental Health Association
--Romulo Rivera, Project Manager, Policy Archive
--Introduction and Moderation by Kris Kasianovitz, University of California Los Angeles
Description. The GODORT update featured 3 speakers who described the role and impact of Non-governmental organizations (NGOs). NGOs play an important role in policy decisions and frequently interact with government entities at all levels. Speakers explored this issue from a librarian, NGO, and governmental point of view. Discussion about GODORT's role in promoting the collection and use of NGO information in libraries will also take place.
James Jacobs' presentation of a panel entitled, "Gone Today, Here Tomorrow: Archiving and Preserving Born Digital Government Documents." The presentation is about Archive-it and preserving born-digital govt information. This part of the panel describes how Stanford University Libraries are using Archive-it.
Presentation about Del.icio.us and its uses for organization, collaboration, integration, and collections. The presentation was given as part of the Docs2.0 preconference at American Library Association's Govt Documents Roundtable (GODORT) in Anaheim, CA June 27, 2008
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
A Strategic Approach: GenAI in EducationPeter Windle
Artificial Intelligence (AI) technologies such as Generative AI, Image Generators and Large Language Models have had a dramatic impact on teaching, learning and assessment over the past 18 months. The most immediate threat AI posed was to Academic Integrity with Higher Education Institutes (HEIs) focusing their efforts on combating the use of GenAI in assessment. Guidelines were developed for staff and students, policies put in place too. Innovative educators have forged paths in the use of Generative AI for teaching, learning and assessments leading to pockets of transformation springing up across HEIs, often with little or no top-down guidance, support or direction.
This Gasta posits a strategic approach to integrating AI into HEIs to prepare staff, students and the curriculum for an evolving world and workplace. We will highlight the advantages of working with these technologies beyond the realm of teaching, learning and assessment by considering prompt engineering skills, industry impact, curriculum changes, and the need for staff upskilling. In contrast, not engaging strategically with Generative AI poses risks, including falling behind peers, missed opportunities and failing to ensure our graduates remain employable. The rapid evolution of AI technologies necessitates a proactive and strategic approach if we are to remain relevant.
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
Acetabularia Information For Class 9 .docxvaibhavrinwa19
Acetabularia acetabulum is a single-celled green alga that in its vegetative state is morphologically differentiated into a basal rhizoid and an axially elongated stalk, which bears whorls of branching hairs. The single diploid nucleus resides in the rhizoid.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
Gone today, here tomorrow: the future of government information and the digital FDLP
1. Gone today, here tomorrow:
the future of government
information and the digital
FDLP
James R. Jacobs
jrjacobs@stanford.edu
lockss-usdocs.stanford.edu
UW i-school
Thursday January 24, 2013
Wednesday, January 23, 2013
Iʼd like to thank Cass Hartnett, the Northwest Government Information Network, the UW Information School, the UW
Association of Library and Information Science Students (ALISS), and the University of Washington Libraries for inviting me to
talk with you today. I hope itʼll be worth your while :-)
2. Aaron Swartz 11.8.86 - 1.11.13
PD haiku notice: do what you feel like /
since the work is abandoned /
the law doesn’t care
http://www.aaronsw.com/weblog/000360
http://www.rememberaaronsw.com/
Wednesday, January 23, 2013
I dedicate today’s talk to my friend and internet activist Aaron Swartz for his progressive
ideals and dedication to free and open information access.
PD: do what you feel like / since the work is abandoned / the law doesn’t care
3. Agenda
• Historical ideals of the FDLP
• Collection strategies:
• Everyday Electronic Materials (EEMs) “Water droplets”
• Archive-it “Oceans”
• LOCKSS-USDOCS “Waterfalls”
• Collaboration “Reservoirs”
• Reflection
• [[slides available at slideshare.net/freegovinfo]]
Wednesday, January 23, 2013
introduce and agenda
Weʼre at the very beginning of the digital era where tools, policies, best practices, etc are all in flux. In many ways, weʼre at the
age of new metaphors needed to describe what it is that we as librarians do on a daily basis.
I'd like to talk about the underlying historical ideals of the FDLP, discuss how those ideals have been under fire from both within
and without the library community and argue that those ideals applied to today's new information metaphors give us the best
chance at access to and long-term preservation and assurance of govt information.
Then Iʼll talk about some of the digital collection strategies that Iʼve found to be successful and then conclude with a bit about
collaboration and to-dos.
4. Librarians ...
... Explore
... Collect
... Describe
... Share
... Preserve
Wednesday, January 23, 2013
... But basically, we explore, collect, describe, share and preserve the world of information. In my humble estimation, format
does not change what it is that we do as librarians! Today I aim to show that the shift to digital does not preclude us from
exploring, collecting, describing, sharing, and preserving government information.
Right up front, I'm a librarian and a collaborator in the LOCKSS-USDOCS distributed digital preservation project (Lots of
Copies Keep Stuff Safe). I've been in academia/education my whole life as a student, teacher, librarian and technologist. I've
worked in libraries since high school and been a government information/FDLP librarian since 2002 and served a 3-year term
on the Depository Library Council, the body which informs and advises the Govt Printing Office regarding issues of the Federal
Depository Library Program. So my mindset/perspective/bias is from one who assists in the scholarly communication process,
one who believes that libraries have a place in the digital information landscape, and one who believes strongly in the idea that
public access to govt information is a fundamental right.
In the print era (which is not over!) we had rules and processes in place to do the things that we do as librarians.
In the digital realm (which is just beginning and will continue to overlap with the print era for the foreseeable future) we are just
beginning to figure out the rules and processes. But the concepts remain the same.
Government documents are the DNA of the democratic process – Carl Malamud would call it the “source code.” And so we
must find ways to continue to give access and preserve this content for the long-term.
5. FDLP principles
• Forward democratic ideals
• Serve public interest / public access / public control / public
preservation
• Serve the information needs of your community
• Forward the long-term institutional viability of libraries
• Promote and leverage collective action
Wednesday, January 23, 2013
Are you:
--forwarding democratic ideals?
--serving public interest / public access / public control / public preservation?
--serving the information needs of your community?
--forwarding the long-term institutional life of libraries?
--promoting and leveraging collective action?
These are the principles that we as govt information librarians (and librarians in general!) hold dear. Best practices =
practices in which these principles are embedded – and the principles embedded in the FDLP.
If you too believe in these ideals (I hope!), then you already do take actions in support of these values – and probably one
of the main reasons you all have stayed in the field of librarianship is because you believe the following:
--libraries are critical as memory organizations
--local control of collections is imperative (e.g., a large network of libraries resists accident and natural disasters and are
self-healing. A large network of FDLP libraries can help alleviate and ameliorate the damage and rebuild collections when
those accidents invariably occur. Just ask my friend Rebecca Blakeley who had a wonderful presentation at the 2008 fall
Depository Library Conference about the steps that McNeese State University Library took in rebuilding their documents
collection after heavy damage suffered from Hurricane Rita.
--distributed system is crucial to meet local needs (spread responsibility for content among various locations and
administrations)
--public interest (affirms FDLP libraries’ role in ensuring permanent public access!)
--value of library community
--shared preservation responsibilities
While I talk about the following projects, please keep these principles and ideals in mind. So let’s get to the case study
part of the discussion.
6. “There seems to be an inverse relationship between
convenience of dissemination and preservation standards.”
-- Chuck Humphrey, data librarian, U of Alberta
Wednesday, January 23, 2013
Over the last 20-30 years, developments in publishing and Internet technologies have affected the way government
information is produced, disseminated, controlled, and preserved. These changes have affected the policies and procedures
of the GPO and, in turn, have affected the depository library program. Despite the often-heard promises that Web
technologies will bring more information to more people more quickly and easily, the actual effects have been decidedly
mixed. The highly visible, short-term successes of rapid dissemination of single titles directly to citizens (e.g., the large
number of downloads of the 9/11 report) mask the loss of a secure infrastructure (GPO's Federal Digital System (FDsys)
notwithstanding) for long-term preservation of and access to government information as more and more agencies publish
content on their own Web sites rather than using the GPO conduit (which we in the govt info world call "fugitive documents")
and very few agencies publish to any standards or have policies in place that deal with archiving and preservation. As Chuck
Humphrey, a data librarian friend of mine, once said, “there seems to be an inverse relationship between convenience of
dissemination and preservation standards.”
In addition to this lack of a secure infrastructure, the growing din of the call for digitization of historic govt publications – I
refuse to use the term “legacy”! – from some of the large library associations like ARL, ASERL and CIC, while no doubt a boon
for access today – though with their own unique issues in terms of metadata, provenance, findability, usability etc – is
somewhat of a red herring that makes library administrators believe that they will soon be able to dispose of their physical
collections – not to mention their documents staffs! – and use that space for this week’s buzz word. This call for digitization
may instead have the deleterious affect of damaging the long-term preservation of govt publications.
Lastly, the growing trend toward privatization of govt information has actually caused a decrease in public access despite it's
digital nature. This is not a new trend. Herbert Schiller noted this in 1986 in his book "Information and the Crisis Economy."
Speaking of machine-readable formats, he wrote that, "Library information capability is greatly enhanced. Yet this benefit is
accompanied by the abandonment of libraries' historical free access policy. User charges are introduced. The public character
of the library is weakening as its commercial connection deepens. No less important, the composition and character of its
holdings change as the clientele shifts from general public to the ability-to-pay user."
7. GAO/Thomson contract
Carl Malamud. Public.resource.org. 1/23/13 http://sn.im/gao-contract
Wednesday, January 23, 2013
We've seen over the last several years a disturbing rise in Federal Agencies entering into contracts with private
companies whereby public domain govt documents are digitized and then taken out of the commons via licensing
agreements. See for example, the Government Accountability Office (GAO)'s deal with Thomson-West whereby
Thomson-West digitized the GAO's 20,597 legislative histories of most public laws from 1915-1995 and in return received
exclusive license to sell access to the content. GAO received nothing in return but an account on Thomson's service while
the public received nothing at all.
Last year, NARA entered into a contract with Ancestry.com to serve out the 1940 census schedules (aka enumerators’
notebooks) that were released in 2012 after 72 years. Ancestry agreed to be NARA’s digital infrastructure, offering free
access for 1 year (until April 2013) but henceforth the public would need an Ancestry subscription in order to access the
schedules. And don’t get me started about IBM and Census’ American Factfinder.
Rapid technological change and the misplaced assumption that "it's all in google" have caused some in the FDLP
community to question the need for the FDLP and some others to drop out of the program altogether. I believe that the
inherent nature of digital information actually increases the need for a distributed network of dedicated, legislatively
authorized libraries and librarians. It would be prudent to draw upon the existing infrastructure of FDLP libraries and the
200 years of cumulative experience of these institutions in assuring preservation of and access to government
information. We must reinforce FDLP’s traditional mission of selection, collection, free access, and preservation in the
digital era in order to assure free access to this information into the foreseeable future.
8. FDLP ecosystem
Wednesday, January 23, 2013
Nobody knows for sure how to preserve digital content for the long-term. This means to me that a loosely coupled,
independently administered, distributed ecosystem is the best way to assure long-term preservation -- many
organizations with many funding models and distributed technical infrastructures have a better shot at preservation than 1
or 2 organizations -- especially if one of those organizations has a tenuous budget, or is a private corporation etc. David
Weinberger described the Web in this way in his book “Small pieces loosely joined” and I think that metaphor holds
equally true for libraries. Here’s a back of the napkin kind of sketch of how I imagine the FDLP ecosystem to look.
How would each of these scenarios deal with or react to different stress situations or threat models (directly out of the
OAIS handbook e.g., reduced budgets, increased demand for privatization, increased demand for censorship or control or
removal of information, media/hardware/software/network failure, natural disaster, organizational failure etc.)? It's easy to
see that a highly replicated, distributed FDLP model of preservation based on common open digital standards and OAIS
would deal with these situations much better than a centralized model. A web is much stronger than a silo. This holds true
for all information, not just govt info of course.
Thus ends the soapbox portion of my talk. I’m sure to get back on it later, but for now I’d like to shift gears a bit and talk
about practical matters and about my strategy for collection development and long-term preservation in the FDLP
ecosystem. I’ll run through a few examples for how to conceptualize and actually do digital collection development of govt
information. I like to use a water metaphor to describe my processes. In the digital realm, we have to collect drops of
water, waterfalls as well as the ocean.
First the droplets:
9. EEMs
• Everyday Electronic
Materials
• serendipitous collection
• Collecting the Web a
drop at a time
• Flickr photo by Elle Is Oneirataxic. Attribution-NonCommercial-
ShareAlike 2.0 Generic Creative Commons license
Wednesday, January 23, 2013
EEMs – or Everyday Electronic Materials – is a Mellon Foundation grant-funded project here at Stanford to build infrastructure
and a workflow to support the collection, description, preservation and public access of digital objects by bibliographers and
subject specialists.
EEMs are those digital materials that are serendipitously referenced in news reports, distributed by posting on Web sites, or
through email notification to scholars and bibliographers; those items that selectors come across in the course of doing their
everyday work. In the past, librarians may have downloaded documents to their desktops and perhaps print them out and have
them bound (if their administrations were amenable!). Now we’ve got a digital stacks in which to collect, preserve and give
access!
**For those interested in more, I’ve got a citation and link at the end of the presentation to my colleague Katherine Kott’s report
on the project. For those chomping at the bit now, just Google Kott, EEM, CNI.
Subject specialist workflow is pretty simple:
1. identify a document (*only pdfs and only monographs at this time)
2. drag url of doc to the EEMs browser widget
3. determine copyright status. Request permission from the copyright owner to harvest/preserve if need be (I can usually skip
this step with public domain govt documents!)
4. describe the document (title, author, rights status, notes)
5. submit to acq and cataloging workflow.
6. EEM is locally stored in our digital repository and accessible through our catalog (searchworks)
7. My EEMs workflow also includes reporting fugitive documents to GPO, but I’ll describe that momentarily.
10. Agencies tracked for EEMs
• Bureau of Land Management CA field office
• Department of Justice
• Bureau of Ocean Energy Management, Regulation and Enforcement (BOEMRE)
(including Minerals Management Service)
• NOAA
• National Cancer Institute
• National Institutes of Health
• USDA
• Office of Management and Budget
• **Harvesting with archive-it:
• EPA
• GAO
• Census current industrial reports
• Thanks lost docs blog! http://lostdocs.freegovinfo.info
Wednesday, January 23, 2013
My use of the EEMs workflow and tool grew out of 2 other projects focusing on fugitive govt documents – fugitive documents
are a particular passion of mine!
Particularly through the work of the lostdocs blog (lostdocs.freegovinfo.info) – which tracks fugitive document submissions to
the GPO in order to provide a public listing of fugitive documents – I’ve been able to target several agencies that generally are
the worst offenders in terms of fugitive documents:
We also found that 3 other agencies that were top fugitive offenders published too many documents to make the EEMs
workflow feasible. So I’m harvesting the following 3 agencies with Archive-it (which I’ll describe later):
I have an acquisitions staff person working about 3hrs per month to 1) check the agency publications pages for new
publications; 2) Check the CGP (http://catalog.gpo.gov) to see if the document has made it into the GPO catalog, and 3)
submit a fugitive document report to GPO, and upload the PDF to the EEMs tool.
Besides these federal agencies, I also scour the news – and have a google alert set – for leaked and newsworthy govt
documents like the recently debunked LoC report on Iranian intelligence written about on ProPublica. This is sort of like
reverse engineering the collection development process.
11. EEM: http://searchworks.stanford.edu/view/8707790
Wednesday, January 23, 2013
Through the EEMs workflow, to date we’ve been able to collect over 400 documents like this one (notice the Stanford PURL),
preserve them locally in the Stanford digital repository (SDR) and give access to them through our catalog, searchworks. Think
what we could do if 100 libraries – or 1000! – instituted this workflow? Collectively, we could cover all federal agencies to
assure that no born-digital document within scope of the FDLP falls through the cracks and becomes fugitive.
Next I’ll talk about the ocean:
12. Archive-it
• collecting the Web in
bulk
• Archive-it.org/home/ssrg
• Fotopedia image by Marcus Revertegat. Creative Commons
Attribution 3.0 Unported license.
Wednesday, January 23, 2013
Archive-it is a subscription service from the Internet Archive – which by the way has many digital copies of historic govt
documents and digitized microfilm available in its text collection. It’s an easy collection-building tool whereby you give the
software a list of urls (called “seeds”), schedule the crawler to harvest the seeds, and then give public access to the
content collected. It’s a good way to contextualize or make sense of the ocean of content on the open Web.
Since 2007 we’ve harvested:
Documents Crawled: 58,590,127 (anything from a spacer gif to a mp4 file is considered a “document”)
Data Archived: 4,616.5 GB (4.6 TB!)
13. SULAIR archive-it home:
http://www.archive-it.org/home/SSRG
Wednesday, January 23, 2013
What I’m collecting with Archive-It:
• CRS Reports
• FOIA documents and Agency FOIA reading rooms
• Fugitive US agencies: EPA, GAO etc (shout-out to lostdocs.freegovinfo.info)
• Bay Area governments
• Climate change and environmental policy
• G-20
• CA Dept of education curriculum and instruction
• US budget
• FRUS
14. Collection seeds
https://archive-it.org/public/collection.html?id=1078
Wednesday, January 23, 2013
Metadata: one of our catalogers has created Dublin core metadata at the collection and seed level. Archive-it allows for
metadata at the document level, but we have not done that. We are in the planning stage to index the metadata for our
catalog. We’re also planning to feed archive-it collections into our LOCKSS caches for redistribution and long-term
preservation.
15. search and discover
http://snipurl.com/crs-energyefficiency
Wednesday, January 23, 2013
We give access to the collections via full text search from the archive-it site and from our databases page. Our crawled seeds
also are brought into the wayback machine for public access.
Search can also be embedded into other Web pages (feel free to copy/paste this code!)
16. Paste this into your HTML:
<form action="http://www.archive-it.org/public/search">
<input type="hidden" name="collection"
value="***COLLECTIONID***" />
<input type="text" name="query" />
<input type="submit" name="go" value="Go" />
</form>
***COLLECTIONID*** = 1078 (CRS reports collection)
add search to other pages
</gratuitous_code>
Wednesday, January 23, 2013
<form action="http://www.archive-it.org/public/search">
<input type="hidden" name="collection" value="***COLLECTIONID***" />
<input type="text" name="query" />
<input type="submit" name="go" value="Go" />
</form>
<form action="http://www.archive-it.org/public/search">
<input type="hidden" name="collection" value="1078" />
<input type="text" name="query" />
<input type="submit" name="go" value="Go" />
</form>
Lastly, I’ll mention the waterfall that is LOCKSS-USDOCS.
17. LOCKSS-USDOCS
• Targeted Web collection
and distributed preservation
• Lots of Copies Keep Stuff
Safe
• lockss-usdocs.stanford.edu
• Flickr waterfall picture by discordia1967. That’s actually me at Hanakapi`ai
falls in Kauai :-)
Wednesday, January 23, 2013
lockss-usdocs.stanford.edu
Combines the best of targeted Web harvesting with collaboration and distributed preservation.
18. Wednesday, January 23, 2013
LOCKSS – Lots of Copies Keep Stuff Safe – began at Stanford in 1999. The LOCKSS software was built to solve the problem
of long-term preservation of digital content. It is an open-source distributed digital preservation system based on open
standards (OAIS, OpenURL, HTTP, WARC). Originally LOCKSS was focused on journal literature but over the last 10 years
has been used by other projects focusing on government information, theses and dissertations, numeric data, state records
etc.
The goals of LOCKSS are to spread out the economic cost and responsibility of digital preservation and use off the shelf
hardware and open-source software, so that libraries and content publishers can easily and affordably create, preserve, and
archive local electronic collections and readers can access archived and newly published content transparently at its original
URLs through links resolvers like SFX.
Think of a LOCKSS box as a digitally distributed depository library!
SLIDE 16: DECENTRALIZED PRESERVATION (NEED?)
How does lockss work?
There are 2 parts to the LOCKSS software: harvest and content collection; and content checking and replication.
1) any site – for example FDsys.gov – that gives LOCKSS permission to harvest can be collected by the LOCKSS Web
harvester -- the state of the art in Web harvesting!
2) and this is the cool part: lockss goes through a process of checking and polling all digital content in all of the lockss boxes
on a network. If 1 box has content that is different from all of the other boxes, the software will fix the content, assuring that all
content in the whole network is exactly the same. It is for all intents and purposes injecting stem cells into the network to
replicate and fix content that’s become corrupted over time.
That’s it. LOCKSS is elegant in its simplicity and proven effective in keeping LOCAL(!) digital content safely preserved over
time. This is as close to the unix maxim of “doing one thing, doing it well.”
19. LOCKSS-USDOCS
• LOCKSS for US Documents
• Replicates FDLP in the digital environment
• “digital deposit” (for more on “digital deposit,” see
http://freegovinfo.info/taxonomy/term/3)
• Tamper evident
• 36 libraries and GPO participating
Wednesday, January 23, 2013
So now you can see why some of us in the documents community are so excited about LOCKSS and why we decided to
implement LOCKSS-USDOCS. Portland State and Simon Frasier Universities are the closest partners but I’m always looking
for more.
Using the LOCKSS software we are re-implementing a tamper evident distributed preservation system for digital documents.
Rather than a central silo on a .gov server, digital govt documents reside on 36 servers at 36 different libraries (and counting!).
20. LOCKSS-USDOCS is ...
Federal register, code of federal regulations, congressional
record, congressional bills, congressional reports, US
Code, Public&Private laws, Public Papers of the President,
historic supreme court decisions, US Statutes at Large,
GAO Reports, US Budget ...
and more!!
http://www.gpo.gov/fdsys/browse/collectiontab.action
Wednesday, January 23, 2013
GPO has been instrumental in this process by putting LOCKSS permission statements on all 44 FDsys collections. This
includes:
Federal register, code of federal regulations, congressional record, congressional bills, congressional reports, US Code,
Public&Private laws, Public Papers of the President, historic supreme court decisions, US Statutes at Large, GAO Reports, US
Budget, etc many of these going back to the early 1990s when they first went digital.
In the 2008 Blue Ribbon Task Force on Sustainable Digital Preservation and Access, Abby Smith Rumsey wrote, “Access to
valuable digital materials tomorrow depends upon preservation actions taken today; and, over time, access depends on
ongoing and efficient allocation of resources to preservation.”
With LOCKSS-USDOCS we’re taking collective responsibility today for long-term preservation of digital depository materials.
21. Collaboration
• Farmington Plan Redux
• Summer digital FDLP Institute
• Adopt a federal agency
• Join LOCKSS-USDOCS, TRAIL and other
digitization/digital preservation projects
• Seed the cloud:
• Start blogging your Q&As and editing
Wikipedia articles
http://snipurl.com/qa-average-tariff-
levels
• Catalog, catalog, catalog!
Wednesday, January 23, 2013
Ok, here’s James getting back on his soapbox!
As you can see, the technological tools are there. But there’s a need for a “Farmington Plan redux”:
The Farmington Plan, which lasted from 1948 - 1972, was an innovative ARL program of collaborative collection development
whereby subscribing libraries would have responsibility for collecting and cataloging research materials in certain subject and/
or linguistic areas and would then distribute records (in the form of cards) to the National Union Catalog.
Moving forward, here are some things that we need to do as a community to realize this Farmington Plan Redux and build the
digital FDLP reservoir!:
--First and foremost, set up a summer digital FDLP institute modeled on the ICPSR data library workshop which has trained a
generation of data librarians. Cass and I have talked about this before and I think this is one of the most critical pieces of the
Farmington Plan Redux. The institute would train govt information librarians (and those interested in govt information) on the
ins and outs of the Open Archival Information System (OAIS) and other open digital library tools and standards – including a
proposed standard that my friend and FGI co-conspirator Jim Jacobs and I have written about in a soon to be published D-Lib
article called the "Digital-Surrogate Seal of Approval" (DSSOA), a simple way of describing and guaranteeing to end-users the
quality and accuracy of existing digital surrogates created from printed books and other non-digital originals. The institute
would teach techniques for expanding access to both digital and paper collections, give librarians a framework for updating
their understanding and have increased awareness of digital archival concepts and build and expand their digital toolboxes to
include Web harvesting, digital information collection and organization, building and utilizing Web tools and the semantic Web.
--Adopt a federal agency (or better yet, a local/regional office of a federal agency). Submit fugitive documents to GPO for
inclusion in the CGP and distribution out to other depositories.
--Join LOCKSS, the Technical Reports Archive and Information Library (TRAIL) – shout-out to Mel DeSart who’s been
instrumental in building up TRAIL! – and other digitization/digital preservation projects.
--Seed the cloud:
Start blogging your Q&As and editing Wikipedia articles w library resources. Your users are online and using Google and other
search engines to find stuff. This is an easy way to highlight your collections and your library’s resources and services.
Highlighting your collections online brings users to your library.
Shoutout to Ann Lally and Carolyn Dunford for their 2007 D-Lib article about seeding Wikipedia articles!
22. “...let us save what remains: not by vaults and locks
which fence them from the public eye and use in
consigning them to the waste of time, but by such
a multiplication of copies, as shall place them
beyond the reach of accident.”
— Thomas Jefferson, February 18, 1791
Wednesday, January 23, 2013
digital changes a lot of things about information, but it doesn't change the need to collect it, share it, preserve it, and give
access to it. As my friend, mentor and FGI co-conspirator Jim Jacobs recently stated, "lots of collections keep stuff safe!" (yes
there are 2 of us working on FGI!)
“...let us save what remains: not by vaults and locks which fence them from the public eye and use in consigning them to the
waste of time, but by such a multiplication of copies, as shall place them beyond the reach of accident.”
Or in other words:
24. Further reading
• Future of the Federal Depository Library Program. Free Government
Information. http://freegovinfo.info/taxonomy/term/1087
• “Open Government Publications” Letter to Deputy CTO Noveck.
http://freegovinfo.info/node/2970
• “Digital Deposit.” Free Government Information.
http://freegovinfo.info/taxonomy/term/3
• Preservation for all: LOCKSS-USDOCS and our digital future. James Jacobs
and Victoria Reich. Documents to the People (DttP) Volume 38:3 (Fall 2010).
http://freegovinfo.info/system/files/lockssusdocs-dttp38%283%29.pdf
• Everyday Electronic Materials in Policy and Practice. Coalition for Networked
Information (CNI) project briefing. Fall 2010. Katherine Kott.
http://sn.im/eems-report
• A Guide to Distributed Digital Preservation. K. Skinner and M. Schultz, Eds.
(Atlanta, GA: Educopia Institute, 2010). http://www.metaarchive.org/GDDP
• http://lockss-usdocs.stanford.edu
Wednesday, January 23, 2013