UBC Library Web Archiving 2016

UBC LIBRARY WEB
ARCHIVING
Presentation to UBC Library Community
L A R I S S A R I N G H A M , L I B R A R I A N , D I G I T A L P R O J E C T S
C A R O L I N A R O M A N A M I G O , S T U D E N T L I B R A R I A N , D I G I T A L P R O J E C T S

2
TODAY …
1. Why web archiving?
2. Benchmarking/research project
3. Web archiving at UBC
4. How Archive-it works
5. What’s next?

3
ephemerality
WHY WEB ARCHIVING?

5
• 18 University of Victoria
• 17 University of Alberta
• 10 University of Toronto
• 6 University of British Columbia
• 6 Wilfrid Laurier University
• 4 University of Winnipeg
• 4 University of Manitoba
• 3 University of Waterloo
• 3 Simon Fraser University
• 1 University of Waterloo & Toronto
• 1 University of Saskatchewan
• 1 Dalhousie University
• 1 Carleton University (MacOdrum Library)
WEB ARCHIVING BENCHMARKING – CANADIAN UNIVERSITIES
NUMBER OF COLLECTIONS PER INSTITUTION ON ARCHIVE-IT
12 Canadian
Universities have
Web Archiving
Initiatives.

6
COLLECTION SCOPE – TYPE OF CONTENT
Institution
owned/
affiliated
website
Subject
specific
relevant
websites
Federal/Local
governmental
websites
Local
relevant
events
Local
organizations
Research
projects
Local
news
Local
heritageInternatio
nal
events

7
COLLECTION SCOPE – REASONS FOR ARCHIVING
Public or
scholarly
interest
Preserving
institution
produced
content
Historical or
geographically
local significance
At risk or to be
Decommissio
ned
Supplement
existing
collection
Born
digital
resource

8
ACCESS TO WEB ARCHIVING COLLECTIONS
WHERE ARE
THEY AVAILABLE?
DO THEY HAVE A
DEDICATED PAGE?
HOW ARE THEY
LINKED?
Usually under
digital/archives
or special
collections
Large majority
has a dedicated
page to Web
Archives
Initiative
Usually linked to
Archive-it
institution page
Under subject
guides
Under additional
resources
Featured on library
home page
LESS OFTEN:
Direct link for live
webpages
Restricted access
link
Direct link for
archived webpages
LESS OFTEN:

9
• Ownership remains with website owner
and university has no liability.
• Authorization is granted to educational
purposes, since observing copyright
restrictions from website owners.
OWNERSHIP, LIABILITY, AUTHORIZATION
• Only notifies owners/asks permission in
case of technological protected content.
• Accepts takedown requests.
NOTIFICATION AND TAKEDOWN
POLICIES ADOPTED BY TOP 3 WEB ARCHIVES INITIATIVES
AMONG CANADIAN UNIVERSITITES

11
* Pilot * Federal Government Websites
• Partnered with HSS Library
First Nations and Indigenous Communities Websites
• Partnered with Xwi7wxa Library
2015 Metro Vancouver Transportation and Transit Plebiscite
• Partnered with HSS Library
UBC WEB ARCHIVING PROJECTS TO DATE

12
UBC Asian Library Historical Websites
• Partnered with Asian Library
UBC Conferences and Events
UBC Community and Partners
• Partnered with Faculty of Education, UBC Press

13
HOW ARE WE COLLECTING THE WEB CONTENT?

14
HOW
ARCHIVE-IT
WORKS
http://mayorscouncil.ca/

15
HOW ARCHIVE-IT WORKS
1st add seeds
2nd add scope rules

16
HOW
ARCHIVE-IT
WORKS
Crawl in progress

17
HOW
ARCHIVE-IT
WORKS
Crawl Report

18
HOW
ARCHIVE-IT
WORKS
Quality
Assessment

19
HOW
ARCHIVE-IT
WORKS
Archived
Page on Wayback
Machine

20
UBC Archive-it collection
homepage
https://archive-it.org/organizations/734
HOW
ARCHIVE-IT
WORKS

22
BC Local Government Websites
• Collaborative project with UBC / UVic / SFU
UBC.ca Institutional Website
• Partnership with UBC Archives
COMING UP NEXT ……

23
• Metadata enhancement
• Access and discoverability
• Assessment and analytics
• Preservation with Archivematica
…. AND ON THE HORIZON

24
HOW DO I PROPOSE A
WEB ARCHIVING
PROJECT?

25
1. Research, public or governmental interests relevant for teaching or research
2. Historically or geographically local significance
3. Complementarity to relevant existing collections
4. Content produced by the university or affiliated organizations
WEB ARCHIVING: CONTENT PRIORITIES

26
Risk of disappearance
Originality
Frequency of update
Resource constraints
Duplication
Access
WEB ARCHIVING PROJECT PROPOSALS

27
HOW ARE WEB
ARCHIVING PROJECTS
STRUCTURED?
Source: Bragg, M., & Hanna, K. (2013). The Web
Archiving Life Cycle Model (Publication). Internet Archive.

28
Stakeholder / project partner
• proposes the project
• identifies the content
• performs final QA check
WEB ARCHIVING PROJECT ROLES
Digital Initiatives
• evaluates project against the
policy criteria
• scopes project and assesses for
resources
• performs the archiving crawls
• performs initial QA checks
• creates and applies metadata
• makes content available

29
Technical limitations with the
Archive-it crawler
SOME THINGS TO KEEP IN MIND ….
including but very much not limited to ….
JavaScript
Silverlight
Dynamic databases
Password protected content
Streaming media

30
Web archives team
SOME THINGS TO KEEP IN MIND ….
[image not found]

31
FIND OUT MORE
digitize.library.ubc.ca/work/
digitization.centre@ubc.ca
questions?
larissa.ringham@ubc.ca

UBC Library Web Archiving 2016

UBC Library Web Archiving 2016

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (13)

Similar to UBC Library Web Archiving 2016

Similar to UBC Library Web Archiving 2016 (20)

Recently uploaded

Recently uploaded (20)

UBC Library Web Archiving 2016

Editor's Notes