SlideShare a Scribd company logo
1 of 32
UBC LIBRARY WEB
ARCHIVING
Presentation to UBC Library Community
L A R I S S A R I N G H A M , L I B R A R I A N , D I G I T A L P R O J E C T S
C A R O L I N A R O M A N A M I G O , S T U D E N T L I B R A R I A N , D I G I T A L P R O J E C T S
2
TODAY …
1. Why web archiving?
2. Benchmarking/research project
3. Web archiving at UBC
4. How Archive-it works
5. What’s next?
3
ephemerality
WHY WEB ARCHIVING?
4
RESEARCH AND
BENCHMARKING
5
• 18 University of Victoria
• 17 University of Alberta
• 10 University of Toronto
• 6 University of British Columbia
• 6 Wilfrid Laurier University
• 4 University of Winnipeg
• 4 University of Manitoba
• 3 University of Waterloo
• 3 Simon Fraser University
• 1 University of Waterloo & Toronto
• 1 University of Saskatchewan
• 1 Dalhousie University
• 1 Carleton University (MacOdrum Library)
WEB ARCHIVING BENCHMARKING – CANADIAN UNIVERSITIES
NUMBER OF COLLECTIONS PER INSTITUTION ON ARCHIVE-IT
12 Canadian
Universities have
Web Archiving
Initiatives.
6
COLLECTION SCOPE – TYPE OF CONTENT
Institution
owned/
affiliated
website
Subject
specific
relevant
websites
Federal/Local
governmental
websites
Local
relevant
events
Local
organizations
Research
projects
Local
news
Local
heritageInternatio
nal
events
7
COLLECTION SCOPE – REASONS FOR ARCHIVING
Public or
scholarly
interest
Preserving
institution
produced
content
Historical or
geographically
local significance
At risk or to be
Decommissio
ned
Supplement
existing
collection
Born
digital
resource
8
ACCESS TO WEB ARCHIVING COLLECTIONS
WHERE ARE
THEY AVAILABLE?
DO THEY HAVE A
DEDICATED PAGE?
HOW ARE THEY
LINKED?
Usually under
digital/archives
or special
collections
Large majority
has a dedicated
page to Web
Archives
Initiative
Usually linked to
Archive-it
institution page
Under subject
guides
Under additional
resources
Featured on library
home page
LESS OFTEN:
Direct link for live
webpages
Restricted access
link
Direct link for
archived webpages
LESS OFTEN:
9
• Ownership remains with website owner
and university has no liability.
• Authorization is granted to educational
purposes, since observing copyright
restrictions from website owners.
OWNERSHIP, LIABILITY, AUTHORIZATION
• Only notifies owners/asks permission in
case of technological protected content.
• Accepts takedown requests.
NOTIFICATION AND TAKEDOWN
POLICIES ADOPTED BY TOP 3 WEB ARCHIVES INITIATIVES
AMONG CANADIAN UNIVERSITITES
10
WEB ARCHIVING AT UBC
11
* Pilot * Federal Government Websites
• Partnered with HSS Library
First Nations and Indigenous Communities Websites
• Partnered with Xwi7wxa Library
2015 Metro Vancouver Transportation and Transit Plebiscite
• Partnered with HSS Library
UBC WEB ARCHIVING PROJECTS TO DATE
12
UBC Asian Library Historical Websites
• Partnered with Asian Library
UBC Conferences and Events
UBC Community and Partners
• Partnered with Faculty of Education, UBC Press
13
HOW ARE WE COLLECTING THE WEB CONTENT?
14
HOW
ARCHIVE-IT
WORKS
http://mayorscouncil.ca/
15
HOW ARCHIVE-IT WORKS
1st add seeds
2nd add scope rules
16
HOW
ARCHIVE-IT
WORKS
Crawl in progress
17
HOW
ARCHIVE-IT
WORKS
Crawl Report
18
HOW
ARCHIVE-IT
WORKS
Quality
Assessment
19
HOW
ARCHIVE-IT
WORKS
Archived
Page on Wayback
Machine
20
UBC Archive-it collection
homepage
https://archive-it.org/organizations/734
HOW
ARCHIVE-IT
WORKS
21
NEXT STEPS
22
BC Local Government Websites
• Collaborative project with UBC / UVic / SFU
UBC.ca Institutional Website
• Partnership with UBC Archives
COMING UP NEXT ……
23
• Metadata enhancement
• Access and discoverability
• Assessment and analytics
• Preservation with Archivematica
…. AND ON THE HORIZON
24
HOW DO I PROPOSE A
WEB ARCHIVING
PROJECT?
25
1. Research, public or governmental interests relevant for teaching or research
2. Historically or geographically local significance
3. Complementarity to relevant existing collections
4. Content produced by the university or affiliated organizations
WEB ARCHIVING: CONTENT PRIORITIES
26
Risk of disappearance
Originality
Frequency of update
Resource constraints
Duplication
Access
WEB ARCHIVING PROJECT PROPOSALS
27
HOW ARE WEB
ARCHIVING PROJECTS
STRUCTURED?
Source: Bragg, M., & Hanna, K. (2013). The Web
Archiving Life Cycle Model (Publication). Internet Archive.
28
Stakeholder / project partner
• proposes the project
• identifies the content
• performs final QA check
WEB ARCHIVING PROJECT ROLES
Digital Initiatives
• evaluates project against the
policy criteria
• scopes project and assesses for
resources
• performs the archiving crawls
• performs initial QA checks
• creates and applies metadata
• makes content available
29
Technical limitations with the
Archive-it crawler
SOME THINGS TO KEEP IN MIND ….
including but very much not limited to ….
JavaScript
Silverlight
Dynamic databases
Password protected content
Streaming media
30
Web archives team
SOME THINGS TO KEEP IN MIND ….
[image not found]
31
FIND OUT MORE
digitize.library.ubc.ca/work/
digitization.centre@ubc.ca
questions?
larissa.ringham@ubc.ca
UBC Library Web Archiving 2016

More Related Content

What's hot

Archiving Occupy (presentation for NYC Digital Asset Managers Meetup)
Archiving Occupy (presentation for NYC Digital Asset Managers Meetup)Archiving Occupy (presentation for NYC Digital Asset Managers Meetup)
Archiving Occupy (presentation for NYC Digital Asset Managers Meetup)Anna Perricci
 
Opening Online Learning with OER
Opening Online Learning with OEROpening Online Learning with OER
Opening Online Learning with OERLorna Campbell
 
Finding, managing and using the right MediaHub content
Finding, managing and using the right MediaHub contentFinding, managing and using the right MediaHub content
Finding, managing and using the right MediaHub contentEDINA, University of Edinburgh
 
What Do You Really Want?
What Do You Really Want?What Do You Really Want?
What Do You Really Want?IWMW
 
Hard won: the challenges of obtaining scholarly communication knowledge & skills
Hard won: the challenges of obtaining scholarly communication knowledge & skillsHard won: the challenges of obtaining scholarly communication knowledge & skills
Hard won: the challenges of obtaining scholarly communication knowledge & skillsDanny Kingsley
 
Wikis: Enabling Collaboration in Libraries
Wikis: Enabling Collaboration in LibrariesWikis: Enabling Collaboration in Libraries
Wikis: Enabling Collaboration in LibrariesMeredith Farkas
 
Web 2.0 Tools: Outreach and Community Building
Web 2.0 Tools: Outreach and Community BuildingWeb 2.0 Tools: Outreach and Community Building
Web 2.0 Tools: Outreach and Community BuildingBrian Gray
 
Starting Small: Practical First Steps in Digital Preservation
Starting Small: Practical First Steps in Digital PreservationStarting Small: Practical First Steps in Digital Preservation
Starting Small: Practical First Steps in Digital PreservationHelen Bailey
 
DeRoureWillcox-BuildingOnlineArchives-IsaiahBerlinArchiveWorkshop
DeRoureWillcox-BuildingOnlineArchives-IsaiahBerlinArchiveWorkshopDeRoureWillcox-BuildingOnlineArchives-IsaiahBerlinArchiveWorkshop
DeRoureWillcox-BuildingOnlineArchives-IsaiahBerlinArchiveWorkshopPip Willcox
 
CoPILOT at the University of Surrey: An Introduction
CoPILOT at the University of Surrey: An IntroductionCoPILOT at the University of Surrey: An Introduction
CoPILOT at the University of Surrey: An IntroductionNancy Graham
 
Into the Open: Exploring the benefits of open education and OER
Into the Open: Exploring the benefits of open education and OERInto the Open: Exploring the benefits of open education and OER
Into the Open: Exploring the benefits of open education and OERLorna Campbell
 
“Agile” as Key to Collaboration on NYU Digital Collections Discovery Initiative
“Agile” as Key to Collaboration on NYU Digital Collections Discovery Initiative“Agile” as Key to Collaboration on NYU Digital Collections Discovery Initiative
“Agile” as Key to Collaboration on NYU Digital Collections Discovery InitiativeLovins, Daniel
 
Cdc editlib eng_reptic_2015
Cdc editlib eng_reptic_2015Cdc editlib eng_reptic_2015
Cdc editlib eng_reptic_2015Chantal Lalonde
 
The Brooklyn Visual Heritage Website: Brooklyn’s Museums and Libraries Collab...
The Brooklyn Visual Heritage Website: Brooklyn’s Museums and Libraries Collab...The Brooklyn Visual Heritage Website: Brooklyn’s Museums and Libraries Collab...
The Brooklyn Visual Heritage Website: Brooklyn’s Museums and Libraries Collab...Jonathan Bowen
 
Making the most of digital resources - Anthony Beal and Neil Longley
Making the most of digital resources - Anthony Beal and Neil LongleyMaking the most of digital resources - Anthony Beal and Neil Longley
Making the most of digital resources - Anthony Beal and Neil LongleyJisc
 
Digital transformations: new challenges for the arts and humanities - Andrew ...
Digital transformations: new challenges for the arts and humanities - Andrew ...Digital transformations: new challenges for the arts and humanities - Andrew ...
Digital transformations: new challenges for the arts and humanities - Andrew ...Jisc
 
Urban Archaeology - Session 12: Writing for Archaeology
Urban Archaeology - Session 12: Writing for ArchaeologyUrban Archaeology - Session 12: Writing for Archaeology
Urban Archaeology - Session 12: Writing for ArchaeologyNicole Beale
 

What's hot (20)

Archiving Occupy (presentation for NYC Digital Asset Managers Meetup)
Archiving Occupy (presentation for NYC Digital Asset Managers Meetup)Archiving Occupy (presentation for NYC Digital Asset Managers Meetup)
Archiving Occupy (presentation for NYC Digital Asset Managers Meetup)
 
Opening Online Learning with OER
Opening Online Learning with OEROpening Online Learning with OER
Opening Online Learning with OER
 
Finding, managing and using the right MediaHub content
Finding, managing and using the right MediaHub contentFinding, managing and using the right MediaHub content
Finding, managing and using the right MediaHub content
 
What Do You Really Want?
What Do You Really Want?What Do You Really Want?
What Do You Really Want?
 
Hard won: the challenges of obtaining scholarly communication knowledge & skills
Hard won: the challenges of obtaining scholarly communication knowledge & skillsHard won: the challenges of obtaining scholarly communication knowledge & skills
Hard won: the challenges of obtaining scholarly communication knowledge & skills
 
Open Education Global 2015 Conference Report
Open Education Global 2015 Conference ReportOpen Education Global 2015 Conference Report
Open Education Global 2015 Conference Report
 
To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
 
Wikis: Enabling Collaboration in Libraries
Wikis: Enabling Collaboration in LibrariesWikis: Enabling Collaboration in Libraries
Wikis: Enabling Collaboration in Libraries
 
Web 2.0 Tools: Outreach and Community Building
Web 2.0 Tools: Outreach and Community BuildingWeb 2.0 Tools: Outreach and Community Building
Web 2.0 Tools: Outreach and Community Building
 
Starting Small: Practical First Steps in Digital Preservation
Starting Small: Practical First Steps in Digital PreservationStarting Small: Practical First Steps in Digital Preservation
Starting Small: Practical First Steps in Digital Preservation
 
Ppls mvm2
Ppls mvm2Ppls mvm2
Ppls mvm2
 
DeRoureWillcox-BuildingOnlineArchives-IsaiahBerlinArchiveWorkshop
DeRoureWillcox-BuildingOnlineArchives-IsaiahBerlinArchiveWorkshopDeRoureWillcox-BuildingOnlineArchives-IsaiahBerlinArchiveWorkshop
DeRoureWillcox-BuildingOnlineArchives-IsaiahBerlinArchiveWorkshop
 
CoPILOT at the University of Surrey: An Introduction
CoPILOT at the University of Surrey: An IntroductionCoPILOT at the University of Surrey: An Introduction
CoPILOT at the University of Surrey: An Introduction
 
Into the Open: Exploring the benefits of open education and OER
Into the Open: Exploring the benefits of open education and OERInto the Open: Exploring the benefits of open education and OER
Into the Open: Exploring the benefits of open education and OER
 
“Agile” as Key to Collaboration on NYU Digital Collections Discovery Initiative
“Agile” as Key to Collaboration on NYU Digital Collections Discovery Initiative“Agile” as Key to Collaboration on NYU Digital Collections Discovery Initiative
“Agile” as Key to Collaboration on NYU Digital Collections Discovery Initiative
 
Cdc editlib eng_reptic_2015
Cdc editlib eng_reptic_2015Cdc editlib eng_reptic_2015
Cdc editlib eng_reptic_2015
 
The Brooklyn Visual Heritage Website: Brooklyn’s Museums and Libraries Collab...
The Brooklyn Visual Heritage Website: Brooklyn’s Museums and Libraries Collab...The Brooklyn Visual Heritage Website: Brooklyn’s Museums and Libraries Collab...
The Brooklyn Visual Heritage Website: Brooklyn’s Museums and Libraries Collab...
 
Making the most of digital resources - Anthony Beal and Neil Longley
Making the most of digital resources - Anthony Beal and Neil LongleyMaking the most of digital resources - Anthony Beal and Neil Longley
Making the most of digital resources - Anthony Beal and Neil Longley
 
Digital transformations: new challenges for the arts and humanities - Andrew ...
Digital transformations: new challenges for the arts and humanities - Andrew ...Digital transformations: new challenges for the arts and humanities - Andrew ...
Digital transformations: new challenges for the arts and humanities - Andrew ...
 
Urban Archaeology - Session 12: Writing for Archaeology
Urban Archaeology - Session 12: Writing for ArchaeologyUrban Archaeology - Session 12: Writing for Archaeology
Urban Archaeology - Session 12: Writing for Archaeology
 

Viewers also liked

UBC Library Digitization Program: From Capture to Community
UBC Library Digitization Program: From Capture to CommunityUBC Library Digitization Program: From Capture to Community
UBC Library Digitization Program: From Capture to CommunityLarissa Ringham
 
Digital Humanities at the University of British Columbia Library: The Royal F...
Digital Humanities at the University of British Columbia Library: The Royal F...Digital Humanities at the University of British Columbia Library: The Royal F...
Digital Humanities at the University of British Columbia Library: The Royal F...Larissa Ringham
 
Bas herri sarea + baratzan dantzan konpromisoak
Bas herri sarea + baratzan dantzan   konpromisoakBas herri sarea + baratzan dantzan   konpromisoak
Bas herri sarea + baratzan dantzan konpromisoakJoxepo Garmendia
 
Tools to Manage Your Digital & Social Media #godigitalyuma
Tools to Manage Your Digital & Social Media #godigitalyumaTools to Manage Your Digital & Social Media #godigitalyuma
Tools to Manage Your Digital & Social Media #godigitalyumaAlan Pruitt
 
My goals ....
My goals ....My goals ....
My goals ....sofi1900
 
Schedule: English 2269
Schedule: English 2269Schedule: English 2269
Schedule: English 2269Torsa Ghosal
 
Medical Careers
Medical CareersMedical Careers
Medical CareersMaria5548
 
Learning goals 1 1
Learning goals 1 1Learning goals 1 1
Learning goals 1 1Torsa Ghosal
 
Home Remedies
Home RemediesHome Remedies
Home RemediesMaria5548
 

Viewers also liked (13)

UBC Library Digitization Program: From Capture to Community
UBC Library Digitization Program: From Capture to CommunityUBC Library Digitization Program: From Capture to Community
UBC Library Digitization Program: From Capture to Community
 
Digital Humanities at the University of British Columbia Library: The Royal F...
Digital Humanities at the University of British Columbia Library: The Royal F...Digital Humanities at the University of British Columbia Library: The Royal F...
Digital Humanities at the University of British Columbia Library: The Royal F...
 
Bas herri sarea + baratzan dantzan konpromisoak
Bas herri sarea + baratzan dantzan   konpromisoakBas herri sarea + baratzan dantzan   konpromisoak
Bas herri sarea + baratzan dantzan konpromisoak
 
Tweetdeck
TweetdeckTweetdeck
Tweetdeck
 
Schedule
ScheduleSchedule
Schedule
 
Cancer
CancerCancer
Cancer
 
Tools to Manage Your Digital & Social Media #godigitalyuma
Tools to Manage Your Digital & Social Media #godigitalyumaTools to Manage Your Digital & Social Media #godigitalyuma
Tools to Manage Your Digital & Social Media #godigitalyuma
 
My goals ....
My goals ....My goals ....
My goals ....
 
Schedule: English 2269
Schedule: English 2269Schedule: English 2269
Schedule: English 2269
 
Powerpoint
PowerpointPowerpoint
Powerpoint
 
Medical Careers
Medical CareersMedical Careers
Medical Careers
 
Learning goals 1 1
Learning goals 1 1Learning goals 1 1
Learning goals 1 1
 
Home Remedies
Home RemediesHome Remedies
Home Remedies
 

Similar to UBC Library Web Archiving 2016

Building Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the WebBuilding Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the WebAnna Perricci
 
Collaborative Web Archiving with Ivy Plus / Borrow Direct
Collaborative Web Archiving with Ivy Plus / Borrow Direct Collaborative Web Archiving with Ivy Plus / Borrow Direct
Collaborative Web Archiving with Ivy Plus / Borrow Direct Anna Perricci
 
NCompass Live: Learning Opportunities and Resources from WebJunction
NCompass Live: Learning Opportunities and Resources from WebJunctionNCompass Live: Learning Opportunities and Resources from WebJunction
NCompass Live: Learning Opportunities and Resources from WebJunctionNebraska Library Commission
 
Web Preservation, or Managing your Organisation’s Online Presence After the O...
Web Preservation, or Managing your Organisation’s Online Presence After the O...Web Preservation, or Managing your Organisation’s Online Presence After the O...
Web Preservation, or Managing your Organisation’s Online Presence After the O...lisbk
 
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...Anna Perricci
 
Library Connect Webinar | Fostering research community through library spaces...
Library Connect Webinar | Fostering research community through library spaces...Library Connect Webinar | Fostering research community through library spaces...
Library Connect Webinar | Fostering research community through library spaces...Library_Connect
 
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...Anna Perricci
 
Increase impact of_your_research_open_access_and_c_i_rcle
Increase impact of_your_research_open_access_and_c_i_rcleIncrease impact of_your_research_open_access_and_c_i_rcle
Increase impact of_your_research_open_access_and_c_i_rcleUniversity of British Columbia
 
Digital collections: Increasing awareness and use
Digital collections:  Increasing awareness and useDigital collections:  Increasing awareness and use
Digital collections: Increasing awareness and useButtes
 
Moo cs and libraries bc faculty day 2014
Moo cs and libraries bc faculty day 2014Moo cs and libraries bc faculty day 2014
Moo cs and libraries bc faculty day 2014myspacelibrarian
 
Moo cs and libraries bc faculty day 2014
Moo cs and libraries bc faculty day 2014Moo cs and libraries bc faculty day 2014
Moo cs and libraries bc faculty day 2014myspacelibrarian
 
JISC LIDP ILI2011
JISC LIDP ILI2011JISC LIDP ILI2011
JISC LIDP ILI2011daveyp
 
Strategies for LIS research
Strategies for LIS researchStrategies for LIS research
Strategies for LIS researchVaralakshmiRSR
 
Contributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and WikimediaContributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and WikimediaNick Sheppard
 
Finding Research After You Graduate
Finding Research After You GraduateFinding Research After You Graduate
Finding Research After You GraduateElaine Lasda
 
Looking at Libraries, collections & technology
Looking at Libraries, collections & technologyLooking at Libraries, collections & technology
Looking at Libraries, collections & technologylisld
 
Prospects and pitfalls in using web archives for research
Prospects and pitfalls in using web archives for researchProspects and pitfalls in using web archives for research
Prospects and pitfalls in using web archives for researchPeter Webster
 
L&P Diego Argaez - Open Access Innovations: Think Local, Act Global
L&P Diego Argaez -  Open Access Innovations: Think Local, Act GlobalL&P Diego Argaez -  Open Access Innovations: Think Local, Act Global
L&P Diego Argaez - Open Access Innovations: Think Local, Act GlobalCASRAI
 
Marc and beyond: 3 Linked Data Choices
 Marc and beyond: 3 Linked Data Choices  Marc and beyond: 3 Linked Data Choices
Marc and beyond: 3 Linked Data Choices Richard Wallis
 

Similar to UBC Library Web Archiving 2016 (20)

Building Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the WebBuilding Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the Web
 
Collaborative Web Archiving with Ivy Plus / Borrow Direct
Collaborative Web Archiving with Ivy Plus / Borrow Direct Collaborative Web Archiving with Ivy Plus / Borrow Direct
Collaborative Web Archiving with Ivy Plus / Borrow Direct
 
NCompass Live: Learning Opportunities and Resources from WebJunction
NCompass Live: Learning Opportunities and Resources from WebJunctionNCompass Live: Learning Opportunities and Resources from WebJunction
NCompass Live: Learning Opportunities and Resources from WebJunction
 
Web Preservation, or Managing your Organisation’s Online Presence After the O...
Web Preservation, or Managing your Organisation’s Online Presence After the O...Web Preservation, or Managing your Organisation’s Online Presence After the O...
Web Preservation, or Managing your Organisation’s Online Presence After the O...
 
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
 
Library Connect Webinar | Fostering research community through library spaces...
Library Connect Webinar | Fostering research community through library spaces...Library Connect Webinar | Fostering research community through library spaces...
Library Connect Webinar | Fostering research community through library spaces...
 
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
 
Increase impact of_your_research_open_access_and_c_i_rcle
Increase impact of_your_research_open_access_and_c_i_rcleIncrease impact of_your_research_open_access_and_c_i_rcle
Increase impact of_your_research_open_access_and_c_i_rcle
 
Digital collections: Increasing awareness and use
Digital collections:  Increasing awareness and useDigital collections:  Increasing awareness and use
Digital collections: Increasing awareness and use
 
Moo cs and libraries bc faculty day 2014
Moo cs and libraries bc faculty day 2014Moo cs and libraries bc faculty day 2014
Moo cs and libraries bc faculty day 2014
 
Moo cs and libraries bc faculty day 2014
Moo cs and libraries bc faculty day 2014Moo cs and libraries bc faculty day 2014
Moo cs and libraries bc faculty day 2014
 
JISC LIDP ILI2011
JISC LIDP ILI2011JISC LIDP ILI2011
JISC LIDP ILI2011
 
Strategies for LIS research
Strategies for LIS researchStrategies for LIS research
Strategies for LIS research
 
Contributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and WikimediaContributing to the global commons: Repositories and Wikimedia
Contributing to the global commons: Repositories and Wikimedia
 
Finding Research After You Graduate
Finding Research After You GraduateFinding Research After You Graduate
Finding Research After You Graduate
 
Looking at Libraries, collections & technology
Looking at Libraries, collections & technologyLooking at Libraries, collections & technology
Looking at Libraries, collections & technology
 
EDINA Serials UKLA SafeNet
EDINA Serials UKLA SafeNetEDINA Serials UKLA SafeNet
EDINA Serials UKLA SafeNet
 
Prospects and pitfalls in using web archives for research
Prospects and pitfalls in using web archives for researchProspects and pitfalls in using web archives for research
Prospects and pitfalls in using web archives for research
 
L&P Diego Argaez - Open Access Innovations: Think Local, Act Global
L&P Diego Argaez -  Open Access Innovations: Think Local, Act GlobalL&P Diego Argaez -  Open Access Innovations: Think Local, Act Global
L&P Diego Argaez - Open Access Innovations: Think Local, Act Global
 
Marc and beyond: 3 Linked Data Choices
 Marc and beyond: 3 Linked Data Choices  Marc and beyond: 3 Linked Data Choices
Marc and beyond: 3 Linked Data Choices
 

Recently uploaded

Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 

Recently uploaded (20)

Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 

UBC Library Web Archiving 2016

  • 1. UBC LIBRARY WEB ARCHIVING Presentation to UBC Library Community L A R I S S A R I N G H A M , L I B R A R I A N , D I G I T A L P R O J E C T S C A R O L I N A R O M A N A M I G O , S T U D E N T L I B R A R I A N , D I G I T A L P R O J E C T S
  • 2. 2 TODAY … 1. Why web archiving? 2. Benchmarking/research project 3. Web archiving at UBC 4. How Archive-it works 5. What’s next?
  • 5. 5 • 18 University of Victoria • 17 University of Alberta • 10 University of Toronto • 6 University of British Columbia • 6 Wilfrid Laurier University • 4 University of Winnipeg • 4 University of Manitoba • 3 University of Waterloo • 3 Simon Fraser University • 1 University of Waterloo & Toronto • 1 University of Saskatchewan • 1 Dalhousie University • 1 Carleton University (MacOdrum Library) WEB ARCHIVING BENCHMARKING – CANADIAN UNIVERSITIES NUMBER OF COLLECTIONS PER INSTITUTION ON ARCHIVE-IT 12 Canadian Universities have Web Archiving Initiatives.
  • 6. 6 COLLECTION SCOPE – TYPE OF CONTENT Institution owned/ affiliated website Subject specific relevant websites Federal/Local governmental websites Local relevant events Local organizations Research projects Local news Local heritageInternatio nal events
  • 7. 7 COLLECTION SCOPE – REASONS FOR ARCHIVING Public or scholarly interest Preserving institution produced content Historical or geographically local significance At risk or to be Decommissio ned Supplement existing collection Born digital resource
  • 8. 8 ACCESS TO WEB ARCHIVING COLLECTIONS WHERE ARE THEY AVAILABLE? DO THEY HAVE A DEDICATED PAGE? HOW ARE THEY LINKED? Usually under digital/archives or special collections Large majority has a dedicated page to Web Archives Initiative Usually linked to Archive-it institution page Under subject guides Under additional resources Featured on library home page LESS OFTEN: Direct link for live webpages Restricted access link Direct link for archived webpages LESS OFTEN:
  • 9. 9 • Ownership remains with website owner and university has no liability. • Authorization is granted to educational purposes, since observing copyright restrictions from website owners. OWNERSHIP, LIABILITY, AUTHORIZATION • Only notifies owners/asks permission in case of technological protected content. • Accepts takedown requests. NOTIFICATION AND TAKEDOWN POLICIES ADOPTED BY TOP 3 WEB ARCHIVES INITIATIVES AMONG CANADIAN UNIVERSITITES
  • 11. 11 * Pilot * Federal Government Websites • Partnered with HSS Library First Nations and Indigenous Communities Websites • Partnered with Xwi7wxa Library 2015 Metro Vancouver Transportation and Transit Plebiscite • Partnered with HSS Library UBC WEB ARCHIVING PROJECTS TO DATE
  • 12. 12 UBC Asian Library Historical Websites • Partnered with Asian Library UBC Conferences and Events UBC Community and Partners • Partnered with Faculty of Education, UBC Press
  • 13. 13 HOW ARE WE COLLECTING THE WEB CONTENT?
  • 15. 15 HOW ARCHIVE-IT WORKS 1st add seeds 2nd add scope rules
  • 22. 22 BC Local Government Websites • Collaborative project with UBC / UVic / SFU UBC.ca Institutional Website • Partnership with UBC Archives COMING UP NEXT ……
  • 23. 23 • Metadata enhancement • Access and discoverability • Assessment and analytics • Preservation with Archivematica …. AND ON THE HORIZON
  • 24. 24 HOW DO I PROPOSE A WEB ARCHIVING PROJECT?
  • 25. 25 1. Research, public or governmental interests relevant for teaching or research 2. Historically or geographically local significance 3. Complementarity to relevant existing collections 4. Content produced by the university or affiliated organizations WEB ARCHIVING: CONTENT PRIORITIES
  • 26. 26 Risk of disappearance Originality Frequency of update Resource constraints Duplication Access WEB ARCHIVING PROJECT PROPOSALS
  • 27. 27 HOW ARE WEB ARCHIVING PROJECTS STRUCTURED? Source: Bragg, M., & Hanna, K. (2013). The Web Archiving Life Cycle Model (Publication). Internet Archive.
  • 28. 28 Stakeholder / project partner • proposes the project • identifies the content • performs final QA check WEB ARCHIVING PROJECT ROLES Digital Initiatives • evaluates project against the policy criteria • scopes project and assesses for resources • performs the archiving crawls • performs initial QA checks • creates and applies metadata • makes content available
  • 29. 29 Technical limitations with the Archive-it crawler SOME THINGS TO KEEP IN MIND …. including but very much not limited to …. JavaScript Silverlight Dynamic databases Password protected content Streaming media
  • 30. 30 Web archives team SOME THINGS TO KEEP IN MIND …. [image not found]

Editor's Notes

  1. What do we mean by “web archiving”? Collecting targeted web sites and web content for preservation and access [why it is important] More and more content is being born digital, and not all of it is being captured digital preservation programs capture documents and files, but that is not all the *web content* Many sites are abandoned or taken down and that content is lost forever UBC Library, through digital initiatives, has been doing web archiving since 2013 using the archive-it service from the internet archive (will take more about that a bit later) - Our archiving activities have so far been very much an off-the-side of the desk activity and in response to specific, time-sensitive preservation issues. But now we wanted to think about the program a bit more strategically and take the initiative to the larger library community to hear what you would want to see as web archiving activity at UBC. Now: Carolina … research
  2. Examples of subject specific relevant websites: University of Alberta Energy/Environment Collection, and Circumpolar Collection. Examples of Local relevant events: Alberta Floods June 2013
  3. Public or scholarly interest: from websites relevant to research to governmental websites. Examples of historical or geographically local significance: B.C. Teachers' Labour Dispute (2014) (Uvic) Examples of at risk or to be decommissioned webpages: Edmonton Public Library - Orphaned Collections (University of Alberta)
  4. So, how did we get into web archiving? 2013 CGI-PLN group learned that the federal government websites were being replaced, UBC took part in an initiative to archive them Pilot project and in retrospect it was not ideal for a pilot given the size of what we captured But we were pressed for time, and had a little over a month Almost 1million pages archived After that we partnered with Xwi7xwa library to build another collection to capture FN and indigenous community content Much here is an example of endangered community content, and a good example of what I was talking about at the beginning regarding the ephemerality of web content These sites were largely outdated, not properly maintained often due to the community organizations no longer being in existence. In some cases we were not successful in capturing the content, sometimes due to technical reasons and once being the site itself was hijacked But they are a valuable source of community content for researchers and students.
  5. We are using Archive-it Archive-It is a subscription web archiving service from the Internet Archive that helps organizations to harvest, build, and preserve collections of digital content. Archive-It allows us to collect, and manage our collections of archived content in an openly accessible format and with full text search available Content is hosted and stored at the Internet Archive data centers. [Relationship between the internet archive, archive it, wayback machine] Archive-it is essentially a managed version of the wayback machine. - The IA’s wayback machine crawls web content constantly but indiscriminately - Archive-it allows us to build custom collections based on the needs of what we are collecting, and gives us control over what is crawled and how often
  6. I’m going to let Carolina walk us through the steps of a crawl so you can get the basic idea of the capture process.
  7. Our collections are accessible through this institutional collection page [go to live site for demo]
  8. Due to the fact that we do not have a vast office of bodies dedicated to the task of web archiving – projects are assessed on a case by case basis against a set of criteria.
  9. Web archiving program is run out of digital initiatives. For the most part we do not actively identify and select content for recruitment, we rely on the subject matter experts, librarians, faculty members and others to bring potential content to our attention.
  10. The web archiving team consists of myself, and through to the end of August, Carolina. Resource constraints