SlideShare a Scribd company logo
Information sharing about Columbia University
Library’s recent web archiving conference
Web Archiving Collaboration:
New Tools and Models
Anna Perricci
Columbia University Libraries
2015 Archive-It Partner Meeting
August 18, 2015
• This conference was part of a set
of projects within a Mellon
funded project
• About 10 slide decks on associated
Mellon supported web archiving
collaborations are available on
SlideShare:
http://www.slideshare.net/annaperricci
We had a conference
(it was pretty awesome)
Conference: why & how
• This conference was a follow up to incentive awards program for
improving web archiving tools
• Goals:
– Share results of funded projects and foster conversations about
application & extensibility of work done
– Provide information about other new collaborative models for web
archiving
• Space limitations - Day 1 and Day 2
– Videos:
https://www.youtube.com/playlist?list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
• Co-organizers
– Bob Wolven, Alex Thurman and Anna Perricci
– Thanks also to many volunteers and those kindly conscripted
Keynote address
https://www.youtube.com/watch?v=kr82McjQGyQ&index=
1&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
We need radical collaborative models
to do a better job preserving & making
available born digital materials
Incentive award winners: Preserving online
law content
https://www.youtube.com/watch?v=t_qZ4hNtmyw&index=
2&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
Free Law Project
Free Law Project & developments
Free Law Project goals:
• Put entirety of US Case Law
online for the public for free
• Develop web-based legal
research tools
• Support academic research on
legal corpora
• See CourtListener:
https://www.courtlistener.com
Results from grant
• Expanded capacity of Juriscraper to
harvest opinions on appellate court
websites & involve wider community in
scraping work
– https://github.com/freelawproject/juriscra
per
– Over 200 web pages checked every
weekday, harvest when changes detected
– Federal circuit courts and Supreme Court;
now covers all state courts of last resort &
most intermediate appellate state courts
• Capture oral arguments
• Collaborative work with RECAP
Perma.cc
Perma.cc
• Premise: https://perma.cc/about
– Per http://dx.doi.org/10.2139/ssrn.2329161
“approximately 70% of all links in citations
published between 1999 and 2011 no longer
point to the same material. Broken links in journal
articles undermine the citation-based system of
legal scholarship by obscuring the evidence
underlying authors’ ideas.”
• “Any author can go to the Perma.cc website and input
a URL. Perma.cc downloads the material at that URL
and gives back a new URL (a “Perma.cc link”) that can
then be inserted in a paper.”
• There are some more steps to read about on their
website, including some stats: https://perma.cc/stats
Results
Better APIs for
greater extensibility
of tools and wider
use of technology in
other settings / for
other services
New collaborative models
https://www.youtube.com/watch?v=ntAP064ZdBM&list=PL
f1Dab4lwQhBpFRB1dpUnKLglmM2iScjl&index=3
New York Art Resources Consortium (NYARC)
• NYARC is a consortium of museum libraries
at the Frick Collection, the Museum of
Modern Art and the Brooklyn Museum
• With Mellon funding they are preserving
and making available websites on themes
ranging from artists websites to their own
institutions’ websites to resources for
restitution related research
• Cutting edge work on access via a discovery
layer is in progress
• They are leaders in work on quality
assurance
• Excellent resource created by NDSR
resident: http://wiki.nyarc.org/web-
archiving/quality-assurance
Collaborative collecting with Ivy Plus
International Internet Preservation Consortium
(IIPC) collaborative collections
• Access working group of the IIPC is
leading an initiative to collaboratively
collect websites on topics of
international interest
• Complications and challenges arose
particularly in determining collecting
themes and given variations in laws
governing intellectual property
• Archive-It chosen as tool for creating
collection
• No spoiler alert: watch the video to
see historic moment in IIPC’s
collaborative collecting
Lunch
Image source: http://www.omgcatsinspace.com/post/38940531331
Lightning Talks
https://www.youtube.com/watch?v=jqrC0Ygdc-
M&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
&index=4
Lightning talks: Michael Lissner on RECAP
Lightning talks: Jefferson Bailey
on Archive-It Researcher Services
Lightning talks: Dragan Espenschied on
Rhizome’s web archiving work
Lightning talks: Dan Chudnov
on Social Feed Manager
Lightning talks: Jack Cushman
on Tools for Time Travel
Incentive award winners: New platforms
https://www.youtube.com/watch?v=6h8MohBSEt
I&index=5&list=PLf1Dab4lwQhBpFRB1dpUnKLglm
M2iScjl
Warcbase: Building a scalable platform on
HBase and Hadoop
Warcbase
Warcbase is an open-source
platform for storing,
managing, and analyzing web
archives using current “big
data" infrastructure and tools
(e.g. HBase for storage,
Hadoop for data analytics)
-further applications on
‘wimpy hardware’ (Raspberry
Pi) also demonstrated for
personal digital archiving
For more information on Ian’s work:
http://ianmilligan.ca/
For more information on Warcbase:
https://github.com/lintool/warcbase
Archiving transactions towards an
uninterruptible web service
Webmasters can benefit from web archiving!
Archiving transactions towards an
uninterruptible web service
Goal: create and leverage existing tools so when a web resource
is unavailable due to some interruption of a key service an
archived copy will be provided to an end user
• Web archives can serve as a value-added collection to
motivate web archiving as a tool for day-to-day IT operation
For more information see: https://www.cs.vt.edu/node/7650
Incentive award winners: New curation tools
for web archivists
https://www.youtube.com/watch?v=yeuk_vIOXc
w&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
&index=6
Tools for managing seed URIs
Tools for managing seed URIs
• Results: developed tool to enable curators to evaluate and
detect when their web archives are off topic or discover new
seed sites to include in collections
• For more information see:
https://github.com/yasmina85/offtopic-Detection
Visualizing Digital Collections of Web Archives
Visualizing Digital Collections of Web Archives
Results: developed tool for showing how a single web page
changes over time
For more information see:
https://github.com/machawk1/ArchiveThumbnails,
http://thumbnails.cs.odu.edu:15421,
https://github.com/machawk1/ArchiveThumbnails/blob/master/
CalendarResults.jsp
Exploring a national collaborative
model for web archiving
https://www.youtube.com/watch?v=6f1Vj5cNAIU&index=7&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
Day 2: smaller group discussions
• Capture beyond traditional crawling
– subtopics: transactional archiving; direct ingest of publisher files; web
recording
• Collection development / Descriptive metadata & access
– subtopics: collaborative collection development; institutional archives;
metadata workflows; non-document collections
• Tools/APIs: integration into systems and standardization
– subtopics: development of tools/APIs; integration into vendor and local
systems; goal of network of standardized components
• Analysis of web archive datasets
– subtopics: tools; researcher perspectives
• Preservation of scholarly & legal record / Long-term preservation
– subtopics: preservation of scholarly record; reference rot; long-term
preservation
It was a really busy and productive day
but we only have one picture
Check out the slides and videos
and let us know what you think!
• See the slides, watch the
videos:
https://library.columbia.edu/bts/
web_resources_collection/Confe
rences/program.html
Feedback welcome!
Attendees of the
conference are especially
encouraged to keep us
posted on resulting work
Thanks!
Anna Perricci
Columbia University Libraries
anna.perricci@gmail.com
@AnnaPerricci
Unless otherwise noted, all images in this presentation are
from the CUWARC Flickr album: https://flic.kr/s/aHskfjd54s

More Related Content

What's hot

Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
Ahmed AlSum
 
SAA 2014 session 703
SAA 2014 session 703SAA 2014 session 703
SAA 2014 session 703
rosalielack
 
[[edit]] this GLAM
[[edit]] this GLAM[[edit]] this GLAM
[[edit]] this GLAM
wittylama
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
Adrian Stevenson
 
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
National Information Standards Organization (NISO)
 
UBC Library Web Archiving 2016
UBC Library Web Archiving 2016UBC Library Web Archiving 2016
UBC Library Web Archiving 2016
Larissa Ringham
 
Digital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project FeedbackDigital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project Feedback
jisc-elearning
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
ekansa
 
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the MakingKeeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
WGBH Media Library and Archives
 
An A+ Plan to Transform Your Library with Linked Data
An A+ Plan to Transform Your Library with Linked DataAn A+ Plan to Transform Your Library with Linked Data
An A+ Plan to Transform Your Library with Linked Data
National Information Standards Organization (NISO)
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
Rachel Frick
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentation
ekansa
 
Report on the Rethinking Resource Sharing Initiative
Report on the Rethinking Resource Sharing InitiativeReport on the Rethinking Resource Sharing Initiative
Report on the Rethinking Resource Sharing Initiativekramsey
 
Library as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of CooperationLibrary as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of Cooperation
Karen S Calhoun
 
Tools for Managing the Past Web
Tools for Managing the Past WebTools for Managing the Past Web
Tools for Managing the Past Web
Michele Weigle
 
OUR space: the new world of metadata
OUR space: the new world of metadataOUR space: the new world of metadata
OUR space: the new world of metadata
Karen S Calhoun
 
6-4-13 VIVO Case Studies Presentation Slides
6-4-13 VIVO Case Studies Presentation Slides6-4-13 VIVO Case Studies Presentation Slides
6-4-13 VIVO Case Studies Presentation Slides
DuraSpace
 
To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
Herbert Van de Sompel
 

What's hot (20)

Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
 
SAA 2014 session 703
SAA 2014 session 703SAA 2014 session 703
SAA 2014 session 703
 
[[edit]] this GLAM
[[edit]] this GLAM[[edit]] this GLAM
[[edit]] this GLAM
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
 
UBC Library Web Archiving 2016
UBC Library Web Archiving 2016UBC Library Web Archiving 2016
UBC Library Web Archiving 2016
 
Digital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project FeedbackDigital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project Feedback
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
 
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the MakingKeeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
 
An A+ Plan to Transform Your Library with Linked Data
An A+ Plan to Transform Your Library with Linked DataAn A+ Plan to Transform Your Library with Linked Data
An A+ Plan to Transform Your Library with Linked Data
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentation
 
Report on the Rethinking Resource Sharing Initiative
Report on the Rethinking Resource Sharing InitiativeReport on the Rethinking Resource Sharing Initiative
Report on the Rethinking Resource Sharing Initiative
 
Library as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of CooperationLibrary as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of Cooperation
 
Tools for Managing the Past Web
Tools for Managing the Past WebTools for Managing the Past Web
Tools for Managing the Past Web
 
OUR space: the new world of metadata
OUR space: the new world of metadataOUR space: the new world of metadata
OUR space: the new world of metadata
 
6-4-13 VIVO Case Studies Presentation Slides
6-4-13 VIVO Case Studies Presentation Slides6-4-13 VIVO Case Studies Presentation Slides
6-4-13 VIVO Case Studies Presentation Slides
 
To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
 

Viewers also liked

Dismantling Silos to Build Robust Shared Print Projects
Dismantling Silos to Build Robust Shared Print ProjectsDismantling Silos to Build Robust Shared Print Projects
Dismantling Silos to Build Robust Shared Print Projects
Anna Perricci
 
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
Anna Perricci
 
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
Anna Perricci
 
METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...
Anna Perricci
 
Establishing and growing a multi-institutional web archiving collaboration f...
Establishing and growing a multi-institutional web archiving collaboration f...Establishing and growing a multi-institutional web archiving collaboration f...
Establishing and growing a multi-institutional web archiving collaboration f...
Anna Perricci
 
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Anna Perricci
 
SAA Web Archiving Roundtable Education Needs Assessment Survey Results
SAA Web Archiving Roundtable Education Needs Assessment Survey ResultsSAA Web Archiving Roundtable Education Needs Assessment Survey Results
SAA Web Archiving Roundtable Education Needs Assessment Survey Results
Anna Perricci
 
Building Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the WebBuilding Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the Web
Anna Perricci
 

Viewers also liked (8)

Dismantling Silos to Build Robust Shared Print Projects
Dismantling Silos to Build Robust Shared Print ProjectsDismantling Silos to Build Robust Shared Print Projects
Dismantling Silos to Build Robust Shared Print Projects
 
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
 
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
 
METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...
 
Establishing and growing a multi-institutional web archiving collaboration f...
Establishing and growing a multi-institutional web archiving collaboration f...Establishing and growing a multi-institutional web archiving collaboration f...
Establishing and growing a multi-institutional web archiving collaboration f...
 
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
 
SAA Web Archiving Roundtable Education Needs Assessment Survey Results
SAA Web Archiving Roundtable Education Needs Assessment Survey ResultsSAA Web Archiving Roundtable Education Needs Assessment Survey Results
SAA Web Archiving Roundtable Education Needs Assessment Survey Results
 
Building Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the WebBuilding Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the Web
 

Similar to Information sharing about Columbia University Library’s recent web archiving conference Web Archiving Collaboration: New Tools and Models

Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
Anna Perricci
 
Slides for Web Archiving in the Heritage and Archive Sectors
Slides for Web Archiving in the Heritage and Archive SectorsSlides for Web Archiving in the Heritage and Archive Sectors
Slides for Web Archiving in the Heritage and Archive Sectors
Anna Perricci
 
Farl web archiving
Farl web archivingFarl web archiving
Farl web archivingaerho
 
Introduction to Omeka
Introduction to OmekaIntroduction to Omeka
Introduction to Omeka
Shawn Day
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...
Krishna-Kumar
 
Spotlight on the Digital: increase discovery of your digital resources
Spotlight on the Digital: increase discovery of your digital resourcesSpotlight on the Digital: increase discovery of your digital resources
Spotlight on the Digital: increase discovery of your digital resources
PaolaMarchionni
 
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
Martin Klein
 
Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...
openminted_eu
 
Grant apr20-8
Grant apr20-8Grant apr20-8
Open Tools for Open Publishing
Open Tools for Open PublishingOpen Tools for Open Publishing
Open Tools for Open Publishing
Clint Lalonde
 
Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012
Kerryn Amery
 
Research Data Publishing
Research Data PublishingResearch Data Publishing
Research Data Publishing
Brian Hole
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE
 
The Commons and Digital Humanities
The Commons and Digital HumanitiesThe Commons and Digital Humanities
The Commons and Digital Humanities
christinadepaolo
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
Sarah Jones
 
Crossing the Streams: Trialing and Investigating New Format Resources
Crossing the Streams: Trialing and Investigating New Format ResourcesCrossing the Streams: Trialing and Investigating New Format Resources
Crossing the Streams: Trialing and Investigating New Format Resources
Lindsay Cronk
 
Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Matthew Ragucci
 
CIL 2020 - Bringing Collections to the Screen
CIL 2020 - Bringing Collections to the ScreenCIL 2020 - Bringing Collections to the Screen
CIL 2020 - Bringing Collections to the Screen
Matthew Ragucci
 

Similar to Information sharing about Columbia University Library’s recent web archiving conference Web Archiving Collaboration: New Tools and Models (20)

Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
 
Slides for Web Archiving in the Heritage and Archive Sectors
Slides for Web Archiving in the Heritage and Archive SectorsSlides for Web Archiving in the Heritage and Archive Sectors
Slides for Web Archiving in the Heritage and Archive Sectors
 
Farl web archiving
Farl web archivingFarl web archiving
Farl web archiving
 
Introduction to Omeka
Introduction to OmekaIntroduction to Omeka
Introduction to Omeka
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...
 
Spotlight on the Digital: increase discovery of your digital resources
Spotlight on the Digital: increase discovery of your digital resourcesSpotlight on the Digital: increase discovery of your digital resources
Spotlight on the Digital: increase discovery of your digital resources
 
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
 
Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...
 
Grant apr20-8
Grant apr20-8Grant apr20-8
Grant apr20-8
 
Information update march 2013.ppt
Information update march 2013.pptInformation update march 2013.ppt
Information update march 2013.ppt
 
Open Tools for Open Publishing
Open Tools for Open PublishingOpen Tools for Open Publishing
Open Tools for Open Publishing
 
Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012
 
Personal learning environment
Personal learning environmentPersonal learning environment
Personal learning environment
 
Research Data Publishing
Research Data PublishingResearch Data Publishing
Research Data Publishing
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation Repositories
 
The Commons and Digital Humanities
The Commons and Digital HumanitiesThe Commons and Digital Humanities
The Commons and Digital Humanities
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 
Crossing the Streams: Trialing and Investigating New Format Resources
Crossing the Streams: Trialing and Investigating New Format ResourcesCrossing the Streams: Trialing and Investigating New Format Resources
Crossing the Streams: Trialing and Investigating New Format Resources
 
Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...
 
CIL 2020 - Bringing Collections to the Screen
CIL 2020 - Bringing Collections to the ScreenCIL 2020 - Bringing Collections to the Screen
CIL 2020 - Bringing Collections to the Screen
 

More from Anna Perricci

Introduction to Web Archiving
Introduction to Web ArchivingIntroduction to Web Archiving
Introduction to Web Archiving
Anna Perricci
 
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
Anna Perricci
 
Ethics & Archiving the Web - presentation at ACH 2019 closing plenary
Ethics & Archiving the Web - presentation at ACH 2019 closing plenaryEthics & Archiving the Web - presentation at ACH 2019 closing plenary
Ethics & Archiving the Web - presentation at ACH 2019 closing plenary
Anna Perricci
 
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
Anna Perricci
 
Archiving for Now and Later - workshop at Common Field Convening 2019
Archiving for Now and Later - workshop at Common Field Convening 2019Archiving for Now and Later - workshop at Common Field Convening 2019
Archiving for Now and Later - workshop at Common Field Convening 2019
Anna Perricci
 
Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!
Anna Perricci
 
Archiver le web pour les artistes : Atelier Webrecorder
Archiver le web pour les artistes : Atelier WebrecorderArchiver le web pour les artistes : Atelier Webrecorder
Archiver le web pour les artistes : Atelier Webrecorder
Anna Perricci
 
Webrecorder: Building, Maintaining & Growing
Webrecorder: Building, Maintaining & GrowingWebrecorder: Building, Maintaining & Growing
Webrecorder: Building, Maintaining & Growing
Anna Perricci
 
Social Contexts of Web Archiving: Collaboration and Ethical Collection Building
Social Contexts of Web Archiving: Collaboration and  Ethical Collection BuildingSocial Contexts of Web Archiving: Collaboration and  Ethical Collection Building
Social Contexts of Web Archiving: Collaboration and Ethical Collection Building
Anna Perricci
 
Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)
Anna Perricci
 
Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!
Anna Perricci
 

More from Anna Perricci (11)

Introduction to Web Archiving
Introduction to Web ArchivingIntroduction to Web Archiving
Introduction to Web Archiving
 
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
 
Ethics & Archiving the Web - presentation at ACH 2019 closing plenary
Ethics & Archiving the Web - presentation at ACH 2019 closing plenaryEthics & Archiving the Web - presentation at ACH 2019 closing plenary
Ethics & Archiving the Web - presentation at ACH 2019 closing plenary
 
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
 
Archiving for Now and Later - workshop at Common Field Convening 2019
Archiving for Now and Later - workshop at Common Field Convening 2019Archiving for Now and Later - workshop at Common Field Convening 2019
Archiving for Now and Later - workshop at Common Field Convening 2019
 
Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!
 
Archiver le web pour les artistes : Atelier Webrecorder
Archiver le web pour les artistes : Atelier WebrecorderArchiver le web pour les artistes : Atelier Webrecorder
Archiver le web pour les artistes : Atelier Webrecorder
 
Webrecorder: Building, Maintaining & Growing
Webrecorder: Building, Maintaining & GrowingWebrecorder: Building, Maintaining & Growing
Webrecorder: Building, Maintaining & Growing
 
Social Contexts of Web Archiving: Collaboration and Ethical Collection Building
Social Contexts of Web Archiving: Collaboration and  Ethical Collection BuildingSocial Contexts of Web Archiving: Collaboration and  Ethical Collection Building
Social Contexts of Web Archiving: Collaboration and Ethical Collection Building
 
Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)
 
Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!
 

Recently uploaded

Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
chanes7
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
JEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questionsJEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questions
ShivajiThube2
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdfMASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
goswamiyash170123
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
Krisztián Száraz
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 

Recently uploaded (20)

Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
JEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questionsJEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questions
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdfMASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
MASS MEDIA STUDIES-835-CLASS XI Resource Material.pdf
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 

Information sharing about Columbia University Library’s recent web archiving conference Web Archiving Collaboration: New Tools and Models

  • 1. Information sharing about Columbia University Library’s recent web archiving conference Web Archiving Collaboration: New Tools and Models Anna Perricci Columbia University Libraries 2015 Archive-It Partner Meeting August 18, 2015
  • 2. • This conference was part of a set of projects within a Mellon funded project • About 10 slide decks on associated Mellon supported web archiving collaborations are available on SlideShare: http://www.slideshare.net/annaperricci
  • 3. We had a conference (it was pretty awesome)
  • 4. Conference: why & how • This conference was a follow up to incentive awards program for improving web archiving tools • Goals: – Share results of funded projects and foster conversations about application & extensibility of work done – Provide information about other new collaborative models for web archiving • Space limitations - Day 1 and Day 2 – Videos: https://www.youtube.com/playlist?list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl • Co-organizers – Bob Wolven, Alex Thurman and Anna Perricci – Thanks also to many volunteers and those kindly conscripted
  • 6. We need radical collaborative models to do a better job preserving & making available born digital materials
  • 7. Incentive award winners: Preserving online law content https://www.youtube.com/watch?v=t_qZ4hNtmyw&index= 2&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
  • 9. Free Law Project & developments Free Law Project goals: • Put entirety of US Case Law online for the public for free • Develop web-based legal research tools • Support academic research on legal corpora • See CourtListener: https://www.courtlistener.com Results from grant • Expanded capacity of Juriscraper to harvest opinions on appellate court websites & involve wider community in scraping work – https://github.com/freelawproject/juriscra per – Over 200 web pages checked every weekday, harvest when changes detected – Federal circuit courts and Supreme Court; now covers all state courts of last resort & most intermediate appellate state courts • Capture oral arguments • Collaborative work with RECAP
  • 11. Perma.cc • Premise: https://perma.cc/about – Per http://dx.doi.org/10.2139/ssrn.2329161 “approximately 70% of all links in citations published between 1999 and 2011 no longer point to the same material. Broken links in journal articles undermine the citation-based system of legal scholarship by obscuring the evidence underlying authors’ ideas.” • “Any author can go to the Perma.cc website and input a URL. Perma.cc downloads the material at that URL and gives back a new URL (a “Perma.cc link”) that can then be inserted in a paper.” • There are some more steps to read about on their website, including some stats: https://perma.cc/stats Results Better APIs for greater extensibility of tools and wider use of technology in other settings / for other services
  • 13. New York Art Resources Consortium (NYARC) • NYARC is a consortium of museum libraries at the Frick Collection, the Museum of Modern Art and the Brooklyn Museum • With Mellon funding they are preserving and making available websites on themes ranging from artists websites to their own institutions’ websites to resources for restitution related research • Cutting edge work on access via a discovery layer is in progress • They are leaders in work on quality assurance • Excellent resource created by NDSR resident: http://wiki.nyarc.org/web- archiving/quality-assurance
  • 15. International Internet Preservation Consortium (IIPC) collaborative collections • Access working group of the IIPC is leading an initiative to collaboratively collect websites on topics of international interest • Complications and challenges arose particularly in determining collecting themes and given variations in laws governing intellectual property • Archive-It chosen as tool for creating collection • No spoiler alert: watch the video to see historic moment in IIPC’s collaborative collecting
  • 18. Lightning talks: Michael Lissner on RECAP
  • 19. Lightning talks: Jefferson Bailey on Archive-It Researcher Services
  • 20. Lightning talks: Dragan Espenschied on Rhizome’s web archiving work
  • 21. Lightning talks: Dan Chudnov on Social Feed Manager
  • 22. Lightning talks: Jack Cushman on Tools for Time Travel
  • 23. Incentive award winners: New platforms https://www.youtube.com/watch?v=6h8MohBSEt I&index=5&list=PLf1Dab4lwQhBpFRB1dpUnKLglm M2iScjl
  • 24. Warcbase: Building a scalable platform on HBase and Hadoop
  • 25. Warcbase Warcbase is an open-source platform for storing, managing, and analyzing web archives using current “big data" infrastructure and tools (e.g. HBase for storage, Hadoop for data analytics) -further applications on ‘wimpy hardware’ (Raspberry Pi) also demonstrated for personal digital archiving For more information on Ian’s work: http://ianmilligan.ca/ For more information on Warcbase: https://github.com/lintool/warcbase
  • 26. Archiving transactions towards an uninterruptible web service Webmasters can benefit from web archiving!
  • 27. Archiving transactions towards an uninterruptible web service Goal: create and leverage existing tools so when a web resource is unavailable due to some interruption of a key service an archived copy will be provided to an end user • Web archives can serve as a value-added collection to motivate web archiving as a tool for day-to-day IT operation For more information see: https://www.cs.vt.edu/node/7650
  • 28. Incentive award winners: New curation tools for web archivists https://www.youtube.com/watch?v=yeuk_vIOXc w&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl &index=6
  • 29. Tools for managing seed URIs
  • 30. Tools for managing seed URIs • Results: developed tool to enable curators to evaluate and detect when their web archives are off topic or discover new seed sites to include in collections • For more information see: https://github.com/yasmina85/offtopic-Detection
  • 32. Visualizing Digital Collections of Web Archives Results: developed tool for showing how a single web page changes over time For more information see: https://github.com/machawk1/ArchiveThumbnails, http://thumbnails.cs.odu.edu:15421, https://github.com/machawk1/ArchiveThumbnails/blob/master/ CalendarResults.jsp
  • 33. Exploring a national collaborative model for web archiving https://www.youtube.com/watch?v=6f1Vj5cNAIU&index=7&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
  • 34. Day 2: smaller group discussions • Capture beyond traditional crawling – subtopics: transactional archiving; direct ingest of publisher files; web recording • Collection development / Descriptive metadata & access – subtopics: collaborative collection development; institutional archives; metadata workflows; non-document collections • Tools/APIs: integration into systems and standardization – subtopics: development of tools/APIs; integration into vendor and local systems; goal of network of standardized components • Analysis of web archive datasets – subtopics: tools; researcher perspectives • Preservation of scholarly & legal record / Long-term preservation – subtopics: preservation of scholarly record; reference rot; long-term preservation
  • 35. It was a really busy and productive day but we only have one picture
  • 36. Check out the slides and videos and let us know what you think! • See the slides, watch the videos: https://library.columbia.edu/bts/ web_resources_collection/Confe rences/program.html Feedback welcome! Attendees of the conference are especially encouraged to keep us posted on resulting work
  • 37. Thanks! Anna Perricci Columbia University Libraries anna.perricci@gmail.com @AnnaPerricci Unless otherwise noted, all images in this presentation are from the CUWARC Flickr album: https://flic.kr/s/aHskfjd54s