SlideShare a Scribd company logo
1 of 37
Information sharing about Columbia University
Library’s recent web archiving conference
Web Archiving Collaboration:
New Tools and Models
Anna Perricci
Columbia University Libraries
2015 Archive-It Partner Meeting
August 18, 2015
• This conference was part of a set
of projects within a Mellon
funded project
• About 10 slide decks on associated
Mellon supported web archiving
collaborations are available on
SlideShare:
http://www.slideshare.net/annaperricci
We had a conference
(it was pretty awesome)
Conference: why & how
• This conference was a follow up to incentive awards program for
improving web archiving tools
• Goals:
– Share results of funded projects and foster conversations about
application & extensibility of work done
– Provide information about other new collaborative models for web
archiving
• Space limitations - Day 1 and Day 2
– Videos:
https://www.youtube.com/playlist?list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
• Co-organizers
– Bob Wolven, Alex Thurman and Anna Perricci
– Thanks also to many volunteers and those kindly conscripted
Keynote address
https://www.youtube.com/watch?v=kr82McjQGyQ&index=
1&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
We need radical collaborative models
to do a better job preserving & making
available born digital materials
Incentive award winners: Preserving online
law content
https://www.youtube.com/watch?v=t_qZ4hNtmyw&index=
2&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
Free Law Project
Free Law Project & developments
Free Law Project goals:
• Put entirety of US Case Law
online for the public for free
• Develop web-based legal
research tools
• Support academic research on
legal corpora
• See CourtListener:
https://www.courtlistener.com
Results from grant
• Expanded capacity of Juriscraper to
harvest opinions on appellate court
websites & involve wider community in
scraping work
– https://github.com/freelawproject/juriscra
per
– Over 200 web pages checked every
weekday, harvest when changes detected
– Federal circuit courts and Supreme Court;
now covers all state courts of last resort &
most intermediate appellate state courts
• Capture oral arguments
• Collaborative work with RECAP
Perma.cc
Perma.cc
• Premise: https://perma.cc/about
– Per http://dx.doi.org/10.2139/ssrn.2329161
“approximately 70% of all links in citations
published between 1999 and 2011 no longer
point to the same material. Broken links in journal
articles undermine the citation-based system of
legal scholarship by obscuring the evidence
underlying authors’ ideas.”
• “Any author can go to the Perma.cc website and input
a URL. Perma.cc downloads the material at that URL
and gives back a new URL (a “Perma.cc link”) that can
then be inserted in a paper.”
• There are some more steps to read about on their
website, including some stats: https://perma.cc/stats
Results
Better APIs for
greater extensibility
of tools and wider
use of technology in
other settings / for
other services
New collaborative models
https://www.youtube.com/watch?v=ntAP064ZdBM&list=PL
f1Dab4lwQhBpFRB1dpUnKLglmM2iScjl&index=3
New York Art Resources Consortium (NYARC)
• NYARC is a consortium of museum libraries
at the Frick Collection, the Museum of
Modern Art and the Brooklyn Museum
• With Mellon funding they are preserving
and making available websites on themes
ranging from artists websites to their own
institutions’ websites to resources for
restitution related research
• Cutting edge work on access via a discovery
layer is in progress
• They are leaders in work on quality
assurance
• Excellent resource created by NDSR
resident: http://wiki.nyarc.org/web-
archiving/quality-assurance
Collaborative collecting with Ivy Plus
International Internet Preservation Consortium
(IIPC) collaborative collections
• Access working group of the IIPC is
leading an initiative to collaboratively
collect websites on topics of
international interest
• Complications and challenges arose
particularly in determining collecting
themes and given variations in laws
governing intellectual property
• Archive-It chosen as tool for creating
collection
• No spoiler alert: watch the video to
see historic moment in IIPC’s
collaborative collecting
Lunch
Image source: http://www.omgcatsinspace.com/post/38940531331
Lightning Talks
https://www.youtube.com/watch?v=jqrC0Ygdc-
M&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
&index=4
Lightning talks: Michael Lissner on RECAP
Lightning talks: Jefferson Bailey
on Archive-It Researcher Services
Lightning talks: Dragan Espenschied on
Rhizome’s web archiving work
Lightning talks: Dan Chudnov
on Social Feed Manager
Lightning talks: Jack Cushman
on Tools for Time Travel
Incentive award winners: New platforms
https://www.youtube.com/watch?v=6h8MohBSEt
I&index=5&list=PLf1Dab4lwQhBpFRB1dpUnKLglm
M2iScjl
Warcbase: Building a scalable platform on
HBase and Hadoop
Warcbase
Warcbase is an open-source
platform for storing,
managing, and analyzing web
archives using current “big
data" infrastructure and tools
(e.g. HBase for storage,
Hadoop for data analytics)
-further applications on
‘wimpy hardware’ (Raspberry
Pi) also demonstrated for
personal digital archiving
For more information on Ian’s work:
http://ianmilligan.ca/
For more information on Warcbase:
https://github.com/lintool/warcbase
Archiving transactions towards an
uninterruptible web service
Webmasters can benefit from web archiving!
Archiving transactions towards an
uninterruptible web service
Goal: create and leverage existing tools so when a web resource
is unavailable due to some interruption of a key service an
archived copy will be provided to an end user
• Web archives can serve as a value-added collection to
motivate web archiving as a tool for day-to-day IT operation
For more information see: https://www.cs.vt.edu/node/7650
Incentive award winners: New curation tools
for web archivists
https://www.youtube.com/watch?v=yeuk_vIOXc
w&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
&index=6
Tools for managing seed URIs
Tools for managing seed URIs
• Results: developed tool to enable curators to evaluate and
detect when their web archives are off topic or discover new
seed sites to include in collections
• For more information see:
https://github.com/yasmina85/offtopic-Detection
Visualizing Digital Collections of Web Archives
Visualizing Digital Collections of Web Archives
Results: developed tool for showing how a single web page
changes over time
For more information see:
https://github.com/machawk1/ArchiveThumbnails,
http://thumbnails.cs.odu.edu:15421,
https://github.com/machawk1/ArchiveThumbnails/blob/master/
CalendarResults.jsp
Exploring a national collaborative
model for web archiving
https://www.youtube.com/watch?v=6f1Vj5cNAIU&index=7&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
Day 2: smaller group discussions
• Capture beyond traditional crawling
– subtopics: transactional archiving; direct ingest of publisher files; web
recording
• Collection development / Descriptive metadata & access
– subtopics: collaborative collection development; institutional archives;
metadata workflows; non-document collections
• Tools/APIs: integration into systems and standardization
– subtopics: development of tools/APIs; integration into vendor and local
systems; goal of network of standardized components
• Analysis of web archive datasets
– subtopics: tools; researcher perspectives
• Preservation of scholarly & legal record / Long-term preservation
– subtopics: preservation of scholarly record; reference rot; long-term
preservation
It was a really busy and productive day
but we only have one picture
Check out the slides and videos
and let us know what you think!
• See the slides, watch the
videos:
https://library.columbia.edu/bts/
web_resources_collection/Confe
rences/program.html
Feedback welcome!
Attendees of the
conference are especially
encouraged to keep us
posted on resulting work
Thanks!
Anna Perricci
Columbia University Libraries
anna.perricci@gmail.com
@AnnaPerricci
Unless otherwise noted, all images in this presentation are
from the CUWARC Flickr album: https://flic.kr/s/aHskfjd54s

More Related Content

What's hot

Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunitiesAhmed AlSum
 
SAA 2014 session 703
SAA 2014 session 703SAA 2014 session 703
SAA 2014 session 703rosalielack
 
[[edit]] this GLAM
[[edit]] this GLAM[[edit]] this GLAM
[[edit]] this GLAMwittylama
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked DataAdrian Stevenson
 
UBC Library Web Archiving 2016
UBC Library Web Archiving 2016UBC Library Web Archiving 2016
UBC Library Web Archiving 2016Larissa Ringham
 
Digital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project FeedbackDigital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project Feedbackjisc-elearning
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentationekansa
 
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the MakingKeeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the MakingWGBH Media Library and Archives
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless OpportunityRachel Frick
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentationekansa
 
Report on the Rethinking Resource Sharing Initiative
Report on the Rethinking Resource Sharing InitiativeReport on the Rethinking Resource Sharing Initiative
Report on the Rethinking Resource Sharing Initiativekramsey
 
Library as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of CooperationLibrary as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of CooperationKaren S Calhoun
 
Tools for Managing the Past Web
Tools for Managing the Past WebTools for Managing the Past Web
Tools for Managing the Past WebMichele Weigle
 
OUR space: the new world of metadata
OUR space: the new world of metadataOUR space: the new world of metadata
OUR space: the new world of metadataKaren S Calhoun
 
6-4-13 VIVO Case Studies Presentation Slides
6-4-13 VIVO Case Studies Presentation Slides6-4-13 VIVO Case Studies Presentation Slides
6-4-13 VIVO Case Studies Presentation SlidesDuraSpace
 

What's hot (20)

Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
 
SAA 2014 session 703
SAA 2014 session 703SAA 2014 session 703
SAA 2014 session 703
 
[[edit]] this GLAM
[[edit]] this GLAM[[edit]] this GLAM
[[edit]] this GLAM
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
November 19, 2014 NISO Virtual Conference: Can't We All Work Together?: Inter...
 
UBC Library Web Archiving 2016
UBC Library Web Archiving 2016UBC Library Web Archiving 2016
UBC Library Web Archiving 2016
 
Digital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project FeedbackDigital Visitors and Residents: Project Feedback
Digital Visitors and Residents: Project Feedback
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
 
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the MakingKeeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
 
An A+ Plan to Transform Your Library with Linked Data
An A+ Plan to Transform Your Library with Linked DataAn A+ Plan to Transform Your Library with Linked Data
An A+ Plan to Transform Your Library with Linked Data
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentation
 
Report on the Rethinking Resource Sharing Initiative
Report on the Rethinking Resource Sharing InitiativeReport on the Rethinking Resource Sharing Initiative
Report on the Rethinking Resource Sharing Initiative
 
Library as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of CooperationLibrary as Place, Place as Library: Duality and the Power of Cooperation
Library as Place, Place as Library: Duality and the Power of Cooperation
 
Tools for Managing the Past Web
Tools for Managing the Past WebTools for Managing the Past Web
Tools for Managing the Past Web
 
OUR space: the new world of metadata
OUR space: the new world of metadataOUR space: the new world of metadata
OUR space: the new world of metadata
 
6-4-13 VIVO Case Studies Presentation Slides
6-4-13 VIVO Case Studies Presentation Slides6-4-13 VIVO Case Studies Presentation Slides
6-4-13 VIVO Case Studies Presentation Slides
 
To the Rescue of Scholarly Orphans
To the Rescue of Scholarly OrphansTo the Rescue of Scholarly Orphans
To the Rescue of Scholarly Orphans
 

Viewers also liked

Dismantling Silos to Build Robust Shared Print Projects
Dismantling Silos to Build Robust Shared Print ProjectsDismantling Silos to Build Robust Shared Print Projects
Dismantling Silos to Build Robust Shared Print ProjectsAnna Perricci
 
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...Anna Perricci
 
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)Anna Perricci
 
METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...Anna Perricci
 
Establishing and growing a multi-institutional web archiving collaboration f...
Establishing and growing a multi-institutional web archiving collaboration f...Establishing and growing a multi-institutional web archiving collaboration f...
Establishing and growing a multi-institutional web archiving collaboration f...Anna Perricci
 
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...Anna Perricci
 
SAA Web Archiving Roundtable Education Needs Assessment Survey Results
SAA Web Archiving Roundtable Education Needs Assessment Survey ResultsSAA Web Archiving Roundtable Education Needs Assessment Survey Results
SAA Web Archiving Roundtable Education Needs Assessment Survey ResultsAnna Perricci
 
Building Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the WebBuilding Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the WebAnna Perricci
 

Viewers also liked (8)

Dismantling Silos to Build Robust Shared Print Projects
Dismantling Silos to Build Robust Shared Print ProjectsDismantling Silos to Build Robust Shared Print Projects
Dismantling Silos to Build Robust Shared Print Projects
 
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
ACRL/NY 2013 poster: Assessment of the Effectiveness of the Human Rights Web ...
 
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
Retention Modeling for the Eastern Academic Scholars' Trust (EAST)
 
METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...METRO Conference 2014: How collaboration can save [more of] the web: recent p...
METRO Conference 2014: How collaboration can save [more of] the web: recent p...
 
Establishing and growing a multi-institutional web archiving collaboration f...
Establishing and growing a multi-institutional web archiving collaboration f...Establishing and growing a multi-institutional web archiving collaboration f...
Establishing and growing a multi-institutional web archiving collaboration f...
 
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
Best Practices Exchange 2013: How collaboration can save [more of] the web: r...
 
SAA Web Archiving Roundtable Education Needs Assessment Survey Results
SAA Web Archiving Roundtable Education Needs Assessment Survey ResultsSAA Web Archiving Roundtable Education Needs Assessment Survey Results
SAA Web Archiving Roundtable Education Needs Assessment Survey Results
 
Building Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the WebBuilding Web Archiving Collaborations to Save [More of] the Web
Building Web Archiving Collaborations to Save [More of] the Web
 

Similar to Columbia University Library Web Archiving Conference

Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...Anna Perricci
 
Slides for Web Archiving in the Heritage and Archive Sectors
Slides for Web Archiving in the Heritage and Archive SectorsSlides for Web Archiving in the Heritage and Archive Sectors
Slides for Web Archiving in the Heritage and Archive SectorsAnna Perricci
 
Farl web archiving
Farl web archivingFarl web archiving
Farl web archivingaerho
 
Introduction to Omeka
Introduction to OmekaIntroduction to Omeka
Introduction to OmekaShawn Day
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...Krishna-Kumar
 
Spotlight on the Digital: increase discovery of your digital resources
Spotlight on the Digital: increase discovery of your digital resourcesSpotlight on the Digital: increase discovery of your digital resources
Spotlight on the Digital: increase discovery of your digital resourcesPaolaMarchionni
 
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...Martin Klein
 
Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...openminted_eu
 
Open Tools for Open Publishing
Open Tools for Open PublishingOpen Tools for Open Publishing
Open Tools for Open PublishingClint Lalonde
 
Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Kerryn Amery
 
Research Data Publishing
Research Data PublishingResearch Data Publishing
Research Data PublishingBrian Hole
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE
 
The Commons and Digital Humanities
The Commons and Digital HumanitiesThe Commons and Digital Humanities
The Commons and Digital Humanitieschristinadepaolo
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open scienceSarah Jones
 
Crossing the Streams: Trialing and Investigating New Format Resources
Crossing the Streams: Trialing and Investigating New Format ResourcesCrossing the Streams: Trialing and Investigating New Format Resources
Crossing the Streams: Trialing and Investigating New Format ResourcesLindsay Cronk
 
Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...Matthew Ragucci
 
CIL 2020 - Bringing Collections to the Screen
CIL 2020 - Bringing Collections to the ScreenCIL 2020 - Bringing Collections to the Screen
CIL 2020 - Bringing Collections to the ScreenMatthew Ragucci
 

Similar to Columbia University Library Web Archiving Conference (20)

Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
Human Scale Web Collecting for Individuals and Institutions (Webrecorder Work...
 
Slides for Web Archiving in the Heritage and Archive Sectors
Slides for Web Archiving in the Heritage and Archive SectorsSlides for Web Archiving in the Heritage and Archive Sectors
Slides for Web Archiving in the Heritage and Archive Sectors
 
Farl web archiving
Farl web archivingFarl web archiving
Farl web archiving
 
Introduction to Omeka
Introduction to OmekaIntroduction to Omeka
Introduction to Omeka
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...
 
Spotlight on the Digital: increase discovery of your digital resources
Spotlight on the Digital: increase discovery of your digital resourcesSpotlight on the Digital: increase discovery of your digital resources
Spotlight on the Digital: increase discovery of your digital resources
 
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
ResourceSync - Overview and Real-World Use Cases for Discovery, Harvesting, a...
 
Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...
 
Grant apr20-8
Grant apr20-8Grant apr20-8
Grant apr20-8
 
Information update march 2013.ppt
Information update march 2013.pptInformation update march 2013.ppt
Information update march 2013.ppt
 
Open Tools for Open Publishing
Open Tools for Open PublishingOpen Tools for Open Publishing
Open Tools for Open Publishing
 
Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012
 
Personal learning environment
Personal learning environmentPersonal learning environment
Personal learning environment
 
Research Data Publishing
Research Data PublishingResearch Data Publishing
Research Data Publishing
 
OpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation RepositoriesOpenAIRE Open Innovation call: Next Generation Repositories
OpenAIRE Open Innovation call: Next Generation Repositories
 
The Commons and Digital Humanities
The Commons and Digital HumanitiesThe Commons and Digital Humanities
The Commons and Digital Humanities
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 
Crossing the Streams: Trialing and Investigating New Format Resources
Crossing the Streams: Trialing and Investigating New Format ResourcesCrossing the Streams: Trialing and Investigating New Format Resources
Crossing the Streams: Trialing and Investigating New Format Resources
 
Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...Charleston 2021 - Hit the ground running - Best practices for navigating cont...
Charleston 2021 - Hit the ground running - Best practices for navigating cont...
 
CIL 2020 - Bringing Collections to the Screen
CIL 2020 - Bringing Collections to the ScreenCIL 2020 - Bringing Collections to the Screen
CIL 2020 - Bringing Collections to the Screen
 

More from Anna Perricci

Introduction to Web Archiving
Introduction to Web ArchivingIntroduction to Web Archiving
Introduction to Web ArchivingAnna Perricci
 
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising Anna Perricci
 
Ethics & Archiving the Web - presentation at ACH 2019 closing plenary
Ethics & Archiving the Web - presentation at ACH 2019 closing plenaryEthics & Archiving the Web - presentation at ACH 2019 closing plenary
Ethics & Archiving the Web - presentation at ACH 2019 closing plenaryAnna Perricci
 
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...Anna Perricci
 
Archiving for Now and Later - workshop at Common Field Convening 2019
Archiving for Now and Later - workshop at Common Field Convening 2019Archiving for Now and Later - workshop at Common Field Convening 2019
Archiving for Now and Later - workshop at Common Field Convening 2019Anna Perricci
 
Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Anna Perricci
 
Archiver le web pour les artistes : Atelier Webrecorder
Archiver le web pour les artistes : Atelier WebrecorderArchiver le web pour les artistes : Atelier Webrecorder
Archiver le web pour les artistes : Atelier WebrecorderAnna Perricci
 
Webrecorder: Building, Maintaining & Growing
Webrecorder: Building, Maintaining & GrowingWebrecorder: Building, Maintaining & Growing
Webrecorder: Building, Maintaining & GrowingAnna Perricci
 
Social Contexts of Web Archiving: Collaboration and Ethical Collection Building
Social Contexts of Web Archiving: Collaboration and  Ethical Collection BuildingSocial Contexts of Web Archiving: Collaboration and  Ethical Collection Building
Social Contexts of Web Archiving: Collaboration and Ethical Collection BuildingAnna Perricci
 
Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Anna Perricci
 
Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Anna Perricci
 

More from Anna Perricci (11)

Introduction to Web Archiving
Introduction to Web ArchivingIntroduction to Web Archiving
Introduction to Web Archiving
 
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
DPC Web Archiving & Preservation Webinar #4: Outreach & Awareness Raising
 
Ethics & Archiving the Web - presentation at ACH 2019 closing plenary
Ethics & Archiving the Web - presentation at ACH 2019 closing plenaryEthics & Archiving the Web - presentation at ACH 2019 closing plenary
Ethics & Archiving the Web - presentation at ACH 2019 closing plenary
 
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
No one said this would be easy: Sustaining Webrecorder as a robust web archiv...
 
Archiving for Now and Later - workshop at Common Field Convening 2019
Archiving for Now and Later - workshop at Common Field Convening 2019Archiving for Now and Later - workshop at Common Field Convening 2019
Archiving for Now and Later - workshop at Common Field Convening 2019
 
Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!
 
Archiver le web pour les artistes : Atelier Webrecorder
Archiver le web pour les artistes : Atelier WebrecorderArchiver le web pour les artistes : Atelier Webrecorder
Archiver le web pour les artistes : Atelier Webrecorder
 
Webrecorder: Building, Maintaining & Growing
Webrecorder: Building, Maintaining & GrowingWebrecorder: Building, Maintaining & Growing
Webrecorder: Building, Maintaining & Growing
 
Social Contexts of Web Archiving: Collaboration and Ethical Collection Building
Social Contexts of Web Archiving: Collaboration and  Ethical Collection BuildingSocial Contexts of Web Archiving: Collaboration and  Ethical Collection Building
Social Contexts of Web Archiving: Collaboration and Ethical Collection Building
 
Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)Web Archiving Intro (circa 2015)
Web Archiving Intro (circa 2015)
 
Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!Webrecorder: Web Archiving for All!
Webrecorder: Web Archiving for All!
 

Recently uploaded

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinojohnmickonozaleda
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipino
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 

Columbia University Library Web Archiving Conference

  • 1. Information sharing about Columbia University Library’s recent web archiving conference Web Archiving Collaboration: New Tools and Models Anna Perricci Columbia University Libraries 2015 Archive-It Partner Meeting August 18, 2015
  • 2. • This conference was part of a set of projects within a Mellon funded project • About 10 slide decks on associated Mellon supported web archiving collaborations are available on SlideShare: http://www.slideshare.net/annaperricci
  • 3. We had a conference (it was pretty awesome)
  • 4. Conference: why & how • This conference was a follow up to incentive awards program for improving web archiving tools • Goals: – Share results of funded projects and foster conversations about application & extensibility of work done – Provide information about other new collaborative models for web archiving • Space limitations - Day 1 and Day 2 – Videos: https://www.youtube.com/playlist?list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl • Co-organizers – Bob Wolven, Alex Thurman and Anna Perricci – Thanks also to many volunteers and those kindly conscripted
  • 6. We need radical collaborative models to do a better job preserving & making available born digital materials
  • 7. Incentive award winners: Preserving online law content https://www.youtube.com/watch?v=t_qZ4hNtmyw&index= 2&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
  • 9. Free Law Project & developments Free Law Project goals: • Put entirety of US Case Law online for the public for free • Develop web-based legal research tools • Support academic research on legal corpora • See CourtListener: https://www.courtlistener.com Results from grant • Expanded capacity of Juriscraper to harvest opinions on appellate court websites & involve wider community in scraping work – https://github.com/freelawproject/juriscra per – Over 200 web pages checked every weekday, harvest when changes detected – Federal circuit courts and Supreme Court; now covers all state courts of last resort & most intermediate appellate state courts • Capture oral arguments • Collaborative work with RECAP
  • 11. Perma.cc • Premise: https://perma.cc/about – Per http://dx.doi.org/10.2139/ssrn.2329161 “approximately 70% of all links in citations published between 1999 and 2011 no longer point to the same material. Broken links in journal articles undermine the citation-based system of legal scholarship by obscuring the evidence underlying authors’ ideas.” • “Any author can go to the Perma.cc website and input a URL. Perma.cc downloads the material at that URL and gives back a new URL (a “Perma.cc link”) that can then be inserted in a paper.” • There are some more steps to read about on their website, including some stats: https://perma.cc/stats Results Better APIs for greater extensibility of tools and wider use of technology in other settings / for other services
  • 13. New York Art Resources Consortium (NYARC) • NYARC is a consortium of museum libraries at the Frick Collection, the Museum of Modern Art and the Brooklyn Museum • With Mellon funding they are preserving and making available websites on themes ranging from artists websites to their own institutions’ websites to resources for restitution related research • Cutting edge work on access via a discovery layer is in progress • They are leaders in work on quality assurance • Excellent resource created by NDSR resident: http://wiki.nyarc.org/web- archiving/quality-assurance
  • 15. International Internet Preservation Consortium (IIPC) collaborative collections • Access working group of the IIPC is leading an initiative to collaboratively collect websites on topics of international interest • Complications and challenges arose particularly in determining collecting themes and given variations in laws governing intellectual property • Archive-It chosen as tool for creating collection • No spoiler alert: watch the video to see historic moment in IIPC’s collaborative collecting
  • 18. Lightning talks: Michael Lissner on RECAP
  • 19. Lightning talks: Jefferson Bailey on Archive-It Researcher Services
  • 20. Lightning talks: Dragan Espenschied on Rhizome’s web archiving work
  • 21. Lightning talks: Dan Chudnov on Social Feed Manager
  • 22. Lightning talks: Jack Cushman on Tools for Time Travel
  • 23. Incentive award winners: New platforms https://www.youtube.com/watch?v=6h8MohBSEt I&index=5&list=PLf1Dab4lwQhBpFRB1dpUnKLglm M2iScjl
  • 24. Warcbase: Building a scalable platform on HBase and Hadoop
  • 25. Warcbase Warcbase is an open-source platform for storing, managing, and analyzing web archives using current “big data" infrastructure and tools (e.g. HBase for storage, Hadoop for data analytics) -further applications on ‘wimpy hardware’ (Raspberry Pi) also demonstrated for personal digital archiving For more information on Ian’s work: http://ianmilligan.ca/ For more information on Warcbase: https://github.com/lintool/warcbase
  • 26. Archiving transactions towards an uninterruptible web service Webmasters can benefit from web archiving!
  • 27. Archiving transactions towards an uninterruptible web service Goal: create and leverage existing tools so when a web resource is unavailable due to some interruption of a key service an archived copy will be provided to an end user • Web archives can serve as a value-added collection to motivate web archiving as a tool for day-to-day IT operation For more information see: https://www.cs.vt.edu/node/7650
  • 28. Incentive award winners: New curation tools for web archivists https://www.youtube.com/watch?v=yeuk_vIOXc w&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl &index=6
  • 29. Tools for managing seed URIs
  • 30. Tools for managing seed URIs • Results: developed tool to enable curators to evaluate and detect when their web archives are off topic or discover new seed sites to include in collections • For more information see: https://github.com/yasmina85/offtopic-Detection
  • 32. Visualizing Digital Collections of Web Archives Results: developed tool for showing how a single web page changes over time For more information see: https://github.com/machawk1/ArchiveThumbnails, http://thumbnails.cs.odu.edu:15421, https://github.com/machawk1/ArchiveThumbnails/blob/master/ CalendarResults.jsp
  • 33. Exploring a national collaborative model for web archiving https://www.youtube.com/watch?v=6f1Vj5cNAIU&index=7&list=PLf1Dab4lwQhBpFRB1dpUnKLglmM2iScjl
  • 34. Day 2: smaller group discussions • Capture beyond traditional crawling – subtopics: transactional archiving; direct ingest of publisher files; web recording • Collection development / Descriptive metadata & access – subtopics: collaborative collection development; institutional archives; metadata workflows; non-document collections • Tools/APIs: integration into systems and standardization – subtopics: development of tools/APIs; integration into vendor and local systems; goal of network of standardized components • Analysis of web archive datasets – subtopics: tools; researcher perspectives • Preservation of scholarly & legal record / Long-term preservation – subtopics: preservation of scholarly record; reference rot; long-term preservation
  • 35. It was a really busy and productive day but we only have one picture
  • 36. Check out the slides and videos and let us know what you think! • See the slides, watch the videos: https://library.columbia.edu/bts/ web_resources_collection/Confe rences/program.html Feedback welcome! Attendees of the conference are especially encouraged to keep us posted on resulting work
  • 37. Thanks! Anna Perricci Columbia University Libraries anna.perricci@gmail.com @AnnaPerricci Unless otherwise noted, all images in this presentation are from the CUWARC Flickr album: https://flic.kr/s/aHskfjd54s