SlideShare a Scribd company logo
Client-Assisted Memento Aggregation
Using the Prefer Header
Mat Kelly, Sawood Alam, Michael L. Nelson, and Michele C. Weigle
Old Dominion University
Web Science & Digital Libraries Research Group
{mkelly, salam, mln, mweigle}@cs.odu.edu
@machawk1 โ€ข @WebSciDL
Web Archiving and Digital Libraries (WADL) Workshop
June 6, 2018, Fort Worth, TX
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
Proliferation of Personal Web Archives
2
@machawk1
A Framework for Aggregating Private and Public Web Archives
JCDL 2018 โ€ข June 5, 2018 โ€ข Fort Worth, TX
Todayโ€™s Memento Aggregation
3
Archives Queried (A0 )
@machawk1
A Framework for Aggregating Private and Public Web Archives
JCDL 2018 โ€ข June 5, 2018 โ€ข Fort Worth, TX
Motivation
4
Archives Queried (A0 )
> Include personal archives
> Include other non-aggregated archives
@machawk1
A Framework for Aggregating Private and Public Web Archives
JCDL 2018 โ€ข June 5, 2018 โ€ข Fort Worth, TX
Motivation
5
Archives Queried (A0 )
> Include personal archives
> Include other non-aggregated archives
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
State of Aggregatorsโ€™ Capabilities
โ— Mementoweb aggregator
โ—‹ Cannot customize set of archives aggregated
โ—‹ Open source? Unavailable for individualsโ€™ deployment
โ— MemGator
โ—‹ Open source โœ” https://github.com/oduwsdl/MemGator
โ—‹ Requires static set of archives on-launch
โ—‹ Still specified by server, clients have no say
โ— With each, the set of archives is determined on the โ€œserverโ€.
โ— Neither allows client to specify set of archives aggregated.
6
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
HTTP Prefer
โ— RFC 7240 (June 2014)
โ— CLIENT requests with HTTP Header:
โ—‹ Prefer: foo; bar=""
โ— SERVER may response with HTTP Header:
โ—‹ Preference-Applied: foo
7
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
HTTP Prefer
โ— RFC 7240 (June 2014)
โ— CLIENT requests with HTTP Header:
โ—‹ Prefer: foo; bar=""
โ— SERVER may response with HTTP Header:
โ—‹ Preference-Applied: foo
Prefer: archives="data:application/json;charset=utf-8;base64,Ww0KIC7...NCn0="
OUR APPROACH:
8
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
Prefer + Memento
โ— S. Jones, H. Van de Sompel, et al. โ€œMementos in the Rawโ€ 1
โ—‹ Prefer: original-content, original-links, original headers
โ—‹ Mitigate replay system rewriting, may โ€œrawโ€ information more accessible
โ— D.S.H. Rosenthal โ€œContent negotiation and Mementoโ€ 2
โ—‹ none, screenshot, altered-dom, url-rewritten, banner-inserted
โ—‹ Additional focus on derived representations
9
1 http://ws-dl.blogspot.com/2016/08/2016-08-15-mementos-in-raw-take-two.html
2 https://blog.dshr.org/2016/08/content-negotiation-and-memento.html
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
A More Capable, Transparent Aggregator
10
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
Memento Meta-Aggregator (MMA)1
โ— Additional responsibilities beyond aggregation
โ— Provide hierarchical querying model to other aggregators
โ— Advanced querying models like Precedence and Short-Circuiting
โ— Systematic interaction and aggregation with Private and Personal Web
archive
1 Kelly et al. โ€œA Framework for Aggregator Private and Public Web Archivesโ€, JCDL 2018
11
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
Bob Prefers to Exclude IA Captures
12
โœ“ โœ“
GET /archives
Bob Requests Supported Archives
13
โ†’{ }
Bob Customizes the Set in the JSON
14
โ†’{ }
โœ“ โœ“
Bob Requests CNN for His Custom Set
15
โ†’{ }
( )
base64 encoded JSON
transmitted
MMA Complies or Ignores Preference
16
โ†’{ }
โ†’{ }
โœ“
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
Client-Side Archive Specification
17
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
[
{
"id": "ia",
"name": "Internet Archive",
"timemap": "http://web.archive.org/web/timemap/link/",
"timegate": "http://web.archive.org/web/",
},
{
"id": "alice",
"name": "Aliceโ€™s Captures",
"timemap": "http://localhost:8081/timemap/",
"timegate": "http://localhost:8081/timegate/",
},
โ€ฆ
]
Respecification of archives.json
18
Base64
encoded
Ww0KICB7...NCn0=
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
Requesting Custom Set of Archives with curl
> GET /timemap/link/http://fox.cs.vt.edu/wadl2017.html HTTP/1.1
> Host: mma.cs.odu.edu
> Prefer: archives="data:application/json;charset=utf-8;base64,Ww0KICB7...NCn0="
< HTTP /1.1 200
< content-type: application/link-format
< vary: prefer
< preference-applied: archives="data:application/json;charset=utf-8;base64,Ww0KICB7...NCn0="
< content-location: /timemap/link/5bd...8e9/http://fox.cs.vt.edu/wadl2017.html
19
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
Non-Aggregated Public Web Archives
20
@machawk1
Client-Assisted Memento Aggregation Using the Prefer Header
WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX
Potential Approaches Toward Archival Set
Persistence for Subsequent Queries
1. Maintain state
โ—‹ content-location: /timemap/link/5bd...8e9/http://fox.cs.vt.edu/wadl2017.html
โ—‹ Not something we want to do with HTTP
2. Require re-specification with each request
โ—‹ not portable to other users
3. Server-side set caching
โ—‹ combinatorial explosion
21
Client-Assisted Memento Aggregation
Using the Prefer Header
Mat Kelly, Sawood Alam, Michael L. Nelson, and Michele C. Weigle
Old Dominion University
Web Science & Digital Libraries Research Group
{mkelly, salam, mln, mweigle}@cs.odu.edu
@machawk1 โ€ข @WebSciDL
Web Archiving and Digital Libraries (WADL) Workshop
June 6, 2018, Fort Worth, TX

More Related Content

What's hot

DBpedia Tutorial - Feb 2015, Dublin
DBpedia Tutorial - Feb 2015, DublinDBpedia Tutorial - Feb 2015, Dublin
DBpedia Tutorial - Feb 2015, Dublin
m_ackermann
ย 
Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...
CIGScotland
ย 
Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...
Michele Pasin
ย 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library Data
Richard Wallis
ย 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge Graph
Ioan Toma
ย 
The Semantic Data Web, Sรถren Auer, University of Leipzig
The Semantic Data Web, Sรถren Auer, University of LeipzigThe Semantic Data Web, Sรถren Auer, University of Leipzig
The Semantic Data Web, Sรถren Auer, University of Leipzig
LOD2 Creating Knowledge out of Interlinked Data
ย 
Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data
Asuncion Gomez-Perez
ย 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
Richard Wallis
ย 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of Data
Sebastian Hellmann
ย 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Sergio Fernรกndez
ย 
[Databeers] 06/05/2014 - Boris Villazon: โ€œData Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: โ€œData Integration - A Linked Data ap...[Databeers] 06/05/2014 - Boris Villazon: โ€œData Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: โ€œData Integration - A Linked Data ap...
Data Beers
ย 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
Richard Wallis
ย 
The RDF Report Card: Beyond the Triple Count
The RDF Report Card: Beyond the Triple CountThe RDF Report Card: Beyond the Triple Count
The RDF Report Card: Beyond the Triple Count
Leigh Dodds
ย 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge Graph
Peter Haase
ย 
Semantic web 101: Benefits for geologists
Semantic web 101: Benefits for geologistsSemantic web 101: Benefits for geologists
Semantic web 101: Benefits for geologists
dgarijo
ย 
A Platform for Object-Action Semantic Web Interaction
A Platform for Object-Action Semantic Web InteractionA Platform for Object-Action Semantic Web Interaction
A Platform for Object-Action Semantic Web Interaction
Roberto Garcรญa
ย 
Schema.org: Where did that come from!
Schema.org: Where did that come from!Schema.org: Where did that come from!
Schema.org: Where did that come from!
Richard Wallis
ย 
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Ontotext
ย 
Linked data tooling XML
Linked data tooling XMLLinked data tooling XML
Linked data tooling XML
FREMEProjectH2020
ย 
Linked Open Data stuff
Linked Open Data stuffLinked Open Data stuff

What's hot (20)

DBpedia Tutorial - Feb 2015, Dublin
DBpedia Tutorial - Feb 2015, DublinDBpedia Tutorial - Feb 2015, Dublin
DBpedia Tutorial - Feb 2015, Dublin
ย 
Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...
ย 
Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...Linked data experience at Macmillan: Building discovery services for scientif...
Linked data experience at Macmillan: Building discovery services for scientif...
ย 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library Data
ย 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge Graph
ย 
The Semantic Data Web, Sรถren Auer, University of Leipzig
The Semantic Data Web, Sรถren Auer, University of LeipzigThe Semantic Data Web, Sรถren Auer, University of Leipzig
The Semantic Data Web, Sรถren Auer, University of Leipzig
ย 
Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data Maximising (Re)Usability of Library metadata using Linked Data
Maximising (Re)Usability of Library metadata using Linked Data
ย 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
ย 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of Data
ย 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
ย 
[Databeers] 06/05/2014 - Boris Villazon: โ€œData Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: โ€œData Integration - A Linked Data ap...[Databeers] 06/05/2014 - Boris Villazon: โ€œData Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: โ€œData Integration - A Linked Data ap...
ย 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
ย 
The RDF Report Card: Beyond the Triple Count
The RDF Report Card: Beyond the Triple CountThe RDF Report Card: Beyond the Triple Count
The RDF Report Card: Beyond the Triple Count
ย 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge Graph
ย 
Semantic web 101: Benefits for geologists
Semantic web 101: Benefits for geologistsSemantic web 101: Benefits for geologists
Semantic web 101: Benefits for geologists
ย 
A Platform for Object-Action Semantic Web Interaction
A Platform for Object-Action Semantic Web InteractionA Platform for Object-Action Semantic Web Interaction
A Platform for Object-Action Semantic Web Interaction
ย 
Schema.org: Where did that come from!
Schema.org: Where did that come from!Schema.org: Where did that come from!
Schema.org: Where did that come from!
ย 
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
ย 
Linked data tooling XML
Linked data tooling XMLLinked data tooling XML
Linked data tooling XML
ย 
Linked Open Data stuff
Linked Open Data stuffLinked Open Data stuff
Linked Open Data stuff
ย 

Similar to Client-Assisted Memento Aggregation Using the Prefer Header

A Framework for Aggregating Public and Private Web Archives
A Framework for Aggregating Public and Private Web ArchivesA Framework for Aggregating Public and Private Web Archives
A Framework for Aggregating Public and Private Web Archives
Mat Kelly
ย 
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Ian Milligan
ย 
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Ian Milligan
ย 
AWS Public Data Sets: How to Stage Petabytes of Data for Analysis in AWS (WPS...
AWS Public Data Sets: How to Stage Petabytes of Data for Analysis in AWS (WPS...AWS Public Data Sets: How to Stage Petabytes of Data for Analysis in AWS (WPS...
AWS Public Data Sets: How to Stage Petabytes of Data for Analysis in AWS (WPS...
Amazon Web Services
ย 
REST easy with API Platform
REST easy with API PlatformREST easy with API Platform
REST easy with API Platform
Antonio Peric-Mazar
ย 
Mi Domain Wheel Slides
Mi Domain Wheel SlidesMi Domain Wheel Slides
Mi Domain Wheel Slides
lancesfa
ย 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
Access Innovations, Inc.
ย 
Telling the World and Our Users What We Have
Telling the World and Our Users What We HaveTelling the World and Our Users What We Have
Telling the World and Our Users What We Have
Richard Wallis
ย 
Log ingestion kafka -- impala using apex
Log ingestion   kafka -- impala using apexLog ingestion   kafka -- impala using apex
Log ingestion kafka -- impala using apex
Apache Apex
ย 
Frontera ั€ะฐัะฟั€ะตะดะตะปะตะฝะฝั‹ะน ั€ะพะฑะพั‚ ะดะปั ะพะฑั…ะพะดะฐ ะฒะตะฑะฐ ะฒ ะฑะพะปัŒัˆะธั… ะพะฑัŠะตะผะฐั… / ะะปะตะบัะฐะฝะดั€ ะก...
Frontera ั€ะฐัะฟั€ะตะดะตะปะตะฝะฝั‹ะน ั€ะพะฑะพั‚ ะดะปั ะพะฑั…ะพะดะฐ ะฒะตะฑะฐ ะฒ ะฑะพะปัŒัˆะธั… ะพะฑัŠะตะผะฐั… / ะะปะตะบัะฐะฝะดั€ ะก...Frontera ั€ะฐัะฟั€ะตะดะตะปะตะฝะฝั‹ะน ั€ะพะฑะพั‚ ะดะปั ะพะฑั…ะพะดะฐ ะฒะตะฑะฐ ะฒ ะฑะพะปัŒัˆะธั… ะพะฑัŠะตะผะฐั… / ะะปะตะบัะฐะฝะดั€ ะก...
Frontera ั€ะฐัะฟั€ะตะดะตะปะตะฝะฝั‹ะน ั€ะพะฑะพั‚ ะดะปั ะพะฑั…ะพะดะฐ ะฒะตะฑะฐ ะฒ ะฑะพะปัŒัˆะธั… ะพะฑัŠะตะผะฐั… / ะะปะตะบัะฐะฝะดั€ ะก...
Ontico
ย 
Meetup070416 Presentations
Meetup070416 PresentationsMeetup070416 Presentations
Meetup070416 Presentations
Ana Rebelo
ย 
Internet content as research data
Internet content as research dataInternet content as research data
Internet content as research data
National Library of Australia
ย 
Data+Need=Hack
Data+Need=HackData+Need=Hack
Data+Need=Hack
Nikos Manolis
ย 
Facilitating the discovery of public datasets
Facilitating the discovery of public datasetsFacilitating the discovery of public datasets
Facilitating the discovery of public datasets
Nafiseh Navabpour
ย 
Analyzing Web Archives
Analyzing Web ArchivesAnalyzing Web Archives
Analyzing Web Archives
vinaygo
ย 
Hack angular wildly
Hack angular wildlyHack angular wildly
Hack angular wildly
Todd Warren
ย 
Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
Ahmed AlSum
ย 
Linked Data (1st Linked Data Meetup Malmรถ)
Linked Data (1st Linked Data Meetup Malmรถ)Linked Data (1st Linked Data Meetup Malmรถ)
Linked Data (1st Linked Data Meetup Malmรถ)
Anja Jentzsch
ย 
Polyglot Persistence
Polyglot PersistencePolyglot Persistence
Polyglot Persistence
Bryan Reinero
ย 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
National Information Standards Organization (NISO)
ย 

Similar to Client-Assisted Memento Aggregation Using the Prefer Header (20)

A Framework for Aggregating Public and Private Web Archives
A Framework for Aggregating Public and Private Web ArchivesA Framework for Aggregating Public and Private Web Archives
A Framework for Aggregating Public and Private Web Archives
ย 
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
Warcbase: Building a Scalable Platform on HBase and Hadoop - Part Two, Histor...
ย 
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
Warcbase Building a Scalable Platform on HBase and Hadoop - Part Two: Histori...
ย 
AWS Public Data Sets: How to Stage Petabytes of Data for Analysis in AWS (WPS...
AWS Public Data Sets: How to Stage Petabytes of Data for Analysis in AWS (WPS...AWS Public Data Sets: How to Stage Petabytes of Data for Analysis in AWS (WPS...
AWS Public Data Sets: How to Stage Petabytes of Data for Analysis in AWS (WPS...
ย 
REST easy with API Platform
REST easy with API PlatformREST easy with API Platform
REST easy with API Platform
ย 
Mi Domain Wheel Slides
Mi Domain Wheel SlidesMi Domain Wheel Slides
Mi Domain Wheel Slides
ย 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
ย 
Telling the World and Our Users What We Have
Telling the World and Our Users What We HaveTelling the World and Our Users What We Have
Telling the World and Our Users What We Have
ย 
Log ingestion kafka -- impala using apex
Log ingestion   kafka -- impala using apexLog ingestion   kafka -- impala using apex
Log ingestion kafka -- impala using apex
ย 
Frontera ั€ะฐัะฟั€ะตะดะตะปะตะฝะฝั‹ะน ั€ะพะฑะพั‚ ะดะปั ะพะฑั…ะพะดะฐ ะฒะตะฑะฐ ะฒ ะฑะพะปัŒัˆะธั… ะพะฑัŠะตะผะฐั… / ะะปะตะบัะฐะฝะดั€ ะก...
Frontera ั€ะฐัะฟั€ะตะดะตะปะตะฝะฝั‹ะน ั€ะพะฑะพั‚ ะดะปั ะพะฑั…ะพะดะฐ ะฒะตะฑะฐ ะฒ ะฑะพะปัŒัˆะธั… ะพะฑัŠะตะผะฐั… / ะะปะตะบัะฐะฝะดั€ ะก...Frontera ั€ะฐัะฟั€ะตะดะตะปะตะฝะฝั‹ะน ั€ะพะฑะพั‚ ะดะปั ะพะฑั…ะพะดะฐ ะฒะตะฑะฐ ะฒ ะฑะพะปัŒัˆะธั… ะพะฑัŠะตะผะฐั… / ะะปะตะบัะฐะฝะดั€ ะก...
Frontera ั€ะฐัะฟั€ะตะดะตะปะตะฝะฝั‹ะน ั€ะพะฑะพั‚ ะดะปั ะพะฑั…ะพะดะฐ ะฒะตะฑะฐ ะฒ ะฑะพะปัŒัˆะธั… ะพะฑัŠะตะผะฐั… / ะะปะตะบัะฐะฝะดั€ ะก...
ย 
Meetup070416 Presentations
Meetup070416 PresentationsMeetup070416 Presentations
Meetup070416 Presentations
ย 
Internet content as research data
Internet content as research dataInternet content as research data
Internet content as research data
ย 
Data+Need=Hack
Data+Need=HackData+Need=Hack
Data+Need=Hack
ย 
Facilitating the discovery of public datasets
Facilitating the discovery of public datasetsFacilitating the discovery of public datasets
Facilitating the discovery of public datasets
ย 
Analyzing Web Archives
Analyzing Web ArchivesAnalyzing Web Archives
Analyzing Web Archives
ย 
Hack angular wildly
Hack angular wildlyHack angular wildly
Hack angular wildly
ย 
Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
ย 
Linked Data (1st Linked Data Meetup Malmรถ)
Linked Data (1st Linked Data Meetup Malmรถ)Linked Data (1st Linked Data Meetup Malmรถ)
Linked Data (1st Linked Data Meetup Malmรถ)
ย 
Polyglot Persistence
Polyglot PersistencePolyglot Persistence
Polyglot Persistence
ย 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
ย 

More from Mat Kelly

Aggregating Private and Public Web Archives Using the Mementity Framework
Aggregating Private and Public Web Archives Using the Mementity FrameworkAggregating Private and Public Web Archives Using the Mementity Framework
Aggregating Private and Public Web Archives Using the Mementity Framework
Mat Kelly
ย 
Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count
Mat Kelly
ย 
Exploring Aggregation of Personal, Private, and Institutional Web Archives
Exploring Aggregation of Personal, Private, and Institutional Web ArchivesExploring Aggregation of Personal, Private, and Institutional Web Archives
Exploring Aggregation of Personal, Private, and Institutional Web Archives
Mat Kelly
ย 
JCDL 2015 Doctoral Consortium - A Framework for Aggregating Private and Publi...
JCDL 2015 Doctoral Consortium - A Framework for AggregatingPrivate and Publi...JCDL 2015 Doctoral Consortium - A Framework for AggregatingPrivate and Publi...
JCDL 2015 Doctoral Consortium - A Framework for Aggregating Private and Publi...
Mat Kelly
ย 
Visualizing Digital Collections of Web Archives from Columbia Web Archiving C...
Visualizing Digital Collections of Web Archives from Columbia Web Archiving C...Visualizing Digital Collections of Web Archives from Columbia Web Archiving C...
Visualizing Digital Collections of Web Archives from Columbia Web Archiving C...
Mat Kelly
ย 
Facilitation of the A Posteriori Replication of Web Published Satellite Imagery
Facilitation of the A Posteriori Replication of Web Published Satellite ImageryFacilitation of the A Posteriori Replication of Web Published Satellite Imagery
Facilitation of the A Posteriori Replication of Web Published Satellite Imagery
Mat Kelly
ย 
Slides
SlidesSlides
Slides
Mat Kelly
ย 
Mink: Integrating the Live and Archived Web Viewing Experience Using Web Brow...
Mink: Integrating the Live and Archived Web Viewing Experience Using Web Brow...Mink: Integrating the Live and Archived Web Viewing Experience Using Web Brow...
Mink: Integrating the Live and Archived Web Viewing Experience Using Web Brow...
Mat Kelly
ย 
Efficient Thumbnail Generation for Web Archives at Digital Preservation 2014
Efficient Thumbnail Generation for Web Archives at Digital Preservation 2014Efficient Thumbnail Generation for Web Archives at Digital Preservation 2014
Efficient Thumbnail Generation for Web Archives at Digital Preservation 2014
Mat Kelly
ย 
Browser-Based Digital Preservation
Browser-Based Digital PreservationBrowser-Based Digital Preservation
Browser-Based Digital Preservation
Mat Kelly
ย 
Archive What I See Now - Archive-It Partner Meeting 2013 2013
Archive What I See Now - Archive-It Partner Meeting 2013 2013Archive What I See Now - Archive-It Partner Meeting 2013 2013
Archive What I See Now - Archive-It Partner Meeting 2013 2013
Mat Kelly
ย 
IEEE VIS 2013 Graph-Based Navigation of a Box Office Prediction System
IEEE VIS 2013 Graph-Based Navigation of a Box Office Prediction SystemIEEE VIS 2013 Graph-Based Navigation of a Box Office Prediction System
IEEE VIS 2013 Graph-Based Navigation of a Box Office Prediction System
Mat Kelly
ย 
Digital Preservation 2013
Digital Preservation 2013Digital Preservation 2013
Digital Preservation 2013
Mat Kelly
ย 
Making Enterprise-Level Archive Tools Accessible for Personal Web Archiving
Making Enterprise-Level Archive Tools Accessible for Personal Web ArchivingMaking Enterprise-Level Archive Tools Accessible for Personal Web Archiving
Making Enterprise-Level Archive Tools Accessible for Personal Web Archiving
Mat Kelly
ย 
An Extensible Framework for Creating Personal Web Archives of Content Behind ...
An Extensible Framework for Creating Personal Web Archives of Content Behind ...An Extensible Framework for Creating Personal Web Archives of Content Behind ...
An Extensible Framework for Creating Personal Web Archives of Content Behind ...
Mat Kelly
ย 
The Revolution Will Not Be Archived
The Revolution Will Not Be ArchivedThe Revolution Will Not Be Archived
The Revolution Will Not Be Archived
Mat Kelly
ย 
WARCreate - Create Wayback-Consumable WARC Files from Any Webpage
WARCreate - Create Wayback-Consumable WARC Files from Any WebpageWARCreate - Create Wayback-Consumable WARC Files from Any Webpage
WARCreate - Create Wayback-Consumable WARC Files from Any Webpage
Mat Kelly
ย 
NDIIPP/NDSA 2011 - YouTube Link Restoration
NDIIPP/NDSA 2011 - YouTube Link RestorationNDIIPP/NDSA 2011 - YouTube Link Restoration
NDIIPP/NDSA 2011 - YouTube Link Restoration
Mat Kelly
ย 
NDIIPP/NDSA 2011 - Archive Facebook
NDIIPP/NDSA 2011 - Archive FacebookNDIIPP/NDSA 2011 - Archive Facebook
NDIIPP/NDSA 2011 - Archive Facebook
Mat Kelly
ย 

More from Mat Kelly (19)

Aggregating Private and Public Web Archives Using the Mementity Framework
Aggregating Private and Public Web Archives Using the Mementity FrameworkAggregating Private and Public Web Archives Using the Mementity Framework
Aggregating Private and Public Web Archives Using the Mementity Framework
ย 
Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count
ย 
Exploring Aggregation of Personal, Private, and Institutional Web Archives
Exploring Aggregation of Personal, Private, and Institutional Web ArchivesExploring Aggregation of Personal, Private, and Institutional Web Archives
Exploring Aggregation of Personal, Private, and Institutional Web Archives
ย 
JCDL 2015 Doctoral Consortium - A Framework for Aggregating Private and Publi...
JCDL 2015 Doctoral Consortium - A Framework for AggregatingPrivate and Publi...JCDL 2015 Doctoral Consortium - A Framework for AggregatingPrivate and Publi...
JCDL 2015 Doctoral Consortium - A Framework for Aggregating Private and Publi...
ย 
Visualizing Digital Collections of Web Archives from Columbia Web Archiving C...
Visualizing Digital Collections of Web Archives from Columbia Web Archiving C...Visualizing Digital Collections of Web Archives from Columbia Web Archiving C...
Visualizing Digital Collections of Web Archives from Columbia Web Archiving C...
ย 
Facilitation of the A Posteriori Replication of Web Published Satellite Imagery
Facilitation of the A Posteriori Replication of Web Published Satellite ImageryFacilitation of the A Posteriori Replication of Web Published Satellite Imagery
Facilitation of the A Posteriori Replication of Web Published Satellite Imagery
ย 
Slides
SlidesSlides
Slides
ย 
Mink: Integrating the Live and Archived Web Viewing Experience Using Web Brow...
Mink: Integrating the Live and Archived Web Viewing Experience Using Web Brow...Mink: Integrating the Live and Archived Web Viewing Experience Using Web Brow...
Mink: Integrating the Live and Archived Web Viewing Experience Using Web Brow...
ย 
Efficient Thumbnail Generation for Web Archives at Digital Preservation 2014
Efficient Thumbnail Generation for Web Archives at Digital Preservation 2014Efficient Thumbnail Generation for Web Archives at Digital Preservation 2014
Efficient Thumbnail Generation for Web Archives at Digital Preservation 2014
ย 
Browser-Based Digital Preservation
Browser-Based Digital PreservationBrowser-Based Digital Preservation
Browser-Based Digital Preservation
ย 
Archive What I See Now - Archive-It Partner Meeting 2013 2013
Archive What I See Now - Archive-It Partner Meeting 2013 2013Archive What I See Now - Archive-It Partner Meeting 2013 2013
Archive What I See Now - Archive-It Partner Meeting 2013 2013
ย 
IEEE VIS 2013 Graph-Based Navigation of a Box Office Prediction System
IEEE VIS 2013 Graph-Based Navigation of a Box Office Prediction SystemIEEE VIS 2013 Graph-Based Navigation of a Box Office Prediction System
IEEE VIS 2013 Graph-Based Navigation of a Box Office Prediction System
ย 
Digital Preservation 2013
Digital Preservation 2013Digital Preservation 2013
Digital Preservation 2013
ย 
Making Enterprise-Level Archive Tools Accessible for Personal Web Archiving
Making Enterprise-Level Archive Tools Accessible for Personal Web ArchivingMaking Enterprise-Level Archive Tools Accessible for Personal Web Archiving
Making Enterprise-Level Archive Tools Accessible for Personal Web Archiving
ย 
An Extensible Framework for Creating Personal Web Archives of Content Behind ...
An Extensible Framework for Creating Personal Web Archives of Content Behind ...An Extensible Framework for Creating Personal Web Archives of Content Behind ...
An Extensible Framework for Creating Personal Web Archives of Content Behind ...
ย 
The Revolution Will Not Be Archived
The Revolution Will Not Be ArchivedThe Revolution Will Not Be Archived
The Revolution Will Not Be Archived
ย 
WARCreate - Create Wayback-Consumable WARC Files from Any Webpage
WARCreate - Create Wayback-Consumable WARC Files from Any WebpageWARCreate - Create Wayback-Consumable WARC Files from Any Webpage
WARCreate - Create Wayback-Consumable WARC Files from Any Webpage
ย 
NDIIPP/NDSA 2011 - YouTube Link Restoration
NDIIPP/NDSA 2011 - YouTube Link RestorationNDIIPP/NDSA 2011 - YouTube Link Restoration
NDIIPP/NDSA 2011 - YouTube Link Restoration
ย 
NDIIPP/NDSA 2011 - Archive Facebook
NDIIPP/NDSA 2011 - Archive FacebookNDIIPP/NDSA 2011 - Archive Facebook
NDIIPP/NDSA 2011 - Archive Facebook
ย 

Recently uploaded

Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
danielkiash986
ย 
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
EduSkills OECD
ย 
How to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in useHow to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in use
Celine George
ย 
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
Payaamvohra1
ย 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
Himanshu Rai
ย 
Accounting for Restricted Grants When and How To Record Properly
Accounting for Restricted Grants  When and How To Record ProperlyAccounting for Restricted Grants  When and How To Record Properly
Accounting for Restricted Grants When and How To Record Properly
TechSoup
ย 
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumPhilippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
MJDuyan
ย 
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
ImMuslim
ย 
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptxBIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
RidwanHassanYusuf
ย 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
Steve Thomason
ย 
Data Structure using C by Dr. K Adisesha .ppsx
Data Structure using C by Dr. K Adisesha .ppsxData Structure using C by Dr. K Adisesha .ppsx
Data Structure using C by Dr. K Adisesha .ppsx
Prof. Dr. K. Adisesha
ย 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
TechSoup
ย 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
PsychoTech Services
ย 
How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17
Celine George
ย 
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
National Information Standards Organization (NISO)
ย 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
nitinpv4ai
ย 
Juneteenth Freedom Day 2024 David Douglas School District
Juneteenth Freedom Day 2024 David Douglas School DistrictJuneteenth Freedom Day 2024 David Douglas School District
Juneteenth Freedom Day 2024 David Douglas School District
David Douglas School District
ย 
Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10
nitinpv4ai
ย 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
ย 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
Jyoti Chand
ย 

Recently uploaded (20)

Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
ย 
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
Andreas Schleicher presents PISA 2022 Volume III - Creative Thinking - 18 Jun...
ย 
How to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in useHow to Fix [Errno 98] address already in use
How to Fix [Errno 98] address already in use
ย 
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
NIPER 2024 MEMORY BASED QUESTIONS.ANSWERS TO NIPER 2024 QUESTIONS.NIPER JEE 2...
ย 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
ย 
Accounting for Restricted Grants When and How To Record Properly
Accounting for Restricted Grants  When and How To Record ProperlyAccounting for Restricted Grants  When and How To Record Properly
Accounting for Restricted Grants When and How To Record Properly
ย 
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumPhilippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) Curriculum
ย 
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
Geography as a Discipline Chapter 1 __ Class 11 Geography NCERT _ Class Notes...
ย 
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptxBIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
BIOLOGY NATIONAL EXAMINATION COUNCIL (NECO) 2024 PRACTICAL MANUAL.pptx
ย 
A Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two HeartsA Visual Guide to 1 Samuel | A Tale of Two Hearts
A Visual Guide to 1 Samuel | A Tale of Two Hearts
ย 
Data Structure using C by Dr. K Adisesha .ppsx
Data Structure using C by Dr. K Adisesha .ppsxData Structure using C by Dr. K Adisesha .ppsx
Data Structure using C by Dr. K Adisesha .ppsx
ย 
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
Elevate Your Nonprofit's Online Presence_ A Guide to Effective SEO Strategies...
ย 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
ย 
How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17How to Manage Reception Report in Odoo 17
How to Manage Reception Report in Odoo 17
ย 
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
Jemison, MacLaughlin, and Majumder "Broadening Pathways for Editors and Authors"
ย 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
ย 
Juneteenth Freedom Day 2024 David Douglas School District
Juneteenth Freedom Day 2024 David Douglas School DistrictJuneteenth Freedom Day 2024 David Douglas School District
Juneteenth Freedom Day 2024 David Douglas School District
ย 
Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10Haunted Houses by H W Longfellow for class 10
Haunted Houses by H W Longfellow for class 10
ย 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
ย 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
ย 

Client-Assisted Memento Aggregation Using the Prefer Header

  • 1. Client-Assisted Memento Aggregation Using the Prefer Header Mat Kelly, Sawood Alam, Michael L. Nelson, and Michele C. Weigle Old Dominion University Web Science & Digital Libraries Research Group {mkelly, salam, mln, mweigle}@cs.odu.edu @machawk1 โ€ข @WebSciDL Web Archiving and Digital Libraries (WADL) Workshop June 6, 2018, Fort Worth, TX
  • 2. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX Proliferation of Personal Web Archives 2
  • 3. @machawk1 A Framework for Aggregating Private and Public Web Archives JCDL 2018 โ€ข June 5, 2018 โ€ข Fort Worth, TX Todayโ€™s Memento Aggregation 3 Archives Queried (A0 )
  • 4. @machawk1 A Framework for Aggregating Private and Public Web Archives JCDL 2018 โ€ข June 5, 2018 โ€ข Fort Worth, TX Motivation 4 Archives Queried (A0 ) > Include personal archives > Include other non-aggregated archives
  • 5. @machawk1 A Framework for Aggregating Private and Public Web Archives JCDL 2018 โ€ข June 5, 2018 โ€ข Fort Worth, TX Motivation 5 Archives Queried (A0 ) > Include personal archives > Include other non-aggregated archives
  • 6. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX State of Aggregatorsโ€™ Capabilities โ— Mementoweb aggregator โ—‹ Cannot customize set of archives aggregated โ—‹ Open source? Unavailable for individualsโ€™ deployment โ— MemGator โ—‹ Open source โœ” https://github.com/oduwsdl/MemGator โ—‹ Requires static set of archives on-launch โ—‹ Still specified by server, clients have no say โ— With each, the set of archives is determined on the โ€œserverโ€. โ— Neither allows client to specify set of archives aggregated. 6
  • 7. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX HTTP Prefer โ— RFC 7240 (June 2014) โ— CLIENT requests with HTTP Header: โ—‹ Prefer: foo; bar="" โ— SERVER may response with HTTP Header: โ—‹ Preference-Applied: foo 7
  • 8. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX HTTP Prefer โ— RFC 7240 (June 2014) โ— CLIENT requests with HTTP Header: โ—‹ Prefer: foo; bar="" โ— SERVER may response with HTTP Header: โ—‹ Preference-Applied: foo Prefer: archives="data:application/json;charset=utf-8;base64,Ww0KIC7...NCn0=" OUR APPROACH: 8
  • 9. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX Prefer + Memento โ— S. Jones, H. Van de Sompel, et al. โ€œMementos in the Rawโ€ 1 โ—‹ Prefer: original-content, original-links, original headers โ—‹ Mitigate replay system rewriting, may โ€œrawโ€ information more accessible โ— D.S.H. Rosenthal โ€œContent negotiation and Mementoโ€ 2 โ—‹ none, screenshot, altered-dom, url-rewritten, banner-inserted โ—‹ Additional focus on derived representations 9 1 http://ws-dl.blogspot.com/2016/08/2016-08-15-mementos-in-raw-take-two.html 2 https://blog.dshr.org/2016/08/content-negotiation-and-memento.html
  • 10. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX A More Capable, Transparent Aggregator 10
  • 11. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX Memento Meta-Aggregator (MMA)1 โ— Additional responsibilities beyond aggregation โ— Provide hierarchical querying model to other aggregators โ— Advanced querying models like Precedence and Short-Circuiting โ— Systematic interaction and aggregation with Private and Personal Web archive 1 Kelly et al. โ€œA Framework for Aggregator Private and Public Web Archivesโ€, JCDL 2018 11
  • 12. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX Bob Prefers to Exclude IA Captures 12 โœ“ โœ“
  • 13. GET /archives Bob Requests Supported Archives 13 โ†’{ }
  • 14. Bob Customizes the Set in the JSON 14 โ†’{ } โœ“ โœ“
  • 15. Bob Requests CNN for His Custom Set 15 โ†’{ } ( ) base64 encoded JSON transmitted
  • 16. MMA Complies or Ignores Preference 16 โ†’{ } โ†’{ } โœ“
  • 17. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX Client-Side Archive Specification 17
  • 18. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX [ { "id": "ia", "name": "Internet Archive", "timemap": "http://web.archive.org/web/timemap/link/", "timegate": "http://web.archive.org/web/", }, { "id": "alice", "name": "Aliceโ€™s Captures", "timemap": "http://localhost:8081/timemap/", "timegate": "http://localhost:8081/timegate/", }, โ€ฆ ] Respecification of archives.json 18 Base64 encoded Ww0KICB7...NCn0=
  • 19. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX Requesting Custom Set of Archives with curl > GET /timemap/link/http://fox.cs.vt.edu/wadl2017.html HTTP/1.1 > Host: mma.cs.odu.edu > Prefer: archives="data:application/json;charset=utf-8;base64,Ww0KICB7...NCn0=" < HTTP /1.1 200 < content-type: application/link-format < vary: prefer < preference-applied: archives="data:application/json;charset=utf-8;base64,Ww0KICB7...NCn0=" < content-location: /timemap/link/5bd...8e9/http://fox.cs.vt.edu/wadl2017.html 19
  • 20. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX Non-Aggregated Public Web Archives 20
  • 21. @machawk1 Client-Assisted Memento Aggregation Using the Prefer Header WADL 2018 โ€ข June 6, 2018 โ€ข Fort Worth, TX Potential Approaches Toward Archival Set Persistence for Subsequent Queries 1. Maintain state โ—‹ content-location: /timemap/link/5bd...8e9/http://fox.cs.vt.edu/wadl2017.html โ—‹ Not something we want to do with HTTP 2. Require re-specification with each request โ—‹ not portable to other users 3. Server-side set caching โ—‹ combinatorial explosion 21
  • 22. Client-Assisted Memento Aggregation Using the Prefer Header Mat Kelly, Sawood Alam, Michael L. Nelson, and Michele C. Weigle Old Dominion University Web Science & Digital Libraries Research Group {mkelly, salam, mln, mweigle}@cs.odu.edu @machawk1 โ€ข @WebSciDL Web Archiving and Digital Libraries (WADL) Workshop June 6, 2018, Fort Worth, TX