SlideShare a Scribd company logo
JCDL 2013
July 24th Indianapolis, IN 1
Martin Klein
@mart1nkle1n
martinklein0815@gmail.com
Herbert Van de Sompel
@hvdsomp
hvdsomp@gmail.com
http://www.openarchives.org/rs/
Extending Sitemaps for ResourceSync
JCDL 2013
July 24th Indianapolis, IN 2
ResourceSync Core Team
JCDL 2013
July 24th Indianapolis, IN 3
ResourceSync Technical Group
JISC
Richard Jones
Graham Klyne
Stuart Lewis
OCLC
Jeff Young
LOCKSS
David Rosenthal
RedHat
Christian Sadilek
Ex Libris Inc.
Shlomo Sanders
Library of Congress
Kevin Ford
JCDL 2013
July 24th Indianapolis, IN 4
Synchronize
• Web resources
o things with a URI that can be dereferenced
• many/few
• big/small
• fast/slow
What
• Keep “in sync”
• Destination (client) follows changes at a Source
(server) over time
• Keep copies on different systems the same
JCDL 2013
July 24th Indianapolis, IN 5
Two ResourceSync Capabilities
Resource List
Lists resources
subject to synchronization
Change List
Lists changes to
resources
subject to synchronization
• Allow Destinations to obtain current resources
• RequiresURI
• Allow Destination to verify accuracy of sync’ed content
• Requireslastmodand fixity information
• Allow Source to include references to additional content
• Requires inclusion of links
JCDL 2013
July 24th Indianapolis, IN 6
Entrance…. Sitemaps
• Resource List is an
inventory – so is a Sitemap
• Low barrier of adoption
• Ack’ed by Google, Yahoo!,
Bing
JCDL 2013
July 24th Indianapolis, IN 7
<loc>http://example.com/res1</loc>
<lastmod>2013-07-24-T09:00:00Z</lastmod>
<url>
</url>
<url>
…
</url>
<urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”>
</urlset>
Sitemap Format
JCDL 2013
July 24th Indianapolis, IN 8
<urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”>
--- root level ---
document info, lastmod, links
<url>
--- resource level ---
fixity, change type, and other resource info, links
<loc>http://example.com/res1</loc>
<lastmod>2013-07-24T09:00:00Z</lastmod>
</url>
<url>
…
</url>
</urlset>
ResourceSync Sitemap Extensions
JCDL 2013
July 24th Indianapolis, IN 9
Testing ResourceSync Sitemap Extensions
Series of informal experiments
1. Enhance Sitemaps with attributes and elements
2. Submit Sitemapsto Google’s Webmaster Tool
3. Evaluate immediate feedback
4. Check Google index
Concerns:
1. Rejection ofResourceSync documents due to
a. Added elements and attributes on root level
b. Added elements and attributes on resource level
2. Unwanted indexing of URIs from links vs. <loc>
JCDL 2013
July 24th Indianapolis, IN 10
<urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”
xmlns:rs="http://www.openarchives.org/rs/terms/”>
<rs:meta capability=”resourcelist”
modified=”2013-07-24-T11:00:00Z"/>
<url>
<loc>http://example.com/res1</loc>
<lastmod>2013-07-24T09:00:00Z</lastmod>
</url>
</urlset>
Sitemap Extensions Test #1
Inclusion of elements and attributes at root level
to convey:
• Type of capability
• Last modification date
JCDL 2013
July 24th Indianapolis, IN 11
<urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”
xmlns:rs="http://www.openarchives.org/rs/terms/”>
<url>
<loc>http://example.com/res1</loc>
<lastmodrs:change=”updated">2013-07-24T09:00:00Z</lastmod>
<rs:fixity type=“md5”>a2f29dklfgj9823lksdf90sfkd</rs:fixity>
<rs:mimetype>text/html</rs:mimetype>
</url>
</urlset>
Sitemap Extensions Test #2
Inclusion of elements and attributes at resource level
to convey:
• Change type
• Metadata
JCDL 2013
July 24th Indianapolis, IN 12
<urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”
xmlns:rs="http://www.openarchives.org/rs/terms/”>
<rs:linkrel=”resourcesync”
href=”http://example.com/capabilitylist.xml"/>
<rs:linkrel=”describedby”
href=”http://example.com/info-about-source.xml"/>
<url>
<loc>http://example.com/res1</loc>
<lastmod>2013-07-24T09:00:00Z</lastmod>
</url>
</urlset>
Sitemap Extensions Test #3
Inclusion of links at root level to:
• Navigate through the framework
• Point at misc documents
JCDL 2013
July 24th Indianapolis, IN 13
<urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”
xmlns:rs="http://www.openarchives.org/rs/terms/”>
<url>
<loc>http://example.com/res1</loc>
<lastmod>2013-07-24T09:00:00Z</lastmod>
<rs:linkrel="duplicate"
href="http://mirror.example.com/res1"/>
<rs:linkrel="http://www.openarchives.org/rs/terms/patch”
href="http://example.com/res1-json-patch"
type="application/json-patch"/>
</url>
</urlset>
Sitemap Extensions Test #4
Inclusion of links at resource level to:
• Point to related resources documents
JCDL 2013
July 24th Indianapolis, IN 14
Results - Sitemap Extensions Test #4
As expected:
1. Child elements tolerated
2. Google indexes URI within <loc>
Unintended consequences:
3. Google indexes URIs within <rs:link>
2 & 3 together is not desired e.g.,
• When mirror location is provided, URI in <rs:link>
should and URI in <loc> should not be indexed
• URI in <rs:link> points at partial content
JCDL 2013
July 24th Indianapolis, IN 15
<urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”
xmlns:rs="http://www.openarchives.org/rs/terms/”>
<url>
<loc>http://example.com/res1</loc>
<lastmod>2013-07-24T09:00:00Z</lastmod>
<rs:linkrel="duplicate"
href="http://mirror.example.com/res1"/>
<rs:linkrel="http://www.openarchives.org/rs/terms/patch”
href="http://example.com/res1-json-patch"
type="application/json-patch"/>
</url>
</urlset>
Sitemap Extensions Test #4
Inclusion of links at resource level to:
• Point to related resources documents
JCDL 2013
July 24th Indianapolis, IN 16
<urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”
xmlns:rs="http://www.openarchives.org/rs/terms/”>
<rs:lnrel=”resourcesync”
href=”http://example.com/capabilitylist.xml"/>
<rs:mdcapability=”changelist”
modified=” 2013-07-24-T11:00:00Z"/>
<url>
<loc>http://example.com/res1</loc>
<lastmod>2013-07-24T09:00:00Z</lastmod>
<rs:mdchange=”updated”
type=”text/html”
hash=”md5:a2f94c567f9b370c43fb1188f1f46330”/>
<rs:lnrel=”duplicate”
href=”http://mirror.example.com/res1"/>
</url>
</urlset>
Summary
JCDL 2013
July 24th Indianapolis, IN 17
http://www.openarchives.org/rs/
JCDL 2013
July 24th Indianapolis, IN 18
Martin Klein
@mart1nkle1n
martinklein0815@gmail.com
Herbert Van de Sompel
@hvdsomp
hvdsomp@gmail.com
http://www.openarchives.org/rs/
Extending Sitemaps for ResourceSync
Thank you!

More Related Content

Similar to Jcdl2013 mklein

NISO ResourceSync Training Session
NISO ResourceSync Training SessionNISO ResourceSync Training Session
NISO ResourceSync Training Session
National Information Standards Organization (NISO)
 
ResourceSync in 24x7
ResourceSync in 24x7ResourceSync in 24x7
ResourceSync in 24x7
Simeon Warner
 
elasticsearch
elasticsearchelasticsearch
elasticsearch
Satish Mohan
 
Azure DocumentDB for Healthcare Integration
Azure DocumentDB for Healthcare IntegrationAzure DocumentDB for Healthcare Integration
Azure DocumentDB for Healthcare Integration
BizTalk360
 
Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)
Hong (Jenny) Jing
 
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG: connecting the knowledge community
 
ResourceSync - NISO Update Jan 2014
ResourceSync - NISO Update Jan 2014ResourceSync - NISO Update Jan 2014
ResourceSync - NISO Update Jan 2014
National Information Standards Organization (NISO)
 
Resources plugin
Resources pluginResources plugin
Resources plugin
Alidad Soleimani
 
RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel
ASIS&T
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve Meyer
WiLS
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museums
trevorthornton
 
Dizajn REST API-ja
Dizajn REST API-jaDizajn REST API-ja
Dizajn REST API-ja
Denis Kranjčec
 
Commodity Semantic Search: A Case Study of DiscoverEd
Commodity Semantic Search: A Case Study of DiscoverEdCommodity Semantic Search: A Case Study of DiscoverEd
Commodity Semantic Search: A Case Study of DiscoverEd
Nathan Yergler
 
Describing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.orgDescribing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.org
OCLC
 
Upcoming JDeveloper ADF Business Components REST support
Upcoming JDeveloper ADF Business Components REST supportUpcoming JDeveloper ADF Business Components REST support
Upcoming JDeveloper ADF Business Components REST support
Steven Davelaar
 
ResourceSync Tutorial from Open Repositories 2013
ResourceSync Tutorial from Open Repositories 2013ResourceSync Tutorial from Open Repositories 2013
ResourceSync Tutorial from Open Repositories 2013
Simeon Warner
 
RDFa: an introduction
RDFa: an introductionRDFa: an introduction
RDFa: an introduction
Kai Li
 
ResourceSync Introduction at SWIB13
ResourceSync Introduction at SWIB13ResourceSync Introduction at SWIB13
ResourceSync Introduction at SWIB13
Simeon Warner
 
Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...
Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...
Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...
semanticsconference
 
DBtrends Semantics 2016
DBtrends Semantics 2016DBtrends Semantics 2016
DBtrends Semantics 2016
Edgard Marx
 

Similar to Jcdl2013 mklein (20)

NISO ResourceSync Training Session
NISO ResourceSync Training SessionNISO ResourceSync Training Session
NISO ResourceSync Training Session
 
ResourceSync in 24x7
ResourceSync in 24x7ResourceSync in 24x7
ResourceSync in 24x7
 
elasticsearch
elasticsearchelasticsearch
elasticsearch
 
Azure DocumentDB for Healthcare Integration
Azure DocumentDB for Healthcare IntegrationAzure DocumentDB for Healthcare Integration
Azure DocumentDB for Healthcare Integration
 
Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)
 
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
UKSG webinar: Making Connections - Creating Linked Open Library Data with Nei...
 
ResourceSync - NISO Update Jan 2014
ResourceSync - NISO Update Jan 2014ResourceSync - NISO Update Jan 2014
ResourceSync - NISO Update Jan 2014
 
Resources plugin
Resources pluginResources plugin
Resources plugin
 
RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve Meyer
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museums
 
Dizajn REST API-ja
Dizajn REST API-jaDizajn REST API-ja
Dizajn REST API-ja
 
Commodity Semantic Search: A Case Study of DiscoverEd
Commodity Semantic Search: A Case Study of DiscoverEdCommodity Semantic Search: A Case Study of DiscoverEd
Commodity Semantic Search: A Case Study of DiscoverEd
 
Describing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.orgDescribing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.org
 
Upcoming JDeveloper ADF Business Components REST support
Upcoming JDeveloper ADF Business Components REST supportUpcoming JDeveloper ADF Business Components REST support
Upcoming JDeveloper ADF Business Components REST support
 
ResourceSync Tutorial from Open Repositories 2013
ResourceSync Tutorial from Open Repositories 2013ResourceSync Tutorial from Open Repositories 2013
ResourceSync Tutorial from Open Repositories 2013
 
RDFa: an introduction
RDFa: an introductionRDFa: an introduction
RDFa: an introduction
 
ResourceSync Introduction at SWIB13
ResourceSync Introduction at SWIB13ResourceSync Introduction at SWIB13
ResourceSync Introduction at SWIB13
 
Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...
Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...
Edgard Marx, Amrapali Zaveri, Diego Moussallem and Sandro Rautenberg | DBtren...
 
DBtrends Semantics 2016
DBtrends Semantics 2016DBtrends Semantics 2016
DBtrends Semantics 2016
 

More from Martin Klein

On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly WebOn the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly Web
Martin Klein
 
On the Persistence of Persistent Identifiers of the Scholarly Web
 On the Persistence of Persistent Identifiers of the Scholarly Web On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly Web
Martin Klein
 
An Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly OrphansAn Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly Orphans
Martin Klein
 
Who is Asking - Humans and Machines Experience a Different Scholarly Web
Who is Asking - Humans and Machines  Experience a Different Scholarly WebWho is Asking - Humans and Machines  Experience a Different Scholarly Web
Who is Asking - Humans and Machines Experience a Different Scholarly Web
Martin Klein
 
The Memento Tracer Framework: Balancing Quality and Scalability for Web Arch...
The Memento Tracer Framework: Balancing Quality and Scalability  for Web Arch...The Memento Tracer Framework: Balancing Quality and Scalability  for Web Arch...
The Memento Tracer Framework: Balancing Quality and Scalability for Web Arch...
Martin Klein
 
Memento Tracer An Innovative Approach Towards Balancing Scale and Fidelity f...
Memento Tracer An Innovative Approach Towards Balancing  Scale and Fidelity f...Memento Tracer An Innovative Approach Towards Balancing  Scale and Fidelity f...
Memento Tracer An Innovative Approach Towards Balancing Scale and Fidelity f...
Martin Klein
 
Comparing the Performance of OAI-PMH with ResourceSync
Comparing the Performance of OAI-PMH with ResourceSyncComparing the Performance of OAI-PMH with ResourceSync
Comparing the Performance of OAI-PMH with ResourceSync
Martin Klein
 
Evaluating Memento Service Optimizations
Evaluating Memento Service OptimizationsEvaluating Memento Service Optimizations
Evaluating Memento Service Optimizations
Martin Klein
 
An Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly OrphansAn Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly Orphans
Martin Klein
 
A Vision of the Library’s Role in Archiving Scholarly Artifacts
A Vision of the Library’s Role  in Archiving Scholarly ArtifactsA Vision of the Library’s Role  in Archiving Scholarly Artifacts
A Vision of the Library’s Role in Archiving Scholarly Artifacts
Martin Klein
 
First Steps in Research Data Management Under Constraints of a National Secur...
First Steps in Research Data Management Under Constraints of a National Secur...First Steps in Research Data Management Under Constraints of a National Secur...
First Steps in Research Data Management Under Constraints of a National Secur...
Martin Klein
 
Smart Routing of Memento Requests
Smart Routing of Memento RequestsSmart Routing of Memento Requests
Smart Routing of Memento Requests
Martin Klein
 
Building Event Collections from Crawling Web Archives
Building Event Collections from Crawling Web ArchivesBuilding Event Collections from Crawling Web Archives
Building Event Collections from Crawling Web Archives
Martin Klein
 
A Web-Centric Pipeline for Archiving Scholarly Artifacts
A Web-Centric Pipeline for Archiving Scholarly ArtifactsA Web-Centric Pipeline for Archiving Scholarly Artifacts
A Web-Centric Pipeline for Archiving Scholarly Artifacts
Martin Klein
 
Focused Crawl of Web Archives to Build Event Collections
Focused Crawl of Web Archives to Build Event CollectionsFocused Crawl of Web Archives to Build Event Collections
Focused Crawl of Web Archives to Build Event Collections
Martin Klein
 
Creating Topical Collections: Web Archives vs. Live Web
Creating Topical Collections:Web Archives vs. Live WebCreating Topical Collections:Web Archives vs. Live Web
Creating Topical Collections: Web Archives vs. Live Web
Martin Klein
 
Robust Linking to Web Resources
Robust Linking to Web ResourcesRobust Linking to Web Resources
Robust Linking to Web Resources
Martin Klein
 
Signposting for Repositories
Signposting for RepositoriesSignposting for Repositories
Signposting for Repositories
Martin Klein
 
Discovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCIDDiscovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCID
Martin Klein
 
Using the Memento Framework to Assess Content Drift in Scholarly Communication
Using the Memento Framework to Assess Content Drift in Scholarly CommunicationUsing the Memento Framework to Assess Content Drift in Scholarly Communication
Using the Memento Framework to Assess Content Drift in Scholarly Communication
Martin Klein
 

More from Martin Klein (20)

On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly WebOn the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly Web
 
On the Persistence of Persistent Identifiers of the Scholarly Web
 On the Persistence of Persistent Identifiers of the Scholarly Web On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly Web
 
An Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly OrphansAn Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly Orphans
 
Who is Asking - Humans and Machines Experience a Different Scholarly Web
Who is Asking - Humans and Machines  Experience a Different Scholarly WebWho is Asking - Humans and Machines  Experience a Different Scholarly Web
Who is Asking - Humans and Machines Experience a Different Scholarly Web
 
The Memento Tracer Framework: Balancing Quality and Scalability for Web Arch...
The Memento Tracer Framework: Balancing Quality and Scalability  for Web Arch...The Memento Tracer Framework: Balancing Quality and Scalability  for Web Arch...
The Memento Tracer Framework: Balancing Quality and Scalability for Web Arch...
 
Memento Tracer An Innovative Approach Towards Balancing Scale and Fidelity f...
Memento Tracer An Innovative Approach Towards Balancing  Scale and Fidelity f...Memento Tracer An Innovative Approach Towards Balancing  Scale and Fidelity f...
Memento Tracer An Innovative Approach Towards Balancing Scale and Fidelity f...
 
Comparing the Performance of OAI-PMH with ResourceSync
Comparing the Performance of OAI-PMH with ResourceSyncComparing the Performance of OAI-PMH with ResourceSync
Comparing the Performance of OAI-PMH with ResourceSync
 
Evaluating Memento Service Optimizations
Evaluating Memento Service OptimizationsEvaluating Memento Service Optimizations
Evaluating Memento Service Optimizations
 
An Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly OrphansAn Institutional Perspective to Rescue Scholarly Orphans
An Institutional Perspective to Rescue Scholarly Orphans
 
A Vision of the Library’s Role in Archiving Scholarly Artifacts
A Vision of the Library’s Role  in Archiving Scholarly ArtifactsA Vision of the Library’s Role  in Archiving Scholarly Artifacts
A Vision of the Library’s Role in Archiving Scholarly Artifacts
 
First Steps in Research Data Management Under Constraints of a National Secur...
First Steps in Research Data Management Under Constraints of a National Secur...First Steps in Research Data Management Under Constraints of a National Secur...
First Steps in Research Data Management Under Constraints of a National Secur...
 
Smart Routing of Memento Requests
Smart Routing of Memento RequestsSmart Routing of Memento Requests
Smart Routing of Memento Requests
 
Building Event Collections from Crawling Web Archives
Building Event Collections from Crawling Web ArchivesBuilding Event Collections from Crawling Web Archives
Building Event Collections from Crawling Web Archives
 
A Web-Centric Pipeline for Archiving Scholarly Artifacts
A Web-Centric Pipeline for Archiving Scholarly ArtifactsA Web-Centric Pipeline for Archiving Scholarly Artifacts
A Web-Centric Pipeline for Archiving Scholarly Artifacts
 
Focused Crawl of Web Archives to Build Event Collections
Focused Crawl of Web Archives to Build Event CollectionsFocused Crawl of Web Archives to Build Event Collections
Focused Crawl of Web Archives to Build Event Collections
 
Creating Topical Collections: Web Archives vs. Live Web
Creating Topical Collections:Web Archives vs. Live WebCreating Topical Collections:Web Archives vs. Live Web
Creating Topical Collections: Web Archives vs. Live Web
 
Robust Linking to Web Resources
Robust Linking to Web ResourcesRobust Linking to Web Resources
Robust Linking to Web Resources
 
Signposting for Repositories
Signposting for RepositoriesSignposting for Repositories
Signposting for Repositories
 
Discovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCIDDiscovering Scholarly Orphans Using ORCID
Discovering Scholarly Orphans Using ORCID
 
Using the Memento Framework to Assess Content Drift in Scholarly Communication
Using the Memento Framework to Assess Content Drift in Scholarly CommunicationUsing the Memento Framework to Assess Content Drift in Scholarly Communication
Using the Memento Framework to Assess Content Drift in Scholarly Communication
 

Recently uploaded

GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
Shinana2
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
marufrahmanstratejm
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Precisely
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
maazsz111
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 

Recently uploaded (20)

GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
SAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloudSAP S/4 HANA sourcing and procurement to Public cloud
SAP S/4 HANA sourcing and procurement to Public cloud
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 

Jcdl2013 mklein

  • 1. JCDL 2013 July 24th Indianapolis, IN 1 Martin Klein @mart1nkle1n martinklein0815@gmail.com Herbert Van de Sompel @hvdsomp hvdsomp@gmail.com http://www.openarchives.org/rs/ Extending Sitemaps for ResourceSync
  • 2. JCDL 2013 July 24th Indianapolis, IN 2 ResourceSync Core Team
  • 3. JCDL 2013 July 24th Indianapolis, IN 3 ResourceSync Technical Group JISC Richard Jones Graham Klyne Stuart Lewis OCLC Jeff Young LOCKSS David Rosenthal RedHat Christian Sadilek Ex Libris Inc. Shlomo Sanders Library of Congress Kevin Ford
  • 4. JCDL 2013 July 24th Indianapolis, IN 4 Synchronize • Web resources o things with a URI that can be dereferenced • many/few • big/small • fast/slow What • Keep “in sync” • Destination (client) follows changes at a Source (server) over time • Keep copies on different systems the same
  • 5. JCDL 2013 July 24th Indianapolis, IN 5 Two ResourceSync Capabilities Resource List Lists resources subject to synchronization Change List Lists changes to resources subject to synchronization • Allow Destinations to obtain current resources • RequiresURI • Allow Destination to verify accuracy of sync’ed content • Requireslastmodand fixity information • Allow Source to include references to additional content • Requires inclusion of links
  • 6. JCDL 2013 July 24th Indianapolis, IN 6 Entrance…. Sitemaps • Resource List is an inventory – so is a Sitemap • Low barrier of adoption • Ack’ed by Google, Yahoo!, Bing
  • 7. JCDL 2013 July 24th Indianapolis, IN 7 <loc>http://example.com/res1</loc> <lastmod>2013-07-24-T09:00:00Z</lastmod> <url> </url> <url> … </url> <urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”> </urlset> Sitemap Format
  • 8. JCDL 2013 July 24th Indianapolis, IN 8 <urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9”> --- root level --- document info, lastmod, links <url> --- resource level --- fixity, change type, and other resource info, links <loc>http://example.com/res1</loc> <lastmod>2013-07-24T09:00:00Z</lastmod> </url> <url> … </url> </urlset> ResourceSync Sitemap Extensions
  • 9. JCDL 2013 July 24th Indianapolis, IN 9 Testing ResourceSync Sitemap Extensions Series of informal experiments 1. Enhance Sitemaps with attributes and elements 2. Submit Sitemapsto Google’s Webmaster Tool 3. Evaluate immediate feedback 4. Check Google index Concerns: 1. Rejection ofResourceSync documents due to a. Added elements and attributes on root level b. Added elements and attributes on resource level 2. Unwanted indexing of URIs from links vs. <loc>
  • 10. JCDL 2013 July 24th Indianapolis, IN 10 <urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9” xmlns:rs="http://www.openarchives.org/rs/terms/”> <rs:meta capability=”resourcelist” modified=”2013-07-24-T11:00:00Z"/> <url> <loc>http://example.com/res1</loc> <lastmod>2013-07-24T09:00:00Z</lastmod> </url> </urlset> Sitemap Extensions Test #1 Inclusion of elements and attributes at root level to convey: • Type of capability • Last modification date
  • 11. JCDL 2013 July 24th Indianapolis, IN 11 <urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9” xmlns:rs="http://www.openarchives.org/rs/terms/”> <url> <loc>http://example.com/res1</loc> <lastmodrs:change=”updated">2013-07-24T09:00:00Z</lastmod> <rs:fixity type=“md5”>a2f29dklfgj9823lksdf90sfkd</rs:fixity> <rs:mimetype>text/html</rs:mimetype> </url> </urlset> Sitemap Extensions Test #2 Inclusion of elements and attributes at resource level to convey: • Change type • Metadata
  • 12. JCDL 2013 July 24th Indianapolis, IN 12 <urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9” xmlns:rs="http://www.openarchives.org/rs/terms/”> <rs:linkrel=”resourcesync” href=”http://example.com/capabilitylist.xml"/> <rs:linkrel=”describedby” href=”http://example.com/info-about-source.xml"/> <url> <loc>http://example.com/res1</loc> <lastmod>2013-07-24T09:00:00Z</lastmod> </url> </urlset> Sitemap Extensions Test #3 Inclusion of links at root level to: • Navigate through the framework • Point at misc documents
  • 13. JCDL 2013 July 24th Indianapolis, IN 13 <urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9” xmlns:rs="http://www.openarchives.org/rs/terms/”> <url> <loc>http://example.com/res1</loc> <lastmod>2013-07-24T09:00:00Z</lastmod> <rs:linkrel="duplicate" href="http://mirror.example.com/res1"/> <rs:linkrel="http://www.openarchives.org/rs/terms/patch” href="http://example.com/res1-json-patch" type="application/json-patch"/> </url> </urlset> Sitemap Extensions Test #4 Inclusion of links at resource level to: • Point to related resources documents
  • 14. JCDL 2013 July 24th Indianapolis, IN 14 Results - Sitemap Extensions Test #4 As expected: 1. Child elements tolerated 2. Google indexes URI within <loc> Unintended consequences: 3. Google indexes URIs within <rs:link> 2 & 3 together is not desired e.g., • When mirror location is provided, URI in <rs:link> should and URI in <loc> should not be indexed • URI in <rs:link> points at partial content
  • 15. JCDL 2013 July 24th Indianapolis, IN 15 <urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9” xmlns:rs="http://www.openarchives.org/rs/terms/”> <url> <loc>http://example.com/res1</loc> <lastmod>2013-07-24T09:00:00Z</lastmod> <rs:linkrel="duplicate" href="http://mirror.example.com/res1"/> <rs:linkrel="http://www.openarchives.org/rs/terms/patch” href="http://example.com/res1-json-patch" type="application/json-patch"/> </url> </urlset> Sitemap Extensions Test #4 Inclusion of links at resource level to: • Point to related resources documents
  • 16. JCDL 2013 July 24th Indianapolis, IN 16 <urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9” xmlns:rs="http://www.openarchives.org/rs/terms/”> <rs:lnrel=”resourcesync” href=”http://example.com/capabilitylist.xml"/> <rs:mdcapability=”changelist” modified=” 2013-07-24-T11:00:00Z"/> <url> <loc>http://example.com/res1</loc> <lastmod>2013-07-24T09:00:00Z</lastmod> <rs:mdchange=”updated” type=”text/html” hash=”md5:a2f94c567f9b370c43fb1188f1f46330”/> <rs:lnrel=”duplicate” href=”http://mirror.example.com/res1"/> </url> </urlset> Summary
  • 17. JCDL 2013 July 24th Indianapolis, IN 17 http://www.openarchives.org/rs/
  • 18. JCDL 2013 July 24th Indianapolis, IN 18 Martin Klein @mart1nkle1n martinklein0815@gmail.com Herbert Van de Sompel @hvdsomp hvdsomp@gmail.com http://www.openarchives.org/rs/ Extending Sitemaps for ResourceSync Thank you!