SlideShare a Scribd company logo
Web Obervatories, Data
Analytics and Archives
Professor Dame Wendy Hall
University of Southampton, UK and Kluge Chair, Library
of Congress
16th June 2016
2

Library of Congress Web Archives
Over 18 million resources
available via the LOC website
Books, images, drawings,
newspapers, and websites
Over 11 thousand archived
websites supported by the
Internet Archive and counting
Library of Congress API
LOC exposes their resource data via
API to enable services to be built on top
>
Building Web Observatory Services on
the LOC website
The Web Observatory uses schema.org vocab
to describe different resources
• simple, lightweight metadata (URL, name,
description, author)
The Web Observatory lens can be added to
existing repositories and data catalogues
• helping make them discoverable across Web
Observatories
Enriching LOC Resources with Schema.org
LOC API (JSON)
>
WO Schema.org (JSON-LD)
Archives Unleashed 2.0 – used datasets
found through the Observatory
Archives Unleashed 2.0 – the results can be
made accessible through the Observatory
Studying the Web across continents
• Astronomers obtain a very high
resolution picture of the sky from small
telescopes a long distance apart.
• Many web research labs, contributing
across the globe, help build an accurate
picture of human activity at planetary
scale.
– transcending parochial social, political,
economic, legal interpretations
understanding web evolution:
- observation
- experimentation
Web Observatory Infrastructure today
• An initiative supported by the Web Science Trust
– http://www.webscience.org
• A growing number of sites using common metadata for
hosted datasets and apps (schema.org)
– http://index.webobservatory.org
• Some WO sites use purpose-built software that:
– Allows their community members to list and share
public or private datasets and apps
– Provides for discovery and access to listed datasets and
apps across WO sites
– Provides APIs for app development using listed datasets:
– http://webobservatory.soton.ac.uk
Thanassis Tiropanis – University of Southampton – t.tiropanis@southampton.ac.uk
index.webobservatory.org
Follow us: @wo_team
Contact us: hello@webobservatory.org
Web Observatory Infrastructure tomorrow
• A distributed catalogue across WO sites
• WO sites use common technical standards for
– Describing locally or remotely hosted datasets and apps
– Accessing datasets and apps across sites
– APIs for developing apps and visualisations
– Meaningful terms and conditions
– Implementing ethical practice
Thanassis Tiropanis – University of Southampton – t.tiropanis@southampton.ac.uk
The Web Observatory: A Middle Layer for Broad Data. (2014). Tiropanis, T, Hall, W, Hendler, J A, De Larinaga, C. Big Data, 2(3).
Datasets
20
Apps
Who is editing what in Wikipedia?
22
The Web of Observatories
Web
Observatory
Web
Observatory
Web
Observatory
Web
Observatory
Web
Observatory
24© Web Science Trust 2013
Observing the Web
How do we catalogue Observatories and content?
•https://www.w3.org/wiki/WebSchemas/SchemaDot
OrgProposals
•https://www.w3.org/wiki/WebSchemas/WebObsSch
ema
•We’re building a crawler and a search engine
25© Web Science Trust 2013
Observing the Web
The ambition is to map the digital
universe!
webscience.org/web-observatory
Follow us: @wo_team
Contact us: hello@webobservatory.org
Digital Vellum
The Self Archiving Web
Who is archiving what?

More Related Content

What's hot

EPOS GNSS Data and Products TCS - What we do...
EPOS GNSS Data and Products TCS - What we do...EPOS GNSS Data and Products TCS - What we do...
EPOS GNSS Data and Products TCS - What we do...
EPOS | GNSS Data and Products
 
Linked Sensor Data
Linked Sensor DataLinked Sensor Data
Linked Sensor Data
Harshal Patni
 
Drones in the Earth Sciences - Opportunities and issues
Drones in the Earth Sciences - Opportunities and issuesDrones in the Earth Sciences - Opportunities and issues
Drones in the Earth Sciences - Opportunities and issues
ARDC
 
Real Time Semantic Analysis of Streaming Sensor Data
Real Time Semantic Analysis of Streaming Sensor DataReal Time Semantic Analysis of Streaming Sensor Data
Real Time Semantic Analysis of Streaming Sensor Data
Artificial Intelligence Institute at UofSC
 
AusPlots field data collection with AusScribe
AusPlots field data collection with AusScribeAusPlots field data collection with AusScribe
AusPlots field data collection with AusScribe
TERN Australia
 
AURIN - Overview
AURIN - OverviewAURIN - Overview
AURIN - Overview
ARDC
 
CoESRA: Platform for collaborative research
CoESRA: Platform for collaborative researchCoESRA: Platform for collaborative research
CoESRA: Platform for collaborative research
TERN Australia
 
EOSC for Physics & Astronomy: Radio Astronomy Competence Centre
EOSC for Physics & Astronomy: Radio Astronomy Competence CentreEOSC for Physics & Astronomy: Radio Astronomy Competence Centre
EOSC for Physics & Astronomy: Radio Astronomy Competence Centre
EOSC-hub project
 
041018 Esds Poster
041018 Esds Poster041018 Esds Poster
041018 Esds Poster
Rudolf Husar
 
Collaboratively Conceived, Designed and Implemented: Matching Visualization ...
Collaboratively Conceived, Designed and Implemented:  Matching Visualization ...Collaboratively Conceived, Designed and Implemented:  Matching Visualization ...
Collaboratively Conceived, Designed and Implemented: Matching Visualization ...
Nancy Hoebelheinrich
 
Data Science at the ATI and BL Web Archiving
Data Science at the ATI and BL Web ArchivingData Science at the ATI and BL Web Archiving
Data Science at the ATI and BL Web Archiving
labsbl
 
Ireland’s Transboundary Monitoring Network and EMEP Strategy - Michael Geever...
Ireland’s Transboundary Monitoring Network and EMEP Strategy - Michael Geever...Ireland’s Transboundary Monitoring Network and EMEP Strategy - Michael Geever...
Ireland’s Transboundary Monitoring Network and EMEP Strategy - Michael Geever...
Environmental Protection Agency, Ireland
 
EcoTas13 Caddy-Retalic TERN Infrastructure
EcoTas13 Caddy-Retalic TERN InfrastructureEcoTas13 Caddy-Retalic TERN Infrastructure
EcoTas13 Caddy-Retalic TERN Infrastructure
TERN Australia
 
OpenAIRE provide dashboard #OpenAIREweek2020
OpenAIRE provide dashboard #OpenAIREweek2020OpenAIRE provide dashboard #OpenAIREweek2020
OpenAIRE provide dashboard #OpenAIREweek2020
Pedro Príncipe
 
How ACDLabs Software Tools are used by the Royal Society of Chemistry
How ACDLabs Software Tools are used by the Royal Society of ChemistryHow ACDLabs Software Tools are used by the Royal Society of Chemistry
How ACDLabs Software Tools are used by the Royal Society of Chemistry
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
2004-09-12 Data and Tools for Web-Based Monitoring and Analysis
2004-09-12 Data and Tools for Web-Based Monitoring and Analysis2004-09-12 Data and Tools for Web-Based Monitoring and Analysis
2004-09-12 Data and Tools for Web-Based Monitoring and AnalysisRudolf Husar
 
Smr Fastnet Presentation Take2 Pubs
Smr Fastnet Presentation Take2 PubsSmr Fastnet Presentation Take2 Pubs
Smr Fastnet Presentation Take2 Pubs
Rudolf Husar
 
Fastnet Aq Conference
Fastnet Aq ConferenceFastnet Aq Conference
Fastnet Aq Conference
Rudolf Husar
 
DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...
DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...
DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...
Deltares
 

What's hot (20)

EPOS GNSS Data and Products TCS - What we do...
EPOS GNSS Data and Products TCS - What we do...EPOS GNSS Data and Products TCS - What we do...
EPOS GNSS Data and Products TCS - What we do...
 
Linked Sensor Data
Linked Sensor DataLinked Sensor Data
Linked Sensor Data
 
Drones in the Earth Sciences - Opportunities and issues
Drones in the Earth Sciences - Opportunities and issuesDrones in the Earth Sciences - Opportunities and issues
Drones in the Earth Sciences - Opportunities and issues
 
Real Time Semantic Analysis of Streaming Sensor Data
Real Time Semantic Analysis of Streaming Sensor DataReal Time Semantic Analysis of Streaming Sensor Data
Real Time Semantic Analysis of Streaming Sensor Data
 
AusPlots field data collection with AusScribe
AusPlots field data collection with AusScribeAusPlots field data collection with AusScribe
AusPlots field data collection with AusScribe
 
AURIN - Overview
AURIN - OverviewAURIN - Overview
AURIN - Overview
 
CoESRA: Platform for collaborative research
CoESRA: Platform for collaborative researchCoESRA: Platform for collaborative research
CoESRA: Platform for collaborative research
 
EOSC for Physics & Astronomy: Radio Astronomy Competence Centre
EOSC for Physics & Astronomy: Radio Astronomy Competence CentreEOSC for Physics & Astronomy: Radio Astronomy Competence Centre
EOSC for Physics & Astronomy: Radio Astronomy Competence Centre
 
041018 Esds Poster
041018 Esds Poster041018 Esds Poster
041018 Esds Poster
 
Collaboratively Conceived, Designed and Implemented: Matching Visualization ...
Collaboratively Conceived, Designed and Implemented:  Matching Visualization ...Collaboratively Conceived, Designed and Implemented:  Matching Visualization ...
Collaboratively Conceived, Designed and Implemented: Matching Visualization ...
 
GIS Research at UCL
GIS Research at UCLGIS Research at UCL
GIS Research at UCL
 
Data Science at the ATI and BL Web Archiving
Data Science at the ATI and BL Web ArchivingData Science at the ATI and BL Web Archiving
Data Science at the ATI and BL Web Archiving
 
Ireland’s Transboundary Monitoring Network and EMEP Strategy - Michael Geever...
Ireland’s Transboundary Monitoring Network and EMEP Strategy - Michael Geever...Ireland’s Transboundary Monitoring Network and EMEP Strategy - Michael Geever...
Ireland’s Transboundary Monitoring Network and EMEP Strategy - Michael Geever...
 
EcoTas13 Caddy-Retalic TERN Infrastructure
EcoTas13 Caddy-Retalic TERN InfrastructureEcoTas13 Caddy-Retalic TERN Infrastructure
EcoTas13 Caddy-Retalic TERN Infrastructure
 
OpenAIRE provide dashboard #OpenAIREweek2020
OpenAIRE provide dashboard #OpenAIREweek2020OpenAIRE provide dashboard #OpenAIREweek2020
OpenAIRE provide dashboard #OpenAIREweek2020
 
How ACDLabs Software Tools are used by the Royal Society of Chemistry
How ACDLabs Software Tools are used by the Royal Society of ChemistryHow ACDLabs Software Tools are used by the Royal Society of Chemistry
How ACDLabs Software Tools are used by the Royal Society of Chemistry
 
2004-09-12 Data and Tools for Web-Based Monitoring and Analysis
2004-09-12 Data and Tools for Web-Based Monitoring and Analysis2004-09-12 Data and Tools for Web-Based Monitoring and Analysis
2004-09-12 Data and Tools for Web-Based Monitoring and Analysis
 
Smr Fastnet Presentation Take2 Pubs
Smr Fastnet Presentation Take2 PubsSmr Fastnet Presentation Take2 Pubs
Smr Fastnet Presentation Take2 Pubs
 
Fastnet Aq Conference
Fastnet Aq ConferenceFastnet Aq Conference
Fastnet Aq Conference
 
DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...
DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...
DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...
 

Similar to Professor Dame Wendy Hall - Saving the Web

Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
Ahmed AlSum
 
Uk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseUk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcase
RDTF-Discovery
 
Module development
Module development Module development
Module development
Araport
 
Using linked data in a heterogeneous sensor web: Challenges, experiments and ...
Using linked data in a heterogeneous sensor web: Challenges, experiments and ...Using linked data in a heterogeneous sensor web: Challenges, experiments and ...
Using linked data in a heterogeneous sensor web: Challenges, experiments and ...
Cybera Inc.
 
ROHub - Research Object Management Platform Introduction
ROHub - Research Object Management Platform IntroductionROHub - Research Object Management Platform Introduction
ROHub - Research Object Management Platform Introduction
Raul Palma
 
Library Support For Ref
Library Support For RefLibrary Support For Ref
Library Support For RefDavid Clay
 
FDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksFDO as building block for digitization technology stacks
FDO as building block for digitization technology stacks
Raul Palma
 
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
Trevor Owens
 
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
Blue BRIDGE
 
Aggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project ExperiencesAggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project Experiences
Adrian Stevenson
 
The Role of Librarians in transforming the world through Open Data and Open S...
The Role of Librarians in transforming the world through Open Data and Open S...The Role of Librarians in transforming the world through Open Data and Open S...
The Role of Librarians in transforming the world through Open Data and Open S...
African Open Science Platform
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
Adrian Stevenson
 
RO-crate-FDO-ROHub
RO-crate-FDO-ROHubRO-crate-FDO-ROHub
RO-crate-FDO-ROHub
Raul Palma
 
Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012
Kerryn Amery
 
Locah Project Show and Tell
Locah Project Show and TellLocah Project Show and Tell
Locah Project Show and Tell
Adrian Stevenson
 
Describing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.orgDescribing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.org
OCLC
 
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Anna Perricci
 
Accelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the CloudAccelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the Cloud
Jamie Kinney
 

Similar to Professor Dame Wendy Hall - Saving the Web (20)

Open Spatial Data: Sources and Tools
Open Spatial Data: Sources and ToolsOpen Spatial Data: Sources and Tools
Open Spatial Data: Sources and Tools
 
Web archiving challenges and opportunities
Web archiving challenges and opportunitiesWeb archiving challenges and opportunities
Web archiving challenges and opportunities
 
Uk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcaseUk discovery-jisc-project-showcase
Uk discovery-jisc-project-showcase
 
Module development
Module development Module development
Module development
 
Using linked data in a heterogeneous sensor web: Challenges, experiments and ...
Using linked data in a heterogeneous sensor web: Challenges, experiments and ...Using linked data in a heterogeneous sensor web: Challenges, experiments and ...
Using linked data in a heterogeneous sensor web: Challenges, experiments and ...
 
ROHub - Research Object Management Platform Introduction
ROHub - Research Object Management Platform IntroductionROHub - Research Object Management Platform Introduction
ROHub - Research Object Management Platform Introduction
 
Library Support For Ref
Library Support For RefLibrary Support For Ref
Library Support For Ref
 
FDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksFDO as building block for digitization technology stacks
FDO as building block for digitization technology stacks
 
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
 
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
ICOS: Integrated Carbon Observation System Open data to open our eyes to clim...
 
Aggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project ExperiencesAggregation Using Linked Data – LOCAH Project Experiences
Aggregation Using Linked Data – LOCAH Project Experiences
 
The Role of Librarians in transforming the world through Open Data and Open S...
The Role of Librarians in transforming the world through Open Data and Open S...The Role of Librarians in transforming the world through Open Data and Open S...
The Role of Librarians in transforming the world through Open Data and Open S...
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 
RO-crate-FDO-ROHub
RO-crate-FDO-ROHubRO-crate-FDO-ROHub
RO-crate-FDO-ROHub
 
Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono
 
Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012Emtacl12, mlibraries12 conferences, 2012
Emtacl12, mlibraries12 conferences, 2012
 
Locah Project Show and Tell
Locah Project Show and TellLocah Project Show and Tell
Locah Project Show and Tell
 
Describing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.orgDescribing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.org
 
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
Progress Made and Lessons Learned through Collaborative Web Archiving Proj...
 
Accelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the CloudAccelerating Time to Science: Transforming Research in the Cloud
Accelerating Time to Science: Transforming Research in the Cloud
 

Recently uploaded

The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 

Recently uploaded (20)

The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 

Professor Dame Wendy Hall - Saving the Web

  • 1. Web Obervatories, Data Analytics and Archives Professor Dame Wendy Hall University of Southampton, UK and Kluge Chair, Library of Congress 16th June 2016
  • 2. 2
  • 3.
  • 4. Library of Congress Web Archives Over 18 million resources available via the LOC website Books, images, drawings, newspapers, and websites Over 11 thousand archived websites supported by the Internet Archive and counting
  • 5. Library of Congress API LOC exposes their resource data via API to enable services to be built on top >
  • 6. Building Web Observatory Services on the LOC website The Web Observatory uses schema.org vocab to describe different resources • simple, lightweight metadata (URL, name, description, author) The Web Observatory lens can be added to existing repositories and data catalogues • helping make them discoverable across Web Observatories
  • 7. Enriching LOC Resources with Schema.org LOC API (JSON) > WO Schema.org (JSON-LD)
  • 8.
  • 9.
  • 10.
  • 11. Archives Unleashed 2.0 – used datasets found through the Observatory
  • 12. Archives Unleashed 2.0 – the results can be made accessible through the Observatory
  • 13. Studying the Web across continents • Astronomers obtain a very high resolution picture of the sky from small telescopes a long distance apart. • Many web research labs, contributing across the globe, help build an accurate picture of human activity at planetary scale. – transcending parochial social, political, economic, legal interpretations
  • 14. understanding web evolution: - observation - experimentation
  • 15. Web Observatory Infrastructure today • An initiative supported by the Web Science Trust – http://www.webscience.org • A growing number of sites using common metadata for hosted datasets and apps (schema.org) – http://index.webobservatory.org • Some WO sites use purpose-built software that: – Allows their community members to list and share public or private datasets and apps – Provides for discovery and access to listed datasets and apps across WO sites – Provides APIs for app development using listed datasets: – http://webobservatory.soton.ac.uk Thanassis Tiropanis – University of Southampton – t.tiropanis@southampton.ac.uk
  • 17. Web Observatory Infrastructure tomorrow • A distributed catalogue across WO sites • WO sites use common technical standards for – Describing locally or remotely hosted datasets and apps – Accessing datasets and apps across sites – APIs for developing apps and visualisations – Meaningful terms and conditions – Implementing ethical practice Thanassis Tiropanis – University of Southampton – t.tiropanis@southampton.ac.uk
  • 18. The Web Observatory: A Middle Layer for Broad Data. (2014). Tiropanis, T, Hall, W, Hendler, J A, De Larinaga, C. Big Data, 2(3).
  • 19.
  • 21. Apps
  • 22. Who is editing what in Wikipedia? 22
  • 23. The Web of Observatories Web Observatory Web Observatory Web Observatory Web Observatory Web Observatory
  • 24. 24© Web Science Trust 2013 Observing the Web How do we catalogue Observatories and content? •https://www.w3.org/wiki/WebSchemas/SchemaDot OrgProposals •https://www.w3.org/wiki/WebSchemas/WebObsSch ema •We’re building a crawler and a search engine
  • 25. 25© Web Science Trust 2013 Observing the Web The ambition is to map the digital universe!
  • 27. Digital Vellum The Self Archiving Web Who is archiving what?