SlideShare a Scribd company logo
1 of 32
Data on the Web
By Alejandra Garcia Rojas
@aletapool
http://lnked.in/alegrm
25/05/16Women in Digital
Outline
• History of the Web
• Big Data
• Open Data
• Linked Data
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web History
25/05/16Women in Digital
Web Standards
25/05/16Women in Digital
https://www.w3.org/TR/tr-date-stds.html
Web Design and Applications HTML, CSS, SVG, Ajax, and other
technologies for Web Applications
Web Architecture URIs and HTTP
Semantic Web RDF, SPARQL, OWL, and SKOS.
XML Technology XML, XML Namespaces, XML Schema,
XSLT, Efficient XML Interchange (EXI), and
other related standards.
Web of Services HTTP, XML, SOAP, WSDL, SPARQL, and
others.
Browsers and Authoring Tools
Web Generations
25/05/16Women in Digital
250 000 sites
45 million users
1996
80 million sites
1 billion users
2006
800 million sites
3 billion users
2016
Files, Documents
Keyword Search
Social
Networks
Semantic
Search
Natural
Language
Search
2026
IoT
User
Generated
Content
Streaming
Ubiquity
Personalized
Content
http://www.quantumrun.com/future-timeline/2026
3.9 billion users
Rows of servers inside a Facebook data center in North Carolina. Photo by
Rich Miller
25/05/16Women in Digital
Internet Traffic Forecast
25/05/16Women in Digital
Cisco Visual Networking Index: Forecast and Methodology, 2014-2019
White Paper
25/05/16Women in Digital
25/05/16Women in Digital
Big Data in Use
25/05/16Women in Digital
Customer
experience
Brand
perception
Target segment
identification
Demand Forecast
Supply Chain
Product Design
Risk management
Fraud detection
Research
Real time data
Health Care
Diagnosis
Icons made by Freepik from www.flaticon.com licensed by Creative Commons BY 3.0
Open Data
Open means anyone can freely access, use,
modify, and share for any purpose (subject,
at most, to requirements that preserve
provenance and openness)
25/05/16Women in Digital
Source: http://opendefinition.org/od/2.1/en/
Open Government Data
• Transparency
• Public service
improvement
• Economic and Social
Value
• Open data != Free
data
25/05/16Women in Digital
Open Data: The Next Phase in the Technology Revolution
BY CASEY COLEMAN – AUGUST 27, 2013
POSTED IN: EMERGING TECHNOLOGY, GOVERNMENT,
INNOVATION, UNCATEGORIZED
Creating value trough
Open Data
25/05/16Women in Digital
Where is the data?
• DBpedia
• Government portals
o UK, US
o https://opendata.swiss launched last year (2015)
o Open Data Barometer 3rd edition (2016)
• The World Bank
• European Data Portal
• Google Public Data Directory
• Data Portals search, DataHub by Open Knowledge Foundation
• CKAN Instances- http://ckan.org/instances/#
25/05/16Women in Digital
Open Data Switzerland
25/05/16Women in Digital
Switzerland
25/05/16Women in Digital
https://www.wohnungsrechner.ch
Open Data Issues
• Provenance, trust and privacy
• Timeliness, relevancy,
completeness, sufficiency
• Licenses, e.g. for creative
content :
• Public domain (CC0, PDDL)
• Attribution (CC-by ODC-by)
• Attribution & share-alike (CC-by-sa,
ODnL)
• Reusability
25/05/16Women in Digital
https://research.neustar.biz/2014/09/15/riding-
with-the-stars-passenger-privacy-in-the-nyc-
taxicab-dataset/
5 Stars Data
25/05/16Women in Digital
http://5stardata.info/en/
Linked [Open] Data
The Semantic Web isn't just about putting
data on the web. It is about making links, so
that a person or machine can explore the
web of data.
25/05/16Women in Digital
Tim Berners-Lee
https://www.w3.org/DesignIssues/LinkedData.html
Linked Data Cloud
• http://lod-cloud.net/ 25/05/16Women in Digital
Querying the web
SPARQL-LD endpoint:
http://users.ics.forth.gr/~fafalios/
Recipe:
• SPARQL endpoint
o Federated query
• Annotated Website
Ref:
P. Fafalios and Y. Tzitzikas, SPARQL-LD: A SPARQL Extension for Fetching and
Querying Linked Data,14th International Semantic Web Conference (demo paper),
ISWC 2015, Bethlehem, Pennsylvania, USA, October 11-15, 2015.
25/05/16Women in Digital
Thank you!
25/05/16Women in Digital
@aletapool
http://lnked.in/alegrm
SPARQL services
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
SELECT DISTINCT ?authorName
(count(DISTINCT ?paper) AS ?numOfPapers)
(count(DISTINCT ?series) AS ?numOfDiffConfs)
WHERE {
SERVICE <http://users.ics.forth.gr/~fafalios> {
SELECT DISTINCT ?authorURI WHERE {
?p <http://purl.org/dc/terms/creator> ?authorURI } }
SERVICE <http://dblp.l3s.de/d2r/sparql> {
?p2 <http://purl.org/dc/elements/1.1/creator> ?authorURI .
?p2 <http://swrc.ontoware.org/ontology#series> ?series }
SERVICE ?authorURI {
?author foaf:name ?authorName .
?paper <http://purl.org/dc/elements/1.1/creator> ?authorURI }
} GROUP BY ?authorName ORDER BY DESC(?numOfPapers)
25/05/16Women in Digital

More Related Content

What's hot

Linked data in the German National Library at the OCLC IFLA round table 2013
Linked data in the German National Library at the OCLC IFLA round table 2013Linked data in the German National Library at the OCLC IFLA round table 2013
Linked data in the German National Library at the OCLC IFLA round table 2013Lars G. Svensson
 
A Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's DatasetsA Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's DatasetsOscar Corcho
 
Weatherstations - Citizen-Apps and eParticipation as sources for Datajournalism
Weatherstations - Citizen-Apps and eParticipation as sources for DatajournalismWeatherstations - Citizen-Apps and eParticipation as sources for Datajournalism
Weatherstations - Citizen-Apps and eParticipation as sources for DatajournalismLorenz Matzat
 
Zeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadhZeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadhMarcia Zeng
 
Persistent identification: supporting digital humanities
Persistent identification: supporting digital humanitiesPersistent identification: supporting digital humanities
Persistent identification: supporting digital humanitiesPACKED vzw
 
Soci 4385 fall2020 slideshare
Soci 4385 fall2020 slideshareSoci 4385 fall2020 slideshare
Soci 4385 fall2020 slideshareholland_uhcl
 
Aleksandar Kapisoda: The semantic approach for tracking scientific publications
Aleksandar Kapisoda: The semantic approach for tracking scientific publicationsAleksandar Kapisoda: The semantic approach for tracking scientific publications
Aleksandar Kapisoda: The semantic approach for tracking scientific publicationsSemantic Web Company
 
Linked Statistical Data 101
Linked Statistical Data 101Linked Statistical Data 101
Linked Statistical Data 101Oscar Corcho
 
GLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web MeetupGLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web MeetupAdrian Stevenson
 
Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204Beat Estermann
 
2014 Overview of the activities of the Brussels Data Science Community
2014 Overview of the activities of the Brussels Data Science Community2014 Overview of the activities of the Brussels Data Science Community
2014 Overview of the activities of the Brussels Data Science CommunityDigitYser
 
Wikidata Introductory Workshop
Wikidata Introductory WorkshopWikidata Introductory Workshop
Wikidata Introductory WorkshopBeat Estermann
 
Wikidata Introduction, Linked Digital Future Initiative, August 2019
Wikidata Introduction, Linked Digital Future Initiative, August 2019Wikidata Introduction, Linked Digital Future Initiative, August 2019
Wikidata Introduction, Linked Digital Future Initiative, August 2019Beat Estermann
 
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Linked Enterprise Date Services
 
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...Robert H. McDonald
 
Three Linked Data choices for Libraries
Three Linked Data choices for LibrariesThree Linked Data choices for Libraries
Three Linked Data choices for LibrariesRichard Wallis
 
Marc and beyond: 3 Linked Data Choices
 Marc and beyond: 3 Linked Data Choices  Marc and beyond: 3 Linked Data Choices
Marc and beyond: 3 Linked Data Choices Richard Wallis
 
JCDL 2015 Tutorial Opening Slides
JCDL 2015 Tutorial Opening SlidesJCDL 2015 Tutorial Opening Slides
JCDL 2015 Tutorial Opening SlidesRobert H. McDonald
 

What's hot (19)

Linked data in the German National Library at the OCLC IFLA round table 2013
Linked data in the German National Library at the OCLC IFLA round table 2013Linked data in the German National Library at the OCLC IFLA round table 2013
Linked data in the German National Library at the OCLC IFLA round table 2013
 
A Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's DatasetsA Linked Data Dataset for Madrid Transport Authority's Datasets
A Linked Data Dataset for Madrid Transport Authority's Datasets
 
Weatherstations - Citizen-Apps and eParticipation as sources for Datajournalism
Weatherstations - Citizen-Apps and eParticipation as sources for DatajournalismWeatherstations - Citizen-Apps and eParticipation as sources for Datajournalism
Weatherstations - Citizen-Apps and eParticipation as sources for Datajournalism
 
Zeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadhZeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadh
 
Persistent identification: supporting digital humanities
Persistent identification: supporting digital humanitiesPersistent identification: supporting digital humanities
Persistent identification: supporting digital humanities
 
Soci 4385 fall2020 slideshare
Soci 4385 fall2020 slideshareSoci 4385 fall2020 slideshare
Soci 4385 fall2020 slideshare
 
Digital Research Support by Stella Wisdom, for 20th & 21st Century Collection...
Digital Research Support by Stella Wisdom, for 20th & 21st Century Collection...Digital Research Support by Stella Wisdom, for 20th & 21st Century Collection...
Digital Research Support by Stella Wisdom, for 20th & 21st Century Collection...
 
Aleksandar Kapisoda: The semantic approach for tracking scientific publications
Aleksandar Kapisoda: The semantic approach for tracking scientific publicationsAleksandar Kapisoda: The semantic approach for tracking scientific publications
Aleksandar Kapisoda: The semantic approach for tracking scientific publications
 
Linked Statistical Data 101
Linked Statistical Data 101Linked Statistical Data 101
Linked Statistical Data 101
 
GLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web MeetupGLAM Rocks! London Semantic Web Meetup
GLAM Rocks! London Semantic Web Meetup
 
Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204Estermann wd glam-intro_20181204
Estermann wd glam-intro_20181204
 
2014 Overview of the activities of the Brussels Data Science Community
2014 Overview of the activities of the Brussels Data Science Community2014 Overview of the activities of the Brussels Data Science Community
2014 Overview of the activities of the Brussels Data Science Community
 
Wikidata Introductory Workshop
Wikidata Introductory WorkshopWikidata Introductory Workshop
Wikidata Introductory Workshop
 
Wikidata Introduction, Linked Digital Future Initiative, August 2019
Wikidata Introduction, Linked Digital Future Initiative, August 2019Wikidata Introduction, Linked Digital Future Initiative, August 2019
Wikidata Introduction, Linked Digital Future Initiative, August 2019
 
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
 
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
 
Three Linked Data choices for Libraries
Three Linked Data choices for LibrariesThree Linked Data choices for Libraries
Three Linked Data choices for Libraries
 
Marc and beyond: 3 Linked Data Choices
 Marc and beyond: 3 Linked Data Choices  Marc and beyond: 3 Linked Data Choices
Marc and beyond: 3 Linked Data Choices
 
JCDL 2015 Tutorial Opening Slides
JCDL 2015 Tutorial Opening SlidesJCDL 2015 Tutorial Opening Slides
JCDL 2015 Tutorial Opening Slides
 

Viewers also liked

All you need to know about Implementing and Managing Change - "The People Fac...
All you need to know about Implementing and Managing Change - "The People Fac...All you need to know about Implementing and Managing Change - "The People Fac...
All you need to know about Implementing and Managing Change - "The People Fac...George Vorster
 
عينة البحث وأدوات جمع البيانات 6
عينة البحث وأدوات جمع البيانات  6عينة البحث وأدوات جمع البيانات  6
عينة البحث وأدوات جمع البيانات 6Dr. Magdy Youness
 
コンポーネントを”つなぐ”時代へ Web&Mobileアプリ開発最新動向
コンポーネントを”つなぐ”時代へ Web&Mobileアプリ開発最新動向コンポーネントを”つなぐ”時代へ Web&Mobileアプリ開発最新動向
コンポーネントを”つなぐ”時代へ Web&Mobileアプリ開発最新動向Mitch Okamoto
 
Piezoelectricity Mk
Piezoelectricity MkPiezoelectricity Mk
Piezoelectricity MkNISEnet
 
Ukraine, khust. karina
Ukraine, khust. karinaUkraine, khust. karina
Ukraine, khust. karinaNatalia Orlyk
 
מצגת - כנס מס ריבוי דירות - 27.1.17
מצגת - כנס מס ריבוי דירות - 27.1.17מצגת - כנס מס ריבוי דירות - 27.1.17
מצגת - כנס מס ריבוי דירות - 27.1.17Dorit Gabay
 

Viewers also liked (20)

Chern-Simons Theory
Chern-Simons TheoryChern-Simons Theory
Chern-Simons Theory
 
All you need to know about Implementing and Managing Change - "The People Fac...
All you need to know about Implementing and Managing Change - "The People Fac...All you need to know about Implementing and Managing Change - "The People Fac...
All you need to know about Implementing and Managing Change - "The People Fac...
 
عينة البحث وأدوات جمع البيانات 6
عينة البحث وأدوات جمع البيانات  6عينة البحث وأدوات جمع البيانات  6
عينة البحث وأدوات جمع البيانات 6
 
Appium
AppiumAppium
Appium
 
Why SOFRI?
Why SOFRI?Why SOFRI?
Why SOFRI?
 
Traslado de residuos
Traslado de residuosTraslado de residuos
Traslado de residuos
 
حكاية
حكايةحكاية
حكاية
 
コンポーネントを”つなぐ”時代へ Web&Mobileアプリ開発最新動向
コンポーネントを”つなぐ”時代へ Web&Mobileアプリ開発最新動向コンポーネントを”つなぐ”時代へ Web&Mobileアプリ開発最新動向
コンポーネントを”つなぐ”時代へ Web&Mobileアプリ開発最新動向
 
Vírus e viroses
Vírus e virosesVírus e viroses
Vírus e viroses
 
DUNHAM
DUNHAMDUNHAM
DUNHAM
 
Slide sobre função
Slide sobre funçãoSlide sobre função
Slide sobre função
 
HT
HTHT
HT
 
Types teeth
Types teethTypes teeth
Types teeth
 
Piezoelectricity Mk
Piezoelectricity MkPiezoelectricity Mk
Piezoelectricity Mk
 
PM Foundation
PM FoundationPM Foundation
PM Foundation
 
Painting
PaintingPainting
Painting
 
BY CC
BY CCBY CC
BY CC
 
Ukraine, khust. karina
Ukraine, khust. karinaUkraine, khust. karina
Ukraine, khust. karina
 
Leadership
LeadershipLeadership
Leadership
 
מצגת - כנס מס ריבוי דירות - 27.1.17
מצגת - כנס מס ריבוי דירות - 27.1.17מצגת - כנס מס ריבוי דירות - 27.1.17
מצגת - כנס מס ריבוי דירות - 27.1.17
 

Similar to Data on the web

Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataBoris Villazón-Terrazas
 
How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open dataMarie Gustafsson Friberger
 
Top ten-dències tecnològiques
Top ten-dències tecnològiquesTop ten-dències tecnològiques
Top ten-dències tecnològiquesRicard de la Vega
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesRichard Wallis
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationRichard Wallis
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving UpPaco Nathan
 
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Big Data Spain
 
Data Con LA 2018 - From the Panama Papers by Mark Quinsland
Data Con LA 2018 - From the Panama Papers by Mark QuinslandData Con LA 2018 - From the Panama Papers by Mark Quinsland
Data Con LA 2018 - From the Panama Papers by Mark QuinslandData Con LA
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our OpportunityRichard Wallis
 
Linked Data for Digital Humanities - Big Data Summerschool
Linked Data for Digital Humanities - Big Data SummerschoolLinked Data for Digital Humanities - Big Data Summerschool
Linked Data for Digital Humanities - Big Data SummerschoolVictor de Boer
 
An open data story
An open data storyAn open data story
An open data storyProgCity
 
RDFa Introductory Course Session 4/4 When RDFa
RDFa Introductory Course Session 4/4 When RDFaRDFa Introductory Course Session 4/4 When RDFa
RDFa Introductory Course Session 4/4 When RDFaPlatypus
 
Schema.org: Where did that come from!
Schema.org: Where did that come from!Schema.org: Where did that come from!
Schema.org: Where did that come from!Richard Wallis
 
Radically Open at the National Archives
Radically Open at the National ArchivesRadically Open at the National Archives
Radically Open at the National ArchivesJon Voss
 
Research into Practice case study 2: Library linked data implementations an...
	Research into Practice case study 2:  Library linked data implementations an...	Research into Practice case study 2:  Library linked data implementations an...
Research into Practice case study 2: Library linked data implementations an...Hazel Hall
 
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky ReichEDINA, University of Edinburgh
 

Similar to Data on the web (20)

Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked Data
 
How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open data
 
Top ten-dències tecnològiques
Top ten-dències tecnològiquesTop ten-dències tecnològiques
Top ten-dències tecnològiques
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data Foundation
 
Semantic Puzzle
Semantic PuzzleSemantic Puzzle
Semantic Puzzle
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving Up
 
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
 
Data Con LA 2018 - From the Panama Papers by Mark Quinsland
Data Con LA 2018 - From the Panama Papers by Mark QuinslandData Con LA 2018 - From the Panama Papers by Mark Quinsland
Data Con LA 2018 - From the Panama Papers by Mark Quinsland
 
The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
 
Linked Data for Digital Humanities - Big Data Summerschool
Linked Data for Digital Humanities - Big Data SummerschoolLinked Data for Digital Humanities - Big Data Summerschool
Linked Data for Digital Humanities - Big Data Summerschool
 
An open data story
An open data storyAn open data story
An open data story
 
When RDFa?
When RDFa?When RDFa?
When RDFa?
 
RDFa Introductory Course Session 4/4 When RDFa
RDFa Introductory Course Session 4/4 When RDFaRDFa Introductory Course Session 4/4 When RDFa
RDFa Introductory Course Session 4/4 When RDFa
 
Schema.org: Where did that come from!
Schema.org: Where did that come from!Schema.org: Where did that come from!
Schema.org: Where did that come from!
 
Radically Open at the National Archives
Radically Open at the National ArchivesRadically Open at the National Archives
Radically Open at the National Archives
 
Here Comes Everything
Here Comes EverythingHere Comes Everything
Here Comes Everything
 
Research into Practice case study 2: Library linked data implementations an...
	Research into Practice case study 2:  Library linked data implementations an...	Research into Practice case study 2:  Library linked data implementations an...
Research into Practice case study 2: Library linked data implementations an...
 
Linked Data past, present and futures
Linked Datapast, present and futuresLinked Datapast, present and futures
Linked Data past, present and futures
 
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
'Your Scholarship. Our World. Preserving the Long Tail' by Vicky Reich
 

Recently uploaded

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 

Recently uploaded (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

Data on the web

  • 1. Data on the Web By Alejandra Garcia Rojas @aletapool http://lnked.in/alegrm 25/05/16Women in Digital
  • 2. Outline • History of the Web • Big Data • Open Data • Linked Data 25/05/16Women in Digital
  • 13. Web Standards 25/05/16Women in Digital https://www.w3.org/TR/tr-date-stds.html Web Design and Applications HTML, CSS, SVG, Ajax, and other technologies for Web Applications Web Architecture URIs and HTTP Semantic Web RDF, SPARQL, OWL, and SKOS. XML Technology XML, XML Namespaces, XML Schema, XSLT, Efficient XML Interchange (EXI), and other related standards. Web of Services HTTP, XML, SOAP, WSDL, SPARQL, and others. Browsers and Authoring Tools
  • 14. Web Generations 25/05/16Women in Digital 250 000 sites 45 million users 1996 80 million sites 1 billion users 2006 800 million sites 3 billion users 2016 Files, Documents Keyword Search Social Networks Semantic Search Natural Language Search 2026 IoT User Generated Content Streaming Ubiquity Personalized Content http://www.quantumrun.com/future-timeline/2026 3.9 billion users
  • 15. Rows of servers inside a Facebook data center in North Carolina. Photo by Rich Miller 25/05/16Women in Digital
  • 16. Internet Traffic Forecast 25/05/16Women in Digital Cisco Visual Networking Index: Forecast and Methodology, 2014-2019 White Paper
  • 19. Big Data in Use 25/05/16Women in Digital Customer experience Brand perception Target segment identification Demand Forecast Supply Chain Product Design Risk management Fraud detection Research Real time data Health Care Diagnosis Icons made by Freepik from www.flaticon.com licensed by Creative Commons BY 3.0
  • 20. Open Data Open means anyone can freely access, use, modify, and share for any purpose (subject, at most, to requirements that preserve provenance and openness) 25/05/16Women in Digital Source: http://opendefinition.org/od/2.1/en/
  • 21. Open Government Data • Transparency • Public service improvement • Economic and Social Value • Open data != Free data 25/05/16Women in Digital Open Data: The Next Phase in the Technology Revolution BY CASEY COLEMAN – AUGUST 27, 2013 POSTED IN: EMERGING TECHNOLOGY, GOVERNMENT, INNOVATION, UNCATEGORIZED
  • 22. Creating value trough Open Data 25/05/16Women in Digital
  • 23. Where is the data? • DBpedia • Government portals o UK, US o https://opendata.swiss launched last year (2015) o Open Data Barometer 3rd edition (2016) • The World Bank • European Data Portal • Google Public Data Directory • Data Portals search, DataHub by Open Knowledge Foundation • CKAN Instances- http://ckan.org/instances/# 25/05/16Women in Digital
  • 26. Open Data Issues • Provenance, trust and privacy • Timeliness, relevancy, completeness, sufficiency • Licenses, e.g. for creative content : • Public domain (CC0, PDDL) • Attribution (CC-by ODC-by) • Attribution & share-alike (CC-by-sa, ODnL) • Reusability 25/05/16Women in Digital https://research.neustar.biz/2014/09/15/riding- with-the-stars-passenger-privacy-in-the-nyc- taxicab-dataset/
  • 27. 5 Stars Data 25/05/16Women in Digital http://5stardata.info/en/
  • 28. Linked [Open] Data The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data. 25/05/16Women in Digital Tim Berners-Lee https://www.w3.org/DesignIssues/LinkedData.html
  • 29. Linked Data Cloud • http://lod-cloud.net/ 25/05/16Women in Digital
  • 30. Querying the web SPARQL-LD endpoint: http://users.ics.forth.gr/~fafalios/ Recipe: • SPARQL endpoint o Federated query • Annotated Website Ref: P. Fafalios and Y. Tzitzikas, SPARQL-LD: A SPARQL Extension for Fetching and Querying Linked Data,14th International Semantic Web Conference (demo paper), ISWC 2015, Bethlehem, Pennsylvania, USA, October 11-15, 2015. 25/05/16Women in Digital
  • 31. Thank you! 25/05/16Women in Digital @aletapool http://lnked.in/alegrm
  • 32. SPARQL services PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT DISTINCT ?authorName (count(DISTINCT ?paper) AS ?numOfPapers) (count(DISTINCT ?series) AS ?numOfDiffConfs) WHERE { SERVICE <http://users.ics.forth.gr/~fafalios> { SELECT DISTINCT ?authorURI WHERE { ?p <http://purl.org/dc/terms/creator> ?authorURI } } SERVICE <http://dblp.l3s.de/d2r/sparql> { ?p2 <http://purl.org/dc/elements/1.1/creator> ?authorURI . ?p2 <http://swrc.ontoware.org/ontology#series> ?series } SERVICE ?authorURI { ?author foaf:name ?authorName . ?paper <http://purl.org/dc/elements/1.1/creator> ?authorURI } } GROUP BY ?authorName ORDER BY DESC(?numOfPapers) 25/05/16Women in Digital

Editor's Notes

  1. Say something about me.
  2. Vint Cerf "Founded the Internet Society (ISOC). Now ISOC leader Internet related standards, education, and policy. It is dedicated to ensuring the open development, evolution, and use of the Internet for the benefit of people throughout the world.ISOC continues to serve as the organizational home of the Internet Engineering Task Force (IETF). Tim Berners-Lee Founded the W3C together with MIT.W3C primarily pursues its mission through the creation of Web standards and guidelines designed to ensure long-term growth for the Web.
  3. User generated content: blogs, tags,
  4. How big the data in the internet is? Internet traffic is the flow of data across the Internet. Because of the distributed nature of the Internet, there is no single point of measurement for total Internet traffic
  5. Big data: Digitalization of services. customer activity, analysis Text -> google correction Gestures analysis of images video Likes, dislikes internet of things, geospatial data Social media
  6. Big data concerns Volume, Velocity and Variety … also mentioned veracity and value One time processing is probably not big data problem… Cannot store all this data in a single computer and cannot be processed and Analyzed How to deal with big data = be able to add resources (computers) on the fly -> scale (up vs out) Distributed Data Hadoop Distributed storage (HDFS) NoSQL Distributed computing * Mapreduce - Spark, Kafka, Each computer maps a computation to a single node, and then the algorithm summarizes (reduces) the computation
  7. Costs Benefits of relying on big data US health $300 billion USD/year increasing the efficiency and quality service (McKinsey) Europe $149 billion USD in government administration costs A lot of investment in Big data projects Technology User experience Internet services (Google)- Smart Cities Jobs generation Computer-Science related jobs Innovation being able to deal with big data has open new doors Data-based services Mobile apps Data Science Industry Increasingly connected to the Internet in order to open up new dimensions in production efficiency. Industry 4.0 is used to refer to the fourth industrial revolution, following those of mechanization, industrialization, and automation. Health Evidence based diagnosis Geonome research Success stories: Real time prodction Uber real time rate calculation Forecast: Shortage of big data talents? New possitions as data chief officers that helps to lead Machine Learning momentum – new tools to use algorithms with big data- computer power, deep learning Spark –consuming streams, Machine Learning Data as a service business models
  8. The importance of opening data started in the 1950s with the Open Scientific Data concept with the formation of the World Data Center system with the aim to share Astronomical and Geophysical data. The International Council of Scientific Unions (now the International Council for Science) established several World Data Centers to minimize the risk of data loss and to maximize data accessibility, further recommending in 1955 that data be made available in machine-readable form. Other movements emerged: open source, open hardware, open content and open access wikipedia.com, on Monday 15 January 2001. In 2001, Lawrence Lessig founded Creative Commons   Tim Berners-Lee (TED 2009): “We want raw data, now!”   Lessig was a candidate for the Democratic Party's nomination for President of the United States in the 2016 U.S. presidential election, but withdrew before the primaries.
  9. Benefits Government spending Public service improvement (movability, education, health) Economic and Social Value   Open Data is not necessarily free -> business opportunities by opening the data, make users to pay a service maintenance quality
  10. Evidence of the quantitative impact of re-use of Open Data is measured by means of key indicators:   Direct benefits are monetized benefits that are realized in market transactions in the form of revenues and Gross Value Added (GVA), the number of jobs involved in producing a service or product, and cost savings. Indirect economic benefits are i.e. new goods and services, time savings for users of applications using Open Data, knowledge economy growth, increased efficiency in public services and growth of related markets. The European Commission, within the context of the launch of the wished to obtain further evidence of the quantitative impact of re-use of Public Data Resources. A study was carried out with the aim to collect, assess and aggregate all economic evidence to forecast the benefits of the re-use of Open Data for all 28 European Member States and the European Free Trade Association (EFTA) countries, further referred to as EU 28+, for the period 2016-2020.
  11. Privacy New York City Taxi and Limousine Commission. It contains details about every taxi ride (yellow cabs) in New York in 2013, including the pickup and drop off times, locations, fare and tip amounts, as well as anonymized (hashed) versions of the taxi’s license and medallion numbers. Open Licenses Public domain license has no restrictions at all (technically, these indicate that the rights owner has waived their rights to the content or data) CC0, PDDL Attribution license just says that you must give attribution to the publisher CC-by ODC-by Attribution & share-alike license says that you must give attribution and share any derived content or data under the same licence CC-by-sa, ODnL
  12. Inportance of standards Schema.org
  13. Owl:sameAs or other kind of links DBpedia -Towards a Public Data Infrastructure for a Large, Multilingual, Semantic Knowledge Graph Cloud by contributors to the Linking Open Data community project and other individuals and organisations. It is based on metadata collected and curated by contributors to the Data Hub as well as on metadata extracted from a crawl of the Linked Data web conducted in April 2014.
  14. https://www.weforum.org/reports/global-information-technology-report-2015