SlideShare a Scribd company logo
OPEN DATA

  READY, SET, GO!




Paul Groth
Twitter: @pgroth
Blog: thinklinks.wordpress.com
http://www.few.vu.nl/~pgroth
The Science Lifecycle                                 Virtual Learning
                                                                                Undergraduate
                                                        Environment             Students

                                                                                          Next Generation
                                                                                          Researchers

             Digital
            Libraries                            scientists               Graduate
                                                                          Students

                    Reprints

          Peer-
        Reviewed                  Technical
                                              experimentation
        Journal &       Preprints Reports
       Conference          &
         Papers         Metadata



                                    Local                              Data, Metadata,
                                    Web                              Provenance, Scripts,
               Repositories
                                                    Certified
                                                                     Workflows, Services,
                                              Experimental Results
                                                                     Ontologies, Blogs, ...
                                                  & Analyses
Adapted from David De Roure’s                              slides
TWO STORIES

THE CONSUMER AND PRODUCER
MEET JULIE

PhD Student
“institutional influences on
patterns of collaboration in
producing research of
interdisciplinary character”




                               Faculteit der Exacte Wetenschappen
Julie needs data




5                      Faculteit der Exacte Wetenschappen
I AM NOT A LAWYER
    Web of Knowledge Terms of Use


    You are entitled to access the product, download or extract reasonable amounts of data from the
    product that are required for the activities you carry out individually or as part of your employment, and
    include insubstantial portions of extracted data in your work documents and reports, provided that such
    documents or reports are for the benefit of (and belong to) your organization, or where such documents
    or reports are intended for the benefit of third parties (not your organization ), extracted data is
    immaterial in the context of such documents or reports and used only for illustrative/demo purposes.

    Thomson Reuters determines a “reasonable amount” of data to download by comparing your download
    activity against the average annual download rates for all Thomson Reuters clients using the product in
    question. Thomson Reuters determines an “insubstantial portion” of downloaded data to mean an
    amount of data taken from the product which (1) would not have significant commercial value of its
    own; and (2) would not act as a substitute for access to a Thomson Reuters product for someone who
    does not have access to the product.

    You are not entitled to do anything that would cause a breach of the terms of the agreement between
    your organization and Thomson Reuters, such as (1) allowing anyone else to use your
    username/password, (2) downloading excessive amounts of data, (3) providing data to anyone else,
    other than in licensed, source-acknowledged documents or reports created as part of your normal work,
    (4) archiving or using downloaded data to create a derivative database or metrics, (5) using the product
    or any downloaded data to provide services to anyone outside your organization, or (6) using the
    product in a way that risks damaging, disabling, overburdening or impairing the operation of the
    product, or any other person’s use or enjoyment of the product.




6                                                                                   Faculteit der Exacte Wetenschappen
7   Faculteit der Exacte Wetenschappen
8   Faculteit der Exacte Wetenschappen
OPEN DATA: 2 WEEKS  15 MINUTES
SELECT ?author ?affiliation ?uriAffiliation WHERE
{
    GRAPH <$graph> {
      {<$article> swrc:author ?author.
          OPTIONAL{?author swrc:affiliation ?uriAffiliation.}
          OPTIONAL{?author swc:affiliation ?affiliation.} }
          UNION {
             <$article> foaf:maker ?author.
             OPTIONAL{?author swrc:affiliation ?uriAffiliation.}
             OPTIONAL{?author swc:affiliation ?affiliation.}
          }
          UNION {
             <$article> dc:creator ?author.
             OPTIONAL{?author swrc:affiliation ?uriAffiliation.}
             OPTIONAL{?author swc:affiliation ?affiliation.}
          }
}




9                                                                  Faculteit der Exacte Wetenschappen
PRODUCER

INSTITUTION
PRODUCER

PERSONAL
12   Faculteit der Exacte Wetenschappen
13   Faculteit der Exacte Wetenschappen
14   Faculteit der Exacte Wetenschappen
15   Faculteit der Exacte Wetenschappen
16   Photo by IvanClow - http://www.flickr.com/photos/ivanclow/4201955402/   Faculteit der Exacte Wetenschappen
ERR
.SUPPORT?




17                   Faculteit der Exacte Wetenschappen
5 TAKE-AWAYS

     1.   Open Data is a boon to young scientists as consumers
     2.   Trade-offs for producers of open data
     3.   Producers need support
     4.   Clear simple guidelines for data publication
     5.   Data citation is a key to open data




18                                                Faculteit der Exacte Wetenschappen

More Related Content

Viewers also liked

JenH2k
JenH2kJenH2k
JenH2k
Golden Team
 
Ict
IctIct
A consumer-first approach to mobile strategy.
A consumer-first approach to mobile strategy.A consumer-first approach to mobile strategy.
A consumer-first approach to mobile strategy.
Kayla Green
 
Dengue hearhgic fever by dr muhammad tuseef javed
Dengue hearhgic fever by dr muhammad tuseef javedDengue hearhgic fever by dr muhammad tuseef javed
Dengue hearhgic fever by dr muhammad tuseef javed
Tauseef Jawaid
 
Value Line
Value LineValue Line
Value Line
Lacey Klemm
 
slideshow
slideshowslideshow
slideshowmelineez
 
Scholastic photojournalists and the publication of graphic, spot news images
Scholastic photojournalists and the publication of graphic, spot news imagesScholastic photojournalists and the publication of graphic, spot news images
Scholastic photojournalists and the publication of graphic, spot news images
Bradley Wilson
 
20130218 NEOBR Zip form training
20130218 NEOBR Zip form training20130218 NEOBR Zip form training
20130218 NEOBR Zip form training
RE/MAX Grand Lake
 
zJOS/Puspa© Pipelined Scheduling
zJOS/Puspa© Pipelined SchedulingzJOS/Puspa© Pipelined Scheduling
zJOS/Puspa© Pipelined Scheduling
Deru Sudibyo
 
World hunger bueno
World hunger buenoWorld hunger bueno
World hunger bueno
Svalent
 
Transmedia 101
Transmedia 101Transmedia 101
Transmedia 101
Kayla Green
 
KÔmsi lak kunsti ainekava pÔhikoolile
KÔmsi lak kunsti ainekava pÔhikoolileKÔmsi lak kunsti ainekava pÔhikoolile
KÔmsi lak kunsti ainekava pÔhikoolile
Annika
 
NRG-Advice | Praktisch Verandermanagement
NRG-Advice | Praktisch VerandermanagementNRG-Advice | Praktisch Verandermanagement
NRG-Advice | Praktisch Verandermanagement
NRG-Advice
 
zJOS System Events Automation Users Guide
zJOS System Events Automation Users GuidezJOS System Events Automation Users Guide
zJOS System Events Automation Users Guide
Deru Sudibyo
 
Whataburger Annette
Whataburger AnnetteWhataburger Annette
Whataburger Annette
abfarrell
 
Perception of Women in the Mining and Mineral Exploration Industries, a Canad...
Perception of Women in the Mining and Mineral Exploration Industries, a Canad...Perception of Women in the Mining and Mineral Exploration Industries, a Canad...
Perception of Women in the Mining and Mineral Exploration Industries, a Canad...
MafaldaArias
 
ĐŸĐŸŃ€Ń‚Ń€Đ”Ń‚ ŃĐŸŃ‚Ń€ŃƒĐŽĐœĐžĐșĐ°
ĐŸĐŸŃ€Ń‚Ń€Đ”Ń‚ ŃĐŸŃ‚Ń€ŃƒĐŽĐœĐžĐșĐ°ĐŸĐŸŃ€Ń‚Ń€Đ”Ń‚ ŃĐŸŃ‚Ń€ŃƒĐŽĐœĐžĐșĐ°
ĐŸĐŸŃ€Ń‚Ń€Đ”Ń‚ ŃĐŸŃ‚Ń€ŃƒĐŽĐœĐžĐșĐ°MEB
 
ĐĄĐż. ĐœĐ”ĐœĐžĐŽĐ¶ŃŠŃ€ - ĐŸŃ€Đ”Đ·Đ”ĐœŃ‚Đ°Ń†ĐžŃ ĐœĐ° ĐœĐ”ĐŽĐžĐ”Đœ ĐżĐ°ĐœĐ°ĐžŃ€ 2008
ĐĄĐż. ĐœĐ”ĐœĐžĐŽĐ¶ŃŠŃ€ - ĐŸŃ€Đ”Đ·Đ”ĐœŃ‚Đ°Ń†ĐžŃ ĐœĐ° ĐœĐ”ĐŽĐžĐ”Đœ ĐżĐ°ĐœĐ°ĐžŃ€ 2008ĐĄĐż. ĐœĐ”ĐœĐžĐŽĐ¶ŃŠŃ€ - ĐŸŃ€Đ”Đ·Đ”ĐœŃ‚Đ°Ń†ĐžŃ ĐœĐ° ĐœĐ”ĐŽĐžĐ”Đœ ĐżĐ°ĐœĐ°ĐžŃ€ 2008
ĐĄĐż. ĐœĐ”ĐœĐžĐŽĐ¶ŃŠŃ€ - ĐŸŃ€Đ”Đ·Đ”ĐœŃ‚Đ°Ń†ĐžŃ ĐœĐ° ĐœĐ”ĐŽĐžĐ”Đœ ĐżĐ°ĐœĐ°ĐžŃ€ 2008MediaFair2010
 
Workshop about hedge funds industry
Workshop about hedge funds industryWorkshop about hedge funds industry
Workshop about hedge funds industry
Davide Zari
 
Bungy Jumping
Bungy JumpingBungy Jumping
Bungy Jumping
julietac
 

Viewers also liked (20)

JenH2k
JenH2kJenH2k
JenH2k
 
Ict
IctIct
Ict
 
A consumer-first approach to mobile strategy.
A consumer-first approach to mobile strategy.A consumer-first approach to mobile strategy.
A consumer-first approach to mobile strategy.
 
Dengue hearhgic fever by dr muhammad tuseef javed
Dengue hearhgic fever by dr muhammad tuseef javedDengue hearhgic fever by dr muhammad tuseef javed
Dengue hearhgic fever by dr muhammad tuseef javed
 
Value Line
Value LineValue Line
Value Line
 
slideshow
slideshowslideshow
slideshow
 
Scholastic photojournalists and the publication of graphic, spot news images
Scholastic photojournalists and the publication of graphic, spot news imagesScholastic photojournalists and the publication of graphic, spot news images
Scholastic photojournalists and the publication of graphic, spot news images
 
20130218 NEOBR Zip form training
20130218 NEOBR Zip form training20130218 NEOBR Zip form training
20130218 NEOBR Zip form training
 
zJOS/Puspa© Pipelined Scheduling
zJOS/Puspa© Pipelined SchedulingzJOS/Puspa© Pipelined Scheduling
zJOS/Puspa© Pipelined Scheduling
 
World hunger bueno
World hunger buenoWorld hunger bueno
World hunger bueno
 
Transmedia 101
Transmedia 101Transmedia 101
Transmedia 101
 
KÔmsi lak kunsti ainekava pÔhikoolile
KÔmsi lak kunsti ainekava pÔhikoolileKÔmsi lak kunsti ainekava pÔhikoolile
KÔmsi lak kunsti ainekava pÔhikoolile
 
NRG-Advice | Praktisch Verandermanagement
NRG-Advice | Praktisch VerandermanagementNRG-Advice | Praktisch Verandermanagement
NRG-Advice | Praktisch Verandermanagement
 
zJOS System Events Automation Users Guide
zJOS System Events Automation Users GuidezJOS System Events Automation Users Guide
zJOS System Events Automation Users Guide
 
Whataburger Annette
Whataburger AnnetteWhataburger Annette
Whataburger Annette
 
Perception of Women in the Mining and Mineral Exploration Industries, a Canad...
Perception of Women in the Mining and Mineral Exploration Industries, a Canad...Perception of Women in the Mining and Mineral Exploration Industries, a Canad...
Perception of Women in the Mining and Mineral Exploration Industries, a Canad...
 
ĐŸĐŸŃ€Ń‚Ń€Đ”Ń‚ ŃĐŸŃ‚Ń€ŃƒĐŽĐœĐžĐșĐ°
ĐŸĐŸŃ€Ń‚Ń€Đ”Ń‚ ŃĐŸŃ‚Ń€ŃƒĐŽĐœĐžĐșĐ°ĐŸĐŸŃ€Ń‚Ń€Đ”Ń‚ ŃĐŸŃ‚Ń€ŃƒĐŽĐœĐžĐșĐ°
ĐŸĐŸŃ€Ń‚Ń€Đ”Ń‚ ŃĐŸŃ‚Ń€ŃƒĐŽĐœĐžĐșĐ°
 
ĐĄĐż. ĐœĐ”ĐœĐžĐŽĐ¶ŃŠŃ€ - ĐŸŃ€Đ”Đ·Đ”ĐœŃ‚Đ°Ń†ĐžŃ ĐœĐ° ĐœĐ”ĐŽĐžĐ”Đœ ĐżĐ°ĐœĐ°ĐžŃ€ 2008
ĐĄĐż. ĐœĐ”ĐœĐžĐŽĐ¶ŃŠŃ€ - ĐŸŃ€Đ”Đ·Đ”ĐœŃ‚Đ°Ń†ĐžŃ ĐœĐ° ĐœĐ”ĐŽĐžĐ”Đœ ĐżĐ°ĐœĐ°ĐžŃ€ 2008ĐĄĐż. ĐœĐ”ĐœĐžĐŽĐ¶ŃŠŃ€ - ĐŸŃ€Đ”Đ·Đ”ĐœŃ‚Đ°Ń†ĐžŃ ĐœĐ° ĐœĐ”ĐŽĐžĐ”Đœ ĐżĐ°ĐœĐ°ĐžŃ€ 2008
ĐĄĐż. ĐœĐ”ĐœĐžĐŽĐ¶ŃŠŃ€ - ĐŸŃ€Đ”Đ·Đ”ĐœŃ‚Đ°Ń†ĐžŃ ĐœĐ° ĐœĐ”ĐŽĐžĐ”Đœ ĐżĐ°ĐœĐ°ĐžŃ€ 2008
 
Workshop about hedge funds industry
Workshop about hedge funds industryWorkshop about hedge funds industry
Workshop about hedge funds industry
 
Bungy Jumping
Bungy JumpingBungy Jumping
Bungy Jumping
 

Similar to Open Data: Ready Set Go

Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
guestd9aa5
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
guru122
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and Sharing
Jisc
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
Carole Goble
 
myExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentmyExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research Environment
David De Roure
 
20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong
Lancaster University Library
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
tsbbbu
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
Carole Goble
 
Oscon 2011 schroeder
Oscon 2011 schroederOscon 2011 schroeder
Oscon 2011 schroeder
will-schroeder
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
Carole Goble
 
Data Citation Made Easy
Data Citation Made EasyData Citation Made Easy
Data Citation Made Easy
University of California Curation Center
 
My Experiment
My ExperimentMy Experiment
My Experiment
Francesco Izzo
 
Dave de Roure - The myExperiment approach towards Open Science
Dave de Roure - The myExperiment approach towards Open ScienceDave de Roure - The myExperiment approach towards Open Science
Dave de Roure - The myExperiment approach towards Open Science
shwu
 
Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...
OpenAIRE
 
Purdue University Research Repository - OR2013
Purdue University Research Repository - OR2013Purdue University Research Repository - OR2013
Purdue University Research Repository - OR2013
Courtney Matthews
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402
vrij
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
David De Roure
 
The New e-Science (Bangalore Edition)
The New e-Science (Bangalore Edition)The New e-Science (Bangalore Edition)
The New e-Science (Bangalore Edition)
David De Roure
 
Do you speak open science
Do you speak open science Do you speak open science
Do you speak open science
Paola Chiara Masuzzo
 
Learning Open Source through GSOC
Learning Open Source through GSOC Learning Open Source through GSOC
Learning Open Source through GSOC
smarru
 

Similar to Open Data: Ready Set Go (20)

Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Deroure Repo3
Deroure Repo3Deroure Repo3
Deroure Repo3
 
Collaboration and Sharing
Collaboration and SharingCollaboration and Sharing
Collaboration and Sharing
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
myExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentmyExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research Environment
 
20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
 
Oscon 2011 schroeder
Oscon 2011 schroederOscon 2011 schroeder
Oscon 2011 schroeder
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
Data Citation Made Easy
Data Citation Made EasyData Citation Made Easy
Data Citation Made Easy
 
My Experiment
My ExperimentMy Experiment
My Experiment
 
Dave de Roure - The myExperiment approach towards Open Science
Dave de Roure - The myExperiment approach towards Open ScienceDave de Roure - The myExperiment approach towards Open Science
Dave de Roure - The myExperiment approach towards Open Science
 
Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...Facilitate Research Communities Adoption of Open Science Publishing Principle...
Facilitate Research Communities Adoption of Open Science Publishing Principle...
 
Purdue University Research Repository - OR2013
Purdue University Research Repository - OR2013Purdue University Research Repository - OR2013
Purdue University Research Repository - OR2013
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
 
The New e-Science (Bangalore Edition)
The New e-Science (Bangalore Edition)The New e-Science (Bangalore Edition)
The New e-Science (Bangalore Edition)
 
Do you speak open science
Do you speak open science Do you speak open science
Do you speak open science
 
Learning Open Source through GSOC
Learning Open Source through GSOC Learning Open Source through GSOC
Learning Open Source through GSOC
 

More from Paul Groth

To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Data Curation and Debugging for Data Centric AI
Data Curation and Debugging for Data Centric AIData Curation and Debugging for Data Centric AI
Data Curation and Debugging for Data Centric AI
Paul Groth
 
Content + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learningContent + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learning
Paul Groth
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
Paul Groth
 
Minimal viable-datareuse-czi
Minimal viable-datareuse-cziMinimal viable-datareuse-czi
Minimal viable-datareuse-czi
Paul Groth
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
Paul Groth
 
Knowledge Graph Futures
Knowledge Graph FuturesKnowledge Graph Futures
Knowledge Graph Futures
Paul Groth
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
Paul Groth
 
Thoughts on Knowledge Graphs & Deeper Provenance
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper Provenance
Paul Groth
 
Thinking About the Making of Data
Thinking About the Making of DataThinking About the Making of Data
Thinking About the Making of Data
Paul Groth
 
End-to-End Learning for Answering Structured Queries Directly over Text
End-to-End Learning for  Answering Structured Queries Directly over Text End-to-End Learning for  Answering Structured Queries Directly over Text
End-to-End Learning for Answering Structured Queries Directly over Text
Paul Groth
 
From Data Search to Data Showcasing
From Data Search to Data ShowcasingFrom Data Search to Data Showcasing
From Data Search to Data Showcasing
Paul Groth
 
Elsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge GraphElsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge Graph
Paul Groth
 
The Challenge of Deeper Knowledge Graphs for Science
The Challenge of Deeper Knowledge Graphs for ScienceThe Challenge of Deeper Knowledge Graphs for Science
The Challenge of Deeper Knowledge Graphs for Science
Paul Groth
 
More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?
Paul Groth
 
Diversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domainsDiversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domains
Paul Groth
 
Progressive Provenance Capture Through Re-computation
Progressive Provenance Capture Through Re-computationProgressive Provenance Capture Through Re-computation
Progressive Provenance Capture Through Re-computation
Paul Groth
 
From Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsFrom Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge Graphs
Paul Groth
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Paul Groth
 
The need for a transparent data supply chain
The need for a transparent data supply chainThe need for a transparent data supply chain
The need for a transparent data supply chain
Paul Groth
 

More from Paul Groth (20)

To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Data Curation and Debugging for Data Centric AI
Data Curation and Debugging for Data Centric AIData Curation and Debugging for Data Centric AI
Data Curation and Debugging for Data Centric AI
 
Content + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learningContent + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learning
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 
Minimal viable-datareuse-czi
Minimal viable-datareuse-cziMinimal viable-datareuse-czi
Minimal viable-datareuse-czi
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
 
Knowledge Graph Futures
Knowledge Graph FuturesKnowledge Graph Futures
Knowledge Graph Futures
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
 
Thoughts on Knowledge Graphs & Deeper Provenance
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper Provenance
 
Thinking About the Making of Data
Thinking About the Making of DataThinking About the Making of Data
Thinking About the Making of Data
 
End-to-End Learning for Answering Structured Queries Directly over Text
End-to-End Learning for  Answering Structured Queries Directly over Text End-to-End Learning for  Answering Structured Queries Directly over Text
End-to-End Learning for Answering Structured Queries Directly over Text
 
From Data Search to Data Showcasing
From Data Search to Data ShowcasingFrom Data Search to Data Showcasing
From Data Search to Data Showcasing
 
Elsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge GraphElsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge Graph
 
The Challenge of Deeper Knowledge Graphs for Science
The Challenge of Deeper Knowledge Graphs for ScienceThe Challenge of Deeper Knowledge Graphs for Science
The Challenge of Deeper Knowledge Graphs for Science
 
More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?
 
Diversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domainsDiversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domains
 
Progressive Provenance Capture Through Re-computation
Progressive Provenance Capture Through Re-computationProgressive Provenance Capture Through Re-computation
Progressive Provenance Capture Through Re-computation
 
From Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsFrom Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge Graphs
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
 
The need for a transparent data supply chain
The need for a transparent data supply chainThe need for a transparent data supply chain
The need for a transparent data supply chain
 

Recently uploaded

Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
Federico Razzoli
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-UniversitÀt
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 

Recently uploaded (20)

Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 

Open Data: Ready Set Go

  • 1. OPEN DATA READY, SET, GO! Paul Groth Twitter: @pgroth Blog: thinklinks.wordpress.com http://www.few.vu.nl/~pgroth
  • 2. The Science Lifecycle Virtual Learning Undergraduate Environment Students Next Generation Researchers Digital Libraries scientists Graduate Students Reprints Peer- Reviewed Technical experimentation Journal & Preprints Reports Conference & Papers Metadata Local Data, Metadata, Web Provenance, Scripts, Repositories Certified Workflows, Services, Experimental Results Ontologies, Blogs, ... & Analyses Adapted from David De Roure’s slides
  • 4. MEET JULIE PhD Student “institutional influences on patterns of collaboration in producing research of interdisciplinary character” Faculteit der Exacte Wetenschappen
  • 5. Julie needs data 5 Faculteit der Exacte Wetenschappen
  • 6. I AM NOT A LAWYER Web of Knowledge Terms of Use You are entitled to access the product, download or extract reasonable amounts of data from the product that are required for the activities you carry out individually or as part of your employment, and include insubstantial portions of extracted data in your work documents and reports, provided that such documents or reports are for the benefit of (and belong to) your organization, or where such documents or reports are intended for the benefit of third parties (not your organization ), extracted data is immaterial in the context of such documents or reports and used only for illustrative/demo purposes. Thomson Reuters determines a “reasonable amount” of data to download by comparing your download activity against the average annual download rates for all Thomson Reuters clients using the product in question. Thomson Reuters determines an “insubstantial portion” of downloaded data to mean an amount of data taken from the product which (1) would not have significant commercial value of its own; and (2) would not act as a substitute for access to a Thomson Reuters product for someone who does not have access to the product. You are not entitled to do anything that would cause a breach of the terms of the agreement between your organization and Thomson Reuters, such as (1) allowing anyone else to use your username/password, (2) downloading excessive amounts of data, (3) providing data to anyone else, other than in licensed, source-acknowledged documents or reports created as part of your normal work, (4) archiving or using downloaded data to create a derivative database or metrics, (5) using the product or any downloaded data to provide services to anyone outside your organization, or (6) using the product in a way that risks damaging, disabling, overburdening or impairing the operation of the product, or any other person’s use or enjoyment of the product. 6 Faculteit der Exacte Wetenschappen
  • 7. 7 Faculteit der Exacte Wetenschappen
  • 8. 8 Faculteit der Exacte Wetenschappen
  • 9. OPEN DATA: 2 WEEKS  15 MINUTES SELECT ?author ?affiliation ?uriAffiliation WHERE { GRAPH <$graph> { {<$article> swrc:author ?author. OPTIONAL{?author swrc:affiliation ?uriAffiliation.} OPTIONAL{?author swc:affiliation ?affiliation.} } UNION { <$article> foaf:maker ?author. OPTIONAL{?author swrc:affiliation ?uriAffiliation.} OPTIONAL{?author swc:affiliation ?affiliation.} } UNION { <$article> dc:creator ?author. OPTIONAL{?author swrc:affiliation ?uriAffiliation.} OPTIONAL{?author swc:affiliation ?affiliation.} } } 9 Faculteit der Exacte Wetenschappen
  • 12. 12 Faculteit der Exacte Wetenschappen
  • 13. 13 Faculteit der Exacte Wetenschappen
  • 14. 14 Faculteit der Exacte Wetenschappen
  • 15. 15 Faculteit der Exacte Wetenschappen
  • 16. 16 Photo by IvanClow - http://www.flickr.com/photos/ivanclow/4201955402/ Faculteit der Exacte Wetenschappen
  • 17. ERR
.SUPPORT? 17 Faculteit der Exacte Wetenschappen
  • 18. 5 TAKE-AWAYS 1. Open Data is a boon to young scientists as consumers 2. Trade-offs for producers of open data 3. Producers need support 4. Clear simple guidelines for data publication 5. Data citation is a key to open data 18 Faculteit der Exacte Wetenschappen

Editor's Notes

  1. Talk about citation data, difficult to get 2 weeks to gather a couple of hundred citation scores
  2. Open data to the rescue
. (
  3. My own community
  4. Faster Easier to experiment Access to more data
  5. Effective at the institutional level: Examples: Uniprot, chembl, astromicial data service, us government weather data
  6. Not as much experience at the personal level But good examples from (open source software)
  7. Built software during my phd released it as open source
..
  8. A fairly highly sighted paper in the UK e-Science All Hands Meeting (not the biggest outlet in the world)
  9. Led to new collaborators
  10. Exposing your dirty laundry is scary
  11. Lots of questions about the software People want support This is a distraction and can take time away from “science”
  12. Name-check e-science center for 3