SlideShare a Scribd company logo
1 of 19
Intro to PROV
Nicholas Car
Data Architect
nicholas.car@ga.gov.au
Outline
• What is PROV?
• How do I use PROV: modelling
• How do I use PROV: data management
• How do I use PROV: with other systems
Intro to PROV
What is PROV?
• W3C Recommendation (standard)
• Completed 2013
• Large number of authors
• The only international provenance standard
• Successor to precursors: PML, OPM.
• Many precursor authors involved
• Simpler than precursors
• No v2 any time soon
• Authors recommend extending the current standard
• Seeing good adoption
Intro to PROV
What is PROV?
• A “Family of documents”
• PROV-OVERVIEW – documentation
• PROV-PRIMER – tutorial
• PROV-DM – Data Model
• PROV-O – OWL Ontology version of DM
• PROV-N – special Notation for DM
• PROV-XML – XML encoding of DM
• PROV-CONSTRAINS – DM constraints
• http://www.w3.org/TR/prov-overview/
Intro to PROV
How do I use PROV: modelling
Not like this:
Do not describe the lineage of something in the metadata
document of that thing
Intro to PROV
ISO19115
or other
standardised
Document
provenance
information
contained in
document
some provenance field
Ref: https://geo-ide.noaa.gov/wiki/index.php?title=ISO_Lineage
How do I use PROV: modelling
Not like this:
Do not link a class of something to a provenance object
Intro to PROV
Data Catalogue Vocabulary (DCAT)
https://www.w3.org/TR/vocab-dcat/
Provenance
field 1
field 2
provenance
How do I use PROV: modelling
Not like this:
Do not link a class of something to a provenance object
Intro to PROV
Data Catalogue Vocabulary (DCAT)
https://www.w3.org/TR/vocab-dcat/
Provenance
field 1
field 2
provenance
Not even by using the
Dublin Core ‘provenance’
Property!
How do I use PROV: modelling
Like this:
Model things you are interested in as either Entities, Agents or
Activities and relate them to one another
Intro to PROV
PROV-DM’s basic classes expressed in a PROV-O style. After https://www.w3.org/TR/prov-o/
How do I use PROV: modelling
Like this:
GA’s “process provenance model”
Intro to PROV
How do I use PROV: data management
• For humans, or systems that log things:
• create Reports
• store them in a document DB
• with all the perks of a graph DB!
Intro to PROV
How do I use PROV: data management
• For humans, or systems that log things:
• create Reports
• store them in a document DB
• with all the perks of a graph DB!
Intro to PROV
A provenance Report
generation form for
human use in PROMS
How do I use PROV: data management
• For humans, or systems that log things:
• create Reports
• store them in a document DB
• For catalogue-like things:
• Add the ability to link Entities, Agents, Activities
Intro to PROV
Dataset
X
Dataset
Y
How do I use PROV: data management
• For humans, or systems that log things:
• create Reports
• store them in a document DB
• For catalogue-like things:
• Add the ability to link Entities, Agents, Activities
Intro to PROV
Dataset
X
Dataset
Y
wasDerivedFrom
Entity YEntity X
How do I use PROV: data management
• For humans, or systems that log things:
• create Reports
• store them in a document DB
• For catalogue-like things:
• Add the ability to link Entities, Agents, Activities
• Ensure relevant properties align with PROV
Intro to PROV
Dataset
X Creator
creator
How do I use PROV: data management
• For humans, or systems that log things:
• create Reports
• store them in a document DB
• For catalogue-like things:
• Add the ability to link Entities, Agents, Activities
• Ensure relevant properties align with PROV
Intro to PROV
Dataset
X
wasAssociatedWith
Creator
creator
Agent Creator
hadRole
How do I use PROV: data management
• For humans, or systems that log things:
• create Reports
• store them in a document DB
• For catalogue-like things:
• Add the ability to link Entities, Agents, Activities
• Ensure relevant properties align with PROV
• For databases:
• Ensure you represent the PROV-DM
Intro to PROV
How do I use PROV: data management
• For humans, or systems that log things:
• create Reports
• store them in a document DB
• For catalogue-like things:
• Add the ability to link Entities, Agents, Activities
• Ensure relevant properties align with PROV
• For databases:
• Ensure you represent the PROV-DM
• prove it via exporting
Intro to PROV
How do I use PROV: with other systems
• PROV & Metadata System X:
1. Full Alignment – Classify all things in MSX in PROV
o Requires a data model for MSX
o May have to reconsider some MSX objects
o Can profile PROV, don’t allow everything
2. Partial Alignment – Classify some of MSX in PROV
o Link classified things only
o Even link to things outside MSX
o Need to demo valid PROV-DM
3. Just PROV – Interpret/create PROV-only data
o Deprecate MSX for PROV
o Or create new data
Intro to PROV
How do I use PROV: data management
Like this:
GA’s “process provenance model”, full version
Intro to PROV

More Related Content

What's hot

What's hot (10)

Linked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and Inference
 
swib15 ALIADA
swib15 ALIADAswib15 ALIADA
swib15 ALIADA
 
Using the whole web as your dataset
Using the whole web as your datasetUsing the whole web as your dataset
Using the whole web as your dataset
 
Semantic web for ontology chapter4 bynk
Semantic web for ontology chapter4 bynkSemantic web for ontology chapter4 bynk
Semantic web for ontology chapter4 bynk
 
RDF Seminar Presentation
RDF Seminar PresentationRDF Seminar Presentation
RDF Seminar Presentation
 
Incorporating Functions in Mappings to Facilitate the Uplift of CSV Files int...
Incorporating Functions in Mappings to Facilitate the Uplift of CSV Files int...Incorporating Functions in Mappings to Facilitate the Uplift of CSV Files int...
Incorporating Functions in Mappings to Facilitate the Uplift of CSV Files int...
 
Managing RDF data with graph databases
Managing RDF data with graph databasesManaging RDF data with graph databases
Managing RDF data with graph databases
 
Deriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF DataDeriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF Data
 
Shawn-Averkamp-feb25
Shawn-Averkamp-feb25Shawn-Averkamp-feb25
Shawn-Averkamp-feb25
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New Technologies
 

Viewers also liked

JeremyTan_120058974_FYP_Final
JeremyTan_120058974_FYP_FinalJeremyTan_120058974_FYP_Final
JeremyTan_120058974_FYP_Final
Jeremy Tan
 
Presentación Personal
Presentación PersonalPresentación Personal
Presentación Personal
Bonabellotomas
 

Viewers also liked (16)

Managing provenance in the Social Sciences: the Data Documentation Initiative...
Managing provenance in the Social Sciences: the Data Documentation Initiative...Managing provenance in the Social Sciences: the Data Documentation Initiative...
Managing provenance in the Social Sciences: the Data Documentation Initiative...
 
Documenting Data Transformations
Documenting Data TransformationsDocumenting Data Transformations
Documenting Data Transformations
 
Western sydney wanderers official merchandise
Western sydney wanderers official merchandiseWestern sydney wanderers official merchandise
Western sydney wanderers official merchandise
 
Chapter 16_mcgee
Chapter 16_mcgeeChapter 16_mcgee
Chapter 16_mcgee
 
JeremyTan_120058974_FYP_Final
JeremyTan_120058974_FYP_FinalJeremyTan_120058974_FYP_Final
JeremyTan_120058974_FYP_Final
 
Numeros con signo suma, resta
Numeros con signo suma, restaNumeros con signo suma, resta
Numeros con signo suma, resta
 
Case study
Case studyCase study
Case study
 
RESUME.pdf
RESUME.pdfRESUME.pdf
RESUME.pdf
 
Undefined
UndefinedUndefined
Undefined
 
Sydney Fc
Sydney FcSydney Fc
Sydney Fc
 
Proyecto 6
Proyecto 6Proyecto 6
Proyecto 6
 
All about me kaitlyn
All about me kaitlynAll about me kaitlyn
All about me kaitlyn
 
Transcript of Webinar: Data management plans (DMPs) - audio
Transcript of Webinar: Data management plans (DMPs) - audioTranscript of Webinar: Data management plans (DMPs) - audio
Transcript of Webinar: Data management plans (DMPs) - audio
 
Diapositvas para el blog
Diapositvas para el blogDiapositvas para el blog
Diapositvas para el blog
 
Prac2
Prac2Prac2
Prac2
 
Presentación Personal
Presentación PersonalPresentación Personal
Presentación Personal
 

Similar to Provenance and social science data Nicholas Car - Intro to PROV

DevOps for TYPO3 Teams and Projects
DevOps for TYPO3 Teams and ProjectsDevOps for TYPO3 Teams and Projects
DevOps for TYPO3 Teams and Projects
Fedir RYKHTIK
 
Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...
DataWorks Summit
 

Similar to Provenance and social science data Nicholas Car - Intro to PROV (20)

Datasets and GATE Evaluation Framework for Benchmarking Wikipedia Based NER S...
Datasets and GATE Evaluation Framework for Benchmarking Wikipedia Based NER S...Datasets and GATE Evaluation Framework for Benchmarking Wikipedia Based NER S...
Datasets and GATE Evaluation Framework for Benchmarking Wikipedia Based NER S...
 
Lawson ProcessFlow
Lawson ProcessFlowLawson ProcessFlow
Lawson ProcessFlow
 
(BDT303) Construct Your ETL Pipeline with AWS Data Pipeline, Amazon EMR, and ...
(BDT303) Construct Your ETL Pipeline with AWS Data Pipeline, Amazon EMR, and ...(BDT303) Construct Your ETL Pipeline with AWS Data Pipeline, Amazon EMR, and ...
(BDT303) Construct Your ETL Pipeline with AWS Data Pipeline, Amazon EMR, and ...
 
Case study: Life Cycle Management for SAP BusinessObjects platform as well as...
Case study: Life Cycle Management for SAP BusinessObjects platform as well as...Case study: Life Cycle Management for SAP BusinessObjects platform as well as...
Case study: Life Cycle Management for SAP BusinessObjects platform as well as...
 
FOSDEM 2014: Social Network Benchmark (SNB) Graph Generator
FOSDEM 2014:  Social Network Benchmark (SNB) Graph GeneratorFOSDEM 2014:  Social Network Benchmark (SNB) Graph Generator
FOSDEM 2014: Social Network Benchmark (SNB) Graph Generator
 
DevOps for TYPO3 Teams and Projects
DevOps for TYPO3 Teams and ProjectsDevOps for TYPO3 Teams and Projects
DevOps for TYPO3 Teams and Projects
 
Facilitating Release Planning Event
Facilitating Release Planning EventFacilitating Release Planning Event
Facilitating Release Planning Event
 
Migrating Data Warehouse Solutions from Oracle to non-Oracle Databases
Migrating Data Warehouse Solutions from Oracle to non-Oracle DatabasesMigrating Data Warehouse Solutions from Oracle to non-Oracle Databases
Migrating Data Warehouse Solutions from Oracle to non-Oracle Databases
 
Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...Software engineering practices for the data science and machine learning life...
Software engineering practices for the data science and machine learning life...
 
Tableau and hadoop
Tableau and hadoopTableau and hadoop
Tableau and hadoop
 
Best Practices and Tips on Migrating a Legacy-Based CMS to Drupal
Best Practices and Tips on Migrating a Legacy-Based CMS to DrupalBest Practices and Tips on Migrating a Legacy-Based CMS to Drupal
Best Practices and Tips on Migrating a Legacy-Based CMS to Drupal
 
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
AnalyticOps: Lessons Learned Moving Machine-Learning Algorithms to Production...
 
How city of chicago boosts their sap business objects environment prepares fo...
How city of chicago boosts their sap business objects environment prepares fo...How city of chicago boosts their sap business objects environment prepares fo...
How city of chicago boosts their sap business objects environment prepares fo...
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
 
Drupalcampchicago2010.rachel.datamigration.
Drupalcampchicago2010.rachel.datamigration.Drupalcampchicago2010.rachel.datamigration.
Drupalcampchicago2010.rachel.datamigration.
 
SAP Business Objects Trianing
SAP Business Objects TrianingSAP Business Objects Trianing
SAP Business Objects Trianing
 
Which Freaking Database Should I Use?
Which Freaking Database Should I Use?Which Freaking Database Should I Use?
Which Freaking Database Should I Use?
 
AWS Summit Singapore - Managing a Database Migration Project | Best Practices
AWS Summit Singapore - Managing a Database Migration Project | Best PracticesAWS Summit Singapore - Managing a Database Migration Project | Best Practices
AWS Summit Singapore - Managing a Database Migration Project | Best Practices
 
Building bridges - Plone Conference 2015 Bucharest
Building bridges   - Plone Conference 2015 BucharestBuilding bridges   - Plone Conference 2015 Bucharest
Building bridges - Plone Conference 2015 Bucharest
 
OpenSource and the Cloud ApacheCon.pptx
OpenSource and the Cloud  ApacheCon.pptxOpenSource and the Cloud  ApacheCon.pptx
OpenSource and the Cloud ApacheCon.pptx
 

More from ARDC

More from ARDC (20)

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and Standards
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspective
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domain
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharing
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studies
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scope
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical data
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and Challenges
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of data
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018
 

Recently uploaded

Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
Health
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 

Recently uploaded (20)

Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 

Provenance and social science data Nicholas Car - Intro to PROV

  • 1. Intro to PROV Nicholas Car Data Architect nicholas.car@ga.gov.au
  • 2. Outline • What is PROV? • How do I use PROV: modelling • How do I use PROV: data management • How do I use PROV: with other systems Intro to PROV
  • 3. What is PROV? • W3C Recommendation (standard) • Completed 2013 • Large number of authors • The only international provenance standard • Successor to precursors: PML, OPM. • Many precursor authors involved • Simpler than precursors • No v2 any time soon • Authors recommend extending the current standard • Seeing good adoption Intro to PROV
  • 4. What is PROV? • A “Family of documents” • PROV-OVERVIEW – documentation • PROV-PRIMER – tutorial • PROV-DM – Data Model • PROV-O – OWL Ontology version of DM • PROV-N – special Notation for DM • PROV-XML – XML encoding of DM • PROV-CONSTRAINS – DM constraints • http://www.w3.org/TR/prov-overview/ Intro to PROV
  • 5. How do I use PROV: modelling Not like this: Do not describe the lineage of something in the metadata document of that thing Intro to PROV ISO19115 or other standardised Document provenance information contained in document some provenance field Ref: https://geo-ide.noaa.gov/wiki/index.php?title=ISO_Lineage
  • 6. How do I use PROV: modelling Not like this: Do not link a class of something to a provenance object Intro to PROV Data Catalogue Vocabulary (DCAT) https://www.w3.org/TR/vocab-dcat/ Provenance field 1 field 2 provenance
  • 7. How do I use PROV: modelling Not like this: Do not link a class of something to a provenance object Intro to PROV Data Catalogue Vocabulary (DCAT) https://www.w3.org/TR/vocab-dcat/ Provenance field 1 field 2 provenance Not even by using the Dublin Core ‘provenance’ Property!
  • 8. How do I use PROV: modelling Like this: Model things you are interested in as either Entities, Agents or Activities and relate them to one another Intro to PROV PROV-DM’s basic classes expressed in a PROV-O style. After https://www.w3.org/TR/prov-o/
  • 9. How do I use PROV: modelling Like this: GA’s “process provenance model” Intro to PROV
  • 10. How do I use PROV: data management • For humans, or systems that log things: • create Reports • store them in a document DB • with all the perks of a graph DB! Intro to PROV
  • 11. How do I use PROV: data management • For humans, or systems that log things: • create Reports • store them in a document DB • with all the perks of a graph DB! Intro to PROV A provenance Report generation form for human use in PROMS
  • 12. How do I use PROV: data management • For humans, or systems that log things: • create Reports • store them in a document DB • For catalogue-like things: • Add the ability to link Entities, Agents, Activities Intro to PROV Dataset X Dataset Y
  • 13. How do I use PROV: data management • For humans, or systems that log things: • create Reports • store them in a document DB • For catalogue-like things: • Add the ability to link Entities, Agents, Activities Intro to PROV Dataset X Dataset Y wasDerivedFrom Entity YEntity X
  • 14. How do I use PROV: data management • For humans, or systems that log things: • create Reports • store them in a document DB • For catalogue-like things: • Add the ability to link Entities, Agents, Activities • Ensure relevant properties align with PROV Intro to PROV Dataset X Creator creator
  • 15. How do I use PROV: data management • For humans, or systems that log things: • create Reports • store them in a document DB • For catalogue-like things: • Add the ability to link Entities, Agents, Activities • Ensure relevant properties align with PROV Intro to PROV Dataset X wasAssociatedWith Creator creator Agent Creator hadRole
  • 16. How do I use PROV: data management • For humans, or systems that log things: • create Reports • store them in a document DB • For catalogue-like things: • Add the ability to link Entities, Agents, Activities • Ensure relevant properties align with PROV • For databases: • Ensure you represent the PROV-DM Intro to PROV
  • 17. How do I use PROV: data management • For humans, or systems that log things: • create Reports • store them in a document DB • For catalogue-like things: • Add the ability to link Entities, Agents, Activities • Ensure relevant properties align with PROV • For databases: • Ensure you represent the PROV-DM • prove it via exporting Intro to PROV
  • 18. How do I use PROV: with other systems • PROV & Metadata System X: 1. Full Alignment – Classify all things in MSX in PROV o Requires a data model for MSX o May have to reconsider some MSX objects o Can profile PROV, don’t allow everything 2. Partial Alignment – Classify some of MSX in PROV o Link classified things only o Even link to things outside MSX o Need to demo valid PROV-DM 3. Just PROV – Interpret/create PROV-only data o Deprecate MSX for PROV o Or create new data Intro to PROV
  • 19. How do I use PROV: data management Like this: GA’s “process provenance model”, full version Intro to PROV