SlideShare a Scribd company logo
+
FOOD: FOod in Open Data
Silvio Peroni, Giorgia Lodi, Luigi Asprino, 

Aldo Gangemi, Valentina Presutti
15th International Semantic Web Conference (ISWC 2016)

Kobe, Japan, October 17-23, 2016
Paper: https://w3id.org/people/essepuntato/papers/food-iswc2016.html
Ontologies+Data: http://dx.doi.org/10.6084/m9.figshare.3187903
+
Outline
• Quality schemes and policy documents
• The FOOD project
• FOOD ontologies
• FOOD data
• Conclusions
+
Quality schemes
• European Union (EU) quality schemes for agricultural and
food products refer to agricultural products and foodstuffs,
wines, spirits and aromatised wines, which producers or
producer groups have registered according to specific rules
so as to assure consumers that these are the genuine
products, not imitations seeking to benefit from the good name
and reputation of the original
– PDO (Protected Designation of Origin)
– PGI (Protected Geographical Indication)
– TSG (Traditional Speciality Guaranteed)
• Issue: management of quality schemes is not fully automatised
at the EU level, and no standards are used in the definition of
the rules that should regulate them
+
Policy documents
• Italian situation: the rules one has to follow for
applying for a quality scheme are regulated by
particular policy documents (a.k.a. product
specifications)
• What’s in there
– First article: the protected name and the types of
related products of such name
– Second article: the raw materials which contribute to
the products composition
– Third article: production area
– Rest of the articles: physical/chemical/organoleptic
characteristics of the products + historical/
geographical evidences supporting the production
area + methods used for producing the products of
that protected name
Italian label
applied to
wines
compliant to
a particular
policy
document
+
FOOD project
• The FOod in Open Data (FOOD) project was conceived and carried out by the
Ministry of Agriculture (MIPAAF), the Italian Digital Agency (AgID), and the
STLab
• Main goal: to extract the data contained in the textual content (from DOCX and
PDF files) of the more than more than 800 policy documents of Italian
agricultural PDO, PGI and TSG products, and to make them available as LOD
according to appropriate ontologies to be developed within the project
• Three main concepts:
– protected name, i.e. a trademark label (e.g., “Abruzzo DOC” for a particular Italian
wine), issued by an authority granted for certifying agricultural products and
foodstuffs, that typically belongs to a certification scheme (i.e. PDO, PGI, TSG);
– type, i.e., a possible type for certain products (e.g., “white” or “red” for wines);
– product, i.e., the agricultural product or foodstuff (e.g., “Abruzzo DOC red wine”)
which has a type, refers to a protected name disciplined by a certain policy
document, and is described in terms of its raw materials (the particular wine varietal
used) and physical/chemical/organolectic characteristics (colour, smell, flavour, etc.)
https://w3id.org/food
+
Ontologies
• We needed several ontologies for describing data related
to each of the products introduced in the policy documents
• The ontology development process has been split in three
sequential steps
1.eXtreme Design (XD) methodology for developing an upper
ontology describing the concepts protected name, type, product
and the related contextual information (characteristics, raw
material, etc.)
2.Template-based approach based on the upper ontology for
developing other 20 ontologies (one for each kind of the
products we had to handle)
3.Alignment to other existing and relevant ontologies (i.e.
AGROVOC, DOLCE, DBpedia and Wordnet)
+
Upper Ontology
• Two ontology engineers
analysed all the Italian
policy documents about
agricultural products
• Domain experts of
MIPAAF and AgID have
been involved as well so
as to identify the main
high-level concepts
characterising the
domain
• EU schemes for
protected names were
also considered so as to
take into account how
they are organised at
the European level
+
Other ontologies
For speeding up
the process, we
involved three
more ontology
engineers, and we
provided a generic
Graffoo template
created starting
from the parts of
the upper ontology
that had to be
extended, so as to
guide the
development of all
the other
ontologies in an
homogeneous way
+
Data extraction
563 policy
documents on wines
274 policy documents
on other products
INPUT OUTPUT
EXTRACTION VALIDATION CONVERSION
Scripts
convert
CSV
documents
into RDF
documents
People from our
lab gather data by
reading
documents
Scripts extract
relevant data from
source documents
Scripts
convert
validated
data into
CSV
documents
Experts (MIPAAF
+ AgID)
correct and
validate data
+
Extracting DBpedia entities referring to production places,
revised by humans, and aligned existing Italian PA LOD
+
Data: example
product:wine-abruzzo-red
a upper:Product , wine:Wine ;
rdfs:label
"Wine 'Abruzzo' of type 'Rosso'"@en ,
"Vino 'Abruzzo' Rosso"@it ;
upper:hasProtectedName
name:wine-abruzzo ;
upper:hasDescription
descraw:wine-abruzzo-red-raw-material-1 ,
descraw:wine-abruzzo-red-raw-material-2 ;
upper:hasType type:red-wine .
descraw:wine-abruzzo-red-raw-material-1
a upper:DescriptionOfRawMaterial ;
rdfs:label "Vitigno 'Montepulciano': minimo (>=) 80%" ;
upper:hasRawMaterial rawmat:vine-variety-montepulciano ;
upper:hasMinimumValue "80%" .
A Turtle excerpt of the data about
the wine “Abruzzo Rosso”…
… and its
rendering in HTML
by means of
LOD View
+
Conclusions
• We have described the outcomes of an e-government project named FOOD
– FOod in Open Data – carried out in the context of a collaboration between
STLab, MIPAAF and AgID
– Input: 837 policy documents
– Outcome: 21 ontologies for describing protected names and their products + LOD
– Reuse: Upper Ontology for the Umbrian portal on PDO/PGI/TSG products
• Lessons learnt
– The automatic extraction of data from the policy documents – even when they are
generally structured similarly – it is quite difficult and the results are far from the
quality required for being published – the experts intervention was quite huge
– A manual extraction of such data would have prevented the introduction of such a
large number of mistakes
– The analysis done have enabled the refactor of existing templates for policy
documents so as to write relevant information in an homogeneous manner – a
crucial point for guaranteeing automatic processing in the future
+
Thanks for your attention
Silvio Peroni, Giorgia Lodi, Luigi Asprino, 

Aldo Gangemi, Valentina Presutti
15th International Semantic Web Conference (ISWC 2016)

Kobe, Japan, October 17-23, 2016
Paper: https://w3id.org/people/essepuntato/papers/food-iswc2016.html
Ontologies+Data: http://dx.doi.org/10.6084/m9.figshare.3187903

More Related Content

Similar to FOOD: FOod in Open Data

LIFE FRONT project
LIFE FRONT projectLIFE FRONT project
LIFE FRONT project
Klara Zolcer Skacanova
 
The European Innovation Partnership on Water Online Marketplace
The European Innovation Partnership on Water Online MarketplaceThe European Innovation Partnership on Water Online Marketplace
The European Innovation Partnership on Water Online Marketplace
Martin Kaltenböck
 
Sdc ppt 8 mar 21
Sdc ppt 8 mar 21Sdc ppt 8 mar 21
Sdc ppt 8 mar 21
Vijay Prakash
 
János-István Petrusán
János-István PetrusánJános-István Petrusán
János-István Petrusán
ERIAFF2014
 
H2020 ICT calls
H2020 ICT callsH2020 ICT calls
20120619 EU BC&E Towards a global standard with sustainability criteria for b...
20120619 EU BC&E Towards a global standard with sustainability criteria for b...20120619 EU BC&E Towards a global standard with sustainability criteria for b...
20120619 EU BC&E Towards a global standard with sustainability criteria for b...
Jarno Dakhorst
 
2020-12-10 EC - PEF for SMEs.pdf
2020-12-10 EC - PEF for SMEs.pdf2020-12-10 EC - PEF for SMEs.pdf
2020-12-10 EC - PEF for SMEs.pdf
Anna Zhenova
 
ICOS Services and Products
ICOS Services and Products ICOS Services and Products
ICOS Services and Products
Integrated Carbon Observation System (ICOS)
 
OECD Green Talks LIVE: Global eChemPortal to information on chemical substances
OECD Green Talks LIVE: Global eChemPortal to information on chemical substancesOECD Green Talks LIVE: Global eChemPortal to information on chemical substances
OECD Green Talks LIVE: Global eChemPortal to information on chemical substances
OECD Environment
 
FIspace Infodays Poznan 18 Sep 2013
FIspace Infodays Poznan 18 Sep 2013FIspace Infodays Poznan 18 Sep 2013
FIspace Infodays Poznan 18 Sep 2013
Sjaak Wolfert
 
Technical Status Quo of the SCIP Database, Dissemination Platform & Software ...
Technical Status Quo of the SCIP Database, Dissemination Platform & Software ...Technical Status Quo of the SCIP Database, Dissemination Platform & Software ...
Technical Status Quo of the SCIP Database, Dissemination Platform & Software ...
raj takhar
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data Pilot
Sarah Jones
 
Publication of INSPIRE-based agricultural linked data
Publication of INSPIRE-based agricultural linked dataPublication of INSPIRE-based agricultural linked data
Publication of INSPIRE-based agricultural linked data
Raul Palma
 
GRIN- Global platform and CIP database structure
GRIN- Global platform and CIP database structureGRIN- Global platform and CIP database structure
GRIN- Global platform and CIP database structure
Edwin Rojas
 
Financement de l'innovation par les subsides européens - Le Point du LIEGE sc...
Financement de l'innovation par les subsides européens - Le Point du LIEGE sc...Financement de l'innovation par les subsides européens - Le Point du LIEGE sc...
Financement de l'innovation par les subsides européens - Le Point du LIEGE sc...
Interface ULg, LIEGE science park
 
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
Dr. Haxel Consult
 
Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015
cthanopoulos
 
Ormala - Industrial Innovation in transition; Big data
Ormala - Industrial Innovation in transition; Big dataOrmala - Industrial Innovation in transition; Big data
Ormala - Industrial Innovation in transition; Big data
innovationoecd
 
FP7-ICT Programme
FP7-ICT ProgrammeFP7-ICT Programme
FP7-ICT Programme
Leandro Ciuffo
 
Co2Logistic Tool
Co2Logistic ToolCo2Logistic Tool
Co2Logistic Tool
InclamSoft
 

Similar to FOOD: FOod in Open Data (20)

LIFE FRONT project
LIFE FRONT projectLIFE FRONT project
LIFE FRONT project
 
The European Innovation Partnership on Water Online Marketplace
The European Innovation Partnership on Water Online MarketplaceThe European Innovation Partnership on Water Online Marketplace
The European Innovation Partnership on Water Online Marketplace
 
Sdc ppt 8 mar 21
Sdc ppt 8 mar 21Sdc ppt 8 mar 21
Sdc ppt 8 mar 21
 
János-István Petrusán
János-István PetrusánJános-István Petrusán
János-István Petrusán
 
H2020 ICT calls
H2020 ICT callsH2020 ICT calls
H2020 ICT calls
 
20120619 EU BC&E Towards a global standard with sustainability criteria for b...
20120619 EU BC&E Towards a global standard with sustainability criteria for b...20120619 EU BC&E Towards a global standard with sustainability criteria for b...
20120619 EU BC&E Towards a global standard with sustainability criteria for b...
 
2020-12-10 EC - PEF for SMEs.pdf
2020-12-10 EC - PEF for SMEs.pdf2020-12-10 EC - PEF for SMEs.pdf
2020-12-10 EC - PEF for SMEs.pdf
 
ICOS Services and Products
ICOS Services and Products ICOS Services and Products
ICOS Services and Products
 
OECD Green Talks LIVE: Global eChemPortal to information on chemical substances
OECD Green Talks LIVE: Global eChemPortal to information on chemical substancesOECD Green Talks LIVE: Global eChemPortal to information on chemical substances
OECD Green Talks LIVE: Global eChemPortal to information on chemical substances
 
FIspace Infodays Poznan 18 Sep 2013
FIspace Infodays Poznan 18 Sep 2013FIspace Infodays Poznan 18 Sep 2013
FIspace Infodays Poznan 18 Sep 2013
 
Technical Status Quo of the SCIP Database, Dissemination Platform & Software ...
Technical Status Quo of the SCIP Database, Dissemination Platform & Software ...Technical Status Quo of the SCIP Database, Dissemination Platform & Software ...
Technical Status Quo of the SCIP Database, Dissemination Platform & Software ...
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data Pilot
 
Publication of INSPIRE-based agricultural linked data
Publication of INSPIRE-based agricultural linked dataPublication of INSPIRE-based agricultural linked data
Publication of INSPIRE-based agricultural linked data
 
GRIN- Global platform and CIP database structure
GRIN- Global platform and CIP database structureGRIN- Global platform and CIP database structure
GRIN- Global platform and CIP database structure
 
Financement de l'innovation par les subsides européens - Le Point du LIEGE sc...
Financement de l'innovation par les subsides européens - Le Point du LIEGE sc...Financement de l'innovation par les subsides européens - Le Point du LIEGE sc...
Financement de l'innovation par les subsides européens - Le Point du LIEGE sc...
 
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
 
Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015Agro know Food Safety Challenge for the Future Food Hack 2015
Agro know Food Safety Challenge for the Future Food Hack 2015
 
Ormala - Industrial Innovation in transition; Big data
Ormala - Industrial Innovation in transition; Big dataOrmala - Industrial Innovation in transition; Big data
Ormala - Industrial Innovation in transition; Big data
 
FP7-ICT Programme
FP7-ICT ProgrammeFP7-ICT Programme
FP7-ICT Programme
 
Co2Logistic Tool
Co2Logistic ToolCo2Logistic Tool
Co2Logistic Tool
 

More from University of Bologna

The Initiative for Open Citations and the OpenCitations Corpus
The Initiative for Open Citations and the OpenCitations CorpusThe Initiative for Open Citations and the OpenCitations Corpus
The Initiative for Open Citations and the OpenCitations Corpus
University of Bologna
 
OpenCitations
OpenCitationsOpenCitations
OpenCitations
University of Bologna
 
A document-inspired way for tracking changes of RDF data - The case of the Op...
A document-inspired way for tracking changes of RDF data - The case of the Op...A document-inspired way for tracking changes of RDF data - The case of the Op...
A document-inspired way for tracking changes of RDF data - The case of the Op...
University of Bologna
 
A Simplified Agile Methodology for Ontology Development
A Simplified Agile Methodology for Ontology DevelopmentA Simplified Agile Methodology for Ontology Development
A Simplified Agile Methodology for Ontology Development
University of Bologna
 
Freedom for bibliographic references: OpenCitations arise
Freedom for bibliographic references: OpenCitations ariseFreedom for bibliographic references: OpenCitations arise
Freedom for bibliographic references: OpenCitations arise
University of Bologna
 
A pattern-based ontology for describing publishing workflows
A pattern-based ontology for describing publishing workflowsA pattern-based ontology for describing publishing workflows
A pattern-based ontology for describing publishing workflows
University of Bologna
 
Semantic lenses to bring digital and semantic publishing together
Semantic lenses to bring digital and semantic publishing togetherSemantic lenses to bring digital and semantic publishing together
Semantic lenses to bring digital and semantic publishing together
University of Bologna
 
Zeri e LODE
: Extracting the Zeri photo archive to Linked Open Data: formaliz...
Zeri e LODE
: Extracting the Zeri photo archive to Linked Open Data: formaliz...Zeri e LODE
: Extracting the Zeri photo archive to Linked Open Data: formaliz...
Zeri e LODE
: Extracting the Zeri photo archive to Linked Open Data: formaliz...
University of Bologna
 
Characterising citations in scholarly articles: an experiment
Characterising citations in scholarly articles: an experimentCharacterising citations in scholarly articles: an experiment
Characterising citations in scholarly articles: an experiment
University of Bologna
 
Bringing semantic publishing into TEI: ideas and pointers
Bringing semantic publishing into TEI: ideas and pointersBringing semantic publishing into TEI: ideas and pointers
Bringing semantic publishing into TEI: ideas and pointers
University of Bologna
 
Tracking Changes through EARMARK: a Theoretical Perspective and an Implementa...
Tracking Changes through EARMARK: a Theoretical Perspective and an Implementa...Tracking Changes through EARMARK: a Theoretical Perspective and an Implementa...
Tracking Changes through EARMARK: a Theoretical Perspective and an Implementa...
University of Bologna
 
Towards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citationsTowards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citations
University of Bologna
 
The Live OWL Documentation Environment: a tool for the automatic generation o...
The Live OWL Documentation Environment: a tool for the automatic generation o...The Live OWL Documentation Environment: a tool for the automatic generation o...
The Live OWL Documentation Environment: a tool for the automatic generation o...
University of Bologna
 
Scholarly publishing and Linked Data: describing roles, statuses, temporal an...
Scholarly publishing and Linked Data: describing roles, statuses, temporal an...Scholarly publishing and Linked Data: describing roles, statuses, temporal an...
Scholarly publishing and Linked Data: describing roles, statuses, temporal an...
University of Bologna
 
Embedding semantic annotations within texts: the FRETTA approach
Embedding semantic annotations within texts: the FRETTA approachEmbedding semantic annotations within texts: the FRETTA approach
Embedding semantic annotations within texts: the FRETTA approach
University of Bologna
 
Dealing with Markup Semantics
Dealing with Markup SemanticsDealing with Markup Semantics
Dealing with Markup Semantics
University of Bologna
 
Handling Markup Overlaps Using OWL
Handling Markup Overlaps Using OWLHandling Markup Overlaps Using OWL
Handling Markup Overlaps Using OWL
University of Bologna
 

More from University of Bologna (17)

The Initiative for Open Citations and the OpenCitations Corpus
The Initiative for Open Citations and the OpenCitations CorpusThe Initiative for Open Citations and the OpenCitations Corpus
The Initiative for Open Citations and the OpenCitations Corpus
 
OpenCitations
OpenCitationsOpenCitations
OpenCitations
 
A document-inspired way for tracking changes of RDF data - The case of the Op...
A document-inspired way for tracking changes of RDF data - The case of the Op...A document-inspired way for tracking changes of RDF data - The case of the Op...
A document-inspired way for tracking changes of RDF data - The case of the Op...
 
A Simplified Agile Methodology for Ontology Development
A Simplified Agile Methodology for Ontology DevelopmentA Simplified Agile Methodology for Ontology Development
A Simplified Agile Methodology for Ontology Development
 
Freedom for bibliographic references: OpenCitations arise
Freedom for bibliographic references: OpenCitations ariseFreedom for bibliographic references: OpenCitations arise
Freedom for bibliographic references: OpenCitations arise
 
A pattern-based ontology for describing publishing workflows
A pattern-based ontology for describing publishing workflowsA pattern-based ontology for describing publishing workflows
A pattern-based ontology for describing publishing workflows
 
Semantic lenses to bring digital and semantic publishing together
Semantic lenses to bring digital and semantic publishing togetherSemantic lenses to bring digital and semantic publishing together
Semantic lenses to bring digital and semantic publishing together
 
Zeri e LODE
: Extracting the Zeri photo archive to Linked Open Data: formaliz...
Zeri e LODE
: Extracting the Zeri photo archive to Linked Open Data: formaliz...Zeri e LODE
: Extracting the Zeri photo archive to Linked Open Data: formaliz...
Zeri e LODE
: Extracting the Zeri photo archive to Linked Open Data: formaliz...
 
Characterising citations in scholarly articles: an experiment
Characterising citations in scholarly articles: an experimentCharacterising citations in scholarly articles: an experiment
Characterising citations in scholarly articles: an experiment
 
Bringing semantic publishing into TEI: ideas and pointers
Bringing semantic publishing into TEI: ideas and pointersBringing semantic publishing into TEI: ideas and pointers
Bringing semantic publishing into TEI: ideas and pointers
 
Tracking Changes through EARMARK: a Theoretical Perspective and an Implementa...
Tracking Changes through EARMARK: a Theoretical Perspective and an Implementa...Tracking Changes through EARMARK: a Theoretical Perspective and an Implementa...
Tracking Changes through EARMARK: a Theoretical Perspective and an Implementa...
 
Towards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citationsTowards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citations
 
The Live OWL Documentation Environment: a tool for the automatic generation o...
The Live OWL Documentation Environment: a tool for the automatic generation o...The Live OWL Documentation Environment: a tool for the automatic generation o...
The Live OWL Documentation Environment: a tool for the automatic generation o...
 
Scholarly publishing and Linked Data: describing roles, statuses, temporal an...
Scholarly publishing and Linked Data: describing roles, statuses, temporal an...Scholarly publishing and Linked Data: describing roles, statuses, temporal an...
Scholarly publishing and Linked Data: describing roles, statuses, temporal an...
 
Embedding semantic annotations within texts: the FRETTA approach
Embedding semantic annotations within texts: the FRETTA approachEmbedding semantic annotations within texts: the FRETTA approach
Embedding semantic annotations within texts: the FRETTA approach
 
Dealing with Markup Semantics
Dealing with Markup SemanticsDealing with Markup Semantics
Dealing with Markup Semantics
 
Handling Markup Overlaps Using OWL
Handling Markup Overlaps Using OWLHandling Markup Overlaps Using OWL
Handling Markup Overlaps Using OWL
 

Recently uploaded

Lattice Defects in ionic solid compound.pptx
Lattice Defects in ionic solid compound.pptxLattice Defects in ionic solid compound.pptx
Lattice Defects in ionic solid compound.pptx
DrRajeshDas
 
Injection: Risks and challenges - Injection of CO2 into geological rock forma...
Injection: Risks and challenges - Injection of CO2 into geological rock forma...Injection: Risks and challenges - Injection of CO2 into geological rock forma...
Injection: Risks and challenges - Injection of CO2 into geological rock forma...
Oeko-Institut
 
gastroretentive drug delivery system-PPT.pptx
gastroretentive drug delivery system-PPT.pptxgastroretentive drug delivery system-PPT.pptx
gastroretentive drug delivery system-PPT.pptx
Shekar Boddu
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
PirithiRaju
 
Compositions of iron-meteorite parent bodies constrainthe structure of the pr...
Compositions of iron-meteorite parent bodies constrainthe structure of the pr...Compositions of iron-meteorite parent bodies constrainthe structure of the pr...
Compositions of iron-meteorite parent bodies constrainthe structure of the pr...
Sérgio Sacani
 
Immunotherapy presentation from clinical immunology
Immunotherapy presentation from clinical immunologyImmunotherapy presentation from clinical immunology
Immunotherapy presentation from clinical immunology
VetriVel359477
 
Flow chart.pdf LIFE SCIENCES CSIR UGC NET CONTENT
Flow chart.pdf  LIFE SCIENCES CSIR UGC NET CONTENTFlow chart.pdf  LIFE SCIENCES CSIR UGC NET CONTENT
Flow chart.pdf LIFE SCIENCES CSIR UGC NET CONTENT
savindersingh16
 
cathode ray oscilloscope and its applications
cathode ray oscilloscope and its applicationscathode ray oscilloscope and its applications
cathode ray oscilloscope and its applications
sandertein
 
2001_Book_HumanChromosomes - Genéticapdf
2001_Book_HumanChromosomes - Genéticapdf2001_Book_HumanChromosomes - Genéticapdf
2001_Book_HumanChromosomes - Genéticapdf
lucianamillenium
 
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptxLEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
yourprojectpartner05
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
PirithiRaju
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
PirithiRaju
 
Quality assurance B.pharm 6th semester BP606T UNIT 5
Quality assurance B.pharm 6th semester BP606T UNIT 5Quality assurance B.pharm 6th semester BP606T UNIT 5
Quality assurance B.pharm 6th semester BP606T UNIT 5
vimalveerammal
 
Introduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptxIntroduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptx
QusayMaghayerh
 
Embracing Deep Variability For Reproducibility and Replicability
Embracing Deep Variability For Reproducibility and ReplicabilityEmbracing Deep Variability For Reproducibility and Replicability
Embracing Deep Variability For Reproducibility and Replicability
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
PirithiRaju
 
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
Sérgio Sacani
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
Frédéric Baudron
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Sérgio Sacani
 

Recently uploaded (20)

Lattice Defects in ionic solid compound.pptx
Lattice Defects in ionic solid compound.pptxLattice Defects in ionic solid compound.pptx
Lattice Defects in ionic solid compound.pptx
 
Injection: Risks and challenges - Injection of CO2 into geological rock forma...
Injection: Risks and challenges - Injection of CO2 into geological rock forma...Injection: Risks and challenges - Injection of CO2 into geological rock forma...
Injection: Risks and challenges - Injection of CO2 into geological rock forma...
 
gastroretentive drug delivery system-PPT.pptx
gastroretentive drug delivery system-PPT.pptxgastroretentive drug delivery system-PPT.pptx
gastroretentive drug delivery system-PPT.pptx
 
Pests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdfPests of Storage_Identification_Dr.UPR.pdf
Pests of Storage_Identification_Dr.UPR.pdf
 
Compositions of iron-meteorite parent bodies constrainthe structure of the pr...
Compositions of iron-meteorite parent bodies constrainthe structure of the pr...Compositions of iron-meteorite parent bodies constrainthe structure of the pr...
Compositions of iron-meteorite parent bodies constrainthe structure of the pr...
 
Immunotherapy presentation from clinical immunology
Immunotherapy presentation from clinical immunologyImmunotherapy presentation from clinical immunology
Immunotherapy presentation from clinical immunology
 
Flow chart.pdf LIFE SCIENCES CSIR UGC NET CONTENT
Flow chart.pdf  LIFE SCIENCES CSIR UGC NET CONTENTFlow chart.pdf  LIFE SCIENCES CSIR UGC NET CONTENT
Flow chart.pdf LIFE SCIENCES CSIR UGC NET CONTENT
 
cathode ray oscilloscope and its applications
cathode ray oscilloscope and its applicationscathode ray oscilloscope and its applications
cathode ray oscilloscope and its applications
 
2001_Book_HumanChromosomes - Genéticapdf
2001_Book_HumanChromosomes - Genéticapdf2001_Book_HumanChromosomes - Genéticapdf
2001_Book_HumanChromosomes - Genéticapdf
 
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptxLEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
 
Quality assurance B.pharm 6th semester BP606T UNIT 5
Quality assurance B.pharm 6th semester BP606T UNIT 5Quality assurance B.pharm 6th semester BP606T UNIT 5
Quality assurance B.pharm 6th semester BP606T UNIT 5
 
Introduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptxIntroduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptx
 
Embracing Deep Variability For Reproducibility and Replicability
Embracing Deep Variability For Reproducibility and ReplicabilityEmbracing Deep Variability For Reproducibility and Replicability
Embracing Deep Variability For Reproducibility and Replicability
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
 
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
 
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
 

FOOD: FOod in Open Data

  • 1. + FOOD: FOod in Open Data Silvio Peroni, Giorgia Lodi, Luigi Asprino, 
 Aldo Gangemi, Valentina Presutti 15th International Semantic Web Conference (ISWC 2016)
 Kobe, Japan, October 17-23, 2016 Paper: https://w3id.org/people/essepuntato/papers/food-iswc2016.html Ontologies+Data: http://dx.doi.org/10.6084/m9.figshare.3187903
  • 2. + Outline • Quality schemes and policy documents • The FOOD project • FOOD ontologies • FOOD data • Conclusions
  • 3. + Quality schemes • European Union (EU) quality schemes for agricultural and food products refer to agricultural products and foodstuffs, wines, spirits and aromatised wines, which producers or producer groups have registered according to specific rules so as to assure consumers that these are the genuine products, not imitations seeking to benefit from the good name and reputation of the original – PDO (Protected Designation of Origin) – PGI (Protected Geographical Indication) – TSG (Traditional Speciality Guaranteed) • Issue: management of quality schemes is not fully automatised at the EU level, and no standards are used in the definition of the rules that should regulate them
  • 4. + Policy documents • Italian situation: the rules one has to follow for applying for a quality scheme are regulated by particular policy documents (a.k.a. product specifications) • What’s in there – First article: the protected name and the types of related products of such name – Second article: the raw materials which contribute to the products composition – Third article: production area – Rest of the articles: physical/chemical/organoleptic characteristics of the products + historical/ geographical evidences supporting the production area + methods used for producing the products of that protected name Italian label applied to wines compliant to a particular policy document
  • 5. + FOOD project • The FOod in Open Data (FOOD) project was conceived and carried out by the Ministry of Agriculture (MIPAAF), the Italian Digital Agency (AgID), and the STLab • Main goal: to extract the data contained in the textual content (from DOCX and PDF files) of the more than more than 800 policy documents of Italian agricultural PDO, PGI and TSG products, and to make them available as LOD according to appropriate ontologies to be developed within the project • Three main concepts: – protected name, i.e. a trademark label (e.g., “Abruzzo DOC” for a particular Italian wine), issued by an authority granted for certifying agricultural products and foodstuffs, that typically belongs to a certification scheme (i.e. PDO, PGI, TSG); – type, i.e., a possible type for certain products (e.g., “white” or “red” for wines); – product, i.e., the agricultural product or foodstuff (e.g., “Abruzzo DOC red wine”) which has a type, refers to a protected name disciplined by a certain policy document, and is described in terms of its raw materials (the particular wine varietal used) and physical/chemical/organolectic characteristics (colour, smell, flavour, etc.) https://w3id.org/food
  • 6. + Ontologies • We needed several ontologies for describing data related to each of the products introduced in the policy documents • The ontology development process has been split in three sequential steps 1.eXtreme Design (XD) methodology for developing an upper ontology describing the concepts protected name, type, product and the related contextual information (characteristics, raw material, etc.) 2.Template-based approach based on the upper ontology for developing other 20 ontologies (one for each kind of the products we had to handle) 3.Alignment to other existing and relevant ontologies (i.e. AGROVOC, DOLCE, DBpedia and Wordnet)
  • 7. + Upper Ontology • Two ontology engineers analysed all the Italian policy documents about agricultural products • Domain experts of MIPAAF and AgID have been involved as well so as to identify the main high-level concepts characterising the domain • EU schemes for protected names were also considered so as to take into account how they are organised at the European level
  • 8. + Other ontologies For speeding up the process, we involved three more ontology engineers, and we provided a generic Graffoo template created starting from the parts of the upper ontology that had to be extended, so as to guide the development of all the other ontologies in an homogeneous way
  • 9. + Data extraction 563 policy documents on wines 274 policy documents on other products INPUT OUTPUT EXTRACTION VALIDATION CONVERSION Scripts convert CSV documents into RDF documents People from our lab gather data by reading documents Scripts extract relevant data from source documents Scripts convert validated data into CSV documents Experts (MIPAAF + AgID) correct and validate data + Extracting DBpedia entities referring to production places, revised by humans, and aligned existing Italian PA LOD
  • 10. + Data: example product:wine-abruzzo-red a upper:Product , wine:Wine ; rdfs:label "Wine 'Abruzzo' of type 'Rosso'"@en , "Vino 'Abruzzo' Rosso"@it ; upper:hasProtectedName name:wine-abruzzo ; upper:hasDescription descraw:wine-abruzzo-red-raw-material-1 , descraw:wine-abruzzo-red-raw-material-2 ; upper:hasType type:red-wine . descraw:wine-abruzzo-red-raw-material-1 a upper:DescriptionOfRawMaterial ; rdfs:label "Vitigno 'Montepulciano': minimo (>=) 80%" ; upper:hasRawMaterial rawmat:vine-variety-montepulciano ; upper:hasMinimumValue "80%" . A Turtle excerpt of the data about the wine “Abruzzo Rosso”… … and its rendering in HTML by means of LOD View
  • 11. + Conclusions • We have described the outcomes of an e-government project named FOOD – FOod in Open Data – carried out in the context of a collaboration between STLab, MIPAAF and AgID – Input: 837 policy documents – Outcome: 21 ontologies for describing protected names and their products + LOD – Reuse: Upper Ontology for the Umbrian portal on PDO/PGI/TSG products • Lessons learnt – The automatic extraction of data from the policy documents – even when they are generally structured similarly – it is quite difficult and the results are far from the quality required for being published – the experts intervention was quite huge – A manual extraction of such data would have prevented the introduction of such a large number of mistakes – The analysis done have enabled the refactor of existing templates for policy documents so as to write relevant information in an homogeneous manner – a crucial point for guaranteeing automatic processing in the future
  • 12. + Thanks for your attention Silvio Peroni, Giorgia Lodi, Luigi Asprino, 
 Aldo Gangemi, Valentina Presutti 15th International Semantic Web Conference (ISWC 2016)
 Kobe, Japan, October 17-23, 2016 Paper: https://w3id.org/people/essepuntato/papers/food-iswc2016.html Ontologies+Data: http://dx.doi.org/10.6084/m9.figshare.3187903