SlideShare a Scribd company logo
1 of 19
Building a Knowledge
Graph for Agri-Food Sector
Dr. Raul Palma
Head of Data Analytics and Semantics Department
Poznan Supercomputing and Networking Center
Modeling Sustainability Workshop
Knowledge Graph Conference
3rd May 2021
Place
Pilot
icon The agri-food context
• Farm management
• Multiple activities and stakeholders
• Multiple data sources, types and formats
• Multiple applications, tools and devices
Schematic overview of relationships between farm management
and its environment (Sörensen, et al., 2010)
Place
Pilot
icon
Data & modeling challenges in agri-food sector
Source: Accenture
The rapid advances of IoT technologies, AI and Big Data, among
others, have boosted the adoption of smart farming practices.
This has led to an explosion of data, generated by a wide range
of different systems and platforms that rarely interoperate.
• The lack of integrated data access, in turn, hinders the full potential of value
creation and decision support based on all the available data
Some of the key challenges hampering a seamless exchange and
integration of the data produced/collected by those systems are:
• Availability of data in different formats and represented according to different
models
• Heterogeneity of data models and semantics used to represent data
• Lack of related standards dominating this space
• Insufficient interoperability mechanisms enabling the connection of existing
agri-food data models
Place
Pilot
icon The project(s) behind
Data integration challenges in agri-food sector have been addressed in various key EU projects
…solutions for (big) data mgmt., including
the harmonization and integration of a large
variety of data from many sources
…open and interoperable cloud-based
solution addressing the integration of
data relevant to farming production
Place
Pilot
icon Knowledge Graphs
KG provide a flexible and efficient solution to
address many of those challenges.
• They can provide an integrated view over
(initially) disconnected and heterogeneous
datasets,
• through the interlinking of different entities,
typically by applying Linked Data principles
• Improves data accessibility by both humans & machines
• Enable to discover new knowledge
• and in compliance with any privacy and access
control needs.
http://lod-cloud.net/
Place
Pilot
icon
Reuse large repository of Linked Data related to agriculture with
over 1 billion triples
Linked
Data
Pipelines
IACS
Glosis
AIM
Place
Pilot
icon
Use case: national crop data access &
monitoring
Goal:
• To enable access to AgroDataCube (AGC), a large collection of both open and derived data from
Netherlands for use in agri-food applications (by Wageningen Environmental Research), and
• To connect it with other open and widely used EU vocabularies (e.g., AgroVoc, Eurostat).
AGC exposes a REST API with various resources:
• Fields (crop registration datasets), Altitude, Meteo, Soil, NDVI
• Data is returned in GeoJSON format
Insights from this data may be relevant for
• Organisations collecting or validating agri-related indicators, e.g.,
paying agencies advisors looking for granular views of crops;
• Advisory organizations, e.g., looking to gain insights into the
distribution of crops in their region;
• Researchers, e.g., looking to find potential demonstration farms;
• Producers, e.g., looking to identify clusters of crops, etc.
Place
Pilot
icon Visualize and exploit the linked data
Demo app: http://metaphactory.foodie-cloud.org/resource/:AGROVOC-crops
Place
Pilot
icon
Demo app: http://metaphactory.foodie-cloud.org/resource/:AGROVOC-crops
Place
Pilot
icon
Use case: Farm Productivity and Sustainability
Benchmarking
Goal:
• To enable the benchmarking on the productivity and sustainability
performance of the farms;
• monitoring and comparing different conditions and parameters
affecting such indicators, and
• collecting the data & integrating it in a unified layer accessible by DSS
Such information is relevant for:
• Organisations collecting or validating agri-related indicators, e.g.,
paying agencies advisors who need a complete view at different
levels and to identify poorly performing regions;
• Advisory organizations, e.g., looking to gain insights of agri-indicators
in their region;
• Researchers, e.g., looking for regions with poor performance, or with
particular conditions;
• Producers, e.g., interested in identifying high-demand regions and
their challenges to customize their offer, etc
Place
Pilot
icon Visualize and exploit the data
Measure: Total
outcome,
farm income,
economic size,..
Type of Farming:
Fieldcrops, Horticulture, Wine, …
Place
Pilot
icon Use case: Farm machinery management
This use case collects real time telematic data from machinery on the field
The data is collected by machinery sensors, and it is
stored and manged by Senslog
Senslog is a web-based sensor data management system
• receives measured data (observations) directly from sensor devices
• stores sensor data in SensLog data model implemented in RDBMS;
• can pre-process data and/or analyze data
• Publish data through web-services
Place
Pilot
icon Visualize and exploit the linked data
Sparql endpoint: http://senslogrdf.foodie-cloud.org/sparql
SNORQL search endpoint: http://senslogrdf.foodie-
cloud.org/snorql/
Web-based visualization: http://senslogrdf.foodie-cloud.org/
Demo app: http://metaphactory.foodie-cloud.org/resource/:senslog-data
Place
Pilot
icon
Behind the scenes peek
Place
Pilot
icon Agriculture Information Model - AIM
AIM aims to establish the basis of a common agricultural data
space, enable the interoperation of different systems, and the
analysis of data produced by those systems in an integrated way
AIM follows a modular approach in a layered architecture:
realized as a suite of ontologies and corresponding JSON-LD
contexts, and associated SHACL shapes
implemented in line with best practices, reusing existing
standards and well-scoped models
establishes alignments between base models to enable their
interoperability and the integration of existing data
Palma R., Roussaki I., Döhmen T., et.al (2021). “Agriculture Information
Model” in D. D. Bochtis, et.al (Eds). Information and Communication
Technologies for Agriculture—Theme III: Decision. Springer (TBP)
https://w3id.org/demeter/
https://github.com/rapw3k/DEMETER/tree/master/models
Place
Pilot
icon Implementation
https://git.man.poznan.pl/stash/projects/DEM/repos/pipelines/browse
Place
Pilot
icon Visualization & exploitation
https://metaphacts.com/
Key features
Knowledge Graph Asset
Management
Rapid Application Building
End-user oriented interaction
Place
Pilot
icon Future work & references
Extend pipelines for other preconfigured data types: Soil data, weather data, etc.
Extend implementation with additional capabilities:
• Enrichment, link discovery, etc.
• Additional pre-process/post-process methods
• Integrate additional tools to handle other data sources/cases (e.g., non-sql db, sparql-transformations, etc.)
Triplestore endpoint: https://www.foodie-cloud.org/sparql
Faceted search: https://www.foodie-cloud.org/fct/
Demo applications: http://metaphactory.foodie-cloud.org/
Linked data pipelines Web service: https://dpi-enabler-demeter.apps.paas-dev.psnc.pl/api/swagger/
Linked data pipelines CLI source: https://git.man.poznan.pl/stash/projects/DEM/repos/pipelines/browse
AIM ontology: https://github.com/rapw3k/DEMETER/tree/master/models
Read more: https://blog.metaphacts.com/a-knowledge-graph-for-the-agri-food-sector
https://www.slideshare.net/rapw3k/presentations
Thank you!
rpalma@man.poznan.pl

More Related Content

Similar to Building Knowledge Graphs for the Agri-Food sector

Self Service BI for Healthcare
Self Service BI for HealthcareSelf Service BI for Healthcare
Self Service BI for HealthcareVeerendra Raju
 
AJAY _ Synopsis-1(1).pdf for project report for bca
AJAY _ Synopsis-1(1).pdf for project report for bcaAJAY _ Synopsis-1(1).pdf for project report for bca
AJAY _ Synopsis-1(1).pdf for project report for bcachauhanajay68136
 
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...Databricks
 
[DSC Europe 23] Mihailo Ilic - Scalable and Interoperable Data Flow Managemen...
[DSC Europe 23] Mihailo Ilic - Scalable and Interoperable Data Flow Managemen...[DSC Europe 23] Mihailo Ilic - Scalable and Interoperable Data Flow Managemen...
[DSC Europe 23] Mihailo Ilic - Scalable and Interoperable Data Flow Managemen...DataScienceConferenc1
 
Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?Nikos Manouselis
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Using Ontology to Capture Supply Chain Code Halos
Using Ontology to Capture Supply Chain Code HalosUsing Ontology to Capture Supply Chain Code Halos
Using Ontology to Capture Supply Chain Code HalosCognizant
 
Web Based Agriculture Information System
Web Based Agriculture Information SystemWeb Based Agriculture Information System
Web Based Agriculture Information SystemGihan Wikramanayake
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Geoffrey Fox
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfAlan Morrison
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationCambridge Semantics
 
Putting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataPutting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
 
Designing and configuring context-aware semantic web applications
Designing and configuring context-aware semantic web applicationsDesigning and configuring context-aware semantic web applications
Designing and configuring context-aware semantic web applicationsTELKOMNIKA JOURNAL
 
Lingustic Harmony in the Tower of Babel
Lingustic Harmony in the Tower of BabelLingustic Harmony in the Tower of Babel
Lingustic Harmony in the Tower of BabelAnn Kelly
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformSanjay Padhi, Ph.D
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)Matt Barnes
 
DEMETER at OGC Agriculture Session
DEMETER at OGC Agriculture SessionDEMETER at OGC Agriculture Session
DEMETER at OGC Agriculture SessionH2020 DEMETER
 

Similar to Building Knowledge Graphs for the Agri-Food sector (20)

Self Service BI for Healthcare
Self Service BI for HealthcareSelf Service BI for Healthcare
Self Service BI for Healthcare
 
agriopenlink - summary
agriopenlink  - summary agriopenlink  - summary
agriopenlink - summary
 
AJAY _ Synopsis-1(1).pdf for project report for bca
AJAY _ Synopsis-1(1).pdf for project report for bcaAJAY _ Synopsis-1(1).pdf for project report for bca
AJAY _ Synopsis-1(1).pdf for project report for bca
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpda
 
PlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web ScalePlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web Scale
 
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
 
[DSC Europe 23] Mihailo Ilic - Scalable and Interoperable Data Flow Managemen...
[DSC Europe 23] Mihailo Ilic - Scalable and Interoperable Data Flow Managemen...[DSC Europe 23] Mihailo Ilic - Scalable and Interoperable Data Flow Managemen...
[DSC Europe 23] Mihailo Ilic - Scalable and Interoperable Data Flow Managemen...
 
Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Using Ontology to Capture Supply Chain Code Halos
Using Ontology to Capture Supply Chain Code HalosUsing Ontology to Capture Supply Chain Code Halos
Using Ontology to Capture Supply Chain Code Halos
 
Web Based Agriculture Information System
Web Based Agriculture Information SystemWeb Based Agriculture Information System
Web Based Agriculture Information System
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
 
Putting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataPutting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open Data
 
Designing and configuring context-aware semantic web applications
Designing and configuring context-aware semantic web applicationsDesigning and configuring context-aware semantic web applications
Designing and configuring context-aware semantic web applications
 
Lingustic Harmony in the Tower of Babel
Lingustic Harmony in the Tower of BabelLingustic Harmony in the Tower of Babel
Lingustic Harmony in the Tower of Babel
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Maven and google pharma r&d (1)
Maven and google pharma r&d  (1)Maven and google pharma r&d  (1)
Maven and google pharma r&d (1)
 
DEMETER at OGC Agriculture Session
DEMETER at OGC Agriculture SessionDEMETER at OGC Agriculture Session
DEMETER at OGC Agriculture Session
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 

Building Knowledge Graphs for the Agri-Food sector

  • 1. Building a Knowledge Graph for Agri-Food Sector Dr. Raul Palma Head of Data Analytics and Semantics Department Poznan Supercomputing and Networking Center Modeling Sustainability Workshop Knowledge Graph Conference 3rd May 2021
  • 2. Place Pilot icon The agri-food context • Farm management • Multiple activities and stakeholders • Multiple data sources, types and formats • Multiple applications, tools and devices Schematic overview of relationships between farm management and its environment (Sörensen, et al., 2010)
  • 3. Place Pilot icon Data & modeling challenges in agri-food sector Source: Accenture The rapid advances of IoT technologies, AI and Big Data, among others, have boosted the adoption of smart farming practices. This has led to an explosion of data, generated by a wide range of different systems and platforms that rarely interoperate. • The lack of integrated data access, in turn, hinders the full potential of value creation and decision support based on all the available data Some of the key challenges hampering a seamless exchange and integration of the data produced/collected by those systems are: • Availability of data in different formats and represented according to different models • Heterogeneity of data models and semantics used to represent data • Lack of related standards dominating this space • Insufficient interoperability mechanisms enabling the connection of existing agri-food data models
  • 4. Place Pilot icon The project(s) behind Data integration challenges in agri-food sector have been addressed in various key EU projects …solutions for (big) data mgmt., including the harmonization and integration of a large variety of data from many sources …open and interoperable cloud-based solution addressing the integration of data relevant to farming production
  • 5. Place Pilot icon Knowledge Graphs KG provide a flexible and efficient solution to address many of those challenges. • They can provide an integrated view over (initially) disconnected and heterogeneous datasets, • through the interlinking of different entities, typically by applying Linked Data principles • Improves data accessibility by both humans & machines • Enable to discover new knowledge • and in compliance with any privacy and access control needs. http://lod-cloud.net/
  • 6. Place Pilot icon Reuse large repository of Linked Data related to agriculture with over 1 billion triples Linked Data Pipelines IACS Glosis AIM
  • 7. Place Pilot icon Use case: national crop data access & monitoring Goal: • To enable access to AgroDataCube (AGC), a large collection of both open and derived data from Netherlands for use in agri-food applications (by Wageningen Environmental Research), and • To connect it with other open and widely used EU vocabularies (e.g., AgroVoc, Eurostat). AGC exposes a REST API with various resources: • Fields (crop registration datasets), Altitude, Meteo, Soil, NDVI • Data is returned in GeoJSON format Insights from this data may be relevant for • Organisations collecting or validating agri-related indicators, e.g., paying agencies advisors looking for granular views of crops; • Advisory organizations, e.g., looking to gain insights into the distribution of crops in their region; • Researchers, e.g., looking to find potential demonstration farms; • Producers, e.g., looking to identify clusters of crops, etc.
  • 8. Place Pilot icon Visualize and exploit the linked data Demo app: http://metaphactory.foodie-cloud.org/resource/:AGROVOC-crops
  • 10. Place Pilot icon Use case: Farm Productivity and Sustainability Benchmarking Goal: • To enable the benchmarking on the productivity and sustainability performance of the farms; • monitoring and comparing different conditions and parameters affecting such indicators, and • collecting the data & integrating it in a unified layer accessible by DSS Such information is relevant for: • Organisations collecting or validating agri-related indicators, e.g., paying agencies advisors who need a complete view at different levels and to identify poorly performing regions; • Advisory organizations, e.g., looking to gain insights of agri-indicators in their region; • Researchers, e.g., looking for regions with poor performance, or with particular conditions; • Producers, e.g., interested in identifying high-demand regions and their challenges to customize their offer, etc
  • 11. Place Pilot icon Visualize and exploit the data Measure: Total outcome, farm income, economic size,.. Type of Farming: Fieldcrops, Horticulture, Wine, …
  • 12. Place Pilot icon Use case: Farm machinery management This use case collects real time telematic data from machinery on the field The data is collected by machinery sensors, and it is stored and manged by Senslog Senslog is a web-based sensor data management system • receives measured data (observations) directly from sensor devices • stores sensor data in SensLog data model implemented in RDBMS; • can pre-process data and/or analyze data • Publish data through web-services
  • 13. Place Pilot icon Visualize and exploit the linked data Sparql endpoint: http://senslogrdf.foodie-cloud.org/sparql SNORQL search endpoint: http://senslogrdf.foodie- cloud.org/snorql/ Web-based visualization: http://senslogrdf.foodie-cloud.org/ Demo app: http://metaphactory.foodie-cloud.org/resource/:senslog-data
  • 15. Place Pilot icon Agriculture Information Model - AIM AIM aims to establish the basis of a common agricultural data space, enable the interoperation of different systems, and the analysis of data produced by those systems in an integrated way AIM follows a modular approach in a layered architecture: realized as a suite of ontologies and corresponding JSON-LD contexts, and associated SHACL shapes implemented in line with best practices, reusing existing standards and well-scoped models establishes alignments between base models to enable their interoperability and the integration of existing data Palma R., Roussaki I., Döhmen T., et.al (2021). “Agriculture Information Model” in D. D. Bochtis, et.al (Eds). Information and Communication Technologies for Agriculture—Theme III: Decision. Springer (TBP) https://w3id.org/demeter/ https://github.com/rapw3k/DEMETER/tree/master/models
  • 17. Place Pilot icon Visualization & exploitation https://metaphacts.com/ Key features Knowledge Graph Asset Management Rapid Application Building End-user oriented interaction
  • 18. Place Pilot icon Future work & references Extend pipelines for other preconfigured data types: Soil data, weather data, etc. Extend implementation with additional capabilities: • Enrichment, link discovery, etc. • Additional pre-process/post-process methods • Integrate additional tools to handle other data sources/cases (e.g., non-sql db, sparql-transformations, etc.) Triplestore endpoint: https://www.foodie-cloud.org/sparql Faceted search: https://www.foodie-cloud.org/fct/ Demo applications: http://metaphactory.foodie-cloud.org/ Linked data pipelines Web service: https://dpi-enabler-demeter.apps.paas-dev.psnc.pl/api/swagger/ Linked data pipelines CLI source: https://git.man.poznan.pl/stash/projects/DEM/repos/pipelines/browse AIM ontology: https://github.com/rapw3k/DEMETER/tree/master/models Read more: https://blog.metaphacts.com/a-knowledge-graph-for-the-agri-food-sector https://www.slideshare.net/rapw3k/presentations

Editor's Notes

  1. Farm management is a complex process that involves multiple activities carried out by farmers and other stakeholders, who have to manage multiple and heterogeneous data sources collected and generated through various applications, services and devices.
  2. Such process, however, has become even more complex in recent years. In particular
  3. Such data integration challenges
  4. The approach we adopted to address the data integration challenges in those projects was through the use of knowledge graphs.
  5. However as we started creating and reusing more and more KGs to cover multiple use cases in multiple projects, we noticed a recurring process to get things up and running. This led us to the design and implementation of "Linked Data pipelines", which automate as much as possible the processes to carry out the necessary steps to transform and publish different input datasets from various heterogeneous sources as Linked Data.  connect different data processing components that carry out the transformation of data into RDF and their linking Re-executable Re-usable Adaptable (semi-) Automatic For both: Static (mostly) Dynamic (e.g., Iot data)
  6. so that it can interoperate with other services, and be connected with datasets