SlideShare a Scribd company logo
1 of 28
Download to read offline
Consumer Industry
Agribusiness Initiative
June 11, 2020
Understanding Agri Ontologies, Taxonomies &
Reference Data
Mehdi Charafeddine
Agenda
• Target Audience and How to Use
• Problem Statement
• Knowledge Representation in the Agri Landscape
• The journey into Ontologies
• Ontology Use Case
• How do we build and use Ontologies?
• Next Steps
2
Problem Statement
3
The Open Agri space is busy with emerging, living and Dying initiatives
We need to understand
what exists, what has
worked, and what hasn’t
We need to support our
own tactical and strategic
needs
We need to Establish
where we can and should
fit into this eco-system
Target Audience and How to Use
4
Chief Data Office
Agronomists
Data Scientists
Data Engineers
Data Stewards
Brand Managers
Sustainability and Supply Chain
Personnel
Client Context
Target Audience How to Use
The asset should be used with clients in the context of:
Ø Building marketplaces
Ø Building data platforms
Ø Enabling agronomy AI use cases
Ø Grower Management
IBM Community
Ø Presentation: Lighthouse, Seismic, Solution Gateway
Ø GitHub Repository: IBM GitHub
Ø Recorded Webinar: CBDS Data Platform, Data Architect
Community
Knowledge Representation
5
Knowledge
Management
Information
Management
Data
Management
Data is discrete,
facts have no meaning
in isolation
Data has relevance and
purpose. It informs and
causes the uses to change
state,
Knowledge is actionable
information placed in
context based on
facts and meaning.
Wisdom
Knowledge enables
Understanding necessary
For effective decision-making
Knowledge Modelling Landscape
6
• Ontology:
• Semantic data model
• Classify Things and define more
specific relations and attributes
• Local and global ontologies can be
combined
• Culturally neutral
• Taxonomy:
• Describes the organization’s
vocabulary, common terms, and
synonyms.
• Organize terms into hierarchy
• Reference Data:
• Set of well-known value for
attributes, translates into local
meaning
• Glossary:
• Define terms in a simple way
Ontology
Subject – Predicate - Object
Logical Data Model
Entity - Relationship
Taxonomy
Hierarchy – Tree Structure
Glossary / Dictionary
Term, Definitions, References
SEMANTIC
RICHNESS
LOW
HIGH
Farming Domain Ontologies
7
Field
Crop
Soil
Input
Environmental
Ontology Domains
SCOPE?
REPRESENTATION?
OWNERSHIP?
MAINTENANCE?
LICENSE TERMS?
HOW TO MODIFY?
Open Questions?
Reference Data
8
“Reference data are data that define the set of permissible values to be used
by other data fields. Reference data gain in value when they are widely re-used
and widely referenced. Typically, they do not change overly much in terms of
definition, apart from occasional revisions.” - Wikipedia
What are the common names of
the crops and what are the
correspondences across
languages and regions?? What sources of reference data
can we find for agriculture?
What are the terms and
conditions around the use of
reference data?
Can we find reference data for:
Crops, Growing Practices, Inputs,
Equipment, Soils, Weather
Knowledge Representation Journey
9
Chief
Data
Office
Glossary
Taxonomy
Ontology
Terms &
Conditions
Licensing
• Defining ontologies is a
core data governance
practice
• Building ontologies is a
continuous process
Coffee Farming Use Case
10
As An Agronomist
Manager,
I want to be able to
record known
performance of local
Arabica
and Robusta varieties,
So that agronomists
and nurseries know
the performance of
the variety and can
inform farmers of
potential benefits
KPIs to track
• Yield (metric
tons/hectare):
• Product type: Cherry,
Green Coffee or Parch.
• Measurement Tree
Density (tree / hectare)
• Cup quality expected
(optional): 1.0 (best) to
1.4 (worst)
• Compliance with Green
Coffee Specifications:
Y/N/Unknown
• Granulometry (100 beans
weight) – grams
• Resistance to drought
(Sensitive / Moderate /
Tolerant)
Coffee Quality Map
11
Cup Quality
Measures
• Aroma
• Flavour
• Aftertaste
• Acidity
• Body
• Balance
• Uniformity
• Cup Cleanliness
• Sweetness
• Moisture
• Defects
Farm Metadata
• Owner
• Country of Origin
• Region
• Farm Name
• Lot Number
• Holding Pattern
• Mill (on site)
• Company
• Location and Altitude
• Farm Map
• Farm Area
Bean Metadata
• Processing Method (wet
or dry process, washed or
natural)
• Bean size & density
• Bean Colour
• Species (Arabica /
Robusta)
• Roast appearance and
cup quality in relation to
flavour, characteristics
and cleanliness
Coffee Varieties
12
Two dominant coffee varieties out of 125!
Coffee is a long-term crop
with a lifespan of more than
10 years, and considerably
longer under good
management, thus the
choice of variety (cultivar) is
very important. As quality of
the coffee bean is crucial for
production of high-grade
coffee, choose only varieties
that are recommended for
your area.
Reference Data sources for Coffee
13
Production &
Yield
Disease
Resistance
Soil Type
Plant genotype
& taxonomy
Growing
Practices
Data Available
Modeling an Ontology for our Coffee Use Case: Where to Start?
14
Ontology Development Methodology
16
Define Entities
Define Use Cases
Ontologies
Guidelines
Ontologies
Tooling
Define Reference
Data
Identify Reusable
Ontologies
Create Ontology 1 Create Ontology 2 Create Ontology N
…
Collaboration
Process
Ontology
Repository
Framework
Domain Definition
Execution
Preparation
How does it fit with a CDO organization?
18
LOW
Ontologies
+
Knowledge Graphs
=
• Identify all
• Provide context
• Discover hidden facts
“An enterprise knowledge graph is a representation of an organization’s
knowledge domain and artefacts that is understood by both humans and machines”
Graph Database
Enterprise
Data Sources
Glossaries,
Taxonomies
• Also known as “triple store”
• Collection of references to
knowledge objects in their
source systems
• Store properties for each
object from the various
sources
• Store relationships between
those objects
• Variety of data sources
and systems
• Many disparate
systems
How does it fit with a CDO organization?
19
• Watson Decision Platform for Agriculture:
• The Electronic Field/Regional Record holds domain specific information across growing seasons
• Terms and Reference are used to define the allowable values for attributes such as:
• Crop type,
• Irrigation type,
• Tilling methods, etc.
• Work to extend with Taxonomies & Ontologies covering more extensive information about:
• Growth stages, soil types, planting characteristics is planned.
Ontologies as an IBM asset
Example: potato crop ontology build for Yara ODX
20
Assets for knowledge
modelling
• Architecture for building
ontologies
• Method and framework
for implementation
ontologies
• Domain-specific
ontologies (i.e. farming &
agri business)
Commercialization
• Can be owned by industry
or service line practices,
like industry models
• Can be part of the
industry commercial
offering package
• Can be extended to any
industry
Appendix
21
Open Agri Landscape Data initiatives
22
WHAT CAN WE
LEARN FROM?
WHAT CAN WE
POSITION AGAINST?
WHAT CAN WE
REUSE?
Work Initiatives
23
Understand the
Marketplace
Understand
Ontologies
Understand
What’s Available
Understand Data
Sharing
• Building out the Matrix
(Existence, Liveness,
Usefulness)
• What can we leverage
• What we should not do
• Plants, Growing
Practices, Inputs,
Measurements,
Environmental
• Translations and regional
variability
• Country specific
requirements
• Industry Requirements
• Public Expectations
• Agricultural
• Environmental
• Growers
Using Structured Knowledge
Structure Why Examples
Ontology Graphs representing both simple and complex
relationships and attributes. A mechanism to consistently
capture knowledge about a domain.
Describing environmental rules for
where to plant what crops and why.
Relationships between different inputs
and crops. Typically used in both search
and analytics.
Taxonomy Simple hierarchies to structure and find reference data
and glossary terms – structure implies hierarchical
relationships such as containment or part-of
Crop growth stages, Crop types,
Equipment types - all have implicit
structure. Used in analytics and
presentation.
Reference Data Common value sets with translations and regionalization –
often a missing aspect to integrate data from multiple
systems together.
Think of pull-down lists, kind of crop,
kind of irrigation, colors, places, etc.
Used to consistently communicate
internally and externally.
Glossary Common terminology for communicating internally and
with customers
Understandable terminology on screens
and reports
24
Coffee Crop Production Worldwide
25
Source: http://www.fao.org/
Two dominant coffee varieties are being produced
26
Robusta (aka Canephora)
• Lower Quality
• More robust/ fewer disease problems
• Easy to grow and manage
• Higher yielding (up to 3 tonnes/ha green bean
possible for small holder production)
• Easy to process
• Dominates lower end of the market such as instant
coffee and less discerning markets
• One third of world production
• Average world market price/kg approximately half
of that of Arabica
Arabica
• Higher Quality
• Less tolerant to environmental fluctuations/ More
prone to disease
• More complex physiological management Higher
yielding (up to 3 tonnes/ha green bean possible for
small holder production)
• Lower yielding (up to 1.5 tonnes/ha green bean
possible for small holder production)
• More sophisticated processing needed
• Operates at the higher end of the market such as
roasted and ground coffee
• Two thirds of world production
• Average world market price/kg approximately twice
that of Robusta
VS
Source:
Coffee end market segmentation by quality
27
Source: https://www.cbi.eu/market-information/coffee/belgium/market-entry/
Stages of Coffee Crop Growth
28
Source: https://www.torchcoffee.asia/resources
Reusable Ontology: soil properties and processes (OSP)
29
Source: http://doi.org/10.5518/54
How does it fit with a CDO organization?
30
Identify Subject Areas
Define the Meaning of things in the enterprise
Describe the logical representation of properties
Describe the physical means of data storage
Represent the coding language of the data platform
Store the values of the properties applied to the data in a schema
ONTOLOGY
Ontology, Conceptual ER,
Business Process
Entity-Relation model, Json schema,
XML schema
Physical schema, asset repository
Contextual
Conceptual
Logical
Physical
Definition
Instance

More Related Content

Similar to Ontologies in Agribusiness

IAR4D and benefits and ARC
IAR4D and benefits and ARCIAR4D and benefits and ARC
IAR4D and benefits and ARCAFAAS
 
Charlie Henderson Insetting State of Play
Charlie Henderson Insetting State of PlayCharlie Henderson Insetting State of Play
Charlie Henderson Insetting State of PlayPlanVivo1
 
GFAR / GODAN / CTA webinar #2 "Key data for farmers" - Stephen Kalyesubula - ...
GFAR / GODAN / CTA webinar #2 "Key data for farmers" - Stephen Kalyesubula - ...GFAR / GODAN / CTA webinar #2 "Key data for farmers" - Stephen Kalyesubula - ...
GFAR / GODAN / CTA webinar #2 "Key data for farmers" - Stephen Kalyesubula - ...GCARD Conferences
 
GFAR / GODAN / CTA webinar #1 "Data-driven agriculture. An overview" - Dan Be...
GFAR / GODAN / CTA webinar #1 "Data-driven agriculture. An overview" - Dan Be...GFAR / GODAN / CTA webinar #1 "Data-driven agriculture. An overview" - Dan Be...
GFAR / GODAN / CTA webinar #1 "Data-driven agriculture. An overview" - Dan Be...GCARD Conferences
 
An applied information economics approach to assessing resilience in the Horn...
An applied information economics approach to assessing resilience in the Horn...An applied information economics approach to assessing resilience in the Horn...
An applied information economics approach to assessing resilience in the Horn...ILRI
 
Alternative Agronomic Crops
Alternative Agronomic CropsAlternative Agronomic Crops
Alternative Agronomic CropsElisaMendelsohn
 
Food 4.0: Data Driven Agri-Food Systems
Food 4.0: Data Driven Agri-Food SystemsFood 4.0: Data Driven Agri-Food Systems
Food 4.0: Data Driven Agri-Food SystemsDeepak Pareek
 
East and Southern Africa Flagship Key highlights of our work so far-Polly E...
 East and Southern Africa FlagshipKey highlights of our work so far-Polly E... East and Southern Africa FlagshipKey highlights of our work so far-Polly E...
East and Southern Africa Flagship Key highlights of our work so far-Polly E...CGIAR Research Program on Dryland Systems
 
Information and communications technologies for agricultural research and dev...
Information and communications technologies for agricultural research and dev...Information and communications technologies for agricultural research and dev...
Information and communications technologies for agricultural research and dev...CIAT
 
ICRISAT Global Planning Meeting 2019:Research Program - Innovation Systems fo...
ICRISAT Global Planning Meeting 2019:Research Program - Innovation Systems fo...ICRISAT Global Planning Meeting 2019:Research Program - Innovation Systems fo...
ICRISAT Global Planning Meeting 2019:Research Program - Innovation Systems fo...ICRISAT
 
Smarter Agriculture Handout - v3
Smarter Agriculture Handout - v3Smarter Agriculture Handout - v3
Smarter Agriculture Handout - v3Ann Lambrecht
 
Session 5.2 Combining numerical modeling with a participative approach
Session 5.2 Combining numerical modeling with a participative approachSession 5.2 Combining numerical modeling with a participative approach
Session 5.2 Combining numerical modeling with a participative approachWorld Agroforestry (ICRAF)
 
Inclusive and Efficient Value Chains: Innovations, Scaling, and Way Forward
Inclusive and Efficient Value Chains: Innovations, Scaling, and Way ForwardInclusive and Efficient Value Chains: Innovations, Scaling, and Way Forward
Inclusive and Efficient Value Chains: Innovations, Scaling, and Way ForwardIFPRI-PIM
 
Innovative IT app in agri food.pptx
Innovative IT app in agri food.pptxInnovative IT app in agri food.pptx
Innovative IT app in agri food.pptxkrish408617
 
Product overview_Vertical farming.pdf
Product overview_Vertical farming.pdfProduct overview_Vertical farming.pdf
Product overview_Vertical farming.pdfUmashankarTriplicane
 
Session 5.2 Combining numerical modeling with a participative approach
Session 5.2 Combining numerical modeling with a participative approachSession 5.2 Combining numerical modeling with a participative approach
Session 5.2 Combining numerical modeling with a participative approachWorld Agroforestry (ICRAF)
 

Similar to Ontologies in Agribusiness (20)

IAR4D and benefits and ARC
IAR4D and benefits and ARCIAR4D and benefits and ARC
IAR4D and benefits and ARC
 
Charlie Henderson Insetting State of Play
Charlie Henderson Insetting State of PlayCharlie Henderson Insetting State of Play
Charlie Henderson Insetting State of Play
 
GFAR / GODAN / CTA webinar #2 "Key data for farmers" - Stephen Kalyesubula - ...
GFAR / GODAN / CTA webinar #2 "Key data for farmers" - Stephen Kalyesubula - ...GFAR / GODAN / CTA webinar #2 "Key data for farmers" - Stephen Kalyesubula - ...
GFAR / GODAN / CTA webinar #2 "Key data for farmers" - Stephen Kalyesubula - ...
 
GFAR / GODAN / CTA webinar #1 "Data-driven agriculture. An overview" - Dan Be...
GFAR / GODAN / CTA webinar #1 "Data-driven agriculture. An overview" - Dan Be...GFAR / GODAN / CTA webinar #1 "Data-driven agriculture. An overview" - Dan Be...
GFAR / GODAN / CTA webinar #1 "Data-driven agriculture. An overview" - Dan Be...
 
An applied information economics approach to assessing resilience in the Horn...
An applied information economics approach to assessing resilience in the Horn...An applied information economics approach to assessing resilience in the Horn...
An applied information economics approach to assessing resilience in the Horn...
 
Alternative Agronomic Crops
Alternative Agronomic CropsAlternative Agronomic Crops
Alternative Agronomic Crops
 
IGAD_CODATA
IGAD_CODATAIGAD_CODATA
IGAD_CODATA
 
Food 4.0: Data Driven Agri-Food Systems
Food 4.0: Data Driven Agri-Food SystemsFood 4.0: Data Driven Agri-Food Systems
Food 4.0: Data Driven Agri-Food Systems
 
East and Southern Africa Flagship Key highlights of our work so far-Polly E...
 East and Southern Africa FlagshipKey highlights of our work so far-Polly E... East and Southern Africa FlagshipKey highlights of our work so far-Polly E...
East and Southern Africa Flagship Key highlights of our work so far-Polly E...
 
Information and communications technologies for agricultural research and dev...
Information and communications technologies for agricultural research and dev...Information and communications technologies for agricultural research and dev...
Information and communications technologies for agricultural research and dev...
 
Global agricultural Concept Scheme (GACS)
Global agricultural Concept Scheme (GACS)Global agricultural Concept Scheme (GACS)
Global agricultural Concept Scheme (GACS)
 
ICRISAT Global Planning Meeting 2019:Research Program - Innovation Systems fo...
ICRISAT Global Planning Meeting 2019:Research Program - Innovation Systems fo...ICRISAT Global Planning Meeting 2019:Research Program - Innovation Systems fo...
ICRISAT Global Planning Meeting 2019:Research Program - Innovation Systems fo...
 
Humidtropics ISPC Presentation Sept. 2014
Humidtropics ISPC Presentation Sept. 2014Humidtropics ISPC Presentation Sept. 2014
Humidtropics ISPC Presentation Sept. 2014
 
Smarter Agriculture Handout - v3
Smarter Agriculture Handout - v3Smarter Agriculture Handout - v3
Smarter Agriculture Handout - v3
 
Session 5.2 Combining numerical modeling with a participative approach
Session 5.2 Combining numerical modeling with a participative approachSession 5.2 Combining numerical modeling with a participative approach
Session 5.2 Combining numerical modeling with a participative approach
 
Humid Tropics: How Humid Tropics approaches regional research and selects res...
Humid Tropics: How Humid Tropics approaches regional research and selects res...Humid Tropics: How Humid Tropics approaches regional research and selects res...
Humid Tropics: How Humid Tropics approaches regional research and selects res...
 
Inclusive and Efficient Value Chains: Innovations, Scaling, and Way Forward
Inclusive and Efficient Value Chains: Innovations, Scaling, and Way ForwardInclusive and Efficient Value Chains: Innovations, Scaling, and Way Forward
Inclusive and Efficient Value Chains: Innovations, Scaling, and Way Forward
 
Innovative IT app in agri food.pptx
Innovative IT app in agri food.pptxInnovative IT app in agri food.pptx
Innovative IT app in agri food.pptx
 
Product overview_Vertical farming.pdf
Product overview_Vertical farming.pdfProduct overview_Vertical farming.pdf
Product overview_Vertical farming.pdf
 
Session 5.2 Combining numerical modeling with a participative approach
Session 5.2 Combining numerical modeling with a participative approachSession 5.2 Combining numerical modeling with a participative approach
Session 5.2 Combining numerical modeling with a participative approach
 

Recently uploaded

Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 

Recently uploaded (20)

Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 

Ontologies in Agribusiness

  • 1. Consumer Industry Agribusiness Initiative June 11, 2020 Understanding Agri Ontologies, Taxonomies & Reference Data Mehdi Charafeddine
  • 2. Agenda • Target Audience and How to Use • Problem Statement • Knowledge Representation in the Agri Landscape • The journey into Ontologies • Ontology Use Case • How do we build and use Ontologies? • Next Steps 2
  • 3. Problem Statement 3 The Open Agri space is busy with emerging, living and Dying initiatives We need to understand what exists, what has worked, and what hasn’t We need to support our own tactical and strategic needs We need to Establish where we can and should fit into this eco-system
  • 4. Target Audience and How to Use 4 Chief Data Office Agronomists Data Scientists Data Engineers Data Stewards Brand Managers Sustainability and Supply Chain Personnel Client Context Target Audience How to Use The asset should be used with clients in the context of: Ø Building marketplaces Ø Building data platforms Ø Enabling agronomy AI use cases Ø Grower Management IBM Community Ø Presentation: Lighthouse, Seismic, Solution Gateway Ø GitHub Repository: IBM GitHub Ø Recorded Webinar: CBDS Data Platform, Data Architect Community
  • 5. Knowledge Representation 5 Knowledge Management Information Management Data Management Data is discrete, facts have no meaning in isolation Data has relevance and purpose. It informs and causes the uses to change state, Knowledge is actionable information placed in context based on facts and meaning. Wisdom Knowledge enables Understanding necessary For effective decision-making
  • 6. Knowledge Modelling Landscape 6 • Ontology: • Semantic data model • Classify Things and define more specific relations and attributes • Local and global ontologies can be combined • Culturally neutral • Taxonomy: • Describes the organization’s vocabulary, common terms, and synonyms. • Organize terms into hierarchy • Reference Data: • Set of well-known value for attributes, translates into local meaning • Glossary: • Define terms in a simple way Ontology Subject – Predicate - Object Logical Data Model Entity - Relationship Taxonomy Hierarchy – Tree Structure Glossary / Dictionary Term, Definitions, References SEMANTIC RICHNESS LOW HIGH
  • 7. Farming Domain Ontologies 7 Field Crop Soil Input Environmental Ontology Domains SCOPE? REPRESENTATION? OWNERSHIP? MAINTENANCE? LICENSE TERMS? HOW TO MODIFY? Open Questions?
  • 8. Reference Data 8 “Reference data are data that define the set of permissible values to be used by other data fields. Reference data gain in value when they are widely re-used and widely referenced. Typically, they do not change overly much in terms of definition, apart from occasional revisions.” - Wikipedia What are the common names of the crops and what are the correspondences across languages and regions?? What sources of reference data can we find for agriculture? What are the terms and conditions around the use of reference data? Can we find reference data for: Crops, Growing Practices, Inputs, Equipment, Soils, Weather
  • 9. Knowledge Representation Journey 9 Chief Data Office Glossary Taxonomy Ontology Terms & Conditions Licensing • Defining ontologies is a core data governance practice • Building ontologies is a continuous process
  • 10. Coffee Farming Use Case 10 As An Agronomist Manager, I want to be able to record known performance of local Arabica and Robusta varieties, So that agronomists and nurseries know the performance of the variety and can inform farmers of potential benefits KPIs to track • Yield (metric tons/hectare): • Product type: Cherry, Green Coffee or Parch. • Measurement Tree Density (tree / hectare) • Cup quality expected (optional): 1.0 (best) to 1.4 (worst) • Compliance with Green Coffee Specifications: Y/N/Unknown • Granulometry (100 beans weight) – grams • Resistance to drought (Sensitive / Moderate / Tolerant)
  • 11. Coffee Quality Map 11 Cup Quality Measures • Aroma • Flavour • Aftertaste • Acidity • Body • Balance • Uniformity • Cup Cleanliness • Sweetness • Moisture • Defects Farm Metadata • Owner • Country of Origin • Region • Farm Name • Lot Number • Holding Pattern • Mill (on site) • Company • Location and Altitude • Farm Map • Farm Area Bean Metadata • Processing Method (wet or dry process, washed or natural) • Bean size & density • Bean Colour • Species (Arabica / Robusta) • Roast appearance and cup quality in relation to flavour, characteristics and cleanliness
  • 12. Coffee Varieties 12 Two dominant coffee varieties out of 125! Coffee is a long-term crop with a lifespan of more than 10 years, and considerably longer under good management, thus the choice of variety (cultivar) is very important. As quality of the coffee bean is crucial for production of high-grade coffee, choose only varieties that are recommended for your area.
  • 13. Reference Data sources for Coffee 13 Production & Yield Disease Resistance Soil Type Plant genotype & taxonomy Growing Practices Data Available
  • 14. Modeling an Ontology for our Coffee Use Case: Where to Start? 14
  • 15. Ontology Development Methodology 16 Define Entities Define Use Cases Ontologies Guidelines Ontologies Tooling Define Reference Data Identify Reusable Ontologies Create Ontology 1 Create Ontology 2 Create Ontology N … Collaboration Process Ontology Repository Framework Domain Definition Execution Preparation
  • 16. How does it fit with a CDO organization? 18 LOW Ontologies + Knowledge Graphs = • Identify all • Provide context • Discover hidden facts “An enterprise knowledge graph is a representation of an organization’s knowledge domain and artefacts that is understood by both humans and machines” Graph Database Enterprise Data Sources Glossaries, Taxonomies • Also known as “triple store” • Collection of references to knowledge objects in their source systems • Store properties for each object from the various sources • Store relationships between those objects • Variety of data sources and systems • Many disparate systems
  • 17. How does it fit with a CDO organization? 19 • Watson Decision Platform for Agriculture: • The Electronic Field/Regional Record holds domain specific information across growing seasons • Terms and Reference are used to define the allowable values for attributes such as: • Crop type, • Irrigation type, • Tilling methods, etc. • Work to extend with Taxonomies & Ontologies covering more extensive information about: • Growth stages, soil types, planting characteristics is planned.
  • 18. Ontologies as an IBM asset Example: potato crop ontology build for Yara ODX 20 Assets for knowledge modelling • Architecture for building ontologies • Method and framework for implementation ontologies • Domain-specific ontologies (i.e. farming & agri business) Commercialization • Can be owned by industry or service line practices, like industry models • Can be part of the industry commercial offering package • Can be extended to any industry
  • 20. Open Agri Landscape Data initiatives 22 WHAT CAN WE LEARN FROM? WHAT CAN WE POSITION AGAINST? WHAT CAN WE REUSE?
  • 21. Work Initiatives 23 Understand the Marketplace Understand Ontologies Understand What’s Available Understand Data Sharing • Building out the Matrix (Existence, Liveness, Usefulness) • What can we leverage • What we should not do • Plants, Growing Practices, Inputs, Measurements, Environmental • Translations and regional variability • Country specific requirements • Industry Requirements • Public Expectations • Agricultural • Environmental • Growers
  • 22. Using Structured Knowledge Structure Why Examples Ontology Graphs representing both simple and complex relationships and attributes. A mechanism to consistently capture knowledge about a domain. Describing environmental rules for where to plant what crops and why. Relationships between different inputs and crops. Typically used in both search and analytics. Taxonomy Simple hierarchies to structure and find reference data and glossary terms – structure implies hierarchical relationships such as containment or part-of Crop growth stages, Crop types, Equipment types - all have implicit structure. Used in analytics and presentation. Reference Data Common value sets with translations and regionalization – often a missing aspect to integrate data from multiple systems together. Think of pull-down lists, kind of crop, kind of irrigation, colors, places, etc. Used to consistently communicate internally and externally. Glossary Common terminology for communicating internally and with customers Understandable terminology on screens and reports 24
  • 23. Coffee Crop Production Worldwide 25 Source: http://www.fao.org/
  • 24. Two dominant coffee varieties are being produced 26 Robusta (aka Canephora) • Lower Quality • More robust/ fewer disease problems • Easy to grow and manage • Higher yielding (up to 3 tonnes/ha green bean possible for small holder production) • Easy to process • Dominates lower end of the market such as instant coffee and less discerning markets • One third of world production • Average world market price/kg approximately half of that of Arabica Arabica • Higher Quality • Less tolerant to environmental fluctuations/ More prone to disease • More complex physiological management Higher yielding (up to 3 tonnes/ha green bean possible for small holder production) • Lower yielding (up to 1.5 tonnes/ha green bean possible for small holder production) • More sophisticated processing needed • Operates at the higher end of the market such as roasted and ground coffee • Two thirds of world production • Average world market price/kg approximately twice that of Robusta VS Source:
  • 25. Coffee end market segmentation by quality 27 Source: https://www.cbi.eu/market-information/coffee/belgium/market-entry/
  • 26. Stages of Coffee Crop Growth 28 Source: https://www.torchcoffee.asia/resources
  • 27. Reusable Ontology: soil properties and processes (OSP) 29 Source: http://doi.org/10.5518/54
  • 28. How does it fit with a CDO organization? 30 Identify Subject Areas Define the Meaning of things in the enterprise Describe the logical representation of properties Describe the physical means of data storage Represent the coding language of the data platform Store the values of the properties applied to the data in a schema ONTOLOGY Ontology, Conceptual ER, Business Process Entity-Relation model, Json schema, XML schema Physical schema, asset repository Contextual Conceptual Logical Physical Definition Instance