SlideShare a Scribd company logo
Cory Casanave
Copyright © 2017 Model Driven Solutions
Licensed as creative commons, share-alike
https://creativecommons.org/licenses/by-sa/4.0/legalcode
To relate different
representations of the
same things, we need to
understand
• What are the
common concepts
• How do the various
information syntaxes
represent those
concepts
• What are the rules
for translating
between them in
various context
Semantic Mediation
Reference
Concepts
Information
Syntax
RepresentsInformation
Syntax
Information
Syntax
Conceptual Model
Mapping Rules
Information Model
Leveraging Federated Information
T
Reference
Concepts
Information
Syntax
RepresentsInformation
Syntax
Information
Syntax
Conceptual Model
Mapping Rules
Information Model
Modeling Concepts are the meta-concepts we use to build our models
Raising the level of abstraction
Conceptual models need not be limited to computational paradigms
• E.g. OWL-DL is limited by FOL, DL and unary “properties” (no relations)
Examples of modeling concepts (more detail below)
• Context
• First-class n-ary relationships
• Roles and phases
• Temporal friendly “situations” and relationships
Note: Ontology does not mean “OWL-DL”, Ontology is about conception of the world(s).
Note also that the above can be “painted over” OWL and UML with patterns.
High-Level Modeling Concepts
• Linked Data
• Ontologies (e.g. OWL)
• Rules
• Traditional DBMS / SQL
• Big Data
• NoSQL
• Traditional APIs and applications (e.g. Java)
• Functional Programming
• ETL / Data Wearhouse
• Publish/Subscribe
• SOA
• Application and web servers
Multiple technologies can be
leveraged, integrated and
combined based on the
same conceptual models
using model driven
architecture
MultipleTechnologies
Solution Architecture
What are the requirements for
conceptual modeling in support of
federation?
Role of the interpreter
Retention of meaning across languages, communities and cultures
Communicate what is said without judging, coloring or filtering it
Interpreters have substantial preparation, learning syntax, grammar, vocabulary and
cultural idioms.
Interpreters can only communicate what they understand and what can be understood in
the languages they deal with – the common concepts
They then communicate what other people said based on how those concepts are
expressed in different languages – they communicate the provenance
Interpreters are preforming semantic mediation
What we expect of interpreters
Common
concepts
Understandable and precise
Capture, disambiguate and precisely describe
stakeholder concepts such that they can
enable automation.
Represent these concepts so stakeholders can
understand and validate them.
Semantic mediation does
• Capture shared concepts, this includes types, relationships, individuals
and other concept kinds
• Define a relationship between patterns of these concepts and patterns of
data representations – it is not a word for word translation
• Respect the context of the concepts and rules
• Maintain the provenance of information
Semantic mediation does not
• Define applications that produce or consume information
• Assert policy about what is “right”, “common” or “legal”
• Structure information for an application perspective
• Filter for “data quality”
Implications for the role of semantic
mediation (conceptual reference models)
Conceptual models and mappings, intended for the federation role
• Should use stakeholder relevant terms and concepts presented in an understandable
way
• Should have minimal commitment – only committing to what is necessary to
understand and “map” information. E.g. specific units or data representations
• Should keep application specifics out of the concepts – how the information is
developed is not our concern.
• Should contain both the “most generic” sense of a concept as well as any specialization
required (and named appropriately) for precise communications
• Mappings should be able to be “mixed and matched” with multiple concept modules
based on context
• Should only contain semantics that have a “ROI” for federation.
• E.g. should not be implementing “application rules”, it is up to the “end applications” to enforce
rules prior to mediation
Conclusion: Conceptual models for federation may be a subset of (simpler than) conceptual
models or operational ontologies intended for applications.Their semantics only need
to support the mapping rules for representations.
Implications for SMIF
Example from “Operational
threat/risk”
Is this a threat actor? (STIX)
STIX is a large XML Schema
What is a threat actor? (FBI)
The Federal Bureau of Investigation has identified three categories
of cyber threat actors:
“ [1] organized crime groups that are primarily threatening the
financial services sector, and they are expanding the scope of their
attacks; [2] state sponsors — foreign governments that are
interested in pilfering data, including intellectual property and
research and development data from major manufacturers,
government agencies, and defense contractors; and [3] increasingly
there are terrorist groups who want to impact this country the
same way they did on 9/11 by flying planes into buildings.They are
seeking to use the network to challenge the United States by
looking at critical infrastructure to disrupt or harm the viability of
our way of life.[1]
Is this a threat actor?
Threat
Actor?
Our conclusion:Only the “real world” perspective can
provide a pivot point between different representations.
Instances of a conceptual “threat actor” are real threat
actors, not data about them as data is structured for its
application purpose.
What is a threat actor? (Threat Model)
Data represents concepts
Represents
Model Representation
Example of Mapping Pattern (UML
Profile)
SMIF Components – conceptual models
and mappings to schema
Conceptual
Modeling UML
Profile
Mapping UML
Profile
Schema ProfileDependsOn MapTo
SMIF Conceptual
Metamodel
Mapping
Metamodel
SMIF Information
Metamodel
DependsOn MapTo
Represents
SMIF Conceptual Meta Model
UML Representation
Represents
Represents
As easy as possible for domain stakeholders to understand the models as
diagrams, tables or text
“First class” n-ary relationships that may have properties and participate in other
relationships
Recognition and representation of time and context – most relationships are
only true for a limited time and in specific situations
Roles, phases and other “non rigid” classifications that also may be time or
situationally dependent
Hierarchies of types, relationship types and properties
Business values as represented by various systems of units
Patterns as a basis for real-world situations as well as mapping rules
Note that while some of these features may be difficult for generic FOL
reasoners, they are practical for specific conceptual model and mapping rules
Important conceptual modeling and
mapping capabilities
Last years Ontology Summit focused on “Ontologies within Semantic
Interoperability Ecosystems ”. Full document: http://tiny.cc/Ontolog2016
Supports the value proposition of SMIF
• “The United States spent nearly $3.0 trillion on healthcare in 2014 [ . . . ] the efficient
use of health information technology will result in considerable cost savings, in addition
to saving thousands of lives per year”.
Supports the need for capabilities not currently provided, focused on
semantic interoperability
• "More mature conceptual modeling & knowledge engineering tools are required to
support the design, development and storage of ontologies and vocabularies. [ . . .
] these should be integrated into software development and data management tools
that provide support across the IT lifecycle and related operations.“
• "There was broad consensus within the Summit that improved software tools and
environments are required to support the integration of concepts and data."
Ontology Summit: Semantic
Interoperability Communique
SMIF Implementation for information
federation, sharing and analytics
Status: PoC
Federation Engine tm
Prototype model & semantics
implementation preview
“SMIF Full”
RDF Repository
SMIF
UML Models
SMIF Java API
UML
Facet
Eclipse
UML
Java
Facet
RDF/OWL
Facet
Rules
SW Rules
Facet
SQL
Facet
XML
Facet
“Flat”
Facet
Planned
API Generated
from SMIF
Model
CL/IKL
Facet
Each facet provides an adapter
between external data and a SMIF
counterpart at any meta-level. A
SMIFObject may have multiple
facets.
Mappings & federations are
(conceptually) between SMIF
objects.
Analytics
Implements
SMIF Semantics
as Java code
XMI
Facet
Java API for any SMIF model (Including those imported from other sources)
Production of “SMIF Full” RDF/OWL and rules
Bi-directional data mapping – SQL, XML, OWL
Semantic mapping between data repositories
User definable rules in SMIF or Java
Projection to “tabular” data formats for analytics engines
Full support for multiple inheritance and multiple classification
All of the above for FIBO, Concept Library and Threat/Risk
Capabilities (In Progress)
Using the SMIF UML profile – one
projection of the SMIF model
Entities & Facets Propositions
FoundationalConceptualTools (Meta-Concepts)
Characteristics
Associations (Always true between entities)
Relationships (Independent/Contextual)
Rigid (Entity)Types
Roles
Phases
• Reference models are conceptual, information models are application specific
• Multiple inheritance & multiple (instance) classification is required
• A type can have multiple Supertypes
• A thing may have multiple types, these types may change over time and in differing context
• Everything has metadata – source (including derivations), provenance & timeframe
• Things change – time matters (the world is not static or a snapshot)
• E.G. Relationships, situations and characteristics are temporal
• Information is contextual (the world is not first order)
• Statements are not “deleted”, they go in and out of context (Allows functional)
• The world is open, conclusions (computations) are contextual
• Trust in information varies, not everything that can be inferenced should be
• Independence from representation (e.g. schema) and processing technologies
(including inference engines)
• But, we can bind to any of these technologies
Type Based ModelTheory
Roles define what something is for or how it behaves in a certain context,
not “what it is”.
A role is a <<Facet Of>> what it can be a role of.
An entity can play any number of roles and these roles may change over
time.
Roles can be contextual and specialize other roles
Roles are usually established by relationships
Roles
Phases (or states) are classifications of objects over their lifetime. A
“Relative thing”
Examples: Child,Teenager, Adult or Invoiced, Late, Paid
May be combined with other types using unions and intersection (e.g.
teenage driver)
Phases
Relationships are meaningful atomic concepts involving a set of other entities or
relationships. A “Mediating thing”.
There can be any number of related “ends”, but two ends is most common
Relationships are atomic & static “Situations”, the involved ends do not change over
the lifetime of the relationship. The context and “truth value” of the relationship may
change.
Relationships are temporal – exist for a timeframe. The timeframe of a relationship
may or may not match the timeframe of the “ends”
Relationships may be involved in other relationships and may have characteristics
We refer to these independent relationships as “first class” relationships.
Relationships
Modeling with first class roles & relationships
(Note that relationships can be temporal
– be “true” only for a time period)
Associations are similar to relationships but:
• Not “first class”
• Not temporal – they exist for the lifetime of the related “ends”
• Limited to binary
• Also known as “Formal Relations”
• Map most directly to simple properties (e.g. OWL or Java)
Associations
Characteristics are features inherent in another type
Most like “properties” in many languages
Usually have a value type as their range
John has a weight of 60 KG
Characteristics are temporal (have a timeframe) & identifiable
John weighed 60kg on March 3rd, 2011
In conceptual reference models quantity kinds (e.g. Mass) are preferred over
specific units (e.g. kg) or data types (e.g. “Real”) as the types of characteristics
Characteristics
Properties encompass the “ends” of associations and relationships, and
characteristics
Like everything else – properties have types (property types) and instances
(called bindings)
Properties can be specialized and restricted
Models typically define property types
Specialized properties may be “virtually derived”, not define a new concept
but restrict an existing one. These have no name or the same name as their
super-property
Property specialization is either a “subset” or redefines (equivalent set)
Specializing properties
Subset properties define subtypes of other property types (ends of
associations or relationships, characteristics, etc).
Extent of subset property is a subset of super-property
May tighten constraints – multiplicity, type, etc.
Subset properties
Redefined properties define subtypes of other property types (ends of
associations or relationships, characteristics, etc) and replace the super-
property in the given context.
Extent of redefined property is the same as the extent of super-property
May tighten constraints – multiplicity, type, etc.
Redefined Properties
Restriction Example
Disjoint defines a set of classifications
that can never be combined in the same
thing.
• e.g. you can’t have a cat/dog
Complete defines that a set of things
can’t have any more members.
Incomplete says there can be more in a
set.
• Certainly there are more kinds of
animals than cats and dogs.
Incomplete and Disjoint
Note: Disjoint may also use a dependency
It is essential that we understand what quantities
represent .
For numeric characteristics, we want to know
what it means (e.g.Temperature), not the kind of
number (Real).
<<Quantity Kind>> is an aspect common to
mutually comparable quantities represented by
one or more units.
A “unit value” represents a quantity kind, there
are multiple units representing temperature.
A physical representation would then represent
the unit as some kind of number in a specified
unit.
Quantity Kinds & Units
Situations contextualize one or more
propositions – relationships, associations,
characteristics or rules
Situations are “mediating things”
Situations are temporal
Relationships are atomic situations
Situations types / patterns and individuals
may be defined
Situations may be
• Static or dynamic
• Past, present or possible future
Situations
Kinds of situations
Situations capture “truth” in context (any
such truth is subject to trust and likelihood)
Models may be organized into packages (sub-graphs) and reference or assert
other packages
Model Organization
Expressions
• Define computations
Patterns
• Define sets of entities and relationships with variables, acts as a type.
• May be asserted for a type or context
• Used as the basis for mapping
• In UML, uses structured classifiers
Rules
• Define constraints or implications for a context. E.g. Multiplicity.
Mapping Rules
• Defines how an information model relates to a conceptual reference model
• Based on pattern matching
• Multiple mappings are combined to federate or translate information
AdditionalCapabilities, not detailed here
SMIF provides a framework for defining a semantic model
In the general sense of the word, this is an “Ontology”
Tools can generate ontologies in various languages, such as OWL
As the focus of SMIF is federation and integration, not general inference,
there are some differences
• SMIF allows things some ontology languages do not – context, “second
order” relationships and time.
• It includes higher level concepts such as first-class relationships, roles and
phases
• It does not include some capabilities of first order logic, it does not
provide for general inferencing based on arbitrary expressions
SMIF is a good match for the “semantic web” and linked data
Semantics and SMIF
The concept library is defined in and specializes SMIF, it is not
defined as part of SMIF. Concept libraries are open source.
These are candidate concept – refinements and alternatives are
welcome.
https://github.com/ModelDriven/ConceptLibraries/
SMIF
Concept Library
DefinedUsing
Extends
Domain Concept
Models
Extends
Application Model References
To facilitate cross domain and cross system interaction, we must have
concepts common across those domains and systems
We know they will be represented differently and may have different
purposes, but we must understand what things mean
We should be reusing concepts rather then building new stovepipes
There will never be “one true” concept set, but we can define concepts for
broad use and acceptance
Concepts are introduced as:
• They are needed in a domain of interest
• They are an abstraction of one or more concepts needed to fully define
another concept
Concept Library Principles
TopLevel(AnyLevel)
SMIF
Identifiers
Values
Meta
EntityRelationships
SocialAgents
Persons
PhysicalEntities
Places
Organizations
Time&TemporalThings
SituationPartitions
SituationInterdependencies
Occurrences
Process(Base)
Containment
MoveBetweenContainment
Information&InformationActions
Control
Resource
GeopoliticalEntities
Corporations
Objectives
Assessments
Policy
Quantity Kinds
Common Units (2)
The above models were done with two “profiles” of
UML.
Profiles specialize UML for a specific purpose.
Any UML tool can be used with the SMIF
profile
Our preferred tool is Magicdraw®
The “Concept modeler” can produce and
consume OWL ontologies based on
conceptual models
ModelingTools
ConceptualModelProfile
SMIF Rules Profile
http://www.modeldriven.com
https://github.com/ModelDriven/SIMF/
https://github.com/ModelDriven/ConceptLibraries/
SMIF is an “standard in progress” in the Object Management Group (omg.org)
It is currently being used and implemented

More Related Content

Similar to Semantic Modeling for Information Federation

Tropos project toward RE
Tropos project toward RETropos project toward RE
Tropos project toward RESehrish Asif
 
The Revolution Of Cloud Computing
The Revolution Of Cloud ComputingThe Revolution Of Cloud Computing
The Revolution Of Cloud ComputingCarmen Sanborn
 
Journalism and the Semantic Web
Journalism and the Semantic WebJournalism and the Semantic Web
Journalism and the Semantic WebKurt Cagle
 
An introduction to repository reference models
An introduction to repository reference modelsAn introduction to repository reference models
An introduction to repository reference modelsJulie Allinson
 
RIM-Recasted, Value-Added Efficiency Interpolation in the HL7 Development Par...
RIM-Recasted, Value-Added Efficiency Interpolation in the HL7 Development Par...RIM-Recasted, Value-Added Efficiency Interpolation in the HL7 Development Par...
RIM-Recasted, Value-Added Efficiency Interpolation in the HL7 Development Par...ijceronline
 
Implementing information federation
Implementing information federationImplementing information federation
Implementing information federationCory Casanave
 
Business case for concept models
Business case for concept modelsBusiness case for concept models
Business case for concept modelsCory Casanave
 
Model-Based Systems Engineering Demystified
Model-Based Systems Engineering DemystifiedModel-Based Systems Engineering Demystified
Model-Based Systems Engineering DemystifiedElizabeth Steiner
 
Graph-Based Algorithm for a User-Aware SaaS Approach: Computing Optimal Distr...
Graph-Based Algorithm for a User-Aware SaaS Approach: Computing Optimal Distr...Graph-Based Algorithm for a User-Aware SaaS Approach: Computing Optimal Distr...
Graph-Based Algorithm for a User-Aware SaaS Approach: Computing Optimal Distr...IJERA Editor
 
UNITII -MODELING and AGGRIGATION-sss.docx
UNITII -MODELING  and AGGRIGATION-sss.docxUNITII -MODELING  and AGGRIGATION-sss.docx
UNITII -MODELING and AGGRIGATION-sss.docxsharwarisolapur
 
Metadata practice and direction: a community perspective
Metadata practice and direction:a community perspectiveMetadata practice and direction:a community perspective
Metadata practice and direction: a community perspectivelisld
 
Data and Computation Interoperability in Internet Services
Data and Computation Interoperability in Internet ServicesData and Computation Interoperability in Internet Services
Data and Computation Interoperability in Internet ServicesSergey Boldyrev
 
Semantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanSemantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanPeter Berger
 
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalKeystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalMauro Dragoni
 
Delivering Process-Driven, Dynamic Applications
Delivering Process-Driven, Dynamic ApplicationsDelivering Process-Driven, Dynamic Applications
Delivering Process-Driven, Dynamic ApplicationsNathaniel Palmer
 
Delivering Process-Driven, Dynamic Applications
Delivering Process-Driven, Dynamic ApplicationsDelivering Process-Driven, Dynamic Applications
Delivering Process-Driven, Dynamic ApplicationsNathaniel Palmer
 

Similar to Semantic Modeling for Information Federation (20)

Tropos project toward RE
Tropos project toward RETropos project toward RE
Tropos project toward RE
 
The Revolution Of Cloud Computing
The Revolution Of Cloud ComputingThe Revolution Of Cloud Computing
The Revolution Of Cloud Computing
 
Ultra large scale systems to design interoperability
Ultra large scale systems to design interoperabilityUltra large scale systems to design interoperability
Ultra large scale systems to design interoperability
 
Journalism and the Semantic Web
Journalism and the Semantic WebJournalism and the Semantic Web
Journalism and the Semantic Web
 
An introduction to repository reference models
An introduction to repository reference modelsAn introduction to repository reference models
An introduction to repository reference models
 
RIM-Recasted, Value-Added Efficiency Interpolation in the HL7 Development Par...
RIM-Recasted, Value-Added Efficiency Interpolation in the HL7 Development Par...RIM-Recasted, Value-Added Efficiency Interpolation in the HL7 Development Par...
RIM-Recasted, Value-Added Efficiency Interpolation in the HL7 Development Par...
 
Implementing information federation
Implementing information federationImplementing information federation
Implementing information federation
 
Business case for concept models
Business case for concept modelsBusiness case for concept models
Business case for concept models
 
Model-Based Systems Engineering Demystified
Model-Based Systems Engineering DemystifiedModel-Based Systems Engineering Demystified
Model-Based Systems Engineering Demystified
 
Graph-Based Algorithm for a User-Aware SaaS Approach: Computing Optimal Distr...
Graph-Based Algorithm for a User-Aware SaaS Approach: Computing Optimal Distr...Graph-Based Algorithm for a User-Aware SaaS Approach: Computing Optimal Distr...
Graph-Based Algorithm for a User-Aware SaaS Approach: Computing Optimal Distr...
 
UNITII -MODELING and AGGRIGATION-sss.docx
UNITII -MODELING  and AGGRIGATION-sss.docxUNITII -MODELING  and AGGRIGATION-sss.docx
UNITII -MODELING and AGGRIGATION-sss.docx
 
Metadata practice and direction: a community perspective
Metadata practice and direction:a community perspectiveMetadata practice and direction:a community perspective
Metadata practice and direction: a community perspective
 
Data and Computation Interoperability in Internet Services
Data and Computation Interoperability in Internet ServicesData and Computation Interoperability in Internet Services
Data and Computation Interoperability in Internet Services
 
Semantics in Financial Services -David Newman
Semantics in Financial Services -David NewmanSemantics in Financial Services -David Newman
Semantics in Financial Services -David Newman
 
Data Modeling.docx
Data Modeling.docxData Modeling.docx
Data Modeling.docx
 
Ontologies
OntologiesOntologies
Ontologies
 
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information RetrievalKeystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
Keystone Summer School 2015: Mauro Dragoni, Ontologies For Information Retrieval
 
Mca 504 dotnet_unit4
Mca 504 dotnet_unit4Mca 504 dotnet_unit4
Mca 504 dotnet_unit4
 
Delivering Process-Driven, Dynamic Applications
Delivering Process-Driven, Dynamic ApplicationsDelivering Process-Driven, Dynamic Applications
Delivering Process-Driven, Dynamic Applications
 
Delivering Process-Driven, Dynamic Applications
Delivering Process-Driven, Dynamic ApplicationsDelivering Process-Driven, Dynamic Applications
Delivering Process-Driven, Dynamic Applications
 

Recently uploaded

Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsalex933524
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单ewymefz
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sMAQIB18
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单ewymefz
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIAlejandraGmez176757
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单nscud
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .NABLAS株式会社
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单ukgaet
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBAlireza Kamrani
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Domenico Conte
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJames Polillo
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单ewymefz
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatheahmadsaood
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhArpitMalhotra16
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...correoyaya
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单vcaxypu
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单ewymefz
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单enxupq
 

Recently uploaded (20)

Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 

Semantic Modeling for Information Federation

  • 1. Cory Casanave Copyright © 2017 Model Driven Solutions Licensed as creative commons, share-alike https://creativecommons.org/licenses/by-sa/4.0/legalcode
  • 2. To relate different representations of the same things, we need to understand • What are the common concepts • How do the various information syntaxes represent those concepts • What are the rules for translating between them in various context Semantic Mediation Reference Concepts Information Syntax RepresentsInformation Syntax Information Syntax Conceptual Model Mapping Rules Information Model
  • 4. Modeling Concepts are the meta-concepts we use to build our models Raising the level of abstraction Conceptual models need not be limited to computational paradigms • E.g. OWL-DL is limited by FOL, DL and unary “properties” (no relations) Examples of modeling concepts (more detail below) • Context • First-class n-ary relationships • Roles and phases • Temporal friendly “situations” and relationships Note: Ontology does not mean “OWL-DL”, Ontology is about conception of the world(s). Note also that the above can be “painted over” OWL and UML with patterns. High-Level Modeling Concepts
  • 5. • Linked Data • Ontologies (e.g. OWL) • Rules • Traditional DBMS / SQL • Big Data • NoSQL • Traditional APIs and applications (e.g. Java) • Functional Programming • ETL / Data Wearhouse • Publish/Subscribe • SOA • Application and web servers Multiple technologies can be leveraged, integrated and combined based on the same conceptual models using model driven architecture MultipleTechnologies
  • 7. What are the requirements for conceptual modeling in support of federation?
  • 8. Role of the interpreter
  • 9. Retention of meaning across languages, communities and cultures Communicate what is said without judging, coloring or filtering it Interpreters have substantial preparation, learning syntax, grammar, vocabulary and cultural idioms. Interpreters can only communicate what they understand and what can be understood in the languages they deal with – the common concepts They then communicate what other people said based on how those concepts are expressed in different languages – they communicate the provenance Interpreters are preforming semantic mediation What we expect of interpreters Common concepts
  • 10. Understandable and precise Capture, disambiguate and precisely describe stakeholder concepts such that they can enable automation. Represent these concepts so stakeholders can understand and validate them.
  • 11. Semantic mediation does • Capture shared concepts, this includes types, relationships, individuals and other concept kinds • Define a relationship between patterns of these concepts and patterns of data representations – it is not a word for word translation • Respect the context of the concepts and rules • Maintain the provenance of information Semantic mediation does not • Define applications that produce or consume information • Assert policy about what is “right”, “common” or “legal” • Structure information for an application perspective • Filter for “data quality” Implications for the role of semantic mediation (conceptual reference models)
  • 12. Conceptual models and mappings, intended for the federation role • Should use stakeholder relevant terms and concepts presented in an understandable way • Should have minimal commitment – only committing to what is necessary to understand and “map” information. E.g. specific units or data representations • Should keep application specifics out of the concepts – how the information is developed is not our concern. • Should contain both the “most generic” sense of a concept as well as any specialization required (and named appropriately) for precise communications • Mappings should be able to be “mixed and matched” with multiple concept modules based on context • Should only contain semantics that have a “ROI” for federation. • E.g. should not be implementing “application rules”, it is up to the “end applications” to enforce rules prior to mediation Conclusion: Conceptual models for federation may be a subset of (simpler than) conceptual models or operational ontologies intended for applications.Their semantics only need to support the mapping rules for representations. Implications for SMIF
  • 14. Is this a threat actor? (STIX) STIX is a large XML Schema
  • 15. What is a threat actor? (FBI) The Federal Bureau of Investigation has identified three categories of cyber threat actors: “ [1] organized crime groups that are primarily threatening the financial services sector, and they are expanding the scope of their attacks; [2] state sponsors — foreign governments that are interested in pilfering data, including intellectual property and research and development data from major manufacturers, government agencies, and defense contractors; and [3] increasingly there are terrorist groups who want to impact this country the same way they did on 9/11 by flying planes into buildings.They are seeking to use the network to challenge the United States by looking at critical infrastructure to disrupt or harm the viability of our way of life.[1]
  • 16. Is this a threat actor? Threat Actor? Our conclusion:Only the “real world” perspective can provide a pivot point between different representations. Instances of a conceptual “threat actor” are real threat actors, not data about them as data is structured for its application purpose.
  • 17. What is a threat actor? (Threat Model)
  • 19. Example of Mapping Pattern (UML Profile)
  • 20. SMIF Components – conceptual models and mappings to schema Conceptual Modeling UML Profile Mapping UML Profile Schema ProfileDependsOn MapTo SMIF Conceptual Metamodel Mapping Metamodel SMIF Information Metamodel DependsOn MapTo Represents SMIF Conceptual Meta Model UML Representation Represents Represents
  • 21. As easy as possible for domain stakeholders to understand the models as diagrams, tables or text “First class” n-ary relationships that may have properties and participate in other relationships Recognition and representation of time and context – most relationships are only true for a limited time and in specific situations Roles, phases and other “non rigid” classifications that also may be time or situationally dependent Hierarchies of types, relationship types and properties Business values as represented by various systems of units Patterns as a basis for real-world situations as well as mapping rules Note that while some of these features may be difficult for generic FOL reasoners, they are practical for specific conceptual model and mapping rules Important conceptual modeling and mapping capabilities
  • 22. Last years Ontology Summit focused on “Ontologies within Semantic Interoperability Ecosystems ”. Full document: http://tiny.cc/Ontolog2016 Supports the value proposition of SMIF • “The United States spent nearly $3.0 trillion on healthcare in 2014 [ . . . ] the efficient use of health information technology will result in considerable cost savings, in addition to saving thousands of lives per year”. Supports the need for capabilities not currently provided, focused on semantic interoperability • "More mature conceptual modeling & knowledge engineering tools are required to support the design, development and storage of ontologies and vocabularies. [ . . . ] these should be integrated into software development and data management tools that provide support across the IT lifecycle and related operations.“ • "There was broad consensus within the Summit that improved software tools and environments are required to support the integration of concepts and data." Ontology Summit: Semantic Interoperability Communique
  • 23. SMIF Implementation for information federation, sharing and analytics Status: PoC Federation Engine tm
  • 24. Prototype model & semantics implementation preview “SMIF Full” RDF Repository SMIF UML Models SMIF Java API UML Facet Eclipse UML Java Facet RDF/OWL Facet Rules SW Rules Facet SQL Facet XML Facet “Flat” Facet Planned API Generated from SMIF Model CL/IKL Facet Each facet provides an adapter between external data and a SMIF counterpart at any meta-level. A SMIFObject may have multiple facets. Mappings & federations are (conceptually) between SMIF objects. Analytics Implements SMIF Semantics as Java code XMI Facet
  • 25. Java API for any SMIF model (Including those imported from other sources) Production of “SMIF Full” RDF/OWL and rules Bi-directional data mapping – SQL, XML, OWL Semantic mapping between data repositories User definable rules in SMIF or Java Projection to “tabular” data formats for analytics engines Full support for multiple inheritance and multiple classification All of the above for FIBO, Concept Library and Threat/Risk Capabilities (In Progress)
  • 26. Using the SMIF UML profile – one projection of the SMIF model
  • 27. Entities & Facets Propositions FoundationalConceptualTools (Meta-Concepts) Characteristics Associations (Always true between entities) Relationships (Independent/Contextual) Rigid (Entity)Types Roles Phases
  • 28. • Reference models are conceptual, information models are application specific • Multiple inheritance & multiple (instance) classification is required • A type can have multiple Supertypes • A thing may have multiple types, these types may change over time and in differing context • Everything has metadata – source (including derivations), provenance & timeframe • Things change – time matters (the world is not static or a snapshot) • E.G. Relationships, situations and characteristics are temporal • Information is contextual (the world is not first order) • Statements are not “deleted”, they go in and out of context (Allows functional) • The world is open, conclusions (computations) are contextual • Trust in information varies, not everything that can be inferenced should be • Independence from representation (e.g. schema) and processing technologies (including inference engines) • But, we can bind to any of these technologies Type Based ModelTheory
  • 29. Roles define what something is for or how it behaves in a certain context, not “what it is”. A role is a <<Facet Of>> what it can be a role of. An entity can play any number of roles and these roles may change over time. Roles can be contextual and specialize other roles Roles are usually established by relationships Roles
  • 30. Phases (or states) are classifications of objects over their lifetime. A “Relative thing” Examples: Child,Teenager, Adult or Invoiced, Late, Paid May be combined with other types using unions and intersection (e.g. teenage driver) Phases
  • 31. Relationships are meaningful atomic concepts involving a set of other entities or relationships. A “Mediating thing”. There can be any number of related “ends”, but two ends is most common Relationships are atomic & static “Situations”, the involved ends do not change over the lifetime of the relationship. The context and “truth value” of the relationship may change. Relationships are temporal – exist for a timeframe. The timeframe of a relationship may or may not match the timeframe of the “ends” Relationships may be involved in other relationships and may have characteristics We refer to these independent relationships as “first class” relationships. Relationships
  • 32. Modeling with first class roles & relationships (Note that relationships can be temporal – be “true” only for a time period)
  • 33. Associations are similar to relationships but: • Not “first class” • Not temporal – they exist for the lifetime of the related “ends” • Limited to binary • Also known as “Formal Relations” • Map most directly to simple properties (e.g. OWL or Java) Associations
  • 34. Characteristics are features inherent in another type Most like “properties” in many languages Usually have a value type as their range John has a weight of 60 KG Characteristics are temporal (have a timeframe) & identifiable John weighed 60kg on March 3rd, 2011 In conceptual reference models quantity kinds (e.g. Mass) are preferred over specific units (e.g. kg) or data types (e.g. “Real”) as the types of characteristics Characteristics
  • 35. Properties encompass the “ends” of associations and relationships, and characteristics Like everything else – properties have types (property types) and instances (called bindings) Properties can be specialized and restricted Models typically define property types Specialized properties may be “virtually derived”, not define a new concept but restrict an existing one. These have no name or the same name as their super-property Property specialization is either a “subset” or redefines (equivalent set) Specializing properties
  • 36. Subset properties define subtypes of other property types (ends of associations or relationships, characteristics, etc). Extent of subset property is a subset of super-property May tighten constraints – multiplicity, type, etc. Subset properties
  • 37. Redefined properties define subtypes of other property types (ends of associations or relationships, characteristics, etc) and replace the super- property in the given context. Extent of redefined property is the same as the extent of super-property May tighten constraints – multiplicity, type, etc. Redefined Properties Restriction Example
  • 38. Disjoint defines a set of classifications that can never be combined in the same thing. • e.g. you can’t have a cat/dog Complete defines that a set of things can’t have any more members. Incomplete says there can be more in a set. • Certainly there are more kinds of animals than cats and dogs. Incomplete and Disjoint Note: Disjoint may also use a dependency
  • 39. It is essential that we understand what quantities represent . For numeric characteristics, we want to know what it means (e.g.Temperature), not the kind of number (Real). <<Quantity Kind>> is an aspect common to mutually comparable quantities represented by one or more units. A “unit value” represents a quantity kind, there are multiple units representing temperature. A physical representation would then represent the unit as some kind of number in a specified unit. Quantity Kinds & Units
  • 40. Situations contextualize one or more propositions – relationships, associations, characteristics or rules Situations are “mediating things” Situations are temporal Relationships are atomic situations Situations types / patterns and individuals may be defined Situations may be • Static or dynamic • Past, present or possible future Situations
  • 42. Situations capture “truth” in context (any such truth is subject to trust and likelihood)
  • 43. Models may be organized into packages (sub-graphs) and reference or assert other packages Model Organization
  • 44. Expressions • Define computations Patterns • Define sets of entities and relationships with variables, acts as a type. • May be asserted for a type or context • Used as the basis for mapping • In UML, uses structured classifiers Rules • Define constraints or implications for a context. E.g. Multiplicity. Mapping Rules • Defines how an information model relates to a conceptual reference model • Based on pattern matching • Multiple mappings are combined to federate or translate information AdditionalCapabilities, not detailed here
  • 45. SMIF provides a framework for defining a semantic model In the general sense of the word, this is an “Ontology” Tools can generate ontologies in various languages, such as OWL As the focus of SMIF is federation and integration, not general inference, there are some differences • SMIF allows things some ontology languages do not – context, “second order” relationships and time. • It includes higher level concepts such as first-class relationships, roles and phases • It does not include some capabilities of first order logic, it does not provide for general inferencing based on arbitrary expressions SMIF is a good match for the “semantic web” and linked data Semantics and SMIF
  • 46.
  • 47. The concept library is defined in and specializes SMIF, it is not defined as part of SMIF. Concept libraries are open source. These are candidate concept – refinements and alternatives are welcome. https://github.com/ModelDriven/ConceptLibraries/ SMIF Concept Library DefinedUsing Extends Domain Concept Models Extends Application Model References
  • 48. To facilitate cross domain and cross system interaction, we must have concepts common across those domains and systems We know they will be represented differently and may have different purposes, but we must understand what things mean We should be reusing concepts rather then building new stovepipes There will never be “one true” concept set, but we can define concepts for broad use and acceptance Concepts are introduced as: • They are needed in a domain of interest • They are an abstraction of one or more concepts needed to fully define another concept Concept Library Principles
  • 74.
  • 76. The above models were done with two “profiles” of UML. Profiles specialize UML for a specific purpose.
  • 77. Any UML tool can be used with the SMIF profile Our preferred tool is Magicdraw® The “Concept modeler” can produce and consume OWL ontologies based on conceptual models ModelingTools
  • 80. http://www.modeldriven.com https://github.com/ModelDriven/SIMF/ https://github.com/ModelDriven/ConceptLibraries/ SMIF is an “standard in progress” in the Object Management Group (omg.org) It is currently being used and implemented