SlideShare a Scribd company logo
1 of 37
Download to read offline
Jena based
 Implementation of a
ISO 11179 Meta-data
      Registry
  A. Anil Sinaci, SRDC
   sinaci@apache.org


                         1 / 37
About me
• PhD student at


• Senior Software Engineer at
• FP7 Projects:




                                2 / 37
Agenda
   Introduction
   Motivation
       FP7 – SALUS & BIVEE Projects
   Background
       ISO/IEC 11179
       Common Data Elements
   Design & Implementation
   Use-case
   Summary


                                       3 / 37
What is Meta-data?
• data about data… (deprecated)
  • at design time the application contains no data
    •   Descriptive metadata
  • Structural Metadata – data about the
    containers of data
• meta-data is data
  • can be stored and managed
  • meta-data registries


                                                  4 / 37
Importance of Meta-data



Meta-data                      Data




                                      5 / 37
Problem of Interoperability
Syntactic vs. Semantic


• The ability to
  exchange information
  •   access
• The ability to use the
  information once it has
  been exchanged
  •   understand


                                                                6 / 37

                            The figure is taken from a presentation of caSDR
Meta-data for Semantic Interoperability
•   Precise knowledge about how data is structured
•   More efficient and productive with a central, well-administered place to seek
    for meta-data
    •        Central, easily consumable

•   Classifications with well-known terminology systems
•   Build (or map) data models based on a common meta-model
        Patient Name

               Surname                          MDR
               Birth Date                                        ISO/IEC
               Sex                        Patient First name     11179
                                                 Last name
        Patient Firstname                        Date of Birth

               Surname                           Sex

               Date of Birth
               Gender



                                                                                7 / 37
Jena based ISO 11179
There are lots of MDR instances out there
•   Most of them are based on ISO/IEC 11179
    •   have the chance to interoperate semantically
•   ISO/IEC 11179 ontology
    •   common vocabulary for meta-data level
•   Manage all items, classifications, inter-relations and links
    to the external world (terminology systems, taxonomies,
    vocabularies)
    •   in a triple-store
    •   easily expose as RDF
    •   easily import as RDF

                                                               8 / 37
Interoperable through LOD

                                                                MDR
                                                   R
                                                   D
                                                   F      TripleStore
       MDR        R
                  D
    TripleStore   F




                                          R             MDR
                                          D
                                          F      TripleStore



                                                                                                    9 / 37

                      Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Motivation
The SALUS Project: Pharmacovigilance
•   Current post-market safety surveillance and reporting activities are
    largely based on reports of suspected adverse drug reactions sent
    to the regulatory bodies
    •   5% of all hospital admissions in Europe are due to an adverse drug reaction
        (ADR)
    •   ADRs are the fifth most common cause of hospital deaths
    •   drug withdrawals (eg. Vioxx)

•   Interoperability between clinical care and clinical research domains
     A semantic interoperability architecture based on commonly
accepted data elements
                       http://www.salusproject.eu



                                                                                      10 / 37
SALUS Project   Central & Semantic
                Meta-data Registry




                               11 / 37
Motivation
The BIVEE Project: Business Innovation Management
• Software tools for the support of innovation & improvement
  management within Virtual Enterprises
     •   Document Centric Approach
          •   Identify document structures through common building blocks  common data elements

     •   CCTS – OASIS UBL
          •   based on ISO/IEC 11179

     •   Production & Innovation Knowledge Repository  Semantic Descriptors

•   Interoperability among business domains of collaborating partners within a
    Virtual Enterprise
     An interoperability architecture based on commonly accepted
    semantic descriptors (meta-data)
                                  http://www.bivee.eu


                                                                                                   12 / 37
BIVEE Project                                                            Central & Semantic
Virtual Enterprise                                                       Meta-data Registry
Environment                               BIVEE
                       Semantic
                                        Ontologies                Semantic
   Value                                 (I-PIKR)                               Business
   Production          Search/                                    Search/
                       Query/                                     Query/        Innovation
   Space                                                          Reasoning     Space
                       Reasoning
   Business
   Processes                                                                        Innovation
                                                                                    related
                                                        SD
                                                                                    docs
                                   SD
                     Semantic                                      Semantic
      KPIs                              SD                   SD    Annotation
                     Annotation                   SD

                                   SD                        SD
                                             SD
                                                       SD
    Production                                                                          KPIs
    related
    docs                                 F-PIKR                                     Innovation
Production
   Data                     VE Members                                                 Data
                           (competencies)


                                                             External
                                                             Resources                       13 / 37
The requirement
A clear need for a Common Data Element Repository to
  facilitate the semantic interoperability between different
  application domains
   •   to store the building blocks of data models of different domains
       and systems
   •   so that different data models are described through the
       aggregation and association of Common Data Elements
   •   should deal with several annotations and links to external world
       •   several vocabularies, classification schemes and terminology systems are
           currently in use for different domains

   •   should follow the characteristics of the Linked Data approach.




                                                                                      14 / 37
What is ISO/IEC 11179 ?

 • Family of standards addressing the;
   • Semantics of Data
   • Representation of Data
   • Registration of Data


 • ISO/IEC 11179 is;
   • Description of metadata in terms of Data Elements
   • Procedures to manage registry of Data Elements


                                                         15 / 37
Parts of ISO/IEC 11179
 Consists of 6 parts defining
    •   Framework for Specification
    •   Classification
    •   Registry Metamodel
    •   Formulations of Data Definitions
    •   Naming and Identification Principles
    •   Registration


 of Data Elements.


                                               16 / 37
Purpose of ISO/IEC 11179
ISO/IEC 11179 is to promote
 Standard description of data
 Common understanding of data across organizational
 elements and between organizations
 Re-use and standardization of data over time, space, and
 applications
 Harmonization and standardization of data within an
 organization and across organizations
 Management of the components of data
 Re-use of the components of data

                                                       17 / 37
Benefits of ISO/IEC 11179
• Similar CDE’s linked to same Concept’s;
                   reduced search time
• All representations of a CDE can be shown together;
                   increased filexibility
• CDE’s having same value domain can be shown together;
               easy administration of registry
• Concept of Object Class and Property;
            allows Linked Data representation
• Classification through External Vocabularies;
              allows Linked Data integration
                                                        18 / 37
Common Data Element

• Logical unit of data
• Belongs to one kind of information
• Set of attributes specifies;
     •   Identification
     •   Definition
     •   Representation
     •   Permissible value


                                       19 / 37
Common Data Element
                           Data Element



          Data Element
                                          Value Domain
            Concept


 Object
                     Property             Representation
 Class




                                                           20 / 37
Common Data Element
                                                        Data Element
                              Person Birth Date Value


Data Element                                                             Value
  Concept                                                               Domain
         Person Birth Date
                                                         Birth Date Value


                                                         Data type: Calendar
   Person                 Birth Date
Object Class              Property

            The concept                                  The representation
              What?                                            How?


                                                                                 21 / 37
Common Data Element
                                        Linked Data                                   Integration
                                                                                      with other
                                                                                      MDRs




                                           Linked Data
                                                                                 •    ICD9, ICD10
                                                                                 •    SNOMED CT
                                                                                 •    LOINC
                                                                                 •    RxNorm
                                                                                 •    WHO ART
                                                                                 •    MedDRA
                                                                                 •    ….                          22 / 37

  diagram adopted from http://ncicbtraining.nci.nih.gov//TPOnline/TPOnline.dll/Public%20Course/COURSENO=COUR2006121515230703800967
Common Data Element
 • Improves the quality of data
 • Simplifies data sharing
    • Knowledge sharing
 • Promotes standard, consistent, universal data
 • Ease of development
    • data collection tools
 • Data Interoperability between
    • applications
    • development teams
    • enterprises
    • …
         All require precise definitions of data
                                                    23 / 37
ISO/IEC 11179 Implementations
• OneMeta MDR, Data Foundations Inc.
     • extendible and configurable, commercial

• caDSR, US National Cancer Institute
     • Extension to standard, persisted on RDBMS

• CCTS, UN/CEFACT
     • Business data model standard based on 11179

  • UBL is an implementation of CCTS
• US National Information Exchange Model - NIEM


                                                     24 / 37
Organizations using ISO/IEC 11179
 • Australian Institute of Health and Welfare - METeOR
 • US Department of Justice - Global Justice XML Data Model GJXDM
 • US Environmental Protection Agency - Environmental Data Registry
 • US Health Information Knowledgebase (USHIK)
 • Ohio State University - open Metadata Repository (openMDR)
 • Minnesota Department of Education Metadata Registry (K-12 Data)
 • Minnesota Department of Revenue Property Taxation
 • The Census Bureau Corporate Metadata Repository
 • Statistics Canada Integrated MetaDataBase
 • The Environmental Data Registry
                                                                     25 / 37
Design & Implementation
ISO/IEC 11179 Ontology




                          26 / 37
Ontology Design




                  27 / 37
Design & Implementation
  Common Data Element (CDE) Repository




                                 CDE Repository Web GUI




         UML Model                                        Semantic Model
          Importer                                           Importer




                                                          Schema Model
                                                            Importer


                                     CDE Knowledge
                                         Base




                                                                           28 / 37
Design & Implementation

                                       Java API                 REST API
            CDE Knowledge Base




                  Semantic MDR


                                     MDR API
                     (Easy-to-use Semantic ISO 11179 Mapping)




                          Semantic Data Manipulation API
                            (Pure ISO 11179 Mapping)




                                 JENA RDF/OWL API

                                     Triple Store
                                 (Jena TDB | Virtuoso)   Data
                                                                           29 / 37
Use-case
Once we have an implementation for a semantic MDR
•   Need to populate with Common Data Elements
    •   Mining for CDEs
    •   Importers for different languages: XML Schema, UML, and
        ontology languages (RDFS/OWL)
•   Other applications must be built on top of the semantic
    MDR
    •   New content models referring to the CDEs
        •   Matching and mapping  Strong reasoning
    •   Data Warehouses, Web Services, EHR Systems, Content
        Management Systems etc…


                                                                  30 / 37
Use-case
List all “ClassificationScheme”s




List all “ObjectClass”es




                                   31 / 37
Use-case
Get all “Property”s of a Patient




                                   32 / 37
Use-case
List all “DataElement”s which are “classifiedBy”
  Myocardial Infarction
  (ClassificationSchemeItem) and Nifedipine
  (ClassificationSchemeItem) AND which have
  Allergy as “DataElementConcept”




                                                   33 / 37
Use-case II




              34 / 37
Summary
• Meta-data Registry to facilitate Semantic Interoperability through
  Common Data Elements (CDE)
    • For several different domains
• ISO/IEC 11179 based
    • well-established and commonly accepted standard
• Pure triple-store implementation access through Jena API
    • easy integration to Linked Data cloud
        • together with other MDR implementations

•   Importers for CDE identification
    •   XML Schema, UML (v1.x and v2.x), RDFS/OWL based ontologies
•   Apache Wicket based Web interface


                                                                       35 / 37
ISO/IEC 20943
Procedures for achieving metadata registry (MDR) content
  consistency
  •    formalized ontology generation with well-defined concepts
                                                                         Web Ontology
                 Metadata Registry
                  (ISO/IEC 11179)
              DEC             OC             CD

                         DE          ...


                                 realized
                         MDRs
                   (Sets of concepts)                                             build
                                           METeO                          Our Proposal
                                                                        Scope of this Part
            EDR            caDSR
                                             R
      (Environmental Data (US National (Metadata Online   utilized      Process Manager
           Registry)    Cancer Institute) Registry)
                                                                     Mapping Info. and Rulus   36 / 37
Thank you for listening…

Questions


      A. Anil Sinaci
       @aasinaci
                                     37 / 37
                       Special thanks to
                       anilpacaci@gmail.com

More Related Content

What's hot

MDM Strategy & Roadmap
MDM Strategy & RoadmapMDM Strategy & Roadmap
MDM Strategy & Roadmap
victorlbrown
 

What's hot (20)

MDM for product data with Talend
MDM for product data with Talend MDM for product data with Talend
MDM for product data with Talend
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata Management
 
How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
How to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data QualityHow to Strengthen Enterprise Data Governance with Data Quality
How to Strengthen Enterprise Data Governance with Data Quality
 
Data Governance challenges in a major Energy Company
Data Governance challenges in a major Energy CompanyData Governance challenges in a major Energy Company
Data Governance challenges in a major Energy Company
 
IBM Industry Models and Data Lake
IBM Industry Models and Data Lake IBM Industry Models and Data Lake
IBM Industry Models and Data Lake
 
MDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large EnterprisesMDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large Enterprises
 
Essential Reference and Master Data Management
Essential Reference and Master Data ManagementEssential Reference and Master Data Management
Essential Reference and Master Data Management
 
5 Level of MDM Maturity
5 Level of MDM Maturity5 Level of MDM Maturity
5 Level of MDM Maturity
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
Requirements for a Master Data Management (MDM) Solution - Presentation
Requirements for a Master Data Management (MDM) Solution - PresentationRequirements for a Master Data Management (MDM) Solution - Presentation
Requirements for a Master Data Management (MDM) Solution - Presentation
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
MDM and Reference Data
MDM and Reference DataMDM and Reference Data
MDM and Reference Data
 
MDM Strategy & Roadmap
MDM Strategy & RoadmapMDM Strategy & Roadmap
MDM Strategy & Roadmap
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl concepts
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
Chapter 5: Data Development
Chapter 5: Data Development Chapter 5: Data Development
Chapter 5: Data Development
 
Identity and Access Management
Identity and Access ManagementIdentity and Access Management
Identity and Access Management
 
Master data management (mdm) & plm in context of enterprise product management
Master data management (mdm) & plm in context of enterprise product managementMaster data management (mdm) & plm in context of enterprise product management
Master data management (mdm) & plm in context of enterprise product management
 

Viewers also liked

070517 Jena
070517 Jena070517 Jena
070517 Jena
yuhana
 

Viewers also liked (20)

Code4Lib 2008 Metadata Registry
Code4Lib 2008   Metadata RegistryCode4Lib 2008   Metadata Registry
Code4Lib 2008 Metadata Registry
 
The Library of Congress Medium of Performance Thesaurus: Deployment through t...
The Library of Congress Medium of Performance Thesaurus: Deployment through t...The Library of Congress Medium of Performance Thesaurus: Deployment through t...
The Library of Congress Medium of Performance Thesaurus: Deployment through t...
 
Designing and launching the Clinical Reference Library
Designing and launching the Clinical Reference LibraryDesigning and launching the Clinical Reference Library
Designing and launching the Clinical Reference Library
 
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEMAn Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
 
Semantic Integration Patterns
Semantic Integration PatternsSemantic Integration Patterns
Semantic Integration Patterns
 
서울시 열린데이터 광장 문화관광 분야 LOD 서비스
서울시 열린데이터 광장 문화관광 분야 LOD 서비스서울시 열린데이터 광장 문화관광 분야 LOD 서비스
서울시 열린데이터 광장 문화관광 분야 LOD 서비스
 
Jena
JenaJena
Jena
 
070517 Jena
070517 Jena070517 Jena
070517 Jena
 
A Machine Learning Approach to SPARQL Query Performance Prediction
A Machine Learning Approach to SPARQL Query Performance PredictionA Machine Learning Approach to SPARQL Query Performance Prediction
A Machine Learning Approach to SPARQL Query Performance Prediction
 
17 using rules of inference to build arguments
17   using rules of inference to build arguments17   using rules of inference to build arguments
17 using rules of inference to build arguments
 
Linked Open Data in Romania
Linked Open Data in RomaniaLinked Open Data in Romania
Linked Open Data in Romania
 
An Introduction to the Jena API
An Introduction to the Jena APIAn Introduction to the Jena API
An Introduction to the Jena API
 
Semantic Integration with Apache Jena and Stanbol
Semantic Integration with Apache Jena and StanbolSemantic Integration with Apache Jena and Stanbol
Semantic Integration with Apache Jena and Stanbol
 
Unit 1 rules of inference
Unit 1  rules of inferenceUnit 1  rules of inference
Unit 1 rules of inference
 
Inference on the Semantic Web
Inference on the Semantic WebInference on the Semantic Web
Inference on the Semantic Web
 
LOD(Linked Open Data) Recommendations
LOD(Linked Open Data) RecommendationsLOD(Linked Open Data) Recommendations
LOD(Linked Open Data) Recommendations
 
Introduction of Deep Learning
Introduction of Deep LearningIntroduction of Deep Learning
Introduction of Deep Learning
 
LODAC 2017 Linked Open Data Workshop
LODAC 2017 Linked Open Data WorkshopLODAC 2017 Linked Open Data Workshop
LODAC 2017 Linked Open Data Workshop
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data, Information And Knowledge Management Framework And The Data Management ...
Data, Information And Knowledge Management Framework And The Data Management ...Data, Information And Knowledge Management Framework And The Data Management ...
Data, Information And Knowledge Management Framework And The Data Management ...
 

Similar to Jena based implementation of a iso 11179 meta data registry

10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
Stichting ePortfolio Support
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
dallemang
 

Similar to Jena based implementation of a iso 11179 meta data registry (20)

Metadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesMetadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiences
 
MarkLogic Applications in Healthcare
MarkLogic Applications in HealthcareMarkLogic Applications in Healthcare
MarkLogic Applications in Healthcare
 
TCS Innovation Forum 2012 - Kitenga
TCS Innovation Forum 2012 - KitengaTCS Innovation Forum 2012 - Kitenga
TCS Innovation Forum 2012 - Kitenga
 
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data FunnelA Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found..."Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
 
Towards a brokering framework for knowledge-based services: Learning from the...
Towards a brokering framework for knowledge-based services: Learning from the...Towards a brokering framework for knowledge-based services: Learning from the...
Towards a brokering framework for knowledge-based services: Learning from the...
 
Pistoia Alliance SESL pilot Bio IT World Hanover 12 Oct 2011
Pistoia Alliance SESL pilot Bio IT World Hanover 12 Oct 2011Pistoia Alliance SESL pilot Bio IT World Hanover 12 Oct 2011
Pistoia Alliance SESL pilot Bio IT World Hanover 12 Oct 2011
 
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
10052012 luc vervenne synergetics van syntax portfolio naar semantische uitwi...
 
EDF2013: Invited Talk Bastiaan Deblieck: Who remembers EDP?
EDF2013: Invited Talk Bastiaan Deblieck: Who remembers EDP?EDF2013: Invited Talk Bastiaan Deblieck: Who remembers EDP?
EDF2013: Invited Talk Bastiaan Deblieck: Who remembers EDP?
 
Teradata Big Data London Seminar
Teradata Big Data London SeminarTeradata Big Data London Seminar
Teradata Big Data London Seminar
 
SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value
SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value
SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value
 
What is SDMX-RDF?
What is SDMX-RDF?What is SDMX-RDF?
What is SDMX-RDF?
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
 
Ontotext Overview Winter 2012
Ontotext Overview Winter 2012Ontotext Overview Winter 2012
Ontotext Overview Winter 2012
 
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
 
Deploying Enterprise Search in PLM Context with Aras
Deploying Enterprise Search in PLM Context with ArasDeploying Enterprise Search in PLM Context with Aras
Deploying Enterprise Search in PLM Context with Aras
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 

Jena based implementation of a iso 11179 meta data registry

  • 1. Jena based Implementation of a ISO 11179 Meta-data Registry A. Anil Sinaci, SRDC sinaci@apache.org 1 / 37
  • 2. About me • PhD student at • Senior Software Engineer at • FP7 Projects: 2 / 37
  • 3. Agenda  Introduction  Motivation  FP7 – SALUS & BIVEE Projects  Background  ISO/IEC 11179  Common Data Elements  Design & Implementation  Use-case  Summary 3 / 37
  • 4. What is Meta-data? • data about data… (deprecated) • at design time the application contains no data • Descriptive metadata • Structural Metadata – data about the containers of data • meta-data is data • can be stored and managed • meta-data registries 4 / 37
  • 6. Problem of Interoperability Syntactic vs. Semantic • The ability to exchange information • access • The ability to use the information once it has been exchanged • understand 6 / 37 The figure is taken from a presentation of caSDR
  • 7. Meta-data for Semantic Interoperability • Precise knowledge about how data is structured • More efficient and productive with a central, well-administered place to seek for meta-data • Central, easily consumable • Classifications with well-known terminology systems • Build (or map) data models based on a common meta-model Patient Name Surname MDR Birth Date ISO/IEC Sex Patient First name 11179 Last name Patient Firstname Date of Birth Surname Sex Date of Birth Gender 7 / 37
  • 8. Jena based ISO 11179 There are lots of MDR instances out there • Most of them are based on ISO/IEC 11179 • have the chance to interoperate semantically • ISO/IEC 11179 ontology • common vocabulary for meta-data level • Manage all items, classifications, inter-relations and links to the external world (terminology systems, taxonomies, vocabularies) • in a triple-store • easily expose as RDF • easily import as RDF 8 / 37
  • 9. Interoperable through LOD MDR R D F TripleStore MDR R D TripleStore F R MDR D F TripleStore 9 / 37 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 10. Motivation The SALUS Project: Pharmacovigilance • Current post-market safety surveillance and reporting activities are largely based on reports of suspected adverse drug reactions sent to the regulatory bodies • 5% of all hospital admissions in Europe are due to an adverse drug reaction (ADR) • ADRs are the fifth most common cause of hospital deaths • drug withdrawals (eg. Vioxx) • Interoperability between clinical care and clinical research domains  A semantic interoperability architecture based on commonly accepted data elements http://www.salusproject.eu 10 / 37
  • 11. SALUS Project Central & Semantic Meta-data Registry 11 / 37
  • 12. Motivation The BIVEE Project: Business Innovation Management • Software tools for the support of innovation & improvement management within Virtual Enterprises • Document Centric Approach • Identify document structures through common building blocks  common data elements • CCTS – OASIS UBL • based on ISO/IEC 11179 • Production & Innovation Knowledge Repository  Semantic Descriptors • Interoperability among business domains of collaborating partners within a Virtual Enterprise  An interoperability architecture based on commonly accepted semantic descriptors (meta-data) http://www.bivee.eu 12 / 37
  • 13. BIVEE Project Central & Semantic Virtual Enterprise Meta-data Registry Environment BIVEE Semantic Ontologies Semantic Value (I-PIKR) Business Production Search/ Search/ Query/ Query/ Innovation Space Reasoning Space Reasoning Business Processes Innovation related SD docs SD Semantic Semantic KPIs SD SD Annotation Annotation SD SD SD SD SD Production KPIs related docs F-PIKR Innovation Production Data VE Members Data (competencies) External Resources 13 / 37
  • 14. The requirement A clear need for a Common Data Element Repository to facilitate the semantic interoperability between different application domains • to store the building blocks of data models of different domains and systems • so that different data models are described through the aggregation and association of Common Data Elements • should deal with several annotations and links to external world • several vocabularies, classification schemes and terminology systems are currently in use for different domains • should follow the characteristics of the Linked Data approach. 14 / 37
  • 15. What is ISO/IEC 11179 ? • Family of standards addressing the; • Semantics of Data • Representation of Data • Registration of Data • ISO/IEC 11179 is; • Description of metadata in terms of Data Elements • Procedures to manage registry of Data Elements 15 / 37
  • 16. Parts of ISO/IEC 11179 Consists of 6 parts defining • Framework for Specification • Classification • Registry Metamodel • Formulations of Data Definitions • Naming and Identification Principles • Registration of Data Elements. 16 / 37
  • 17. Purpose of ISO/IEC 11179 ISO/IEC 11179 is to promote Standard description of data Common understanding of data across organizational elements and between organizations Re-use and standardization of data over time, space, and applications Harmonization and standardization of data within an organization and across organizations Management of the components of data Re-use of the components of data 17 / 37
  • 18. Benefits of ISO/IEC 11179 • Similar CDE’s linked to same Concept’s; reduced search time • All representations of a CDE can be shown together; increased filexibility • CDE’s having same value domain can be shown together; easy administration of registry • Concept of Object Class and Property; allows Linked Data representation • Classification through External Vocabularies; allows Linked Data integration 18 / 37
  • 19. Common Data Element • Logical unit of data • Belongs to one kind of information • Set of attributes specifies; • Identification • Definition • Representation • Permissible value 19 / 37
  • 20. Common Data Element Data Element Data Element Value Domain Concept Object Property Representation Class 20 / 37
  • 21. Common Data Element Data Element Person Birth Date Value Data Element Value Concept Domain Person Birth Date Birth Date Value Data type: Calendar Person Birth Date Object Class Property The concept The representation What? How? 21 / 37
  • 22. Common Data Element Linked Data Integration with other MDRs Linked Data • ICD9, ICD10 • SNOMED CT • LOINC • RxNorm • WHO ART • MedDRA • …. 22 / 37 diagram adopted from http://ncicbtraining.nci.nih.gov//TPOnline/TPOnline.dll/Public%20Course/COURSENO=COUR2006121515230703800967
  • 23. Common Data Element • Improves the quality of data • Simplifies data sharing • Knowledge sharing • Promotes standard, consistent, universal data • Ease of development • data collection tools • Data Interoperability between • applications • development teams • enterprises • …  All require precise definitions of data 23 / 37
  • 24. ISO/IEC 11179 Implementations • OneMeta MDR, Data Foundations Inc. • extendible and configurable, commercial • caDSR, US National Cancer Institute • Extension to standard, persisted on RDBMS • CCTS, UN/CEFACT • Business data model standard based on 11179 • UBL is an implementation of CCTS • US National Information Exchange Model - NIEM 24 / 37
  • 25. Organizations using ISO/IEC 11179 • Australian Institute of Health and Welfare - METeOR • US Department of Justice - Global Justice XML Data Model GJXDM • US Environmental Protection Agency - Environmental Data Registry • US Health Information Knowledgebase (USHIK) • Ohio State University - open Metadata Repository (openMDR) • Minnesota Department of Education Metadata Registry (K-12 Data) • Minnesota Department of Revenue Property Taxation • The Census Bureau Corporate Metadata Repository • Statistics Canada Integrated MetaDataBase • The Environmental Data Registry 25 / 37
  • 26. Design & Implementation ISO/IEC 11179 Ontology 26 / 37
  • 27. Ontology Design 27 / 37
  • 28. Design & Implementation Common Data Element (CDE) Repository CDE Repository Web GUI UML Model Semantic Model Importer Importer Schema Model Importer CDE Knowledge Base 28 / 37
  • 29. Design & Implementation Java API REST API CDE Knowledge Base Semantic MDR MDR API (Easy-to-use Semantic ISO 11179 Mapping) Semantic Data Manipulation API (Pure ISO 11179 Mapping) JENA RDF/OWL API Triple Store (Jena TDB | Virtuoso) Data 29 / 37
  • 30. Use-case Once we have an implementation for a semantic MDR • Need to populate with Common Data Elements • Mining for CDEs • Importers for different languages: XML Schema, UML, and ontology languages (RDFS/OWL) • Other applications must be built on top of the semantic MDR • New content models referring to the CDEs • Matching and mapping  Strong reasoning • Data Warehouses, Web Services, EHR Systems, Content Management Systems etc… 30 / 37
  • 31. Use-case List all “ClassificationScheme”s List all “ObjectClass”es 31 / 37
  • 32. Use-case Get all “Property”s of a Patient 32 / 37
  • 33. Use-case List all “DataElement”s which are “classifiedBy” Myocardial Infarction (ClassificationSchemeItem) and Nifedipine (ClassificationSchemeItem) AND which have Allergy as “DataElementConcept” 33 / 37
  • 34. Use-case II 34 / 37
  • 35. Summary • Meta-data Registry to facilitate Semantic Interoperability through Common Data Elements (CDE) • For several different domains • ISO/IEC 11179 based • well-established and commonly accepted standard • Pure triple-store implementation access through Jena API • easy integration to Linked Data cloud • together with other MDR implementations • Importers for CDE identification • XML Schema, UML (v1.x and v2.x), RDFS/OWL based ontologies • Apache Wicket based Web interface 35 / 37
  • 36. ISO/IEC 20943 Procedures for achieving metadata registry (MDR) content consistency • formalized ontology generation with well-defined concepts Web Ontology Metadata Registry (ISO/IEC 11179) DEC OC CD DE ... realized MDRs (Sets of concepts) build METeO Our Proposal Scope of this Part EDR caDSR R (Environmental Data (US National (Metadata Online utilized Process Manager Registry) Cancer Institute) Registry) Mapping Info. and Rulus 36 / 37
  • 37. Thank you for listening… Questions A. Anil Sinaci @aasinaci 37 / 37 Special thanks to anilpacaci@gmail.com