SlideShare a Scribd company logo
1 of 16
Download to read offline
©2016 Allotrope Foundation
Allotrope Foundation:
Driving Metadata & Master Data Management through
Improved Data Modeling with Semantic Technologies
Dana Vanderwall, Ph.D.
Director, Biology & Preclinical IT (BMS)
Vice Chair, Board of Directors (Allotrope)
Eric Little, Ph.D.
Vice President, Data Science (OSTHUS)
Adjunct Professor (NYU Polytechnic School of Engineering )
©2016 Allotrope Foundation
The Current Situation in the Lab
Many challenges exist for data to be
captured, integrated and shared
• Data Silos
• Incompatible instruments and
software systems
• Legacy architectures are brittle
and rigid
• SME knowledge resides in
people’s heads
• Data schemas are not explicitly
understood
• Lack of common vision between
business units and scientists
2
©2016 Allotrope Foundation
How do we change that?
3
Data in Standard Format
Metadata in a Standard vocabularyRegulatory Guidance
Methods
Recipes
SOPs
…
Vendor-Specific Formats
Process
Material
Equipment
Result
©2016 Allotrope Foundation
Allotrope Foundation: Driving the Change
4
• Subject Matter Experts
• Project Funding
Member Companies
• Project Management
• Legal & Logistical Support
Secretariat
• Framework Development
• Technical Leadership
Professional
Software Firm
• Requirements & Specifications
• Contributions, PoC Applications
Partner Network
AbbVie
Amgen
Baxter
Bayer
Biogen
Boehringer Ingelheim
Bristol-Myers Squibb
Eli Lilly
Genentech/Roche
GlaxoSmithKline
Merck & Co.
Pfizer
ACD/Labs
Agilent
Biovia
Bruker
BSSN
IDBS
LabAnswer
LabVantage
LEAP Technologies
Mestrelab Research
Mettler Toledo
PerkinElmer
Persistent Systems
Riffyn
Sartorius
Shimadzu
Tetra Science
Thermo Scientific
Waters
Erasmus Univ.
Med Center
J. Paul Getty
Trust
(UK) Science and
Technology
Facilities Council
University of
Southampton
University of
Strathclyde
©2016 Allotrope Foundation
Allotrope Data Format (ADF)
5
Data Description
RDF Model
Data Cubes
Universal data container
Data Package
Virtual file system *
Contains:
• Method, instrument, sample,
process, result, etc.
• Data cube metadata
• Data package metadata
• …
Analytical data represented by
one- or multidimensional arrays.
HDF5
Platform Independent File Format
Allotrope Data Format
Analytical data represented by
arbitrary formats, incl. native
instrument formats, images, pdf,
video, etc.
Specifically designed to store and
organize large amounts of
numerical data.
APIs(Java&.NETclasslibraries)
v1.0 ADF, Taxonomies, Class Libraries released Sept 2015, v1.1 April 2016
©2016 Allotrope Foundation
Moving from Data Format to Semantics
• Has its origins in philosophy - generally understood as the abstract study
of meaning
• Distinguished from syntax – which is the rules-based grammar of a
language
6
“Washington”
©2016 Allotrope Foundation
Allotrope Foundation Taxonomies (AFT)
7
©2016 Allotrope Foundation
Result
Process
Equipment
8©2016 Allotrope Foundation
Allotrope Taxonomies Standardize our Metadata
©2016 Allotrope Foundation
Utilizing the Semantic Spectrum
(Moving Beyond Taxonomies)
9
Code (Lists) Terms (Soil, Plant, etc.)
Controlled Vocabulary
(Agreed Upon Terms)
Taxonomy
(Hierarchy)
Thesaurus
(Preferred Labels, Synonyms, etc.)
RDF Models
(Triples as Graphs)
OWL Ontologies
(RDF + Axioms)
Reasoning
(Rule-based Logics:
Discover New Patterns)
Ontologies and Reasoning add
Axioms and Advanced Logic
©2016 Allotrope Foundation
Understanding the 4V’s of Big Data
10
Normally the focus –
Big Data Analysis is
more than just size
Performance is
Critical to Success
Data complexity is
increasing – Model
complexity
Uncertainty abounds
– requires statistics
and probabilities
Majority of Big Data analytics
approaches treat these two V’s
Semantic
technologies provide
clear advantages
Mathematical
Clustering
Techniques
provide clear
advantages
©2016 Allotrope Foundation
Why Semantics Matters for Data Analytics
11
Big Data approaches require
proper metadata and
terminologies to integrate
information well
Relationships matter in the
data
Understanding perspective
(context) is crucial for success
in today’s world
Semantics provides better
data models/schemas
©2016 Allotrope Foundation
The Foundation for Real Data Analytics on the Laboratory
Workflow and Data
12
Plan
Analysis
Prepare
Samples
Submit
Samples
Control Inst.
Acquire Data
Process
Data
Analyze
Data
Reports
Results
Store,
Archive
Data
Request Report
Search &
Reuse
Data
Sample Prep
Data
Instrument
Instructions
Instrument
Data Processed Data Analyzed Data Reported
Results Stored DataAnalytical
Method
Data Description
RDF Model
Data Cubes
Universal data
container
Data Package
Virtual file system
Allotrope Data Format
©2016 Allotrope Foundation
How is the Framework Being Used?
Implementation by Member Companies
13
DevelopmentResearch Commercial
Member non-GMP GMPInstrument
BMS
Bayer
Baxter
Merck & Co.
Amgen
Boehringer-
Ingelheim
GSK Drug Substance Release & Stability
Structure ID, Purification,
In vitro bioanalysis
Method Screening
HPLC-
UV/MS
HPLC-UV
Balance
HPLC-
UV/MS
Structure IDHPLC-MS
Fermentation
Process Control
Bioanalyzer
Small and Large Molecule CMC
Genentech
Elemental Impurities
Assay, PurityHPLC-UV
Biogen CRO IntegrationHPLC-UV
Pfizer LC Data to ADF Converter/AdapterHPLC-UV
ICP-MS
pH, Weighing, GC, Karl Fischer, TGA, NMR ,
Cell Density/Viability, Blood Gas Analyzer, Cell
Culture Analyzer, Capillary Electrophoresis…
©2016 Allotrope Foundation
How is the Framework Being Used?
Implementations by Member Companies
14
DevelopmentResearch Commercial
Member non-GMP GMPInstrument
Member 6
Member 3
Member 2
Member 9
Member 1
Member 5
Member 8 Drug Substance Release & Stability
Structure ID, Purification,
In vitro bioanalysis
Method ScreeningHPLC-UV/MS
HPLC-UV
Balance
HPLC-UV/MS
Structure IDHPLC-MS
Fermentation
Process Control
Bioanalyzer
Small and Large Molecule CMC
Multiple
types
Member 7
Elemental ImpuritiesICP-MS
Assay, PurityHPLC-UV
pH, Weighing, GC, Karl Fischer, TGA,
NMR , Cell Density/Viability, Blood
Gas Analyzer , Cell Culture Analyzer,
Capillary Electrophoresis
…
Member 4 CRO IntegrationHPLC-UV
Member 10 LC Data to ADF Converter/AdapterHPLC-UV
DevelopmentResearch Commercial
Member 6
Member 9
Member 8 Drug Substance Release & Stability
Structure ID, Purification,
In vitro bioanalysis
Method ScreeningHPLC-UV/MS
HPLC-UV
Balance
HPLC-UV/MS
Member 10 LC Data to ADF Converter/AdapterHPLC-UV
Member 1 Small and Large Molecule CMC
Multiple
types
taxonomies
methods
repository
data repository
adapter
instrument adapter
pH, Weighing, GC, Karl Fischer, TGA, NMR ,
Cell Density/Viability, Blood Gas Analyzer, Cell
Culture Analyzer, Capillary Electrophoresis…
©2016 Allotrope Foundation
Smart Labs for the 21st Century
Smart labs in the future will provide the
enterprise with:
• Integrated Data – common reference
data structures (vocabularies)
• Sharable Data – easier interaction
across teams and business units
• Scalability – Big data applications that
can be highly elastic
• Conceptual Representations – context
and perspective are captured
• Advanced Analytics – complex &
automated problem-solving capabilities
©2016 Allotrope Foundation
Thank you!
• Any questions, please contact the Secretariat at
more.info@allotrope.org or james.vergis@dbr.com
• 2016 Workshops
– January 20, 2016: San Francisco, CA @ Genentech
– June 2016: Ingelheim, Germany @ Boehringer Ingelheim
– September 2016: Indianapolis, ID @ Eli Lilly and Co.
http://www.allotrope.org for more information and to register
16

More Related Content

What's hot

IC-SDV 2019: OntoChem
IC-SDV 2019: OntoChemIC-SDV 2019: OntoChem
IC-SDV 2019: OntoChem
Dr. Haxel Consult
 
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
Dr. Haxel Consult
 
eTRIKS Data Harmonization Service Platform
eTRIKS Data Harmonization Service PlatformeTRIKS Data Harmonization Service Platform
eTRIKS Data Harmonization Service Platform
ibemam
 
Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...
Valery Tkachenko
 
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
Perficient
 

What's hot (20)

Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
IC-SDV 2019: OntoChem
IC-SDV 2019: OntoChemIC-SDV 2019: OntoChem
IC-SDV 2019: OntoChem
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...
Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...
Introduction of BJU-BMR-RG and use case study of Applying openEHR archetypes ...
 
Automated and Explainable Deep Learning for Clinical Language Understanding a...
Automated and Explainable Deep Learning for Clinical Language Understanding a...Automated and Explainable Deep Learning for Clinical Language Understanding a...
Automated and Explainable Deep Learning for Clinical Language Understanding a...
 
openEHR sll-2015final
openEHR sll-2015finalopenEHR sll-2015final
openEHR sll-2015final
 
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse EnvironmentsEnabling Clinical Data Reuse with openEHR Data Warehouse Environments
Enabling Clinical Data Reuse with openEHR Data Warehouse Environments
 
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
ICIC 2014 Finding Answers in the Data – The Future Role of Text and Data Mini...
 
eHealth Foundations: Can openEHR Provide One Layer?
eHealth Foundations: Can openEHR Provide One Layer?eHealth Foundations: Can openEHR Provide One Layer?
eHealth Foundations: Can openEHR Provide One Layer?
 
eTRIKS Data Harmonization Service Platform
eTRIKS Data Harmonization Service PlatformeTRIKS Data Harmonization Service Platform
eTRIKS Data Harmonization Service Platform
 
Why ICT Fails in Healthcare: Software Maintenance and Maintainability
Why ICT Fails in Healthcare: Software Maintenance and MaintainabilityWhy ICT Fails in Healthcare: Software Maintenance and Maintainability
Why ICT Fails in Healthcare: Software Maintenance and Maintainability
 
Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...
 
What is FHIR
What is FHIRWhat is FHIR
What is FHIR
 
ICIC 2014 New Product Introduction Wiley
ICIC 2014 New Product Introduction WileyICIC 2014 New Product Introduction Wiley
ICIC 2014 New Product Introduction Wiley
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
 
openEHR Medinfo2015 Brazil Sponsor Session
openEHR Medinfo2015 Brazil Sponsor SessionopenEHR Medinfo2015 Brazil Sponsor Session
openEHR Medinfo2015 Brazil Sponsor Session
 
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
 
Clinical modelling with openEHR Archetypes
Clinical modelling with openEHR ArchetypesClinical modelling with openEHR Archetypes
Clinical modelling with openEHR Archetypes
 
ICIC 2014 New Product Introduction InfoChem
ICIC 2014 New Product Introduction InfoChemICIC 2014 New Product Introduction InfoChem
ICIC 2014 New Product Introduction InfoChem
 

Similar to Allotrope foundation vanderwall_and_little_bio_it_world_2016

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
Carole Goble
 

Similar to Allotrope foundation vanderwall_and_little_bio_it_world_2016 (20)

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Resource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and FederationResource Description Framework Approach to Data Publication and Federation
Resource Description Framework Approach to Data Publication and Federation
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
 
How SAP HANA can provide value for Pharma R&D
How SAP HANA can provide value for Pharma R&DHow SAP HANA can provide value for Pharma R&D
How SAP HANA can provide value for Pharma R&D
 
Common Protocol Template (CPT) Initiative - Implementation Toolkit Executive ...
Common Protocol Template (CPT) Initiative - Implementation Toolkit Executive ...Common Protocol Template (CPT) Initiative - Implementation Toolkit Executive ...
Common Protocol Template (CPT) Initiative - Implementation Toolkit Executive ...
 
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
Crossref LIVE: The Benefits of Open Infrastructure (APAC time zones) - 29th O...
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology Agency
 
Pistoia Chemistry Live Strategy April 2011
Pistoia Chemistry Live Strategy April 2011Pistoia Chemistry Live Strategy April 2011
Pistoia Chemistry Live Strategy April 2011
 
Objects in Motion The Institutional Repositories Landscape
Objects in Motion The Institutional Repositories LandscapeObjects in Motion The Institutional Repositories Landscape
Objects in Motion The Institutional Repositories Landscape
 
Active research management and sharing
Active research management and sharingActive research management and sharing
Active research management and sharing
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...
An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...
An Approach to Combining Disparate Clinical Study Data across Multiple Sponso...
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
 
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSSWheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
Wheat Data Interoperability (1) by Esther DZALE YEUMO KABORE and Richard FULSS
 
Common Protocol Template Executive Summary
Common Protocol Template Executive SummaryCommon Protocol Template Executive Summary
Common Protocol Template Executive Summary
 
FAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and NeuroscienceFAIR and metadata standards - FAIRsharing and Neuroscience
FAIR and metadata standards - FAIRsharing and Neuroscience
 
Pistoia Alliance Debates: Ontologies mapping webinar 23rd Feb 2017
Pistoia Alliance Debates: Ontologies mapping webinar 23rd Feb 2017Pistoia Alliance Debates: Ontologies mapping webinar 23rd Feb 2017
Pistoia Alliance Debates: Ontologies mapping webinar 23rd Feb 2017
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
 

More from OSTHUS

More from OSTHUS (14)

The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data
 
Challenges & Opportunities of Implementation FAIR in Life Sciences
Challenges & Opportunities of Implementation FAIR in Life SciencesChallenges & Opportunities of Implementation FAIR in Life Sciences
Challenges & Opportunities of Implementation FAIR in Life Sciences
 
Data lifecycle mgt across the enterprise
Data lifecycle mgt across the enterpriseData lifecycle mgt across the enterprise
Data lifecycle mgt across the enterprise
 
From allotrope to reference master data management
From allotrope to reference master data management From allotrope to reference master data management
From allotrope to reference master data management
 
Big Data becomes Big Analysis
Big Data becomes Big Analysis Big Data becomes Big Analysis
Big Data becomes Big Analysis
 
Early AI Adoption Via Advanced Analytics
Early AI Adoption Via  Advanced AnalyticsEarly AI Adoption Via  Advanced Analytics
Early AI Adoption Via Advanced Analytics
 
Why Data is Becoming the Most Valuable Asset Companies Posses
Why Data is Becoming the Most Valuable Asset Companies PossesWhy Data is Becoming the Most Valuable Asset Companies Posses
Why Data is Becoming the Most Valuable Asset Companies Posses
 
Demystifying Semantics:Practical Utilization of Semantic Technologies for Rea...
Demystifying Semantics:Practical Utilization of Semantic Technologies for Rea...Demystifying Semantics:Practical Utilization of Semantic Technologies for Rea...
Demystifying Semantics:Practical Utilization of Semantic Technologies for Rea...
 
Why paperless lab is just the first step towards a smart lab
Why paperless lab is just the first step towards a smart labWhy paperless lab is just the first step towards a smart lab
Why paperless lab is just the first step towards a smart lab
 
Smart Data for Smart Labs
Smart Data for Smart Labs Smart Data for Smart Labs
Smart Data for Smart Labs
 
Reasoning over big data
Reasoning over big dataReasoning over big data
Reasoning over big data
 
Best Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data CurationBest Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data Curation
 
Data Quality- How to clean up your legacy data
Data Quality- How to clean up your legacy dataData Quality- How to clean up your legacy data
Data Quality- How to clean up your legacy data
 
Data Quality- How to clean up your legacy data?
Data Quality- How to clean up your legacy data?Data Quality- How to clean up your legacy data?
Data Quality- How to clean up your legacy data?
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 

Allotrope foundation vanderwall_and_little_bio_it_world_2016

  • 1. ©2016 Allotrope Foundation Allotrope Foundation: Driving Metadata & Master Data Management through Improved Data Modeling with Semantic Technologies Dana Vanderwall, Ph.D. Director, Biology & Preclinical IT (BMS) Vice Chair, Board of Directors (Allotrope) Eric Little, Ph.D. Vice President, Data Science (OSTHUS) Adjunct Professor (NYU Polytechnic School of Engineering )
  • 2. ©2016 Allotrope Foundation The Current Situation in the Lab Many challenges exist for data to be captured, integrated and shared • Data Silos • Incompatible instruments and software systems • Legacy architectures are brittle and rigid • SME knowledge resides in people’s heads • Data schemas are not explicitly understood • Lack of common vision between business units and scientists 2
  • 3. ©2016 Allotrope Foundation How do we change that? 3 Data in Standard Format Metadata in a Standard vocabularyRegulatory Guidance Methods Recipes SOPs … Vendor-Specific Formats Process Material Equipment Result
  • 4. ©2016 Allotrope Foundation Allotrope Foundation: Driving the Change 4 • Subject Matter Experts • Project Funding Member Companies • Project Management • Legal & Logistical Support Secretariat • Framework Development • Technical Leadership Professional Software Firm • Requirements & Specifications • Contributions, PoC Applications Partner Network AbbVie Amgen Baxter Bayer Biogen Boehringer Ingelheim Bristol-Myers Squibb Eli Lilly Genentech/Roche GlaxoSmithKline Merck & Co. Pfizer ACD/Labs Agilent Biovia Bruker BSSN IDBS LabAnswer LabVantage LEAP Technologies Mestrelab Research Mettler Toledo PerkinElmer Persistent Systems Riffyn Sartorius Shimadzu Tetra Science Thermo Scientific Waters Erasmus Univ. Med Center J. Paul Getty Trust (UK) Science and Technology Facilities Council University of Southampton University of Strathclyde
  • 5. ©2016 Allotrope Foundation Allotrope Data Format (ADF) 5 Data Description RDF Model Data Cubes Universal data container Data Package Virtual file system * Contains: • Method, instrument, sample, process, result, etc. • Data cube metadata • Data package metadata • … Analytical data represented by one- or multidimensional arrays. HDF5 Platform Independent File Format Allotrope Data Format Analytical data represented by arbitrary formats, incl. native instrument formats, images, pdf, video, etc. Specifically designed to store and organize large amounts of numerical data. APIs(Java&.NETclasslibraries) v1.0 ADF, Taxonomies, Class Libraries released Sept 2015, v1.1 April 2016
  • 6. ©2016 Allotrope Foundation Moving from Data Format to Semantics • Has its origins in philosophy - generally understood as the abstract study of meaning • Distinguished from syntax – which is the rules-based grammar of a language 6 “Washington”
  • 7. ©2016 Allotrope Foundation Allotrope Foundation Taxonomies (AFT) 7
  • 8. ©2016 Allotrope Foundation Result Process Equipment 8©2016 Allotrope Foundation Allotrope Taxonomies Standardize our Metadata
  • 9. ©2016 Allotrope Foundation Utilizing the Semantic Spectrum (Moving Beyond Taxonomies) 9 Code (Lists) Terms (Soil, Plant, etc.) Controlled Vocabulary (Agreed Upon Terms) Taxonomy (Hierarchy) Thesaurus (Preferred Labels, Synonyms, etc.) RDF Models (Triples as Graphs) OWL Ontologies (RDF + Axioms) Reasoning (Rule-based Logics: Discover New Patterns) Ontologies and Reasoning add Axioms and Advanced Logic
  • 10. ©2016 Allotrope Foundation Understanding the 4V’s of Big Data 10 Normally the focus – Big Data Analysis is more than just size Performance is Critical to Success Data complexity is increasing – Model complexity Uncertainty abounds – requires statistics and probabilities Majority of Big Data analytics approaches treat these two V’s Semantic technologies provide clear advantages Mathematical Clustering Techniques provide clear advantages
  • 11. ©2016 Allotrope Foundation Why Semantics Matters for Data Analytics 11 Big Data approaches require proper metadata and terminologies to integrate information well Relationships matter in the data Understanding perspective (context) is crucial for success in today’s world Semantics provides better data models/schemas
  • 12. ©2016 Allotrope Foundation The Foundation for Real Data Analytics on the Laboratory Workflow and Data 12 Plan Analysis Prepare Samples Submit Samples Control Inst. Acquire Data Process Data Analyze Data Reports Results Store, Archive Data Request Report Search & Reuse Data Sample Prep Data Instrument Instructions Instrument Data Processed Data Analyzed Data Reported Results Stored DataAnalytical Method Data Description RDF Model Data Cubes Universal data container Data Package Virtual file system Allotrope Data Format
  • 13. ©2016 Allotrope Foundation How is the Framework Being Used? Implementation by Member Companies 13 DevelopmentResearch Commercial Member non-GMP GMPInstrument BMS Bayer Baxter Merck & Co. Amgen Boehringer- Ingelheim GSK Drug Substance Release & Stability Structure ID, Purification, In vitro bioanalysis Method Screening HPLC- UV/MS HPLC-UV Balance HPLC- UV/MS Structure IDHPLC-MS Fermentation Process Control Bioanalyzer Small and Large Molecule CMC Genentech Elemental Impurities Assay, PurityHPLC-UV Biogen CRO IntegrationHPLC-UV Pfizer LC Data to ADF Converter/AdapterHPLC-UV ICP-MS pH, Weighing, GC, Karl Fischer, TGA, NMR , Cell Density/Viability, Blood Gas Analyzer, Cell Culture Analyzer, Capillary Electrophoresis…
  • 14. ©2016 Allotrope Foundation How is the Framework Being Used? Implementations by Member Companies 14 DevelopmentResearch Commercial Member non-GMP GMPInstrument Member 6 Member 3 Member 2 Member 9 Member 1 Member 5 Member 8 Drug Substance Release & Stability Structure ID, Purification, In vitro bioanalysis Method ScreeningHPLC-UV/MS HPLC-UV Balance HPLC-UV/MS Structure IDHPLC-MS Fermentation Process Control Bioanalyzer Small and Large Molecule CMC Multiple types Member 7 Elemental ImpuritiesICP-MS Assay, PurityHPLC-UV pH, Weighing, GC, Karl Fischer, TGA, NMR , Cell Density/Viability, Blood Gas Analyzer , Cell Culture Analyzer, Capillary Electrophoresis … Member 4 CRO IntegrationHPLC-UV Member 10 LC Data to ADF Converter/AdapterHPLC-UV DevelopmentResearch Commercial Member 6 Member 9 Member 8 Drug Substance Release & Stability Structure ID, Purification, In vitro bioanalysis Method ScreeningHPLC-UV/MS HPLC-UV Balance HPLC-UV/MS Member 10 LC Data to ADF Converter/AdapterHPLC-UV Member 1 Small and Large Molecule CMC Multiple types taxonomies methods repository data repository adapter instrument adapter pH, Weighing, GC, Karl Fischer, TGA, NMR , Cell Density/Viability, Blood Gas Analyzer, Cell Culture Analyzer, Capillary Electrophoresis…
  • 15. ©2016 Allotrope Foundation Smart Labs for the 21st Century Smart labs in the future will provide the enterprise with: • Integrated Data – common reference data structures (vocabularies) • Sharable Data – easier interaction across teams and business units • Scalability – Big data applications that can be highly elastic • Conceptual Representations – context and perspective are captured • Advanced Analytics – complex & automated problem-solving capabilities
  • 16. ©2016 Allotrope Foundation Thank you! • Any questions, please contact the Secretariat at more.info@allotrope.org or james.vergis@dbr.com • 2016 Workshops – January 20, 2016: San Francisco, CA @ Genentech – June 2016: Ingelheim, Germany @ Boehringer Ingelheim – September 2016: Indianapolis, ID @ Eli Lilly and Co. http://www.allotrope.org for more information and to register 16