SlideShare a Scribd company logo
1 of 23
Download to read offline
V.2.2
Eric Little, PhD
Chief Data Officer
OSTHUS
eric.little@osthus.com
Data Lifecycle Management
Across The Enterprise
Slide 2
Pharma invests in R&D and has to
make $ back over subsequent years
 Most R&D will fail, so risk is high
Law of Diminishing Returns
 R&D productivity is declining
 Harder treatments have greater costs,
potentially lower returns
 Drugs with minimal improvements
(not as many blockbusters + generics)
The Pharma Industry Is At A Tipping Point
From: Kelvin Stott - https://endpts.com/pharmas-broken-business-model-
an-industry-on-the-brink-of-terminal-decline/
Slide 3
Reduce R&D costs through better use of data
 Many experiments are re-run because scientists cannot find existing data
 Costs of system integration is much higher than data integration
 Standardization upstream can significantly impact costs downstream
Once data is available – automate as much as possible
Connect your internal data to other external data sources
 Many items exist in open source that can be modified easier than built from scratch
How To Help Remedy the Situation
Use the data you have before you generate more!
Start with reoccurring tasks – workflows, models,
query patterns, analytics, etc., then build out!
Don’t reinvent the wheel! Build data communities!
Slide 4
THE MOVE FROM BIG DATA TO
BIG ANALYSIS
STATISTICAL
SEMANTICS
MACHINE
LEARNING
REASONING
Slide 5
Moving to Smart Data
Smart data can be added to existing systems
 Does not require replacement of existing tech
Smart data provides a separation of:
 Model Layer
 Data Layer
Link to the model layer
 Leave data in place
 Smart data links information from the models to instance-level data
Smart Data uses metadata in order to capture context about data
Slide 6
Semantic Spectrum of Knowledge Organization Systems
• Deborah L. McGuinness. "Ontologies Come of Age". In Dieter Fensel, Jim Hendler, Henry Lieberman, and Wolfgang Wahlster, editors. Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential. MIT Press, 2003.
• Michael Uschold and Michael Gruninger “Ontologies and semantics for seamless connectivity” SIGMOD Rec. 33, 4 (December 2004), 58-64. DOI=http://dx.doi.org/10.1145/1041410.1041420
• Leo Obrst “The Ontology Spectrum”. Book section in of Roberto Poli, Michael Healy, Achilles Kameas “Theory and Applications of Ontology: Computer Applications”. Springer Netherlands, 17 Sep 2010.
• Leo Obrst and Mills Davis "Semantic Wave 2008 Report: Industry Roadmap to Web 3.0 & Multibillion Dollar Market Opportunities”. 2008.
Sources
Slide 7
Advantages of Using This Tech
Use cases where customers report distinct improvement:
 Better defined terms
• Differentiates between Entities and Labels – more specific data dictionary
 Better taxonomic structure
• Hierarchies can be accurately captured – not buried in incorrect tables
 Query Federation
• Can easily use multiple data sources (integration)
 Query Faceting
• Query results can be easily refined (and shared)
 Better use of metadata
• Provides context for users
• Raw data is more valuable over time
 Makes data actionable across an enterprise
• Moves from local data (on people’s machines, in their heads) to explicit sharable resources
• Adding SMART DATA to BIG DATA provides the means to access and use the data
• Requires combining logical data with statistical data in order to find patterns of
interest inside of large data sets
Slide 8
A Semantic Framework can connect the entire enterprise using a common semantics
The Semantic Hub should only focus on metadata (not instance level data)
Benefits: Common Terms, Models, Queries, Rules and Results (End-to-End)
Integrating Data Across the Enterprise
Lab Instruments Clinical Trials Regulatory AffairsProduction eArchiving
Slide 9
Lab Instrument Use Case –
Allotrope Framework
HPLC – UV
Mobile Phase Selection
Slide 10
Ontology for HPLC Example (Allotrope)
resultdevice
material
process
Slide 11
Clinical Trials Use Case –
Astra Zeneca & MedImmune
Slide 12
Connecting The Dots Across AstraZeneca & MedImmune
For Clinical Trials
Slide 13
FAIR Principles Bring Together Clinical Trials Data Across Phases
Slide 14
Domain Knowledge Is Captured In Models
Slide 15
Production Use Case –
Manufacturing Data Integration
Slide 16
Often times R&D and manufacturing cannot easily share data
Competing systems can evolve which cause incompatibilities
Manufacturing data is often lower less complex than R&D data, but significantly
higher in throughput
 QA/QC plays a major role
 Far more interpretation in R&D
 Manufacturing needs results fast
• Alarms
• Trends
 Manufacturing data is less retrospective
Manufacturing Data Vs. R&D Data
Slide 17
Regulatory Use Case –
Unstructured Data Integration
Slide 18
Regulatory compliance requires accessing and mining unstructured data
Linking unstructured data to other data provides significant advantages
 Text to DB links unstructured and structured data
 Text to Public Data Sources leverages open source research
Regulatory Compliance
Regulatory Documentation
Slide 19
E-Archiving: Managing Data
Over Long Lifecycles
Slide 20
Data is made available for easier search and indexing (even after long periods of time)
Archiving is no longer a “vault” concept but is integrated within the Data Mgt. Lifecycle
E-Archiving Using the Allotrope Data Framework
Slide 21
Big Analysis Requires Hybrid Architectures
Semantic DBs
Unstructured Docs
Structured Data
Cloud DBs (NoSQL)Analytics
Dashboards & Reports
Integration Layer
Slide 22
Data Science (machine learning, text analytics, clustering etc.)
FAIR Data Is Now Accessible For Advanced Analytics
Linked Open Data
& Open APIs
Semantic
Graph DB
(Knowledge Graph)
Operational DBs
…
Unstructured
Documents
Analytics Tools
simulations
statistics
reasoning
Visualization
dashboards
exploration
search
…
Semi-structured
Data
Instrument
Data
Lightweight Semantic Integration Layer
(semantic RMDM, APIs, semantic indexing, data annotation, catalogues, meta data and linking)
Reporting
regulatory
internal
external
Slide 23
CONNECTING DATA, PEOPLE AND ORGANIZATIONS
Contact Information:
Email: eric.little@osthus.com
Web: www.osthus.com
www.biganalysis.com
Twitter: OntoEric

More Related Content

What's hot

Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG TRG
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical DataPaul Agapow
 
5th Forum on Laboratory Informatics
5th Forum on Laboratory Informatics5th Forum on Laboratory Informatics
5th Forum on Laboratory InformaticsAbby Lombardi
 
Data Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future JobsData Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future JobsJian Qin
 
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...Cognizant
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiPistoia Alliance
 
Acceliant white paper_edc_and_epro
Acceliant white paper_edc_and_eproAcceliant white paper_edc_and_epro
Acceliant white paper_edc_and_eproTrianz
 
Datascienceindia article
Datascienceindia articleDatascienceindia article
Datascienceindia articleHimanshuPise1
 
Understand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.SUnderstand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.SJiaming Zhang
 
Data Science
Data ScienceData Science
Data ScienceRabin BK
 
Data science lecture1_doaa_mohey
Data science lecture1_doaa_moheyData science lecture1_doaa_mohey
Data science lecture1_doaa_moheyDoaa Mohey Eldin
 
Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...
Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...
Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...Pistoia Alliance
 
Hybrid Fuzzy Approches for Networks
Hybrid Fuzzy Approches for NetworksHybrid Fuzzy Approches for Networks
Hybrid Fuzzy Approches for Networksijcoa
 
Finding common ground: integrating the eagle-i and VIVO ontologies
Finding common ground: integrating the eagle-i and VIVO ontologiesFinding common ground: integrating the eagle-i and VIVO ontologies
Finding common ground: integrating the eagle-i and VIVO ontologiesmhaendel
 
Sowmya Raghavan Strand Life
Sowmya Raghavan Strand LifeSowmya Raghavan Strand Life
Sowmya Raghavan Strand LifeEmTech
 
Nvidia why every industry should be thinking about AI today
Nvidia why every industry should be thinking about AI todayNvidia why every industry should be thinking about AI today
Nvidia why every industry should be thinking about AI todayJustin Hayward
 
Data Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralData Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralFrank Kienle
 

What's hot (20)

Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG Faster R & D Analysis Tool - TRG
Faster R & D Analysis Tool - TRG
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical Data
 
5th Forum on Laboratory Informatics
5th Forum on Laboratory Informatics5th Forum on Laboratory Informatics
5th Forum on Laboratory Informatics
 
Data Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future JobsData Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future Jobs
 
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
To Be Digital, Pharma Labs Must Bridge the Gap Between Legacy Systems & Conne...
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirti
 
Acceliant white paper_edc_and_epro
Acceliant white paper_edc_and_eproAcceliant white paper_edc_and_epro
Acceliant white paper_edc_and_epro
 
Datascienceindia article
Datascienceindia articleDatascienceindia article
Datascienceindia article
 
Linked data in pharma
Linked data in pharmaLinked data in pharma
Linked data in pharma
 
Understand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.SUnderstand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.S
 
Data Science
Data ScienceData Science
Data Science
 
Data science lecture1_doaa_mohey
Data science lecture1_doaa_moheyData science lecture1_doaa_mohey
Data science lecture1_doaa_mohey
 
SciBite
SciBiteSciBite
SciBite
 
Removing the information bottleneck in R&D
Removing the information bottleneck in R&DRemoving the information bottleneck in R&D
Removing the information bottleneck in R&D
 
Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...
Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...
Pistoia Alliance Debates: PhUSE Framework for the Adoption of Cloud Technolog...
 
Hybrid Fuzzy Approches for Networks
Hybrid Fuzzy Approches for NetworksHybrid Fuzzy Approches for Networks
Hybrid Fuzzy Approches for Networks
 
Finding common ground: integrating the eagle-i and VIVO ontologies
Finding common ground: integrating the eagle-i and VIVO ontologiesFinding common ground: integrating the eagle-i and VIVO ontologies
Finding common ground: integrating the eagle-i and VIVO ontologies
 
Sowmya Raghavan Strand Life
Sowmya Raghavan Strand LifeSowmya Raghavan Strand Life
Sowmya Raghavan Strand Life
 
Nvidia why every industry should be thinking about AI today
Nvidia why every industry should be thinking about AI todayNvidia why every industry should be thinking about AI today
Nvidia why every industry should be thinking about AI today
 
Data Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralData Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information Collateral
 

Similar to Data lifecycle mgt across the enterprise

Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterOSTHUS
 
Licensing Linked Data
Licensing Linked DataLicensing Linked Data
Licensing Linked Datapellegrinit
 
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARECLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCAREUsmanYakubuMaaruf
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfAlan Morrison
 
Laboratory Integration John Trigg
Laboratory Integration  John TriggLaboratory Integration  John Trigg
Laboratory Integration John TriggJohn Trigg
 
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and SecurityOntology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and SecurityBarry Smith
 
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveOpen Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveKees van Bochove
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs3 Round Stones
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)James Hendler
 
Activate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge GraphActivate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge GraphDATAVERSITY
 
Collaboration - theory & Practice
Collaboration - theory & PracticeCollaboration - theory & Practice
Collaboration - theory & PracticeSean Ekins
 
Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereAlex Hardisty
 
Considerations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflowConsiderations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflowEagle Genomics
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science James Hendler
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data IJCERT JOURNAL
 
IoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDILIoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDILTill Riedel
 
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertA Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertWansoo Im
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Denodo
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALJohn Koch
 

Similar to Data lifecycle mgt across the enterprise (20)

Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
 
Licensing Linked Data
Licensing Linked DataLicensing Linked Data
Licensing Linked Data
 
Thesis Defense MBI
Thesis Defense MBIThesis Defense MBI
Thesis Defense MBI
 
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARECLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
CLOUD COMPUTING AND BYOD: BENEFITS AND CHALLENGES IN MODERN HEALTHCARE
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
 
Laboratory Integration John Trigg
Laboratory Integration  John TriggLaboratory Integration  John Trigg
Laboratory Integration John Trigg
 
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and SecurityOntology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
 
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveOpen Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
 
Linked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for EntrepreneursLinked Data: Opportunities for Entrepreneurs
Linked Data: Opportunities for Entrepreneurs
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Activate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge GraphActivate Your Data Lakehouse with an Enterprise Knowledge Graph
Activate Your Data Lakehouse with an Enterprise Knowledge Graph
 
Collaboration - theory & Practice
Collaboration - theory & PracticeCollaboration - theory & Practice
Collaboration - theory & Practice
 
Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphere
 
Considerations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflowConsiderations and challenges in building an end to-end microbiome workflow
Considerations and challenges in building an end to-end microbiome workflow
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data
 
IoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDILIoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDIL
 
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertA Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
 
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
Square Pegs In Round Holes: Rethinking Data Availability in the Age of Automa...
 
BigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINALBigDataAnalytics_Talk_KOCH_FINAL
BigDataAnalytics_Talk_KOCH_FINAL
 

More from OSTHUS

The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data OSTHUS
 
Early AI Adoption Via Advanced Analytics
Early AI Adoption Via  Advanced AnalyticsEarly AI Adoption Via  Advanced Analytics
Early AI Adoption Via Advanced AnalyticsOSTHUS
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...OSTHUS
 
Why paperless lab is just the first step towards a smart lab
Why paperless lab is just the first step towards a smart labWhy paperless lab is just the first step towards a smart lab
Why paperless lab is just the first step towards a smart labOSTHUS
 
Allotrope foundation vanderwall_and_little_bio_it_world_2016
Allotrope foundation vanderwall_and_little_bio_it_world_2016Allotrope foundation vanderwall_and_little_bio_it_world_2016
Allotrope foundation vanderwall_and_little_bio_it_world_2016OSTHUS
 
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...OSTHUS
 
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...OSTHUS
 
Best Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data CurationBest Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data CurationOSTHUS
 
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...OSTHUS
 
OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015
OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015
OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015OSTHUS
 
Data Quality- How to clean up your legacy data
Data Quality- How to clean up your legacy dataData Quality- How to clean up your legacy data
Data Quality- How to clean up your legacy dataOSTHUS
 
Data Quality- How to clean up your legacy data?
Data Quality- How to clean up your legacy data?Data Quality- How to clean up your legacy data?
Data Quality- How to clean up your legacy data?OSTHUS
 

More from OSTHUS (12)

The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data
 
Early AI Adoption Via Advanced Analytics
Early AI Adoption Via  Advanced AnalyticsEarly AI Adoption Via  Advanced Analytics
Early AI Adoption Via Advanced Analytics
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
 
Why paperless lab is just the first step towards a smart lab
Why paperless lab is just the first step towards a smart labWhy paperless lab is just the first step towards a smart lab
Why paperless lab is just the first step towards a smart lab
 
Allotrope foundation vanderwall_and_little_bio_it_world_2016
Allotrope foundation vanderwall_and_little_bio_it_world_2016Allotrope foundation vanderwall_and_little_bio_it_world_2016
Allotrope foundation vanderwall_and_little_bio_it_world_2016
 
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
 
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...Semantics for integrated laboratory analytical processes - The Allotrope Pers...
Semantics for integrated laboratory analytical processes - The Allotrope Pers...
 
Best Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data CurationBest Practice Reference Architecture for Data Curation
Best Practice Reference Architecture for Data Curation
 
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
Allotrope Foundation & OSTHUS at SmartLab Exchange 2015: Update on the Allotr...
 
OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015
OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015
OSTHUS-Allotrope presents "Laboratory Informatics Strategy" at SmartLab 2015
 
Data Quality- How to clean up your legacy data
Data Quality- How to clean up your legacy dataData Quality- How to clean up your legacy data
Data Quality- How to clean up your legacy data
 
Data Quality- How to clean up your legacy data?
Data Quality- How to clean up your legacy data?Data Quality- How to clean up your legacy data?
Data Quality- How to clean up your legacy data?
 

Recently uploaded

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Intelisync
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 

Recently uploaded (20)

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 

Data lifecycle mgt across the enterprise

  • 1. V.2.2 Eric Little, PhD Chief Data Officer OSTHUS eric.little@osthus.com Data Lifecycle Management Across The Enterprise
  • 2. Slide 2 Pharma invests in R&D and has to make $ back over subsequent years  Most R&D will fail, so risk is high Law of Diminishing Returns  R&D productivity is declining  Harder treatments have greater costs, potentially lower returns  Drugs with minimal improvements (not as many blockbusters + generics) The Pharma Industry Is At A Tipping Point From: Kelvin Stott - https://endpts.com/pharmas-broken-business-model- an-industry-on-the-brink-of-terminal-decline/
  • 3. Slide 3 Reduce R&D costs through better use of data  Many experiments are re-run because scientists cannot find existing data  Costs of system integration is much higher than data integration  Standardization upstream can significantly impact costs downstream Once data is available – automate as much as possible Connect your internal data to other external data sources  Many items exist in open source that can be modified easier than built from scratch How To Help Remedy the Situation Use the data you have before you generate more! Start with reoccurring tasks – workflows, models, query patterns, analytics, etc., then build out! Don’t reinvent the wheel! Build data communities!
  • 4. Slide 4 THE MOVE FROM BIG DATA TO BIG ANALYSIS STATISTICAL SEMANTICS MACHINE LEARNING REASONING
  • 5. Slide 5 Moving to Smart Data Smart data can be added to existing systems  Does not require replacement of existing tech Smart data provides a separation of:  Model Layer  Data Layer Link to the model layer  Leave data in place  Smart data links information from the models to instance-level data Smart Data uses metadata in order to capture context about data
  • 6. Slide 6 Semantic Spectrum of Knowledge Organization Systems • Deborah L. McGuinness. "Ontologies Come of Age". In Dieter Fensel, Jim Hendler, Henry Lieberman, and Wolfgang Wahlster, editors. Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential. MIT Press, 2003. • Michael Uschold and Michael Gruninger “Ontologies and semantics for seamless connectivity” SIGMOD Rec. 33, 4 (December 2004), 58-64. DOI=http://dx.doi.org/10.1145/1041410.1041420 • Leo Obrst “The Ontology Spectrum”. Book section in of Roberto Poli, Michael Healy, Achilles Kameas “Theory and Applications of Ontology: Computer Applications”. Springer Netherlands, 17 Sep 2010. • Leo Obrst and Mills Davis "Semantic Wave 2008 Report: Industry Roadmap to Web 3.0 & Multibillion Dollar Market Opportunities”. 2008. Sources
  • 7. Slide 7 Advantages of Using This Tech Use cases where customers report distinct improvement:  Better defined terms • Differentiates between Entities and Labels – more specific data dictionary  Better taxonomic structure • Hierarchies can be accurately captured – not buried in incorrect tables  Query Federation • Can easily use multiple data sources (integration)  Query Faceting • Query results can be easily refined (and shared)  Better use of metadata • Provides context for users • Raw data is more valuable over time  Makes data actionable across an enterprise • Moves from local data (on people’s machines, in their heads) to explicit sharable resources • Adding SMART DATA to BIG DATA provides the means to access and use the data • Requires combining logical data with statistical data in order to find patterns of interest inside of large data sets
  • 8. Slide 8 A Semantic Framework can connect the entire enterprise using a common semantics The Semantic Hub should only focus on metadata (not instance level data) Benefits: Common Terms, Models, Queries, Rules and Results (End-to-End) Integrating Data Across the Enterprise Lab Instruments Clinical Trials Regulatory AffairsProduction eArchiving
  • 9. Slide 9 Lab Instrument Use Case – Allotrope Framework HPLC – UV Mobile Phase Selection
  • 10. Slide 10 Ontology for HPLC Example (Allotrope) resultdevice material process
  • 11. Slide 11 Clinical Trials Use Case – Astra Zeneca & MedImmune
  • 12. Slide 12 Connecting The Dots Across AstraZeneca & MedImmune For Clinical Trials
  • 13. Slide 13 FAIR Principles Bring Together Clinical Trials Data Across Phases
  • 14. Slide 14 Domain Knowledge Is Captured In Models
  • 15. Slide 15 Production Use Case – Manufacturing Data Integration
  • 16. Slide 16 Often times R&D and manufacturing cannot easily share data Competing systems can evolve which cause incompatibilities Manufacturing data is often lower less complex than R&D data, but significantly higher in throughput  QA/QC plays a major role  Far more interpretation in R&D  Manufacturing needs results fast • Alarms • Trends  Manufacturing data is less retrospective Manufacturing Data Vs. R&D Data
  • 17. Slide 17 Regulatory Use Case – Unstructured Data Integration
  • 18. Slide 18 Regulatory compliance requires accessing and mining unstructured data Linking unstructured data to other data provides significant advantages  Text to DB links unstructured and structured data  Text to Public Data Sources leverages open source research Regulatory Compliance Regulatory Documentation
  • 19. Slide 19 E-Archiving: Managing Data Over Long Lifecycles
  • 20. Slide 20 Data is made available for easier search and indexing (even after long periods of time) Archiving is no longer a “vault” concept but is integrated within the Data Mgt. Lifecycle E-Archiving Using the Allotrope Data Framework
  • 21. Slide 21 Big Analysis Requires Hybrid Architectures Semantic DBs Unstructured Docs Structured Data Cloud DBs (NoSQL)Analytics Dashboards & Reports Integration Layer
  • 22. Slide 22 Data Science (machine learning, text analytics, clustering etc.) FAIR Data Is Now Accessible For Advanced Analytics Linked Open Data & Open APIs Semantic Graph DB (Knowledge Graph) Operational DBs … Unstructured Documents Analytics Tools simulations statistics reasoning Visualization dashboards exploration search … Semi-structured Data Instrument Data Lightweight Semantic Integration Layer (semantic RMDM, APIs, semantic indexing, data annotation, catalogues, meta data and linking) Reporting regulatory internal external
  • 23. Slide 23 CONNECTING DATA, PEOPLE AND ORGANIZATIONS Contact Information: Email: eric.little@osthus.com Web: www.osthus.com www.biganalysis.com Twitter: OntoEric