SlideShare a Scribd company logo
1 of 44
The Data Commons
Digital Ecosystems for Sharing and
Analyzing biomedical Big Data
Vivien Bonazzi, Ph.D.
Senior Advisor for Data Science
Office of Data Science (ADDS)
National Institutes of Health
Lets Talk About Biomedical Big Data
What Makes Big Data Big?
VOLUME
VELOCITY
VARIETY
VERACITY
It’s a signal of the coming Digital Economy
DATA has VALUE
DATA is CENTRAL to the Digital Economy
But its more than this…..
An economy characterized by
using data to gain a business
advantage
(yes, institutions are a business)
Organizations that are not born
digital will be at a disadvantage
in the new economy
Organizations will be defined by their digital assets
Scientific digital assets
Data
Software
Workflows
The most successful organizations of the future will
be those that can leverage their digital assets and
transform them into a digital enterprise
Make data
The currency of an organization
Usable in a digital ecosystems – Data Commons
The problem with biomedical data
Digital assets includes Data
Challenges Biomedical Data
The Journal Article is the end goal
Data is a means to an ends (low value)
Data is not FAIR
Findable, Accessible, Interoperable,
Reproducible
Limited e-infrastructures to
The Problem
With
Biomedical DATA
https://www.youtube.com/watch?v=N2zK3sAtr-4
What’s
Changing?
FAIR principles drive data to become the currency
Policies that promote data sharing via FAIR help
change the culture
We also need a digital ecosystem that allows
transactions to occur on FAIR data
at scale
The Data Commons
is a platform
that fosters the development of a digital ecosystem
The Data Commons platform that fosters development of a digital
ecosystem
Treats products of research – data, software, methods, papers etc as
digital asset (object)
Digital objects need to conform to FAIR principles
Digital objects exist in a shared virtual space
- Find, Deposit, Manage, Share and Reuse: digital assets
Enables interactions between Producers and Consumers of digital assets
Gives currency to digital assets and the people who develop and support
them
The Data Commons
is a platform?
that fosters the development of a digital ecosystem
“A platform is a plug and play model
that allows multiple participants (producers and
consumers) to connect to it, interact with each
other and create value”
Sangeet Paul Choudary – Platform Scale
A lot of what see today uses a platform approach ”
Sangeet Paul Choudary – Platform Scale
The goal of the a Data Commons Platform is to
enable interactions between producers and
consumers
Sangeet Paul Choudary – Platform Scale
To understand the
Data Commons Platform
(and how it works for biomedical data)
we need to use a Platform stack
to help visualize the concept
Sangeet Paul Choudary – Platform Scale
Platforms have 3 layers
NIH Data Commons - Platform Stack
https://datascience.nih.gov/commons
Technology
Technology
Data
Network/
market place
https://datascience.nih.gov/commons
NIH Data Commons - Platform Stack
Initial Phase
Unique digital object identifiers of resolvable to original authoritative source
Machine readable
A minimal set of searchable metadata
Clear access rules (especially important for human subjects data)
An entry (with metadata) in one or more indices
Future Phases
Standard, community based unique digital object identifiers
Conform to community approved standard metadata and ontologies for
enhanced searching
Digital objects accessible via open standard APIs
NIH Data Commons: Digital Asset Compliance
Making things FAIR
Data Commons Platform drives digital ecosystem
The NIH Data Commons Pilot
The NIH Data Commons Pilot
Co-location of large and/or highly
utilized NIH funded data with
storage and computing infrastructure +
Commonly used tools for analyzing and
sharing digital objects
to create an interoperable resource for
the research community.
Investigators will be able to collaborate
and share digital objects within this
environment and connect with others
NIH Nascent Commons Pilots
An NIH Wide Data Commons Pilot
Indexing
Indexing
Indexing
Authorization /authentication layer
Considerations
Metrics - understanding and accounting of data usage patterns
Cost - Cloud Storage, pay for use cloud compute (NIH credits)
Hybrid Clouds – Mix of research and commercial clouds
Connecting - Interoperability with other Commons, clouds
Consent - Reconsenting data, Dynamic consents
Standards – Metadata, UIDs, APIs
An Australian Commons Experiment?
A Garvan Data Commons Platform?
Garvan DATA
NCI + Cloud
Analysis tools (Inc 3rd party)
Apps Store
Community – Research, Clinical, Public
API connectivity
with other
Commons
An Australian Data Commons?
Australian DATA - Flora and Fauna
Commercial Cloud (NCI)
Analysis tools (Inc 3rd party)
Apps Store
Community – Research, Clinical, Public
API connectivity
with other
Commons
“To achieve great things, two things are
needed: a plan and not quite enough
time”
Leonard Bernstein
Thank you
• ADDS Office
- Phil Bourne, Michelle Dunn, Jennie Larkin, Mark Guyer, Sonynka Ngosso
• NCBI: Jim Ostell, David Lipman, George Komatsoulis
• NHGRI: Valentina di Francesco, Kevin Lee, Eric Green
• NIGMS: John Lorsch, Susan Gregurik
• CIT: Andrea Norris, Debbie Sinmao, Stacy Charland
• NCI: Warren Kibbe, Tony Kerlavage, Lou Staudt, Tanja Davidsen, Ian Fore
• NIAID: JJ McGowan, Nick Weber, Darrell Hurt, Maria Giovanni
• The NIH Common Fund: Betsy Wilder, Jim Anderson, Leslie Derr
• Trans NIH BD2K Executive Committee & Working groups
• Many biomedical researchers, cloud providers, IT professionals
• John Mattick and the Garvan Institute
Stay in Touch
QR Business Card
LinkedIn
@Vivien.Bonazzi
Slideshare
Blog
(Coming soon!)
Vivien Bonazzi
bonazziv@mail.nih.gov

More Related Content

What's hot

Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationPhilip Bourne
 
NDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) OfficeNDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) OfficePhilip Bourne
 
Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointMark Parsons
 
Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015George Komatsoulis
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)SEAD
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD
 
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoLicence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoEDINA, University of Edinburgh
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive trackGeorge Komatsoulis
 
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...ASIS&T
 
Infrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAInfrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAResearch Data Alliance
 
NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14SEAD
 
Towards the Digital Research Enterprise
Towards the Digital Research EnterpriseTowards the Digital Research Enterprise
Towards the Digital Research EnterprisePhilip Bourne
 
Digital Curation in Libraries: An innovative way of content preservation and...
Digital Curation in Libraries:  An innovative way of content preservation and...Digital Curation in Libraries:  An innovative way of content preservation and...
Digital Curation in Libraries: An innovative way of content preservation and...Bhojaraju Gunjal
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curationMichael Day
 
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertA Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertWansoo Im
 
ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsSEAD
 

What's hot (20)

Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & Innovation
 
NDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) OfficeNDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) Office
 
Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the Point
 
Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Data 2.0|
Data 2.0|Data 2.0|
Data 2.0|
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)
 
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoLicence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive track
 
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
 
Infrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAInfrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDA
 
NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14
 
Towards the Digital Research Enterprise
Towards the Digital Research EnterpriseTowards the Digital Research Enterprise
Towards the Digital Research Enterprise
 
Digital Curation in Libraries: An innovative way of content preservation and...
Digital Curation in Libraries:  An innovative way of content preservation and...Digital Curation in Libraries:  An innovative way of content preservation and...
Digital Curation in Libraries: An innovative way of content preservation and...
 
FAIR data
FAIR dataFAIR data
FAIR data
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertA Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
 
ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and Tools
 
Digital Curation Technology: JHU Summit, October 2015
Digital Curation Technology: JHU Summit, October 2015Digital Curation Technology: JHU Summit, October 2015
Digital Curation Technology: JHU Summit, October 2015
 

Similar to Data Commons Garvan - 2016

The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataPhilip Bourne
 
Innovation in Future Enterprise, by David Osimo
Innovation in Future Enterprise, by David OsimoInnovation in Future Enterprise, by David Osimo
Innovation in Future Enterprise, by David OsimoFutureEnterprise
 
Presentation at board DKV Seguros
Presentation at board DKV SegurosPresentation at board DKV Seguros
Presentation at board DKV Segurososimod
 
Publishing Data on the Web
Publishing Data on the Web Publishing Data on the Web
Publishing Data on the Web Centro Web
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data miningPolash Halder
 
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...Joy Palmer
 
Computer Applications and Systems - Workshop V
Computer Applications and Systems - Workshop VComputer Applications and Systems - Workshop V
Computer Applications and Systems - Workshop VRaji Gogulapati
 
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsWhitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsHappiest Minds Technologies
 
Impact of DDOD on Data Quality - White House 2016
Impact of DDOD on Data Quality -  White House 2016Impact of DDOD on Data Quality -  White House 2016
Impact of DDOD on Data Quality - White House 2016David Portnoy
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...Carole Goble
 
Emerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsEmerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsDavid De Roure
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
JIMS Rohini IT Flash Monthly Newsletter - October Issue
JIMS Rohini IT Flash Monthly Newsletter  - October IssueJIMS Rohini IT Flash Monthly Newsletter  - October Issue
JIMS Rohini IT Flash Monthly Newsletter - October IssueJIMS Rohini Sector 5
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Anita de Waard
 

Similar to Data Commons Garvan - 2016 (20)

The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big Data
 
Innovation in Future Enterprise, by David Osimo
Innovation in Future Enterprise, by David OsimoInnovation in Future Enterprise, by David Osimo
Innovation in Future Enterprise, by David Osimo
 
Presentation at board DKV Seguros
Presentation at board DKV SegurosPresentation at board DKV Seguros
Presentation at board DKV Seguros
 
Publishing Data on the Web
Publishing Data on the Web Publishing Data on the Web
Publishing Data on the Web
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data mining
 
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
 
Computer Applications and Systems - Workshop V
Computer Applications and Systems - Workshop VComputer Applications and Systems - Workshop V
Computer Applications and Systems - Workshop V
 
Conclusion
ConclusionConclusion
Conclusion
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsWhitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
 
Impact of DDOD on Data Quality - White House 2016
Impact of DDOD on Data Quality -  White House 2016Impact of DDOD on Data Quality -  White House 2016
Impact of DDOD on Data Quality - White House 2016
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
Emerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsEmerging Forms of Data and Analytics
Emerging Forms of Data and Analytics
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Big data mining
Big data miningBig data mining
Big data mining
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
BigData
BigDataBigData
BigData
 
JIMS Rohini IT Flash Monthly Newsletter - October Issue
JIMS Rohini IT Flash Monthly Newsletter  - October IssueJIMS Rohini IT Flash Monthly Newsletter  - October Issue
JIMS Rohini IT Flash Monthly Newsletter - October Issue
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 

Recently uploaded

Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 

Recently uploaded (20)

Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 

Data Commons Garvan - 2016

  • 1. The Data Commons Digital Ecosystems for Sharing and Analyzing biomedical Big Data Vivien Bonazzi, Ph.D. Senior Advisor for Data Science Office of Data Science (ADDS) National Institutes of Health
  • 2. Lets Talk About Biomedical Big Data
  • 3.
  • 4. What Makes Big Data Big? VOLUME VELOCITY VARIETY VERACITY
  • 5. It’s a signal of the coming Digital Economy DATA has VALUE DATA is CENTRAL to the Digital Economy But its more than this…..
  • 6. An economy characterized by using data to gain a business advantage (yes, institutions are a business) Organizations that are not born digital will be at a disadvantage in the new economy
  • 7. Organizations will be defined by their digital assets Scientific digital assets Data Software Workflows
  • 8. The most successful organizations of the future will be those that can leverage their digital assets and transform them into a digital enterprise
  • 9. Make data The currency of an organization Usable in a digital ecosystems – Data Commons
  • 10. The problem with biomedical data Digital assets includes Data
  • 11. Challenges Biomedical Data The Journal Article is the end goal Data is a means to an ends (low value) Data is not FAIR Findable, Accessible, Interoperable, Reproducible Limited e-infrastructures to
  • 14. FAIR principles drive data to become the currency Policies that promote data sharing via FAIR help change the culture
  • 15. We also need a digital ecosystem that allows transactions to occur on FAIR data at scale
  • 16. The Data Commons is a platform that fosters the development of a digital ecosystem
  • 17. The Data Commons platform that fosters development of a digital ecosystem Treats products of research – data, software, methods, papers etc as digital asset (object) Digital objects need to conform to FAIR principles Digital objects exist in a shared virtual space - Find, Deposit, Manage, Share and Reuse: digital assets Enables interactions between Producers and Consumers of digital assets Gives currency to digital assets and the people who develop and support them
  • 18. The Data Commons is a platform? that fosters the development of a digital ecosystem
  • 19. “A platform is a plug and play model that allows multiple participants (producers and consumers) to connect to it, interact with each other and create value” Sangeet Paul Choudary – Platform Scale
  • 20. A lot of what see today uses a platform approach ” Sangeet Paul Choudary – Platform Scale
  • 21. The goal of the a Data Commons Platform is to enable interactions between producers and consumers Sangeet Paul Choudary – Platform Scale
  • 22. To understand the Data Commons Platform (and how it works for biomedical data) we need to use a Platform stack to help visualize the concept
  • 23. Sangeet Paul Choudary – Platform Scale Platforms have 3 layers
  • 24. NIH Data Commons - Platform Stack https://datascience.nih.gov/commons Technology Technology Data Network/ market place
  • 26. Initial Phase Unique digital object identifiers of resolvable to original authoritative source Machine readable A minimal set of searchable metadata Clear access rules (especially important for human subjects data) An entry (with metadata) in one or more indices Future Phases Standard, community based unique digital object identifiers Conform to community approved standard metadata and ontologies for enhanced searching Digital objects accessible via open standard APIs NIH Data Commons: Digital Asset Compliance Making things FAIR
  • 27.
  • 28.
  • 29. Data Commons Platform drives digital ecosystem
  • 30. The NIH Data Commons Pilot
  • 31. The NIH Data Commons Pilot Co-location of large and/or highly utilized NIH funded data with storage and computing infrastructure + Commonly used tools for analyzing and sharing digital objects to create an interoperable resource for the research community. Investigators will be able to collaborate and share digital objects within this environment and connect with others
  • 33. An NIH Wide Data Commons Pilot
  • 34.
  • 38. Considerations Metrics - understanding and accounting of data usage patterns Cost - Cloud Storage, pay for use cloud compute (NIH credits) Hybrid Clouds – Mix of research and commercial clouds Connecting - Interoperability with other Commons, clouds Consent - Reconsenting data, Dynamic consents Standards – Metadata, UIDs, APIs
  • 39. An Australian Commons Experiment?
  • 40. A Garvan Data Commons Platform? Garvan DATA NCI + Cloud Analysis tools (Inc 3rd party) Apps Store Community – Research, Clinical, Public API connectivity with other Commons
  • 41. An Australian Data Commons? Australian DATA - Flora and Fauna Commercial Cloud (NCI) Analysis tools (Inc 3rd party) Apps Store Community – Research, Clinical, Public API connectivity with other Commons
  • 42. “To achieve great things, two things are needed: a plan and not quite enough time” Leonard Bernstein
  • 43. Thank you • ADDS Office - Phil Bourne, Michelle Dunn, Jennie Larkin, Mark Guyer, Sonynka Ngosso • NCBI: Jim Ostell, David Lipman, George Komatsoulis • NHGRI: Valentina di Francesco, Kevin Lee, Eric Green • NIGMS: John Lorsch, Susan Gregurik • CIT: Andrea Norris, Debbie Sinmao, Stacy Charland • NCI: Warren Kibbe, Tony Kerlavage, Lou Staudt, Tanja Davidsen, Ian Fore • NIAID: JJ McGowan, Nick Weber, Darrell Hurt, Maria Giovanni • The NIH Common Fund: Betsy Wilder, Jim Anderson, Leslie Derr • Trans NIH BD2K Executive Committee & Working groups • Many biomedical researchers, cloud providers, IT professionals • John Mattick and the Garvan Institute
  • 44. Stay in Touch QR Business Card LinkedIn @Vivien.Bonazzi Slideshare Blog (Coming soon!) Vivien Bonazzi bonazziv@mail.nih.gov

Editor's Notes

  1. Currencies don’t exist in a vacuum Buy and sell Goods
  2. A nascent platform
  3. Platforms that utilize data as a central currency – enable transactions between producers and consumers
  4. Producers of digital objects - data, tools, workflows - used by consumers The Platform enables these transactions – Accommodates bioinformatics and non bioinformatics users
  5. Framework helps visualize the concept of the platform
  6. * All Garvan Data + Tools in authorized /access control environment – allow access to approved users * Hybrid Clouds: NCI (National Computing Infrastructure) + Commercial (AWS Allow approved users Garvan or others inlcudingcommercial vendors (ie DNA Nexus) to develop tools (SaaS) onto of the Garvan data API connections to other Commons – NY Genome Center * Beacon projects - variation
  7. Develop an Australian Data Commons Make ALL Australian data : flora, fauna incl. human clinical data available in a data commons cloud – (mix of NCI and commercial cloud) Encourage tool development from bioinformatics research or commercial groups Make the commons interoperable with other Cloud Commons Use NCBI and EBI as an archive – learn their annotation methods for metadata and their data distribution methods and cloud access. Embed Postdocs within NCBI and EBI to learn these methods and bring them back to Australia. Develop a team approach Use this as a way to train the next generation of scientists – Bfx and non Bfx