SlideShare a Scribd company logo
1 of 44
The Data Commons
Digital Ecosystems for Sharing and
Analyzing biomedical Big Data
Vivien Bonazzi, Ph.D.
Senior Advisor for Data Science
Office of Data Science (ADDS)
National Institutes of Health
Lets Talk About Biomedical Big Data
What Makes Big Data Big?
VOLUME
VELOCITY
VARIETY
VERACITY
It’s a signal of the coming Digital Economy
DATA has VALUE
DATA is CENTRAL to the Digital Economy
But its more than this…..
An economy characterized by
using data to gain a business
advantage
(yes, institutions are a business)
Organizations that are not born
digital will be at a disadvantage
in the new economy
Organizations will be defined by their digital assets
Scientific digital assets
Data
Software
Workflows
The most successful organizations of the future will
be those that can leverage their digital assets and
transform them into a digital enterprise
Make data
The currency of an organization
Usable in a digital ecosystems – Data Commons
The problem with biomedical data
Digital assets includes Data
Challenges Biomedical Data
The Journal Article is the end goal
Data is a means to an ends (low value)
Data is not FAIR
Findable, Accessible, Interoperable,
Reproducible
Limited e-infrastructures to
The Problem
With
Biomedical DATA
https://www.youtube.com/watch?v=N2zK3sAtr-4
What’s
Changing?
FAIR principles drive data to become the currency
Policies that promote data sharing via FAIR help
change the culture
We also need a digital ecosystem that allows
transactions to occur on FAIR data
at scale
The Data Commons
is a platform
that fosters the development of a digital ecosystem
The Data Commons platform that fosters development of a digital
ecosystem
Treats products of research – data, software, methods, papers etc as
digital asset (object)
Digital objects need to conform to FAIR principles
Digital objects exist in a shared virtual space
- Find, Deposit, Manage, Share and Reuse: digital assets
Enables interactions between Producers and Consumers of digital assets
Gives currency to digital assets and the people who develop and support
them
The Data Commons
is a platform?
that fosters the development of a digital ecosystem
“A platform is a plug and play model
that allows multiple participants (producers and
consumers) to connect to it, interact with each
other and create value”
Sangeet Paul Choudary – Platform Scale
A lot of what see today uses a platform approach ”
Sangeet Paul Choudary – Platform Scale
The goal of the a Data Commons Platform is to
enable interactions between producers and
consumers
Sangeet Paul Choudary – Platform Scale
To understand the
Data Commons Platform
(and how it works for biomedical data)
we need to use a Platform stack
to help visualize the concept
Sangeet Paul Choudary – Platform Scale
Platforms have 3 layers
NIH Data Commons - Platform Stack
https://datascience.nih.gov/commons
Technology
Technology
Data
Network/
market place
https://datascience.nih.gov/commons
NIH Data Commons - Platform Stack
Initial Phase
Unique digital object identifiers of resolvable to original authoritative source
Machine readable
A minimal set of searchable metadata
Clear access rules (especially important for human subjects data)
An entry (with metadata) in one or more indices
Future Phases
Standard, community based unique digital object identifiers
Conform to community approved standard metadata and ontologies for
enhanced searching
Digital objects accessible via open standard APIs
NIH Data Commons: Digital Asset Compliance
Making things FAIR
Data Commons Platform drives digital ecosystem
The NIH Data Commons Pilot
The NIH Data Commons Pilot
Co-location of large and/or highly
utilized NIH funded data with
storage and computing infrastructure +
Commonly used tools for analyzing and
sharing digital objects
to create an interoperable resource for
the research community.
Investigators will be able to collaborate
and share digital objects within this
environment and connect with others
NIH Nascent Commons Pilots
An NIH Wide Data Commons Pilot
Indexing
Indexing
Indexing
Authorization /authentication layer
Considerations
Metrics - understanding and accounting of data usage patterns
Cost - Cloud Storage, pay for use cloud compute (NIH credits)
Hybrid Clouds – Mix of research and commercial clouds
Connecting - Interoperability with other Commons, clouds
Consent - Reconsenting data, Dynamic consents
Standards – Metadata, UIDs, APIs
An Australian Commons Experiment?
A Garvan Data Commons Platform?
Garvan DATA
NCI + Cloud
Analysis tools (Inc 3rd party)
Apps Store
Community – Research, Clinical, Public
API connectivity
with other
Commons
An Australian Data Commons?
Australian DATA - Flora and Fauna
Commercial Cloud (NCI)
Analysis tools (Inc 3rd party)
Apps Store
Community – Research, Clinical, Public
API connectivity
with other
Commons
“To achieve great things, two things are
needed: a plan and not quite enough
time”
Leonard Bernstein
Thank you
• ADDS Office
- Phil Bourne, Michelle Dunn, Jennie Larkin, Mark Guyer, Sonynka Ngosso
• NCBI: Jim Ostell, David Lipman, George Komatsoulis
• NHGRI: Valentina di Francesco, Kevin Lee, Eric Green
• NIGMS: John Lorsch, Susan Gregurik
• CIT: Andrea Norris, Debbie Sinmao, Stacy Charland
• NCI: Warren Kibbe, Tony Kerlavage, Lou Staudt, Tanja Davidsen, Ian Fore
• NIAID: JJ McGowan, Nick Weber, Darrell Hurt, Maria Giovanni
• The NIH Common Fund: Betsy Wilder, Jim Anderson, Leslie Derr
• Trans NIH BD2K Executive Committee & Working groups
• Many biomedical researchers, cloud providers, IT professionals
• John Mattick and the Garvan Institute
Stay in Touch
QR Business Card
LinkedIn
@Vivien.Bonazzi
Slideshare
Blog
(Coming soon!)
Vivien Bonazzi
bonazziv@mail.nih.gov

More Related Content

What's hot

NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14
SEAD
 

What's hot (20)

Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & Innovation
 
NDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) OfficeNDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) Office
 
Why Data Citation Currently Misses the Point
Why Data Citation Currently Misses the PointWhy Data Citation Currently Misses the Point
Why Data Citation Currently Misses the Point
 
Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Data 2.0|
Data 2.0|Data 2.0|
Data 2.0|
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)
 
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeoLicence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive track
 
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
ESIP Federation: Community-Driven, Collaborative Governance - Carol Beaton Me...
 
Infrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAInfrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDA
 
NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14
 
Towards the Digital Research Enterprise
Towards the Digital Research EnterpriseTowards the Digital Research Enterprise
Towards the Digital Research Enterprise
 
Digital Curation in Libraries: An innovative way of content preservation and...
Digital Curation in Libraries:  An innovative way of content preservation and...Digital Curation in Libraries:  An innovative way of content preservation and...
Digital Curation in Libraries: An innovative way of content preservation and...
 
FAIR data
FAIR dataFAIR data
FAIR data
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie LenertA Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
A Framework for Geospatial Web Services for Public Health by Dr. Leslie Lenert
 
ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and Tools
 
Digital Curation Technology: JHU Summit, October 2015
Digital Curation Technology: JHU Summit, October 2015Digital Curation Technology: JHU Summit, October 2015
Digital Curation Technology: JHU Summit, October 2015
 

Similar to Data Commons Garvan - 2016

Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
Joy Palmer
 

Similar to Data Commons Garvan - 2016 (20)

The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big Data
 
Innovation in Future Enterprise, by David Osimo
Innovation in Future Enterprise, by David OsimoInnovation in Future Enterprise, by David Osimo
Innovation in Future Enterprise, by David Osimo
 
Presentation at board DKV Seguros
Presentation at board DKV SegurosPresentation at board DKV Seguros
Presentation at board DKV Seguros
 
Publishing Data on the Web
Publishing Data on the Web Publishing Data on the Web
Publishing Data on the Web
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data mining
 
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
Discovery: Implementing a vision for a 'virtuous' flow of metadata across the...
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Computer Applications and Systems - Workshop V
Computer Applications and Systems - Workshop VComputer Applications and Systems - Workshop V
Computer Applications and Systems - Workshop V
 
Conclusion
ConclusionConclusion
Conclusion
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsWhitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
 
Impact of DDOD on Data Quality - White House 2016
Impact of DDOD on Data Quality -  White House 2016Impact of DDOD on Data Quality -  White House 2016
Impact of DDOD on Data Quality - White House 2016
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
Emerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsEmerging Forms of Data and Analytics
Emerging Forms of Data and Analytics
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Big data mining
Big data miningBig data mining
Big data mining
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
BigData
BigDataBigData
BigData
 
JIMS Rohini IT Flash Monthly Newsletter - October Issue
JIMS Rohini IT Flash Monthly Newsletter  - October IssueJIMS Rohini IT Flash Monthly Newsletter  - October Issue
JIMS Rohini IT Flash Monthly Newsletter - October Issue
 

Recently uploaded

Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
Sérgio Sacani
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 

Recently uploaded (20)

Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 

Data Commons Garvan - 2016

  • 1. The Data Commons Digital Ecosystems for Sharing and Analyzing biomedical Big Data Vivien Bonazzi, Ph.D. Senior Advisor for Data Science Office of Data Science (ADDS) National Institutes of Health
  • 2. Lets Talk About Biomedical Big Data
  • 3.
  • 4. What Makes Big Data Big? VOLUME VELOCITY VARIETY VERACITY
  • 5. It’s a signal of the coming Digital Economy DATA has VALUE DATA is CENTRAL to the Digital Economy But its more than this…..
  • 6. An economy characterized by using data to gain a business advantage (yes, institutions are a business) Organizations that are not born digital will be at a disadvantage in the new economy
  • 7. Organizations will be defined by their digital assets Scientific digital assets Data Software Workflows
  • 8. The most successful organizations of the future will be those that can leverage their digital assets and transform them into a digital enterprise
  • 9. Make data The currency of an organization Usable in a digital ecosystems – Data Commons
  • 10. The problem with biomedical data Digital assets includes Data
  • 11. Challenges Biomedical Data The Journal Article is the end goal Data is a means to an ends (low value) Data is not FAIR Findable, Accessible, Interoperable, Reproducible Limited e-infrastructures to
  • 14. FAIR principles drive data to become the currency Policies that promote data sharing via FAIR help change the culture
  • 15. We also need a digital ecosystem that allows transactions to occur on FAIR data at scale
  • 16. The Data Commons is a platform that fosters the development of a digital ecosystem
  • 17. The Data Commons platform that fosters development of a digital ecosystem Treats products of research – data, software, methods, papers etc as digital asset (object) Digital objects need to conform to FAIR principles Digital objects exist in a shared virtual space - Find, Deposit, Manage, Share and Reuse: digital assets Enables interactions between Producers and Consumers of digital assets Gives currency to digital assets and the people who develop and support them
  • 18. The Data Commons is a platform? that fosters the development of a digital ecosystem
  • 19. “A platform is a plug and play model that allows multiple participants (producers and consumers) to connect to it, interact with each other and create value” Sangeet Paul Choudary – Platform Scale
  • 20. A lot of what see today uses a platform approach ” Sangeet Paul Choudary – Platform Scale
  • 21. The goal of the a Data Commons Platform is to enable interactions between producers and consumers Sangeet Paul Choudary – Platform Scale
  • 22. To understand the Data Commons Platform (and how it works for biomedical data) we need to use a Platform stack to help visualize the concept
  • 23. Sangeet Paul Choudary – Platform Scale Platforms have 3 layers
  • 24. NIH Data Commons - Platform Stack https://datascience.nih.gov/commons Technology Technology Data Network/ market place
  • 26. Initial Phase Unique digital object identifiers of resolvable to original authoritative source Machine readable A minimal set of searchable metadata Clear access rules (especially important for human subjects data) An entry (with metadata) in one or more indices Future Phases Standard, community based unique digital object identifiers Conform to community approved standard metadata and ontologies for enhanced searching Digital objects accessible via open standard APIs NIH Data Commons: Digital Asset Compliance Making things FAIR
  • 27.
  • 28.
  • 29. Data Commons Platform drives digital ecosystem
  • 30. The NIH Data Commons Pilot
  • 31. The NIH Data Commons Pilot Co-location of large and/or highly utilized NIH funded data with storage and computing infrastructure + Commonly used tools for analyzing and sharing digital objects to create an interoperable resource for the research community. Investigators will be able to collaborate and share digital objects within this environment and connect with others
  • 33. An NIH Wide Data Commons Pilot
  • 34.
  • 38. Considerations Metrics - understanding and accounting of data usage patterns Cost - Cloud Storage, pay for use cloud compute (NIH credits) Hybrid Clouds – Mix of research and commercial clouds Connecting - Interoperability with other Commons, clouds Consent - Reconsenting data, Dynamic consents Standards – Metadata, UIDs, APIs
  • 39. An Australian Commons Experiment?
  • 40. A Garvan Data Commons Platform? Garvan DATA NCI + Cloud Analysis tools (Inc 3rd party) Apps Store Community – Research, Clinical, Public API connectivity with other Commons
  • 41. An Australian Data Commons? Australian DATA - Flora and Fauna Commercial Cloud (NCI) Analysis tools (Inc 3rd party) Apps Store Community – Research, Clinical, Public API connectivity with other Commons
  • 42. “To achieve great things, two things are needed: a plan and not quite enough time” Leonard Bernstein
  • 43. Thank you • ADDS Office - Phil Bourne, Michelle Dunn, Jennie Larkin, Mark Guyer, Sonynka Ngosso • NCBI: Jim Ostell, David Lipman, George Komatsoulis • NHGRI: Valentina di Francesco, Kevin Lee, Eric Green • NIGMS: John Lorsch, Susan Gregurik • CIT: Andrea Norris, Debbie Sinmao, Stacy Charland • NCI: Warren Kibbe, Tony Kerlavage, Lou Staudt, Tanja Davidsen, Ian Fore • NIAID: JJ McGowan, Nick Weber, Darrell Hurt, Maria Giovanni • The NIH Common Fund: Betsy Wilder, Jim Anderson, Leslie Derr • Trans NIH BD2K Executive Committee & Working groups • Many biomedical researchers, cloud providers, IT professionals • John Mattick and the Garvan Institute
  • 44. Stay in Touch QR Business Card LinkedIn @Vivien.Bonazzi Slideshare Blog (Coming soon!) Vivien Bonazzi bonazziv@mail.nih.gov

Editor's Notes

  1. Currencies don’t exist in a vacuum Buy and sell Goods
  2. A nascent platform
  3. Platforms that utilize data as a central currency – enable transactions between producers and consumers
  4. Producers of digital objects - data, tools, workflows - used by consumers The Platform enables these transactions – Accommodates bioinformatics and non bioinformatics users
  5. Framework helps visualize the concept of the platform
  6. * All Garvan Data + Tools in authorized /access control environment – allow access to approved users * Hybrid Clouds: NCI (National Computing Infrastructure) + Commercial (AWS Allow approved users Garvan or others inlcudingcommercial vendors (ie DNA Nexus) to develop tools (SaaS) onto of the Garvan data API connections to other Commons – NY Genome Center * Beacon projects - variation
  7. Develop an Australian Data Commons Make ALL Australian data : flora, fauna incl. human clinical data available in a data commons cloud – (mix of NCI and commercial cloud) Encourage tool development from bioinformatics research or commercial groups Make the commons interoperable with other Cloud Commons Use NCBI and EBI as an archive – learn their annotation methods for metadata and their data distribution methods and cloud access. Embed Postdocs within NCBI and EBI to learn these methods and bring them back to Australia. Develop a team approach Use this as a way to train the next generation of scientists – Bfx and non Bfx