SlideShare a Scribd company logo
1 of 17
Human Genomic Data Discoverability
Fiona Nielsen – Data Dialogue, Cambridge – July 28th 2016
The surge of genomics data
• High throughput technologies – biology is moving from the lab to the
computer
2006 2007 2008 2009 2010 2011 2012 2013 2014 2015
Genomes Sequenced
80+PB
Sequenced
every year
Population sequencing projects
• For example 100,000 Genomes project in the UK
Where is the data?
• A researcher in human genomics knows on average 4-5 data sources
The need to redefine data sharing: http://www.sciencedirect.com/science/article/pii/S2212066114000386
Hundreds of data sources
• Content overview of 163 data sources
Assay Types
Dedicated to…
Hundreds of data sources
• Sizes vary from tens to 100s of thousands of samples
1
10
100
1000
10000
100000
1000000
Sample#(Log10)
Top 5:
GEO (1.8M)
PMI Cohort Program (1M)
Auria Biopankki (1M)
EGA (~0.6M)
SRA (~0.5M)
Which populations are represented?
Aboriginals
African Americans
Africans
Australians
Chinese
Malays
Indians
Danish
Dutch Estonian
Russian
European Ancestry
Finnish
Icelandic
Japanese
Korean
Latin Americans
Saudi
Swedish
Where does the data come from?
947560
0
8
8
66
0
2
6
6
8
50
62
3
2
5
0
0
2
3
International
Interesting site to look at:
http://omicsmaps.com/stats
Why is some data not shared?
• Challenges for international research community: How to work across
borders and silos?
Why is some data not shared?
• Additional challenges for biomedical: Data privacy, data governance,
patient consent, medical legislation
Also consider: Community-led resources
• patient groups, academia, the general public
What needs to change?
• Increased data visibility and accessibility positively benefit both
researchers and patients
?
Pain points
FRAGMENTED
Poor visibility of available
genomic data
ADMIN BURDEN
Huge overhead to manage
data access
BAD CULTURE
Lack of data sharing habits in
research culture
Best practices
MAKE DATA
DISCOVERABLE
SIMPLIFY
WORKFLOWS
CONTRIBUTE TO
COMMUNITY
DNAdigest and Repositive – Connecting the world of genomic data
http://journals.plos.org/plosbiology/article?id=10.1371%2Fjournal.pbio.1002418
Panel discussion
• What are best practices for sharing difficult data?
FAIR data: Findable, Accessible, Interoperable, Reuseable
Translating and Commercialising Genomic Research
7-9 December 2016| Wellcome Genome Campus, Hinxton, Cambridge UK
Applications open soon!
Scientific programme committee
Emmanuelle Astoul Wellcome Trust Sanger Institute, UK
Fiona Nielsen Repositive/DNAdigest, UK
Abel Ureta-Vidal Eagle Genomics, UK
Ross Rounsevell Wellcome Trust Sanger Institute, UK
Full details at:
www.wellcomegenomecampus.org/coursesandconferences
Topics will include:
• Commercial opportunities arising from data aggregation
• Exploiting bioinformatics tools
• Externalising bioinformatics pipelines
• Translating biomarkers, genetic signatures or gene panels
CEO Fiona Nielsen, fiona@repositive.io
Try our free platform for discovering human genomic data http://repositive.io
Follow us on twitter @repositiveio

More Related Content

What's hot

The Growing Call for Open Access - Heather Joseph (2007)
The Growing Call for Open Access - Heather Joseph (2007)The Growing Call for Open Access - Heather Joseph (2007)
The Growing Call for Open Access - Heather Joseph (2007)faflrt
 
Open access publishing and open access data sharing for malaria research and ...
Open access publishing and open access data sharing for malaria research and ...Open access publishing and open access data sharing for malaria research and ...
Open access publishing and open access data sharing for malaria research and ...BioMedCentral
 
Research Data Management and the brave new world, By Paul Ayris
Research Data Management and the brave new world, By Paul AyrisResearch Data Management and the brave new world, By Paul Ayris
Research Data Management and the brave new world, By Paul AyrisLEARN Project
 
FAU Open Access Promotion
FAU Open Access PromotionFAU Open Access Promotion
FAU Open Access PromotionBebe Chang
 
Why, how and by whom? A pro-active approach to open access in Africa
Why, how and by whom? A pro-active approach to open access in AfricaWhy, how and by whom? A pro-active approach to open access in Africa
Why, how and by whom? A pro-active approach to open access in AfricaBioMedCentral
 
Opening access to DFID research
Opening access to DFID researchOpening access to DFID research
Opening access to DFID researchBioMedCentral
 
The Needs of stakeholders in the RDM process - the role of LEARN
The Needs of stakeholders in the RDM process - the role of LEARNThe Needs of stakeholders in the RDM process - the role of LEARN
The Needs of stakeholders in the RDM process - the role of LEARNLEARN Project
 
Arch sci symposium presentataion
Arch sci symposium presentataionArch sci symposium presentataion
Arch sci symposium presentataionAhmad Alam
 
Open by default: the challenges of research data in Europe
Open by default: the challenges of research data in EuropeOpen by default: the challenges of research data in Europe
Open by default: the challenges of research data in EuropeLEARN Project
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
Open Access and Publishers - Michael Mabe (2007)
Open Access and Publishers - Michael Mabe (2007)Open Access and Publishers - Michael Mabe (2007)
Open Access and Publishers - Michael Mabe (2007)faflrt
 
Impact Through Innovation: The Wellcome Sanger Institute
Impact Through Innovation: The Wellcome Sanger InstituteImpact Through Innovation: The Wellcome Sanger Institute
Impact Through Innovation: The Wellcome Sanger InstituteVictoria Lebedeva- Baxter ACIM
 
A Strategy for Sharing Your Research: Make Your Work Open Access
A Strategy for Sharing Your Research: Make Your Work Open AccessA Strategy for Sharing Your Research: Make Your Work Open Access
A Strategy for Sharing Your Research: Make Your Work Open AccessSunghae Ress
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyICZN
 
How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...LEARN Project
 

What's hot (20)

The Growing Call for Open Access - Heather Joseph (2007)
The Growing Call for Open Access - Heather Joseph (2007)The Growing Call for Open Access - Heather Joseph (2007)
The Growing Call for Open Access - Heather Joseph (2007)
 
Open access publishing and open access data sharing for malaria research and ...
Open access publishing and open access data sharing for malaria research and ...Open access publishing and open access data sharing for malaria research and ...
Open access publishing and open access data sharing for malaria research and ...
 
Research Data Management and the brave new world, By Paul Ayris
Research Data Management and the brave new world, By Paul AyrisResearch Data Management and the brave new world, By Paul Ayris
Research Data Management and the brave new world, By Paul Ayris
 
FAU Open Access Promotion
FAU Open Access PromotionFAU Open Access Promotion
FAU Open Access Promotion
 
Why, how and by whom? A pro-active approach to open access in Africa
Why, how and by whom? A pro-active approach to open access in AfricaWhy, how and by whom? A pro-active approach to open access in Africa
Why, how and by whom? A pro-active approach to open access in Africa
 
Oauk jw oe_conf_final
Oauk jw oe_conf_finalOauk jw oe_conf_final
Oauk jw oe_conf_final
 
Opening access to DFID research
Opening access to DFID researchOpening access to DFID research
Opening access to DFID research
 
The Needs of stakeholders in the RDM process - the role of LEARN
The Needs of stakeholders in the RDM process - the role of LEARNThe Needs of stakeholders in the RDM process - the role of LEARN
The Needs of stakeholders in the RDM process - the role of LEARN
 
Arch sci symposium presentataion
Arch sci symposium presentataionArch sci symposium presentataion
Arch sci symposium presentataion
 
Open by default: the challenges of research data in Europe
Open by default: the challenges of research data in EuropeOpen by default: the challenges of research data in Europe
Open by default: the challenges of research data in Europe
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
Open-access publishing and the transformation of scholarly communication
Open-access publishing and the transformation of scholarly communicationOpen-access publishing and the transformation of scholarly communication
Open-access publishing and the transformation of scholarly communication
 
Developing a community of researchers: A role of an academic librarian
Developing a community of researchers: A role of an academic librarianDeveloping a community of researchers: A role of an academic librarian
Developing a community of researchers: A role of an academic librarian
 
Open Access and Publishers - Michael Mabe (2007)
Open Access and Publishers - Michael Mabe (2007)Open Access and Publishers - Michael Mabe (2007)
Open Access and Publishers - Michael Mabe (2007)
 
Tyler poster v2
Tyler poster  v2Tyler poster  v2
Tyler poster v2
 
Impact Through Innovation: The Wellcome Sanger Institute
Impact Through Innovation: The Wellcome Sanger InstituteImpact Through Innovation: The Wellcome Sanger Institute
Impact Through Innovation: The Wellcome Sanger Institute
 
NISO Webinar May 8: Taking Full Advantage: Discovery of Open Access Content.
NISO Webinar May 8: Taking Full Advantage: Discovery of Open Access Content.NISO Webinar May 8: Taking Full Advantage: Discovery of Open Access Content.
NISO Webinar May 8: Taking Full Advantage: Discovery of Open Access Content.
 
A Strategy for Sharing Your Research: Make Your Work Open Access
A Strategy for Sharing Your Research: Make Your Work Open AccessA Strategy for Sharing Your Research: Make Your Work Open Access
A Strategy for Sharing Your Research: Make Your Work Open Access
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
 
How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...
 

Viewers also liked

КАРТКА ОБЛІКУ
КАРТКА ОБЛІКУКАРТКА ОБЛІКУ
КАРТКА ОБЛІКУsasha884
 
London traffic through the lens of Behavioral economics
London traffic through the lens of Behavioral economicsLondon traffic through the lens of Behavioral economics
London traffic through the lens of Behavioral economicsSankar Nagarajan
 
EricDrain_Resume
EricDrain_ResumeEricDrain_Resume
EricDrain_ResumeEric Drain
 
Cognitive politics Uncovering the Second presidential debate
Cognitive politics   Uncovering the Second presidential debateCognitive politics   Uncovering the Second presidential debate
Cognitive politics Uncovering the Second presidential debateSankar Nagarajan
 
Study_Food_Fortification_en_WEB-2
Study_Food_Fortification_en_WEB-2Study_Food_Fortification_en_WEB-2
Study_Food_Fortification_en_WEB-2Judith Hodge
 
მსოფლიო ისტორიის ტესტები
მსოფლიო ისტორიის ტესტებიმსოფლიო ისტორიის ტესტები
მსოფლიო ისტორიის ტესტებიmarika rekhviashvili
 
Integrative bioinformatics analysis of Parkinson's disease related omics data
Integrative bioinformatics analysis of Parkinson's disease related omics dataIntegrative bioinformatics analysis of Parkinson's disease related omics data
Integrative bioinformatics analysis of Parkinson's disease related omics dataEnrico Glaab
 
Online education ppt
Online education pptOnline education ppt
Online education pptkriti013
 

Viewers also liked (12)

Presentation1
Presentation1Presentation1
Presentation1
 
КАРТКА ОБЛІКУ
КАРТКА ОБЛІКУКАРТКА ОБЛІКУ
КАРТКА ОБЛІКУ
 
London traffic through the lens of Behavioral economics
London traffic through the lens of Behavioral economicsLondon traffic through the lens of Behavioral economics
London traffic through the lens of Behavioral economics
 
EricDrain_Resume
EricDrain_ResumeEricDrain_Resume
EricDrain_Resume
 
L269
L269L269
L269
 
Cognitive politics Uncovering the Second presidential debate
Cognitive politics   Uncovering the Second presidential debateCognitive politics   Uncovering the Second presidential debate
Cognitive politics Uncovering the Second presidential debate
 
ELUSIVE LUHYA UNITY
ELUSIVE LUHYA UNITYELUSIVE LUHYA UNITY
ELUSIVE LUHYA UNITY
 
Study_Food_Fortification_en_WEB-2
Study_Food_Fortification_en_WEB-2Study_Food_Fortification_en_WEB-2
Study_Food_Fortification_en_WEB-2
 
Weihnachtsaufgabe
WeihnachtsaufgabeWeihnachtsaufgabe
Weihnachtsaufgabe
 
მსოფლიო ისტორიის ტესტები
მსოფლიო ისტორიის ტესტებიმსოფლიო ისტორიის ტესტები
მსოფლიო ისტორიის ტესტები
 
Integrative bioinformatics analysis of Parkinson's disease related omics data
Integrative bioinformatics analysis of Parkinson's disease related omics dataIntegrative bioinformatics analysis of Parkinson's disease related omics data
Integrative bioinformatics analysis of Parkinson's disease related omics data
 
Online education ppt
Online education pptOnline education ppt
Online education ppt
 

Similar to Data dialogue - Human Genomic Data Discovery

Workshop finding and accessing data - fiona - lunteren april 18 2016
Workshop   finding and accessing data - fiona - lunteren april 18 2016Workshop   finding and accessing data - fiona - lunteren april 18 2016
Workshop finding and accessing data - fiona - lunteren april 18 2016Fiona Nielsen
 
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...Fiona Nielsen
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015Fiona Nielsen
 
SciDataCon - How to increase accessibility and reuse for clinical and persona...
SciDataCon - How to increase accessibility and reuse for clinical and persona...SciDataCon - How to increase accessibility and reuse for clinical and persona...
SciDataCon - How to increase accessibility and reuse for clinical and persona...Fiona Nielsen
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Philip Bourne
 
Open Science: Where Theory Meets Practice
Open Science: Where Theory Meets PracticeOpen Science: Where Theory Meets Practice
Open Science: Where Theory Meets PracticePhilip Bourne
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Susanna-Assunta Sansone
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsManuel Corpas
 
Stories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global InfrastructureStories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global InfrastructureResearch Data Alliance
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKpetermurrayrust
 
Pistoia Alliance European Conference 2015 - Julia Wilson / Global Alliance fo...
Pistoia Alliance European Conference 2015 - Julia Wilson / Global Alliance fo...Pistoia Alliance European Conference 2015 - Julia Wilson / Global Alliance fo...
Pistoia Alliance European Conference 2015 - Julia Wilson / Global Alliance fo...Pistoia Alliance
 
Jisc's new shared data centre
Jisc's new shared data centreJisc's new shared data centre
Jisc's new shared data centreJisc
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataARDC
 
Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Warren Kibbe
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeVince Smith
 
Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?Sandra Binning
 
SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation Jackie Wirz, PhD
 

Similar to Data dialogue - Human Genomic Data Discovery (20)

Workshop finding and accessing data - fiona - lunteren april 18 2016
Workshop   finding and accessing data - fiona - lunteren april 18 2016Workshop   finding and accessing data - fiona - lunteren april 18 2016
Workshop finding and accessing data - fiona - lunteren april 18 2016
 
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...Workshop   finding and accessing data - fiona nadia charlotte - cambridge apr...
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
SciDataCon - How to increase accessibility and reuse for clinical and persona...
SciDataCon - How to increase accessibility and reuse for clinical and persona...SciDataCon - How to increase accessibility and reuse for clinical and persona...
SciDataCon - How to increase accessibility and reuse for clinical and persona...
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
 
Open Science: Where Theory Meets Practice
Open Science: Where Theory Meets PracticeOpen Science: Where Theory Meets Practice
Open Science: Where Theory Meets Practice
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics Datasets
 
Stories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global InfrastructureStories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global Infrastructure
 
IRDiRC: progress and expectations
IRDiRC: progress and expectationsIRDiRC: progress and expectations
IRDiRC: progress and expectations
 
ContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UKContentMining for France and Europe; Lessons from 2 years in UK
ContentMining for France and Europe; Lessons from 2 years in UK
 
Pistoia Alliance European Conference 2015 - Julia Wilson / Global Alliance fo...
Pistoia Alliance European Conference 2015 - Julia Wilson / Global Alliance fo...Pistoia Alliance European Conference 2015 - Julia Wilson / Global Alliance fo...
Pistoia Alliance European Conference 2015 - Julia Wilson / Global Alliance fo...
 
Jisc's new shared data centre
Jisc's new shared data centreJisc's new shared data centre
Jisc's new shared data centre
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016Nci clinical genomics data sharing ncra sept 2016
Nci clinical genomics data sharing ncra sept 2016
 
The Biodiversity Informatics Landscape
The Biodiversity Informatics LandscapeThe Biodiversity Informatics Landscape
The Biodiversity Informatics Landscape
 
Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03Open data genomics_palermo_2017_ver03
Open data genomics_palermo_2017_ver03
 
Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?
 
VIVO Keynote
VIVO KeynoteVIVO Keynote
VIVO Keynote
 
SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation SPARC 2013 Data Management Presentation
SPARC 2013 Data Management Presentation
 

More from Fiona Nielsen

EICT Summer School August 2023 - Things I never knew I never knew - about bu...
EICT Summer School August 2023 - Things I never knew  I never knew - about bu...EICT Summer School August 2023 - Things I never knew  I never knew - about bu...
EICT Summer School August 2023 - Things I never knew I never knew - about bu...Fiona Nielsen
 
Challenges with pre-clinical studies in immuno oncology - by Fiona Nielsen
Challenges with pre-clinical studies in immuno oncology - by Fiona NielsenChallenges with pre-clinical studies in immuno oncology - by Fiona Nielsen
Challenges with pre-clinical studies in immuno oncology - by Fiona NielsenFiona Nielsen
 
AIDR2019 - standards - tools - incentives - what does it take to enable data ...
AIDR2019 - standards - tools - incentives - what does it take to enable data ...AIDR2019 - standards - tools - incentives - what does it take to enable data ...
AIDR2019 - standards - tools - incentives - what does it take to enable data ...Fiona Nielsen
 
Genomics for the public is coming - are you ready or not?
Genomics for the public is coming - are you ready or not?Genomics for the public is coming - are you ready or not?
Genomics for the public is coming - are you ready or not?Fiona Nielsen
 
Investing in innovation for genomic medicine - sept 5 2017
Investing in innovation for genomic medicine - sept 5 2017Investing in innovation for genomic medicine - sept 5 2017
Investing in innovation for genomic medicine - sept 5 2017Fiona Nielsen
 
Investing in innovation for genomic medicine - the journey of Repositive
Investing in innovation for genomic medicine - the journey of RepositiveInvesting in innovation for genomic medicine - the journey of Repositive
Investing in innovation for genomic medicine - the journey of RepositiveFiona Nielsen
 
From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016
From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016
From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016Fiona Nielsen
 
ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016Fiona Nielsen
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Fiona Nielsen
 
Genome sharing projects around the world - Open Access is not enough
Genome sharing projects around the world - Open Access is not enough Genome sharing projects around the world - Open Access is not enough
Genome sharing projects around the world - Open Access is not enough Fiona Nielsen
 
From Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to EntrepreneurFrom Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to EntrepreneurFiona Nielsen
 
Session 3 - big (biomedical) data
Session 3 - big (biomedical) dataSession 3 - big (biomedical) data
Session 3 - big (biomedical) dataFiona Nielsen
 
Why i left my job in genomics R&D - Lunteren - april 18 - 2016
Why i left my job in genomics R&D - Lunteren - april 18 - 2016Why i left my job in genomics R&D - Lunteren - april 18 - 2016
Why i left my job in genomics R&D - Lunteren - april 18 - 2016Fiona Nielsen
 
Overcoming barriers for genomic data sharing yaac presentation may 23 2015
Overcoming barriers for genomic data sharing   yaac presentation may 23 2015Overcoming barriers for genomic data sharing   yaac presentation may 23 2015
Overcoming barriers for genomic data sharing yaac presentation may 23 2015Fiona Nielsen
 
The need to redefine genomic data sharing - moving towards Open Science Oct ...
The need to redefine genomic data sharing - moving towards Open Science  Oct ...The need to redefine genomic data sharing - moving towards Open Science  Oct ...
The need to redefine genomic data sharing - moving towards Open Science Oct ...Fiona Nielsen
 
DNAdigest Eagle Genomics Symposium March 27, 2014
DNAdigest Eagle Genomics Symposium March 27, 2014DNAdigest Eagle Genomics Symposium March 27, 2014
DNAdigest Eagle Genomics Symposium March 27, 2014Fiona Nielsen
 

More from Fiona Nielsen (16)

EICT Summer School August 2023 - Things I never knew I never knew - about bu...
EICT Summer School August 2023 - Things I never knew  I never knew - about bu...EICT Summer School August 2023 - Things I never knew  I never knew - about bu...
EICT Summer School August 2023 - Things I never knew I never knew - about bu...
 
Challenges with pre-clinical studies in immuno oncology - by Fiona Nielsen
Challenges with pre-clinical studies in immuno oncology - by Fiona NielsenChallenges with pre-clinical studies in immuno oncology - by Fiona Nielsen
Challenges with pre-clinical studies in immuno oncology - by Fiona Nielsen
 
AIDR2019 - standards - tools - incentives - what does it take to enable data ...
AIDR2019 - standards - tools - incentives - what does it take to enable data ...AIDR2019 - standards - tools - incentives - what does it take to enable data ...
AIDR2019 - standards - tools - incentives - what does it take to enable data ...
 
Genomics for the public is coming - are you ready or not?
Genomics for the public is coming - are you ready or not?Genomics for the public is coming - are you ready or not?
Genomics for the public is coming - are you ready or not?
 
Investing in innovation for genomic medicine - sept 5 2017
Investing in innovation for genomic medicine - sept 5 2017Investing in innovation for genomic medicine - sept 5 2017
Investing in innovation for genomic medicine - sept 5 2017
 
Investing in innovation for genomic medicine - the journey of Repositive
Investing in innovation for genomic medicine - the journey of RepositiveInvesting in innovation for genomic medicine - the journey of Repositive
Investing in innovation for genomic medicine - the journey of Repositive
 
From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016
From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016
From bioinformatics scientist to entrepreneur - Women in Omics - ICG11 - 2016
 
ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016
 
Genome sharing projects around the world - Open Access is not enough
Genome sharing projects around the world - Open Access is not enough Genome sharing projects around the world - Open Access is not enough
Genome sharing projects around the world - Open Access is not enough
 
From Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to EntrepreneurFrom Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to Entrepreneur
 
Session 3 - big (biomedical) data
Session 3 - big (biomedical) dataSession 3 - big (biomedical) data
Session 3 - big (biomedical) data
 
Why i left my job in genomics R&D - Lunteren - april 18 - 2016
Why i left my job in genomics R&D - Lunteren - april 18 - 2016Why i left my job in genomics R&D - Lunteren - april 18 - 2016
Why i left my job in genomics R&D - Lunteren - april 18 - 2016
 
Overcoming barriers for genomic data sharing yaac presentation may 23 2015
Overcoming barriers for genomic data sharing   yaac presentation may 23 2015Overcoming barriers for genomic data sharing   yaac presentation may 23 2015
Overcoming barriers for genomic data sharing yaac presentation may 23 2015
 
The need to redefine genomic data sharing - moving towards Open Science Oct ...
The need to redefine genomic data sharing - moving towards Open Science  Oct ...The need to redefine genomic data sharing - moving towards Open Science  Oct ...
The need to redefine genomic data sharing - moving towards Open Science Oct ...
 
DNAdigest Eagle Genomics Symposium March 27, 2014
DNAdigest Eagle Genomics Symposium March 27, 2014DNAdigest Eagle Genomics Symposium March 27, 2014
DNAdigest Eagle Genomics Symposium March 27, 2014
 

Recently uploaded

GBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolationGBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolationAreesha Ahmad
 
Plasmid: types, structure and functions.
Plasmid: types, structure and functions.Plasmid: types, structure and functions.
Plasmid: types, structure and functions.Cherry
 
ONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for voteONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for voteRaunakRastogi4
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptxCherry
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Lipids: types, structure and important functions.
Lipids: types, structure and important functions.Lipids: types, structure and important functions.
Lipids: types, structure and important functions.Cherry
 
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Cherry
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
Site specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdfSite specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdfCherry
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.Cherry
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACherry
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Cherry
 
Daily Lesson Log in Science 9 Fourth Quarter Physics
Daily Lesson Log in Science 9 Fourth Quarter PhysicsDaily Lesson Log in Science 9 Fourth Quarter Physics
Daily Lesson Log in Science 9 Fourth Quarter PhysicsWILSONROMA4
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCherry
 
X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneyX-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneySérgio Sacani
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxCherry
 
GBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismGBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismAreesha Ahmad
 

Recently uploaded (20)

GBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolationGBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolation
 
Plasmid: types, structure and functions.
Plasmid: types, structure and functions.Plasmid: types, structure and functions.
Plasmid: types, structure and functions.
 
ONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for voteONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for vote
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Lipids: types, structure and important functions.
Lipids: types, structure and important functions.Lipids: types, structure and important functions.
Lipids: types, structure and important functions.
 
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Site specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdfSite specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdf
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNACot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNA
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Daily Lesson Log in Science 9 Fourth Quarter Physics
Daily Lesson Log in Science 9 Fourth Quarter PhysicsDaily Lesson Log in Science 9 Fourth Quarter Physics
Daily Lesson Log in Science 9 Fourth Quarter Physics
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneyX-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
GBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismGBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) Metabolism
 

Data dialogue - Human Genomic Data Discovery

  • 1. Human Genomic Data Discoverability Fiona Nielsen – Data Dialogue, Cambridge – July 28th 2016
  • 2. The surge of genomics data • High throughput technologies – biology is moving from the lab to the computer 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 Genomes Sequenced 80+PB Sequenced every year
  • 3. Population sequencing projects • For example 100,000 Genomes project in the UK
  • 4. Where is the data? • A researcher in human genomics knows on average 4-5 data sources The need to redefine data sharing: http://www.sciencedirect.com/science/article/pii/S2212066114000386
  • 5. Hundreds of data sources • Content overview of 163 data sources Assay Types Dedicated to…
  • 6. Hundreds of data sources • Sizes vary from tens to 100s of thousands of samples 1 10 100 1000 10000 100000 1000000 Sample#(Log10) Top 5: GEO (1.8M) PMI Cohort Program (1M) Auria Biopankki (1M) EGA (~0.6M) SRA (~0.5M)
  • 7. Which populations are represented? Aboriginals African Americans Africans Australians Chinese Malays Indians Danish Dutch Estonian Russian European Ancestry Finnish Icelandic Japanese Korean Latin Americans Saudi Swedish
  • 8. Where does the data come from? 947560 0 8 8 66 0 2 6 6 8 50 62 3 2 5 0 0 2 3 International Interesting site to look at: http://omicsmaps.com/stats
  • 9. Why is some data not shared? • Challenges for international research community: How to work across borders and silos?
  • 10. Why is some data not shared? • Additional challenges for biomedical: Data privacy, data governance, patient consent, medical legislation
  • 11. Also consider: Community-led resources • patient groups, academia, the general public
  • 12. What needs to change? • Increased data visibility and accessibility positively benefit both researchers and patients ?
  • 13. Pain points FRAGMENTED Poor visibility of available genomic data ADMIN BURDEN Huge overhead to manage data access BAD CULTURE Lack of data sharing habits in research culture
  • 14. Best practices MAKE DATA DISCOVERABLE SIMPLIFY WORKFLOWS CONTRIBUTE TO COMMUNITY DNAdigest and Repositive – Connecting the world of genomic data http://journals.plos.org/plosbiology/article?id=10.1371%2Fjournal.pbio.1002418
  • 15. Panel discussion • What are best practices for sharing difficult data? FAIR data: Findable, Accessible, Interoperable, Reuseable
  • 16. Translating and Commercialising Genomic Research 7-9 December 2016| Wellcome Genome Campus, Hinxton, Cambridge UK Applications open soon! Scientific programme committee Emmanuelle Astoul Wellcome Trust Sanger Institute, UK Fiona Nielsen Repositive/DNAdigest, UK Abel Ureta-Vidal Eagle Genomics, UK Ross Rounsevell Wellcome Trust Sanger Institute, UK Full details at: www.wellcomegenomecampus.org/coursesandconferences Topics will include: • Commercial opportunities arising from data aggregation • Exploiting bioinformatics tools • Externalising bioinformatics pipelines • Translating biomarkers, genetic signatures or gene panels
  • 17. CEO Fiona Nielsen, fiona@repositive.io Try our free platform for discovering human genomic data http://repositive.io Follow us on twitter @repositiveio

Editor's Notes

  1. Our mission is to speed up research and diagnostics for genetic diseases by enabling efficient and ethical access to genomic research data
  2. Falling cost of sequencing and techological advances General intro to the subject – assume audience are novices but from technical background What’s hot, what’s not Major recent advances Key tactical challenges Strategic issues faced Relevance to Pistoia Alliance activity and strategy (if appropriate)
  3. General intro to the subject – assume audience are novices but from technical background What’s hot, what’s not Major recent advances Key tactical challenges Strategic issues faced Relevance to Pistoia Alliance activity and strategy (if appropriate)
  4. The need to redefine data sharing: http://www.sciencedirect.com/science/article/pii/S2212066114000386
  5. DNA.land OpenSNP PersonalGenomesProject Direct to consumer genetic tests & microbiome
  6. In the light of the increasing costs of drug development, this is an opportunity not to miss!
  7. The Repositive platform is an online community and marketplace connecting data consumers with data providers. On Repositive, Jenn has Easy, Interactive search Faster data access workflow Easy access to new data collaborators Benefiting from reading feedback on data from community, colleagues, to assess data quality and utility The Repositive platform and technology will remove barriers to data sharing and will incentivise users to explore, contribute and collaborate in alignment with best practices
  8. Data is fragmented in unconnected silos – makes it very difficult to discover data Tracking data and working with data access requests is a time-consuming and bureaucratic exercise Difficult to build a user community without best practices and tools/platforms where users can share their data experience / findings
  9. FAIR data: https://www.force11.org/group/fairgroup/fairprinciples
  10. General intro to the subject – assume audience are novices but from technical background What’s hot, what’s not Major recent advances Key tactical challenges Strategic issues faced Relevance to Pistoia Alliance activity and strategy (if appropriate)
  11. Our mission is to speed up research and diagnostics for genetic diseases by enabling efficient and ethical access to genomic research data