SlideShare a Scribd company logo
1 of 51
Download to read offline
NeuroVault and the vision for
data sharing in neuroimaging
Chris Gorgolewski
Max Planck Research Group: Neuroanatomy & Connectivity
Leipzig, Germany
Data sharing?
Data sharing?
•  Ok, ok so we should share data.
•  We all know it’s good.
•  But almost no one does it.
–  You have to prepare data
–  You risk that your mistakes will be found!
Part I: Large scale data sharing is a fact
Part II: Data sharing does not have to be
expensive
Part III: Making data sharing count
Part IV: Implications of data sharing
The data out there is calling you…

PART I: LARGE SCALE DATA
SHARING IS A FACT
NKI Enhanced
•  329 subjects (will reach 1000)
–  Representative sample: young and old, some with
mental health history

•  1 hour worth of MRI (3T) scanning:
–  MPRAGE (TR = 1900; voxel size = 1mm isotropic)
–  3x resting state scans (645msec, 1400msec, and
2500msec)
–  Diffusion Tensor Imaging (137 direction; voxel size
= 2mm isotropic)
–  Visual Checkboard and Breath Holding
manipulations

	
  
fcon_1000.projects.nitrc.org/indi/
enhanced/
Human Connectome Project
•  ~232 subjects (will reach 1200)
–  Young and healthy (22-35yrs)
–  200 twins!

•  1 hour worth of MRI scanning:
–  Resting-state fMRI (R-fMRI)
–  Task-evoked fMRI (T-fMRI)
• 
• 
• 
• 
• 
• 
• 

Working Memory
Gambling
Motor
Language
Social Cognition
Relational Processing
Emotion Processing

–  Diffusion MRI (dMRI)	
  
Human Connectome Project
•  Rich phenotypical data
–  Cognition, personality, substance abuse etc.

•  Genotyping!
•  Methodological developments
(not yet available)

–  Fine tuned sequences
–  New preprocessing techniques

•  Ready to use preprocessed data
humanconnectome.org
Test-retest datasets
•  NKI multiband Test-retest
•  Classification learning and stop-signal
(1 year test-retest)
•  A test-retest fMRI dataset for motor,
language and spatial attention functions
FCP/INDI Usage Survey
FCP/INDI Data Usage Description

	
  

Master's thesis research
Doctoral dissertation research
Teaching resource (projects or examples)
Pilot data for grant applications
Research intended for publication
Independent study (e.g., teach self about analysis)

	
  
11.94%
38.81%
13.43%
16.42%
76.12%
37.31%

FCP/INDI Users; 10% respondent rate

Survey Courtesy of Stan Colcombe & Cameron Craddock
Sharing little things…

PART II: DATA SHARING DOES
NOT HAVE TO BE EXPENSIVE
Just coordinates?
•  Databases such as Neurosynth or
BrainMap rely on peak coordinates
reported in papers (only strong effects)
Are we throwing money away?
Baby steps
•  Everything is a question of cost and benefit
–  If we keep the cost low even small benefit (or
just conviction that data sharing is GOOD) will
suffice
NeuroVault.org
simple data sharing
•  Minimize the cost!
•  We just want your statistical maps with
minimum description (DOI)
–  If you want you can put more metadata, but
you don’t have to

•  We streamline login process (Google,
Facebook)
Benefits?
•  In return authors get interactive web
based visualization of their statistical maps
–  Something they can embed on their lab
website

•  We are keeping both cost and benefit low…
–  …but we also plan to work with journal editors
to popularize the idea
Live demo
Using NeuroVault…
• 
• 
• 
• 
• 

Improves collaboration
Makes your paper more attractive
Shows you care about transparency
Takes only five minutes
Gives you warm and fuzzy feeling that you
helped future meta-analyses
NeuroVault for developers
•  RESTful API (field tested by Neurosynth)
•  Source code available on GitHub
NeuroVault.org
Credit where credit’s due

PART III: MAKING DATASHARING
COUNT
Motivation
•  Share	
  your	
  stat	
  maps!	
  

vs.
institutions

scientists
Quality control
•  Share	
  your	
  stat	
  maps!	
  
Complex datasets require
elaborate descriptions
Solution – data papers
•  Authors get recognizable credit for their
work.
–  Even smaller contributors such as RAs can be
included.

•  Acquisition methods are described in
detail.
•  Quality of metadata is being controlled by
peer review.
Gorgolewski, Milham, and Margulies, 2013
Where to publish data papers?
•  Neuroinformatics (Springer)
•  Frontiers in Human Brain Methods
(Frontiers Media)
•  GigaScience (BGI, BioMed Central)
•  Scientific Data (Nature Publising Group,
coming soon)
Where to publish data papers?
•  Neuroinformatics (Springer)
•  Frontiers in Human Brain Methods
(Nature Publishing Group)
•  GigaScience (BGI, BioMed Central)
•  Scientific Data (Nature Publising Group,
coming soon)
PART IV: IMPLICATIONS OF
DATASHARING
Sample sizes will grow

By combining multiple shared datasets or
using one of the big datasharing initiatives
we will gain access to bigger sample
Bigger samples
•  Better parameter estimates
•  Lower ratio of false positives (and false
negatives)
•  Lower risk of inflated effect sizes
•  Higher power: better sensitivity to small
effects
Is more power bad?
•  In classical hypothesis testing the null
hypothesis usually states no difference
•  However nothing in nature is exactly the
same
•  In most cases we just don’t have enough
power to see it
•  Some differences are more important than
others
Ridgway, Gerard (2013): Illustrative effect sizes for sex differences. figshare.
http://dx.doi.org/10.6084/m9.figshare.866802
Is more power bad?
•  No – an “overpowered study” is an oxymoron
•  But we will need to revise our methods
•  Incorporate our assumption of what is a trivial
effect size in our analyses
–  Either through Bayesian framework or different
null hypotheses

•  Start looking at effect and confidence interval
maps instead of just thresholded p-maps
Vibration effect
•  While analyzing an MRI dataset we face a
plethora of choices
•  Some alternatives have no clear bad or
good options
•  The vibration effect is the ratio of effect
size of the highest and lowest effects
across all processing options
Ioannidis, J. P. a. (2008). Why most discovered true associations are inflated.
Vibration effect

• 

Carp J (2012) On the plurality of (methodological) worlds: estimating the
analytic flexibility of fMRI experiments. Front. Neurosci. 6:149. doi: 10.3389/
fnins.2012.00149
Vibration effect
•  We will finally see how much (or little) our
analyses replicate over different datasets
and methods
Take home message(s)
I.  Take advantage of shared data
II.  Share your statistical maps at
NeuroVault.org
III.  Share your data and publish a data paper
IV.  Expect changes in the way we analyze
our data
Acknowledgements
(my personal giants)
Pierre-Louie Bazin
Haakon Engen
Satrajit Ghosh
Russell A. Poldrack
Jean-Baptiste Poline
Yannick Schwarz
Tal Yarkoni
Michael Milham
Daniel Margulies
Benjamin Baird

Jonathan Smallwood
Yannick Schwarz
Florence J.M. Ruby
Melaina Vinski
Camille Maumet
Dan Lurie
Sebastian Urchs
Judy A. Kipping
R. Cameron Craddock
MPI CBS Resting state group
	
  
THANK YOU!
“I swear I’ve heard it before”
•  In the past there were many attempts to
propagate data sharing
–  For example fMRI DC:
•  technical issues
•  …and the amount of time it took to prepare data for
submission (a week, a very frustrating week)

•  fMRI DC was however very ambitious for
its time:
–  They wanted to collect raw data and all
metadata required to reproduce the analysis
Van Horn & Gazzaniga (2013). Why share data? Lessons learned from the fMRIDC. NeuroImage

More Related Content

What's hot

A Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine PerceptionA Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine Perception
Cory Andrew Henson
 
MLconf NYC Pek Lum
MLconf NYC Pek LumMLconf NYC Pek Lum
MLconf NYC Pek Lum
MLconf
 
Lasi datawrangling
Lasi datawranglingLasi datawrangling
Lasi datawrangling
Tony Hirst
 
Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...
Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...
Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...
Artificial Intelligence Institute at UofSC
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.doc
butest
 

What's hot (20)

A Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine PerceptionA Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine Perception
 
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
Why big data didn’t end causal inference by Totte Harinen at Big Data Spain 2017
 
A metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposalA metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposal
 
Will Data Science Approaches Impact Our Science?
Will Data Science Approaches Impact Our Science?Will Data Science Approaches Impact Our Science?
Will Data Science Approaches Impact Our Science?
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
 
Biomedical Engineering in a Changing Scholarly Landscape
Biomedical Engineering in a Changing Scholarly LandscapeBiomedical Engineering in a Changing Scholarly Landscape
Biomedical Engineering in a Changing Scholarly Landscape
 
Visual Analytics in Omics: why, what, how?
Visual Analytics in Omics: why, what, how?Visual Analytics in Omics: why, what, how?
Visual Analytics in Omics: why, what, how?
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
 
Internet Based Research
Internet Based ResearchInternet Based Research
Internet Based Research
 
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge GraphsCombining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
Combining Explicit and Latent Web Semantics for Maintaining Knowledge Graphs
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical Data
 
MLconf NYC Pek Lum
MLconf NYC Pek LumMLconf NYC Pek Lum
MLconf NYC Pek Lum
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
 
Reproducible research: First steps.
Reproducible research: First steps. Reproducible research: First steps.
Reproducible research: First steps.
 
Lasi datawrangling
Lasi datawranglingLasi datawrangling
Lasi datawrangling
 
2015 04-18-wilson cg
2015 04-18-wilson cg2015 04-18-wilson cg
2015 04-18-wilson cg
 
Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...
Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...
Knowledge-empowered Probabilistic Graphical Models for Physical-Cyber-Social ...
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.doc
 
Fair by design
Fair by designFair by design
Fair by design
 

Viewers also liked

алтынай апталык
алтынай апталыкалтынай апталык
алтынай апталык
dimissanova
 
AS Media Questionnaire Results
AS Media Questionnaire ResultsAS Media Questionnaire Results
AS Media Questionnaire Results
IxDooom
 
6 c flat plans
6 c   flat plans6 c   flat plans
6 c flat plans
MattyLane
 
English tea
English teaEnglish tea
English tea
popimerg
 
MEDIA STUDIES - EVALUATION QUESTION 5
MEDIA STUDIES - EVALUATION QUESTION 5MEDIA STUDIES - EVALUATION QUESTION 5
MEDIA STUDIES - EVALUATION QUESTION 5
MissKylieLee
 
Levan microbial polysaccharide
Levan microbial polysaccharideLevan microbial polysaccharide
Levan microbial polysaccharide
Harry Kwon
 

Viewers also liked (19)

алтынай апталык
алтынай апталыкалтынай апталык
алтынай апталык
 
Introfamilyofcurves
IntrofamilyofcurvesIntrofamilyofcurves
Introfamilyofcurves
 
AS Media Questionnaire Results
AS Media Questionnaire ResultsAS Media Questionnaire Results
AS Media Questionnaire Results
 
Stpaulmathconsultation(functions)
Stpaulmathconsultation(functions)Stpaulmathconsultation(functions)
Stpaulmathconsultation(functions)
 
Q1 fy16 quarterly earnings presentation
Q1 fy16 quarterly earnings presentation Q1 fy16 quarterly earnings presentation
Q1 fy16 quarterly earnings presentation
 
England
EnglandEngland
England
 
6 c flat plans
6 c   flat plans6 c   flat plans
6 c flat plans
 
Makalah protein pada yogurt
Makalah protein pada yogurtMakalah protein pada yogurt
Makalah protein pada yogurt
 
English tea
English teaEnglish tea
English tea
 
Balance Management Talk - Motivation
Balance Management Talk - MotivationBalance Management Talk - Motivation
Balance Management Talk - Motivation
 
Drug rediscovery: Clash between old and older ?
Drug rediscovery: Clash between old and older ?Drug rediscovery: Clash between old and older ?
Drug rediscovery: Clash between old and older ?
 
Propsal usaha aksesorois wanita
Propsal usaha aksesorois wanitaPropsal usaha aksesorois wanita
Propsal usaha aksesorois wanita
 
MEDIA STUDIES - EVALUATION QUESTION 5
MEDIA STUDIES - EVALUATION QUESTION 5MEDIA STUDIES - EVALUATION QUESTION 5
MEDIA STUDIES - EVALUATION QUESTION 5
 
Final q2 fy16 quarterly earnings presentation
Final q2 fy16 quarterly earnings presentationFinal q2 fy16 quarterly earnings presentation
Final q2 fy16 quarterly earnings presentation
 
Questionnaire Analysis - Media Studies A2
Questionnaire Analysis - Media Studies A2Questionnaire Analysis - Media Studies A2
Questionnaire Analysis - Media Studies A2
 
website design Adelaide
website design Adelaidewebsite design Adelaide
website design Adelaide
 
Reporte sobre los animales invertebrados
Reporte sobre los animales invertebradosReporte sobre los animales invertebrados
Reporte sobre los animales invertebrados
 
The Music Industry
The Music IndustryThe Music Industry
The Music Industry
 
Levan microbial polysaccharide
Levan microbial polysaccharideLevan microbial polysaccharide
Levan microbial polysaccharide
 

Similar to NeuroVault and the vision for data sharing in neuroimaging

Reusable Science: How not to slip from the shoulders of giants
Reusable Science: How not to slip from the shoulders of giantsReusable Science: How not to slip from the shoulders of giants
Reusable Science: How not to slip from the shoulders of giants
Krzysztof Gorgolewski
 
On the large scale of studying dynamics with MEG: Lessons learned from the Hu...
On the large scale of studying dynamics with MEG: Lessons learned from the Hu...On the large scale of studying dynamics with MEG: Lessons learned from the Hu...
On the large scale of studying dynamics with MEG: Lessons learned from the Hu...
Robert Oostenveld
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
Jordan Engbers
 

Similar to NeuroVault and the vision for data sharing in neuroimaging (20)

2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotech
 
From data lakes to actionable data (adventures in data curation)
From data lakes to actionable data (adventures in data curation)From data lakes to actionable data (adventures in data curation)
From data lakes to actionable data (adventures in data curation)
 
Reusable Science: How not to slip from the shoulders of giants
Reusable Science: How not to slip from the shoulders of giantsReusable Science: How not to slip from the shoulders of giants
Reusable Science: How not to slip from the shoulders of giants
 
Frankie Rybicki slide set for Deep Learning in Radiology / Medicine
Frankie Rybicki slide set for Deep Learning in Radiology / MedicineFrankie Rybicki slide set for Deep Learning in Radiology / Medicine
Frankie Rybicki slide set for Deep Learning in Radiology / Medicine
 
Making data sharing count
Making data sharing countMaking data sharing count
Making data sharing count
 
Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier TordoirShare and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
 
(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early Thoughts
 
Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?Data Science: Origins, Methods, Challenges and the future?
Data Science: Origins, Methods, Challenges and the future?
 
On the large scale of studying dynamics with MEG: Lessons learned from the Hu...
On the large scale of studying dynamics with MEG: Lessons learned from the Hu...On the large scale of studying dynamics with MEG: Lessons learned from the Hu...
On the large scale of studying dynamics with MEG: Lessons learned from the Hu...
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
 
Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)Big Data, The Community and The Commons (May 12, 2014)
Big Data, The Community and The Commons (May 12, 2014)
 
An introduction to machine learning in biomedical research: Key concepts, pr...
An introduction to machine learning in biomedical research:  Key concepts, pr...An introduction to machine learning in biomedical research:  Key concepts, pr...
An introduction to machine learning in biomedical research: Key concepts, pr...
 
Dia sds2015 web version
Dia sds2015 web versionDia sds2015 web version
Dia sds2015 web version
 
AstraZeneca - The promise of graphs & graph-based learning in drug discovery
AstraZeneca - The promise of graphs & graph-based learning in drug discoveryAstraZeneca - The promise of graphs & graph-based learning in drug discovery
AstraZeneca - The promise of graphs & graph-based learning in drug discovery
 
Overview of Data Science and AI
Overview of Data Science and AIOverview of Data Science and AI
Overview of Data Science and AI
 
Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?Will Biomedical Research Fundamentally Change in the Era of Big Data?
Will Biomedical Research Fundamentally Change in the Era of Big Data?
 
2015 balti-and-bioinformatics
2015 balti-and-bioinformatics2015 balti-and-bioinformatics
2015 balti-and-bioinformatics
 
The State of Open Research Data - OpenCon 2014
The State of Open Research Data - OpenCon 2014The State of Open Research Data - OpenCon 2014
The State of Open Research Data - OpenCon 2014
 

More from Krzysztof Gorgolewski

Towards open and reproducible neuroscience in the age of big data
Towards open and reproducible neuroscience in the age of big dataTowards open and reproducible neuroscience in the age of big data
Towards open and reproducible neuroscience in the age of big data
Krzysztof Gorgolewski
 
Evaluation of full brain parcellation schemes using the NeuroVault database o...
Evaluation of full brain parcellation schemes using the NeuroVault database o...Evaluation of full brain parcellation schemes using the NeuroVault database o...
Evaluation of full brain parcellation schemes using the NeuroVault database o...
Krzysztof Gorgolewski
 

More from Krzysztof Gorgolewski (19)

Reproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approachReproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approach
 
ML Researcher’s Guide to Open Brain Imaging Data
ML Researcher’s Guide to Open Brain Imaging DataML Researcher’s Guide to Open Brain Imaging Data
ML Researcher’s Guide to Open Brain Imaging Data
 
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
 
A practical guide to practicing open science
A practical guide to practicing open scienceA practical guide to practicing open science
A practical guide to practicing open science
 
Towards open and reproducible neuroscience in the age of big data
Towards open and reproducible neuroscience in the age of big dataTowards open and reproducible neuroscience in the age of big data
Towards open and reproducible neuroscience in the age of big data
 
Modern tools for sharing and synthesizing neuroimaging results
Modern tools for sharing and synthesizing neuroimaging resultsModern tools for sharing and synthesizing neuroimaging results
Modern tools for sharing and synthesizing neuroimaging results
 
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging dataOpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
 
Containers in Science: neuroimaging use cases
Containers in Science: neuroimaging use casesContainers in Science: neuroimaging use cases
Containers in Science: neuroimaging use cases
 
Study pre-registration: Benefits and considerations
Study pre-registration: Benefits and considerationsStudy pre-registration: Benefits and considerations
Study pre-registration: Benefits and considerations
 
FMRIPREP - robust and easy to use fMRI preprocessing pipeline
FMRIPREP - robust and easy to use fMRI preprocessing pipelineFMRIPREP - robust and easy to use fMRI preprocessing pipeline
FMRIPREP - robust and easy to use fMRI preprocessing pipeline
 
Evaluation of full brain parcellation schemes using the NeuroVault database o...
Evaluation of full brain parcellation schemes using the NeuroVault database o...Evaluation of full brain parcellation schemes using the NeuroVault database o...
Evaluation of full brain parcellation schemes using the NeuroVault database o...
 
Quality control for structural and functional MRI
Quality control for structural and functional MRIQuality control for structural and functional MRI
Quality control for structural and functional MRI
 
Software testing for scientists
Software testing for scientistsSoftware testing for scientists
Software testing for scientists
 
Docker for scientists
Docker for scientistsDocker for scientists
Docker for scientists
 
The Brain Imaging Data Structure (OHBM 2016)
The Brain Imaging Data Structure (OHBM 2016)The Brain Imaging Data Structure (OHBM 2016)
The Brain Imaging Data Structure (OHBM 2016)
 
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceBrain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible Neuroscince
 
Brain Imaging Data Structure
Brain Imaging Data StructureBrain Imaging Data Structure
Brain Imaging Data Structure
 
Meta analysis in neuroimaging 101
Meta analysis in neuroimaging 101Meta analysis in neuroimaging 101
Meta analysis in neuroimaging 101
 
If you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or notIf you liked it you should've put a p-value on it ...or not
If you liked it you should've put a p-value on it ...or not
 

Recently uploaded

Recently uploaded (20)

How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Basic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationBasic Intentional Injuries Health Education
Basic Intentional Injuries Health Education
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 

NeuroVault and the vision for data sharing in neuroimaging

  • 1. NeuroVault and the vision for data sharing in neuroimaging Chris Gorgolewski Max Planck Research Group: Neuroanatomy & Connectivity Leipzig, Germany
  • 3.
  • 4. Data sharing? •  Ok, ok so we should share data. •  We all know it’s good. •  But almost no one does it. –  You have to prepare data –  You risk that your mistakes will be found!
  • 5. Part I: Large scale data sharing is a fact Part II: Data sharing does not have to be expensive Part III: Making data sharing count Part IV: Implications of data sharing
  • 6. The data out there is calling you… PART I: LARGE SCALE DATA SHARING IS A FACT
  • 7. NKI Enhanced •  329 subjects (will reach 1000) –  Representative sample: young and old, some with mental health history •  1 hour worth of MRI (3T) scanning: –  MPRAGE (TR = 1900; voxel size = 1mm isotropic) –  3x resting state scans (645msec, 1400msec, and 2500msec) –  Diffusion Tensor Imaging (137 direction; voxel size = 2mm isotropic) –  Visual Checkboard and Breath Holding manipulations  
  • 8.
  • 10. Human Connectome Project •  ~232 subjects (will reach 1200) –  Young and healthy (22-35yrs) –  200 twins! •  1 hour worth of MRI scanning: –  Resting-state fMRI (R-fMRI) –  Task-evoked fMRI (T-fMRI) •  •  •  •  •  •  •  Working Memory Gambling Motor Language Social Cognition Relational Processing Emotion Processing –  Diffusion MRI (dMRI)  
  • 11. Human Connectome Project •  Rich phenotypical data –  Cognition, personality, substance abuse etc. •  Genotyping! •  Methodological developments (not yet available) –  Fine tuned sequences –  New preprocessing techniques •  Ready to use preprocessed data
  • 13. Test-retest datasets •  NKI multiband Test-retest •  Classification learning and stop-signal (1 year test-retest) •  A test-retest fMRI dataset for motor, language and spatial attention functions
  • 14. FCP/INDI Usage Survey FCP/INDI Data Usage Description   Master's thesis research Doctoral dissertation research Teaching resource (projects or examples) Pilot data for grant applications Research intended for publication Independent study (e.g., teach self about analysis)   11.94% 38.81% 13.43% 16.42% 76.12% 37.31% FCP/INDI Users; 10% respondent rate Survey Courtesy of Stan Colcombe & Cameron Craddock
  • 15. Sharing little things… PART II: DATA SHARING DOES NOT HAVE TO BE EXPENSIVE
  • 16. Just coordinates? •  Databases such as Neurosynth or BrainMap rely on peak coordinates reported in papers (only strong effects)
  • 17. Are we throwing money away?
  • 18.
  • 19. Baby steps •  Everything is a question of cost and benefit –  If we keep the cost low even small benefit (or just conviction that data sharing is GOOD) will suffice
  • 20. NeuroVault.org simple data sharing •  Minimize the cost! •  We just want your statistical maps with minimum description (DOI) –  If you want you can put more metadata, but you don’t have to •  We streamline login process (Google, Facebook)
  • 21. Benefits? •  In return authors get interactive web based visualization of their statistical maps –  Something they can embed on their lab website •  We are keeping both cost and benefit low… –  …but we also plan to work with journal editors to popularize the idea
  • 23. Using NeuroVault… •  •  •  •  •  Improves collaboration Makes your paper more attractive Shows you care about transparency Takes only five minutes Gives you warm and fuzzy feeling that you helped future meta-analyses
  • 24. NeuroVault for developers •  RESTful API (field tested by Neurosynth) •  Source code available on GitHub
  • 26. Credit where credit’s due PART III: MAKING DATASHARING COUNT
  • 27. Motivation •  Share  your  stat  maps!   vs. institutions scientists
  • 28. Quality control •  Share  your  stat  maps!   Complex datasets require elaborate descriptions
  • 29. Solution – data papers •  Authors get recognizable credit for their work. –  Even smaller contributors such as RAs can be included. •  Acquisition methods are described in detail. •  Quality of metadata is being controlled by peer review.
  • 30. Gorgolewski, Milham, and Margulies, 2013
  • 31. Where to publish data papers? •  Neuroinformatics (Springer) •  Frontiers in Human Brain Methods (Frontiers Media) •  GigaScience (BGI, BioMed Central) •  Scientific Data (Nature Publising Group, coming soon)
  • 32. Where to publish data papers? •  Neuroinformatics (Springer) •  Frontiers in Human Brain Methods (Nature Publishing Group) •  GigaScience (BGI, BioMed Central) •  Scientific Data (Nature Publising Group, coming soon)
  • 33.
  • 34.
  • 35. PART IV: IMPLICATIONS OF DATASHARING
  • 36. Sample sizes will grow By combining multiple shared datasets or using one of the big datasharing initiatives we will gain access to bigger sample
  • 37. Bigger samples •  Better parameter estimates •  Lower ratio of false positives (and false negatives) •  Lower risk of inflated effect sizes •  Higher power: better sensitivity to small effects
  • 38. Is more power bad? •  In classical hypothesis testing the null hypothesis usually states no difference •  However nothing in nature is exactly the same •  In most cases we just don’t have enough power to see it •  Some differences are more important than others
  • 39.
  • 40.
  • 41.
  • 42. Ridgway, Gerard (2013): Illustrative effect sizes for sex differences. figshare. http://dx.doi.org/10.6084/m9.figshare.866802
  • 43. Is more power bad? •  No – an “overpowered study” is an oxymoron •  But we will need to revise our methods •  Incorporate our assumption of what is a trivial effect size in our analyses –  Either through Bayesian framework or different null hypotheses •  Start looking at effect and confidence interval maps instead of just thresholded p-maps
  • 44. Vibration effect •  While analyzing an MRI dataset we face a plethora of choices •  Some alternatives have no clear bad or good options •  The vibration effect is the ratio of effect size of the highest and lowest effects across all processing options Ioannidis, J. P. a. (2008). Why most discovered true associations are inflated.
  • 45. Vibration effect •  Carp J (2012) On the plurality of (methodological) worlds: estimating the analytic flexibility of fMRI experiments. Front. Neurosci. 6:149. doi: 10.3389/ fnins.2012.00149
  • 46.
  • 47. Vibration effect •  We will finally see how much (or little) our analyses replicate over different datasets and methods
  • 48. Take home message(s) I.  Take advantage of shared data II.  Share your statistical maps at NeuroVault.org III.  Share your data and publish a data paper IV.  Expect changes in the way we analyze our data
  • 49. Acknowledgements (my personal giants) Pierre-Louie Bazin Haakon Engen Satrajit Ghosh Russell A. Poldrack Jean-Baptiste Poline Yannick Schwarz Tal Yarkoni Michael Milham Daniel Margulies Benjamin Baird Jonathan Smallwood Yannick Schwarz Florence J.M. Ruby Melaina Vinski Camille Maumet Dan Lurie Sebastian Urchs Judy A. Kipping R. Cameron Craddock MPI CBS Resting state group  
  • 51. “I swear I’ve heard it before” •  In the past there were many attempts to propagate data sharing –  For example fMRI DC: •  technical issues •  …and the amount of time it took to prepare data for submission (a week, a very frustrating week) •  fMRI DC was however very ambitious for its time: –  They wanted to collect raw data and all metadata required to reproduce the analysis Van Horn & Gazzaniga (2013). Why share data? Lessons learned from the fMRIDC. NeuroImage