SlideShare a Scribd company logo
METASPACE training course
OurCon’16, 17.10.2016
Theodore Alexandrov (EMBL/UCSD/SCiLS)
Andy Palmer (EMBL)
Vitaly Kovalev (EMBL)
Artem Tarasov (EMBL)
@METASPACE2020
Welcome everyone!
Artem
Tarasov
“hacker”
Andy
Palmer
“scientist”
Vitaly
Kovalev
“developer”
Theodore
Alexandrov
“leader”
Part 1: Theory
14:00-14:10 Welcome
14:10-14:15 Introduction into the METASPACE project
14:15-14:45 Metabolite annotation in HR imaging MS
14:45-15:00 Overview of the annotation engine
Coffee Break 15:00-15:30
Part 2: Tutorial
15:30-16:30 Step-by-step analysis of datasets provided in
advance, questions
- Data requirements: 10 min
- Upload UI: 5 min
- Webapp UI: 15 min
- Interpretation: 15 min
- Split into 2 groups: 5 min
- Export to imzML, ideally parallel sessions: 15 min
- SCiLS, FTICR
- EMBL, Orbitrap
Coffee Break 16:35-17:00
Part 3: Hands-on training
17:00-18:00 questions, data analysis
Agenda
Introduction
Theodore Alexandrov (EMBL, UCSD, SCiLS)
What we hope you will learn today
● Ins and outs of metabolite annotation in HR imaging MS
● Bioinformatics we developed for this problem
○ Metabolite Signal Match (MSM) score
○ False Discovery Rate estimation
○ FDR-controlled annotation
● The online engine we implemented
○ How to prepare data for submission to our service
○ How to submit your data
○ How to view molecular annotations in our webapp
Project overview: slides on slideshare
Bioinformatics: slides on slideshare
Theodore Alexandrov (EMBL, UCSD, SCiLS)
Overview of the annotation engine
Vitaly Kovalev (EMBL)
Outline
● Inputs (data and metadata)
● Online Software
● Data Submission
● Annotation Browsing
● Use Cases
a. mouse brain, MALDI-FTICR (UoR1)
b. human colorectal tumor, DESI-Orbitrap (ICL)
Input
● Centroided imzML
(http://ms-imaging.org/)
Input
● Centroided imzML
(http://ms-imaging.org/)
● Dataset metadata
Online Software
annotations
database
browse
annotations
task scheduler
upload
Upload Web UI
● http://upload.metasp.eu
● Easy upload
Upload Web App
● http://upload.metasp.eu
● Easy upload
● Metadata collection
SM Web UI
● http://alpha.metasp.eu
● Annotation browsing
● Technical details
● Feedback form
Use Case 1
Mouse brain
(MALDI-FTICR)
Data provided by
Regis Lavigne, Charles Pineau,
University of Rennes 1
Data provided by
James McKenzie, Zoltan Takats,
Imperial College London
Use Case 2
Human colorectal tumor
(DESI-Orbitrap)
Tutorial
Andy Palmer (EMBL)
Artem Tarasov (EMBL)
Part 1: Theory
14:00-14:10 Welcome, Outline, Learning objectives
14:10-14:15 Introduction into the METASPACE project
14:15-14:45 Metabolite annotation in HR imaging MS
14:45-15:00 Overview of the annotation engine
Coffee Break 15:00-15:30
Part 2: Tutorial
15:30-16:30 Step-by-step analysis of datasets provided in
advance, questions
- Data requirements: 10 min
- Upload UI: 5 min
- Webapp UI: 15 min
- Interpretation: 15 min
- Split into 2 groups: 5 min
- Export to imzML, ideally parallel sessions: 15 min
- SCiLS, FTICR
- EMBL, Orbitrap
Coffee Break 16:35-17:00
Part 3: Hands-on training
17:00-18:00 questions, data analysis
Agenda
Learning Outcomes
1. Preparing data for submission
2. Submitting data
3. Browsing results
Data Requirements
Imaging mass spectrometry data
- Any ionisation source
- Any spatial resolution
- Any tissue
- One section per dataset
Data Requirements
High resolving power
RFWHM
(@400) > 90K
Well-calibrated
ideally < 3 ppm
Data Requirements
Data Format
- imzML
Centroided
- vendor preferred
- http://metasp.eu/imzml
http://imzml.org/wp/introduction/
Customised Processing
Processing is tailored to your data!
- Technical metadata
- Resolving power
- isotope prediction
- Polarity
- adducts
R200
=70K R200
=280K
[C41
H78
NO7
P+K]+
Data Requirements
Your responsibility:
- Data is processed ‘as is’
- Check metadata is correct
- Report resolving power accurately (check within data-set)
- Low numbers of annotations often correspond to poor quality mass spectra
- Calibration inaccuracy
- Lock-mass errors
Data Submission
1. Follow conversion instructions for your instrument
2. Select the centroided files, imzML and ibd
3. Click the Upload button.
The dataset will be copied to the cloud storage
(accessible only to our team)
Data upload
Metadata form
● Appears once the upload is started
● Please fill truthfully
○ Most fields have ‘Other…’ option
○ Don’t want to disclose → put ‘-‘
● Click (at the very bottom)
Browsing Results
Main web interface
http://alpha.metasp.eu
Results are public
Datasets are not
Annotation table
Sign in with a Google ID to provide feedback
Currently selected
molecule
(click to select)
MSM scoreprincipal peak m/z
Sorting/filtering annotations
Click on column headers to sortStart typing a formula or a metabolite name
Filter by database
or dataset Select an adduct Set minimum
MSM score
Enter m/z of interest
FDR color-coding
Green = annotated @ chosen FDR level
Red = not annotated @ chosen FDR
Details for highlighted annotation
molecule distribution
(sum of isotope images)
Putative metabolite IDs
from the database
Feedback!
Thumbs up: reasonable
Thumbs down: dubious
- tell us why it could be wrong!
Feedback is not public
Visual insight into MSM score assignment
Adduct
Exact m/z of
each ion image
Zoom plot
Ion images for each
isotope peak
Isotopic patterns
Blue: theoretical abundance
(at instrument resolving power)
Red: measured image intensity
Step-by-step search
Choose molecular formula database
Step-by-step search
Choose dataset
Step-by-step search
Type molecular class
Step-by-step search
Type single metabolite name Potassium adduct
Step-by-step search
Type single metabolite name Sodium adduct
Step-by-step search
Type single metabolite name Hydrogen adduct
Results Browsing Summary
1. Choose database
2. Choose data-set
3. Type ‘PC’
a. molecular class filter
4. Type ‘PC(16:0/18:0)
a. single metabolite filter
5. Select row of table
a. single ion filter
6. Simple comparison of spatial distributions
between adducts
Also possible
● Filter by m/z
● Formula search
● Comparison across datasets
Interpretation
FDR Controlled Annotation
False Discovery Rate - the fraction of incorrect annotations
Control - request a set of annotations at a fixed estimated FDR
Setting the level:
- Adjust the number of molecules for follow-up analysis
- When only limited numbers of molecules can be reviewed, adjust the
FDR so that fewer/great numbers of molecules are annotated
- Compare annotations between datasets
- A principled way of selecting molecules to compare between
datasets
True annotation
False discovery
MSM
score
FDR = 0.1
FDR = 0.2
FDR = nTrue
nFalse
+ nTrue
Choice of metabolite database
synthesized/recorded
88M CAS registry
biologically occurring/active
50M PubChem compounds
single biological system
40K HMDB
sample specific
1K LC-MS
Choice of metabolite database
Impacts search and False-Discovery-Rate estimation
● Use one that’s relevant
● Larger database
○ more false-hits --> fewer annotations at a fixed FDR
● Different databases give different annotations
○ even for molecules in both databases due to FDR control
○ for data-set comparison, use the same database
Annotating at level of molecular formula
● Possibility of multiple metabolites per sum formula
○ webapp shows all hits from the database search (learn the ambiguity!)
○ other databases can be searched (e.g. PubChem)
○ use enrichment analysis to get biological leads
● Use an orthogonal technique for reporting individual metabolites
○ not directly integrated (yet)
○ use web-app results help to target MS/MS studies (e.g. purchase of standards)
● we annotate molecular formula along with several putative metabolites
■ MSI Levels of classification:
1. identified metabolites
2. putatively annotated compounds
3. putatively characterised compound classes
4. unknown compounds
● In preparation: formal guidelines for reporting imaging mass spectrometry annotations
Guidelines for reporting
The role of reporting standards for metabolite annotation and identification in metabolomic studies,
Salek et al, 2013, gigascience
● Preparing data for submission
○ imzML export
○ metadata
● Submitting data
○ upload web-app
upload.metasp.eu
● Browsing results
○ results web-app
alpha.metasp.eu
Learning Summary
● METASPACE team:
○ web: metaspace2020.eu
○ email: contact@metaspace2020.eu
○ twitter: @metaspace2020
○ github: github.com/spatialmetabolomics
● FTICR data conversion
○ SCiLS: support@scils.de
● Orbitrap data conversion
○ Thermo Fisher Scientific:
kerstin.strupat@thermofisher.com
How to get help?
Export to imzML
(Group 1) Export into imzML: FT-ICR data
Using SCiLS Lab’s METASPACE export
Export to METASPACE
● Export your centroided high-resolution spectra in the imzML format
● Only available for “FT-ICR type” SCiLS Lab files in SCiLS Lab 2016b
● Best results in METASPACE if peak list is required for centroiding
● Two different Bruker data formats
○ SQLite peak list data: Peak list provided during import
○ FT-ICR profile data: Generate a peak list after import
Create imzML file for METASPACE
● In the objects tab, click the export symbol of
the region to be exported and select
“Export to METASPACE”
● The Export Spectra dialog opens
● Set your normalization of choice
● Select your peak list of choice
for example “Imported Peaks” in case of
SQLite
● Provide your scan polarity
● Click OK to save imzML file
SQLite peak list data
● Data must have been acquired with
on-the-fly centroid detection
i.e. there is a file called ‘peaks.sqlite’ within the .d
folder of the data-set
● In SCiLS Lab a peak list “Imported peaks” is
available, selecting most frequent peaks
By default all peaks appearing more frequently
than 1% of spectra
FT-ICR profile data
● Older Solarix Files do not directly contain a peak
list to perform centroiding
● Create peak list with Data Analysis
SCiLS Lab Help Section 7.4
● Use METASPACE tool for peak finding
https://spatialmetabolomics.github.io/centroidize/
● Use other external tools (mMass, …)
● Import the external peak list into SCiLS Lab
File > Import > m/z intervals from CSV or Clipboard
Use METASPACE tool for peak finding
● Select the overview spectrum CSV exported from SCiLS
● Upload CSV file to METASPACE tool
● Copy values to clipboard
● Use File > Import > m/z intervals from CSV
Upload imzML files to METASPACE
● Go to http://upload.metasp.eu/
SCiLS Cloud: Exchange within the Scientific Community
1. SCiLS Lab: computational analysis
2. SCiLS Cloud: data & results can be
shared and viewed in web browser, e.g.,
○ MALDI imaging data,
○ Discriminative m/z markers,
○ Regions of interest, …
Comparison of mean spectra for ROIs m/z images of co-localized ions
Future Vision: SCiLS Cloud and METASPACE
SCiLS Lab
Statistical
analysis
METASPACE SCiLS Cloud
Upload data
and findings
Export data
to imzML
and upload
prospect: direct interface
(Group 2) Export into imzML: Orbitrap data (.raw)
Instructions: metaspace2020.eu/imzML
Software tools:
imageQuest / raw-converter
- Recommended for: MALDI images (Thermo MALDI- / TransMIT AP-S-MALDI-)
imzmlConverter
- Recommended for: DESI/flowProbe with separate files per row
Recommended for bioinformaticians: pyimzML (Python parser)
.raw -> imzML
● Commercial
○ Thermo
Scientific
imageQuest
Raw-imzml converter
.raw -> imzML
● Free
● http://ms-imaging.org/wp/raw-to-imzm
l-converter/
.raw -> mzML -> imzML
● MSConvert
○ free (link)
● imzMLConverter
○ free
○ requires registration
○ http://www.cs.bham.ac.uk/~ibs/imzMLC
onverter/>
imzmlConverter
This project has received funding from the European Union’s Horizon 2020 research and
innovation programme under grant agreement № 634402.
Acknowledgments
Example data was provided by:
University of Rennes 1
Regis Lavigne
Charles Pineau
EMBL
Ksenija Radic
Alexandra Koumoutsi
Andrew Palmer
EMBL
Theodore Alexandrov
Vitaly Kovalev
Artem Tarasov
Andrew Palmer
Dominik Fay
SCiLS
Dennis Trede
Jan Hendrik Kobarg

More Related Content

Similar to ARCHIVED: new version available. 2016 - METASPACE Training Course

2017 - METASPACE training guide
2017 - METASPACE training guide2017 - METASPACE training guide
2017 - METASPACE training guide
METASPACE
 
ProFET - Protein Feature Engineering Toolki
ProFET - Protein Feature Engineering ToolkiProFET - Protein Feature Engineering Toolki
ProFET - Protein Feature Engineering Toolki
Dan Ofer
 
Integrative information management for systems biology
Integrative information management for systems biologyIntegrative information management for systems biology
Integrative information management for systems biology
Neil Swainston
 
PPT
PPTPPT
PPT
butest
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems Biology
Neil Swainston
 
Open and Automated Machine Learning
Open and Automated Machine LearningOpen and Automated Machine Learning
Open and Automated Machine Learning
Joaquin Vanschoren
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
GigaScience, BGI Hong Kong
 
Pittcon06 Auto Chrom
Pittcon06 Auto ChromPittcon06 Auto Chrom
Pittcon06 Auto Chrom
niharaina
 
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
r-kor
 
Standarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata filesStandarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata files
Yasset Perez-Riverol
 
Use of spark for proteomic scoring seattle presentation
Use of spark for  proteomic scoring   seattle presentationUse of spark for  proteomic scoring   seattle presentation
Use of spark for proteomic scoring seattle presentation
lordjoe
 
HPC Visualization
HPC VisualizationHPC Visualization
HPC Visualization
Michael Karpov
 
Data analytcis-first-steps
Data analytcis-first-stepsData analytcis-first-steps
Data analytcis-first-steps
Shesha R
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
Dmitry Grapov
 
Practical Large Scale Experiences with Spark 2.0 Machine Learning: Spark Summ...
Practical Large Scale Experiences with Spark 2.0 Machine Learning: Spark Summ...Practical Large Scale Experiences with Spark 2.0 Machine Learning: Spark Summ...
Practical Large Scale Experiences with Spark 2.0 Machine Learning: Spark Summ...
Spark Summit
 
Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?
Manuel Martín
 
ifip2008albashiri.pdf
ifip2008albashiri.pdfifip2008albashiri.pdf
ifip2008albashiri.pdf
KamalAlbashiri
 
Fostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataFostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked Data
Muhammad Saleem
 
Identification of toxicants and metabolites
Identification of toxicants and metabolitesIdentification of toxicants and metabolites
Identification of toxicants and metabolites
Steffen Neumann
 
MPDB Presentation
MPDB PresentationMPDB Presentation
MPDB Presentation
Alexander Raskind
 

Similar to ARCHIVED: new version available. 2016 - METASPACE Training Course (20)

2017 - METASPACE training guide
2017 - METASPACE training guide2017 - METASPACE training guide
2017 - METASPACE training guide
 
ProFET - Protein Feature Engineering Toolki
ProFET - Protein Feature Engineering ToolkiProFET - Protein Feature Engineering Toolki
ProFET - Protein Feature Engineering Toolki
 
Integrative information management for systems biology
Integrative information management for systems biologyIntegrative information management for systems biology
Integrative information management for systems biology
 
PPT
PPTPPT
PPT
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems Biology
 
Open and Automated Machine Learning
Open and Automated Machine LearningOpen and Automated Machine Learning
Open and Automated Machine Learning
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
 
Pittcon06 Auto Chrom
Pittcon06 Auto ChromPittcon06 Auto Chrom
Pittcon06 Auto Chrom
 
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
RUCK 2017 김성환 R 패키지 메타주성분분석(MetaPCA)
 
Standarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata filesStandarization in Proteomics: From raw data to metadata files
Standarization in Proteomics: From raw data to metadata files
 
Use of spark for proteomic scoring seattle presentation
Use of spark for  proteomic scoring   seattle presentationUse of spark for  proteomic scoring   seattle presentation
Use of spark for proteomic scoring seattle presentation
 
HPC Visualization
HPC VisualizationHPC Visualization
HPC Visualization
 
Data analytcis-first-steps
Data analytcis-first-stepsData analytcis-first-steps
Data analytcis-first-steps
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
 
Practical Large Scale Experiences with Spark 2.0 Machine Learning: Spark Summ...
Practical Large Scale Experiences with Spark 2.0 Machine Learning: Spark Summ...Practical Large Scale Experiences with Spark 2.0 Machine Learning: Spark Summ...
Practical Large Scale Experiences with Spark 2.0 Machine Learning: Spark Summ...
 
Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?Automating Machine Learning - Is it feasible?
Automating Machine Learning - Is it feasible?
 
ifip2008albashiri.pdf
ifip2008albashiri.pdfifip2008albashiri.pdf
ifip2008albashiri.pdf
 
Fostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataFostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked Data
 
Identification of toxicants and metabolites
Identification of toxicants and metabolitesIdentification of toxicants and metabolites
Identification of toxicants and metabolites
 
MPDB Presentation
MPDB PresentationMPDB Presentation
MPDB Presentation
 

Recently uploaded

Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
ABHISHEK SONI NIMT INSTITUTE OF MEDICAL AND PARAMEDCIAL SCIENCES , GOVT PG COLLEGE NOIDA
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills MN
 
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Sérgio Sacani
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
Shashank Shekhar Pandey
 
fermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptxfermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptx
ananya23nair
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
Frédéric Baudron
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
RDhivya6
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
Anti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark UniverseAnti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark Universe
Sérgio Sacani
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
PirithiRaju
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
Scintica Instrumentation
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
Tissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptxTissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptx
muralinath2
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
PsychoTech Services
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
PirithiRaju
 

Recently uploaded (20)

Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
MICROBIAL INTERACTION PPT/ MICROBIAL INTERACTION AND THEIR TYPES // PLANT MIC...
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
 
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
Evidence of Jet Activity from the Secondary Black Hole in the OJ 287 Binary S...
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
 
fermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptxfermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptx
 
Farming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptxFarming systems analysis: what have we learnt?.pptx
Farming systems analysis: what have we learnt?.pptx
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
Anti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark UniverseAnti-Universe And Emergent Gravity and the Dark Universe
Anti-Universe And Emergent Gravity and the Dark Universe
 
Gadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdfGadgets for management of stored product pests_Dr.UPR.pdf
Gadgets for management of stored product pests_Dr.UPR.pdf
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
Tissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptxTissue fluids_etiology_volume regulation_pressure.pptx
Tissue fluids_etiology_volume regulation_pressure.pptx
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
 

ARCHIVED: new version available. 2016 - METASPACE Training Course

  • 1. METASPACE training course OurCon’16, 17.10.2016 Theodore Alexandrov (EMBL/UCSD/SCiLS) Andy Palmer (EMBL) Vitaly Kovalev (EMBL) Artem Tarasov (EMBL) @METASPACE2020
  • 3. Part 1: Theory 14:00-14:10 Welcome 14:10-14:15 Introduction into the METASPACE project 14:15-14:45 Metabolite annotation in HR imaging MS 14:45-15:00 Overview of the annotation engine Coffee Break 15:00-15:30 Part 2: Tutorial 15:30-16:30 Step-by-step analysis of datasets provided in advance, questions - Data requirements: 10 min - Upload UI: 5 min - Webapp UI: 15 min - Interpretation: 15 min - Split into 2 groups: 5 min - Export to imzML, ideally parallel sessions: 15 min - SCiLS, FTICR - EMBL, Orbitrap Coffee Break 16:35-17:00 Part 3: Hands-on training 17:00-18:00 questions, data analysis Agenda
  • 5. What we hope you will learn today ● Ins and outs of metabolite annotation in HR imaging MS ● Bioinformatics we developed for this problem ○ Metabolite Signal Match (MSM) score ○ False Discovery Rate estimation ○ FDR-controlled annotation ● The online engine we implemented ○ How to prepare data for submission to our service ○ How to submit your data ○ How to view molecular annotations in our webapp
  • 6. Project overview: slides on slideshare Bioinformatics: slides on slideshare Theodore Alexandrov (EMBL, UCSD, SCiLS)
  • 7. Overview of the annotation engine Vitaly Kovalev (EMBL)
  • 8. Outline ● Inputs (data and metadata) ● Online Software ● Data Submission ● Annotation Browsing ● Use Cases a. mouse brain, MALDI-FTICR (UoR1) b. human colorectal tumor, DESI-Orbitrap (ICL)
  • 12. Upload Web UI ● http://upload.metasp.eu ● Easy upload
  • 13. Upload Web App ● http://upload.metasp.eu ● Easy upload ● Metadata collection
  • 14. SM Web UI ● http://alpha.metasp.eu ● Annotation browsing ● Technical details ● Feedback form
  • 15. Use Case 1 Mouse brain (MALDI-FTICR) Data provided by Regis Lavigne, Charles Pineau, University of Rennes 1
  • 16. Data provided by James McKenzie, Zoltan Takats, Imperial College London Use Case 2 Human colorectal tumor (DESI-Orbitrap)
  • 18. Part 1: Theory 14:00-14:10 Welcome, Outline, Learning objectives 14:10-14:15 Introduction into the METASPACE project 14:15-14:45 Metabolite annotation in HR imaging MS 14:45-15:00 Overview of the annotation engine Coffee Break 15:00-15:30 Part 2: Tutorial 15:30-16:30 Step-by-step analysis of datasets provided in advance, questions - Data requirements: 10 min - Upload UI: 5 min - Webapp UI: 15 min - Interpretation: 15 min - Split into 2 groups: 5 min - Export to imzML, ideally parallel sessions: 15 min - SCiLS, FTICR - EMBL, Orbitrap Coffee Break 16:35-17:00 Part 3: Hands-on training 17:00-18:00 questions, data analysis Agenda
  • 19. Learning Outcomes 1. Preparing data for submission 2. Submitting data 3. Browsing results
  • 20. Data Requirements Imaging mass spectrometry data - Any ionisation source - Any spatial resolution - Any tissue - One section per dataset
  • 21. Data Requirements High resolving power RFWHM (@400) > 90K Well-calibrated ideally < 3 ppm
  • 22. Data Requirements Data Format - imzML Centroided - vendor preferred - http://metasp.eu/imzml http://imzml.org/wp/introduction/
  • 23. Customised Processing Processing is tailored to your data! - Technical metadata - Resolving power - isotope prediction - Polarity - adducts R200 =70K R200 =280K [C41 H78 NO7 P+K]+
  • 24. Data Requirements Your responsibility: - Data is processed ‘as is’ - Check metadata is correct - Report resolving power accurately (check within data-set) - Low numbers of annotations often correspond to poor quality mass spectra - Calibration inaccuracy - Lock-mass errors
  • 26. 1. Follow conversion instructions for your instrument 2. Select the centroided files, imzML and ibd 3. Click the Upload button. The dataset will be copied to the cloud storage (accessible only to our team) Data upload
  • 27. Metadata form ● Appears once the upload is started ● Please fill truthfully ○ Most fields have ‘Other…’ option ○ Don’t want to disclose → put ‘-‘ ● Click (at the very bottom)
  • 30. Annotation table Sign in with a Google ID to provide feedback Currently selected molecule (click to select) MSM scoreprincipal peak m/z
  • 31. Sorting/filtering annotations Click on column headers to sortStart typing a formula or a metabolite name Filter by database or dataset Select an adduct Set minimum MSM score Enter m/z of interest
  • 32. FDR color-coding Green = annotated @ chosen FDR level Red = not annotated @ chosen FDR
  • 33. Details for highlighted annotation molecule distribution (sum of isotope images) Putative metabolite IDs from the database Feedback! Thumbs up: reasonable Thumbs down: dubious - tell us why it could be wrong! Feedback is not public
  • 34. Visual insight into MSM score assignment Adduct Exact m/z of each ion image Zoom plot Ion images for each isotope peak Isotopic patterns Blue: theoretical abundance (at instrument resolving power) Red: measured image intensity
  • 38. Step-by-step search Type single metabolite name Potassium adduct
  • 39. Step-by-step search Type single metabolite name Sodium adduct
  • 40. Step-by-step search Type single metabolite name Hydrogen adduct
  • 41. Results Browsing Summary 1. Choose database 2. Choose data-set 3. Type ‘PC’ a. molecular class filter 4. Type ‘PC(16:0/18:0) a. single metabolite filter 5. Select row of table a. single ion filter 6. Simple comparison of spatial distributions between adducts Also possible ● Filter by m/z ● Formula search ● Comparison across datasets
  • 43. FDR Controlled Annotation False Discovery Rate - the fraction of incorrect annotations Control - request a set of annotations at a fixed estimated FDR Setting the level: - Adjust the number of molecules for follow-up analysis - When only limited numbers of molecules can be reviewed, adjust the FDR so that fewer/great numbers of molecules are annotated - Compare annotations between datasets - A principled way of selecting molecules to compare between datasets True annotation False discovery MSM score FDR = 0.1 FDR = 0.2 FDR = nTrue nFalse + nTrue
  • 44. Choice of metabolite database synthesized/recorded 88M CAS registry biologically occurring/active 50M PubChem compounds single biological system 40K HMDB sample specific 1K LC-MS
  • 45. Choice of metabolite database Impacts search and False-Discovery-Rate estimation ● Use one that’s relevant ● Larger database ○ more false-hits --> fewer annotations at a fixed FDR ● Different databases give different annotations ○ even for molecules in both databases due to FDR control ○ for data-set comparison, use the same database
  • 46. Annotating at level of molecular formula ● Possibility of multiple metabolites per sum formula ○ webapp shows all hits from the database search (learn the ambiguity!) ○ other databases can be searched (e.g. PubChem) ○ use enrichment analysis to get biological leads ● Use an orthogonal technique for reporting individual metabolites ○ not directly integrated (yet) ○ use web-app results help to target MS/MS studies (e.g. purchase of standards)
  • 47. ● we annotate molecular formula along with several putative metabolites ■ MSI Levels of classification: 1. identified metabolites 2. putatively annotated compounds 3. putatively characterised compound classes 4. unknown compounds ● In preparation: formal guidelines for reporting imaging mass spectrometry annotations Guidelines for reporting The role of reporting standards for metabolite annotation and identification in metabolomic studies, Salek et al, 2013, gigascience
  • 48. ● Preparing data for submission ○ imzML export ○ metadata ● Submitting data ○ upload web-app upload.metasp.eu ● Browsing results ○ results web-app alpha.metasp.eu Learning Summary ● METASPACE team: ○ web: metaspace2020.eu ○ email: contact@metaspace2020.eu ○ twitter: @metaspace2020 ○ github: github.com/spatialmetabolomics ● FTICR data conversion ○ SCiLS: support@scils.de ● Orbitrap data conversion ○ Thermo Fisher Scientific: kerstin.strupat@thermofisher.com How to get help?
  • 50. (Group 1) Export into imzML: FT-ICR data Using SCiLS Lab’s METASPACE export
  • 51. Export to METASPACE ● Export your centroided high-resolution spectra in the imzML format ● Only available for “FT-ICR type” SCiLS Lab files in SCiLS Lab 2016b ● Best results in METASPACE if peak list is required for centroiding ● Two different Bruker data formats ○ SQLite peak list data: Peak list provided during import ○ FT-ICR profile data: Generate a peak list after import
  • 52. Create imzML file for METASPACE ● In the objects tab, click the export symbol of the region to be exported and select “Export to METASPACE” ● The Export Spectra dialog opens ● Set your normalization of choice ● Select your peak list of choice for example “Imported Peaks” in case of SQLite ● Provide your scan polarity ● Click OK to save imzML file
  • 53. SQLite peak list data ● Data must have been acquired with on-the-fly centroid detection i.e. there is a file called ‘peaks.sqlite’ within the .d folder of the data-set ● In SCiLS Lab a peak list “Imported peaks” is available, selecting most frequent peaks By default all peaks appearing more frequently than 1% of spectra
  • 54. FT-ICR profile data ● Older Solarix Files do not directly contain a peak list to perform centroiding ● Create peak list with Data Analysis SCiLS Lab Help Section 7.4 ● Use METASPACE tool for peak finding https://spatialmetabolomics.github.io/centroidize/ ● Use other external tools (mMass, …) ● Import the external peak list into SCiLS Lab File > Import > m/z intervals from CSV or Clipboard
  • 55. Use METASPACE tool for peak finding ● Select the overview spectrum CSV exported from SCiLS ● Upload CSV file to METASPACE tool ● Copy values to clipboard ● Use File > Import > m/z intervals from CSV
  • 56. Upload imzML files to METASPACE ● Go to http://upload.metasp.eu/
  • 57. SCiLS Cloud: Exchange within the Scientific Community 1. SCiLS Lab: computational analysis 2. SCiLS Cloud: data & results can be shared and viewed in web browser, e.g., ○ MALDI imaging data, ○ Discriminative m/z markers, ○ Regions of interest, … Comparison of mean spectra for ROIs m/z images of co-localized ions
  • 58. Future Vision: SCiLS Cloud and METASPACE SCiLS Lab Statistical analysis METASPACE SCiLS Cloud Upload data and findings Export data to imzML and upload prospect: direct interface
  • 59. (Group 2) Export into imzML: Orbitrap data (.raw) Instructions: metaspace2020.eu/imzML Software tools: imageQuest / raw-converter - Recommended for: MALDI images (Thermo MALDI- / TransMIT AP-S-MALDI-) imzmlConverter - Recommended for: DESI/flowProbe with separate files per row Recommended for bioinformaticians: pyimzML (Python parser)
  • 60. .raw -> imzML ● Commercial ○ Thermo Scientific imageQuest
  • 61. Raw-imzml converter .raw -> imzML ● Free ● http://ms-imaging.org/wp/raw-to-imzm l-converter/
  • 62. .raw -> mzML -> imzML ● MSConvert ○ free (link) ● imzMLConverter ○ free ○ requires registration ○ http://www.cs.bham.ac.uk/~ibs/imzMLC onverter/> imzmlConverter
  • 63. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement № 634402. Acknowledgments Example data was provided by: University of Rennes 1 Regis Lavigne Charles Pineau EMBL Ksenija Radic Alexandra Koumoutsi Andrew Palmer EMBL Theodore Alexandrov Vitaly Kovalev Artem Tarasov Andrew Palmer Dominik Fay SCiLS Dennis Trede Jan Hendrik Kobarg