SlideShare a Scribd company logo
Rafael Jimenez
rafael@ebi.ac.uk
TavernaWorkflow Management System
Proteomics Bioinformatics
Wellcome Trust Genome Campus
Hinxton, Cambridge, UK
15-19 July 2011
Exercises
• Search workflows using PICR in MyExperiment
• Look for workflows prepared for this workshop
• Open a workflow from MyExperiment and run it in
Taverna
• Try more workflows from MyExperiment
• Build a workflow of nested workflows
• Running a workflow several times
• Running workflows from the command line
EXERCISE 1
Search workflows using PICR in MyExperiment
• Let’s find workflows using PICR (The Protein Identifier Cross-
Reference Service)
– Open Taverna
– Click on the “myExperiment” button
– Click on “Search” tab
– Search for “EBI PICR”
1. How many workflows were built using Taverna 2?
2. Which workflow includes other nested workflows?
3. Which workflow could you use to find UniProt cross-references for
an Ensembl protein accession?
EXERCISE 2
Look for workflows prepared for this workshop
• We grouped some interested workflows for this workshop in a
“pack”. Let’s have a look.
– Open Taverna
– Click on the “myExperiment” button
– Click on “Search” tab
– Just tick the “packs” checkbox
– Search for “proteomics bioinformatics”
– Click on the “Preview” button to see the selected workflows
available in the “Proteomics Bioinformatics Workshop 2011”
pack
– Click on one workflow to see more information
1. How many workflows are in the pack about DAS?
2. What is DAS?
EXERCISE 3
Open a workflow from MyExperiment and run it in Taverna
• Let’s open and run one of these workflows in Taverna. For instance
let’s find Ensembl and SwissProt cross-references for the protein
accession “TGFR2_HUMAN”
– In the “Proteomics Bioinformatics Workshop 2011” pack find a
click on the “EBI PICR” workflow
– Read the description to get familiar with the workflow
– Click on the “Open” button
– Have a look to the graphical representation of the workflow
• What it is important now is to recognize there are 4 inputs for this workflow
(protein accession, active accessions, taxonomy identification and
databases) and 1 output (a table with mapping results)
– In the menu click on “File” and “Run workflow”
• Set a value for each input (Read the content of “Port description” and
“Example value” for some advice). For the “searchDatabases” input you will
have to set two values:
– onlyActive: true
– Protein_accessions: TGFR2_HUMAN
– searchDatabases: SWISSPROT and ENSEMBL
– taxonId: 9606
• Click on the “Run workflow” button
– Run the workflow and check the results in the “mapping_table”
tab selecting “Value 1”
EXERCISE 4
Try more workflows from MyExperiment
• Following “Exercise 3” open and run other workflows from the
“Proteomics Bioinformatics Workshop 2011” pack that could be
interesting for you.
EXERCISE 5
Build a workflow of nested workflows
Reactome & Biomodels
• Let’s put together two workflows. For instance let’s find Biological
Models from Biomodels and Pathways from Reactome using an
Gene Ontology term as a query.
– Create a new workflow (Menu -> File/New workflow)
– From the list of workflows, available in the “Proteomics
Bioinformatics Workshop 2011” pack, click on “Find Biological
Models by GO”.
• Click on the “Import” button (as a nested worflow)
– From the list of workflows click on “Find Reactome pathways
and reactions by GO”.
• Click on the “Import” button (as a nested worflow)
– Click on the “Design” tab to see the nested workflows
– Create a new input
• In the menu: Insert/Input Workflow port
– Name: goAccession
– Type: Single value
– Click on the input box and drag towards “GoId” (Biomodels
input) and let go. An arrow will connect the two boxes.
– Click on the input box and drag towards “Go_accession”
(Reactome input) and let go. An arrow will connect the two
– Create a new output to store Biomodels names
• In the menu: Insert/Input Workflow port
– Name: biomodels
– Create a new output to store Reaction names
• In the menu: Insert/Input Workflow port
– Name: reactomeReactions
– Create a new output to store Pathway names
• In the menu: Insert/Input Workflow port
– Name: reactomePathways
– Connect biomodels with MoldelsNames
– Connect reactomeReactions with reaction_name
– Connect reactomePathways with pathway_name
– In the menu click on “File” and “Run workflow”
– Try the workflow using a GO accession as input: “0006915”
– Run the workflow and check the results (values for biomodels,
reactomePathways and reactomeReactions)
1. How many models were found?
2. How many reactions?
3. How many pathways?
EXERCISE 6
Running a workflow several times
BLASTing multiple protein sequences
• We might find a workflow we would like to run several times. This
example will show you how to run Blast using a list of sequences.
– Find a workflow to make Blast searches
• Create a new workflow (Menu -> File/New workflow)
• Click on the “myExperiment” button
• Click on “Search” tab
• Search for “EBI NCBI BLAST”
• Find the workflow from “Katy Wolstencroft” for “Taverna2”
• Click on the “Import” button (as a nested worflow)
– Find a workflow to convert an input with multiple FASTA
sequences in an list of FASTA sequences
• Click on the “myExperiment” button
• Click on “Search” tab
• Search for “split fasta sequence”
• Find the “Fasta_string_to_fasta_list” workflow from “Hamish McWilliam” for
“Taverna1”
• Click on the “Import” button (as a nested worflow)
– Connect the “fasta_list” output with the “sequence” input
• Click on the “fasta_list” output box and drag towards the “sequence” input
box and let go. An arrow will connect the two boxes.
– Create a new input
• In the menu: Insert/Input Workflow port
– Name: email
– Type: Single value
– Click on the new “email” input box and drag towards “email”
(EBI_NCBI_BLAST input) and let go. An arrow will connect the
two boxes.
– Create a new output for each “EBI_NCBI_BLAST” output
• In the menu: Insert/Input Workflow port
– Name: chose a name
– Type: Single value
– In the menu click on “File” and “Run workflow”
– Try the workflow using your mail as input and a couple of protein
sequences in FASTA format
– Example of two sequences in FASTA format:
>protein1
CIMKEKKKPGMCSYSETFFMCSCSSDECNDNIIFSEEYNTSNPDLLLVIFQVTGISLLPPSVIIIFYCYRVNRQQKLSSTWETG
KTRKLMEFSEHCAMCSYSIILEDDRSDISSTCANNI
>protein2
SKANGEANRLLTNARITDSSIWSPQPGQHISIQMCSYSTYRELNPAPTSSPTSTRTEIQLNGENSRSTADLPMIHM
– Run the workflow and check the results
– If you want to make a blast search for more than 10 sequences I would
recommend you to run the workflow using the command line tool.
EXERCISE 7
Running workflows from the command line
BLASTing multiple protein sequences
Sometimes we might have memory problems Taverna. For instance if
we send many blast queries Taverna will straggle to get all the results
in memory. One way to avoid this problem is to run the workflow we
have created through the command line and store the results in files.
Following the Blast exercise if you want to make a blast search for
more than 10 sequences I would recommend you to run the workflow
using the command line.
•Copy your workflow and data in the Taverna folder
– Copy your workflow in the Taverna folder
• C:Program FilesTaverna Workbench 2.3.0
– Create a file (yourfilename.txt) with your FASTA sequences.
– Save the file in the Taverna folder
• C:Program FilesTaverna Workbench 2.3.0
• Run the workflow from the command line
– Open the command line
• Start / Run
• In the open field type “cmd” and click on “OK”
– Get into the Taverna directory
• If the last line you do not see “C:>” type “c:” and click Enter.
• Get into the Taverna folder
– Type “cd C:Program FilesTaverna Workbench 2.3.0”
– Run the workflow
• Type “executeworkflow.bat -embedded -inputvalue email your@email.com
-inputfile yourfilename.txt EBI_NCBI_BLAST-multifasta.t2flow
– More docuemtnation about how to use the command line tool ...
http://www.taverna.org.uk/documentation/taverna-2-x/command-line-tool/2-3/
ProteomicsServicesTeam
Acknowledgements & thanks
All the myGrid team
Specially to …Katy Wolstencroft, Shoaib
Sufi, Peter Li, Eric Nzuobontane

More Related Content

Similar to Taverna tutorial

BRiffle DQ - Import and open samples
BRiffle DQ - Import and open samplesBRiffle DQ - Import and open samples
BRiffle DQ - Import and open samples
flexriver
 
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Rafael C. Jimenez
 
Less02 2 e_testermodule_1
Less02 2 e_testermodule_1Less02 2 e_testermodule_1
Less02 2 e_testermodule_1
Suresh Mishra
 
Hands on workshop with the CLARIAH Media Suite: introduction to television hi...
Hands on workshop with the CLARIAH Media Suite: introduction to television hi...Hands on workshop with the CLARIAH Media Suite: introduction to television hi...
Hands on workshop with the CLARIAH Media Suite: introduction to television hi...
clariah_mediasuite
 
IMPACT/myGrid Hackathon - Introduction to Taverna
IMPACT/myGrid Hackathon - Introduction to TavernaIMPACT/myGrid Hackathon - Introduction to Taverna
IMPACT/myGrid Hackathon - Introduction to Taverna
IMPACT Centre of Competence
 
POST/CON 2019 Workshop: Experts
POST/CON 2019 Workshop: ExpertsPOST/CON 2019 Workshop: Experts
POST/CON 2019 Workshop: Experts
Postman
 
Introduction to STATA(2).pdf
Introduction to STATA(2).pdfIntroduction to STATA(2).pdf
Introduction to STATA(2).pdf
Yomif3
 
ACD
ACDACD
Getting Started with Selenium
Getting Started with SeleniumGetting Started with Selenium
Getting Started with Selenium
Dave Haeffner
 
Automated Testing Environment by Bugzilla, Testopia and Jenkins
Automated Testing Environment by Bugzilla, Testopia and JenkinsAutomated Testing Environment by Bugzilla, Testopia and Jenkins
Automated Testing Environment by Bugzilla, Testopia and Jenkins
walkerchang
 
Automated testing with Cypress
Automated testing with CypressAutomated testing with Cypress
Automated testing with Cypress
Yong Shean Chong
 
2014 Taverna tutorial Advanced Taverna
2014 Taverna tutorial Advanced Taverna2014 Taverna tutorial Advanced Taverna
2014 Taverna tutorial Advanced Taverna
myGrid team
 
Bug zillatestopiajenkins
Bug zillatestopiajenkinsBug zillatestopiajenkins
Bug zillatestopiajenkins
Samira Kumar Nanda
 
MPI Sessions: a proposal to the MPI Forum
MPI Sessions: a proposal to the MPI ForumMPI Sessions: a proposal to the MPI Forum
MPI Sessions: a proposal to the MPI Forum
Jeff Squyres
 
Technology Enabled Research Handout
Technology Enabled Research HandoutTechnology Enabled Research Handout
Technology Enabled Research Handout
Alex Hardman
 
Efficient Automated Test Creation With Selenium IDE Plugins
Efficient Automated Test Creation With Selenium IDE PluginsEfficient Automated Test Creation With Selenium IDE Plugins
Efficient Automated Test Creation With Selenium IDE Plugins
Samit Badle
 
700 Tons of Code Later
700 Tons of Code Later700 Tons of Code Later
700 Tons of Code Later
Alexander Shopov
 
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...
Daniel Katz
 
What is UFT? HP's unified functional testing.
What is UFT? HP's unified functional testing.What is UFT? HP's unified functional testing.
What is UFT? HP's unified functional testing.
Confiz
 
The Scientific Filesystem
The Scientific FilesystemThe Scientific Filesystem
The Scientific Filesystem
Vanessa S
 

Similar to Taverna tutorial (20)

BRiffle DQ - Import and open samples
BRiffle DQ - Import and open samplesBRiffle DQ - Import and open samples
BRiffle DQ - Import and open samples
 
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
Non technical introduction to Web Services & Workflows. Taverna, Biocatalogue...
 
Less02 2 e_testermodule_1
Less02 2 e_testermodule_1Less02 2 e_testermodule_1
Less02 2 e_testermodule_1
 
Hands on workshop with the CLARIAH Media Suite: introduction to television hi...
Hands on workshop with the CLARIAH Media Suite: introduction to television hi...Hands on workshop with the CLARIAH Media Suite: introduction to television hi...
Hands on workshop with the CLARIAH Media Suite: introduction to television hi...
 
IMPACT/myGrid Hackathon - Introduction to Taverna
IMPACT/myGrid Hackathon - Introduction to TavernaIMPACT/myGrid Hackathon - Introduction to Taverna
IMPACT/myGrid Hackathon - Introduction to Taverna
 
POST/CON 2019 Workshop: Experts
POST/CON 2019 Workshop: ExpertsPOST/CON 2019 Workshop: Experts
POST/CON 2019 Workshop: Experts
 
Introduction to STATA(2).pdf
Introduction to STATA(2).pdfIntroduction to STATA(2).pdf
Introduction to STATA(2).pdf
 
ACD
ACDACD
ACD
 
Getting Started with Selenium
Getting Started with SeleniumGetting Started with Selenium
Getting Started with Selenium
 
Automated Testing Environment by Bugzilla, Testopia and Jenkins
Automated Testing Environment by Bugzilla, Testopia and JenkinsAutomated Testing Environment by Bugzilla, Testopia and Jenkins
Automated Testing Environment by Bugzilla, Testopia and Jenkins
 
Automated testing with Cypress
Automated testing with CypressAutomated testing with Cypress
Automated testing with Cypress
 
2014 Taverna tutorial Advanced Taverna
2014 Taverna tutorial Advanced Taverna2014 Taverna tutorial Advanced Taverna
2014 Taverna tutorial Advanced Taverna
 
Bug zillatestopiajenkins
Bug zillatestopiajenkinsBug zillatestopiajenkins
Bug zillatestopiajenkins
 
MPI Sessions: a proposal to the MPI Forum
MPI Sessions: a proposal to the MPI ForumMPI Sessions: a proposal to the MPI Forum
MPI Sessions: a proposal to the MPI Forum
 
Technology Enabled Research Handout
Technology Enabled Research HandoutTechnology Enabled Research Handout
Technology Enabled Research Handout
 
Efficient Automated Test Creation With Selenium IDE Plugins
Efficient Automated Test Creation With Selenium IDE PluginsEfficient Automated Test Creation With Selenium IDE Plugins
Efficient Automated Test Creation With Selenium IDE Plugins
 
700 Tons of Code Later
700 Tons of Code Later700 Tons of Code Later
700 Tons of Code Later
 
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...
 
What is UFT? HP's unified functional testing.
What is UFT? HP's unified functional testing.What is UFT? HP's unified functional testing.
What is UFT? HP's unified functional testing.
 
The Scientific Filesystem
The Scientific FilesystemThe Scientific Filesystem
The Scientific Filesystem
 

More from Rafael C. Jimenez

BMB Resource Integration Workshop
BMB Resource Integration WorkshopBMB Resource Integration Workshop
BMB Resource Integration Workshop
Rafael C. Jimenez
 
ELIXIR
ELIXIRELIXIR
Proteomics repositories integration using EUDAT resources
Proteomics repositories integration using EUDAT resourcesProteomics repositories integration using EUDAT resources
Proteomics repositories integration using EUDAT resources
Rafael C. Jimenez
 
ELIXIR
ELIXIRELIXIR
Summary of Technical Coordinators discussions
Summary of Technical Coordinators discussionsSummary of Technical Coordinators discussions
Summary of Technical Coordinators discussions
Rafael C. Jimenez
 
ELIXIR
ELIXIRELIXIR
The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...
Rafael C. Jimenez
 
Standardisation in BMS European infrastructures
Standardisation in BMS European infrastructuresStandardisation in BMS European infrastructures
Standardisation in BMS European infrastructures
Rafael C. Jimenez
 
ELIXIR
ELIXIRELIXIR
ELIXIR
ELIXIRELIXIR
Standards
StandardsStandards
ELIXIR TCG update
ELIXIR TCG updateELIXIR TCG update
ELIXIR TCG update
Rafael C. Jimenez
 
Life science requirements from e-infrastructure: initial results from a joint...
Life science requirements from e-infrastructure:initial results from a joint...Life science requirements from e-infrastructure:initial results from a joint...
Life science requirements from e-infrastructure: initial results from a joint...
Rafael C. Jimenez
 
Technical activities in ELIXIR Europe
Technical activities in ELIXIR EuropeTechnical activities in ELIXIR Europe
Technical activities in ELIXIR Europe
Rafael C. Jimenez
 
Challenges of big data. Summary day 1.
Challenges of big data. Summary day 1.Challenges of big data. Summary day 1.
Challenges of big data. Summary day 1.
Rafael C. Jimenez
 
Challenges of big data. Aims of the workshop.
Challenges of big data. Aims of the workshop.Challenges of big data. Aims of the workshop.
Challenges of big data. Aims of the workshop.
Rafael C. Jimenez
 
Data submissions and archiving raw data in life sciences. A pilot with Proteo...
Data submissions and archiving raw data in life sciences. A pilot with Proteo...Data submissions and archiving raw data in life sciences. A pilot with Proteo...
Data submissions and archiving raw data in life sciences. A pilot with Proteo...
Rafael C. Jimenez
 
ELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciences
Rafael C. Jimenez
 
SASI, A lightweight standard for exchanging course information
SASI, A lightweight standard for exchanging course informationSASI, A lightweight standard for exchanging course information
SASI, A lightweight standard for exchanging course information
Rafael C. Jimenez
 
ELIXIR
ELIXIRELIXIR

More from Rafael C. Jimenez (20)

BMB Resource Integration Workshop
BMB Resource Integration WorkshopBMB Resource Integration Workshop
BMB Resource Integration Workshop
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
Proteomics repositories integration using EUDAT resources
Proteomics repositories integration using EUDAT resourcesProteomics repositories integration using EUDAT resources
Proteomics repositories integration using EUDAT resources
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
Summary of Technical Coordinators discussions
Summary of Technical Coordinators discussionsSummary of Technical Coordinators discussions
Summary of Technical Coordinators discussions
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...The European life-science data infrastructure: Data, Computing and Services ...
The European life-science data infrastructure: Data, Computing and Services ...
 
Standardisation in BMS European infrastructures
Standardisation in BMS European infrastructuresStandardisation in BMS European infrastructures
Standardisation in BMS European infrastructures
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 
Standards
StandardsStandards
Standards
 
ELIXIR TCG update
ELIXIR TCG updateELIXIR TCG update
ELIXIR TCG update
 
Life science requirements from e-infrastructure: initial results from a joint...
Life science requirements from e-infrastructure:initial results from a joint...Life science requirements from e-infrastructure:initial results from a joint...
Life science requirements from e-infrastructure: initial results from a joint...
 
Technical activities in ELIXIR Europe
Technical activities in ELIXIR EuropeTechnical activities in ELIXIR Europe
Technical activities in ELIXIR Europe
 
Challenges of big data. Summary day 1.
Challenges of big data. Summary day 1.Challenges of big data. Summary day 1.
Challenges of big data. Summary day 1.
 
Challenges of big data. Aims of the workshop.
Challenges of big data. Aims of the workshop.Challenges of big data. Aims of the workshop.
Challenges of big data. Aims of the workshop.
 
Data submissions and archiving raw data in life sciences. A pilot with Proteo...
Data submissions and archiving raw data in life sciences. A pilot with Proteo...Data submissions and archiving raw data in life sciences. A pilot with Proteo...
Data submissions and archiving raw data in life sciences. A pilot with Proteo...
 
ELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciencesELIXIR and data grand challenges in life sciences
ELIXIR and data grand challenges in life sciences
 
SASI, A lightweight standard for exchanging course information
SASI, A lightweight standard for exchanging course informationSASI, A lightweight standard for exchanging course information
SASI, A lightweight standard for exchanging course information
 
ELIXIR
ELIXIRELIXIR
ELIXIR
 

Recently uploaded

EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
AbdullaAlAsif1
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
RASHMI M G
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 

Recently uploaded (20)

EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptxBREEDING METHODS FOR DISEASE RESISTANCE.pptx
BREEDING METHODS FOR DISEASE RESISTANCE.pptx
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 

Taverna tutorial

  • 1. Rafael Jimenez rafael@ebi.ac.uk TavernaWorkflow Management System Proteomics Bioinformatics Wellcome Trust Genome Campus Hinxton, Cambridge, UK 15-19 July 2011
  • 2. Exercises • Search workflows using PICR in MyExperiment • Look for workflows prepared for this workshop • Open a workflow from MyExperiment and run it in Taverna • Try more workflows from MyExperiment • Build a workflow of nested workflows • Running a workflow several times • Running workflows from the command line
  • 3. EXERCISE 1 Search workflows using PICR in MyExperiment
  • 4. • Let’s find workflows using PICR (The Protein Identifier Cross- Reference Service) – Open Taverna – Click on the “myExperiment” button – Click on “Search” tab – Search for “EBI PICR” 1. How many workflows were built using Taverna 2? 2. Which workflow includes other nested workflows? 3. Which workflow could you use to find UniProt cross-references for an Ensembl protein accession?
  • 5.
  • 6. EXERCISE 2 Look for workflows prepared for this workshop
  • 7. • We grouped some interested workflows for this workshop in a “pack”. Let’s have a look. – Open Taverna – Click on the “myExperiment” button – Click on “Search” tab – Just tick the “packs” checkbox – Search for “proteomics bioinformatics” – Click on the “Preview” button to see the selected workflows available in the “Proteomics Bioinformatics Workshop 2011” pack – Click on one workflow to see more information 1. How many workflows are in the pack about DAS? 2. What is DAS?
  • 8.
  • 9. EXERCISE 3 Open a workflow from MyExperiment and run it in Taverna
  • 10. • Let’s open and run one of these workflows in Taverna. For instance let’s find Ensembl and SwissProt cross-references for the protein accession “TGFR2_HUMAN” – In the “Proteomics Bioinformatics Workshop 2011” pack find a click on the “EBI PICR” workflow – Read the description to get familiar with the workflow – Click on the “Open” button – Have a look to the graphical representation of the workflow • What it is important now is to recognize there are 4 inputs for this workflow (protein accession, active accessions, taxonomy identification and databases) and 1 output (a table with mapping results) – In the menu click on “File” and “Run workflow” • Set a value for each input (Read the content of “Port description” and “Example value” for some advice). For the “searchDatabases” input you will have to set two values: – onlyActive: true – Protein_accessions: TGFR2_HUMAN – searchDatabases: SWISSPROT and ENSEMBL – taxonId: 9606 • Click on the “Run workflow” button – Run the workflow and check the results in the “mapping_table” tab selecting “Value 1”
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. EXERCISE 4 Try more workflows from MyExperiment
  • 17. • Following “Exercise 3” open and run other workflows from the “Proteomics Bioinformatics Workshop 2011” pack that could be interesting for you.
  • 18. EXERCISE 5 Build a workflow of nested workflows Reactome & Biomodels
  • 19. • Let’s put together two workflows. For instance let’s find Biological Models from Biomodels and Pathways from Reactome using an Gene Ontology term as a query. – Create a new workflow (Menu -> File/New workflow) – From the list of workflows, available in the “Proteomics Bioinformatics Workshop 2011” pack, click on “Find Biological Models by GO”. • Click on the “Import” button (as a nested worflow) – From the list of workflows click on “Find Reactome pathways and reactions by GO”. • Click on the “Import” button (as a nested worflow) – Click on the “Design” tab to see the nested workflows – Create a new input • In the menu: Insert/Input Workflow port – Name: goAccession – Type: Single value – Click on the input box and drag towards “GoId” (Biomodels input) and let go. An arrow will connect the two boxes. – Click on the input box and drag towards “Go_accession” (Reactome input) and let go. An arrow will connect the two
  • 20. – Create a new output to store Biomodels names • In the menu: Insert/Input Workflow port – Name: biomodels – Create a new output to store Reaction names • In the menu: Insert/Input Workflow port – Name: reactomeReactions – Create a new output to store Pathway names • In the menu: Insert/Input Workflow port – Name: reactomePathways – Connect biomodels with MoldelsNames – Connect reactomeReactions with reaction_name – Connect reactomePathways with pathway_name – In the menu click on “File” and “Run workflow” – Try the workflow using a GO accession as input: “0006915” – Run the workflow and check the results (values for biomodels, reactomePathways and reactomeReactions) 1. How many models were found? 2. How many reactions? 3. How many pathways?
  • 21.
  • 22.
  • 23.
  • 24.
  • 25. EXERCISE 6 Running a workflow several times BLASTing multiple protein sequences
  • 26. • We might find a workflow we would like to run several times. This example will show you how to run Blast using a list of sequences. – Find a workflow to make Blast searches • Create a new workflow (Menu -> File/New workflow) • Click on the “myExperiment” button • Click on “Search” tab • Search for “EBI NCBI BLAST” • Find the workflow from “Katy Wolstencroft” for “Taverna2” • Click on the “Import” button (as a nested worflow) – Find a workflow to convert an input with multiple FASTA sequences in an list of FASTA sequences • Click on the “myExperiment” button • Click on “Search” tab • Search for “split fasta sequence” • Find the “Fasta_string_to_fasta_list” workflow from “Hamish McWilliam” for “Taverna1” • Click on the “Import” button (as a nested worflow) – Connect the “fasta_list” output with the “sequence” input • Click on the “fasta_list” output box and drag towards the “sequence” input box and let go. An arrow will connect the two boxes.
  • 27. – Create a new input • In the menu: Insert/Input Workflow port – Name: email – Type: Single value – Click on the new “email” input box and drag towards “email” (EBI_NCBI_BLAST input) and let go. An arrow will connect the two boxes. – Create a new output for each “EBI_NCBI_BLAST” output • In the menu: Insert/Input Workflow port – Name: chose a name – Type: Single value – In the menu click on “File” and “Run workflow” – Try the workflow using your mail as input and a couple of protein sequences in FASTA format – Example of two sequences in FASTA format: >protein1 CIMKEKKKPGMCSYSETFFMCSCSSDECNDNIIFSEEYNTSNPDLLLVIFQVTGISLLPPSVIIIFYCYRVNRQQKLSSTWETG KTRKLMEFSEHCAMCSYSIILEDDRSDISSTCANNI >protein2 SKANGEANRLLTNARITDSSIWSPQPGQHISIQMCSYSTYRELNPAPTSSPTSTRTEIQLNGENSRSTADLPMIHM – Run the workflow and check the results – If you want to make a blast search for more than 10 sequences I would recommend you to run the workflow using the command line tool.
  • 28.
  • 29. EXERCISE 7 Running workflows from the command line BLASTing multiple protein sequences
  • 30. Sometimes we might have memory problems Taverna. For instance if we send many blast queries Taverna will straggle to get all the results in memory. One way to avoid this problem is to run the workflow we have created through the command line and store the results in files. Following the Blast exercise if you want to make a blast search for more than 10 sequences I would recommend you to run the workflow using the command line. •Copy your workflow and data in the Taverna folder – Copy your workflow in the Taverna folder • C:Program FilesTaverna Workbench 2.3.0 – Create a file (yourfilename.txt) with your FASTA sequences. – Save the file in the Taverna folder • C:Program FilesTaverna Workbench 2.3.0
  • 31. • Run the workflow from the command line – Open the command line • Start / Run • In the open field type “cmd” and click on “OK” – Get into the Taverna directory • If the last line you do not see “C:>” type “c:” and click Enter. • Get into the Taverna folder – Type “cd C:Program FilesTaverna Workbench 2.3.0” – Run the workflow • Type “executeworkflow.bat -embedded -inputvalue email your@email.com -inputfile yourfilename.txt EBI_NCBI_BLAST-multifasta.t2flow – More docuemtnation about how to use the command line tool ... http://www.taverna.org.uk/documentation/taverna-2-x/command-line-tool/2-3/
  • 32. ProteomicsServicesTeam Acknowledgements & thanks All the myGrid team Specially to …Katy Wolstencroft, Shoaib Sufi, Peter Li, Eric Nzuobontane