SlideShare a Scribd company logo
Shigehiro Kuraku
Unit Leader
Genome Resource & Analysis Unit, RIKEN CDB
http://www.cdb.riken.jp/gra/skuraku.html
The extended version of this presentation as well as its Japanese version
is available at SlideShare ( http://www.slideshare.net/cdb_gras/ )
aLeaves: web server (http://aleaves.cdb.riken.jp/aleaves/)
for handy phylogenetic analysis
Tutorial movies available
Powered by
“Collecting amino acid sequences and
building a phylogenetic tree on the aLeaves
and MAFFT servers”
https://www.youtube.com/watch?v=0hpp-IqhpyQ
「aLeavesとMAFFTを使って1つのアミノ酸配列
から系統樹を推定する」
https://www.youtube.com/watch?v=N9qPLRhHfIQ
Motivation of aLeaves development
While we have access to various methods for molecular phylogenetic tree
inference and enriched sequence data from large-scale sequencing projects,
phylogenetic tree building is not handy but rather cumbersome for
biologists working in labs.
Launch an online tool which performs comprehensive sequence
searches covering scattered large-scale resources and systematic
data slimming using biologist-friendly cues.
Background
What is hidden paralogy ? ex) zebrafish Emx3
Derobert et al., 2002 etc.
Morita et al., 1995
Reviewed in Kuraku, 2010. Integ. Comp. Biol.
What is hidden paralogy ? ex) zebrafish Emx3
Derobert et al., 2002 etc.
Morita et al., 1995
Reviewed in Kuraku, 2010. Integ. Comp. Biol.
Heuristic collection
B)
A)
Exhaustive search
of homologs
How do you prepare a homolog set?
Using BLAST server at NCBI
“Every BLAST search is an experiment” by
Scattered information prevents our smooth work
EnsemblNCBI Protein
(annotated)
Individual web sites
of genome projects
Your sequences
NCBI Refseq
(annotated)
Ensembl Metazoa
Dataset
Collaborators
GRAS, RIKEN CDB CBRC, AIST
&
iFReC, Osaka Univ.
Christian M. Zmasek
Sanford-Burnham
Medical Research Institute
USA
Kazutaka KatohOsamu Nishimura
aLeaves – http://aleaves.cdb.riken.jp
Output a multi-fasta
sequence file
in several minutes
A single search to cover
diverse species
Enter a query sequence
in a peptide
Taxonomic coverage (1)
Taxonomic coverage (2)
Downstream analysis on MAFFT server
Systematic selection/deletion of seqs based on various criteria
・Sequence length filter
・Delete identical/similar sequences (CD-HIT)
・Delete sequences with large gaps (Max-Align)
・Select only particular species
・Select/delete particular subgroups in a guide-tree
Managed by K. Katoh
Heuristic identification of homologs
(in publications, etc.)
Exhaustive collection of homologs Careful refinement of data set
by deleting unnecessary sequences
Phylogenetic tree inference
Retrieval of limited number of
sequences
(on MAFFT server at CBRC, AIST)
(on aLeaves server at CDB, RIKEN)
Workflow using aLeaves-MAFFT
Warning
・aLeaves is based on sequence resources already made public in other
online databases and does not release original sequence information.
・aLeaves project does not predict and validate protein coding sequences
available at other web sites and just adopt them for integrative searches.
・aLeaves-MAFFT link allows you to perform sequence data set
refinement and preliminary molecular phylogenetic analysis, but
please perform more sophisticated analyses on your local system
by downloading the data set.
Citing aLeaves
http://nar.oxfordjournals.org/content/41/W1/W22.long

More Related Content

Viewers also liked

Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
Hafiz Eswan
 

Viewers also liked (14)

Designing Communities101507
Designing Communities101507Designing Communities101507
Designing Communities101507
 
Brief introduction of aLeaves (mainly in Japanese)
Brief introduction of aLeaves (mainly in Japanese)Brief introduction of aLeaves (mainly in Japanese)
Brief introduction of aLeaves (mainly in Japanese)
 
Evaluation of music magazine- Media portfolio J.O.F.A
Evaluation of music magazine- Media portfolio J.O.F.AEvaluation of music magazine- Media portfolio J.O.F.A
Evaluation of music magazine- Media portfolio J.O.F.A
 
Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...
Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...
Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...
 
Filtros
FiltrosFiltros
Filtros
 
Joy's School magazine
Joy's School magazineJoy's School magazine
Joy's School magazine
 
フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料
フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料
フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料
 
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
 
Michael graves
Michael gravesMichael graves
Michael graves
 
Git
GitGit
Git
 
How to get into open source and contribute
How to get into open source and contributeHow to get into open source and contribute
How to get into open source and contribute
 
Gsoc 2013 presentation
Gsoc 2013 presentationGsoc 2013 presentation
Gsoc 2013 presentation
 
Desktop Alternatives
Desktop AlternativesDesktop Alternatives
Desktop Alternatives
 
Sfd 2013 gnome_opw
Sfd 2013 gnome_opwSfd 2013 gnome_opw
Sfd 2013 gnome_opw
 

Similar to Brief introduction of aLeaves

100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databases
Meetika Gupta
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
Abhik Seal
 
Imgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialImgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorial
Deanna Church
 

Similar to Brief introduction of aLeaves (20)

100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databases
 
Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and Annotations
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
 
Parkinson mibbi
Parkinson mibbiParkinson mibbi
Parkinson mibbi
 
Understanding Genome
Understanding Genome Understanding Genome
Understanding Genome
 
Computational Resources In Infectious Disease
Computational Resources In Infectious DiseaseComputational Resources In Infectious Disease
Computational Resources In Infectious Disease
 
Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...
Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...
Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data Analysis
 
Talk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meetingTalk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meeting
 
Imgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialImgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorial
 
Hands on training_biological_databases.ppt
Hands on training_biological_databases.pptHands on training_biological_databases.ppt
Hands on training_biological_databases.ppt
 
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
 
BioSD Tutorial 2014 Editition
BioSD Tutorial 2014 EdititionBioSD Tutorial 2014 Editition
BioSD Tutorial 2014 Editition
 
NCBI
NCBINCBI
NCBI
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRON
 
Article
ArticleArticle
Article
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
 
BioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS TutorialBioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS Tutorial
 

Recently uploaded

The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...
Sérgio Sacani
 
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Sérgio Sacani
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
Michel Dumontier
 
THYROID-PARATHYROID medical surgical nursing
THYROID-PARATHYROID medical surgical nursingTHYROID-PARATHYROID medical surgical nursing
THYROID-PARATHYROID medical surgical nursing
Jocelyn Atis
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
muralinath2
 
Seminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptxSeminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptx
RUDYLUMAPINET2
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 

Recently uploaded (20)

The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...
 
Shuaib Y-basedComprehensive mahmudj.pptx
Shuaib Y-basedComprehensive mahmudj.pptxShuaib Y-basedComprehensive mahmudj.pptx
Shuaib Y-basedComprehensive mahmudj.pptx
 
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
Exomoons & Exorings with the Habitable Worlds Observatory I: On the Detection...
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
THYROID-PARATHYROID medical surgical nursing
THYROID-PARATHYROID medical surgical nursingTHYROID-PARATHYROID medical surgical nursing
THYROID-PARATHYROID medical surgical nursing
 
SAMPLING.pptx for analystical chemistry sample techniques
SAMPLING.pptx for analystical chemistry sample techniquesSAMPLING.pptx for analystical chemistry sample techniques
SAMPLING.pptx for analystical chemistry sample techniques
 
mixotrophy in cyanobacteria: a dual nutritional strategy
mixotrophy in cyanobacteria: a dual nutritional strategymixotrophy in cyanobacteria: a dual nutritional strategy
mixotrophy in cyanobacteria: a dual nutritional strategy
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
 
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
Gliese 12 b, a temperate Earth-sized planet at 12 parsecs discovered with TES...
 
electrochemical gas sensors and their uses.pptx
electrochemical gas sensors and their uses.pptxelectrochemical gas sensors and their uses.pptx
electrochemical gas sensors and their uses.pptx
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
Seminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptxSeminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptx
 
Topography and sediments of the floor of the Bay of Bengal
Topography and sediments of the floor of the Bay of BengalTopography and sediments of the floor of the Bay of Bengal
Topography and sediments of the floor of the Bay of Bengal
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
INSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere UniversityINSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere University
 
National Biodiversity protection initiatives and Convention on Biological Di...
National Biodiversity protection initiatives and  Convention on Biological Di...National Biodiversity protection initiatives and  Convention on Biological Di...
National Biodiversity protection initiatives and Convention on Biological Di...
 
GBSN - Microbiology (Lab 1) Microbiology Lab Safety Procedures
GBSN -  Microbiology (Lab  1) Microbiology Lab Safety ProceduresGBSN -  Microbiology (Lab  1) Microbiology Lab Safety Procedures
GBSN - Microbiology (Lab 1) Microbiology Lab Safety Procedures
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
NuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent UniversityNuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent University
 

Brief introduction of aLeaves

  • 1. Shigehiro Kuraku Unit Leader Genome Resource & Analysis Unit, RIKEN CDB http://www.cdb.riken.jp/gra/skuraku.html The extended version of this presentation as well as its Japanese version is available at SlideShare ( http://www.slideshare.net/cdb_gras/ ) aLeaves: web server (http://aleaves.cdb.riken.jp/aleaves/) for handy phylogenetic analysis
  • 2. Tutorial movies available Powered by “Collecting amino acid sequences and building a phylogenetic tree on the aLeaves and MAFFT servers” https://www.youtube.com/watch?v=0hpp-IqhpyQ 「aLeavesとMAFFTを使って1つのアミノ酸配列 から系統樹を推定する」 https://www.youtube.com/watch?v=N9qPLRhHfIQ
  • 3. Motivation of aLeaves development While we have access to various methods for molecular phylogenetic tree inference and enriched sequence data from large-scale sequencing projects, phylogenetic tree building is not handy but rather cumbersome for biologists working in labs. Launch an online tool which performs comprehensive sequence searches covering scattered large-scale resources and systematic data slimming using biologist-friendly cues. Background
  • 4. What is hidden paralogy ? ex) zebrafish Emx3 Derobert et al., 2002 etc. Morita et al., 1995 Reviewed in Kuraku, 2010. Integ. Comp. Biol.
  • 5. What is hidden paralogy ? ex) zebrafish Emx3 Derobert et al., 2002 etc. Morita et al., 1995 Reviewed in Kuraku, 2010. Integ. Comp. Biol.
  • 6. Heuristic collection B) A) Exhaustive search of homologs How do you prepare a homolog set?
  • 7. Using BLAST server at NCBI “Every BLAST search is an experiment” by
  • 8. Scattered information prevents our smooth work EnsemblNCBI Protein (annotated) Individual web sites of genome projects Your sequences NCBI Refseq (annotated) Ensembl Metazoa Dataset
  • 9. Collaborators GRAS, RIKEN CDB CBRC, AIST & iFReC, Osaka Univ. Christian M. Zmasek Sanford-Burnham Medical Research Institute USA Kazutaka KatohOsamu Nishimura
  • 10. aLeaves – http://aleaves.cdb.riken.jp Output a multi-fasta sequence file in several minutes A single search to cover diverse species Enter a query sequence in a peptide
  • 13. Downstream analysis on MAFFT server Systematic selection/deletion of seqs based on various criteria ・Sequence length filter ・Delete identical/similar sequences (CD-HIT) ・Delete sequences with large gaps (Max-Align) ・Select only particular species ・Select/delete particular subgroups in a guide-tree Managed by K. Katoh
  • 14. Heuristic identification of homologs (in publications, etc.) Exhaustive collection of homologs Careful refinement of data set by deleting unnecessary sequences Phylogenetic tree inference Retrieval of limited number of sequences (on MAFFT server at CBRC, AIST) (on aLeaves server at CDB, RIKEN) Workflow using aLeaves-MAFFT
  • 15. Warning ・aLeaves is based on sequence resources already made public in other online databases and does not release original sequence information. ・aLeaves project does not predict and validate protein coding sequences available at other web sites and just adopt them for integrative searches. ・aLeaves-MAFFT link allows you to perform sequence data set refinement and preliminary molecular phylogenetic analysis, but please perform more sophisticated analyses on your local system by downloading the data set.