SlideShare a Scribd company logo
1 of 36
Download to read offline
Ensembl Plants:
Visualising, mining and analysing crop
genomics data
Dan Bolser
Ensembl Plants project leader
EMBL-EBI
http://plants.ensembl.org
#EnsemblGenomes
Visualising, mining and
analysing data:
● The Ensembl
genome browser
● BioMart
● Tools for processing
your own data
Overview
Background:
● Ensembl Plants
● History
● Data
● Recent updates
● Wheat
● Barley
EBI Ensembl is developed
jointly by the EBI and
the Wellcome Trust
Sanger Institute
Ensembl Plants uses Ensembl technology
Ensembl:
● A platform for genome browsing, annotation and analysis
developed jointly by the EBI and Wellcome Trust Sanger Institute.
● Has modules for handling:
● Genomic data, Variations, Comparative genomics, Gene prediction, ...
● Multiple points of access to data:
● Browser-based application, Perl and REST APIs, direct access
(MySQL), BioMart data mining tool, DAS (client and server), FTP.
● Upload your own data and compare it to the reference seq. and annotation.
Ensembl was originally developed for vertebrate genomes, subsequently
extended to non-vertebrate species:
● Ensembl Genomes → Ensembl Plants
Currently 33 genomes in
Ensembl Plants
http://plants.ensembl.org
Dicots in
Ensembl Plants
(10)
Brassicales
Fabales
Malpighiales
Rosales
Solanales
Vitales
Monocots in
Ensembl Plants
(12+5)
Poales
Zingiberales
'Others' (5)
Types of data in Ensembl (Ensembl Plants)
● Genomic sequence
● Gene, transcript, and protein annotations
● External references and ontology terms
● Mapped sequences: cDNAs, proteins,
probes, BACs, repeats, markers, ...
● Variation data:
● sequence variants
● structural variants
● Comparative data:
● gene trees, orthologues, paralogues
● whole genome alignments and synteny
Recent data updates
Wheat data in Ensembl Plants
● The chromosome survey sequence
from the International Wheat Genome
Sequencing Consortium.
● Version 2.1 of the IWGSC gene models called
on the chromosome survey sequence.
● Repeats
● Repbase
● The Triticeae Repeat Sequence Database
(TREP)
● Alignments
● RNA-seq from various studies in ENA
● ESTs and UniGene clusters
● 5x 454 Brenchley et al.
● Triticum turgidum cDNA assemblies
Wheat data in Ensembl Plants
● Whole genome alignments
● Between wheat(s) and:
● Rice
● Brachypodium
● Within wheat
● A vs. B
● A vs. D
● B vs. D
● Gene trees
● Aegilops tauschii
● Triticum urartu
● and other more
distant relatives
WGA between wheat, rice and brachy
WGA within wheat A, B and D sub-genomes
Gene trees
Gene trees
Walk through ‘demo’ for
Ensembl Plants
Search
Variant Effect Predictor (VEP)
● Predicts functional consequences of known and
unknown variants
● For substitutions, insertions, deletions and structural
variants
● Web interface (for up to 750 variants), standalone Perl
script, Perl API and REST API
Visualise your own data
Upload data:
● Data saved on server
● 5 MB limit
● Large file formats?
Attach remote files:
● URL-based
● HTTP or FTP
● No size limit
Upload formats:
● BED genes / features
● Gbrowse genes / features
● GFF/GTF genes / features
● PSL sequence alignments
● WIG continuous-valued data
● BedGraph continuous-valued data
● TrackHub collections of tracks
Attach formats:
● BigBed genes / features
● BAM sequence alignments
● BigWig continuous-valued data
● VCF variants
User added tracks:
● Can be saved or shared
● Only trivial security, do not use for sensitive data!
The barley Gene-ome
● Step 1 – Dataset
● Choose your dataset
and species
● Step 2 – Filters
● Limit your dataset
● Step 3 – Attributes
● Specify what
information you want
to output
● Step 4 – Results
● Preview and output
your results
Blast and
BioMart...
pkersey@ebi.ac.uk10/01/2014
Funding (Ensembl Plants)
• Ensembl Genomes Funded by
• EMBL
• EU (INFRAVEC, Microme, transPLANT, AllBio)
• BBSRC (PhytoPath, wheat, barley and midge sequencing,
UK-US collaboration, RNAcentral)
• Wellcome Trust (PomBase)
• NIH/NIAID (VectorBase)
• NSF (Gramene collaboration)
• Bill and Melinda Gates Foundation (wheat rust)
pkersey@ebi.ac.uk10/01/2014
People (Ensembl Plants)
• James Allen, Irina Armean, Dan Bolser, Mikkel
Christensen, Paul Davies, Christoph Grabmueller, Kevin
Howe, Malcolm Hinsley, Jay Humphrey, Arnaud
Kerhornou, Paul Kersey, Julia Khobdova, Eugene
Kulesha, Nick Langridge, Dan Lawson, Mark McDowall,
Uma Maheswari, Gareth Maslen, Michael Nuhn, Chuang
Kee Ong, Michael Paulini, Helder Pedro, Anton Petrov,
Dan Staines, Mary Ann Tuli, Brandon Walts, Gary
Williams
• If you have a question that is not answered here,
please Contact our HelpDesk:
• helpdesk@ensemblgenomes.org

More Related Content

What's hot

Genomic selection
Genomic  selectionGenomic  selection
Genomic selection
pandadebadatta
 

What's hot (20)

Introduction to NGS
Introduction to NGSIntroduction to NGS
Introduction to NGS
 
Proteome databases
Proteome databasesProteome databases
Proteome databases
 
Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
 
Molecular markers
Molecular markersMolecular markers
Molecular markers
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
RFLP ,RAPD ,AFLP, STS, SCAR ,SSCP & QTL
RFLP ,RAPD ,AFLP, STS, SCAR ,SSCP &  QTLRFLP ,RAPD ,AFLP, STS, SCAR ,SSCP &  QTL
RFLP ,RAPD ,AFLP, STS, SCAR ,SSCP & QTL
 
Genomic mapping, genetic mapping
Genomic mapping, genetic mappingGenomic mapping, genetic mapping
Genomic mapping, genetic mapping
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
Genotyping by sequencing
Genotyping by sequencingGenotyping by sequencing
Genotyping by sequencing
 
Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1
 
Expressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular markerExpressed sequence tag (EST), molecular marker
Expressed sequence tag (EST), molecular marker
 
COMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.pptCOMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.ppt
 
Genomic selection
Genomic  selectionGenomic  selection
Genomic selection
 
Molecular Markers
Molecular MarkersMolecular Markers
Molecular Markers
 
Whole Genome Sequencing Analysis
Whole Genome Sequencing AnalysisWhole Genome Sequencing Analysis
Whole Genome Sequencing Analysis
 
Dna sequencing methods
Dna sequencing methodsDna sequencing methods
Dna sequencing methods
 
QTL mapping for crop improvement
QTL mapping for crop improvementQTL mapping for crop improvement
QTL mapping for crop improvement
 
2 whole genome sequencing and analysis
2 whole genome sequencing and analysis2 whole genome sequencing and analysis
2 whole genome sequencing and analysis
 
Bioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysisBioinformatics tools for NGS data analysis
Bioinformatics tools for NGS data analysis
 
Next Generation Sequencing of DNA
Next Generation Sequencing of DNANext Generation Sequencing of DNA
Next Generation Sequencing of DNA
 

Viewers also liked

Chuong 7 doi moi tu duy va cai cach the che
Chuong 7   doi moi tu duy va cai cach the cheChuong 7   doi moi tu duy va cai cach the che
Chuong 7 doi moi tu duy va cai cach the che
Le Thuy Hanh
 
Installation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
Installation Instructions Tachometerwith Counter Drehzahlmessermit ZaehlerInstallation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
Installation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
guestfe21f2
 
Chuong 2 rui ro tham hut tai khoa
Chuong 2   rui ro tham hut tai khoaChuong 2   rui ro tham hut tai khoa
Chuong 2 rui ro tham hut tai khoa
Le Thuy Hanh
 
Interacting Galaxies
Interacting GalaxiesInteracting Galaxies
Interacting Galaxies
ninabean47
 

Viewers also liked (20)

20-Line Lifesavers: Coding simple solutions in the GATK
20-Line Lifesavers: Coding simple solutions in the GATK20-Line Lifesavers: Coding simple solutions in the GATK
20-Line Lifesavers: Coding simple solutions in the GATK
 
Creating a SNP calling pipeline
Creating a SNP calling pipelineCreating a SNP calling pipeline
Creating a SNP calling pipeline
 
Amazon Ec2
Amazon Ec2Amazon Ec2
Amazon Ec2
 
IBM MQ v8 enhancements
IBM MQ v8 enhancementsIBM MQ v8 enhancements
IBM MQ v8 enhancements
 
wchh2014 Wordpress ChildThemes - wieso, weshalb, warum?
wchh2014 Wordpress ChildThemes - wieso, weshalb, warum?wchh2014 Wordpress ChildThemes - wieso, weshalb, warum?
wchh2014 Wordpress ChildThemes - wieso, weshalb, warum?
 
Wycisnąć IR-owca jak cytrynę. Jak inwestorzy indywidualni mogą zdobyć więcej ...
Wycisnąć IR-owca jak cytrynę. Jak inwestorzy indywidualni mogą zdobyć więcej ...Wycisnąć IR-owca jak cytrynę. Jak inwestorzy indywidualni mogą zdobyć więcej ...
Wycisnąć IR-owca jak cytrynę. Jak inwestorzy indywidualni mogą zdobyć więcej ...
 
Pecha Kucha
Pecha KuchaPecha Kucha
Pecha Kucha
 
Portuguese Hidden Champions
Portuguese Hidden ChampionsPortuguese Hidden Champions
Portuguese Hidden Champions
 
Chuong 7 doi moi tu duy va cai cach the che
Chuong 7   doi moi tu duy va cai cach the cheChuong 7   doi moi tu duy va cai cach the che
Chuong 7 doi moi tu duy va cai cach the che
 
NETTAB 2012 flyer
NETTAB 2012 flyerNETTAB 2012 flyer
NETTAB 2012 flyer
 
41035
4103541035
41035
 
Installation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
Installation Instructions Tachometerwith Counter Drehzahlmessermit ZaehlerInstallation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
Installation Instructions Tachometerwith Counter Drehzahlmessermit Zaehler
 
Chuong 2 rui ro tham hut tai khoa
Chuong 2   rui ro tham hut tai khoaChuong 2   rui ro tham hut tai khoa
Chuong 2 rui ro tham hut tai khoa
 
Photofraphy by Solve Sundsbo
Photofraphy by Solve SundsboPhotofraphy by Solve Sundsbo
Photofraphy by Solve Sundsbo
 
Nice 2012, BioWikis and DASWiki
Nice 2012, BioWikis and DASWikiNice 2012, BioWikis and DASWiki
Nice 2012, BioWikis and DASWiki
 
Blood Diamond
Blood DiamondBlood Diamond
Blood Diamond
 
如何开展社会化媒体营销?品牌拟人化
如何开展社会化媒体营销?品牌拟人化如何开展社会化媒体营销?品牌拟人化
如何开展社会化媒体营销?品牌拟人化
 
The Trust Economy
The Trust EconomyThe Trust Economy
The Trust Economy
 
Cellnetrix brochure 2013
Cellnetrix brochure 2013Cellnetrix brochure 2013
Cellnetrix brochure 2013
 
Interacting Galaxies
Interacting GalaxiesInteracting Galaxies
Interacting Galaxies
 

Similar to Ensembl Plants: Visualising, mining and analysing crop genomics data

Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesGenome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
EBI
 
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
CGIAR Generation Challenge Programme
 
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsGRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
CGIAR Generation Challenge Programme
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
Senthil Natesan
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
David Ruau
 

Similar to Ensembl Plants: Visualising, mining and analysing crop genomics data (20)

Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesGenome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
 
Role of ensembl in genome browsing
Role of ensembl in genome browsingRole of ensembl in genome browsing
Role of ensembl in genome browsing
 
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientistsRamil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
Ramil Mauleon: IRRI GALAXY: bioinformatics for rice scientists
 
Browsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with EnsemblBrowsing Genes, Variation and Regulation data with Ensembl
Browsing Genes, Variation and Regulation data with Ensembl
 
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
PAG XXII 2014 – The Crop Ontology: A resource for enabling access to breeders...
 
Gramene
GrameneGramene
Gramene
 
GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005GIAB-GRC workshop oct2015 giab introduction 151005
GIAB-GRC workshop oct2015 giab introduction 151005
 
Data cycle microbes
Data cycle microbesData cycle microbes
Data cycle microbes
 
Understanding Genome
Understanding Genome Understanding Genome
Understanding Genome
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
 
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M SawkinsGRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
GRM 2013: The Integrated Breeding Platform: Overview -- G McLaren and M Sawkins
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
Functional ANNOTATION OF GENOME.pptx
Functional ANNOTATION OF GENOME.pptxFunctional ANNOTATION OF GENOME.pptx
Functional ANNOTATION OF GENOME.pptx
 
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
Pipeline or pipe dream - Midlands Micro Meeting UK - mon 15 sep 2014
 
Major germplasm data sources and referatories
Major germplasm data sources and referatoriesMajor germplasm data sources and referatories
Major germplasm data sources and referatories
 
Bioinformatics Introduction
Bioinformatics IntroductionBioinformatics Introduction
Bioinformatics Introduction
 
Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128Giab jan2016 intro and update 160128
Giab jan2016 intro and update 160128
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Cloud bioinformatics 2
Cloud bioinformatics 2Cloud bioinformatics 2
Cloud bioinformatics 2
 

More from Dan Bolser

Semantic MediaWiki Workshop
Semantic MediaWiki WorkshopSemantic MediaWiki Workshop
Semantic MediaWiki Workshop
Dan Bolser
 

More from Dan Bolser (6)

Ramona Tăme - Email Encryption and Digital SIgning
Ramona Tăme - Email Encryption and Digital SIgningRamona Tăme - Email Encryption and Digital SIgning
Ramona Tăme - Email Encryption and Digital SIgning
 
Ensembl plants hsf_d_bolser_2012
Ensembl plants hsf_d_bolser_2012Ensembl plants hsf_d_bolser_2012
Ensembl plants hsf_d_bolser_2012
 
Semantic MediaWiki Workshop
Semantic MediaWiki WorkshopSemantic MediaWiki Workshop
Semantic MediaWiki Workshop
 
Wikis at work
Wikis at workWikis at work
Wikis at work
 
BioWikis BSB10
BioWikis BSB10BioWikis BSB10
BioWikis BSB10
 
Wikipedia and the Global Brain
Wikipedia and the Global BrainWikipedia and the Global Brain
Wikipedia and the Global Brain
 

Recently uploaded

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Recently uploaded (20)

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 

Ensembl Plants: Visualising, mining and analysing crop genomics data

  • 1. Ensembl Plants: Visualising, mining and analysing crop genomics data Dan Bolser Ensembl Plants project leader EMBL-EBI http://plants.ensembl.org #EnsemblGenomes
  • 2. Visualising, mining and analysing data: ● The Ensembl genome browser ● BioMart ● Tools for processing your own data Overview Background: ● Ensembl Plants ● History ● Data ● Recent updates ● Wheat ● Barley
  • 3. EBI Ensembl is developed jointly by the EBI and the Wellcome Trust Sanger Institute
  • 4. Ensembl Plants uses Ensembl technology Ensembl: ● A platform for genome browsing, annotation and analysis developed jointly by the EBI and Wellcome Trust Sanger Institute. ● Has modules for handling: ● Genomic data, Variations, Comparative genomics, Gene prediction, ... ● Multiple points of access to data: ● Browser-based application, Perl and REST APIs, direct access (MySQL), BioMart data mining tool, DAS (client and server), FTP. ● Upload your own data and compare it to the reference seq. and annotation. Ensembl was originally developed for vertebrate genomes, subsequently extended to non-vertebrate species: ● Ensembl Genomes → Ensembl Plants
  • 5. Currently 33 genomes in Ensembl Plants http://plants.ensembl.org
  • 9. Types of data in Ensembl (Ensembl Plants) ● Genomic sequence ● Gene, transcript, and protein annotations ● External references and ontology terms ● Mapped sequences: cDNAs, proteins, probes, BACs, repeats, markers, ... ● Variation data: ● sequence variants ● structural variants ● Comparative data: ● gene trees, orthologues, paralogues ● whole genome alignments and synteny
  • 11.
  • 12. Wheat data in Ensembl Plants ● The chromosome survey sequence from the International Wheat Genome Sequencing Consortium. ● Version 2.1 of the IWGSC gene models called on the chromosome survey sequence. ● Repeats ● Repbase ● The Triticeae Repeat Sequence Database (TREP) ● Alignments ● RNA-seq from various studies in ENA ● ESTs and UniGene clusters ● 5x 454 Brenchley et al. ● Triticum turgidum cDNA assemblies
  • 13. Wheat data in Ensembl Plants ● Whole genome alignments ● Between wheat(s) and: ● Rice ● Brachypodium ● Within wheat ● A vs. B ● A vs. D ● B vs. D ● Gene trees ● Aegilops tauschii ● Triticum urartu ● and other more distant relatives
  • 14. WGA between wheat, rice and brachy
  • 15. WGA within wheat A, B and D sub-genomes
  • 18. Walk through ‘demo’ for Ensembl Plants
  • 19.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25. Variant Effect Predictor (VEP) ● Predicts functional consequences of known and unknown variants ● For substitutions, insertions, deletions and structural variants ● Web interface (for up to 750 variants), standalone Perl script, Perl API and REST API
  • 26. Visualise your own data Upload data: ● Data saved on server ● 5 MB limit ● Large file formats? Attach remote files: ● URL-based ● HTTP or FTP ● No size limit Upload formats: ● BED genes / features ● Gbrowse genes / features ● GFF/GTF genes / features ● PSL sequence alignments ● WIG continuous-valued data ● BedGraph continuous-valued data ● TrackHub collections of tracks Attach formats: ● BigBed genes / features ● BAM sequence alignments ● BigWig continuous-valued data ● VCF variants User added tracks: ● Can be saved or shared ● Only trivial security, do not use for sensitive data!
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33. ● Step 1 – Dataset ● Choose your dataset and species ● Step 2 – Filters ● Limit your dataset ● Step 3 – Attributes ● Specify what information you want to output ● Step 4 – Results ● Preview and output your results Blast and BioMart...
  • 34.
  • 35. pkersey@ebi.ac.uk10/01/2014 Funding (Ensembl Plants) • Ensembl Genomes Funded by • EMBL • EU (INFRAVEC, Microme, transPLANT, AllBio) • BBSRC (PhytoPath, wheat, barley and midge sequencing, UK-US collaboration, RNAcentral) • Wellcome Trust (PomBase) • NIH/NIAID (VectorBase) • NSF (Gramene collaboration) • Bill and Melinda Gates Foundation (wheat rust)
  • 36. pkersey@ebi.ac.uk10/01/2014 People (Ensembl Plants) • James Allen, Irina Armean, Dan Bolser, Mikkel Christensen, Paul Davies, Christoph Grabmueller, Kevin Howe, Malcolm Hinsley, Jay Humphrey, Arnaud Kerhornou, Paul Kersey, Julia Khobdova, Eugene Kulesha, Nick Langridge, Dan Lawson, Mark McDowall, Uma Maheswari, Gareth Maslen, Michael Nuhn, Chuang Kee Ong, Michael Paulini, Helder Pedro, Anton Petrov, Dan Staines, Mary Ann Tuli, Brandon Walts, Gary Williams • If you have a question that is not answered here, please Contact our HelpDesk: • helpdesk@ensemblgenomes.org