SlideShare a Scribd company logo
1 of 26
Curation
Ewan Birney (tweetable)
Who am I?
โ€ข Associate Director at
  European Bioinformatics
  Institute (EBI)
โ€ข Involved in genomics since I
  was 19 (> 20 years!)
โ€ข Trained as a biochemist โ€“
  most people think I am CS
                                 EBI is in Hinxton, South
โ€ข Analysed โ€“ sometimes lead
                                 Cambridgeshire
  โ€“
  human/mouse/rat/platypus
                                 EBI is part of EMBL, ~like
  etc genomes, ENCODE,
                                 CERN for molecular biology
  Others.
Molecular Biology
โ€ข The study of how life works โ€“ at a molecular level

โ€ข Key molecules:
  โ€ข DNA โ€“ Information store (Disk)
  โ€ข RNA โ€“ Key information transformer, also does stuff (RAM)
  โ€ข Proteins โ€“ The business end of life (Chip, robotic arms)
  โ€ข Metabolites โ€“ Fuel and signalling molecules (electricity)
โ€ข Theories of how these interact โ€“ no theories of to predict what
  they are
โ€ข Instead we determine attributes of molecules and store them in
  globally accessible, open, databases
Theory ๏ƒณ Observation


                    Can accurately predict from models




 Must directly observe
    Molecular Geology,  Climate        High Energy
    Biology   Astronomy modelling      Physics
This ratio is not well correlated with data size


   ~60PB                        High Energy Physics

Data Size
             Molecular Astronomy
             Biology
    ~5PB                      Climate Models




             Ratio of model predictability
โ€œKnowing stuffโ€ is critical to biologyโ€ฆ

โ€ข The bases of the human genome
  โ€ข โ€ฆ and the Mouse, Rat, Wheat, Ecoli, Plasmodium, Cowโ€ฆ.
โ€ข The functions of proteins
  โ€ข Enzymes, Transcription Factors, Signallingโ€ฆ.
โ€ข The types of cells, their lineages and organ composition
  โ€ข โ€ฆand all the molecular components in each cell
โ€ข Small molecules
  โ€ข โ€ฆ and their conversions, binding partners
โ€ข Structures of molecules, complexes and cells
  โ€ข โ€ฆ at atomic and higher resolution
Two fundamental types of information

โ€ข Experimental data           โ€ข Consensus Knowledge

โ€ข The result of a specific    โ€ข Integration of different
  experiment                    strands of information on a
โ€ข Often an experiment           topic
  specific, data heavy part   โ€ข Realised as a
  plus a โ€œmeta-dataโ€ part       computationally accessible
โ€ข Might be contradictory        scheme


โ€ข โ€œPrimary paperโ€             โ€ข โ€œReview articleโ€
Five types of curation
Experimental Data Entry

โ€ข Intact โ€“ Protein:Protein
  interactions


โ€ข GWAS Catalog โ€“
  extraction of summary
  statistics
Experimental Meta data capture

โ€ข Sample, CDS lines in
  ENA
โ€ข Sample in Metabolights,
  PRIDE etc
โ€ข Machine and analysis
  specification in PDB,
  PRIDE, ENA
Consensus integration of information

โ€ข GenCode gene models in
  human
โ€ข Summaries and GO
  assignment in UniProt
โ€ข Pathway information in
  Reactome
โ€ข GO assignment and
  summaries in MODs (eg,
  PomBase, WormBase,
  PhytoPathDB etc)
Knowledge frameworks

โ€ข   The EC classification
โ€ข   Cell type ontologies
โ€ข   Cell lineages โ€“ Worms!
โ€ข   SnowMed, HPO etc
โ€ข   GO ontologies
Knowledge management

โ€ข Creation of rules
  representing ENA
  standards compliance
โ€ข Cross-ontology
  coordination (eg, EFO) or
  tieing (GO ๏ƒณ ChEBI)
โ€ข RuleBase / UniRule
  curation processes
Data Entry vs Programming

 Direct                                    Programmatic
 Data Entry                                Data Entry




                      โ€œMessyโ€ Scripting
         Improved
         Data entry
         tools              RuleBase,
                            Computational Accessible
                            Standards
Thank You!
Curation Dilema

โ€ข If you do your job wellโ€ฆ   โ€ข If you do your job badlyโ€ฆ

โ€ข Everyone assumes itโ€™s      โ€ข Everyone assumes itโ€™s
  easy                         easy
โ€ข People forget about the    โ€ข People forget about the
  complexity                   complexity


โ€ข You are ignored ๏Š          โ€ข People complain ๏Š
Why we need an infrastructureโ€ฆ
Infrastructures are criticalโ€ฆ
But we only notice them when they go wrong
Biology already needs an information
infrastructure

โ€ข For the human genome
  โ€ข (โ€ฆand the mouse, and the rat, andโ€ฆ x 150 now, 1000 in the
    future!) - Ensembl
โ€ข For the function of genes and proteins
  โ€ข For all genes, in text and computational โ€“ UniProt and GO
โ€ข For all 3D structures
  โ€ข To understand how proteins work โ€“ PDBe
โ€ข For where things are expressed
  โ€ข The differences and functionality of cells - Atlas
..But this keeps on goingโ€ฆ

โ€ข We have to scale across all of (interesting) life
  โ€ข There are a lot of species out there!
โ€ข We have to handle new areas, in particular medicine
  โ€ข A set of European haplotypes for good imputation
  โ€ข A set of actionable variants in germline and cancers
โ€ข We have to improve our chemical understanding
  โ€ข Of biological chemicals
  โ€ข Of chemicals which interfere with Biology
ELIXIRโ€™s mission
To build a sustainable
European infrastructure for
biological
information, supporting life
science research and its
                                                  medicine
translation to:

                                    environment


                         bioindustries

            society


              22
How?

Fully Centralised                                 Fully Distributed




Pros: Stability, reuse,             Pros: Responsive, Geographic
Learning ease                       Language responsive
Cons: Hard to concentrate           Cons: Internal communication overhead
Expertise across of life science    Harder for end users to learn
Geographic, language placement      Harder to provide multi-decade stability
Bottlenecks and lack of diversity
Research        Healthcare




    International    National
    EBI / Elixir     Healthcare
    English          National Language
    Low legalities   Complex legalities

2
Other infrastructures needed for biology
โ€ข EuroBioImaging
  โ€ข Cellular and whole organism Imaging
โ€ข BioBanks (BBMRI)
  โ€ข We need numbers โ€“ European populations โ€“ in particular for rare
    diseases, but also for specific sub types of common disease
โ€ข Mouse models and phenotypes (Infrafrontier)
  โ€ข A baseline set of knockouts and phenotypes in our most tractable
    mammalian model
  โ€ข (itโ€™s hard to prove something in human)
โ€ข Robust molecular assays in a clinical setting (EATRIS)
  โ€ข The ability to reliably use state of the art molecular techniques in a
    clinical research setting
(you can follow me on twitter @ewanbirney)
I blog and update this on Google Plus publically

More Related Content

What's hot

Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Fokhruz Zaman
ย 
UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOA
EBI
ย 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
Chimezie Ogbuji
ย 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
nadeem akhter
ย 

What's hot (20)

Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
ย 
Ensembl Browser Workshop
Ensembl Browser WorkshopEnsembl Browser Workshop
Ensembl Browser Workshop
ย 
UniProt & Ontologies
UniProt & OntologiesUniProt & Ontologies
UniProt & Ontologies
ย 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
ย 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomics
ย 
Pโ€ฉ Systems โ€ฉModel โ€ฉOptimisation โ€ฉbyโ€ฉ Means โ€ฉof โ€ฉEvolutionary โ€ฉBased โ€ฉSearchโ€ฉ ...
Pโ€ฉ Systems โ€ฉModel โ€ฉOptimisation โ€ฉbyโ€ฉ Means โ€ฉof โ€ฉEvolutionary โ€ฉBased โ€ฉSearchโ€ฉ ...Pโ€ฉ Systems โ€ฉModel โ€ฉOptimisation โ€ฉbyโ€ฉ Means โ€ฉof โ€ฉEvolutionary โ€ฉBased โ€ฉSearchโ€ฉ ...
Pโ€ฉ Systems โ€ฉModel โ€ฉOptimisation โ€ฉbyโ€ฉ Means โ€ฉof โ€ฉEvolutionary โ€ฉBased โ€ฉSearchโ€ฉ ...
ย 
TGAC Browser bosc 2014
TGAC Browser bosc 2014TGAC Browser bosc 2014
TGAC Browser bosc 2014
ย 
UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOA
ย 
PhoenixBio 2020 Stanford Workshop on PhyloGenes
PhoenixBio 2020 Stanford Workshop on PhyloGenesPhoenixBio 2020 Stanford Workshop on PhyloGenes
PhoenixBio 2020 Stanford Workshop on PhyloGenes
ย 
Introduction to proteomics
Introduction to proteomicsIntroduction to proteomics
Introduction to proteomics
ย 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
ย 
University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012University of Toronto Chemistry Librarians Workshop June 2012
University of Toronto Chemistry Librarians Workshop June 2012
ย 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
ย 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
ย 
Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015
ย 
Shorter bioinformatics
Shorter bioinformaticsShorter bioinformatics
Shorter bioinformatics
ย 
Kegg
KeggKegg
Kegg
ย 
Collaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of Life
ย 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontology
ย 
Mungall keynote-biocurator-2017
Mungall keynote-biocurator-2017Mungall keynote-biocurator-2017
Mungall keynote-biocurator-2017
ย 

Similar to Ewan Birney Biocuration 2013

Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
Nikhil Aggarwal
ย 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
Connected Data World
ย 
Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)
Jan Aerts
ย 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
Senthil Natesan
ย 

Similar to Ewan Birney Biocuration 2013 (20)

Computer science history.pdf
Computer science history.pdfComputer science history.pdf
Computer science history.pdf
ย 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
ย 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biology
ย 
Biocurator2012.41.hu
Biocurator2012.41.huBiocurator2012.41.hu
Biocurator2012.41.hu
ย 
Plant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In SequencesPlant Pathogen Genome Data: My Life In Sequences
Plant Pathogen Genome Data: My Life In Sequences
ย 
Building and Using Ontologies to do biology
Building and Using Ontologies to do biologyBuilding and Using Ontologies to do biology
Building and Using Ontologies to do biology
ย 
Amia tb-review-08
Amia tb-review-08Amia tb-review-08
Amia tb-review-08
ย 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
ย 
Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...
Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...
Synthetic Biology and Data-Driven Synthetic Biology for Personalized Medicine...
ย 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
ย 
Intro bioinformatics
Intro bioinformaticsIntro bioinformatics
Intro bioinformatics
ย 
Big Data
Big DataBig Data
Big Data
ย 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contruction
ย 
Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)Visualizing the Structural Variome (VMLS-Eurovis 2013)
Visualizing the Structural Variome (VMLS-Eurovis 2013)
ย 
Bi 140 science, technology and society module 4
Bi 140 science, technology and society module 4Bi 140 science, technology and society module 4
Bi 140 science, technology and society module 4
ย 
Lecture 1
Lecture 1Lecture 1
Lecture 1
ย 
World-wide data exchange in metabolomics, Wageningen, October 2016
World-wide data exchange in metabolomics, Wageningen, October 2016World-wide data exchange in metabolomics, Wageningen, October 2016
World-wide data exchange in metabolomics, Wageningen, October 2016
ย 
Using public databases to inform research questions
Using public databases to inform research questionsUsing public databases to inform research questions
Using public databases to inform research questions
ย 
Introduction to epigenetics and study design
Introduction to epigenetics and study designIntroduction to epigenetics and study design
Introduction to epigenetics and study design
ย 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
ย 

More from Iddo

CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013
Iddo
ย 
Metagenomics Biocuration 2013
Metagenomics Biocuration 2013Metagenomics Biocuration 2013
Metagenomics Biocuration 2013
Iddo
ย 
Ismb grant-writing-2012
Ismb grant-writing-2012Ismb grant-writing-2012
Ismb grant-writing-2012
Iddo
ย 
David Jones AFP/CAFA2011
David Jones AFP/CAFA2011David Jones AFP/CAFA2011
David Jones AFP/CAFA2011
Iddo
ย 
Vienna afp2011
Vienna afp2011Vienna afp2011
Vienna afp2011
Iddo
ย 
Afp cafa djuric
Afp cafa djuricAfp cafa djuric
Afp cafa djuric
Iddo
ย 
Go camp 2010_cacao
Go camp 2010_cacaoGo camp 2010_cacao
Go camp 2010_cacao
Iddo
ย 

More from Iddo (20)

What can Community Challenges do for You?
What can Community Challenges do for You?What can Community Challenges do for You?
What can Community Challenges do for You?
ย 
Surviving Scientific Presentations
Surviving Scientific PresentationsSurviving Scientific Presentations
Surviving Scientific Presentations
ย 
Friedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nrFriedberg lab-overview-grad-students-2019-nr
Friedberg lab-overview-grad-students-2019-nr
ย 
The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...The roles communities play in improving bioinformatics: better software, bett...
The roles communities play in improving bioinformatics: better software, bett...
ย 
Why Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is WrongWhy Your Microbiome Analysis is Wrong
Why Your Microbiome Analysis is Wrong
ย 
Tracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in BacteriaTracing the Ancestry of Genomes in Bacteria
Tracing the Ancestry of Genomes in Bacteria
ย 
Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...Computational Challenges in Biological Data Science: an Optimistically Cautio...
Computational Challenges in Biological Data Science: an Optimistically Cautio...
ย 
Friedberg lab-overview-grad-students
Friedberg lab-overview-grad-studentsFriedberg lab-overview-grad-students
Friedberg lab-overview-grad-students
ย 
Understanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low OutputUnderstanding Biological Function in Times of High Throughput and Low Output
Understanding Biological Function in Times of High Throughput and Low Output
ย 
Random Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in ScienceRandom Musings on Fixing Data Shambles in Science
Random Musings on Fixing Data Shambles in Science
ย 
Genome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin DiscoveryGenome Informatics 2015 Bacteriocin Discovery
Genome Informatics 2015 Bacteriocin Discovery
ย 
Convergent divergent
Convergent divergentConvergent divergent
Convergent divergent
ย 
Some US Science Funding sources
Some US Science Funding sourcesSome US Science Funding sources
Some US Science Funding sources
ย 
CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013CAFA poster presented at CSHL Genome Informatics 2013
CAFA poster presented at CSHL Genome Informatics 2013
ย 
Metagenomics Biocuration 2013
Metagenomics Biocuration 2013Metagenomics Biocuration 2013
Metagenomics Biocuration 2013
ย 
Ismb grant-writing-2012
Ismb grant-writing-2012Ismb grant-writing-2012
Ismb grant-writing-2012
ย 
David Jones AFP/CAFA2011
David Jones AFP/CAFA2011David Jones AFP/CAFA2011
David Jones AFP/CAFA2011
ย 
Vienna afp2011
Vienna afp2011Vienna afp2011
Vienna afp2011
ย 
Afp cafa djuric
Afp cafa djuricAfp cafa djuric
Afp cafa djuric
ย 
Go camp 2010_cacao
Go camp 2010_cacaoGo camp 2010_cacao
Go camp 2010_cacao
ย 

Recently uploaded

๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
Sheetaleventcompany
ย 
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Sheetaleventcompany
ย 
๐Ÿ‘‰ Chennai Sexy Auntyโ€™s WhatsApp Number ๐Ÿ‘‰๐Ÿ“ž 7427069034 ๐Ÿ‘‰๐Ÿ“ž Just๐Ÿ“ฒ Call Ruhi Colle...
๐Ÿ‘‰ Chennai Sexy Auntyโ€™s WhatsApp Number ๐Ÿ‘‰๐Ÿ“ž 7427069034 ๐Ÿ‘‰๐Ÿ“ž Just๐Ÿ“ฒ Call Ruhi Colle...๐Ÿ‘‰ Chennai Sexy Auntyโ€™s WhatsApp Number ๐Ÿ‘‰๐Ÿ“ž 7427069034 ๐Ÿ‘‰๐Ÿ“ž Just๐Ÿ“ฒ Call Ruhi Colle...
๐Ÿ‘‰ Chennai Sexy Auntyโ€™s WhatsApp Number ๐Ÿ‘‰๐Ÿ“ž 7427069034 ๐Ÿ‘‰๐Ÿ“ž Just๐Ÿ“ฒ Call Ruhi Colle...
rajnisinghkjn
ย 
Nagpur Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Nagpur No๐Ÿ’ฐ...
Nagpur Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Nagpur No๐Ÿ’ฐ...Nagpur Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Nagpur No๐Ÿ’ฐ...
Nagpur Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Nagpur No๐Ÿ’ฐ...
Sheetaleventcompany
ย 
๐Ÿ’ฐCall Girl In Bangaloreโ˜Ž๏ธ63788-78445๐Ÿ’ฐ Call Girl service in Bangaloreโ˜Ž๏ธBangalo...
๐Ÿ’ฐCall Girl In Bangaloreโ˜Ž๏ธ63788-78445๐Ÿ’ฐ Call Girl service in Bangaloreโ˜Ž๏ธBangalo...๐Ÿ’ฐCall Girl In Bangaloreโ˜Ž๏ธ63788-78445๐Ÿ’ฐ Call Girl service in Bangaloreโ˜Ž๏ธBangalo...
๐Ÿ’ฐCall Girl In Bangaloreโ˜Ž๏ธ63788-78445๐Ÿ’ฐ Call Girl service in Bangaloreโ˜Ž๏ธBangalo...
gragneelam30
ย 
Gorgeous Call Girls Dehradun {8854095900} โค๏ธVVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} โค๏ธVVIP ROCKY Call Girls in Dehradun...Gorgeous Call Girls Dehradun {8854095900} โค๏ธVVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} โค๏ธVVIP ROCKY Call Girls in Dehradun...
Sheetaleventcompany
ย 
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
dishamehta3332
ย 
๐Ÿ’šCall Girls In Amritsar ๐Ÿ’ฏAnvi ๐Ÿ“ฒ๐Ÿ”8725944379๐Ÿ”Amritsar Call Girl No๐Ÿ’ฐAdvance Cash...
๐Ÿ’šCall Girls In Amritsar ๐Ÿ’ฏAnvi ๐Ÿ“ฒ๐Ÿ”8725944379๐Ÿ”Amritsar Call Girl No๐Ÿ’ฐAdvance Cash...๐Ÿ’šCall Girls In Amritsar ๐Ÿ’ฏAnvi ๐Ÿ“ฒ๐Ÿ”8725944379๐Ÿ”Amritsar Call Girl No๐Ÿ’ฐAdvance Cash...
๐Ÿ’šCall Girls In Amritsar ๐Ÿ’ฏAnvi ๐Ÿ“ฒ๐Ÿ”8725944379๐Ÿ”Amritsar Call Girl No๐Ÿ’ฐAdvance Cash...
Sheetaleventcompany
ย 
Control of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronicControl of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronic
MedicoseAcademics
ย 
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service DehradunDehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
Sheetaleventcompany
ย 
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan CytotecJual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
jualobat34
ย 

Recently uploaded (20)

Kolkata Call Girls Shobhabazar ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ Top Class Call Gir...
Kolkata Call Girls Shobhabazar  ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ  Top Class Call Gir...Kolkata Call Girls Shobhabazar  ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ  Top Class Call Gir...
Kolkata Call Girls Shobhabazar ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ Top Class Call Gir...
ย 
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
ย 
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
ย 
๐Ÿ‘‰ Chennai Sexy Auntyโ€™s WhatsApp Number ๐Ÿ‘‰๐Ÿ“ž 7427069034 ๐Ÿ‘‰๐Ÿ“ž Just๐Ÿ“ฒ Call Ruhi Colle...
๐Ÿ‘‰ Chennai Sexy Auntyโ€™s WhatsApp Number ๐Ÿ‘‰๐Ÿ“ž 7427069034 ๐Ÿ‘‰๐Ÿ“ž Just๐Ÿ“ฒ Call Ruhi Colle...๐Ÿ‘‰ Chennai Sexy Auntyโ€™s WhatsApp Number ๐Ÿ‘‰๐Ÿ“ž 7427069034 ๐Ÿ‘‰๐Ÿ“ž Just๐Ÿ“ฒ Call Ruhi Colle...
๐Ÿ‘‰ Chennai Sexy Auntyโ€™s WhatsApp Number ๐Ÿ‘‰๐Ÿ“ž 7427069034 ๐Ÿ‘‰๐Ÿ“ž Just๐Ÿ“ฒ Call Ruhi Colle...
ย 
Kolkata Call Girls Naktala ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ Top Class Call Girl Se...
Kolkata Call Girls Naktala  ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ  Top Class Call Girl Se...Kolkata Call Girls Naktala  ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ  Top Class Call Girl Se...
Kolkata Call Girls Naktala ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ Top Class Call Girl Se...
ย 
Nagpur Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Nagpur No๐Ÿ’ฐ...
Nagpur Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Nagpur No๐Ÿ’ฐ...Nagpur Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Nagpur No๐Ÿ’ฐ...
Nagpur Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Nagpur No๐Ÿ’ฐ...
ย 
7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta
7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta
7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta
ย 
Cardiac Output, Venous Return, and Their Regulation
Cardiac Output, Venous Return, and Their RegulationCardiac Output, Venous Return, and Their Regulation
Cardiac Output, Venous Return, and Their Regulation
ย 
๐Ÿ’ฐCall Girl In Bangaloreโ˜Ž๏ธ63788-78445๐Ÿ’ฐ Call Girl service in Bangaloreโ˜Ž๏ธBangalo...
๐Ÿ’ฐCall Girl In Bangaloreโ˜Ž๏ธ63788-78445๐Ÿ’ฐ Call Girl service in Bangaloreโ˜Ž๏ธBangalo...๐Ÿ’ฐCall Girl In Bangaloreโ˜Ž๏ธ63788-78445๐Ÿ’ฐ Call Girl service in Bangaloreโ˜Ž๏ธBangalo...
๐Ÿ’ฐCall Girl In Bangaloreโ˜Ž๏ธ63788-78445๐Ÿ’ฐ Call Girl service in Bangaloreโ˜Ž๏ธBangalo...
ย 
Gorgeous Call Girls Dehradun {8854095900} โค๏ธVVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} โค๏ธVVIP ROCKY Call Girls in Dehradun...Gorgeous Call Girls Dehradun {8854095900} โค๏ธVVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} โค๏ธVVIP ROCKY Call Girls in Dehradun...
ย 
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
ย 
Shazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdfShazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdf
ย 
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
ย 
๐Ÿ’šCall Girls In Amritsar ๐Ÿ’ฏAnvi ๐Ÿ“ฒ๐Ÿ”8725944379๐Ÿ”Amritsar Call Girl No๐Ÿ’ฐAdvance Cash...
๐Ÿ’šCall Girls In Amritsar ๐Ÿ’ฏAnvi ๐Ÿ“ฒ๐Ÿ”8725944379๐Ÿ”Amritsar Call Girl No๐Ÿ’ฐAdvance Cash...๐Ÿ’šCall Girls In Amritsar ๐Ÿ’ฏAnvi ๐Ÿ“ฒ๐Ÿ”8725944379๐Ÿ”Amritsar Call Girl No๐Ÿ’ฐAdvance Cash...
๐Ÿ’šCall Girls In Amritsar ๐Ÿ’ฏAnvi ๐Ÿ“ฒ๐Ÿ”8725944379๐Ÿ”Amritsar Call Girl No๐Ÿ’ฐAdvance Cash...
ย 
Control of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronicControl of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronic
ย 
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service DehradunDehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
ย 
Bhawanipatna Call Girls ๐Ÿ“ž9332606886 Call Girls in Bhawanipatna Escorts servic...
Bhawanipatna Call Girls ๐Ÿ“ž9332606886 Call Girls in Bhawanipatna Escorts servic...Bhawanipatna Call Girls ๐Ÿ“ž9332606886 Call Girls in Bhawanipatna Escorts servic...
Bhawanipatna Call Girls ๐Ÿ“ž9332606886 Call Girls in Bhawanipatna Escorts servic...
ย 
Call Girls Bangalore - 450+ Call Girl Cash Payment ๐Ÿ’ฏCall Us ๐Ÿ” 6378878445 ๐Ÿ” ๐Ÿ’ƒ ...
Call Girls Bangalore - 450+ Call Girl Cash Payment ๐Ÿ’ฏCall Us ๐Ÿ” 6378878445 ๐Ÿ” ๐Ÿ’ƒ ...Call Girls Bangalore - 450+ Call Girl Cash Payment ๐Ÿ’ฏCall Us ๐Ÿ” 6378878445 ๐Ÿ” ๐Ÿ’ƒ ...
Call Girls Bangalore - 450+ Call Girl Cash Payment ๐Ÿ’ฏCall Us ๐Ÿ” 6378878445 ๐Ÿ” ๐Ÿ’ƒ ...
ย 
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan CytotecJual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi Di Dubai UAE Wa 0838-4800-7379 Obat Penggugur Kandungan Cytotec
ย 
Call Girls Mussoorie Just Call 8854095900 Top Class Call Girl Service Available
Call Girls Mussoorie Just Call 8854095900 Top Class Call Girl Service AvailableCall Girls Mussoorie Just Call 8854095900 Top Class Call Girl Service Available
Call Girls Mussoorie Just Call 8854095900 Top Class Call Girl Service Available
ย 

Ewan Birney Biocuration 2013

  • 2. Who am I? โ€ข Associate Director at European Bioinformatics Institute (EBI) โ€ข Involved in genomics since I was 19 (> 20 years!) โ€ข Trained as a biochemist โ€“ most people think I am CS EBI is in Hinxton, South โ€ข Analysed โ€“ sometimes lead Cambridgeshire โ€“ human/mouse/rat/platypus EBI is part of EMBL, ~like etc genomes, ENCODE, CERN for molecular biology Others.
  • 3. Molecular Biology โ€ข The study of how life works โ€“ at a molecular level โ€ข Key molecules: โ€ข DNA โ€“ Information store (Disk) โ€ข RNA โ€“ Key information transformer, also does stuff (RAM) โ€ข Proteins โ€“ The business end of life (Chip, robotic arms) โ€ข Metabolites โ€“ Fuel and signalling molecules (electricity) โ€ข Theories of how these interact โ€“ no theories of to predict what they are โ€ข Instead we determine attributes of molecules and store them in globally accessible, open, databases
  • 4. Theory ๏ƒณ Observation Can accurately predict from models Must directly observe Molecular Geology, Climate High Energy Biology Astronomy modelling Physics
  • 5. This ratio is not well correlated with data size ~60PB High Energy Physics Data Size Molecular Astronomy Biology ~5PB Climate Models Ratio of model predictability
  • 6. โ€œKnowing stuffโ€ is critical to biologyโ€ฆ โ€ข The bases of the human genome โ€ข โ€ฆ and the Mouse, Rat, Wheat, Ecoli, Plasmodium, Cowโ€ฆ. โ€ข The functions of proteins โ€ข Enzymes, Transcription Factors, Signallingโ€ฆ. โ€ข The types of cells, their lineages and organ composition โ€ข โ€ฆand all the molecular components in each cell โ€ข Small molecules โ€ข โ€ฆ and their conversions, binding partners โ€ข Structures of molecules, complexes and cells โ€ข โ€ฆ at atomic and higher resolution
  • 7. Two fundamental types of information โ€ข Experimental data โ€ข Consensus Knowledge โ€ข The result of a specific โ€ข Integration of different experiment strands of information on a โ€ข Often an experiment topic specific, data heavy part โ€ข Realised as a plus a โ€œmeta-dataโ€ part computationally accessible โ€ข Might be contradictory scheme โ€ข โ€œPrimary paperโ€ โ€ข โ€œReview articleโ€
  • 8. Five types of curation
  • 9. Experimental Data Entry โ€ข Intact โ€“ Protein:Protein interactions โ€ข GWAS Catalog โ€“ extraction of summary statistics
  • 10. Experimental Meta data capture โ€ข Sample, CDS lines in ENA โ€ข Sample in Metabolights, PRIDE etc โ€ข Machine and analysis specification in PDB, PRIDE, ENA
  • 11. Consensus integration of information โ€ข GenCode gene models in human โ€ข Summaries and GO assignment in UniProt โ€ข Pathway information in Reactome โ€ข GO assignment and summaries in MODs (eg, PomBase, WormBase, PhytoPathDB etc)
  • 12. Knowledge frameworks โ€ข The EC classification โ€ข Cell type ontologies โ€ข Cell lineages โ€“ Worms! โ€ข SnowMed, HPO etc โ€ข GO ontologies
  • 13. Knowledge management โ€ข Creation of rules representing ENA standards compliance โ€ข Cross-ontology coordination (eg, EFO) or tieing (GO ๏ƒณ ChEBI) โ€ข RuleBase / UniRule curation processes
  • 14. Data Entry vs Programming Direct Programmatic Data Entry Data Entry โ€œMessyโ€ Scripting Improved Data entry tools RuleBase, Computational Accessible Standards
  • 16. Curation Dilema โ€ข If you do your job wellโ€ฆ โ€ข If you do your job badlyโ€ฆ โ€ข Everyone assumes itโ€™s โ€ข Everyone assumes itโ€™s easy easy โ€ข People forget about the โ€ข People forget about the complexity complexity โ€ข You are ignored ๏Š โ€ข People complain ๏Š
  • 17. Why we need an infrastructureโ€ฆ
  • 19. But we only notice them when they go wrong
  • 20. Biology already needs an information infrastructure โ€ข For the human genome โ€ข (โ€ฆand the mouse, and the rat, andโ€ฆ x 150 now, 1000 in the future!) - Ensembl โ€ข For the function of genes and proteins โ€ข For all genes, in text and computational โ€“ UniProt and GO โ€ข For all 3D structures โ€ข To understand how proteins work โ€“ PDBe โ€ข For where things are expressed โ€ข The differences and functionality of cells - Atlas
  • 21. ..But this keeps on goingโ€ฆ โ€ข We have to scale across all of (interesting) life โ€ข There are a lot of species out there! โ€ข We have to handle new areas, in particular medicine โ€ข A set of European haplotypes for good imputation โ€ข A set of actionable variants in germline and cancers โ€ข We have to improve our chemical understanding โ€ข Of biological chemicals โ€ข Of chemicals which interfere with Biology
  • 22. ELIXIRโ€™s mission To build a sustainable European infrastructure for biological information, supporting life science research and its medicine translation to: environment bioindustries society 22
  • 23. How? Fully Centralised Fully Distributed Pros: Stability, reuse, Pros: Responsive, Geographic Learning ease Language responsive Cons: Hard to concentrate Cons: Internal communication overhead Expertise across of life science Harder for end users to learn Geographic, language placement Harder to provide multi-decade stability Bottlenecks and lack of diversity
  • 24. Research Healthcare International National EBI / Elixir Healthcare English National Language Low legalities Complex legalities 2
  • 25. Other infrastructures needed for biology โ€ข EuroBioImaging โ€ข Cellular and whole organism Imaging โ€ข BioBanks (BBMRI) โ€ข We need numbers โ€“ European populations โ€“ in particular for rare diseases, but also for specific sub types of common disease โ€ข Mouse models and phenotypes (Infrafrontier) โ€ข A baseline set of knockouts and phenotypes in our most tractable mammalian model โ€ข (itโ€™s hard to prove something in human) โ€ข Robust molecular assays in a clinical setting (EATRIS) โ€ข The ability to reliably use state of the art molecular techniques in a clinical research setting
  • 26. (you can follow me on twitter @ewanbirney) I blog and update this on Google Plus publically