SlideShare a Scribd company logo
1 of 14
Download to read offline
My	
  Experience	
  with	
  454	
  
                         Sequencing	
  
Sequencing	
  @	
  BitLab	
  
                                                            Dec,	
  17,	
  2010	
  

Raoul	
  Jean	
  Pierre	
  Bonnal	
  <bonnal@ingm.it>	
  
Topics	
  
•  Bacteria	
  
   –  De-­‐novo	
  
   –  Re-­‐sequencing	
  
•  Human	
  
   –  Snp	
  
   –  Mitochondria	
  
       •  Ancient	
  
       •  CharacterizaRon	
  of	
  mutaRons	
  &	
  deleRons	
  
   –  Etc…	
  
Topics	
  
•  Bacteria	
  
   –  De-­‐novo	
  
   –  Re-­‐sequencing	
  
•  Human	
  
   –  Mitochondria	
  
       •  Ancient	
  
       •  CharacterizaRon	
  of	
  mutaRons	
  &	
  deleRons	
  
   –  Snp	
  
   –  Etc…	
  
A	
  case	
  study:	
  
              Acinetobacter	
  Baumannii	
  
•  Epidemic	
  mulRdrug-­‐
   resistant	
  strain	
  
•  Single	
  chromosome	
  
    –  	
  3.904.116	
  bp	
  
•  2	
  plasmids	
  
    –  pACICU1:	
  28.279	
  bp	
  
    –  pACICU2:	
  64.366	
  bp	
  
Sequencing	
  &	
  Assembly	
  
•  Sequencing:	
  
   –  mixed	
  data	
  from	
  GS20	
  and	
  GS	
  FLX,	
  100	
  and	
  ~250	
  reads	
  
      lenght	
  
   –  	
  	
  	
  900.363	
  reads	
  
   –  96.454.548	
  total	
  bp	
  	
  
•  Assembly	
  
   –  23	
  fold	
  coverage	
  of	
  the	
  genome	
  
   –  1.036	
  conRgs	
  with	
  a	
  maximum	
  length	
  of	
  200.179	
  bp	
  
      (automaRc)	
  
   –  47	
  scaffolds	
  using	
  Paired	
  End	
  and	
  a	
  semi	
  automaRc	
  
      assembly	
  
   –  Manual	
  check	
  
Newbler’s	
  weakness	
  
                       is	
  the	
  same	
  for	
  the	
  others	
  



•  Hey,	
  there	
  is	
  something	
  strange	
  in	
  my	
  
   sequences.	
  Clean!	
  

•  Keep	
  in	
  mind	
  that	
  living	
  beings	
  are	
  complex.	
  
   Repeated	
  regions.	
  
Cleaning	
  
•  With	
  Newbler	
  by	
  Roche/454	
  you	
  can	
  define	
  a	
  
   database	
  of	
  known	
  sequences	
  and	
  the	
  reads	
  
   matching	
  against	
  it	
  will	
  be	
  removed	
  from	
  the	
  
   alignment	
  
•  What	
  to	
  clean	
  up?	
  
    –  Plasmids	
  
    –  Contaminants	
  
Cleaning:	
  Plasmids	
  
•  They	
  are	
  mixed	
  in	
  our	
  target	
  genome	
  
•  We	
  can	
  try	
  to	
  reduce	
  the	
  complexity	
  
    –  MidiPrep	
  kit	
  separate	
  the	
  plasmids	
  from	
  main	
  
       chromosome	
  
         •  Easy,	
  Quick,	
  Cheap	
  
    –  Sanger	
  sequencing	
  	
  
    –  Remove	
  the	
  plasmid	
  sequence	
  from	
  the	
  dataset	
  
•  A	
  public	
  database	
  ?	
  	
  
    –  VectorDB	
  –old-­‐	
  
    –  hpp://www.lablife.org/vectordb	
  
Repeated	
  regions	
  
•  InserRon	
  sequences,	
  transposases,	
  rRNA	
  
   operons,	
  …	
  
•  Soqware	
  tends	
  to	
  collapse	
  repeated	
  regions	
  in	
  
   single	
  conRgs	
  
   –  For	
  few	
  variaRons,	
  we	
  used	
  degenerated	
  code	
  
      (IUB)	
  
   –  Re-­‐assembly	
  each	
  conRg	
  separately	
  with	
  more	
  
      stringent	
  parameters	
  
   –  Check	
  wrong	
  frame	
  shiq	
  due	
  to	
  homopolymeric	
  
      stretches	
  
Final	
  approach	
  
•  7	
  conRgs	
  each	
  one	
  with	
  an	
  interrupRon	
  at	
  the	
  
   4.4-­‐kb	
  rRNA	
  gene	
  clusters,	
  then	
  ?	
  Who	
  knows!	
  
•  Brute	
  force	
  PCR	
  strategy	
  
    1.    14	
  primer	
  pairs	
  designed	
  inside	
  the	
  flanking	
  regions	
  
    2.    all	
  combinaRons	
  of	
  primers	
  and	
  conRgs	
  using	
  Elongase	
  
    3.    7	
  amplicons	
  sequenced	
  with	
  an	
  ABI	
  3730	
  DNA	
  sequencer	
  
    4.    Manual	
  &	
  final	
  alignment	
  
•  We	
  can’t	
  skip	
  the	
  PCR	
  we	
  must	
  test	
  our	
  
   hypotheses	
  like	
  in	
  programming	
  
    –  Usually	
  I	
  don’t	
  trust	
  myself	
  	
  
General	
  problems	
  
•  Original	
  design	
  
    –  Which	
  Paired	
  End	
  ?	
  
          •    None	
  
          •    3kb	
  
          •    8kb	
  
          •    20kb	
  
•  MulR	
  assembly	
  
    –  New	
  data	
  are	
  coming	
  in	
  
    –  New	
  soqware	
  update	
  
•  In	
  the	
  mean	
  Rme	
  the	
  experts	
  are	
  doing	
  hypotheses	
  
   and	
  they	
  want	
  to	
  keep	
  track	
  of	
  them	
  
    –  Comparison	
  of	
  different	
  assemblies	
  
Chat	
  
•  We	
  don’t	
  talk,	
  each	
  other,	
  enough!	
  
    –  Avoid	
  black	
  boxes	
  
•  The	
  team	
  is	
  made	
  of	
  	
  
    –  BioinformaRcians	
  
         •  Output	
  is	
  not	
  always	
  user	
  friendly	
  	
  
    –  Microbiologists	
  
         •  Are	
  there	
  similar	
  strains	
  or	
  parents?	
  
         •  Are	
  there	
  important	
  elements	
  that	
  we	
  must	
  find?	
  
Soqware	
  used	
  
•  Assembler	
  
     –  454	
  Newbler	
  
     –  DNAStar	
  Lasergene	
  soqware	
  hpp://www.dnastar.com/products/lasergene.php	
  
              •    Good	
  visual	
  tool	
  for	
  exploring	
  

     –  Applied	
  MATHS	
  
     –  Celera	
  asseblers	
  
•  AnnotaRon	
  
     –    FGENESB	
  hpp://www.soqberry.com/	
  
     –    GeneMark	
  
     –    GLIMMER	
  
     –    IS	
  Finder	
  hpp://www-­‐is.biotoul.fr	
  
     –  TIGR/JCVI	
  	
  &	
  Manatee	
  
•  Synteny	
  
     –  Mauve	
  
•  View	
  
     –  Circos	
  for	
  represenRng	
  informaRon	
  in	
  a	
  circular	
  way,	
  cute	
  images.	
  
     –  Consed	
  
Acknowledgements	
  
BitLab	
                          Fondazione	
  INGM	
  
Oswaldo,	
  Trelles	
             Massimiliano,	
  Pagani	
  
Antonio,	
  Muñoz	
  Mérida	
  

                                  Roche	
  
                                  Michele,	
  Iacono	
  

More Related Content

What's hot

De novo genome assembly - IMB Winter School - 7 July 2015
De novo genome assembly - IMB Winter School - 7 July 2015De novo genome assembly - IMB Winter School - 7 July 2015
De novo genome assembly - IMB Winter School - 7 July 2015Torsten Seemann
 
2014 khmer protocols
2014 khmer protocols2014 khmer protocols
2014 khmer protocolsc.titus.brown
 
RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities Paolo Dametto
 
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...Torsten Seemann
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionJatinder Singh
 
ECCMID 2015 - So I have sequenced my genome ... what now?
ECCMID 2015 - So I have sequenced my genome ... what now?ECCMID 2015 - So I have sequenced my genome ... what now?
ECCMID 2015 - So I have sequenced my genome ... what now?Nick Loman
 
Eccmid meet the-expert
Eccmid meet the-expertEccmid meet the-expert
Eccmid meet the-expertNick Loman
 
ASM Microbe 2017: Reaching the Parts Other Methods Can't: Long Reads for Micr...
ASM Microbe 2017: Reaching the Parts Other Methods Can't: Long Reads for Micr...ASM Microbe 2017: Reaching the Parts Other Methods Can't: Long Reads for Micr...
ASM Microbe 2017: Reaching the Parts Other Methods Can't: Long Reads for Micr...Nick Loman
 
Long read sequencing - LSCC lab talk - fri 5 june 2015
Long read sequencing - LSCC lab talk - fri 5 june 2015Long read sequencing - LSCC lab talk - fri 5 june 2015
Long read sequencing - LSCC lab talk - fri 5 june 2015Torsten Seemann
 
Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)LOGESWARAN KA
 
RNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewRNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewSean Davis
 

What's hot (11)

De novo genome assembly - IMB Winter School - 7 July 2015
De novo genome assembly - IMB Winter School - 7 July 2015De novo genome assembly - IMB Winter School - 7 July 2015
De novo genome assembly - IMB Winter School - 7 July 2015
 
2014 khmer protocols
2014 khmer protocols2014 khmer protocols
2014 khmer protocols
 
RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities RNA sequencing: advances and opportunities
RNA sequencing: advances and opportunities
 
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
Comparing bacterial isolates - T.Seemann - IMB winter school 2016 - fri 8 jul...
 
RNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential ExpressionRNASeq - Analysis Pipeline for Differential Expression
RNASeq - Analysis Pipeline for Differential Expression
 
ECCMID 2015 - So I have sequenced my genome ... what now?
ECCMID 2015 - So I have sequenced my genome ... what now?ECCMID 2015 - So I have sequenced my genome ... what now?
ECCMID 2015 - So I have sequenced my genome ... what now?
 
Eccmid meet the-expert
Eccmid meet the-expertEccmid meet the-expert
Eccmid meet the-expert
 
ASM Microbe 2017: Reaching the Parts Other Methods Can't: Long Reads for Micr...
ASM Microbe 2017: Reaching the Parts Other Methods Can't: Long Reads for Micr...ASM Microbe 2017: Reaching the Parts Other Methods Can't: Long Reads for Micr...
ASM Microbe 2017: Reaching the Parts Other Methods Can't: Long Reads for Micr...
 
Long read sequencing - LSCC lab talk - fri 5 june 2015
Long read sequencing - LSCC lab talk - fri 5 june 2015Long read sequencing - LSCC lab talk - fri 5 june 2015
Long read sequencing - LSCC lab talk - fri 5 june 2015
 
Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)
 
RNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewRNA-seq Data Analysis Overview
RNA-seq Data Analysis Overview
 

Viewers also liked

Enlace para trabajar con los euros
Enlace para trabajar con los eurosEnlace para trabajar con los euros
Enlace para trabajar con los euroscsanieves
 
Karen Fuller Final Online Presentation
Karen Fuller Final Online PresentationKaren Fuller Final Online Presentation
Karen Fuller Final Online PresentationKaren Fuller
 
1160 S Michigan Ave. # 2707
1160 S Michigan Ave.  # 27071160 S Michigan Ave.  # 2707
1160 S Michigan Ave. # 2707Anne Rossley
 
Unseen Things in Thailandt+Loyal and Projects2+ป.1+108+dltvengp1+55t2eng p01 ...
Unseen Things in Thailandt+Loyal and Projects2+ป.1+108+dltvengp1+55t2eng p01 ...Unseen Things in Thailandt+Loyal and Projects2+ป.1+108+dltvengp1+55t2eng p01 ...
Unseen Things in Thailandt+Loyal and Projects2+ป.1+108+dltvengp1+55t2eng p01 ...Prachoom Rangkasikorn
 

Viewers also liked (6)

Presentación13
Presentación13Presentación13
Presentación13
 
Enlace para trabajar con los euros
Enlace para trabajar con los eurosEnlace para trabajar con los euros
Enlace para trabajar con los euros
 
Karen Fuller Final Online Presentation
Karen Fuller Final Online PresentationKaren Fuller Final Online Presentation
Karen Fuller Final Online Presentation
 
1160 S Michigan Ave. # 2707
1160 S Michigan Ave.  # 27071160 S Michigan Ave.  # 2707
1160 S Michigan Ave. # 2707
 
Unseen Things in Thailandt+Loyal and Projects2+ป.1+108+dltvengp1+55t2eng p01 ...
Unseen Things in Thailandt+Loyal and Projects2+ป.1+108+dltvengp1+55t2eng p01 ...Unseen Things in Thailandt+Loyal and Projects2+ป.1+108+dltvengp1+55t2eng p01 ...
Unseen Things in Thailandt+Loyal and Projects2+ป.1+108+dltvengp1+55t2eng p01 ...
 
JAMAL_RESUME
JAMAL_RESUMEJAMAL_RESUME
JAMAL_RESUME
 

Similar to Sequencing @ BitLab

Next Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology OverviewNext Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology OverviewDominic Suciu
 
CRISPR presentation extended Mouse Modeling
CRISPR presentation extended Mouse ModelingCRISPR presentation extended Mouse Modeling
CRISPR presentation extended Mouse ModelingTristan Kempston
 
Introduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqIntroduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqTimothy Tickle
 
Tools and Techniques (Molecular & Biochemical) to Study Physiological Process...
Tools and Techniques (Molecular & Biochemical) to Study Physiological Process...Tools and Techniques (Molecular & Biochemical) to Study Physiological Process...
Tools and Techniques (Molecular & Biochemical) to Study Physiological Process...Agronomist Wasim
 
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...Promila Sheoran
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods Mrinal Vashisth
 
Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...
Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...
Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...Rajarshi Guha
 
NEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCINGNEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCINGBilal Nizami
 
Advanced diagnostic techniques
Advanced diagnostic techniquesAdvanced diagnostic techniques
Advanced diagnostic techniquesD Venkatesh Kumar
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation SequencingAtifa Ambreen
 
High Throughput Sequencing Technologies: What We Can Know
High Throughput Sequencing Technologies: What We Can KnowHigh Throughput Sequencing Technologies: What We Can Know
High Throughput Sequencing Technologies: What We Can KnowBrian Krueger
 
40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?Adam Phillippy
 
Measurements for Reference Material Characterization Working Group Summary Au...
Measurements for Reference Material Characterization Working Group Summary Au...Measurements for Reference Material Characterization Working Group Summary Au...
Measurements for Reference Material Characterization Working Group Summary Au...GenomeInABottle
 
Time and Money: Techniques for Neural Gene Expression Profiling
Time and Money: Techniques for Neural Gene Expression ProfilingTime and Money: Techniques for Neural Gene Expression Profiling
Time and Money: Techniques for Neural Gene Expression ProfilingRayna Harris
 
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and applicationNeeraj Sharma
 
Microarray
MicroarrayMicroarray
Microarrayjain7177
 

Similar to Sequencing @ BitLab (20)

Next Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology OverviewNext Gen Sequencing (NGS) Technology Overview
Next Gen Sequencing (NGS) Technology Overview
 
CRISPR presentation extended Mouse Modeling
CRISPR presentation extended Mouse ModelingCRISPR presentation extended Mouse Modeling
CRISPR presentation extended Mouse Modeling
 
Introduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seqIntroduction to Single-cell RNA-seq
Introduction to Single-cell RNA-seq
 
Tools and Techniques (Molecular & Biochemical) to Study Physiological Process...
Tools and Techniques (Molecular & Biochemical) to Study Physiological Process...Tools and Techniques (Molecular & Biochemical) to Study Physiological Process...
Tools and Techniques (Molecular & Biochemical) to Study Physiological Process...
 
Microbial physiology in genomic era
Microbial physiology in genomic eraMicrobial physiology in genomic era
Microbial physiology in genomic era
 
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
 
Hamas 1
Hamas 1Hamas 1
Hamas 1
 
Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...
Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...
Enabling Discoveries at High Throughput - Small molecule and RNAi HTS at the ...
 
NEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCINGNEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCING
 
Advanced diagnostic techniques
Advanced diagnostic techniquesAdvanced diagnostic techniques
Advanced diagnostic techniques
 
Snps and microarray
Snps and microarraySnps and microarray
Snps and microarray
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
High Throughput Sequencing Technologies: What We Can Know
High Throughput Sequencing Technologies: What We Can KnowHigh Throughput Sequencing Technologies: What We Can Know
High Throughput Sequencing Technologies: What We Can Know
 
40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?40 Years of Genome Assembly: Are We Done Yet?
40 Years of Genome Assembly: Are We Done Yet?
 
Measurements for Reference Material Characterization Working Group Summary Au...
Measurements for Reference Material Characterization Working Group Summary Au...Measurements for Reference Material Characterization Working Group Summary Au...
Measurements for Reference Material Characterization Working Group Summary Au...
 
Time and Money: Techniques for Neural Gene Expression Profiling
Time and Money: Techniques for Neural Gene Expression ProfilingTime and Money: Techniques for Neural Gene Expression Profiling
Time and Money: Techniques for Neural Gene Expression Profiling
 
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and application
 
NGS.pptx
NGS.pptxNGS.pptx
NGS.pptx
 
Microarray
MicroarrayMicroarray
Microarray
 

Recently uploaded

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Recently uploaded (20)

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Sequencing @ BitLab

  • 1. My  Experience  with  454   Sequencing   Sequencing  @  BitLab   Dec,  17,  2010   Raoul  Jean  Pierre  Bonnal  <bonnal@ingm.it>  
  • 2. Topics   •  Bacteria   –  De-­‐novo   –  Re-­‐sequencing   •  Human   –  Snp   –  Mitochondria   •  Ancient   •  CharacterizaRon  of  mutaRons  &  deleRons   –  Etc…  
  • 3. Topics   •  Bacteria   –  De-­‐novo   –  Re-­‐sequencing   •  Human   –  Mitochondria   •  Ancient   •  CharacterizaRon  of  mutaRons  &  deleRons   –  Snp   –  Etc…  
  • 4. A  case  study:   Acinetobacter  Baumannii   •  Epidemic  mulRdrug-­‐ resistant  strain   •  Single  chromosome   –   3.904.116  bp   •  2  plasmids   –  pACICU1:  28.279  bp   –  pACICU2:  64.366  bp  
  • 5. Sequencing  &  Assembly   •  Sequencing:   –  mixed  data  from  GS20  and  GS  FLX,  100  and  ~250  reads   lenght   –       900.363  reads   –  96.454.548  total  bp     •  Assembly   –  23  fold  coverage  of  the  genome   –  1.036  conRgs  with  a  maximum  length  of  200.179  bp   (automaRc)   –  47  scaffolds  using  Paired  End  and  a  semi  automaRc   assembly   –  Manual  check  
  • 6. Newbler’s  weakness   is  the  same  for  the  others   •  Hey,  there  is  something  strange  in  my   sequences.  Clean!   •  Keep  in  mind  that  living  beings  are  complex.   Repeated  regions.  
  • 7. Cleaning   •  With  Newbler  by  Roche/454  you  can  define  a   database  of  known  sequences  and  the  reads   matching  against  it  will  be  removed  from  the   alignment   •  What  to  clean  up?   –  Plasmids   –  Contaminants  
  • 8. Cleaning:  Plasmids   •  They  are  mixed  in  our  target  genome   •  We  can  try  to  reduce  the  complexity   –  MidiPrep  kit  separate  the  plasmids  from  main   chromosome   •  Easy,  Quick,  Cheap   –  Sanger  sequencing     –  Remove  the  plasmid  sequence  from  the  dataset   •  A  public  database  ?     –  VectorDB  –old-­‐   –  hpp://www.lablife.org/vectordb  
  • 9. Repeated  regions   •  InserRon  sequences,  transposases,  rRNA   operons,  …   •  Soqware  tends  to  collapse  repeated  regions  in   single  conRgs   –  For  few  variaRons,  we  used  degenerated  code   (IUB)   –  Re-­‐assembly  each  conRg  separately  with  more   stringent  parameters   –  Check  wrong  frame  shiq  due  to  homopolymeric   stretches  
  • 10. Final  approach   •  7  conRgs  each  one  with  an  interrupRon  at  the   4.4-­‐kb  rRNA  gene  clusters,  then  ?  Who  knows!   •  Brute  force  PCR  strategy   1.  14  primer  pairs  designed  inside  the  flanking  regions   2.  all  combinaRons  of  primers  and  conRgs  using  Elongase   3.  7  amplicons  sequenced  with  an  ABI  3730  DNA  sequencer   4.  Manual  &  final  alignment   •  We  can’t  skip  the  PCR  we  must  test  our   hypotheses  like  in  programming   –  Usually  I  don’t  trust  myself    
  • 11. General  problems   •  Original  design   –  Which  Paired  End  ?   •  None   •  3kb   •  8kb   •  20kb   •  MulR  assembly   –  New  data  are  coming  in   –  New  soqware  update   •  In  the  mean  Rme  the  experts  are  doing  hypotheses   and  they  want  to  keep  track  of  them   –  Comparison  of  different  assemblies  
  • 12. Chat   •  We  don’t  talk,  each  other,  enough!   –  Avoid  black  boxes   •  The  team  is  made  of     –  BioinformaRcians   •  Output  is  not  always  user  friendly     –  Microbiologists   •  Are  there  similar  strains  or  parents?   •  Are  there  important  elements  that  we  must  find?  
  • 13. Soqware  used   •  Assembler   –  454  Newbler   –  DNAStar  Lasergene  soqware  hpp://www.dnastar.com/products/lasergene.php   •  Good  visual  tool  for  exploring   –  Applied  MATHS   –  Celera  asseblers   •  AnnotaRon   –  FGENESB  hpp://www.soqberry.com/   –  GeneMark   –  GLIMMER   –  IS  Finder  hpp://www-­‐is.biotoul.fr   –  TIGR/JCVI    &  Manatee   •  Synteny   –  Mauve   •  View   –  Circos  for  represenRng  informaRon  in  a  circular  way,  cute  images.   –  Consed  
  • 14. Acknowledgements   BitLab   Fondazione  INGM   Oswaldo,  Trelles   Massimiliano,  Pagani   Antonio,  Muñoz  Mérida   Roche   Michele,  Iacono