SlideShare a Scribd company logo
1 of 34
Download to read offline
RNA-Seq


                       (yag_ays)
http://yag-ays.jp/pdf/20110602labseminar_pub.pdf
r e d
             n s o
          c e
usagi



usamimi
NGS
  (Next Generation Sequencing)

         RNA-Seq
    (Transcriptome Analysis)

        de novo
Transcriptome Assembly
Central Dogma
           A T G C

 DNA




 mRNA




 Protein
Central Dogma
           A T G C

 DNA




 mRNA




 Protein
Central Dogma
           A T G C

 DNA
           Transcriptome


 mRNA




 Protein
NGS       RNA-Seq
A T G C
                NGS
                      •   illumina / Solexa GA
                      •   ABI / SOLiD
                      •   Roche / 454
                      •   PacBio
                      •   Helicos / Heliscope
                      •   ion torrent    etc...




mRNA




               TTAGCCTTAGCTTCC
               GTCGCAACTTCCTTA
               TTCACGAGCTTGATG
               TTGCGGATCACTTTG
NGS               RNA-Seq
A T G C
          NGS           NGS
                              •   illumina / Solexa GA
          •                   •   ABI / SOLiD
                              •   Roche / 454
              •               •   PacBio
                              •   Helicos / Heliscope
                              •   ion torrent    etc...

          •
mRNA




                       TTAGCCTTAGCTTCC
                       GTCGCAACTTCCTTA
                       TTCACGAGCTTGATG
                       TTGCGGATCACTTTG
RNA-Seq




 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




                          •


                          •


                          •

 ʻalign-then-assembleʼ   ʻassemble-then-aliignʼ
        approach               approach
RNA-Seq




 • 454


 •
 •


 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
RNA-Seq




              •


               • cDNA
 ʻalign-then-assembleʼ   ʻassemble-then-alignʼ
        approach               approach
Sujai Kumar and Mark L Blaxter : Comparing de novo
assemblers for 454 transcriptome data (2010)
Newbler 2.5
Sujai Kumar and Mark L Blaxter : Comparing de novo
assemblers for 454 transcriptome data (2010)
Newbler 2.5


                                      ...
Sujai Kumar and Mark L Blaxter : Comparing de novo
assemblers for 454 transcriptome data (2010)
Newbler 2.5


                                      ...


                                            Trinity...!!
1. Newbler 2.5
                 • Roche 454
                 • 454
                 •

     2. Trinity
                 • Broad Institute


                 • 454                                                                                                                (        )


                 • Nat Biotechnol. 2011 May                                                               *
* Grabherr MG, Haas BJ,Yassour M et al. : Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011 May 15
1. Newbler 2.5
    • Overlap-Layout-Consensus (OLC)


2. Trinity
    I. Inchworm : k-mer graph
    II. Chrysalis : Contig pool
    III.Butterfly : De Bruijn Graph


                       2
Roche 454 pyrosequencing
           usamimi                                     0.3M reads
                                     (sff or fastq format)




Newbler 2.5                              Trinity


   (fasta format)                          (fasta format)

                    GMAP
                    with usagi CDS




    (gff format)                             (gff format)
S. Kumar et al.(2010)




  •
                        (   )

  •
Newbler 2.5 Trinity

              Newbler 2.5    Trinity
Number of
                19,753       20,758
 contigs

Total Bases    9,651,390    10,275,166

Max contig
                 2,878        2,151
  length
Mean contig
                 488.6         495
  length

   N50            581          616
Newbler 2.5                             N = 19,753




     Trinity                            N = 20,758




http://edwards.sdsu.edu/prinseq_beta/
usagi CDS


                                                       all
  usagi CDS       30,000                               ≧ 80% alignment
                                                       ≧ 90% alignment
                                                       ≧ 95% alignment
        Newbler 2.5   Trinity                          100% alignment
                                16000
 all     15,498       15,524
                                12000
≧ 80%    14,583       14,697
                                 8000
≧ 90%     8,466       8,665
≧ 95%     1,059       1,191      4000


100%        66          30          0
                                        Newbler 2.5   Trinity
usagi


     Newbler 2.5           Trinity

12,417                               10,433
 genes                                genes


          2,990    9,427   1,006
...



            S. Kumar et al.(2010)
 Poly(A/T)


Poly(A/T)


                       Poly(A/T)
Poly(A/T)                                        Trinity > Newbler 2.5

                          Newbler 2.5                            Trinity


                                          257                            3,773
Poly T                                  (1.30%)                        (18.18%)

                      20 bp                              20 bp




                                          539                            2,349
Poly A                                  (2.73%)                        (11.32%)

                              20 bp                      20 bp

http://edwards.sdsu.edu/prinseq_beta/               ()
Poly(A/T)                                           Trinity > Newbler 2.5

                           Newbler 2.5                              Trinity


                                             257                            3,773
Poly T                                     (1.30%)                        (18.18%)
Poly(A/T)                                Quality Value
                       20 bp                                20 bp

    →Newbler Quality                                 trimming                 ...?
                                             539                            2,349
Poly A                                     (2.73%)                        (11.32%)

                               20 bp                        20 bp

 http://edwards.sdsu.edu/prinseq_beta/                 ()
Trinity Newbler 2.5

1
      usagi CDS




2     Poly(A/T)       Trinity
      Newbler 2.5
Trinity Newbler 2.5

1
      usagi CDS
                      Trinity
                      454
2     Poly(A/T)         Trinity
      Newbler 2.5
Trinity Newbler 2.5

1
      usagi CDS
                      Trinity
                      454
2     Poly(A/T)         Trinity
      Newbler 2.5
Method : Parameters
• Newbler 2.5   • Trinity (20110519 ver.)
 • -notrim       • --seqType=fq
 • -urt          • --single
                 • --min_contig_length 50
                 • --run_butterfly
                 • --CPU 4
                 • --bfly_opts "--
                   compatible_path_extensi
                   on --stderr "

More Related Content

Viewers also liked

Rを用いた解析の実例 Project *ぶろったん*
Rを用いた解析の実例 Project *ぶろったん*Rを用いた解析の実例 Project *ぶろったん*
Rを用いた解析の実例 Project *ぶろったん*裕樹 奥田
 
Variational AutoEncoder
Variational AutoEncoderVariational AutoEncoder
Variational AutoEncoderKazuki Nitta
 
Humor Recognition and Humor Anchor Extraction
Humor Recognition and Humor Anchor ExtractionHumor Recognition and Humor Anchor Extraction
Humor Recognition and Humor Anchor Extraction裕樹 奥田
 
変分ベイズ法の説明
変分ベイズ法の説明変分ベイズ法の説明
変分ベイズ法の説明Haruka Ozaki
 
パターン認識 04 混合正規分布
パターン認識 04 混合正規分布パターン認識 04 混合正規分布
パターン認識 04 混合正規分布sleipnir002
 
数式を使わずイメージで理解するEMアルゴリズム
数式を使わずイメージで理解するEMアルゴリズム数式を使わずイメージで理解するEMアルゴリズム
数式を使わずイメージで理解するEMアルゴリズム裕樹 奥田
 
現在のDNNにおける未解決問題
現在のDNNにおける未解決問題現在のDNNにおける未解決問題
現在のDNNにおける未解決問題Daisuke Okanohara
 
IIBMP2016 深層生成モデルによる表現学習
IIBMP2016 深層生成モデルによる表現学習IIBMP2016 深層生成モデルによる表現学習
IIBMP2016 深層生成モデルによる表現学習Preferred Networks
 
猫でも分かるVariational AutoEncoder
猫でも分かるVariational AutoEncoder猫でも分かるVariational AutoEncoder
猫でも分かるVariational AutoEncoderSho Tatsuno
 

Viewers also liked (12)

Re revenge chap03-1
Re revenge chap03-1Re revenge chap03-1
Re revenge chap03-1
 
NICOMI
NICOMINICOMI
NICOMI
 
Rを用いた解析の実例 Project *ぶろったん*
Rを用いた解析の実例 Project *ぶろったん*Rを用いた解析の実例 Project *ぶろったん*
Rを用いた解析の実例 Project *ぶろったん*
 
Chapter9 2
Chapter9 2Chapter9 2
Chapter9 2
 
Variational AutoEncoder
Variational AutoEncoderVariational AutoEncoder
Variational AutoEncoder
 
Humor Recognition and Humor Anchor Extraction
Humor Recognition and Humor Anchor ExtractionHumor Recognition and Humor Anchor Extraction
Humor Recognition and Humor Anchor Extraction
 
変分ベイズ法の説明
変分ベイズ法の説明変分ベイズ法の説明
変分ベイズ法の説明
 
パターン認識 04 混合正規分布
パターン認識 04 混合正規分布パターン認識 04 混合正規分布
パターン認識 04 混合正規分布
 
数式を使わずイメージで理解するEMアルゴリズム
数式を使わずイメージで理解するEMアルゴリズム数式を使わずイメージで理解するEMアルゴリズム
数式を使わずイメージで理解するEMアルゴリズム
 
現在のDNNにおける未解決問題
現在のDNNにおける未解決問題現在のDNNにおける未解決問題
現在のDNNにおける未解決問題
 
IIBMP2016 深層生成モデルによる表現学習
IIBMP2016 深層生成モデルによる表現学習IIBMP2016 深層生成モデルによる表現学習
IIBMP2016 深層生成モデルによる表現学習
 
猫でも分かるVariational AutoEncoder
猫でも分かるVariational AutoEncoder猫でも分かるVariational AutoEncoder
猫でも分かるVariational AutoEncoder
 

Similar to 20110602labseminar pub

Craig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
Craig Hawker of UCSB: Commercial Applications of Polymer as NanomaterialsCraig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
Craig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterialsucsb.ira
 
Jag Trasgo Helsinki091002
Jag Trasgo Helsinki091002Jag Trasgo Helsinki091002
Jag Trasgo Helsinki091002Miguel Morales
 
Glycomics2004-CrKa
Glycomics2004-CrKaGlycomics2004-CrKa
Glycomics2004-CrKaCrKa
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pubsesejun
 
monsanto monmouth1
monsanto monmouth1monsanto monmouth1
monsanto monmouth1finance28
 
Preparation and characterization of pla pbat organoclay composites
Preparation and characterization of pla pbat organoclay compositesPreparation and characterization of pla pbat organoclay composites
Preparation and characterization of pla pbat organoclay compositesJunaedy Keputet
 
PYReco present at Cologne Expo Cologne 14 16th Feb 2012 = Final
PYReco present at Cologne Expo    Cologne  14 16th Feb 2012 = FinalPYReco present at Cologne Expo    Cologne  14 16th Feb 2012 = Final
PYReco present at Cologne Expo Cologne 14 16th Feb 2012 = FinalPYReco
 
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...ainia centro tecnológico
 
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise..."Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...Jonathan Eisen
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_coursehansjansen9999
 
GROTESQUE - ANS 2012
GROTESQUE - ANS 2012GROTESQUE - ANS 2012
GROTESQUE - ANS 2012jdbess
 
生命を理解する道具としての計算機  SCSN@UCLA
生命を理解する道具としての計算機  SCSN@UCLA生命を理解する道具としての計算機  SCSN@UCLA
生命を理解する道具としての計算機  SCSN@UCLAKeiichiro Ono
 
Linked Data for integrating life-science databases
Linked Data for integrating life-science databasesLinked Data for integrating life-science databases
Linked Data for integrating life-science databasesShuichi Kawashima
 
Edward Muge Presentation
Edward Muge  PresentationEdward Muge  Presentation
Edward Muge Presentationobutuz
 
identification of genes and gene-near regions related to active compounds in ...
identification of genes and gene-near regions related to active compounds in ...identification of genes and gene-near regions related to active compounds in ...
identification of genes and gene-near regions related to active compounds in ...World Agroforestry (ICRAF)
 
Edward Muge Presentation
Edward Muge  PresentationEdward Muge  Presentation
Edward Muge Presentationguestd2d93b8
 

Similar to 20110602labseminar pub (20)

Craig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
Craig Hawker of UCSB: Commercial Applications of Polymer as NanomaterialsCraig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
Craig Hawker of UCSB: Commercial Applications of Polymer as Nanomaterials
 
Jag Trasgo Helsinki091002
Jag Trasgo Helsinki091002Jag Trasgo Helsinki091002
Jag Trasgo Helsinki091002
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Glycomics2004-CrKa
Glycomics2004-CrKaGlycomics2004-CrKa
Glycomics2004-CrKa
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pub
 
monsanto monmouth1
monsanto monmouth1monsanto monmouth1
monsanto monmouth1
 
Preparation and characterization of pla pbat organoclay composites
Preparation and characterization of pla pbat organoclay compositesPreparation and characterization of pla pbat organoclay composites
Preparation and characterization of pla pbat organoclay composites
 
PCR Primer desining
PCR Primer desiningPCR Primer desining
PCR Primer desining
 
PYReco present at Cologne Expo Cologne 14 16th Feb 2012 = Final
PYReco present at Cologne Expo    Cologne  14 16th Feb 2012 = FinalPYReco present at Cologne Expo    Cologne  14 16th Feb 2012 = Final
PYReco present at Cologne Expo Cologne 14 16th Feb 2012 = Final
 
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
Desarrollo de cultivos energéticos para producción de biogás en condiciones d...
 
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise..."Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
"Phylogeny-driven studies in genomics and metagenomics" talk by Jonathan Eise...
 
Ashg grc workshop2015_tg
Ashg grc workshop2015_tgAshg grc workshop2015_tg
Ashg grc workshop2015_tg
 
20150601 bio sb_assembly_course
20150601 bio sb_assembly_course20150601 bio sb_assembly_course
20150601 bio sb_assembly_course
 
GROTESQUE - ANS 2012
GROTESQUE - ANS 2012GROTESQUE - ANS 2012
GROTESQUE - ANS 2012
 
生命を理解する道具としての計算機  SCSN@UCLA
生命を理解する道具としての計算機  SCSN@UCLA生命を理解する道具としての計算機  SCSN@UCLA
生命を理解する道具としての計算機  SCSN@UCLA
 
Linked Data for integrating life-science databases
Linked Data for integrating life-science databasesLinked Data for integrating life-science databases
Linked Data for integrating life-science databases
 
Edward Muge Presentation
Edward Muge  PresentationEdward Muge  Presentation
Edward Muge Presentation
 
identification of genes and gene-near regions related to active compounds in ...
identification of genes and gene-near regions related to active compounds in ...identification of genes and gene-near regions related to active compounds in ...
identification of genes and gene-near regions related to active compounds in ...
 
Edward muge presentation
Edward muge  presentationEdward muge  presentation
Edward muge presentation
 
Edward Muge Presentation
Edward Muge  PresentationEdward Muge  Presentation
Edward Muge Presentation
 

Recently uploaded

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 

Recently uploaded (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

20110602labseminar pub

  • 1. RNA-Seq (yag_ays) http://yag-ays.jp/pdf/20110602labseminar_pub.pdf
  • 2. r e d n s o c e usagi usamimi
  • 3. NGS (Next Generation Sequencing) RNA-Seq (Transcriptome Analysis) de novo Transcriptome Assembly
  • 4. Central Dogma A T G C DNA mRNA Protein
  • 5. Central Dogma A T G C DNA mRNA Protein
  • 6. Central Dogma A T G C DNA Transcriptome mRNA Protein
  • 7. NGS RNA-Seq A T G C NGS • illumina / Solexa GA • ABI / SOLiD • Roche / 454 • PacBio • Helicos / Heliscope • ion torrent etc... mRNA TTAGCCTTAGCTTCC GTCGCAACTTCCTTA TTCACGAGCTTGATG TTGCGGATCACTTTG
  • 8. NGS RNA-Seq A T G C NGS NGS • illumina / Solexa GA • • ABI / SOLiD • Roche / 454 • • PacBio • Helicos / Heliscope • ion torrent etc... • mRNA TTAGCCTTAGCTTCC GTCGCAACTTCCTTA TTCACGAGCTTGATG TTGCGGATCACTTTG
  • 9. RNA-Seq ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 10. RNA-Seq • • • ʻalign-then-assembleʼ ʻassemble-then-aliignʼ approach approach
  • 11. RNA-Seq • 454 • • ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 12. RNA-Seq ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 13. RNA-Seq ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 14. RNA-Seq ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 15. RNA-Seq • • cDNA ʻalign-then-assembleʼ ʻassemble-then-alignʼ approach approach
  • 16. Sujai Kumar and Mark L Blaxter : Comparing de novo assemblers for 454 transcriptome data (2010) Newbler 2.5
  • 17. Sujai Kumar and Mark L Blaxter : Comparing de novo assemblers for 454 transcriptome data (2010) Newbler 2.5 ...
  • 18. Sujai Kumar and Mark L Blaxter : Comparing de novo assemblers for 454 transcriptome data (2010) Newbler 2.5 ... Trinity...!!
  • 19. 1. Newbler 2.5 • Roche 454 • 454 • 2. Trinity • Broad Institute • 454 ( ) • Nat Biotechnol. 2011 May * * Grabherr MG, Haas BJ,Yassour M et al. : Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011 May 15
  • 20. 1. Newbler 2.5 • Overlap-Layout-Consensus (OLC) 2. Trinity I. Inchworm : k-mer graph II. Chrysalis : Contig pool III.Butterfly : De Bruijn Graph 2
  • 21. Roche 454 pyrosequencing usamimi 0.3M reads (sff or fastq format) Newbler 2.5 Trinity (fasta format) (fasta format) GMAP with usagi CDS (gff format) (gff format)
  • 22. S. Kumar et al.(2010) • ( ) •
  • 23. Newbler 2.5 Trinity Newbler 2.5 Trinity Number of 19,753 20,758 contigs Total Bases 9,651,390 10,275,166 Max contig 2,878 2,151 length Mean contig 488.6 495 length N50 581 616
  • 24. Newbler 2.5 N = 19,753 Trinity N = 20,758 http://edwards.sdsu.edu/prinseq_beta/
  • 25. usagi CDS all usagi CDS 30,000 ≧ 80% alignment ≧ 90% alignment ≧ 95% alignment Newbler 2.5 Trinity 100% alignment 16000 all 15,498 15,524 12000 ≧ 80% 14,583 14,697 8000 ≧ 90% 8,466 8,665 ≧ 95% 1,059 1,191 4000 100% 66 30 0 Newbler 2.5 Trinity
  • 26. usagi Newbler 2.5 Trinity 12,417 10,433 genes genes 2,990 9,427 1,006
  • 27. ... S. Kumar et al.(2010) Poly(A/T) Poly(A/T) Poly(A/T)
  • 28. Poly(A/T) Trinity > Newbler 2.5 Newbler 2.5 Trinity 257 3,773 Poly T (1.30%) (18.18%) 20 bp 20 bp 539 2,349 Poly A (2.73%) (11.32%) 20 bp 20 bp http://edwards.sdsu.edu/prinseq_beta/ ()
  • 29. Poly(A/T) Trinity > Newbler 2.5 Newbler 2.5 Trinity 257 3,773 Poly T (1.30%) (18.18%) Poly(A/T) Quality Value 20 bp 20 bp →Newbler Quality trimming ...? 539 2,349 Poly A (2.73%) (11.32%) 20 bp 20 bp http://edwards.sdsu.edu/prinseq_beta/ ()
  • 30. Trinity Newbler 2.5 1 usagi CDS 2 Poly(A/T) Trinity Newbler 2.5
  • 31. Trinity Newbler 2.5 1 usagi CDS Trinity 454 2 Poly(A/T) Trinity Newbler 2.5
  • 32. Trinity Newbler 2.5 1 usagi CDS Trinity 454 2 Poly(A/T) Trinity Newbler 2.5
  • 33.
  • 34. Method : Parameters • Newbler 2.5 • Trinity (20110519 ver.) • -notrim • --seqType=fq • -urt • --single • --min_contig_length 50 • --run_butterfly • --CPU 4 • --bfly_opts "-- compatible_path_extensi on --stderr "