Your SlideShare is downloading. ×
  • Like
  • Save
pipeline_structure_overview
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Now you can save presentations on your phone or tablet

Available for both IPhone and Android

Text the download link to your phone

Standard text messaging rates apply

pipeline_structure_overview

  • 216 views
Published

 

Published in Design
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
216
On SlideShare
0
From Embeds
0
Number of Embeds
1

Actions

Shares
Downloads
0
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. NPG Pipeline Overview analyse_RTA PB_cal post_qseq post_qc_review manual qc
  • 2. analyse_RTA OLB(Bustard)tocreateqseq demultiplex CASAVA(Gerald)Recalibration
  • 3. PB_cal OLB(bcl2qseq)tocreateqseq demultiplex PB_callanerecalibrationNo recalibration
  • 4. post_qseq Produce per lane fastq (qseq2fastq.pl) Produce per lane srf (illumina2srf & srf_index_hash) Split out nonconsented data Split fastqs by multiplex tag qX_yield insert_size adapter sequence_error gc_fraction
  • 5. post_qseq Create run analysis schema information contaminationbam file generation md5 generation bam_markduplicates
  • 6. post_qseq gc_biasbam indexing Check cluster counts Manual QC Stage
  • 7. post_qc_review archive_to_sra archive_to_irods Upload fastqcheck Upload auto_qc Upload illumina analysis Tidy up staging area
  • 8. Additional Notes ● Spider runs at the start and finish at the end of all the pipelines. ● Spider caches web pages which are used throughout the pipeline and sets an environment variable, so that all launched jobs can access them. ● Finish is very important, as it ties off the log files, and writes a json string of the processes launched, which is needed for the schema generation.
  • 9. Additional Notes ● Status changes have been left out, along with some file checking and creation of tag specific lane files which occur at the start of the primary analysis pipelines. These happen, but are not responsible directly for the files and qc that you see. ● The production version of the primary pipeline launches a version of the secondary pipeline which creates a Latest_Summary link to it's archival files and QC.
  • 10. Additional Notes ● Status changes have been left out, along with some file checking and creation of tag specific lane files which occur at the start of the primary analysis pipelines. These happen, but are not responsible directly for the files and qc that you see. ● The production version of the primary pipeline launches a version of the secondary pipeline which creates a Latest_Summary link to it's archival files and QC.