SlideShare a Scribd company logo
1 of 1
ArthropodEST: K-State Bioinformatics EST analysis pipeline * Sanjay Chellapilla 1 , Yoonseong Park 2 , Doina Caragea 3  and Susan J. Brown 1 1 Bioinformatics Center, Division of Biology  2 Department of Entomology   3 Department of Computing and Information Sciences Kansas State University, Manhattan KS 66506 ABSTRACT Expressed Sequence Tags (ESTs), produced by single-pass end-sequencing of cDNA clones, generate large datasets that are instrumental in gene discovery and gene sequence determination. Although several EST data analysis pipelines are available on the WWW ( e.g.  ESTpass, EGassembler, ESTexplorer etc.), the WWW-accessible K-State Bioinformatics EST analysis pipeline  ‘ArthropodEST’  goes further than these existing pipelines in providing more options and analyses, along with a user-friendly interface. The pipeline was developed utilizing freely available bioinformatics and system software (academic or F/OSS licenses). Available options in the pipeline include input sequence cleaning and screening for vectors and contaminants, masking repetitive sequences using repeat databases, clustering and assembly into contigs, computing ORFs (Open Reading Frames) and/or signal-peptide predictions, and assigning functional annotations to the contigs and singletons. The pipeline sends out automatic result notification email(s) containing a unique URL to download results from, to the user‘s email address.  A summary report (automatically generated) of the analyses is included in the results available for download. The pipeline is accessible at  http://bioinformatics.ksu.edu/ArthropodEST/ Acknowledgements:   Supported by KSU-TE-AGC (SC), KSU Bioinformatics Center (DC, SC) and K-INBRE (DC, SC). KANSAS STATE UNIVERSITY   KSU BIOINFORMATICS CENTER KSU ARTHROPOD GENOMICS CENTER  K-INBRE Input sequences cleaning Vector/contaminant screening Assembly with optional prior clustering into contigs, singletons User downloads results and report from unique URL automatically sent by email Process user inputs, display project-receipt confirmation and summary, send automatic confirmation email, invoke pipeline shell script Further analyses: functional annotations and/or signal-peptide predictions server-side CGI script server-side Pipeline shell-script client-side (User) client-side (User) ArthropodEST homepage COMPONENTS OF THE PIPELINE (a) System software: GNU/Linux Ubuntu 2.6.24-23-server, bash  3.2.39, Apache 2.2.8 with mod_perl/2.0.3, PERL 5.8.8 with PERL modules CGI 3.29, Mail:Mailer 1.74, File::Temp 0.18, MySQL 5.0 and Postfix 2.5.4 Mail Transport Agent (MTA). (b) Bioinformatics software: - TGICL software suite [ http://compbio.dfci.harvard.edu/tgi/software/ ] -   Vector databases: NCBI UniVec [ http://www.ncbi.nlm.nih.gov/VecScreen/UniVec.html ] EMBL EmVec [ ftp://ftp.ebi.ac.uk/pub/databases/emvec/ ] -   RepeatMasker [ http://www.RepeatMasker.org/ ]  and associated RepBase libraries [ http://www.girinst.org/ ] requires either  cross_match  [ http://www.phrap.org/phredphrapconsed.html ]   or  wu-blastall  [ http://blast.wustl.edu/ ] - CAP3 sequence-assembly program [ http://seq.cs.iastate.edu/ ]     - NCBI BLAST suite [ http://www.ncbi.nlm.nih.gov/BLAST/download.shtml ]   and/or  wu-blastall  [ http://blast.wustl.edu/ ] - blast2GO pipeline version B2G4PIPE [ http://blast2go.bioinfo.cipf.es/ ] -   signalp   [ http://www.cbs.dtu.dk/services/SignalP/ ] and EMBOSS [ http://emboss.sourceforge.net/ ] (c) In-house developed software: WWW-interface HTML/CSS, server-side CGI, PERL, bash shell and awk scripts User-input: project name, e-mail address, input files  and options/parameters for analyses Repeat-masking with standard RepBase libraries WORKFLOW

More Related Content

Similar to Arthropod es tpipeline_poster

Accelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methodsAccelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methodsPriscill Orue Esquivel
 
2014 Taverna Tutorial Introduction to eScience and workflows
2014 Taverna Tutorial Introduction to eScience and workflows2014 Taverna Tutorial Introduction to eScience and workflows
2014 Taverna Tutorial Introduction to eScience and workflowsmyGrid team
 
Open Source Software Tools for Synchrophasor Applications
Open Source Software Tools for  Synchrophasor ApplicationsOpen Source Software Tools for  Synchrophasor Applications
Open Source Software Tools for Synchrophasor ApplicationsLuigi Vanfretti
 
How can you access PubChem programmatically?
How can you access PubChem programmatically?How can you access PubChem programmatically?
How can you access PubChem programmatically?Sunghwan Kim
 
Imgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialImgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialDeanna Church
 
How to be a bioinformatician
How to be a bioinformaticianHow to be a bioinformatician
How to be a bioinformaticianChristian Frech
 
CromoCat: New developpments to genetic diversity databasing
CromoCat: New developpments to genetic diversity databasingCromoCat: New developpments to genetic diversity databasing
CromoCat: New developpments to genetic diversity databasingssuser90148d
 
1st KeyStone Summer School - Hackathon Challenge
1st KeyStone Summer School - Hackathon Challenge1st KeyStone Summer School - Hackathon Challenge
1st KeyStone Summer School - Hackathon ChallengeJoel Azzopardi
 
Jeff Grethe: CAMERA
Jeff Grethe: CAMERAJeff Grethe: CAMERA
Jeff Grethe: CAMERAIddo
 
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Ben Busby
 
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Anubis Hosein
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Prof. Wim Van Criekinge
 
Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsJoão André Carriço
 

Similar to Arthropod es tpipeline_poster (20)

Accelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methodsAccelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methods
 
Understanding Genome
Understanding Genome Understanding Genome
Understanding Genome
 
2014 Taverna Tutorial Introduction to eScience and workflows
2014 Taverna Tutorial Introduction to eScience and workflows2014 Taverna Tutorial Introduction to eScience and workflows
2014 Taverna Tutorial Introduction to eScience and workflows
 
biorepository
biorepositorybiorepository
biorepository
 
Open Source Software Tools for Synchrophasor Applications
Open Source Software Tools for  Synchrophasor ApplicationsOpen Source Software Tools for  Synchrophasor Applications
Open Source Software Tools for Synchrophasor Applications
 
D1803012022
D1803012022D1803012022
D1803012022
 
How can you access PubChem programmatically?
How can you access PubChem programmatically?How can you access PubChem programmatically?
How can you access PubChem programmatically?
 
Imgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialImgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorial
 
How to be a bioinformatician
How to be a bioinformaticianHow to be a bioinformatician
How to be a bioinformatician
 
Full Resume
Full ResumeFull Resume
Full Resume
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
Genome comparision
Genome comparisionGenome comparision
Genome comparision
 
CromoCat: New developpments to genetic diversity databasing
CromoCat: New developpments to genetic diversity databasingCromoCat: New developpments to genetic diversity databasing
CromoCat: New developpments to genetic diversity databasing
 
1st KeyStone Summer School - Hackathon Challenge
1st KeyStone Summer School - Hackathon Challenge1st KeyStone Summer School - Hackathon Challenge
1st KeyStone Summer School - Hackathon Challenge
 
Jeff Grethe: CAMERA
Jeff Grethe: CAMERAJeff Grethe: CAMERA
Jeff Grethe: CAMERA
 
cpc-152-2-2003
cpc-152-2-2003cpc-152-2-2003
cpc-152-2-2003
 
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
 
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
 
Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and Annotations
 

More from Tamizhmuhil

Ayeesha tamil story by Era.Natarajan
Ayeesha tamil story by Era.NatarajanAyeesha tamil story by Era.Natarajan
Ayeesha tamil story by Era.NatarajanTamizhmuhil
 
Ayeesha by Era.Natarajan
Ayeesha by Era.NatarajanAyeesha by Era.Natarajan
Ayeesha by Era.NatarajanTamizhmuhil
 
Tn min-ithazh-pankuni-nanthana
Tn min-ithazh-pankuni-nanthanaTn min-ithazh-pankuni-nanthana
Tn min-ithazh-pankuni-nanthanaTamizhmuhil
 
Algebra formulae
Algebra formulaeAlgebra formulae
Algebra formulaeTamizhmuhil
 
Tn min-ithazh-thai-nanthana
Tn min-ithazh-thai-nanthanaTn min-ithazh-thai-nanthana
Tn min-ithazh-thai-nanthanaTamizhmuhil
 
இந்த வாரம் கலாரசிகன் Dinamani - tamil daily news
இந்த வாரம் கலாரசிகன்   Dinamani - tamil daily newsஇந்த வாரம் கலாரசிகன்   Dinamani - tamil daily news
இந்த வாரம் கலாரசிகன் Dinamani - tamil daily newsTamizhmuhil
 
Kavi visai e-book realesed by tamilaka kavinjar kalai ilakkiya sangam
Kavi visai   e-book realesed by tamilaka kavinjar kalai ilakkiya sangamKavi visai   e-book realesed by tamilaka kavinjar kalai ilakkiya sangam
Kavi visai e-book realesed by tamilaka kavinjar kalai ilakkiya sangamTamizhmuhil
 
Birdhouse gift basket
Birdhouse gift basketBirdhouse gift basket
Birdhouse gift basketTamizhmuhil
 
Cursors in oracle
Cursors in oracleCursors in oracle
Cursors in oracleTamizhmuhil
 

More from Tamizhmuhil (12)

Ayeesha tamil story by Era.Natarajan
Ayeesha tamil story by Era.NatarajanAyeesha tamil story by Era.Natarajan
Ayeesha tamil story by Era.Natarajan
 
Ayeesha by Era.Natarajan
Ayeesha by Era.NatarajanAyeesha by Era.Natarajan
Ayeesha by Era.Natarajan
 
Lecture 343
Lecture 343Lecture 343
Lecture 343
 
Lecture 839
Lecture 839Lecture 839
Lecture 839
 
Tn min-ithazh-pankuni-nanthana
Tn min-ithazh-pankuni-nanthanaTn min-ithazh-pankuni-nanthana
Tn min-ithazh-pankuni-nanthana
 
Kaatruveli
KaatruveliKaatruveli
Kaatruveli
 
Algebra formulae
Algebra formulaeAlgebra formulae
Algebra formulae
 
Tn min-ithazh-thai-nanthana
Tn min-ithazh-thai-nanthanaTn min-ithazh-thai-nanthana
Tn min-ithazh-thai-nanthana
 
இந்த வாரம் கலாரசிகன் Dinamani - tamil daily news
இந்த வாரம் கலாரசிகன்   Dinamani - tamil daily newsஇந்த வாரம் கலாரசிகன்   Dinamani - tamil daily news
இந்த வாரம் கலாரசிகன் Dinamani - tamil daily news
 
Kavi visai e-book realesed by tamilaka kavinjar kalai ilakkiya sangam
Kavi visai   e-book realesed by tamilaka kavinjar kalai ilakkiya sangamKavi visai   e-book realesed by tamilaka kavinjar kalai ilakkiya sangam
Kavi visai e-book realesed by tamilaka kavinjar kalai ilakkiya sangam
 
Birdhouse gift basket
Birdhouse gift basketBirdhouse gift basket
Birdhouse gift basket
 
Cursors in oracle
Cursors in oracleCursors in oracle
Cursors in oracle
 

Recently uploaded

Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinojohnmickonozaleda
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 

Recently uploaded (20)

Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipino
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 

Arthropod es tpipeline_poster

  • 1. ArthropodEST: K-State Bioinformatics EST analysis pipeline * Sanjay Chellapilla 1 , Yoonseong Park 2 , Doina Caragea 3 and Susan J. Brown 1 1 Bioinformatics Center, Division of Biology 2 Department of Entomology 3 Department of Computing and Information Sciences Kansas State University, Manhattan KS 66506 ABSTRACT Expressed Sequence Tags (ESTs), produced by single-pass end-sequencing of cDNA clones, generate large datasets that are instrumental in gene discovery and gene sequence determination. Although several EST data analysis pipelines are available on the WWW ( e.g. ESTpass, EGassembler, ESTexplorer etc.), the WWW-accessible K-State Bioinformatics EST analysis pipeline ‘ArthropodEST’ goes further than these existing pipelines in providing more options and analyses, along with a user-friendly interface. The pipeline was developed utilizing freely available bioinformatics and system software (academic or F/OSS licenses). Available options in the pipeline include input sequence cleaning and screening for vectors and contaminants, masking repetitive sequences using repeat databases, clustering and assembly into contigs, computing ORFs (Open Reading Frames) and/or signal-peptide predictions, and assigning functional annotations to the contigs and singletons. The pipeline sends out automatic result notification email(s) containing a unique URL to download results from, to the user‘s email address. A summary report (automatically generated) of the analyses is included in the results available for download. The pipeline is accessible at http://bioinformatics.ksu.edu/ArthropodEST/ Acknowledgements: Supported by KSU-TE-AGC (SC), KSU Bioinformatics Center (DC, SC) and K-INBRE (DC, SC). KANSAS STATE UNIVERSITY KSU BIOINFORMATICS CENTER KSU ARTHROPOD GENOMICS CENTER K-INBRE Input sequences cleaning Vector/contaminant screening Assembly with optional prior clustering into contigs, singletons User downloads results and report from unique URL automatically sent by email Process user inputs, display project-receipt confirmation and summary, send automatic confirmation email, invoke pipeline shell script Further analyses: functional annotations and/or signal-peptide predictions server-side CGI script server-side Pipeline shell-script client-side (User) client-side (User) ArthropodEST homepage COMPONENTS OF THE PIPELINE (a) System software: GNU/Linux Ubuntu 2.6.24-23-server, bash 3.2.39, Apache 2.2.8 with mod_perl/2.0.3, PERL 5.8.8 with PERL modules CGI 3.29, Mail:Mailer 1.74, File::Temp 0.18, MySQL 5.0 and Postfix 2.5.4 Mail Transport Agent (MTA). (b) Bioinformatics software: - TGICL software suite [ http://compbio.dfci.harvard.edu/tgi/software/ ] - Vector databases: NCBI UniVec [ http://www.ncbi.nlm.nih.gov/VecScreen/UniVec.html ] EMBL EmVec [ ftp://ftp.ebi.ac.uk/pub/databases/emvec/ ] - RepeatMasker [ http://www.RepeatMasker.org/ ] and associated RepBase libraries [ http://www.girinst.org/ ] requires either cross_match [ http://www.phrap.org/phredphrapconsed.html ] or wu-blastall [ http://blast.wustl.edu/ ] - CAP3 sequence-assembly program [ http://seq.cs.iastate.edu/ ]     - NCBI BLAST suite [ http://www.ncbi.nlm.nih.gov/BLAST/download.shtml ] and/or wu-blastall [ http://blast.wustl.edu/ ] - blast2GO pipeline version B2G4PIPE [ http://blast2go.bioinfo.cipf.es/ ] - signalp [ http://www.cbs.dtu.dk/services/SignalP/ ] and EMBOSS [ http://emboss.sourceforge.net/ ] (c) In-house developed software: WWW-interface HTML/CSS, server-side CGI, PERL, bash shell and awk scripts User-input: project name, e-mail address, input files and options/parameters for analyses Repeat-masking with standard RepBase libraries WORKFLOW