SlideShare a Scribd company logo
1 of 2
Download to read offline
Computational and Bioinformatics experience:
T-Lex Web Interfacing and Data Generation and Analysis: (Internship at Stanford University, CA)
Designed a sample Web-Interface for T-Lex tool and generated, annotated the data using the tool and
worked on Drosophila data of 12 different species and also testing on different platforms using the T-Lex
Tool.
(Please refer to webpage : http://petrov.stanford.edu/cgi-bin/Tlex_databases/home.cgi! ).
Large-Scale Data Analysis of p53 Gene Expression in Embryonic Stem Cell Populations (Master’s Thesis)
To explore sets of genes that are correlated with the p 5 3 expression within and between different
groups of embryonic stem cells using A g i l e n t W o r k B e n c h T o o l , and also predicted if p53 might
be regulating one single set of genes in all different stem cell populations or whether its expression is
different in different populations. This was then combined to determine which genes actually are
similar using ChIP-seq data and those which bind to p53 and also measured their expression profile
across different populations.
A re-analysis of previously available microarray dataset followed by analysis of the ChIP-seq data was
done and comparison of these two datasets provided appropriate expression profile of p53 and also
detection of a new pathway which was the Ca+2 Signaling Pathway.
To depict an ideal target site for prevention of African Sleeping Sickness caused by T.Brucei
An ideal target site was detected to prevent the growth of the parasite. Glycolytic Pathway is
considered to be an important source of energy for the parasite survival especially in Trypanosomes
and Leishminia. Enzymes are considered to be the best targets thereby GAPDH enzyme in this pathway
was chosen as a target and worked on.
Genome assembly using short reads, its challenges, methods and future of de novo assembly
A next generation genomics review of data on short reads generated by high sequencing technology
was made and methods which were put to use, the challenges faced in handling the short read data
and appropriate solutions to overcome these challenges along with future technologies like de novo
assembly was studied.
Knowledge about usage of Mapping Reads and worked with Velvet algorithm for assembly.
To map illumine reads of E.coli on to a reference genome and exam the data for single nucleotide
polymorphisms, insertions and deletions. BWA was used to align the reads and Genome Viewer was used
to view the aligned reads.
Velvet was also used to assembly of the short reads.
Modeling generation and Evaluation of the protein (TbAUK1)
TbAUK1 is a protein kinase which is concentrated in the mitotic phase of the cell cycle of blood
stream form of
Trypanosoma Brucei and can act as an important target site for production of anti-parasitic drug. This
required model development and validation of the model using SAVES server and also considering the
Ramachandran Plot and generation of ligands using LIDUEAS tool and further by checking the binding of
the ligand with human and that of the modelled protein and a presentation on the work was also done.
RevTrans Tool Creation
Developed RevTrans tool using NetBeans (BioJava packages) and HTML. This tool helps in providing a
reverse complement of an input sequences, and can be majorly used in the areas of Bioinformatics for
the exploring DNA sequences to find reverse complement.
Protein Structure Prediction and Database creation
A Protein database for Human X chromosome proteins whose structures have not been published in
the public databases were modelled and fully annotated data was created.

More Related Content

What's hot

BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES nadeem akhter
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databasesPranavathiyani G
 
Computational biology bls 303
Computational biology bls 303Computational biology bls 303
Computational biology bls 303Bruno Mmassy
 
Ncbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuNcbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuBen Busby
 
Data analytics challenges in genomics
Data analytics challenges in genomicsData analytics challenges in genomics
Data analytics challenges in genomicsmikaelhuss
 
100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databasesMeetika Gupta
 
bioinfomatics
bioinfomaticsbioinfomatics
bioinfomaticsnguyenpg
 
Biological databases
Biological databasesBiological databases
Biological databasesQamar iqbal
 
Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Sijo A
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS
 
Finding Allelic Frequencies Using MapReduce/Hadoop
Finding Allelic Frequencies Using MapReduce/HadoopFinding Allelic Frequencies Using MapReduce/Hadoop
Finding Allelic Frequencies Using MapReduce/HadoopMahmoud Parsian
 
Biological database by kk sahu
Biological database by kk sahuBiological database by kk sahu
Biological database by kk sahuKAUSHAL SAHU
 

What's hot (20)

BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
Databases ii
Databases iiDatabases ii
Databases ii
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Computational biology bls 303
Computational biology bls 303Computational biology bls 303
Computational biology bls 303
 
Ncbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuNcbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osu
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biological data base
Biological data baseBiological data base
Biological data base
 
Gen bank (genetic sequence databank)
Gen bank (genetic sequence databank)Gen bank (genetic sequence databank)
Gen bank (genetic sequence databank)
 
Data analytics challenges in genomics
Data analytics challenges in genomicsData analytics challenges in genomics
Data analytics challenges in genomics
 
100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databases
 
bioinfomatics
bioinfomaticsbioinfomatics
bioinfomatics
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)Bioinformatics for beginners (exam point of view)
Bioinformatics for beginners (exam point of view)
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequences
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Finding Allelic Frequencies Using MapReduce/Hadoop
Finding Allelic Frequencies Using MapReduce/HadoopFinding Allelic Frequencies Using MapReduce/Hadoop
Finding Allelic Frequencies Using MapReduce/Hadoop
 
Biological database by kk sahu
Biological database by kk sahuBiological database by kk sahu
Biological database by kk sahu
 
Biological databases
Biological databasesBiological databases
Biological databases
 

Viewers also liked

Letter Of recomendation (2)
Letter Of recomendation (2)Letter Of recomendation (2)
Letter Of recomendation (2)Sherif Abdalla
 
Plantilla de diseño catalina
Plantilla de diseño catalinaPlantilla de diseño catalina
Plantilla de diseño catalinaroboticanicoa
 
1с договорчики
1с договорчики1с договорчики
1с договорчики1C-KPD
 
Planeación de clase mayo 10-13
Planeación de clase mayo 10-13Planeación de clase mayo 10-13
Planeación de clase mayo 10-13Katherin Diaz
 
Evolving Duke's Learning Technology Ecosystem
Evolving Duke's Learning Technology EcosystemEvolving Duke's Learning Technology Ecosystem
Evolving Duke's Learning Technology EcosystemMichael Greene
 
Festivals in Turkey
Festivals in TurkeyFestivals in Turkey
Festivals in Turkeysp5bielskp
 
Sleep deprivation to lead some serious consequences
Sleep deprivation to lead some serious consequencesSleep deprivation to lead some serious consequences
Sleep deprivation to lead some serious consequencesSleepingtabs
 
Injection on Steroids: Codeless code injection and 0-day techniques
Injection on Steroids: Codeless code injection and 0-day techniquesInjection on Steroids: Codeless code injection and 0-day techniques
Injection on Steroids: Codeless code injection and 0-day techniquesenSilo
 

Viewers also liked (15)

Joe's OSHA 510
Joe's OSHA 510Joe's OSHA 510
Joe's OSHA 510
 
δδσδσδδσσδ
δδσδσδδσσδδδσδσδδσσδ
δδσδσδδσσδ
 
Origen de las computadoras
Origen de las computadorasOrigen de las computadoras
Origen de las computadoras
 
Letter Of recomendation (2)
Letter Of recomendation (2)Letter Of recomendation (2)
Letter Of recomendation (2)
 
Plantilla de diseño catalina
Plantilla de diseño catalinaPlantilla de diseño catalina
Plantilla de diseño catalina
 
Pp Pd
Pp PdPp Pd
Pp Pd
 
Chorizo
ChorizoChorizo
Chorizo
 
1с договорчики
1с договорчики1с договорчики
1с договорчики
 
dd-214
dd-214dd-214
dd-214
 
Planeación de clase mayo 10-13
Planeación de clase mayo 10-13Planeación de clase mayo 10-13
Planeación de clase mayo 10-13
 
Evolving Duke's Learning Technology Ecosystem
Evolving Duke's Learning Technology EcosystemEvolving Duke's Learning Technology Ecosystem
Evolving Duke's Learning Technology Ecosystem
 
Festivals in Turkey
Festivals in TurkeyFestivals in Turkey
Festivals in Turkey
 
Sleep deprivation to lead some serious consequences
Sleep deprivation to lead some serious consequencesSleep deprivation to lead some serious consequences
Sleep deprivation to lead some serious consequences
 
Dormir
DormirDormir
Dormir
 
Injection on Steroids: Codeless code injection and 0-day techniques
Injection on Steroids: Codeless code injection and 0-day techniquesInjection on Steroids: Codeless code injection and 0-day techniques
Injection on Steroids: Codeless code injection and 0-day techniques
 

Similar to Computational and Bioinformatics Experience

Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuKAUSHAL SAHU
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 pptrehman2009
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final PresentationShruthi Choudary
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data setsimprovemed
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data miningSangeeta Das
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceIJSTA
 
Internship Report
Internship ReportInternship Report
Internship ReportNeha Gupta
 
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSPROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSLubna MRL
 
Transcriptome Analysis & Applications
Transcriptome Analysis & ApplicationsTranscriptome Analysis & Applications
Transcriptome Analysis & Applications1010Genome Pte Ltd
 
RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing ResearchTanmay Ghai
 
Visualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All TogetherVisualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All TogetherNils Gehlenborg
 
Bioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxBioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxxRowlet
 

Similar to Computational and Bioinformatics Experience (20)

Myers CV_2015
Myers CV_2015Myers CV_2015
Myers CV_2015
 
Protein databases
Protein databasesProtein databases
Protein databases
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
Bioinformatics seminar
Bioinformatics seminarBioinformatics seminar
Bioinformatics seminar
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
Bioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahuBioinformatic, and tools by kk sahu
Bioinformatic, and tools by kk sahu
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 ppt
 
Bioinformatics Final Presentation
Bioinformatics Final PresentationBioinformatics Final Presentation
Bioinformatics Final Presentation
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
 
Applications of bioinformatics
Applications of bioinformaticsApplications of bioinformatics
Applications of bioinformatics
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
 
Bioinformatics data mining
Bioinformatics data miningBioinformatics data mining
Bioinformatics data mining
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
 
Internship Report
Internship ReportInternship Report
Internship Report
 
0032-Ijabpt-Imed pub
0032-Ijabpt-Imed pub0032-Ijabpt-Imed pub
0032-Ijabpt-Imed pub
 
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICSPROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
PROKARYOTIC TRANSCRIPTOMICS AND METAGENOMICS
 
Transcriptome Analysis & Applications
Transcriptome Analysis & ApplicationsTranscriptome Analysis & Applications
Transcriptome Analysis & Applications
 
RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing Research
 
Visualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All TogetherVisualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All Together
 
Bioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptxBioinformatics_1_ChenS.pptx
Bioinformatics_1_ChenS.pptx
 

Computational and Bioinformatics Experience

  • 1. Computational and Bioinformatics experience: T-Lex Web Interfacing and Data Generation and Analysis: (Internship at Stanford University, CA) Designed a sample Web-Interface for T-Lex tool and generated, annotated the data using the tool and worked on Drosophila data of 12 different species and also testing on different platforms using the T-Lex Tool. (Please refer to webpage : http://petrov.stanford.edu/cgi-bin/Tlex_databases/home.cgi! ). Large-Scale Data Analysis of p53 Gene Expression in Embryonic Stem Cell Populations (Master’s Thesis) To explore sets of genes that are correlated with the p 5 3 expression within and between different groups of embryonic stem cells using A g i l e n t W o r k B e n c h T o o l , and also predicted if p53 might be regulating one single set of genes in all different stem cell populations or whether its expression is different in different populations. This was then combined to determine which genes actually are similar using ChIP-seq data and those which bind to p53 and also measured their expression profile across different populations. A re-analysis of previously available microarray dataset followed by analysis of the ChIP-seq data was done and comparison of these two datasets provided appropriate expression profile of p53 and also detection of a new pathway which was the Ca+2 Signaling Pathway. To depict an ideal target site for prevention of African Sleeping Sickness caused by T.Brucei An ideal target site was detected to prevent the growth of the parasite. Glycolytic Pathway is considered to be an important source of energy for the parasite survival especially in Trypanosomes and Leishminia. Enzymes are considered to be the best targets thereby GAPDH enzyme in this pathway was chosen as a target and worked on. Genome assembly using short reads, its challenges, methods and future of de novo assembly A next generation genomics review of data on short reads generated by high sequencing technology was made and methods which were put to use, the challenges faced in handling the short read data and appropriate solutions to overcome these challenges along with future technologies like de novo assembly was studied. Knowledge about usage of Mapping Reads and worked with Velvet algorithm for assembly. To map illumine reads of E.coli on to a reference genome and exam the data for single nucleotide polymorphisms, insertions and deletions. BWA was used to align the reads and Genome Viewer was used to view the aligned reads. Velvet was also used to assembly of the short reads. Modeling generation and Evaluation of the protein (TbAUK1) TbAUK1 is a protein kinase which is concentrated in the mitotic phase of the cell cycle of blood stream form of Trypanosoma Brucei and can act as an important target site for production of anti-parasitic drug. This
  • 2. required model development and validation of the model using SAVES server and also considering the Ramachandran Plot and generation of ligands using LIDUEAS tool and further by checking the binding of the ligand with human and that of the modelled protein and a presentation on the work was also done. RevTrans Tool Creation Developed RevTrans tool using NetBeans (BioJava packages) and HTML. This tool helps in providing a reverse complement of an input sequences, and can be majorly used in the areas of Bioinformatics for the exploring DNA sequences to find reverse complement. Protein Structure Prediction and Database creation A Protein database for Human X chromosome proteins whose structures have not been published in the public databases were modelled and fully annotated data was created.