SlideShare a Scribd company logo
1 of 2
Download to read offline
ISSN: 2312-7694 
Anop et al. / International Journal of Computer and Communication System Engineering (IJCCSE) 
90 | P a g e 
© 2014, IJCCSE All Rights Reserved Vol. 1 No.03 October 2014 www.ijccse.com 
Retrieval and Statistical Analysis of Genbank Data (RASA-GD) Anop Singh Ranawat 
School of Life Sciences, main campus, Jaipur National University, Jaipur, Rajasthan, India 
Mohan Kumar Yadav School of Life Sciences, main campus, Jaipur National University, Jaipur, Rajasthan, India 
Abstract— GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences. It comprises the DNA DataBank of Japan (DDBJ), the European Molecular Biology Laboratory (EMBL), and GenBank at NCB[3] .The information retrieved in a genbank file is complex and requires condsiderable time for analyzing the data. In this work we wish to convert the data present in Genbank file into a more readable, graphical format by developing a computational tool RASA-GD, using Perl code, Perl module and php. The tool accept file in genbank format and will extract information such as coding region, exon, intron, gene, mRNA,tRNA, and other data contained in gene bank file and also calculate the length of these elements and will generate a graphical output of length and statistical values such as mean, median, mode, standard deviation, variance, sample range, minimum value, maximum value, unique gene number. This tool will be very useful for various kinds of analysis to both molecular biologists and computational biologists and will help in hypothesis generation. We have used ourtool for visualizing variation of foxp2 gene from various species such as in human, monkey, gorilla, house mouse, gallusgallus. This gene is responsible for speech behavior. In order to explore whether there is an evolutionary significance of the length of various genetic elements of this gene, The data generated has provided useful insight into the evolutionary history of this gene. Keywords— GI number, Genbank, foxp2 gene, RASA-GD 
I. INTRODUCTION 
Bioinformatics is an interdisciplinary scientific field where biological problems can be solve easily and efficiently by using computational approach[1,2] .It plays a very important role in the field of medical research, now it is the time of genomic era so there is a need of computers with biology mergeapproach which can be used to deal with the genomic data, to study and analyze the variation in gene. Nowadays,experimentaldatas are stored in databases to study and analyze the result computationally. Various computational tools have been designed to analyze the data and predict the result by applying mathematical and statistical approach to perform the experiment Insilco, using computational tools and database. The handling of databases and selection of desirable data from the databases is a tough task for the researchers and students from a non bioinformaticsbackground.To make the retrieval work easier we have designed a tool RASA-GD, which helps to retrieve the desire data from the complex file obtained from the database of Genbank. 
II. METHODS AND RESULTS 
We have designed this tool by using perlcode,perl module and PHP.The tool used statistical and graphical module to calculate statistical value and visualize the desire output graphically. For study we have taken six samples files from genbank database(www.genbank.org) of foxp2 gene in different organisms such as human GI:568815591, chimpanzee GI:319999821, monkey GI:109156890, gorilla GI:401623081, house mouse GI:372099104, gallusgallusGI:358485511. We have puted all genbank sample files insingle genbank file and browse in tool. 
Fig.1: This is home page of RASA-GD tool
ISSN: 2312-7694 
Anop et al. / International Journal of Computer and Communication System Engineering (IJCCSE) 
91 | P a g e 
© 2014, IJCCSE All Rights Reserved Vol. 1 No.03 October 2014 www.ijccse.com 
Fig.2: First output file generated from RASA-GD 
Fig.3: This is a output result showing lengths of genebank elements according to different GI 
Fig .4: This shows the second input values. 
Fig .5: This is graphical and statistical value of different gene lengths are obtained from RASA-GD. In first result we have obtained lengths of differenent elements contained in different GI number. Then we have selected the gene lenths of foxp2 in different organisms. The selected lengths are puted in second box shownin fig.2, as GI:568815591,607463,GI:109156890,260557,GI:401623081,276119,GI:372099104,540725,GI:358485511,413384 . The gene length variation shown graphically in fig.4, can easily visualized. The statistical value gave following information such as unique sequence is 5 that are different from other sequences, count is 5 that is total number of sample genes,sum is 2098298 that is total sum of all gene length, min. value 260557 that is minimum value, max. is 607463 that is value of maximum length of the gene, mean419649.6 gave the mean value of total gene length and also gave standard deviation that is 15425.6050, variance is 23970967989.6, sample range is 346906, mode is 260557 and median is 413384. 
III. DISCUSSION AND ANALYSIS 
The tool RASA-GD used in retrieval and graphical visualization of genbank elements such as length of genes, exon, intron, mRNA,CDS, tRNAetc from genbank file and also used to calculate the statistical values of desire length of elements contained in genbank file. The different organisms of foxp2 genes found different lengths. REFERENCES [1] David W. Mount [2] Essential of Bioinformatics 
[3] http://www.ncbi.nlm.nih.gov/gene/?term=foxp2 
[4] http://search.cpan.org/dist/GDGraph/Graph.pm 
[5].http://search.cpan.org/~rhettbull/Statistics-Descriptive- Discrete-0.07/Discrete.pm

More Related Content

What's hot

EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EITESANGO
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsMakarand Bhale
 
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Fokhruz Zaman
 
Basics in bioinformatics
Basics in bioinformaticsBasics in bioinformatics
Basics in bioinformaticsMamun Billah
 
IRJET- Disease Identification using Proteins Values and Regulatory Modules
IRJET-  	  Disease Identification using Proteins Values and Regulatory  ModulesIRJET-  	  Disease Identification using Proteins Values and Regulatory  Modules
IRJET- Disease Identification using Proteins Values and Regulatory ModulesIRJET Journal
 
OVium Bioinformatic Solutions
OVium Bioinformatic SolutionsOVium Bioinformatic Solutions
OVium Bioinformatic SolutionsOVium Solutions
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesUniversity of Malaya
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data setsimprovemed
 
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013Functional Genomics Data Society
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesMelanie Courtot
 
Functional annotation of invertebrate genomes
Functional annotation of invertebrate genomesFunctional annotation of invertebrate genomes
Functional annotation of invertebrate genomesSurya Saha
 
Biological database by kk sahu
Biological database by kk sahuBiological database by kk sahu
Biological database by kk sahuKAUSHAL SAHU
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.Elena Sügis
 
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic MethodsAnalytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic Methodscscpconf
 
Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informaticsDaniela Rotariu
 

What's hot (20)

EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
 
Basics in bioinformatics
Basics in bioinformaticsBasics in bioinformatics
Basics in bioinformatics
 
IRJET- Disease Identification using Proteins Values and Regulatory Modules
IRJET-  	  Disease Identification using Proteins Values and Regulatory  ModulesIRJET-  	  Disease Identification using Proteins Values and Regulatory  Modules
IRJET- Disease Identification using Proteins Values and Regulatory Modules
 
OVium Bioinformatic Solutions
OVium Bioinformatic SolutionsOVium Bioinformatic Solutions
OVium Bioinformatic Solutions
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future Perspectives
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
 
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resources
 
Functional annotation of invertebrate genomes
Functional annotation of invertebrate genomesFunctional annotation of invertebrate genomes
Functional annotation of invertebrate genomes
 
Biological database by kk sahu
Biological database by kk sahuBiological database by kk sahu
Biological database by kk sahu
 
Bioinformatics Analysis of Nucleotide Sequences
Bioinformatics Analysis of Nucleotide SequencesBioinformatics Analysis of Nucleotide Sequences
Bioinformatics Analysis of Nucleotide Sequences
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
 
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
 
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic MethodsAnalytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
 
Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
 

Similar to Retrieval and Statistical Analysis of Genbank Data (RASA-GD)

170120 giab stanford genetics seminar
170120 giab stanford genetics seminar170120 giab stanford genetics seminar
170120 giab stanford genetics seminarGenomeInABottle
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchDavid Ruau
 
Longevity Genie, 01.10.2023
Longevity Genie, 01.10.2023Longevity Genie, 01.10.2023
Longevity Genie, 01.10.2023Anton Kulaga
 
RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing ResearchTanmay Ghai
 
EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795Michael Williams
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchAnshika Bansal
 
Sept2016 plenary nist_intro
Sept2016 plenary nist_introSept2016 plenary nist_intro
Sept2016 plenary nist_introGenomeInABottle
 
Internship Report
Internship ReportInternship Report
Internship ReportNeha Gupta
 
GIAB GRC Workshop slides
GIAB GRC Workshop slidesGIAB GRC Workshop slides
GIAB GRC Workshop slidesGenomeInABottle
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshopGenomeInABottle
 
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei LinChien-Wei Lin
 
Recognition of emotional states using EEG signals based on time-frequency ana...
Recognition of emotional states using EEG signals based on time-frequency ana...Recognition of emotional states using EEG signals based on time-frequency ana...
Recognition of emotional states using EEG signals based on time-frequency ana...IJECEIAES
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsGigaScience, BGI Hong Kong
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGLong Pei
 
Graziano Pesole - il progetto EPIGEN
Graziano Pesole - il progetto EPIGENGraziano Pesole - il progetto EPIGEN
Graziano Pesole - il progetto EPIGENeventi-ITBbari
 

Similar to Retrieval and Statistical Analysis of Genbank Data (RASA-GD) (20)

170326 giab abrf
170326 giab abrf170326 giab abrf
170326 giab abrf
 
170120 giab stanford genetics seminar
170120 giab stanford genetics seminar170120 giab stanford genetics seminar
170120 giab stanford genetics seminar
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...call for papers, research paper publishing, where to publish research paper, ...
call for papers, research paper publishing, where to publish research paper, ...
 
Longevity Genie, 01.10.2023
Longevity Genie, 01.10.2023Longevity Genie, 01.10.2023
Longevity Genie, 01.10.2023
 
RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing Research
 
EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795EPGP Informatics Publication - nihms-369795
EPGP Informatics Publication - nihms-369795
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Sept2016 plenary nist_intro
Sept2016 plenary nist_introSept2016 plenary nist_intro
Sept2016 plenary nist_intro
 
Internship Report
Internship ReportInternship Report
Internship Report
 
Genome in a Bottle
Genome in a BottleGenome in a Bottle
Genome in a Bottle
 
GIAB GRC Workshop slides
GIAB GRC Workshop slidesGIAB GRC Workshop slides
GIAB GRC Workshop slides
 
Ijetcas14 325
Ijetcas14 325Ijetcas14 325
Ijetcas14 325
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 
Research Statement Chien-Wei Lin
Research Statement Chien-Wei LinResearch Statement Chien-Wei Lin
Research Statement Chien-Wei Lin
 
Recognition of emotional states using EEG signals based on time-frequency ana...
Recognition of emotional states using EEG signals based on time-frequency ana...Recognition of emotional states using EEG signals based on time-frequency ana...
Recognition of emotional states using EEG signals based on time-frequency ana...
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
 
Impact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEGImpact_of_gene_length_on_DEG
Impact_of_gene_length_on_DEG
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 
Graziano Pesole - il progetto EPIGEN
Graziano Pesole - il progetto EPIGENGraziano Pesole - il progetto EPIGEN
Graziano Pesole - il progetto EPIGEN
 

More from International Journal of Computer and Communication System Engineering

More from International Journal of Computer and Communication System Engineering (20)

Cloud Security Analysis for Health Care Systems
Cloud Security Analysis for Health Care SystemsCloud Security Analysis for Health Care Systems
Cloud Security Analysis for Health Care Systems
 
Efficient stbc for the data rate of mimo ofdma
Efficient stbc for the data rate of mimo ofdmaEfficient stbc for the data rate of mimo ofdma
Efficient stbc for the data rate of mimo ofdma
 
A novel adaptive algorithm for removal of power line interference from ecg si...
A novel adaptive algorithm for removal of power line interference from ecg si...A novel adaptive algorithm for removal of power line interference from ecg si...
A novel adaptive algorithm for removal of power line interference from ecg si...
 
Modified MD5 Algorithm for Password Encryption
Modified MD5 Algorithm for Password EncryptionModified MD5 Algorithm for Password Encryption
Modified MD5 Algorithm for Password Encryption
 
Implementing Pareto Analysis of Total Quality Management for Service Industri...
Implementing Pareto Analysis of Total Quality Management for Service Industri...Implementing Pareto Analysis of Total Quality Management for Service Industri...
Implementing Pareto Analysis of Total Quality Management for Service Industri...
 
Real Time Parking Information Provider System on Android Phones
Real Time Parking Information Provider System on Android PhonesReal Time Parking Information Provider System on Android Phones
Real Time Parking Information Provider System on Android Phones
 
An Image-Based Bone fracture Detection Using AForge Library
An Image-Based Bone fracture Detection Using AForge LibraryAn Image-Based Bone fracture Detection Using AForge Library
An Image-Based Bone fracture Detection Using AForge Library
 
Compact Fractal Based UWB Band Notch Antenna
Compact Fractal Based UWB Band Notch AntennaCompact Fractal Based UWB Band Notch Antenna
Compact Fractal Based UWB Band Notch Antenna
 
Dynamic Key Based User Authentication (DKBUA) Framework for MobiCloud Environ...
Dynamic Key Based User Authentication (DKBUA) Framework for MobiCloud Environ...Dynamic Key Based User Authentication (DKBUA) Framework for MobiCloud Environ...
Dynamic Key Based User Authentication (DKBUA) Framework for MobiCloud Environ...
 
A Learning Automata Based Prediction Mechanism for Target Tracking in Wireles...
A Learning Automata Based Prediction Mechanism for Target Tracking in Wireles...A Learning Automata Based Prediction Mechanism for Target Tracking in Wireles...
A Learning Automata Based Prediction Mechanism for Target Tracking in Wireles...
 
An Approach of Improvisation in Efficiency of Apriori Algorithm
An Approach of Improvisation in Efficiency of Apriori AlgorithmAn Approach of Improvisation in Efficiency of Apriori Algorithm
An Approach of Improvisation in Efficiency of Apriori Algorithm
 
Cloud Computing for Exploring to Scope in Business
Cloud Computing for Exploring to Scope in BusinessCloud Computing for Exploring to Scope in Business
Cloud Computing for Exploring to Scope in Business
 
Mobile Effects on Human Body
Mobile Effects on Human BodyMobile Effects on Human Body
Mobile Effects on Human Body
 
Performance Analysis of WiMAX Based Vehicular Ad hoc Networks with Realistic ...
Performance Analysis of WiMAX Based Vehicular Ad hoc Networks with Realistic ...Performance Analysis of WiMAX Based Vehicular Ad hoc Networks with Realistic ...
Performance Analysis of WiMAX Based Vehicular Ad hoc Networks with Realistic ...
 
Prevention of Denial-of-Service Attack In Wireless Sensor Network via NS-2
Prevention of Denial-of-Service Attack In Wireless Sensor Network via NS-2Prevention of Denial-of-Service Attack In Wireless Sensor Network via NS-2
Prevention of Denial-of-Service Attack In Wireless Sensor Network via NS-2
 
CLOUD TESTING MODEL – BENEFITS, LIMITATIONS AND CHALLENGES
CLOUD TESTING MODEL – BENEFITS, LIMITATIONS AND CHALLENGESCLOUD TESTING MODEL – BENEFITS, LIMITATIONS AND CHALLENGES
CLOUD TESTING MODEL – BENEFITS, LIMITATIONS AND CHALLENGES
 
Exploratory Analysis of AI Techniques in Computer Games and Challenges faced ...
Exploratory Analysis of AI Techniques in Computer Games and Challenges faced ...Exploratory Analysis of AI Techniques in Computer Games and Challenges faced ...
Exploratory Analysis of AI Techniques in Computer Games and Challenges faced ...
 
Feasibility Study on e-Voting System
Feasibility Study on e-Voting SystemFeasibility Study on e-Voting System
Feasibility Study on e-Voting System
 
Rp 3010 5814
Rp 3010 5814Rp 3010 5814
Rp 3010 5814
 
Gabor Filter
Gabor FilterGabor Filter
Gabor Filter
 

Recently uploaded

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 

Retrieval and Statistical Analysis of Genbank Data (RASA-GD)

  • 1. ISSN: 2312-7694 Anop et al. / International Journal of Computer and Communication System Engineering (IJCCSE) 90 | P a g e © 2014, IJCCSE All Rights Reserved Vol. 1 No.03 October 2014 www.ijccse.com Retrieval and Statistical Analysis of Genbank Data (RASA-GD) Anop Singh Ranawat School of Life Sciences, main campus, Jaipur National University, Jaipur, Rajasthan, India Mohan Kumar Yadav School of Life Sciences, main campus, Jaipur National University, Jaipur, Rajasthan, India Abstract— GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences. It comprises the DNA DataBank of Japan (DDBJ), the European Molecular Biology Laboratory (EMBL), and GenBank at NCB[3] .The information retrieved in a genbank file is complex and requires condsiderable time for analyzing the data. In this work we wish to convert the data present in Genbank file into a more readable, graphical format by developing a computational tool RASA-GD, using Perl code, Perl module and php. The tool accept file in genbank format and will extract information such as coding region, exon, intron, gene, mRNA,tRNA, and other data contained in gene bank file and also calculate the length of these elements and will generate a graphical output of length and statistical values such as mean, median, mode, standard deviation, variance, sample range, minimum value, maximum value, unique gene number. This tool will be very useful for various kinds of analysis to both molecular biologists and computational biologists and will help in hypothesis generation. We have used ourtool for visualizing variation of foxp2 gene from various species such as in human, monkey, gorilla, house mouse, gallusgallus. This gene is responsible for speech behavior. In order to explore whether there is an evolutionary significance of the length of various genetic elements of this gene, The data generated has provided useful insight into the evolutionary history of this gene. Keywords— GI number, Genbank, foxp2 gene, RASA-GD I. INTRODUCTION Bioinformatics is an interdisciplinary scientific field where biological problems can be solve easily and efficiently by using computational approach[1,2] .It plays a very important role in the field of medical research, now it is the time of genomic era so there is a need of computers with biology mergeapproach which can be used to deal with the genomic data, to study and analyze the variation in gene. Nowadays,experimentaldatas are stored in databases to study and analyze the result computationally. Various computational tools have been designed to analyze the data and predict the result by applying mathematical and statistical approach to perform the experiment Insilco, using computational tools and database. The handling of databases and selection of desirable data from the databases is a tough task for the researchers and students from a non bioinformaticsbackground.To make the retrieval work easier we have designed a tool RASA-GD, which helps to retrieve the desire data from the complex file obtained from the database of Genbank. II. METHODS AND RESULTS We have designed this tool by using perlcode,perl module and PHP.The tool used statistical and graphical module to calculate statistical value and visualize the desire output graphically. For study we have taken six samples files from genbank database(www.genbank.org) of foxp2 gene in different organisms such as human GI:568815591, chimpanzee GI:319999821, monkey GI:109156890, gorilla GI:401623081, house mouse GI:372099104, gallusgallusGI:358485511. We have puted all genbank sample files insingle genbank file and browse in tool. Fig.1: This is home page of RASA-GD tool
  • 2. ISSN: 2312-7694 Anop et al. / International Journal of Computer and Communication System Engineering (IJCCSE) 91 | P a g e © 2014, IJCCSE All Rights Reserved Vol. 1 No.03 October 2014 www.ijccse.com Fig.2: First output file generated from RASA-GD Fig.3: This is a output result showing lengths of genebank elements according to different GI Fig .4: This shows the second input values. Fig .5: This is graphical and statistical value of different gene lengths are obtained from RASA-GD. In first result we have obtained lengths of differenent elements contained in different GI number. Then we have selected the gene lenths of foxp2 in different organisms. The selected lengths are puted in second box shownin fig.2, as GI:568815591,607463,GI:109156890,260557,GI:401623081,276119,GI:372099104,540725,GI:358485511,413384 . The gene length variation shown graphically in fig.4, can easily visualized. The statistical value gave following information such as unique sequence is 5 that are different from other sequences, count is 5 that is total number of sample genes,sum is 2098298 that is total sum of all gene length, min. value 260557 that is minimum value, max. is 607463 that is value of maximum length of the gene, mean419649.6 gave the mean value of total gene length and also gave standard deviation that is 15425.6050, variance is 23970967989.6, sample range is 346906, mode is 260557 and median is 413384. III. DISCUSSION AND ANALYSIS The tool RASA-GD used in retrieval and graphical visualization of genbank elements such as length of genes, exon, intron, mRNA,CDS, tRNAetc from genbank file and also used to calculate the statistical values of desire length of elements contained in genbank file. The different organisms of foxp2 genes found different lengths. REFERENCES [1] David W. Mount [2] Essential of Bioinformatics [3] http://www.ncbi.nlm.nih.gov/gene/?term=foxp2 [4] http://search.cpan.org/dist/GDGraph/Graph.pm [5].http://search.cpan.org/~rhettbull/Statistics-Descriptive- Discrete-0.07/Discrete.pm