SlideShare a Scribd company logo
1 of 29
GENOME CLOUD COMPUTING
By.A.Arputha Selvaraj
Introduction
๏ฌ The impending collapse of the genome
informatics
๏ฌ Major sequence-data relateddatabases:
GenBank, EMBL, DDBJ, SRA, GEO, and
ArrayExpress
๏ฌ Value-added integrators of genomic data:
Ensembl, UCSC Genome Browser, Galaxy,
and many model organism databases.
The old genome informatics ecosystem
Basis for the production and
consumption of genomic information
๏ฌ Moore's Law: โ€œthe number of transistors that
can be placed on an integrated circuit board
is increasing exponentially, with a doubling
time of roughly 18 months.โ€
๏ฌ Similar laws for disk storage and network
capacity have also been observed.
๏ฌ Until now the doubling time for DNA
sequencing was just a bit slower than the
growth of compute and storage capacity.
Historical trends in storage prices vs DNA
sequencing costs
Notice: this is a
logarithmic plot โ€“
exponential
curves appear as
straight lines.
โ€œNext generationโ€ sequencing technologies in the mid-2000s changed the trends and
now threatens the conventional genome informatics ecosystem
Current Situation
๏‚จ Soon it will cost less to sequence a base of
DNA than to store it on hard disk
๏‚จ We are facing potential tsunami of genome
data that will swamp our storage system and
crush our compute clusters
๏‚จ Who is affected:
๏‚ค Genome sequence repositories who need
maintain and timely update their data
๏‚ค โ€œPower usersโ€ who accustomed to download the
data to local computer for analysis
Cloud Computing
๏‚จ Cloud computing is a general term for
computation-as-a-service
๏‚จ Computation-as-a-service means that customers
rent the hardware and the storage only for the
time needed to achieve their goals
๏‚จ In cloud computing the rentals are virtual
๏‚จ The service provider implements the
infrastructure to give users the ability to create,
upload, and launch virtual machines on their
extremely large and powerful compute farm.
Virtual Machine
๏‚จ Virtual machine is software that emulates the properties
of a computer.
๏‚จ Any operating system can be chosen to run on the
virtual machine and all the tasks we usually perform on
the computer can be performed on virtual machine.
๏‚จ Virtual machine bootable disk can be stored as an
image. This image can used as a template to start up
multiple virtual machines โ€“ launching a virtual compute
cluster is very easy.
๏‚จ Storing large data sets is usually done with virtual
disks. Virtual disk image can be attached to virtual
machine as local or shared hard drive.
The โ€œnewโ€ genome informatics ecosystem
based on cloud computing
Continue to
work with the
data via web-
pages in their
accustomed
way
Have two options:
continue
downloading data
to local clusters as
before, or use
cloud computing โ€“
move the cluster to
the data.
Cloud computing service providers and
their offered services
๏‚จ Some of the cloud computing service providers are: Amazon
(Elastic Cloud Computing), Rackspace Cloud, Flexiant.
๏‚จ Amazon offers a variety of bioinformatics-oriented virtual machine
images:
๏‚ค Images prepopulated by Galaxy
๏‚ค Bioconductor โ€“ programming environment with R statistic package
๏‚ค GBrowser โ€“ genome browser
๏‚ค BioPerl
๏‚ค JCVI Cloud BioLinux โ€“ collection of bioinformatics tools including
Celera Assembler
๏‚จ Amazon also provides several large genomic datasets in its cloud:
๏‚ค Complete copy of GeneBank (200 Gb)
๏‚ค 30x coverage sequencing reads of a trio of individuals from 1000
Genome Project (700Gb)
๏‚ค Genome databases from Ensembl including annotated human and 50
other species genomes
Cloud computing service providers
from Nature Biotechnology news article โ€œUp in a cloud?โ€
Academic cloud computing
๏‚จ Growing number of academic compute cloud
projects are based on open source cloud
management software.
๏‚จ Open Cloud Consortium is an example of one
such project
Cloud computing disadvantages
๏‚จ Cost of migrating existing systems into the
new and unfamiliar environment
๏‚จ Security and privacy of the data
๏‚ค Need additional level of data encryption
๏‚ค Need new software that will restrict access to
only authorized users
๏‚จ Major question: does cloud computing make
economic sense?
Comparing relative costs of renting vs.
buying computational services
๏‚จ According to technical report on Cloud
Computing by UC Berkeley Reliable Adaptive
Distributed Systems Laboratory
๏‚ค When all the costs of maintaining a large
compute cluster are considered the cost of
renting a data center from Amazon is marginally
more expensive than buying one.
๏‚ค When the flexibility of the cloud to support a
virtual data center that downsizes or grows as
needed is factored in, the economics start to look
better
Obstacles in moving toward cloud
computing for genomics
๏‚จ Network bandwidth โ€“ transferring 100 gigabit
next-generation sequencing data file across
10Gb/sec connection will take less than a day
only if hogging much of the institutionโ€™s
bandwidth.
๏‚จ Cloud computing service providers will have to
offer some flexibility in how large datasets get
into system.
By Jaliya Ekanayake, Thilina Gunarathne, and Judy Qui
IEEE transactions on parallel and distributed systems
Cloud Technologies for
Bioinformatics Applications
Motivation for presenting the article
๏‚จ Exponential increase in the size of datasets
implies that parallelism is essential to process the
information in a timely fashion.
๏‚จ The algorithms presented in this paper might
reduce computational analysis time significantly
whether used on our own cluster or on rented
Amazon virtual cluster.
๏‚จ This paper presents a nice example of how these
new parallel computing algorithms can be applied
for bioinformatics
MapReduce
๏‚จ MapReduce is a software framework introduced by Google to
support distributed computing on large data sets on computer
clusters.
๏‚จ Users specify a map function that processes a key/value pair to
generate a set of intermediate key/value pairs, and a reduce
function that merges all intermediate values associated with the
same intermediate key.
๏‚จ Programs written in this functional style are automatically
parallelized and executed on a large cluster of commodity
machines. The run-time system takes care of the details of
partitioning the input data, scheduling the program's execution
across a set of machines, handling machine failures, and managing
the required inter-machine communication. This allows
programmers without any experience with parallel and distributed
systems to easily utilize the resources of a large distributed system.
Tow steps of MapReduce
๏‚ค Map: The master node takes the input, chops it
up into smaller sub-problems, and distributes
those to worker nodes. The worker node
processes that smaller problem, and passes the
answer back to its master node.
๏‚ค Reduce: The master node then takes the
answers to all the sub-problems and combines
them in a way to get the output - the answer to
the problem it was originally trying to solve.
MapReduce Illustrations
1
2
3
4
Objectives
๏‚จ Examine 3 technologies:
๏‚ค Microsoft Drayd/DryadLINQ
๏‚ค Apache Hadoop
๏‚ค Message Passing Interface (MPI)
๏‚จ These 3 technologies are applied to 2
bioinformatics applications:
๏‚ค Expressed Sequence Tags (EST)
๏‚ค Alu clustering
Technolgies
๏‚จ MPI โ€“ The goal of MPI is to provide a widely used
standard for writing message passing programs. MPI is
used in distributed and parallel computing.
๏‚จ Dryad/DryadLINQ is a distributed execution engine for
coarse grain data parallel applications. It combines the
MapReduce programming style with dataflow graphs to
solve computational problems.
๏‚จ Apache Hadoop is an open source Java implementation
of MapReduce. Hadoop schedules the MapReduce
computation tasks depending on the data locality.
Bioinformatics applications
๏‚จ Alu Sequence Classification is one of the most
challenging problems for sequence clustering because
Alus represent the largest repeat families in human
genome. For the specific objectives of the paper an open
source version of the Smith Waterman โ€“ Gotoh (SW-G)
algorithm was used.
๏‚จ EST (Expressed Sequence Tag) corresponds to
messenger RNAs transcribed from genes residing on
chromosomes. Each individual EST sequence represents
a fragment of mRNA, and the EST assembly aims to
reconstruct full-length mRNA sequence for each
expressed gene. Specific implementation used for this
research is CAP3 software.
Scalability of SW-G implementations
Scalability of CAP3 implementations
How different technologies perform on
cloud computing clusters
๏‚จ The authors were interested to study the overhead of Virtual
Machines for application that use frameworks like Hadoop and
Dryad.
Performance degradation of
Hadoop SWG application on
virtual environment ranges
from 15% to 25%
How different technologies perform on
cloud computing clusters contโ€™d
The performance degradation in CAP3 remains constant near 20% .
CAP3 application does not show the decrease of VM overhead with the
increase of problem size as with SWG application.
Conclusions
๏‚จ The flexibility of clouds and MapReduce suggest they will
become the preferred approaches.
๏‚จ The support for handling large data sets, the concept of
moving computation to data, and the better quality of service
provided by the cloud technologies simplify the
implementation of some problems over traditional systems.
๏‚จ Experiments showed that DryadLINQ and Hadoop get similar
performance and that virtual machines five overheads of
around 20%.
๏‚จ The support for inhomogeneous data is important and
currently Hadoop performs better than DryadLINQ.
๏‚จ MapReduce offer higher level interface and the user needs
less explicit control of the parallelism.
Thank you

More Related Content

What's hot

International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
ย 
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...AtakanAral
ย 
vistech2015
vistech2015vistech2015
vistech2015Jie Jiang
ย 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
ย 
Task Scheduling methodology in cloud computing
Task Scheduling methodology in cloud computing Task Scheduling methodology in cloud computing
Task Scheduling methodology in cloud computing Qutub-ud- Din
ย 
Cloud colonography distributed medical testbed over cloud
Cloud colonography distributed medical testbed over cloudCloud colonography distributed medical testbed over cloud
Cloud colonography distributed medical testbed over cloudVenkat Projects
ย 
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...Frederic Desprez
ย 
Fault tolerant mechanisms in Big Data
Fault tolerant mechanisms in Big DataFault tolerant mechanisms in Big Data
Fault tolerant mechanisms in Big DataKaran Pardeshi
ย 
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...IOSR Journals
ย 
Multicloud Deployment of Computing Clusters for Loosely Coupled Multi Task C...
Multicloud Deployment of Computing Clusters for Loosely  Coupled Multi Task C...Multicloud Deployment of Computing Clusters for Loosely  Coupled Multi Task C...
Multicloud Deployment of Computing Clusters for Loosely Coupled Multi Task C...IOSR Journals
ย 
IEEE Emerging topic in computing Title and Abstract 2016
IEEE Emerging topic in computing Title and Abstract 2016 IEEE Emerging topic in computing Title and Abstract 2016
IEEE Emerging topic in computing Title and Abstract 2016 tsysglobalsolutions
ย 
Iaetsd implementation of secure audit process
Iaetsd implementation of secure audit processIaetsd implementation of secure audit process
Iaetsd implementation of secure audit processIaetsd Iaetsd
ย 
An optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computingAn optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computingDIGVIJAY SHINDE
ย 
Cloud Computing and PSo
Cloud Computing and PSoCloud Computing and PSo
Cloud Computing and PSosurya kumar palla
ย 
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...AtakanAral
ย 

What's hot (19)

ICICCE0298
ICICCE0298ICICCE0298
ICICCE0298
ย 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
ย 
ENERGY EFFICIENT UNEQUAL CLUSTERING ALGORITHM FOR CLUSTERED WIRELESS SENSOR N...
ENERGY EFFICIENT UNEQUAL CLUSTERING ALGORITHM FOR CLUSTERED WIRELESS SENSOR N...ENERGY EFFICIENT UNEQUAL CLUSTERING ALGORITHM FOR CLUSTERED WIRELESS SENSOR N...
ENERGY EFFICIENT UNEQUAL CLUSTERING ALGORITHM FOR CLUSTERED WIRELESS SENSOR N...
ย 
Coca1
Coca1Coca1
Coca1
ย 
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
Modeling and Optimization of Resource Allocation in Cloud [PhD Thesis Progres...
ย 
vistech2015
vistech2015vistech2015
vistech2015
ย 
International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)
ย 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
ย 
Task Scheduling methodology in cloud computing
Task Scheduling methodology in cloud computing Task Scheduling methodology in cloud computing
Task Scheduling methodology in cloud computing
ย 
Cloud colonography distributed medical testbed over cloud
Cloud colonography distributed medical testbed over cloudCloud colonography distributed medical testbed over cloud
Cloud colonography distributed medical testbed over cloud
ย 
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
Grid'5000: Running a Large Instrument for Parallel and Distributed Computing ...
ย 
Fault tolerant mechanisms in Big Data
Fault tolerant mechanisms in Big DataFault tolerant mechanisms in Big Data
Fault tolerant mechanisms in Big Data
ย 
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
Comparative Analysis, Security Aspects & Optimization of Workload in Gfs Base...
ย 
Multicloud Deployment of Computing Clusters for Loosely Coupled Multi Task C...
Multicloud Deployment of Computing Clusters for Loosely  Coupled Multi Task C...Multicloud Deployment of Computing Clusters for Loosely  Coupled Multi Task C...
Multicloud Deployment of Computing Clusters for Loosely Coupled Multi Task C...
ย 
IEEE Emerging topic in computing Title and Abstract 2016
IEEE Emerging topic in computing Title and Abstract 2016 IEEE Emerging topic in computing Title and Abstract 2016
IEEE Emerging topic in computing Title and Abstract 2016
ย 
Iaetsd implementation of secure audit process
Iaetsd implementation of secure audit processIaetsd implementation of secure audit process
Iaetsd implementation of secure audit process
ย 
An optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computingAn optimized scientific workflow scheduling in cloud computing
An optimized scientific workflow scheduling in cloud computing
ย 
Cloud Computing and PSo
Cloud Computing and PSoCloud Computing and PSo
Cloud Computing and PSo
ย 
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
Resource Mapping Optimization for Distributed Cloud Services - PhD Thesis Def...
ย 

Viewers also liked

Presentation of the project "quadrat" - Quality Assurance Driver for adult tr...
Presentation of the project "quadrat" - Quality Assurance Driver for adult tr...Presentation of the project "quadrat" - Quality Assurance Driver for adult tr...
Presentation of the project "quadrat" - Quality Assurance Driver for adult tr...GIANLUCA COPPOLA
ย 
Vayakhel pekudei 1
Vayakhel pekudei 1Vayakhel pekudei 1
Vayakhel pekudei 1el9360
ย 
Jbossav formation-jboss-administration-avancee
Jbossav formation-jboss-administration-avanceeJbossav formation-jboss-administration-avancee
Jbossav formation-jboss-administration-avanceeCERTyou Formation
ย 
Cifocursos2015 sabadell
Cifocursos2015 sabadellCifocursos2015 sabadell
Cifocursos2015 sabadellCop de Mร 
ย 
Lin05 formation-linux-avance-serveur-d-entreprises-avec-samba-ldap-netfilter-...
Lin05 formation-linux-avance-serveur-d-entreprises-avec-samba-ldap-netfilter-...Lin05 formation-linux-avance-serveur-d-entreprises-avec-samba-ldap-netfilter-...
Lin05 formation-linux-avance-serveur-d-entreprises-avec-samba-ldap-netfilter-...CERTyou Formation
ย 
Jee008 formation-jee-jsf-java-server-faces
Jee008 formation-jee-jsf-java-server-facesJee008 formation-jee-jsf-java-server-faces
Jee008 formation-jee-jsf-java-server-facesCERTyou Formation
ย 
Las personas emprendedoras son inquietas, ambiciosas y no tienen miedo a caer
Las personas emprendedoras son inquietas, ambiciosas y no tienen miedo a caerLas personas emprendedoras son inquietas, ambiciosas y no tienen miedo a caer
Las personas emprendedoras son inquietas, ambiciosas y no tienen miedo a caerTACTIO
ย 
Bmng7312 presentation lu1
Bmng7312 presentation lu1Bmng7312 presentation lu1
Bmng7312 presentation lu1varsitycollege
ย 
Web intelligence and big data
Web intelligence and big dataWeb intelligence and big data
Web intelligence and big dataRafael Mendes
ย 
Lirik lagu rukun solat tahun 4
Lirik lagu rukun solat tahun 4Lirik lagu rukun solat tahun 4
Lirik lagu rukun solat tahun 4nurul293
ย 
เธก.3
เธก.3เธก.3
เธก.3Pak Ubss
ย 
Ijos formation-introduction-au-systeme-d-exploitation-junos-juniper
Ijos formation-introduction-au-systeme-d-exploitation-junos-juniperIjos formation-introduction-au-systeme-d-exploitation-junos-juniper
Ijos formation-introduction-au-systeme-d-exploitation-junos-juniperCERTyou Formation
ย 
Subtraรงรฃo com Pernalonga
Subtraรงรฃo com PernalongaSubtraรงรฃo com Pernalonga
Subtraรงรฃo com PernalongaGracita Fraga
ย 
samir julio
samir juliosamir julio
samir juliosamir julio
ย 

Viewers also liked (15)

Resume 2015
Resume 2015Resume 2015
Resume 2015
ย 
Presentation of the project "quadrat" - Quality Assurance Driver for adult tr...
Presentation of the project "quadrat" - Quality Assurance Driver for adult tr...Presentation of the project "quadrat" - Quality Assurance Driver for adult tr...
Presentation of the project "quadrat" - Quality Assurance Driver for adult tr...
ย 
Vayakhel pekudei 1
Vayakhel pekudei 1Vayakhel pekudei 1
Vayakhel pekudei 1
ย 
Jbossav formation-jboss-administration-avancee
Jbossav formation-jboss-administration-avanceeJbossav formation-jboss-administration-avancee
Jbossav formation-jboss-administration-avancee
ย 
Cifocursos2015 sabadell
Cifocursos2015 sabadellCifocursos2015 sabadell
Cifocursos2015 sabadell
ย 
Lin05 formation-linux-avance-serveur-d-entreprises-avec-samba-ldap-netfilter-...
Lin05 formation-linux-avance-serveur-d-entreprises-avec-samba-ldap-netfilter-...Lin05 formation-linux-avance-serveur-d-entreprises-avec-samba-ldap-netfilter-...
Lin05 formation-linux-avance-serveur-d-entreprises-avec-samba-ldap-netfilter-...
ย 
Jee008 formation-jee-jsf-java-server-faces
Jee008 formation-jee-jsf-java-server-facesJee008 formation-jee-jsf-java-server-faces
Jee008 formation-jee-jsf-java-server-faces
ย 
Las personas emprendedoras son inquietas, ambiciosas y no tienen miedo a caer
Las personas emprendedoras son inquietas, ambiciosas y no tienen miedo a caerLas personas emprendedoras son inquietas, ambiciosas y no tienen miedo a caer
Las personas emprendedoras son inquietas, ambiciosas y no tienen miedo a caer
ย 
Bmng7312 presentation lu1
Bmng7312 presentation lu1Bmng7312 presentation lu1
Bmng7312 presentation lu1
ย 
Web intelligence and big data
Web intelligence and big dataWeb intelligence and big data
Web intelligence and big data
ย 
Lirik lagu rukun solat tahun 4
Lirik lagu rukun solat tahun 4Lirik lagu rukun solat tahun 4
Lirik lagu rukun solat tahun 4
ย 
เธก.3
เธก.3เธก.3
เธก.3
ย 
Ijos formation-introduction-au-systeme-d-exploitation-junos-juniper
Ijos formation-introduction-au-systeme-d-exploitation-junos-juniperIjos formation-introduction-au-systeme-d-exploitation-junos-juniper
Ijos formation-introduction-au-systeme-d-exploitation-junos-juniper
ย 
Subtraรงรฃo com Pernalonga
Subtraรงรฃo com PernalongaSubtraรงรฃo com Pernalonga
Subtraรงรฃo com Pernalonga
ย 
samir julio
samir juliosamir julio
samir julio
ย 

Similar to CLOUD BIOINFORMATICS Part1

Cluster computing
Cluster computingCluster computing
Cluster computingpooja khatana
ย 
CloudComputing_UNIT5.pdf
CloudComputing_UNIT5.pdfCloudComputing_UNIT5.pdf
CloudComputing_UNIT5.pdfkhan593595
ย 
Parallel and distributed system projects for java and dot net
Parallel and distributed system projects for java and dot netParallel and distributed system projects for java and dot net
Parallel and distributed system projects for java and dot netredpel dot com
ย 
Gridcomputingppt
GridcomputingpptGridcomputingppt
Gridcomputingpptnavjasser
ย 
Eg4301808811
Eg4301808811Eg4301808811
Eg4301808811IJERA Editor
ย 
A Study on Replication and Failover Cluster to Maximize System Uptime
A Study on Replication and Failover Cluster to Maximize System UptimeA Study on Replication and Failover Cluster to Maximize System Uptime
A Study on Replication and Failover Cluster to Maximize System UptimeYogeshIJTSRD
ย 
ZCloud Consensus on Hardware for Distributed Systems
ZCloud Consensus on Hardware for Distributed SystemsZCloud Consensus on Hardware for Distributed Systems
ZCloud Consensus on Hardware for Distributed SystemsGokhan Boranalp
ย 
Computing Outside The Box September 2009
Computing Outside The Box September 2009Computing Outside The Box September 2009
Computing Outside The Box September 2009Ian Foster
ย 
Application-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud EnvironmentApplication-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud EnvironmentSafayet Hossain
ย 
Data Analysis In The Cloud
Data Analysis In The CloudData Analysis In The Cloud
Data Analysis In The CloudMonica Carter
ย 
Paralyzing Bioinformatics Applications Using Conducive Hadoop Cluster
Paralyzing Bioinformatics Applications Using Conducive Hadoop ClusterParalyzing Bioinformatics Applications Using Conducive Hadoop Cluster
Paralyzing Bioinformatics Applications Using Conducive Hadoop ClusterIOSR Journals
ย 
Ieee transactions on 2018 network and service management
Ieee transactions on 2018 network and service managementIeee transactions on 2018 network and service management
Ieee transactions on 2018 network and service managementtsysglobalsolutions
ย 
Clustering by AKASHMSHAH
Clustering by AKASHMSHAHClustering by AKASHMSHAH
Clustering by AKASHMSHAHAkash M Shah
ย 
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...IEEEFINALSEMSTUDENTPROJECTS
ย 
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...IEEEGLOBALSOFTSTUDENTPROJECTS
ย 
Qiu bosc2010
Qiu bosc2010Qiu bosc2010
Qiu bosc2010BOSC 2010
ย 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computersshopnil786
ย 

Similar to CLOUD BIOINFORMATICS Part1 (20)

Cluster computing
Cluster computingCluster computing
Cluster computing
ย 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
ย 
CloudComputing_UNIT5.pdf
CloudComputing_UNIT5.pdfCloudComputing_UNIT5.pdf
CloudComputing_UNIT5.pdf
ย 
Parallel and distributed system projects for java and dot net
Parallel and distributed system projects for java and dot netParallel and distributed system projects for java and dot net
Parallel and distributed system projects for java and dot net
ย 
Gridcomputingppt
GridcomputingpptGridcomputingppt
Gridcomputingppt
ย 
Eg4301808811
Eg4301808811Eg4301808811
Eg4301808811
ย 
A Study on Replication and Failover Cluster to Maximize System Uptime
A Study on Replication and Failover Cluster to Maximize System UptimeA Study on Replication and Failover Cluster to Maximize System Uptime
A Study on Replication and Failover Cluster to Maximize System Uptime
ย 
ZCloud Consensus on Hardware for Distributed Systems
ZCloud Consensus on Hardware for Distributed SystemsZCloud Consensus on Hardware for Distributed Systems
ZCloud Consensus on Hardware for Distributed Systems
ย 
Computing Outside The Box September 2009
Computing Outside The Box September 2009Computing Outside The Box September 2009
Computing Outside The Box September 2009
ย 
Application-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud EnvironmentApplication-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud Environment
ย 
[IJET V2I5P18] Authors:Pooja Mangla, Dr. Sandip Kumar Goyal
[IJET V2I5P18] Authors:Pooja Mangla, Dr. Sandip Kumar Goyal[IJET V2I5P18] Authors:Pooja Mangla, Dr. Sandip Kumar Goyal
[IJET V2I5P18] Authors:Pooja Mangla, Dr. Sandip Kumar Goyal
ย 
Facade
FacadeFacade
Facade
ย 
Data Analysis In The Cloud
Data Analysis In The CloudData Analysis In The Cloud
Data Analysis In The Cloud
ย 
Paralyzing Bioinformatics Applications Using Conducive Hadoop Cluster
Paralyzing Bioinformatics Applications Using Conducive Hadoop ClusterParalyzing Bioinformatics Applications Using Conducive Hadoop Cluster
Paralyzing Bioinformatics Applications Using Conducive Hadoop Cluster
ย 
Ieee transactions on 2018 network and service management
Ieee transactions on 2018 network and service managementIeee transactions on 2018 network and service management
Ieee transactions on 2018 network and service management
ย 
Clustering by AKASHMSHAH
Clustering by AKASHMSHAHClustering by AKASHMSHAH
Clustering by AKASHMSHAH
ย 
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
2014 IEEE JAVA CLOUD COMPUTING PROJECT Adaptive algorithm for minimizing clou...
ย 
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS Adaptive algorithm for minimizing clo...
ย 
Qiu bosc2010
Qiu bosc2010Qiu bosc2010
Qiu bosc2010
ย 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computers
ย 

Recently uploaded

Shazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdfShazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdfTrustlife
ย 
Call Girl In Indore ๐Ÿ“ž9235973566๐Ÿ“ž Just๐Ÿ“ฒ Call Inaaya Indore Call Girls Service ...
Call Girl In Indore ๐Ÿ“ž9235973566๐Ÿ“ž Just๐Ÿ“ฒ Call Inaaya Indore Call Girls Service ...Call Girl In Indore ๐Ÿ“ž9235973566๐Ÿ“ž Just๐Ÿ“ฒ Call Inaaya Indore Call Girls Service ...
Call Girl In Indore ๐Ÿ“ž9235973566๐Ÿ“ž Just๐Ÿ“ฒ Call Inaaya Indore Call Girls Service ...Sheetaleventcompany
ย 
Goa Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Goa No๐Ÿ’ฐAdvanc...
Goa Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Goa No๐Ÿ’ฐAdvanc...Goa Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Goa No๐Ÿ’ฐAdvanc...
Goa Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Goa No๐Ÿ’ฐAdvanc...Sheetaleventcompany
ย 
Kolkata Call Girls Shobhabazar ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ Top Class Call Gir...
Kolkata Call Girls Shobhabazar  ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ  Top Class Call Gir...Kolkata Call Girls Shobhabazar  ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ  Top Class Call Gir...
Kolkata Call Girls Shobhabazar ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ Top Class Call Gir...Namrata Singh
ย 
ANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptxANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptxSwetaba Besh
ย 
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...Sheetaleventcompany
ย 
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...Sheetaleventcompany
ย 
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsAppMost Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsAppjimmihoslasi
ย 
Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...
Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...
Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...GENUINE ESCORT AGENCY
ย 
Circulatory Shock, types and stages, compensatory mechanisms
Circulatory Shock, types and stages, compensatory mechanismsCirculatory Shock, types and stages, compensatory mechanisms
Circulatory Shock, types and stages, compensatory mechanismsMedicoseAcademics
ย 
Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...
Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...
Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...Angel
ย 
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service DehradunDehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service DehradunSheetaleventcompany
ย 
Premium Call Girls Dehradun {8854095900} โค๏ธVVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} โค๏ธVVIP ANJU Call Girls in Dehradun U...Premium Call Girls Dehradun {8854095900} โค๏ธVVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} โค๏ธVVIP ANJU Call Girls in Dehradun U...Sheetaleventcompany
ย 
tongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacytongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacyDrMohamed Assadawy
ย 
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...Sheetaleventcompany
ย 
โค๏ธCall Girl Service In Chandigarhโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl in Chandigarhโ˜Ž๏ธ Cha...
โค๏ธCall Girl Service In Chandigarhโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl in Chandigarhโ˜Ž๏ธ Cha...โค๏ธCall Girl Service In Chandigarhโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl in Chandigarhโ˜Ž๏ธ Cha...
โค๏ธCall Girl Service In Chandigarhโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl in Chandigarhโ˜Ž๏ธ Cha...Sheetaleventcompany
ย 
Intramuscular & Intravenous Injection.pptx
Intramuscular & Intravenous Injection.pptxIntramuscular & Intravenous Injection.pptx
Intramuscular & Intravenous Injection.pptxsaranpratha12
ย 
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptxANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptxSwetaba Besh
ย 
โค๏ธChandigarh Escorts Serviceโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl service in Chandigarhโ˜Ž๏ธ ...
โค๏ธChandigarh Escorts Serviceโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl service in Chandigarhโ˜Ž๏ธ ...โค๏ธChandigarh Escorts Serviceโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl service in Chandigarhโ˜Ž๏ธ ...
โค๏ธChandigarh Escorts Serviceโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl service in Chandigarhโ˜Ž๏ธ ...Sheetaleventcompany
ย 
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...dishamehta3332
ย 

Recently uploaded (20)

Shazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdfShazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdf
ย 
Call Girl In Indore ๐Ÿ“ž9235973566๐Ÿ“ž Just๐Ÿ“ฒ Call Inaaya Indore Call Girls Service ...
Call Girl In Indore ๐Ÿ“ž9235973566๐Ÿ“ž Just๐Ÿ“ฒ Call Inaaya Indore Call Girls Service ...Call Girl In Indore ๐Ÿ“ž9235973566๐Ÿ“ž Just๐Ÿ“ฒ Call Inaaya Indore Call Girls Service ...
Call Girl In Indore ๐Ÿ“ž9235973566๐Ÿ“ž Just๐Ÿ“ฒ Call Inaaya Indore Call Girls Service ...
ย 
Goa Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Goa No๐Ÿ’ฐAdvanc...
Goa Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Goa No๐Ÿ’ฐAdvanc...Goa Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Goa No๐Ÿ’ฐAdvanc...
Goa Call Girl Service ๐Ÿ“ž9xx000xx09๐Ÿ“žJust Call Divya๐Ÿ“ฒ Call Girl In Goa No๐Ÿ’ฐAdvanc...
ย 
Kolkata Call Girls Shobhabazar ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ Top Class Call Gir...
Kolkata Call Girls Shobhabazar  ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ  Top Class Call Gir...Kolkata Call Girls Shobhabazar  ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ  Top Class Call Gir...
Kolkata Call Girls Shobhabazar ๐Ÿ’ฏCall Us ๐Ÿ” 8005736733 ๐Ÿ” ๐Ÿ’ƒ Top Class Call Gir...
ย 
ANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptxANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptx
ย 
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
๐Ÿ’šChandigarh Call Girls ๐Ÿ’ฏRiya ๐Ÿ“ฒ๐Ÿ”8868886958๐Ÿ”Call Girls In Chandigarh No๐Ÿ’ฐAdvance...
ย 
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
ย 
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsAppMost Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
ย 
Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...
Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...
Genuine Call Girls Hyderabad 9630942363 Book High Profile Call Girl in Hydera...
ย 
Circulatory Shock, types and stages, compensatory mechanisms
Circulatory Shock, types and stages, compensatory mechanismsCirculatory Shock, types and stages, compensatory mechanisms
Circulatory Shock, types and stages, compensatory mechanisms
ย 
Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...
Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...
Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...
ย 
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service DehradunDehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
Dehradun Call Girl Service โค๏ธ๐Ÿ‘ 8854095900 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Dehradun
ย 
Premium Call Girls Dehradun {8854095900} โค๏ธVVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} โค๏ธVVIP ANJU Call Girls in Dehradun U...Premium Call Girls Dehradun {8854095900} โค๏ธVVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} โค๏ธVVIP ANJU Call Girls in Dehradun U...
ย 
tongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacytongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacy
ย 
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
Chandigarh Call Girls Service โค๏ธ๐Ÿ‘ 9809698092 ๐Ÿ‘„๐ŸซฆIndependent Escort Service Cha...
ย 
โค๏ธCall Girl Service In Chandigarhโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl in Chandigarhโ˜Ž๏ธ Cha...
โค๏ธCall Girl Service In Chandigarhโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl in Chandigarhโ˜Ž๏ธ Cha...โค๏ธCall Girl Service In Chandigarhโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl in Chandigarhโ˜Ž๏ธ Cha...
โค๏ธCall Girl Service In Chandigarhโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl in Chandigarhโ˜Ž๏ธ Cha...
ย 
Intramuscular & Intravenous Injection.pptx
Intramuscular & Intravenous Injection.pptxIntramuscular & Intravenous Injection.pptx
Intramuscular & Intravenous Injection.pptx
ย 
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptxANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
ย 
โค๏ธChandigarh Escorts Serviceโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl service in Chandigarhโ˜Ž๏ธ ...
โค๏ธChandigarh Escorts Serviceโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl service in Chandigarhโ˜Ž๏ธ ...โค๏ธChandigarh Escorts Serviceโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl service in Chandigarhโ˜Ž๏ธ ...
โค๏ธChandigarh Escorts Serviceโ˜Ž๏ธ9814379184โ˜Ž๏ธ Call Girl service in Chandigarhโ˜Ž๏ธ ...
ย 
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
Whitefield { Call Girl in Bangalore โ‚น7.5k Pick Up & Drop With Cash Payment 63...
ย 

CLOUD BIOINFORMATICS Part1

  • 2. Introduction ๏ฌ The impending collapse of the genome informatics ๏ฌ Major sequence-data relateddatabases: GenBank, EMBL, DDBJ, SRA, GEO, and ArrayExpress ๏ฌ Value-added integrators of genomic data: Ensembl, UCSC Genome Browser, Galaxy, and many model organism databases.
  • 3. The old genome informatics ecosystem
  • 4. Basis for the production and consumption of genomic information ๏ฌ Moore's Law: โ€œthe number of transistors that can be placed on an integrated circuit board is increasing exponentially, with a doubling time of roughly 18 months.โ€ ๏ฌ Similar laws for disk storage and network capacity have also been observed. ๏ฌ Until now the doubling time for DNA sequencing was just a bit slower than the growth of compute and storage capacity.
  • 5. Historical trends in storage prices vs DNA sequencing costs Notice: this is a logarithmic plot โ€“ exponential curves appear as straight lines. โ€œNext generationโ€ sequencing technologies in the mid-2000s changed the trends and now threatens the conventional genome informatics ecosystem
  • 6. Current Situation ๏‚จ Soon it will cost less to sequence a base of DNA than to store it on hard disk ๏‚จ We are facing potential tsunami of genome data that will swamp our storage system and crush our compute clusters ๏‚จ Who is affected: ๏‚ค Genome sequence repositories who need maintain and timely update their data ๏‚ค โ€œPower usersโ€ who accustomed to download the data to local computer for analysis
  • 7. Cloud Computing ๏‚จ Cloud computing is a general term for computation-as-a-service ๏‚จ Computation-as-a-service means that customers rent the hardware and the storage only for the time needed to achieve their goals ๏‚จ In cloud computing the rentals are virtual ๏‚จ The service provider implements the infrastructure to give users the ability to create, upload, and launch virtual machines on their extremely large and powerful compute farm.
  • 8. Virtual Machine ๏‚จ Virtual machine is software that emulates the properties of a computer. ๏‚จ Any operating system can be chosen to run on the virtual machine and all the tasks we usually perform on the computer can be performed on virtual machine. ๏‚จ Virtual machine bootable disk can be stored as an image. This image can used as a template to start up multiple virtual machines โ€“ launching a virtual compute cluster is very easy. ๏‚จ Storing large data sets is usually done with virtual disks. Virtual disk image can be attached to virtual machine as local or shared hard drive.
  • 9. The โ€œnewโ€ genome informatics ecosystem based on cloud computing Continue to work with the data via web- pages in their accustomed way Have two options: continue downloading data to local clusters as before, or use cloud computing โ€“ move the cluster to the data.
  • 10. Cloud computing service providers and their offered services ๏‚จ Some of the cloud computing service providers are: Amazon (Elastic Cloud Computing), Rackspace Cloud, Flexiant. ๏‚จ Amazon offers a variety of bioinformatics-oriented virtual machine images: ๏‚ค Images prepopulated by Galaxy ๏‚ค Bioconductor โ€“ programming environment with R statistic package ๏‚ค GBrowser โ€“ genome browser ๏‚ค BioPerl ๏‚ค JCVI Cloud BioLinux โ€“ collection of bioinformatics tools including Celera Assembler ๏‚จ Amazon also provides several large genomic datasets in its cloud: ๏‚ค Complete copy of GeneBank (200 Gb) ๏‚ค 30x coverage sequencing reads of a trio of individuals from 1000 Genome Project (700Gb) ๏‚ค Genome databases from Ensembl including annotated human and 50 other species genomes
  • 11. Cloud computing service providers from Nature Biotechnology news article โ€œUp in a cloud?โ€
  • 12. Academic cloud computing ๏‚จ Growing number of academic compute cloud projects are based on open source cloud management software. ๏‚จ Open Cloud Consortium is an example of one such project
  • 13. Cloud computing disadvantages ๏‚จ Cost of migrating existing systems into the new and unfamiliar environment ๏‚จ Security and privacy of the data ๏‚ค Need additional level of data encryption ๏‚ค Need new software that will restrict access to only authorized users ๏‚จ Major question: does cloud computing make economic sense?
  • 14. Comparing relative costs of renting vs. buying computational services ๏‚จ According to technical report on Cloud Computing by UC Berkeley Reliable Adaptive Distributed Systems Laboratory ๏‚ค When all the costs of maintaining a large compute cluster are considered the cost of renting a data center from Amazon is marginally more expensive than buying one. ๏‚ค When the flexibility of the cloud to support a virtual data center that downsizes or grows as needed is factored in, the economics start to look better
  • 15. Obstacles in moving toward cloud computing for genomics ๏‚จ Network bandwidth โ€“ transferring 100 gigabit next-generation sequencing data file across 10Gb/sec connection will take less than a day only if hogging much of the institutionโ€™s bandwidth. ๏‚จ Cloud computing service providers will have to offer some flexibility in how large datasets get into system.
  • 16. By Jaliya Ekanayake, Thilina Gunarathne, and Judy Qui IEEE transactions on parallel and distributed systems Cloud Technologies for Bioinformatics Applications
  • 17. Motivation for presenting the article ๏‚จ Exponential increase in the size of datasets implies that parallelism is essential to process the information in a timely fashion. ๏‚จ The algorithms presented in this paper might reduce computational analysis time significantly whether used on our own cluster or on rented Amazon virtual cluster. ๏‚จ This paper presents a nice example of how these new parallel computing algorithms can be applied for bioinformatics
  • 18. MapReduce ๏‚จ MapReduce is a software framework introduced by Google to support distributed computing on large data sets on computer clusters. ๏‚จ Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that merges all intermediate values associated with the same intermediate key. ๏‚จ Programs written in this functional style are automatically parallelized and executed on a large cluster of commodity machines. The run-time system takes care of the details of partitioning the input data, scheduling the program's execution across a set of machines, handling machine failures, and managing the required inter-machine communication. This allows programmers without any experience with parallel and distributed systems to easily utilize the resources of a large distributed system.
  • 19. Tow steps of MapReduce ๏‚ค Map: The master node takes the input, chops it up into smaller sub-problems, and distributes those to worker nodes. The worker node processes that smaller problem, and passes the answer back to its master node. ๏‚ค Reduce: The master node then takes the answers to all the sub-problems and combines them in a way to get the output - the answer to the problem it was originally trying to solve.
  • 21. Objectives ๏‚จ Examine 3 technologies: ๏‚ค Microsoft Drayd/DryadLINQ ๏‚ค Apache Hadoop ๏‚ค Message Passing Interface (MPI) ๏‚จ These 3 technologies are applied to 2 bioinformatics applications: ๏‚ค Expressed Sequence Tags (EST) ๏‚ค Alu clustering
  • 22. Technolgies ๏‚จ MPI โ€“ The goal of MPI is to provide a widely used standard for writing message passing programs. MPI is used in distributed and parallel computing. ๏‚จ Dryad/DryadLINQ is a distributed execution engine for coarse grain data parallel applications. It combines the MapReduce programming style with dataflow graphs to solve computational problems. ๏‚จ Apache Hadoop is an open source Java implementation of MapReduce. Hadoop schedules the MapReduce computation tasks depending on the data locality.
  • 23. Bioinformatics applications ๏‚จ Alu Sequence Classification is one of the most challenging problems for sequence clustering because Alus represent the largest repeat families in human genome. For the specific objectives of the paper an open source version of the Smith Waterman โ€“ Gotoh (SW-G) algorithm was used. ๏‚จ EST (Expressed Sequence Tag) corresponds to messenger RNAs transcribed from genes residing on chromosomes. Each individual EST sequence represents a fragment of mRNA, and the EST assembly aims to reconstruct full-length mRNA sequence for each expressed gene. Specific implementation used for this research is CAP3 software.
  • 24. Scalability of SW-G implementations
  • 25. Scalability of CAP3 implementations
  • 26. How different technologies perform on cloud computing clusters ๏‚จ The authors were interested to study the overhead of Virtual Machines for application that use frameworks like Hadoop and Dryad. Performance degradation of Hadoop SWG application on virtual environment ranges from 15% to 25%
  • 27. How different technologies perform on cloud computing clusters contโ€™d The performance degradation in CAP3 remains constant near 20% . CAP3 application does not show the decrease of VM overhead with the increase of problem size as with SWG application.
  • 28. Conclusions ๏‚จ The flexibility of clouds and MapReduce suggest they will become the preferred approaches. ๏‚จ The support for handling large data sets, the concept of moving computation to data, and the better quality of service provided by the cloud technologies simplify the implementation of some problems over traditional systems. ๏‚จ Experiments showed that DryadLINQ and Hadoop get similar performance and that virtual machines five overheads of around 20%. ๏‚จ The support for inhomogeneous data is important and currently Hadoop performs better than DryadLINQ. ๏‚จ MapReduce offer higher level interface and the user needs less explicit control of the parallelism.