SlideShare a Scribd company logo
1 of 4
26LinuxUser/June 2001
REAL LIFE LINUX
Hackingthegenome
Don’t be surprised at the strength of the open source ethic in the human genome
project and in the coming bioinformatics explosion, says Damian Counsell:
science has always been about public collaboration
T
he planet is inhabited
by elaborate, water-
rich organisms of var-
ious sizes, every one
built to a tight specifi-
cation. But even the
most sophisticated of these creatures
is merely a container. The inhabi-
tants of this world are soft, fleshy
bags. Inside these vessels are the true
masters: stored programs of massive
complexity. These encoded intelli-
gences influence their hosts’ appear-
ance, the time of their deaths and
much of their behaviour.
Recently some of the beings have
become aware that they are
enslaved. They may soon have the
technology to escape from internal
domination. However, from our
privileged viewpoint outside this
ecosystem, we can see an obstacle.
Some of the slave organisms are
witholding from their fellows the
information they all need to escape.
Restricting this information may
give their rulers, the selfish replica-
tors, the upper hand. We observe the
alien slaves as their hopes for free-
dom hang in the balance.
If you haven’t guessed yet, the
planet is Earth and we, Homo sapiens
sapiens, are the ‘slaves’. To quote
Steve Jones, Professor of Genetics at
University College London - presum-
ably writing of a relatively rich pop-
ulation like the British – “most peo-
ple die because of the genes they
carry.” In case you haven’t noticed a
few months’ worth of headlines, the
‘good’ draft database of those genes,
the human genome, is out.
This final draft is pretty small by
modern storage standards: three mil-
lion MB – that’s megabases (2-bit)
not megabytes (8-bit) – of our own
specifications. What is amazing is
that this library of biological infor-
mation is available for download to
your own PC and can be browsed for
free from the comfort of your own
home using the Human Genome
Browser. To cap this, in the process of
bringing home the genome,
researchers have given brought a
new science to maturity.
Bioinformatics is born
Even if you are familiar with the par-
ents – computing and biology – you
shouldn’t be ashamed if you find the
work of their offspring – bioinfor-
matics – incomprehensible. The text
of human DNA makes past winners
of obfuscated Perl competitions look
like Plain English Awardees. Some of
the biggest brains in science are
working on an interpretation even
as I write. For that job most of them
are using the same sorts of open
source tools and open source operat-
ing systems - including Linux - that
they applied to liberating our
genomes in the first place.
The human genome is the soft-
ware that programs the building and
maintenance of your body. Your
body’s ‘listing’ of the code is written
in a language of just four letters, four
different chemical modules. The
names of these chemicals are usually
shortened to their initials: A, C, G
and T. When you look at an image of
DNA, the double helical ‘staircase’
stuff of your genetic material, com-
plementary pairs of these four letters
form the ‘stairs’.
DNA is supercoiled and packed
with proteins into the complex
structures we call chromosomes. If a
genome is a library of code, each
chromosome can be thought of as a
shelf full of books of program print-
outs. One of the most amazing dis-
coveries of the genome project has
been that most of the pages in the
human library are nonsense. Less
than five per cent of all the code can
be shown to make something tangi-
ble. These biological algorithms
describe, for example, the haemoglo-
bin that carries oxygen in your blood
(and colours it red), the keratin that
makes up your skin and hair and
even the receptors in your brain that
collect the message that you are
bleeding.
Why are we so interested in the
few pages that actually make sense?
One reason is inherited genetic dis-
ease. If one letter in the copies of
your ‘book’ for haemoglobin is
wrong, for example, then, from early
childhood on you can lurch into
excruciating sickle-cell disease
‘crises’, caused by the distortion and
destruction of your blood cells by the
products of your own genes.
Disorders like this derive from errors
in the copying or repair of the books
of life or even of the ‘shelves’ - people
with Down’s Syndrome carry an
extra copy of chromosome 21.
It’s not just medical research
funds and universities who are inter-
ested in the contents of the library –
the drug companies see massive
potential too. If they can identify,
say, a gene that codes for an impor-
tant component of the brain they
might be able to develop drugs
which bind to it and alter the way it
works in whole humans, perhaps to
treat schizophrenia or depression.
More controversially, if a company
patents its knowledge of that gene, it
can demand payment from anyone
else who targets that gene product to
treat the same illnesses. The genome
is a matter of life, death and big
money.
In the beginning was The
Tape
There’s nothing new about the open
source model; science has always
been about public collaboration,
even between rivals. Anyone who
has ever been assigned a partner for
teaching practicals will understand
that the participants in this sharing
of data are not always entirely will-
ing, of course.
Sharing used to be the rule in
computer science too. In the begin-
ning was The Tape. And The Tape
was distributed to anyone who need-
ed it. The users contributed their
fixes back to the programmers and
everyone benefitted. With the
advent of the microcomputer came
the earliest suggestions of a mass
market. With the possibility of seri-
ous money came serious change.
Many people with their roots in that
LinuxUser/June 2001 27
early telecoms, academic computing
and hobbyist world felt payment for
their hard work was due from their
peers – including a certain William H.
Gates III. Perhaps this was more about
pride than cash. Perhaps the program-
mers no longer felt that the users were
truly their peers. Such disgruntled
quasi-hackers and their employers felt
the best way to ensure their just reward
was to close the source of their programs
and charge for the executable. This was
a ‘business model’ that was to prove
extremely lucrative. Some say this is a
model with strong parallels in biotech-
nology.
Many scientists don’t like this model.
Science works because you can only
publish experiments that other people
can do too. If there are any mystical
tools needed to perform your tricks you
have to be prepared to distribute them
to fellow professionals. The white-coat-
ed conjurors of science have to hand
over their wands to anyone in the magic
circle who asks. This is crucial; if an
experiment is not repeatable than it
might as well not have happened - as
the ‘discoverers’ of cold fusion might
tell you.
In biology the classic openly distrib-
uted tools are antibodies, exquisitely
specific biological molecules which can
be used to recognise and label parts of
living things. Interestingly biotech com-
panies still manage to make money
from selling antibodies in bulk, despite
their relatively free circulation among
scientists.
Nowadays, the open tools of science
include software like Linux. Even when
essential programs are ‘closed’, like the
Staden programs used since the early
days by single labs sequencing genes,
they are distributed at cost to other aca-
demics – who identify bugs and suggest
features.
A cunning plan
For years scientists had been mapping
and sequencing single genes in small
groups storing, assembling and manipu-
lating their data on single workstations
running programs like ‘Staden’. Often
these investigators were trying to find
out whereabouts and on which chro-
mosome the gene for a particular dis-
ease was hiding away. This was a long
and difficult process and it was rarely in
any one group’s interest or power to
map and sequence huge regions of the
genome – especially as much DNA
turned out to be ‘rubbish’.
From the mid ’80s, a small, ambi-
tious group of scientists, including
James Watson (one of the discoverers of
the structure of DNA) believed that hav-
ing the whole map of the human
genome at much higher resolution was
not only possible, but would massively
simplify gene hunting, increase the
power of gene research and reduce the
duplication of effort.
Not only would scientists be able to
look up their own gene of interest, but
they would immediately be able to find
the relationship between it and all the
others. This is very important; neigh-
bouring genes are far more likely to be
inherited together for example. Just like
Linux, the combined effort of geeks
worldwide would add up to something
much greater than the work of isolated
geeks.
The HuGeP, as it is affectionately
known among researchers, was carried
out at 16 centres around the world,
including substantial proportions at the
Sanger Centre on the ‘Genome Campus’
near Cambridge, UK, the Whitehead
Institute and Washington University in
the US and Keio University, Tokyo,
Japan.
The human genome is actually one
of several different species’ genomes
you can download to your hard disk
today. The previous projects on various
bugs and simple animals had already
given us some idea of approaches to
reading a genome, but on a much small-
er scale.
You don’t just put some cells into a
machine, crank the handle and read off
the code. A genome has to be assembled
at various levels, like building a gigantic
jigsaw. As you might separate out the
sky from the sea, from the people and
from the buildings, the 23 kinds of chro-
mosome found in humans were par-
celled out to various groups of scientists.
Further, the various groups had to break
their designated chromosomes into
chunks small enough to be stuffed into
an unsuspecting yeast or bacteria. The
cells of these creatures would then be
hijacked to make trillions of copies of
the human genes.
Putting it all together
This wasn’t smallest level of the puzzle.
Although the Internet was crucial to the
co-ordination of the sequencing effort
and the sharing of its fruits, the biggest
single technological advance making
the large-scale sequencing of humans
possible was the development of bril-
liantly engineered laser fluourescence
sequencing robots. Ironically, these
were sold first to private genome
hunters who had been closely involved
in their development. The main cus-
tomer was Celera Genomics. When
Craig Venter, currently Celera’s presi-
dent and chief scientific officer, vowed
in 1998 that he would use these robots
to complete a draft of the human
genome by 2001, the public project
knew it had a race on.
Many perceived Celera’s US$70 mil-
lion challenge as a threat. Although
there had already had been a controver-
sy over the US National Institutes of
Molecular graphic courtesy of Elaine
Meng, Computer Graphics Laboratory,
University of California, San
Francisco. images were produced
using the MidasPlus program
(supported by NIH RR-01081)
28LinuxUser/June 2001
REAL LIFE LINUX
Health and its intention to patent certain genes identi-
fied by Venter’s so-called ‘EST’ method of gene finding,
this was nothing compared to the growing fear of a large-
scale stakeout of our genetic heritage by corporate inter-
ests. Despite Watson’s initial scathing comments on
robot sequencers, leaders of the public project knew they
had to have the best engines to have any chance in the
race. To quote the chairman of Perkin Elmer who made
the robot sequencers, “The day we announced Celera we
set off an arms race, and we were in the arms business.”
But even the amazing PRISM 3700 sequencers he was
referring to could not read any more than 550 bases of
genomic sequence in each ‘lane’ or column of output.
The public first draft of the project emerged in the 15
months to February 2001 – just before the private effort.
Many believe this version of the genome to be better than
the ‘commercial product’ – even allowing for the fact
that Celera was free to use information from the public
effort while the project’s researchers could not do the
same themselves. The Genome Center at the Whitehead
Institute for Genome Research made a map of the
sequence freeze made on 24 May 2000. For this most
recent draft a new and more rigorous map was needed.
96 per cent of the sequence of this
working draft has a lower than one
in 1,000 error rate.
Enter perhaps the most exciting
open source contribution to the
genome Project race: James Kent
and David Haussler, respectively
from the Biology and Computer
Science Departments of the
University of California at Santa
Cruz, developed an algorithm
called GigAssembler for the final
assembly of the overlapping frag-
ments of the code.
Students hacking code is noth-
ing new, but this was rather special
code for a rather special code and
James Kent is no ordinary student,
more of an academic late developer,
as one might say. His first career
was running a computer animation
business. At the time of his involve-
ment last summer he was Haussler’s
40-year-old PhD student. For one
month he coded the algorithm in C
to run on a cluster of 100 800MHz
Pentium boxen Haussler had persuaded
the University to buy especially for the
purpose. To quote Haussler: “Jim in four
weeks created the GigAssembler by
working night and day. He had to ice
his wrists at night because of the fury
with which he created this extraordi-
narily complex piece of code.”
Even one error in 1,000 is not good
enough, but now we are at a stage where
the humble Linux enthusiast can not
only read the code, but actually con-
tribute to its interpretation.
Bigger, slicker and free
One of the most interesting parts of
bioinformatics is the what comes after
you have your sequence. The first tools
to manipulate this DNA data in silico also
started in academia. A collection of sci-
entists in the Department of Genetics at
the University of Wisconsin-Madison in
1982 created a bunch of command line
tools which went on to be used by computer-savvy biolo-
gists everywhere to manipulate sequence data. GCG
(Genetics Computer Group) continued as a service of the
UW Biotechnology Center from 1985 to 1989. Then it
went commercial.
Since then most academic centres have continued to
use it at (what most academics thought was) a reasonable
price. But a recent price hike of the sort that would
shame Microsoft pushed the Human Genome Mapping
Project Resource Centre (the HGMP-RC) to the point
where they recommend that researchers give up paying
them for GCG licences and turn to a free alternative.
Worse, even programs written by outsiders to extend the
original package started to get mired in the complexities
of commercial licences.
Ironically it’s taken another group of academics to ‘re-
free’ the same algorithms that made GCG worth selling.
The HGMP-RC lives on the same Genome Campus out-
side Cambridge as the Sanger Centre where about a third
of the human genome was sequenced. It is part of
EMBnet, “a science-based group of collaborating nodes
throughout Europe”. They used to write extensions to
Further reading
Genome
For a fuller account of the race
to sequence the genome itself
try ‘The Sequence’ by Kevin
Davies [Weidenfeld; ISBN
0297646982].
Find out more about the
biology behind the genome in
Matt Ridley’s ‘Genome’
[Fourth Estate; ISBN
185702835X] – both an inter-
esting layman’s introduction
to the issues raised by the
bioinformatic revolution and
an overview of its enormous
scope.
Go back to the ‘real begin-
ning’ of the race and read
James Watson’s entertaining
and indiscreet memoir of his
and Francis Crick’s determina-
tion of the structure of DNA,
‘The Double Helix’ [Penguin;
ISBN 0140268774] – now
updated with an introduction
by media don Steve Jones.
Hardcore
It’s notoriously difficult to find
any books on bioinformatics
itself that cater well for all of
those coming from comput-
ing, from mathematics and
from biology backgrounds.
The few textbooks available in
the field tend to be eyewater-
ingly expensive as well. If you
are a hardcore maths/com-
puting person Michael
Waterman’s ‘Introduction to
Computational Biology’
[Chapman & Hall/CRC
Statistics and Mathematics;
ISBN 0412993910].
Alternatively, Pavel
Pevzner’s ‘Computational
Molecular Biology – An
Algorithmic Approach’ [The
MIT Press (A Bradford Book);
ISBN 0262161974] will give
you all the discrete maths you
can shake a stick at, but per-
functory introductions to the
biology. If you’re coming to the
subject as a Linux user with a
biological background, looking
to exploit the many tools
available, you might want to
try Terry Attwood and David
Parry-Smith’s ‘Introduction to
Bioinformatics’ [Longman
Higher Education; ISBN
0582327881], or Des Higgins
and Willie Taylor’s
‘Bioinformatics: Sequence
Structure and Databanks’
[Oxford University Press;
ISBN 0199637903].
Cancer
For a gentle introduction to
cancer biology and its links
with genetics try Robert
Weinberg’s ‘One Renegade
Cell’ [Phoenix Press; ISBN:
0753807866].
A stylised image repre-
senting the structure of
a gene product, a pro-
tein, found in the brain,
showing normal (A) and
mutant (B) forms of the
proteins. Although the
sequence of both gene
products is known, no
one has yet determined
the exact structure of
the protein in the state
shown in the images.
Instead the researchers
involved (Liang Zhong
Xu and colleagues at
Howard Hughes Medical
Institute and The
Rockefeller University,
New York, NY) have
published a representa-
tion of a model, rather
than an image of a real
object. The model was
generated using a Linux
cluster running the pro-
gram MODELLER.
MODELLER was written
by Andrej Sali at the
Rockefeller and the pro-
gram was distributed
across a cluster admin-
istered with the Clustor
system (currently part
of TurboLinux’s
enFuzion 6.0).
99LinuxUser/May-June 2000
REAL LIFE LINUX
GCG when it was cheap now they’re writing the
free (as in beer and speech) alternative. Being
hardcore nerds, these guys have eschewed the
messy-but-fast approach of the average open
source project for a very disciplined, object-ori-
ented form of C. Not only does it work, but it
looks pretty too.
The applications themselves are a boon to
practising biologists less interested in bioinfor-
matics for its own sake. Many of them simply
replace the functionality of their GCG predeces-
sors and, unlike GCG and, it must be said, a lot
of previous biological software packages, they
have been released under the GPL.
Even better, EMBnet’s hackers have, in pass-
ing, had to build a set of amazingly powerful
biological libraries which are available under
the LGPL. Slowly and surely users are migrating
to the new system and I have given EMBOSS my
personal ‘growing frog’ award: it gets bigger and
slicker with every change.
Post-genomic hacking
One of the biggest surprises of the Final Draft is
that humans appear, according to the latest esti-
mates, to have about half as many genes again as
a fly. If we want to refine this count we need to
figure out better ways of recognising genes
whose instructions are followed by the body to
make bits of ourselves. Biologists talk about such
parts of the code being ‘expressed’. How can we
find the ‘good bits’ that are in the genetic code?
Ensembl is another European molecular biology
project that is completely open source and uses
various statistical techniques to spot the charac-
teristic signatures of actual genes in the vast
amounts of noise in the DNA signal.
DNA has been sexy for half a decade, but it’s
proteins that do the work. If the genes are the
software then the proteins are the hardware.
Whenever a gene is expressed, it is ‘made flesh’
as a protein. Last year IBM promised a tidy sum
to the development of a machine which can
take the one-dimensional DNA code for a gene
and predict the elaborate three-dimensional
shape of the protein machine it specifies.
Blue Gene, the resulting new supercomputer,
will follow IBM’s attempt to beat man at chess
by tackling this, one of the classic questions in
biology, the so-called ‘protein folding problem’.
In the first week of April 2001 Hitachi and
Oracle announced that they were about to spend
US$500 million on the human proteome - a
detailed description all the products of all the
genes in the human genome.
The proteome is a much bigger and much
fuzzier target to throw hard cash at. The goal
with sequencing the genome was pretty well
defined by comparison. People in research were
similarly sceptical about private efforts to tackle
the genome and the cloning of large mammals,
however. The new race is on.
Damian Counsell does bioinformatics in the Protein Folding and
Assembly Group at the Institute of Cancer Research, London. He has
been using Linux since 1995.
Key links
Genome
Zoom in and out on our genetic heritage
with the Genome Browser
http://genome.ucsc.edu/goldenPath/
hgTracks.html
Visit the focus of the UK’s human
genome effort at the Sanger Centre
www.sanger.ac.uk/Info/Intro
Bioinformatics
Try your hand at some real-life bioinfor-
matics questions; visit the Open
Bioinformatics Project at the Institute of
Cancer Research...
www.icr.ac.uk/cmb/bioinformatics/Open
...or Damian Counsell’s own website at
the Institute
www.icr.ac.uk/cmb/bioinformatics
For answers to more general questions
visit The Open Lab’s Bioinformatics FAQ
http://bioinformatics.org/FAQ
Selected bioinformatics Linux clusters
The BLAST farm at the Sanger Centre
www.sanger.ac.uk/Info/IT/sld012.htm
The cluster at the Jena Genome
Sequencing Center
http://gen100.imb-jena.de/cluster
The YAC at the Mount Sinai Hospital at
the Samuel Lunenfeld Research Institute
http://bioinfo.mshri.on.ca/yac
The Collective at the University of Idaho
www.cs.uidaho.edu/~beowulf

More Related Content

What's hot

Stalking the Fourth Domain in Metagenomic Data: Searching for, Discovering, a...
Stalking the Fourth Domain in Metagenomic Data: Searching for, Discovering, a...Stalking the Fourth Domain in Metagenomic Data: Searching for, Discovering, a...
Stalking the Fourth Domain in Metagenomic Data: Searching for, Discovering, a...Jonathan Eisen
 
Sequencing your poo with a usb stick - Linux.conf.au 2016 miniconf - mon 1 ...
Sequencing your poo with a usb stick -  Linux.conf.au 2016 miniconf  - mon 1 ...Sequencing your poo with a usb stick -  Linux.conf.au 2016 miniconf  - mon 1 ...
Sequencing your poo with a usb stick - Linux.conf.au 2016 miniconf - mon 1 ...Torsten Seemann
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesMonica Munoz-Torres
 
Beiko taconic-nov3
Beiko taconic-nov3Beiko taconic-nov3
Beiko taconic-nov3beiko
 
Biotechnology the next growth area for grassroots entrepreneurship? The Indep...
Biotechnology the next growth area for grassroots entrepreneurship? The Indep...Biotechnology the next growth area for grassroots entrepreneurship? The Indep...
Biotechnology the next growth area for grassroots entrepreneurship? The Indep...Shiwen Yap
 
Munoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ssMunoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ssMonica Munoz-Torres
 
ContentMining in Neuroscience
ContentMining in NeuroscienceContentMining in Neuroscience
ContentMining in NeuroscienceTheContentMine
 
Dario Lijtmaer - Brief introduction to barcoding and the current goals and ca...
Dario Lijtmaer - Brief introduction to barcoding and the current goals and ca...Dario Lijtmaer - Brief introduction to barcoding and the current goals and ca...
Dario Lijtmaer - Brief introduction to barcoding and the current goals and ca...Consortium for the Barcode of Life (CBOL)
 
Forensic Identification using skin bacterial communities
Forensic Identification using skin bacterial communitiesForensic Identification using skin bacterial communities
Forensic Identification using skin bacterial communitiesChristine Kelly
 
Dna Barcoding and Undergraduate Science
Dna Barcoding and Undergraduate ScienceDna Barcoding and Undergraduate Science
Dna Barcoding and Undergraduate ScienceJoe Cross
 

What's hot (14)

Stalking the Fourth Domain in Metagenomic Data: Searching for, Discovering, a...
Stalking the Fourth Domain in Metagenomic Data: Searching for, Discovering, a...Stalking the Fourth Domain in Metagenomic Data: Searching for, Discovering, a...
Stalking the Fourth Domain in Metagenomic Data: Searching for, Discovering, a...
 
My name is luca
My name is lucaMy name is luca
My name is luca
 
Sequencing your poo with a usb stick - Linux.conf.au 2016 miniconf - mon 1 ...
Sequencing your poo with a usb stick -  Linux.conf.au 2016 miniconf  - mon 1 ...Sequencing your poo with a usb stick -  Linux.conf.au 2016 miniconf  - mon 1 ...
Sequencing your poo with a usb stick - Linux.conf.au 2016 miniconf - mon 1 ...
 
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of GenomesApollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
Apollo and i5K: Collaborative Curation and Interactive Analysis of Genomes
 
Beiko taconic-nov3
Beiko taconic-nov3Beiko taconic-nov3
Beiko taconic-nov3
 
Biotechnology the next growth area for grassroots entrepreneurship? The Indep...
Biotechnology the next growth area for grassroots entrepreneurship? The Indep...Biotechnology the next growth area for grassroots entrepreneurship? The Indep...
Biotechnology the next growth area for grassroots entrepreneurship? The Indep...
 
Munoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ssMunoz torres web-apollo-workshop_exeter-2014_ss
Munoz torres web-apollo-workshop_exeter-2014_ss
 
ContentMining in Neuroscience
ContentMining in NeuroscienceContentMining in Neuroscience
ContentMining in Neuroscience
 
Dario Lijtmaer - Brief introduction to barcoding and the current goals and ca...
Dario Lijtmaer - Brief introduction to barcoding and the current goals and ca...Dario Lijtmaer - Brief introduction to barcoding and the current goals and ca...
Dario Lijtmaer - Brief introduction to barcoding and the current goals and ca...
 
Forensic Identification using skin bacterial communities
Forensic Identification using skin bacterial communitiesForensic Identification using skin bacterial communities
Forensic Identification using skin bacterial communities
 
SARS2 CoVID-19
SARS2 CoVID-19SARS2 CoVID-19
SARS2 CoVID-19
 
Dna Barcoding and Undergraduate Science
Dna Barcoding and Undergraduate ScienceDna Barcoding and Undergraduate Science
Dna Barcoding and Undergraduate Science
 
Dmpstr diver poster
Dmpstr diver posterDmpstr diver poster
Dmpstr diver poster
 
D02-NextGenSeq-MOLGENIS
D02-NextGenSeq-MOLGENISD02-NextGenSeq-MOLGENIS
D02-NextGenSeq-MOLGENIS
 

Viewers also liked

amrapali builders @@ hardware hacking and robotics using the raspberry pi.pdf
amrapali builders @@ hardware hacking and robotics using the raspberry pi.pdfamrapali builders @@ hardware hacking and robotics using the raspberry pi.pdf
amrapali builders @@ hardware hacking and robotics using the raspberry pi.pdfamrapalibuildersreviews
 
amrapali builders @@ hacking challenges.pdf
amrapali builders @@ hacking challenges.pdfamrapali builders @@ hacking challenges.pdf
amrapali builders @@ hacking challenges.pdfamrapalibuildersreviews
 
Amrapali builders -- google cheatsheet.pdf
Amrapali builders -- google cheatsheet.pdfAmrapali builders -- google cheatsheet.pdf
Amrapali builders -- google cheatsheet.pdfamrapalibuildersreviews
 
amrapali builders -- maroochy water-services-case-study briefing.pdf
amrapali builders -- maroochy water-services-case-study briefing.pdfamrapali builders -- maroochy water-services-case-study briefing.pdf
amrapali builders -- maroochy water-services-case-study briefing.pdfamrapalibuildersreviews
 
amrapali builders @@ hacking with basic command.pdf
amrapali builders @@ hacking with basic command.pdfamrapali builders @@ hacking with basic command.pdf
amrapali builders @@ hacking with basic command.pdfamrapalibuildersreviews
 
amrapali builders@@@hacking d link routers with hnap.pdf
amrapali builders@@@hacking d link routers with hnap.pdfamrapali builders@@@hacking d link routers with hnap.pdf
amrapali builders@@@hacking d link routers with hnap.pdfamrapalibuildersreviews
 
amrapali builders @@hacking printers.pdf
amrapali builders @@hacking printers.pdfamrapali builders @@hacking printers.pdf
amrapali builders @@hacking printers.pdfamrapalibuildersreviews
 
PyQt: rapid application development
PyQt: rapid application developmentPyQt: rapid application development
PyQt: rapid application developmentDeveler S.r.l.
 
GUI toolkits comparison for python
GUI toolkits comparison for pythonGUI toolkits comparison for python
GUI toolkits comparison for pythonDarren Su
 

Viewers also liked (13)

amrapali builders @@ hardware hacking and robotics using the raspberry pi.pdf
amrapali builders @@ hardware hacking and robotics using the raspberry pi.pdfamrapali builders @@ hardware hacking and robotics using the raspberry pi.pdf
amrapali builders @@ hardware hacking and robotics using the raspberry pi.pdf
 
amrapali builders @@ hacking challenges.pdf
amrapali builders @@ hacking challenges.pdfamrapali builders @@ hacking challenges.pdf
amrapali builders @@ hacking challenges.pdf
 
Amrapali builders -- google cheatsheet.pdf
Amrapali builders -- google cheatsheet.pdfAmrapali builders -- google cheatsheet.pdf
Amrapali builders -- google cheatsheet.pdf
 
amrapali builders -- maroochy water-services-case-study briefing.pdf
amrapali builders -- maroochy water-services-case-study briefing.pdfamrapali builders -- maroochy water-services-case-study briefing.pdf
amrapali builders -- maroochy water-services-case-study briefing.pdf
 
amrapali builders @@ hacking with basic command.pdf
amrapali builders @@ hacking with basic command.pdfamrapali builders @@ hacking with basic command.pdf
amrapali builders @@ hacking with basic command.pdf
 
amrapali builders@@@hacking d link routers with hnap.pdf
amrapali builders@@@hacking d link routers with hnap.pdfamrapali builders@@@hacking d link routers with hnap.pdf
amrapali builders@@@hacking d link routers with hnap.pdf
 
amrapali builders@@hacking ciphers.pdf
amrapali builders@@hacking ciphers.pdfamrapali builders@@hacking ciphers.pdf
amrapali builders@@hacking ciphers.pdf
 
amrapali builders @@hacking printers.pdf
amrapali builders @@hacking printers.pdfamrapali builders @@hacking printers.pdf
amrapali builders @@hacking printers.pdf
 
Printer Security
Printer SecurityPrinter Security
Printer Security
 
amrapali builders @@ google hacking.pdf
amrapali builders @@ google hacking.pdfamrapali builders @@ google hacking.pdf
amrapali builders @@ google hacking.pdf
 
PyQt: rapid application development
PyQt: rapid application developmentPyQt: rapid application development
PyQt: rapid application development
 
GUI toolkits comparison for python
GUI toolkits comparison for pythonGUI toolkits comparison for python
GUI toolkits comparison for python
 
amrapali builders @@hacking cctv.pdf
amrapali builders @@hacking cctv.pdfamrapali builders @@hacking cctv.pdf
amrapali builders @@hacking cctv.pdf
 

Similar to amrapali builders -- hacking the genome.pdf

Bioinformatics group presentation
Bioinformatics group presentationBioinformatics group presentation
Bioinformatics group presentationNaeem Ahmed
 
Bioinformatics group presentation
Bioinformatics group presentationBioinformatics group presentation
Bioinformatics group presentationNaeem Ahmed
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Monica Munoz-Torres
 
Genome sequencing and the development of our current information library
Genome sequencing and the development of our current information libraryGenome sequencing and the development of our current information library
Genome sequencing and the development of our current information libraryZarlishAttique1
 
Newsletter 224
Newsletter 224Newsletter 224
Newsletter 224ESTHHUB
 
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sgScott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sgGigaScience, BGI Hong Kong
 
Basics Of Essay Writing And Essay Prompt Vocabulary
Basics Of Essay Writing And Essay Prompt VocabularyBasics Of Essay Writing And Essay Prompt Vocabulary
Basics Of Essay Writing And Essay Prompt VocabularyTeresa Oakman
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsmikaelhuss
 
Top 10 upcomming innovation
Top 10 upcomming innovationTop 10 upcomming innovation
Top 10 upcomming innovationShajith Shaji
 
New e-Science Edinburgh Late Edition
New e-Science Edinburgh Late EditionNew e-Science Edinburgh Late Edition
New e-Science Edinburgh Late EditionDavid De Roure
 
It walks, It talks and it will conduct economic espionage by Greg Carpenter
It walks, It talks and it will conduct economic espionage by Greg CarpenterIt walks, It talks and it will conduct economic espionage by Greg Carpenter
It walks, It talks and it will conduct economic espionage by Greg CarpenterEC-Council
 

Similar to amrapali builders -- hacking the genome.pdf (20)

crowding out cancer
crowding out cancercrowding out cancer
crowding out cancer
 
Bioinformatics group presentation
Bioinformatics group presentationBioinformatics group presentation
Bioinformatics group presentation
 
Bioinformatics group presentation
Bioinformatics group presentationBioinformatics group presentation
Bioinformatics group presentation
 
Xenobots
XenobotsXenobots
Xenobots
 
Presentation1ba
Presentation1baPresentation1ba
Presentation1ba
 
Lesson 14.3
Lesson 14.3Lesson 14.3
Lesson 14.3
 
2015 pycon-talk
2015 pycon-talk2015 pycon-talk
2015 pycon-talk
 
Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.Web Apollo: Lessons learned from community-based biocuration efforts.
Web Apollo: Lessons learned from community-based biocuration efforts.
 
Introduction
IntroductionIntroduction
Introduction
 
Genome sequencing and the development of our current information library
Genome sequencing and the development of our current information libraryGenome sequencing and the development of our current information library
Genome sequencing and the development of our current information library
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
Newsletter 224
Newsletter 224Newsletter 224
Newsletter 224
 
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sgScott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
Scott Edmunds: Publishing in the Open Data Era, talk at Hackerspace.sg
 
Basics Of Essay Writing And Essay Prompt Vocabulary
Basics Of Essay Writing And Essay Prompt VocabularyBasics Of Essay Writing And Essay Prompt Vocabulary
Basics Of Essay Writing And Essay Prompt Vocabulary
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomics
 
Top 10 upcomming innovation
Top 10 upcomming innovationTop 10 upcomming innovation
Top 10 upcomming innovation
 
2014 naples
2014 naples2014 naples
2014 naples
 
New e-Science Edinburgh Late Edition
New e-Science Edinburgh Late EditionNew e-Science Edinburgh Late Edition
New e-Science Edinburgh Late Edition
 
Open science 2014
Open science 2014Open science 2014
Open science 2014
 
It walks, It talks and it will conduct economic espionage by Greg Carpenter
It walks, It talks and it will conduct economic espionage by Greg CarpenterIt walks, It talks and it will conduct economic espionage by Greg Carpenter
It walks, It talks and it will conduct economic espionage by Greg Carpenter
 

More from amrapalibuildersreviews

amrapali builders@@@bluetooth hacking.pdf
amrapali builders@@@bluetooth hacking.pdfamrapali builders@@@bluetooth hacking.pdf
amrapali builders@@@bluetooth hacking.pdfamrapalibuildersreviews
 
amrapali builders@@@@hacking windows internals.pdf
amrapali builders@@@@hacking windows internals.pdfamrapali builders@@@@hacking windows internals.pdf
amrapali builders@@@@hacking windows internals.pdfamrapalibuildersreviews
 
Amrapali reviews(specification for amrapali titanium (low rise))Amrapali Buil...
Amrapali reviews(specification for amrapali titanium (low rise))Amrapali Buil...Amrapali reviews(specification for amrapali titanium (low rise))Amrapali Buil...
Amrapali reviews(specification for amrapali titanium (low rise))Amrapali Buil...amrapalibuildersreviews
 
Amrapali reviews(profile bn-basu)Amrapali Builders,Amrapali Group
Amrapali reviews(profile bn-basu)Amrapali Builders,Amrapali GroupAmrapali reviews(profile bn-basu)Amrapali Builders,Amrapali Group
Amrapali reviews(profile bn-basu)Amrapali Builders,Amrapali Groupamrapalibuildersreviews
 
Amrapali reviews(list-of-aproved-projects)Amrapali Builders,Amrapali Group
Amrapali reviews(list-of-aproved-projects)Amrapali Builders,Amrapali GroupAmrapali reviews(list-of-aproved-projects)Amrapali Builders,Amrapali Group
Amrapali reviews(list-of-aproved-projects)Amrapali Builders,Amrapali Groupamrapalibuildersreviews
 
Amrapali reviews(application form 4 t ech park g.noida)Amrapali Builders,Am...
Amrapali reviews(application form 4   t ech park g.noida)Amrapali Builders,Am...Amrapali reviews(application form 4   t ech park g.noida)Amrapali Builders,Am...
Amrapali reviews(application form 4 t ech park g.noida)Amrapali Builders,Am...amrapalibuildersreviews
 
Amrapali reviews(doc-cmd)Amrapali Builders,Amrapali Group
Amrapali reviews(doc-cmd)Amrapali Builders,Amrapali GroupAmrapali reviews(doc-cmd)Amrapali Builders,Amrapali Group
Amrapali reviews(doc-cmd)Amrapali Builders,Amrapali Groupamrapalibuildersreviews
 
Amrapali reviews(terrace homes)Amrapali Builders,Amrapali Group
Amrapali reviews(terrace homes)Amrapali Builders,Amrapali GroupAmrapali reviews(terrace homes)Amrapali Builders,Amrapali Group
Amrapali reviews(terrace homes)Amrapali Builders,Amrapali Groupamrapalibuildersreviews
 
Amrapali builders(terrace homes)Amrapali Reviews,Amrapali Group
Amrapali builders(terrace homes)Amrapali Reviews,Amrapali GroupAmrapali builders(terrace homes)Amrapali Reviews,Amrapali Group
Amrapali builders(terrace homes)Amrapali Reviews,Amrapali Groupamrapalibuildersreviews
 
Amrapali builders(specification for amrapali titanium (low rise))Amrapali Rev...
Amrapali builders(specification for amrapali titanium (low rise))Amrapali Rev...Amrapali builders(specification for amrapali titanium (low rise))Amrapali Rev...
Amrapali builders(specification for amrapali titanium (low rise))Amrapali Rev...amrapalibuildersreviews
 
Amrapali builders(profile bn-basu)Amrapali Reviews,Amrapali Group
Amrapali builders(profile bn-basu)Amrapali Reviews,Amrapali GroupAmrapali builders(profile bn-basu)Amrapali Reviews,Amrapali Group
Amrapali builders(profile bn-basu)Amrapali Reviews,Amrapali Groupamrapalibuildersreviews
 
Amrapali builders(list-of-aproved-projects)Amrapali Reviews,Amrapali Group
Amrapali builders(list-of-aproved-projects)Amrapali Reviews,Amrapali GroupAmrapali builders(list-of-aproved-projects)Amrapali Reviews,Amrapali Group
Amrapali builders(list-of-aproved-projects)Amrapali Reviews,Amrapali Groupamrapalibuildersreviews
 
Amrapali builders(application form 4 t ech park g.noida)Amrapali Reviews,Am...
Amrapali builders(application form 4   t ech park g.noida)Amrapali Reviews,Am...Amrapali builders(application form 4   t ech park g.noida)Amrapali Reviews,Am...
Amrapali builders(application form 4 t ech park g.noida)Amrapali Reviews,Am...amrapalibuildersreviews
 
Amrapali builders(amrapali dream valley highrise brochure)Amrapali Reviews,Am...
Amrapali builders(amrapali dream valley highrise brochure)Amrapali Reviews,Am...Amrapali builders(amrapali dream valley highrise brochure)Amrapali Reviews,Am...
Amrapali builders(amrapali dream valley highrise brochure)Amrapali Reviews,Am...amrapalibuildersreviews
 
Amrapali builders(doc-cmd)Amrapali Reviews,Amrapali Group
Amrapali builders(doc-cmd)Amrapali Reviews,Amrapali GroupAmrapali builders(doc-cmd)Amrapali Reviews,Amrapali Group
Amrapali builders(doc-cmd)Amrapali Reviews,Amrapali Groupamrapalibuildersreviews
 

More from amrapalibuildersreviews (16)

amrapali builders@@@bluetooth hacking.pdf
amrapali builders@@@bluetooth hacking.pdfamrapali builders@@@bluetooth hacking.pdf
amrapali builders@@@bluetooth hacking.pdf
 
amrapali builders@@sub way hacking.pdf
amrapali builders@@sub way hacking.pdfamrapali builders@@sub way hacking.pdf
amrapali builders@@sub way hacking.pdf
 
amrapali builders@@@@hacking windows internals.pdf
amrapali builders@@@@hacking windows internals.pdfamrapali builders@@@@hacking windows internals.pdf
amrapali builders@@@@hacking windows internals.pdf
 
Amrapali reviews(specification for amrapali titanium (low rise))Amrapali Buil...
Amrapali reviews(specification for amrapali titanium (low rise))Amrapali Buil...Amrapali reviews(specification for amrapali titanium (low rise))Amrapali Buil...
Amrapali reviews(specification for amrapali titanium (low rise))Amrapali Buil...
 
Amrapali reviews(profile bn-basu)Amrapali Builders,Amrapali Group
Amrapali reviews(profile bn-basu)Amrapali Builders,Amrapali GroupAmrapali reviews(profile bn-basu)Amrapali Builders,Amrapali Group
Amrapali reviews(profile bn-basu)Amrapali Builders,Amrapali Group
 
Amrapali reviews(list-of-aproved-projects)Amrapali Builders,Amrapali Group
Amrapali reviews(list-of-aproved-projects)Amrapali Builders,Amrapali GroupAmrapali reviews(list-of-aproved-projects)Amrapali Builders,Amrapali Group
Amrapali reviews(list-of-aproved-projects)Amrapali Builders,Amrapali Group
 
Amrapali reviews(application form 4 t ech park g.noida)Amrapali Builders,Am...
Amrapali reviews(application form 4   t ech park g.noida)Amrapali Builders,Am...Amrapali reviews(application form 4   t ech park g.noida)Amrapali Builders,Am...
Amrapali reviews(application form 4 t ech park g.noida)Amrapali Builders,Am...
 
Amrapali reviews(doc-cmd)Amrapali Builders,Amrapali Group
Amrapali reviews(doc-cmd)Amrapali Builders,Amrapali GroupAmrapali reviews(doc-cmd)Amrapali Builders,Amrapali Group
Amrapali reviews(doc-cmd)Amrapali Builders,Amrapali Group
 
Amrapali reviews(terrace homes)Amrapali Builders,Amrapali Group
Amrapali reviews(terrace homes)Amrapali Builders,Amrapali GroupAmrapali reviews(terrace homes)Amrapali Builders,Amrapali Group
Amrapali reviews(terrace homes)Amrapali Builders,Amrapali Group
 
Amrapali builders(terrace homes)Amrapali Reviews,Amrapali Group
Amrapali builders(terrace homes)Amrapali Reviews,Amrapali GroupAmrapali builders(terrace homes)Amrapali Reviews,Amrapali Group
Amrapali builders(terrace homes)Amrapali Reviews,Amrapali Group
 
Amrapali builders(specification for amrapali titanium (low rise))Amrapali Rev...
Amrapali builders(specification for amrapali titanium (low rise))Amrapali Rev...Amrapali builders(specification for amrapali titanium (low rise))Amrapali Rev...
Amrapali builders(specification for amrapali titanium (low rise))Amrapali Rev...
 
Amrapali builders(profile bn-basu)Amrapali Reviews,Amrapali Group
Amrapali builders(profile bn-basu)Amrapali Reviews,Amrapali GroupAmrapali builders(profile bn-basu)Amrapali Reviews,Amrapali Group
Amrapali builders(profile bn-basu)Amrapali Reviews,Amrapali Group
 
Amrapali builders(list-of-aproved-projects)Amrapali Reviews,Amrapali Group
Amrapali builders(list-of-aproved-projects)Amrapali Reviews,Amrapali GroupAmrapali builders(list-of-aproved-projects)Amrapali Reviews,Amrapali Group
Amrapali builders(list-of-aproved-projects)Amrapali Reviews,Amrapali Group
 
Amrapali builders(application form 4 t ech park g.noida)Amrapali Reviews,Am...
Amrapali builders(application form 4   t ech park g.noida)Amrapali Reviews,Am...Amrapali builders(application form 4   t ech park g.noida)Amrapali Reviews,Am...
Amrapali builders(application form 4 t ech park g.noida)Amrapali Reviews,Am...
 
Amrapali builders(amrapali dream valley highrise brochure)Amrapali Reviews,Am...
Amrapali builders(amrapali dream valley highrise brochure)Amrapali Reviews,Am...Amrapali builders(amrapali dream valley highrise brochure)Amrapali Reviews,Am...
Amrapali builders(amrapali dream valley highrise brochure)Amrapali Reviews,Am...
 
Amrapali builders(doc-cmd)Amrapali Reviews,Amrapali Group
Amrapali builders(doc-cmd)Amrapali Reviews,Amrapali GroupAmrapali builders(doc-cmd)Amrapali Reviews,Amrapali Group
Amrapali builders(doc-cmd)Amrapali Reviews,Amrapali Group
 

Recently uploaded

Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 

Recently uploaded (20)

Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 

amrapali builders -- hacking the genome.pdf

  • 1. 26LinuxUser/June 2001 REAL LIFE LINUX Hackingthegenome Don’t be surprised at the strength of the open source ethic in the human genome project and in the coming bioinformatics explosion, says Damian Counsell: science has always been about public collaboration T he planet is inhabited by elaborate, water- rich organisms of var- ious sizes, every one built to a tight specifi- cation. But even the most sophisticated of these creatures is merely a container. The inhabi- tants of this world are soft, fleshy bags. Inside these vessels are the true masters: stored programs of massive complexity. These encoded intelli- gences influence their hosts’ appear- ance, the time of their deaths and much of their behaviour. Recently some of the beings have become aware that they are enslaved. They may soon have the technology to escape from internal domination. However, from our privileged viewpoint outside this ecosystem, we can see an obstacle. Some of the slave organisms are witholding from their fellows the information they all need to escape. Restricting this information may give their rulers, the selfish replica- tors, the upper hand. We observe the alien slaves as their hopes for free- dom hang in the balance. If you haven’t guessed yet, the planet is Earth and we, Homo sapiens sapiens, are the ‘slaves’. To quote Steve Jones, Professor of Genetics at University College London - presum- ably writing of a relatively rich pop- ulation like the British – “most peo- ple die because of the genes they carry.” In case you haven’t noticed a few months’ worth of headlines, the ‘good’ draft database of those genes, the human genome, is out. This final draft is pretty small by modern storage standards: three mil- lion MB – that’s megabases (2-bit) not megabytes (8-bit) – of our own specifications. What is amazing is that this library of biological infor- mation is available for download to your own PC and can be browsed for free from the comfort of your own home using the Human Genome Browser. To cap this, in the process of bringing home the genome, researchers have given brought a new science to maturity. Bioinformatics is born Even if you are familiar with the par- ents – computing and biology – you shouldn’t be ashamed if you find the work of their offspring – bioinfor- matics – incomprehensible. The text of human DNA makes past winners of obfuscated Perl competitions look like Plain English Awardees. Some of the biggest brains in science are working on an interpretation even as I write. For that job most of them are using the same sorts of open source tools and open source operat- ing systems - including Linux - that they applied to liberating our genomes in the first place. The human genome is the soft- ware that programs the building and maintenance of your body. Your body’s ‘listing’ of the code is written in a language of just four letters, four different chemical modules. The names of these chemicals are usually shortened to their initials: A, C, G and T. When you look at an image of DNA, the double helical ‘staircase’ stuff of your genetic material, com- plementary pairs of these four letters form the ‘stairs’. DNA is supercoiled and packed with proteins into the complex structures we call chromosomes. If a genome is a library of code, each chromosome can be thought of as a shelf full of books of program print- outs. One of the most amazing dis- coveries of the genome project has been that most of the pages in the human library are nonsense. Less than five per cent of all the code can be shown to make something tangi- ble. These biological algorithms describe, for example, the haemoglo- bin that carries oxygen in your blood (and colours it red), the keratin that makes up your skin and hair and even the receptors in your brain that collect the message that you are bleeding. Why are we so interested in the few pages that actually make sense? One reason is inherited genetic dis- ease. If one letter in the copies of your ‘book’ for haemoglobin is wrong, for example, then, from early childhood on you can lurch into excruciating sickle-cell disease ‘crises’, caused by the distortion and destruction of your blood cells by the products of your own genes. Disorders like this derive from errors in the copying or repair of the books of life or even of the ‘shelves’ - people with Down’s Syndrome carry an extra copy of chromosome 21. It’s not just medical research funds and universities who are inter- ested in the contents of the library – the drug companies see massive potential too. If they can identify, say, a gene that codes for an impor- tant component of the brain they might be able to develop drugs which bind to it and alter the way it works in whole humans, perhaps to treat schizophrenia or depression. More controversially, if a company patents its knowledge of that gene, it can demand payment from anyone else who targets that gene product to treat the same illnesses. The genome is a matter of life, death and big money. In the beginning was The Tape There’s nothing new about the open source model; science has always been about public collaboration, even between rivals. Anyone who has ever been assigned a partner for teaching practicals will understand that the participants in this sharing of data are not always entirely will- ing, of course. Sharing used to be the rule in computer science too. In the begin- ning was The Tape. And The Tape was distributed to anyone who need- ed it. The users contributed their fixes back to the programmers and everyone benefitted. With the advent of the microcomputer came the earliest suggestions of a mass market. With the possibility of seri- ous money came serious change. Many people with their roots in that
  • 2. LinuxUser/June 2001 27 early telecoms, academic computing and hobbyist world felt payment for their hard work was due from their peers – including a certain William H. Gates III. Perhaps this was more about pride than cash. Perhaps the program- mers no longer felt that the users were truly their peers. Such disgruntled quasi-hackers and their employers felt the best way to ensure their just reward was to close the source of their programs and charge for the executable. This was a ‘business model’ that was to prove extremely lucrative. Some say this is a model with strong parallels in biotech- nology. Many scientists don’t like this model. Science works because you can only publish experiments that other people can do too. If there are any mystical tools needed to perform your tricks you have to be prepared to distribute them to fellow professionals. The white-coat- ed conjurors of science have to hand over their wands to anyone in the magic circle who asks. This is crucial; if an experiment is not repeatable than it might as well not have happened - as the ‘discoverers’ of cold fusion might tell you. In biology the classic openly distrib- uted tools are antibodies, exquisitely specific biological molecules which can be used to recognise and label parts of living things. Interestingly biotech com- panies still manage to make money from selling antibodies in bulk, despite their relatively free circulation among scientists. Nowadays, the open tools of science include software like Linux. Even when essential programs are ‘closed’, like the Staden programs used since the early days by single labs sequencing genes, they are distributed at cost to other aca- demics – who identify bugs and suggest features. A cunning plan For years scientists had been mapping and sequencing single genes in small groups storing, assembling and manipu- lating their data on single workstations running programs like ‘Staden’. Often these investigators were trying to find out whereabouts and on which chro- mosome the gene for a particular dis- ease was hiding away. This was a long and difficult process and it was rarely in any one group’s interest or power to map and sequence huge regions of the genome – especially as much DNA turned out to be ‘rubbish’. From the mid ’80s, a small, ambi- tious group of scientists, including James Watson (one of the discoverers of the structure of DNA) believed that hav- ing the whole map of the human genome at much higher resolution was not only possible, but would massively simplify gene hunting, increase the power of gene research and reduce the duplication of effort. Not only would scientists be able to look up their own gene of interest, but they would immediately be able to find the relationship between it and all the others. This is very important; neigh- bouring genes are far more likely to be inherited together for example. Just like Linux, the combined effort of geeks worldwide would add up to something much greater than the work of isolated geeks. The HuGeP, as it is affectionately known among researchers, was carried out at 16 centres around the world, including substantial proportions at the Sanger Centre on the ‘Genome Campus’ near Cambridge, UK, the Whitehead Institute and Washington University in the US and Keio University, Tokyo, Japan. The human genome is actually one of several different species’ genomes you can download to your hard disk today. The previous projects on various bugs and simple animals had already given us some idea of approaches to reading a genome, but on a much small- er scale. You don’t just put some cells into a machine, crank the handle and read off the code. A genome has to be assembled at various levels, like building a gigantic jigsaw. As you might separate out the sky from the sea, from the people and from the buildings, the 23 kinds of chro- mosome found in humans were par- celled out to various groups of scientists. Further, the various groups had to break their designated chromosomes into chunks small enough to be stuffed into an unsuspecting yeast or bacteria. The cells of these creatures would then be hijacked to make trillions of copies of the human genes. Putting it all together This wasn’t smallest level of the puzzle. Although the Internet was crucial to the co-ordination of the sequencing effort and the sharing of its fruits, the biggest single technological advance making the large-scale sequencing of humans possible was the development of bril- liantly engineered laser fluourescence sequencing robots. Ironically, these were sold first to private genome hunters who had been closely involved in their development. The main cus- tomer was Celera Genomics. When Craig Venter, currently Celera’s presi- dent and chief scientific officer, vowed in 1998 that he would use these robots to complete a draft of the human genome by 2001, the public project knew it had a race on. Many perceived Celera’s US$70 mil- lion challenge as a threat. Although there had already had been a controver- sy over the US National Institutes of Molecular graphic courtesy of Elaine Meng, Computer Graphics Laboratory, University of California, San Francisco. images were produced using the MidasPlus program (supported by NIH RR-01081)
  • 3. 28LinuxUser/June 2001 REAL LIFE LINUX Health and its intention to patent certain genes identi- fied by Venter’s so-called ‘EST’ method of gene finding, this was nothing compared to the growing fear of a large- scale stakeout of our genetic heritage by corporate inter- ests. Despite Watson’s initial scathing comments on robot sequencers, leaders of the public project knew they had to have the best engines to have any chance in the race. To quote the chairman of Perkin Elmer who made the robot sequencers, “The day we announced Celera we set off an arms race, and we were in the arms business.” But even the amazing PRISM 3700 sequencers he was referring to could not read any more than 550 bases of genomic sequence in each ‘lane’ or column of output. The public first draft of the project emerged in the 15 months to February 2001 – just before the private effort. Many believe this version of the genome to be better than the ‘commercial product’ – even allowing for the fact that Celera was free to use information from the public effort while the project’s researchers could not do the same themselves. The Genome Center at the Whitehead Institute for Genome Research made a map of the sequence freeze made on 24 May 2000. For this most recent draft a new and more rigorous map was needed. 96 per cent of the sequence of this working draft has a lower than one in 1,000 error rate. Enter perhaps the most exciting open source contribution to the genome Project race: James Kent and David Haussler, respectively from the Biology and Computer Science Departments of the University of California at Santa Cruz, developed an algorithm called GigAssembler for the final assembly of the overlapping frag- ments of the code. Students hacking code is noth- ing new, but this was rather special code for a rather special code and James Kent is no ordinary student, more of an academic late developer, as one might say. His first career was running a computer animation business. At the time of his involve- ment last summer he was Haussler’s 40-year-old PhD student. For one month he coded the algorithm in C to run on a cluster of 100 800MHz Pentium boxen Haussler had persuaded the University to buy especially for the purpose. To quote Haussler: “Jim in four weeks created the GigAssembler by working night and day. He had to ice his wrists at night because of the fury with which he created this extraordi- narily complex piece of code.” Even one error in 1,000 is not good enough, but now we are at a stage where the humble Linux enthusiast can not only read the code, but actually con- tribute to its interpretation. Bigger, slicker and free One of the most interesting parts of bioinformatics is the what comes after you have your sequence. The first tools to manipulate this DNA data in silico also started in academia. A collection of sci- entists in the Department of Genetics at the University of Wisconsin-Madison in 1982 created a bunch of command line tools which went on to be used by computer-savvy biolo- gists everywhere to manipulate sequence data. GCG (Genetics Computer Group) continued as a service of the UW Biotechnology Center from 1985 to 1989. Then it went commercial. Since then most academic centres have continued to use it at (what most academics thought was) a reasonable price. But a recent price hike of the sort that would shame Microsoft pushed the Human Genome Mapping Project Resource Centre (the HGMP-RC) to the point where they recommend that researchers give up paying them for GCG licences and turn to a free alternative. Worse, even programs written by outsiders to extend the original package started to get mired in the complexities of commercial licences. Ironically it’s taken another group of academics to ‘re- free’ the same algorithms that made GCG worth selling. The HGMP-RC lives on the same Genome Campus out- side Cambridge as the Sanger Centre where about a third of the human genome was sequenced. It is part of EMBnet, “a science-based group of collaborating nodes throughout Europe”. They used to write extensions to Further reading Genome For a fuller account of the race to sequence the genome itself try ‘The Sequence’ by Kevin Davies [Weidenfeld; ISBN 0297646982]. Find out more about the biology behind the genome in Matt Ridley’s ‘Genome’ [Fourth Estate; ISBN 185702835X] – both an inter- esting layman’s introduction to the issues raised by the bioinformatic revolution and an overview of its enormous scope. Go back to the ‘real begin- ning’ of the race and read James Watson’s entertaining and indiscreet memoir of his and Francis Crick’s determina- tion of the structure of DNA, ‘The Double Helix’ [Penguin; ISBN 0140268774] – now updated with an introduction by media don Steve Jones. Hardcore It’s notoriously difficult to find any books on bioinformatics itself that cater well for all of those coming from comput- ing, from mathematics and from biology backgrounds. The few textbooks available in the field tend to be eyewater- ingly expensive as well. If you are a hardcore maths/com- puting person Michael Waterman’s ‘Introduction to Computational Biology’ [Chapman & Hall/CRC Statistics and Mathematics; ISBN 0412993910]. Alternatively, Pavel Pevzner’s ‘Computational Molecular Biology – An Algorithmic Approach’ [The MIT Press (A Bradford Book); ISBN 0262161974] will give you all the discrete maths you can shake a stick at, but per- functory introductions to the biology. If you’re coming to the subject as a Linux user with a biological background, looking to exploit the many tools available, you might want to try Terry Attwood and David Parry-Smith’s ‘Introduction to Bioinformatics’ [Longman Higher Education; ISBN 0582327881], or Des Higgins and Willie Taylor’s ‘Bioinformatics: Sequence Structure and Databanks’ [Oxford University Press; ISBN 0199637903]. Cancer For a gentle introduction to cancer biology and its links with genetics try Robert Weinberg’s ‘One Renegade Cell’ [Phoenix Press; ISBN: 0753807866]. A stylised image repre- senting the structure of a gene product, a pro- tein, found in the brain, showing normal (A) and mutant (B) forms of the proteins. Although the sequence of both gene products is known, no one has yet determined the exact structure of the protein in the state shown in the images. Instead the researchers involved (Liang Zhong Xu and colleagues at Howard Hughes Medical Institute and The Rockefeller University, New York, NY) have published a representa- tion of a model, rather than an image of a real object. The model was generated using a Linux cluster running the pro- gram MODELLER. MODELLER was written by Andrej Sali at the Rockefeller and the pro- gram was distributed across a cluster admin- istered with the Clustor system (currently part of TurboLinux’s enFuzion 6.0).
  • 4. 99LinuxUser/May-June 2000 REAL LIFE LINUX GCG when it was cheap now they’re writing the free (as in beer and speech) alternative. Being hardcore nerds, these guys have eschewed the messy-but-fast approach of the average open source project for a very disciplined, object-ori- ented form of C. Not only does it work, but it looks pretty too. The applications themselves are a boon to practising biologists less interested in bioinfor- matics for its own sake. Many of them simply replace the functionality of their GCG predeces- sors and, unlike GCG and, it must be said, a lot of previous biological software packages, they have been released under the GPL. Even better, EMBnet’s hackers have, in pass- ing, had to build a set of amazingly powerful biological libraries which are available under the LGPL. Slowly and surely users are migrating to the new system and I have given EMBOSS my personal ‘growing frog’ award: it gets bigger and slicker with every change. Post-genomic hacking One of the biggest surprises of the Final Draft is that humans appear, according to the latest esti- mates, to have about half as many genes again as a fly. If we want to refine this count we need to figure out better ways of recognising genes whose instructions are followed by the body to make bits of ourselves. Biologists talk about such parts of the code being ‘expressed’. How can we find the ‘good bits’ that are in the genetic code? Ensembl is another European molecular biology project that is completely open source and uses various statistical techniques to spot the charac- teristic signatures of actual genes in the vast amounts of noise in the DNA signal. DNA has been sexy for half a decade, but it’s proteins that do the work. If the genes are the software then the proteins are the hardware. Whenever a gene is expressed, it is ‘made flesh’ as a protein. Last year IBM promised a tidy sum to the development of a machine which can take the one-dimensional DNA code for a gene and predict the elaborate three-dimensional shape of the protein machine it specifies. Blue Gene, the resulting new supercomputer, will follow IBM’s attempt to beat man at chess by tackling this, one of the classic questions in biology, the so-called ‘protein folding problem’. In the first week of April 2001 Hitachi and Oracle announced that they were about to spend US$500 million on the human proteome - a detailed description all the products of all the genes in the human genome. The proteome is a much bigger and much fuzzier target to throw hard cash at. The goal with sequencing the genome was pretty well defined by comparison. People in research were similarly sceptical about private efforts to tackle the genome and the cloning of large mammals, however. The new race is on. Damian Counsell does bioinformatics in the Protein Folding and Assembly Group at the Institute of Cancer Research, London. He has been using Linux since 1995. Key links Genome Zoom in and out on our genetic heritage with the Genome Browser http://genome.ucsc.edu/goldenPath/ hgTracks.html Visit the focus of the UK’s human genome effort at the Sanger Centre www.sanger.ac.uk/Info/Intro Bioinformatics Try your hand at some real-life bioinfor- matics questions; visit the Open Bioinformatics Project at the Institute of Cancer Research... www.icr.ac.uk/cmb/bioinformatics/Open ...or Damian Counsell’s own website at the Institute www.icr.ac.uk/cmb/bioinformatics For answers to more general questions visit The Open Lab’s Bioinformatics FAQ http://bioinformatics.org/FAQ Selected bioinformatics Linux clusters The BLAST farm at the Sanger Centre www.sanger.ac.uk/Info/IT/sld012.htm The cluster at the Jena Genome Sequencing Center http://gen100.imb-jena.de/cluster The YAC at the Mount Sinai Hospital at the Samuel Lunenfeld Research Institute http://bioinfo.mshri.on.ca/yac The Collective at the University of Idaho www.cs.uidaho.edu/~beowulf