SlideShare a Scribd company logo
1 of 31
Course name : Seminar(IT331)
Faculty Guide : Prof. Nikita Patel
Presented By : Ravi Vaniya (15IT036)
Sanat Dhobi (15IT027)
From Magnetic drive to Genomic drive
Synopsis
 Introduction
 History (Evolution of Memory Storage Devices)
 Challenges of BigData
 What is DNA? , Why DNA? (A Biological perspective)
 DNA Data storage
 How data is stored? (Algorithms , Techniques etc.)
 Current research in world (case study by Microsoft)
 Pro’s and Con’s
 Application and Future scope
Introduction
 Deoxyribonucleic acid (DNA) is a molecule that
carries the genetic (hereditary) instructions used
in the growth, development and functioning of all
known living organism and many viruses.
 Most DNA molecules consist of
two biopolymer strands coiled around each other
to form a double helix.
 The information in DNA is stored as a code made
up of four nitrogen bases: adenine (A), guanine
(G), cytosine (C), and thymine (T).
 Nucleotide = Nitrogen base + Sugar + Phosphate.
Some images
History
(Evolution of Memory Storage Devices)
Earlier devices
 In mid-1700 – Punch card
It was used for input both of programs and data.
Used as early as 1725 in the textile industry (for controlling
mechanized textile looms).
 In 1946 – Selectron tube
Capacity - 32 to 512 bytes.
4096-bit Selectron was 10 inches long and 3 inches wide.
Con’s - expensive and production problems.
Courtesy: Wikipedia
Earlier devices …
 In 1932 – Magnetic drum memory
Memory capacity - 10 kB.
 In 1951 – Magnetic tape
 In 1956 – Hard disk drive
 IBM Model 350 - It had 50 24-inch discs with a total storage
capacity of 5 million characters (just under 5 MB).
 In 1971 – First Floppy drive (Diskette).
 In 1978 – Compact disc
 In 1980 – Hard disk drive (First 1 GB drive)
After 1990s …
 DVD and Flask storage (like SD card).
 Micro drive
 Holography.
 Cloud storage.
History
 The idea about the possibility of recording,
storage and retrieval of information on DNA
molecules were originally made by Mikhail
Neiman
 He published his idea in 1964–65 in the
Radiotekhnika journal, USSR(now Russia),
and the technology during that time was
referred to as MNeimONics(Mikhail Neiman
OligoNucleotides).
Challenges of Big Data
Introduction
 What is big data ?
Big data is a term for data sets that are so large or complex that
traditional data processing application software is inadequate to deal
with them.
 Problem for existing DBMS…
 Solutions..
1. Use software/framework
2. Some new technology
Issues
1. Data Volume
2. Data Velocity
3. Data Variety
4. Data Value
5. Data Complexity
Example : Google map
Challenges
 Privacy and security
 Data access and sharing of information
 Analytical challenges
 Human resource and manpower
 Technical – Fault tolerance , Scalability , Quality of data
Solution – 1 : Framework/Software
 Hadoop
Hadoop is an open-source framework(by Apache) that allows to store and process big
data in a distributed environment across clusters of computers using simple
programming models. It is designed to scale up from single servers to thousands of
machines, each offering local computation and storage.
 Let’s see how Hadoop works?
Traditional Approach Google’s Solution
Hadoop
Why DNA ?
1. Density of information that can be stored
- one gram of single-strand DNA could store as much as an exabyte
(1018 bytes).
2. DNA storage is not re-writable
- good for archiving records
3. Preservation
- DNA can still be sequenced from dried mummies thousands of
years old , but such sequences are rarely complete.
Polymerase Chain Reaction
 PCR is a technique to make many copies of a specific DNA region in
vitro (in a test tube rather than an organism).
 PCR relies on a thermostable DNA polymerase, Taq polymerase, and
requires DNA primers designed specifically for the DNA region of
interest.
 In PCR, the reaction is repeatedly cycled through a series of
temperature changes, which allow many copies of the target region to
be produced.
 PCR has many research and practical applications. It is routinely used
in DNA cloning, medical diagnostics, and forensic analysis of DNA.
Data Storage
Data put(Key,Value) process
Data get(Key) process
Advantages
 Density of information that can be stored is very high i.e. one gram of
single-strand DNA could store as much as an Exabyte.
 DNA storage is not re-writable means it is good for archiving records.
 DNA can be preserved for long time.
 DNA can maintain its integrity without any power supply. Also, its
small size and weight make it easy to store and transport.
 DNA is less susceptible to technical failures.
Disadvantages
 High cost of DNA synthesis per data stored (around US$12,400 per
megabyte of data stored).
 Data is read back at low speed.
 DNA is not rewritable, i.e. it can’t update the information it holds
without redoing the entire information storing process.
 DNA does not allow random access either, meaning, to access a
particular part of the data stored, the entire stored information should
be decoded.
References ...
 www.google.co.in
 Official website : University of Washington
 Official website : Microsoft Inc.
 Research paper by Siddhant Shrivastava and Rohan Badlani International
Journal of Electrical Energy, Vol. 2, No. 2, June 2014
 https://en.wikipedia.org/wiki/DNA
 http://www.the-scientist.com/?articles.view/articleNo/32494/title/DNA-Data-
Storage
 https://www.khanacademy.org/science/biology/biotech-dna-technology/dna-
sequencing-pcr-electrophoresis/a/polymerase-chain-reaction-pcr
IT331 Seminar on DNA Data Storage

More Related Content

What's hot

Dna storage
Dna storageDna storage
Dna storageCareerIn
 
DNA storage by Anushka jha
DNA storage by Anushka jhaDNA storage by Anushka jha
DNA storage by Anushka jhaAnushka Jha
 
Data Storage in DNA Documentation
Data Storage in DNA Documentation Data Storage in DNA Documentation
Data Storage in DNA Documentation Aditya Nag
 
DNA Storage at AGBT 2018
DNA Storage at AGBT 2018DNA Storage at AGBT 2018
DNA Storage at AGBT 2018Yaniv Erlich
 
Dna computing
Dna computingDna computing
Dna computingcsr522
 
DNA as memory storage device
DNA as memory storage deviceDNA as memory storage device
DNA as memory storage deviceKiran Gajare
 
DNA computing seminar
DNA computing seminarDNA computing seminar
DNA computing seminarAmit Yerva
 
Power point presentation of saminer topic DNA based computing
Power point presentation of saminer topic  DNA based computingPower point presentation of saminer topic  DNA based computing
Power point presentation of saminer topic DNA based computingPaushali Sen
 
Dna computing
Dna computing Dna computing
Dna computing busyking03
 
Dna computing
Dna computingDna computing
Dna computingNaveen Ch
 
Rainbow technology-ppt
Rainbow technology-pptRainbow technology-ppt
Rainbow technology-pptRajesh Roky
 

What's hot (20)

Dna as data storage device
Dna as data storage deviceDna as data storage device
Dna as data storage device
 
Dna storage
Dna storageDna storage
Dna storage
 
Data Storage in DNA
Data Storage in DNAData Storage in DNA
Data Storage in DNA
 
DNA storage by Anushka jha
DNA storage by Anushka jhaDNA storage by Anushka jha
DNA storage by Anushka jha
 
Data Storage in DNA Documentation
Data Storage in DNA Documentation Data Storage in DNA Documentation
Data Storage in DNA Documentation
 
DNA Storage at AGBT 2018
DNA Storage at AGBT 2018DNA Storage at AGBT 2018
DNA Storage at AGBT 2018
 
Dna computing
Dna computingDna computing
Dna computing
 
DNA as memory storage device
DNA as memory storage deviceDNA as memory storage device
DNA as memory storage device
 
DNA computing
DNA computingDNA computing
DNA computing
 
DNA computing
DNA computingDNA computing
DNA computing
 
DNA computing seminar
DNA computing seminarDNA computing seminar
DNA computing seminar
 
Power point presentation of saminer topic DNA based computing
Power point presentation of saminer topic  DNA based computingPower point presentation of saminer topic  DNA based computing
Power point presentation of saminer topic DNA based computing
 
Holographic memory
Holographic memoryHolographic memory
Holographic memory
 
DNA Computing
DNA ComputingDNA Computing
DNA Computing
 
Dna computing
Dna computing Dna computing
Dna computing
 
DNA Based Computing
DNA Based ComputingDNA Based Computing
DNA Based Computing
 
Dna computing
Dna computingDna computing
Dna computing
 
Dna computing
Dna computingDna computing
Dna computing
 
Rainbow technology-ppt
Rainbow technology-pptRainbow technology-ppt
Rainbow technology-ppt
 
DNA Computing
DNA ComputingDNA Computing
DNA Computing
 

Similar to IT331 Seminar on DNA Data Storage

Datastorage in DNA
Datastorage in DNADatastorage in DNA
Datastorage in DNAAditya Nag
 
Module 1 introduction Dna data storage
Module 1 introduction Dna data storageModule 1 introduction Dna data storage
Module 1 introduction Dna data storageRavi Vaniya
 
Liquid Steganography presentation.pptx
Liquid Steganography  presentation.pptxLiquid Steganography  presentation.pptx
Liquid Steganography presentation.pptxChandniA5
 
Secure data transmission using dna encryption
Secure data transmission using dna encryptionSecure data transmission using dna encryption
Secure data transmission using dna encryptionAlexander Decker
 
Digital preservation
Digital preservationDigital preservation
Digital preservationMichael Day
 
Next generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesNext generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesGuy Coates
 
dna-digital-data-storage_compress.pdf
dna-digital-data-storage_compress.pdfdna-digital-data-storage_compress.pdf
dna-digital-data-storage_compress.pdfAzimGamer1
 
Why DNA can be used as storage material
Why DNA can be used as storage materialWhy DNA can be used as storage material
Why DNA can be used as storage materialRavi Vaniya
 
A Study on DNA based Computation and Memory Devices
A Study on DNA based Computation and Memory DevicesA Study on DNA based Computation and Memory Devices
A Study on DNA based Computation and Memory DevicesEditor IJCATR
 
DNA based computer : present & future
DNA based computer : present & futureDNA based computer : present & future
DNA based computer : present & futureKinjal Mondal
 
DNA computing.pptx
DNA computing.pptxDNA computing.pptx
DNA computing.pptxKushal150906
 

Similar to IT331 Seminar on DNA Data Storage (20)

Datastorage in DNA
Datastorage in DNADatastorage in DNA
Datastorage in DNA
 
DNA digital data storage.pptx
DNA digital data storage.pptxDNA digital data storage.pptx
DNA digital data storage.pptx
 
Dna computing
Dna computingDna computing
Dna computing
 
Module 1 introduction Dna data storage
Module 1 introduction Dna data storageModule 1 introduction Dna data storage
Module 1 introduction Dna data storage
 
Dna ppt
Dna pptDna ppt
Dna ppt
 
Liquid Steganography presentation.pptx
Liquid Steganography  presentation.pptxLiquid Steganography  presentation.pptx
Liquid Steganography presentation.pptx
 
Dna synopsis
Dna synopsisDna synopsis
Dna synopsis
 
Dna computing
Dna computingDna computing
Dna computing
 
Secure data transmission using dna encryption
Secure data transmission using dna encryptionSecure data transmission using dna encryption
Secure data transmission using dna encryption
 
Digital preservation
Digital preservationDigital preservation
Digital preservation
 
Dna tech
Dna techDna tech
Dna tech
 
Next generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesNext generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciences
 
dna-digital-data-storage_compress.pdf
dna-digital-data-storage_compress.pdfdna-digital-data-storage_compress.pdf
dna-digital-data-storage_compress.pdf
 
Cloud bioinformatics 2
Cloud bioinformatics 2Cloud bioinformatics 2
Cloud bioinformatics 2
 
Why DNA can be used as storage material
Why DNA can be used as storage materialWhy DNA can be used as storage material
Why DNA can be used as storage material
 
A Study on DNA based Computation and Memory Devices
A Study on DNA based Computation and Memory DevicesA Study on DNA based Computation and Memory Devices
A Study on DNA based Computation and Memory Devices
 
Dna cryptography
Dna cryptographyDna cryptography
Dna cryptography
 
DNA based computer : present & future
DNA based computer : present & futureDNA based computer : present & future
DNA based computer : present & future
 
Genetic data storage
Genetic data storageGenetic data storage
Genetic data storage
 
DNA computing.pptx
DNA computing.pptxDNA computing.pptx
DNA computing.pptx
 

Recently uploaded

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 

Recently uploaded (20)

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 

IT331 Seminar on DNA Data Storage

  • 1. Course name : Seminar(IT331) Faculty Guide : Prof. Nikita Patel Presented By : Ravi Vaniya (15IT036) Sanat Dhobi (15IT027)
  • 2. From Magnetic drive to Genomic drive
  • 3. Synopsis  Introduction  History (Evolution of Memory Storage Devices)  Challenges of BigData  What is DNA? , Why DNA? (A Biological perspective)  DNA Data storage  How data is stored? (Algorithms , Techniques etc.)  Current research in world (case study by Microsoft)  Pro’s and Con’s  Application and Future scope
  • 4. Introduction  Deoxyribonucleic acid (DNA) is a molecule that carries the genetic (hereditary) instructions used in the growth, development and functioning of all known living organism and many viruses.  Most DNA molecules consist of two biopolymer strands coiled around each other to form a double helix.  The information in DNA is stored as a code made up of four nitrogen bases: adenine (A), guanine (G), cytosine (C), and thymine (T).  Nucleotide = Nitrogen base + Sugar + Phosphate.
  • 6. History (Evolution of Memory Storage Devices)
  • 7. Earlier devices  In mid-1700 – Punch card It was used for input both of programs and data. Used as early as 1725 in the textile industry (for controlling mechanized textile looms).  In 1946 – Selectron tube Capacity - 32 to 512 bytes. 4096-bit Selectron was 10 inches long and 3 inches wide. Con’s - expensive and production problems.
  • 9. Earlier devices …  In 1932 – Magnetic drum memory Memory capacity - 10 kB.  In 1951 – Magnetic tape  In 1956 – Hard disk drive  IBM Model 350 - It had 50 24-inch discs with a total storage capacity of 5 million characters (just under 5 MB).  In 1971 – First Floppy drive (Diskette).  In 1978 – Compact disc  In 1980 – Hard disk drive (First 1 GB drive)
  • 10. After 1990s …  DVD and Flask storage (like SD card).  Micro drive  Holography.  Cloud storage.
  • 11. History  The idea about the possibility of recording, storage and retrieval of information on DNA molecules were originally made by Mikhail Neiman  He published his idea in 1964–65 in the Radiotekhnika journal, USSR(now Russia), and the technology during that time was referred to as MNeimONics(Mikhail Neiman OligoNucleotides).
  • 13. Introduction  What is big data ? Big data is a term for data sets that are so large or complex that traditional data processing application software is inadequate to deal with them.  Problem for existing DBMS…  Solutions.. 1. Use software/framework 2. Some new technology
  • 14. Issues 1. Data Volume 2. Data Velocity 3. Data Variety 4. Data Value 5. Data Complexity Example : Google map
  • 15.
  • 16. Challenges  Privacy and security  Data access and sharing of information  Analytical challenges  Human resource and manpower  Technical – Fault tolerance , Scalability , Quality of data
  • 17. Solution – 1 : Framework/Software  Hadoop Hadoop is an open-source framework(by Apache) that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.  Let’s see how Hadoop works?
  • 20. Why DNA ? 1. Density of information that can be stored - one gram of single-strand DNA could store as much as an exabyte (1018 bytes). 2. DNA storage is not re-writable - good for archiving records 3. Preservation - DNA can still be sequenced from dried mummies thousands of years old , but such sequences are rarely complete.
  • 21. Polymerase Chain Reaction  PCR is a technique to make many copies of a specific DNA region in vitro (in a test tube rather than an organism).  PCR relies on a thermostable DNA polymerase, Taq polymerase, and requires DNA primers designed specifically for the DNA region of interest.  In PCR, the reaction is repeatedly cycled through a series of temperature changes, which allow many copies of the target region to be produced.  PCR has many research and practical applications. It is routinely used in DNA cloning, medical diagnostics, and forensic analysis of DNA.
  • 22.
  • 23.
  • 24.
  • 28. Advantages  Density of information that can be stored is very high i.e. one gram of single-strand DNA could store as much as an Exabyte.  DNA storage is not re-writable means it is good for archiving records.  DNA can be preserved for long time.  DNA can maintain its integrity without any power supply. Also, its small size and weight make it easy to store and transport.  DNA is less susceptible to technical failures.
  • 29. Disadvantages  High cost of DNA synthesis per data stored (around US$12,400 per megabyte of data stored).  Data is read back at low speed.  DNA is not rewritable, i.e. it can’t update the information it holds without redoing the entire information storing process.  DNA does not allow random access either, meaning, to access a particular part of the data stored, the entire stored information should be decoded.
  • 30. References ...  www.google.co.in  Official website : University of Washington  Official website : Microsoft Inc.  Research paper by Siddhant Shrivastava and Rohan Badlani International Journal of Electrical Energy, Vol. 2, No. 2, June 2014  https://en.wikipedia.org/wiki/DNA  http://www.the-scientist.com/?articles.view/articleNo/32494/title/DNA-Data- Storage  https://www.khanacademy.org/science/biology/biotech-dna-technology/dna- sequencing-pcr-electrophoresis/a/polymerase-chain-reaction-pcr