SlideShare a Scribd company logo
1 of 40
CONTENTS
 INTRODUCTION
 ADVANTAGE OF CLOUD COMPUTING
 VIRTUAL MACHINE
 ADVANTAGES OF VIRTUAL MACHINE
 DISADVANTAGES OF VIRTUAL MACHINE
 DEDUPLICATION
 BENEFITSOFDEDUPLICATION
 EXISTING SYSTEM
 ISSUES IN VM STORAGE
 LIQUID SYSTEM ARCHITECTURE
 DEDUPLICATION IN LIQUID
CONTENTS(cont)
 OPTIMIZATION ON FINGER PRINT CALCULATION
 FILE SYSTEM LAYOUT
 COMMUNICATION AMONG COMPONENTS
 HEART BEAT PROTOCOL
 FAST CLONING FOR VM IMAGE
 FAULT TOLERANCE
 GARBAGE COLLECTIONS
 ADVANTAGES OF LIQUID
 CONCLUSION
 REFERENCES
INTRODUCTION
Cloud computing means storing and accessing
data programs over internet instead of yours
computers hard drive.
Figure 1: A Sample Cloud Computing Network[1]
ADVANTAGE CLOUD COMPUTING
Lower computer cost.
Improved performance.
Reduced software cost.
Instant software cost.
Unlimited storage capacity.
Increased data reliability.
Device independence and the “always on!,anywhere
and any place”.
Free from maintenance and the “no-need-to-know”.
VIRTUAL MACHINE
A virtual machine is a software that creates a
virtualized environment between the computer
platform and the end user in which the end user can
operate software.
Vitualization deals with extending or replacing an
existing interface so as to mimic the behavior of
another system.
Crucial component in cloud computing.
VIRTUAL MACHINE(cont)
Virtual machine – Hypothetical computer.
Execute programs like a physical machine.
Initial state of a virtual machine is stored in a file
called virtual machine image.
VIRTUAL MACHINE(cont)
Figure 2 :Virtual Machine Representation[1]
ADVANTAGES OF VIRTUAL MACHINE
Familiar interfaces
Isolation
-Each OS run seperately with its own virtual
resources.
High Availability
-If one VM server is failed then easily access data
from another one.
Scalability
-Add Or remove resources easily.
ADVANTAGES OF VIRTUAL
MACHINE(cont)
Back up with fast recovery
-Using VMDK data recovery tool.
Reduction of cost
-it save cost by running multi OS on single
machine.
-sharing of hardware.
DISADVANTAGES OF VIRTUAL
MACHINE
Difficulty in direct access to hardware.
Great RAM consumption since each virtual machine
will occupy a separate area of the same.
Greate use of disk space , since it takes all the files for
each operating system installed on each virtual
machine.
A virtual machine is less efficient than an actual
machine when it access the host hard drive indirectly.
DEDUPLICATION
Data Deduplication – data compression technology.
Eliminate duplicate copies of repeating data.
A redundant data block is replaced instead of storing
multiple times.
Improves storage utilization.
DEDUPLICATION(cont)
Figure 3: Deduplicated File System[2]
DEDUPLICATION(cont)
14
Traditional
File system
copycopy
Deduplication
copy
Idea. Eliminate
storage of data with
same content No new
blocks!
User’s point of
view
BENEFITS OF DEDUPLICATION
 Lower storage space requirements.
 Minimize the additional storage cost.
Performance increased.
Increase Network efficiency.
 Efficient Volume replication.
EXISTING SYSTEM
Hypervisors such as xen ,KVM etc.
Network Attached Storage(NAS)
Storage Area Network(SAN)
Direct Attached storage(DAS)
ISSUES IN VM STORAGE
High demand on VM storage remains a challenging
problem.
Existing systems have made efforts to reduce storage
consuptions.
Uses SAN cluster.
Cannot satisfy increasing demand due to cost
limitatations.
Hence we propose LIQUID.
LIQUID SYSTEM ARCHITECTURE
Three compononts – single meta server with hot back
up , multiple data server and multiple clients.
Runs on user – level service process.
VM images are split into fixed size data blocks.
Meta server – namespace , finger print , reference
count.
Meta server – mirrored to hot back up shadow meta
server.
LIQUID SYSTEM ARCHITECTURE(cont)
Data servers – charge of managing data blocks in VM
images.
Organized in a distributed hash table.
A liquid client provides a POSIX compatible file
system.
Client – critical component (provides deduplication).
Fault tolerance – mirroring the meta server.
Replicas of data blocks are stored.
LIQUID SYSTEM ARCHITECTURE(cont)
Figure 4: Liquid Architecture[3]
DEDUPLICATION IN LIQUID
Liquid choose fixed size chunking instead of variable
size chunking.
Better since all files stored in vm images will be aligned
on disk block boundaries.
Advantage – simplicity.
Block size choice.
Block size – balancing factor which is hard to choose.
Great impact on both deduplication and IO
performance.
DEDUPLICATION IN LIQUID(cont)
Smaller block size – more random seeks when
accessing a VM image.
Not tolerable.
A block size also not preferable , it will reduce
deduplication ratio.
Liquid choose different block size under different
situation.
Advised to use a multiplication of 4 kb between 256 kb
and 1 MB to achieve good balance between IO
performance and deduplication ratio.
DEDUPLICATION IN LIQUID(cont)
DEDUPLICATION IN LIQUID(cont)
OPTIMIZATION ON FINGER PRINT
CALCULATION
Rely on comparison of data block finger prints for
redundancy.
Finger print – collision resistant hash value calculated
from data block contents.
MD5(26) and SHA-1[12] are frequently used for this
purpose.
Finger print collision – very small , orders of
magnitude smaller than hardware error rates.
OPTIMIZATION ON FINGER PRINT
CALCULATION(cont)
So we could safely assume that two data blocks are
identical.
Finger print calculation – expensive.
Delays finger print calculation for recently modified
data blocks.
Runs deduplication lazily only when it s necessary.
Client side maintains a shared cache which contains
recently accessed data blocks.
OPTIMIZATION ON FINGER PRINT
CALCULATION(cont)
A portion of memory is used by the client side of
liquid as private cache.
Private cache hold – modified data blocks and delay
finger print calculation on them.
Modified data block ejected from shared cache and
added to private cache.
Modified data will be ejected if private cache becomes
full.
OPTIMIZATION ON FINGER PRINT
CALCULATION(cont)
And ejected based on LRU policy.
Only then will the modified data block’s finger print
be calculated.
Liquid uses multiple threads for finger print
calculation.
Multiple threads will process different data blocks
currently.
Provides good IO performance.
FILE SYSTEM LAYOUT
All file system meta data are stored on the meta server.
Organized in a file system tree.
Client side could cache portions of file systems meta
data for fast accesses.
When a VM is stopped , modified meta data and data
blocks.
Will be pushed back to meta server.
Data servers ensures modification on VM image is
visible to other client nodes.
FILE SYSTEM LAYOUT(cont)
Figure 5: Process Of Look-up by Fingerprint[4]
COMMUNICATION AMONG
COMPONENTS
 Heartbeatprotocol
 P2P Data Block Sharing
 On-Demand Data Block Fetching
HEART BEAT PROTOCOL
META SERVER – manages all data servers.
Exchange regular heart beat message with each data
server in a ROUND ROBIN FASHION.
Detect failed data servers when there are many data
servers.
To speed up failure detection data servers send an
error signal to meta server.
FAST CLONING FOR VM IMAGE
Copying large images may be time consuming.
Liquid provide efficient solution by means of fast
cloning.
VM images represented by meta data files having
reference to data blocks.
By copying meta data file and updating reference
count a clone VM image is achieved.
Modification on cloned images will not effect the
original image.
FAULT TOLERANCE
Data replication
Data migration
Hot backup of meta server
GARBAGE COLLECTIONS
Removes unused garbage data blocks when running
out of space.
Reference counting of all data blocks are maintained
by meta servers.
Garbage collection request is issued periodically to
data server.
Garbage collection is executed based on the data block
membership in the bloom filter.
ADVANTAGES OF LIQUID
Fast virtual machine deployment with peer to peer
data transfer.
Low storage consumption by means of deduplication.
Instant cloning for virtual machine images.
On demand fetching through a network caching with
local disks.
LIQUID files has no specific limit.
CONCLUSION
Presented LIQUID which is a deduplication file system
with good IO performance.
Achieve by caching frequently accessed data blocks in
memory cache.
Avoids additional disk operations.
Deduplication of VM images proved to be effective.
REFERENCES
[1] www.slideshare.net
[2]www.IJTRD1303.pdf
[3] www.cs.newpaltz.edu/~lik/publications/Xun-
Zhao-IEEE-TPDS-2014.pd.
[4]www.internationaljournalssrg.org/IJCSE/2015/Speci
al.../IJCSE-ICETSH-P118.pdf
LIQUID-A Scalable Deduplication File System For Virtual Machine Images
LIQUID-A Scalable Deduplication File System For Virtual Machine Images

More Related Content

What's hot

Warehouse scale computer
Warehouse scale computerWarehouse scale computer
Warehouse scale computerHassan A-j
 
Double-Take for Migrations - thinkASG University Series
Double-Take for Migrations - thinkASG University SeriesDouble-Take for Migrations - thinkASG University Series
Double-Take for Migrations - thinkASG University SeriesthinkASG
 
Xen Cloud Platform Installation Guide
Xen Cloud Platform Installation GuideXen Cloud Platform Installation Guide
Xen Cloud Platform Installation GuideSusheel Thakur
 
Hyper V In Windows Server 2008 R2.Son Vu
Hyper V In Windows Server 2008 R2.Son VuHyper V In Windows Server 2008 R2.Son Vu
Hyper V In Windows Server 2008 R2.Son Vuvncson
 
Storage Virtualization Challenges
Storage Virtualization ChallengesStorage Virtualization Challenges
Storage Virtualization ChallengesRandy Weis
 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computersshopnil786
 
Double-Take Availability - Technical Presentation
Double-Take Availability - Technical PresentationDouble-Take Availability - Technical Presentation
Double-Take Availability - Technical PresentationMücahid Akçay
 
[IC Manage] Workspace Acceleration & Network Storage Reduction
[IC Manage] Workspace Acceleration & Network Storage Reduction[IC Manage] Workspace Acceleration & Network Storage Reduction
[IC Manage] Workspace Acceleration & Network Storage ReductionPerforce
 
TSM 6.4.1 intro
TSM 6.4.1 intro TSM 6.4.1 intro
TSM 6.4.1 intro Solv AS
 
Nutanix vdi workshop presentation
Nutanix vdi workshop presentationNutanix vdi workshop presentation
Nutanix vdi workshop presentationHe Hariyadi
 
Gridstore's Software-Defined-Storage Architecture
Gridstore's Software-Defined-Storage ArchitectureGridstore's Software-Defined-Storage Architecture
Gridstore's Software-Defined-Storage ArchitectureGridstore
 
Resumption of virtual machines after adaptive deduplication of virtual machin...
Resumption of virtual machines after adaptive deduplication of virtual machin...Resumption of virtual machines after adaptive deduplication of virtual machin...
Resumption of virtual machines after adaptive deduplication of virtual machin...IJECEIAES
 
A Survey of Performance Comparison between Virtual Machines and Containers
A Survey of Performance Comparison between Virtual Machines and ContainersA Survey of Performance Comparison between Virtual Machines and Containers
A Survey of Performance Comparison between Virtual Machines and Containersprashant desai
 

What's hot (20)

Warehouse scale computer
Warehouse scale computerWarehouse scale computer
Warehouse scale computer
 
Double-Take for Migrations - thinkASG University Series
Double-Take for Migrations - thinkASG University SeriesDouble-Take for Migrations - thinkASG University Series
Double-Take for Migrations - thinkASG University Series
 
3 Hyper V
3 Hyper V3 Hyper V
3 Hyper V
 
Xen Cloud Platform Installation Guide
Xen Cloud Platform Installation GuideXen Cloud Platform Installation Guide
Xen Cloud Platform Installation Guide
 
Lecture 4 Cluster Computing
Lecture 4 Cluster ComputingLecture 4 Cluster Computing
Lecture 4 Cluster Computing
 
Hyper V In Windows Server 2008 R2.Son Vu
Hyper V In Windows Server 2008 R2.Son VuHyper V In Windows Server 2008 R2.Son Vu
Hyper V In Windows Server 2008 R2.Son Vu
 
Storage Virtualization Challenges
Storage Virtualization ChallengesStorage Virtualization Challenges
Storage Virtualization Challenges
 
High performance computing
High performance computingHigh performance computing
High performance computing
 
En
EnEn
En
 
Scaling Small App
Scaling Small AppScaling Small App
Scaling Small App
 
Cluster Computers
Cluster ComputersCluster Computers
Cluster Computers
 
Double-Take Availability - Technical Presentation
Double-Take Availability - Technical PresentationDouble-Take Availability - Technical Presentation
Double-Take Availability - Technical Presentation
 
[IC Manage] Workspace Acceleration & Network Storage Reduction
[IC Manage] Workspace Acceleration & Network Storage Reduction[IC Manage] Workspace Acceleration & Network Storage Reduction
[IC Manage] Workspace Acceleration & Network Storage Reduction
 
TSM 6.4.1 intro
TSM 6.4.1 intro TSM 6.4.1 intro
TSM 6.4.1 intro
 
Nutanix vdi workshop presentation
Nutanix vdi workshop presentationNutanix vdi workshop presentation
Nutanix vdi workshop presentation
 
Gridstore's Software-Defined-Storage Architecture
Gridstore's Software-Defined-Storage ArchitectureGridstore's Software-Defined-Storage Architecture
Gridstore's Software-Defined-Storage Architecture
 
Tombolo
TomboloTombolo
Tombolo
 
Resumption of virtual machines after adaptive deduplication of virtual machin...
Resumption of virtual machines after adaptive deduplication of virtual machin...Resumption of virtual machines after adaptive deduplication of virtual machin...
Resumption of virtual machines after adaptive deduplication of virtual machin...
 
A Survey of Performance Comparison between Virtual Machines and Containers
A Survey of Performance Comparison between Virtual Machines and ContainersA Survey of Performance Comparison between Virtual Machines and Containers
A Survey of Performance Comparison between Virtual Machines and Containers
 
Live VM Migration
Live VM MigrationLive VM Migration
Live VM Migration
 

Similar to LIQUID-A Scalable Deduplication File System For Virtual Machine Images

Windows Server 2019.pptx
Windows Server 2019.pptxWindows Server 2019.pptx
Windows Server 2019.pptxmasbulosoke
 
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYA
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYAWINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYA
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYAAditiaHamid1
 
DB2 for z/O S Data Sharing
DB2 for z/O S  Data  SharingDB2 for z/O S  Data  Sharing
DB2 for z/O S Data SharingSurekha Parekh
 
Introduction to Cloud computing
Introduction to Cloud computingIntroduction to Cloud computing
Introduction to Cloud computingMathews Job
 
Presentation v mware performance overview
Presentation   v mware performance overviewPresentation   v mware performance overview
Presentation v mware performance overviewsolarisyourep
 
Benchmarking a Scalable and Highly Available Architecture for Virtual Desktops
Benchmarking a Scalable and Highly Available Architecture for Virtual DesktopsBenchmarking a Scalable and Highly Available Architecture for Virtual Desktops
Benchmarking a Scalable and Highly Available Architecture for Virtual DesktopsDataCore Software
 
Cio Breakfast Roundtable 05142009 Final Virtualization
Cio Breakfast Roundtable 05142009 Final VirtualizationCio Breakfast Roundtable 05142009 Final Virtualization
Cio Breakfast Roundtable 05142009 Final Virtualizationguestc900809
 
Whitepaper nebucom intelligent application broking and provisioning in a hybr...
Whitepaper nebucom intelligent application broking and provisioning in a hybr...Whitepaper nebucom intelligent application broking and provisioning in a hybr...
Whitepaper nebucom intelligent application broking and provisioning in a hybr...Nebucom
 
Deconstructing the brian paradox
Deconstructing the brian paradoxDeconstructing the brian paradox
Deconstructing the brian paradoxChetan Venkatesh
 
Data Grids with Oracle Coherence
Data Grids with Oracle CoherenceData Grids with Oracle Coherence
Data Grids with Oracle CoherenceBen Stopford
 
School of computer & information sciences its 532 cloud c
School of computer & information sciences its 532 cloud cSchool of computer & information sciences its 532 cloud c
School of computer & information sciences its 532 cloud cssusere73ce3
 
Handling Data in Mega Scale Systems
Handling Data in Mega Scale SystemsHandling Data in Mega Scale Systems
Handling Data in Mega Scale SystemsDirecti Group
 
Virtual Storage Center
Virtual Storage CenterVirtual Storage Center
Virtual Storage CenterIBM Danmark
 
ClickOS_EE80777777777777777777777777777.pptx
ClickOS_EE80777777777777777777777777777.pptxClickOS_EE80777777777777777777777777777.pptx
ClickOS_EE80777777777777777777777777777.pptxBiHongPhc
 

Similar to LIQUID-A Scalable Deduplication File System For Virtual Machine Images (20)

Database as a Service - Tutorial @ICDE 2010
Database as a Service - Tutorial @ICDE 2010Database as a Service - Tutorial @ICDE 2010
Database as a Service - Tutorial @ICDE 2010
 
Future prediction-ds
Future prediction-dsFuture prediction-ds
Future prediction-ds
 
Windows Server 2019.pptx
Windows Server 2019.pptxWindows Server 2019.pptx
Windows Server 2019.pptx
 
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYA
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYAWINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYA
WINDOWS SERVER JALAN INSTALASINYA DAN IMPLEMENTASINYA
 
DB2 for z/O S Data Sharing
DB2 for z/O S  Data  SharingDB2 for z/O S  Data  Sharing
DB2 for z/O S Data Sharing
 
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
 
NoSQL
NoSQLNoSQL
NoSQL
 
Introduction to Cloud computing
Introduction to Cloud computingIntroduction to Cloud computing
Introduction to Cloud computing
 
cluster computing
cluster computingcluster computing
cluster computing
 
Presentation v mware performance overview
Presentation   v mware performance overviewPresentation   v mware performance overview
Presentation v mware performance overview
 
Azure and cloud design patterns
Azure and cloud design patternsAzure and cloud design patterns
Azure and cloud design patterns
 
Benchmarking a Scalable and Highly Available Architecture for Virtual Desktops
Benchmarking a Scalable and Highly Available Architecture for Virtual DesktopsBenchmarking a Scalable and Highly Available Architecture for Virtual Desktops
Benchmarking a Scalable and Highly Available Architecture for Virtual Desktops
 
Cio Breakfast Roundtable 05142009 Final Virtualization
Cio Breakfast Roundtable 05142009 Final VirtualizationCio Breakfast Roundtable 05142009 Final Virtualization
Cio Breakfast Roundtable 05142009 Final Virtualization
 
Whitepaper nebucom intelligent application broking and provisioning in a hybr...
Whitepaper nebucom intelligent application broking and provisioning in a hybr...Whitepaper nebucom intelligent application broking and provisioning in a hybr...
Whitepaper nebucom intelligent application broking and provisioning in a hybr...
 
Deconstructing the brian paradox
Deconstructing the brian paradoxDeconstructing the brian paradox
Deconstructing the brian paradox
 
Data Grids with Oracle Coherence
Data Grids with Oracle CoherenceData Grids with Oracle Coherence
Data Grids with Oracle Coherence
 
School of computer & information sciences its 532 cloud c
School of computer & information sciences its 532 cloud cSchool of computer & information sciences its 532 cloud c
School of computer & information sciences its 532 cloud c
 
Handling Data in Mega Scale Systems
Handling Data in Mega Scale SystemsHandling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
 
Virtual Storage Center
Virtual Storage CenterVirtual Storage Center
Virtual Storage Center
 
ClickOS_EE80777777777777777777777777777.pptx
ClickOS_EE80777777777777777777777777777.pptxClickOS_EE80777777777777777777777777777.pptx
ClickOS_EE80777777777777777777777777777.pptx
 

Recently uploaded

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 

Recently uploaded (20)

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 

LIQUID-A Scalable Deduplication File System For Virtual Machine Images

  • 1.
  • 2. CONTENTS  INTRODUCTION  ADVANTAGE OF CLOUD COMPUTING  VIRTUAL MACHINE  ADVANTAGES OF VIRTUAL MACHINE  DISADVANTAGES OF VIRTUAL MACHINE  DEDUPLICATION  BENEFITSOFDEDUPLICATION  EXISTING SYSTEM  ISSUES IN VM STORAGE  LIQUID SYSTEM ARCHITECTURE  DEDUPLICATION IN LIQUID
  • 3. CONTENTS(cont)  OPTIMIZATION ON FINGER PRINT CALCULATION  FILE SYSTEM LAYOUT  COMMUNICATION AMONG COMPONENTS  HEART BEAT PROTOCOL  FAST CLONING FOR VM IMAGE  FAULT TOLERANCE  GARBAGE COLLECTIONS  ADVANTAGES OF LIQUID  CONCLUSION  REFERENCES
  • 4. INTRODUCTION Cloud computing means storing and accessing data programs over internet instead of yours computers hard drive. Figure 1: A Sample Cloud Computing Network[1]
  • 5. ADVANTAGE CLOUD COMPUTING Lower computer cost. Improved performance. Reduced software cost. Instant software cost. Unlimited storage capacity. Increased data reliability. Device independence and the “always on!,anywhere and any place”. Free from maintenance and the “no-need-to-know”.
  • 6. VIRTUAL MACHINE A virtual machine is a software that creates a virtualized environment between the computer platform and the end user in which the end user can operate software. Vitualization deals with extending or replacing an existing interface so as to mimic the behavior of another system. Crucial component in cloud computing.
  • 7. VIRTUAL MACHINE(cont) Virtual machine – Hypothetical computer. Execute programs like a physical machine. Initial state of a virtual machine is stored in a file called virtual machine image.
  • 8. VIRTUAL MACHINE(cont) Figure 2 :Virtual Machine Representation[1]
  • 9. ADVANTAGES OF VIRTUAL MACHINE Familiar interfaces Isolation -Each OS run seperately with its own virtual resources. High Availability -If one VM server is failed then easily access data from another one. Scalability -Add Or remove resources easily.
  • 10. ADVANTAGES OF VIRTUAL MACHINE(cont) Back up with fast recovery -Using VMDK data recovery tool. Reduction of cost -it save cost by running multi OS on single machine. -sharing of hardware.
  • 11. DISADVANTAGES OF VIRTUAL MACHINE Difficulty in direct access to hardware. Great RAM consumption since each virtual machine will occupy a separate area of the same. Greate use of disk space , since it takes all the files for each operating system installed on each virtual machine. A virtual machine is less efficient than an actual machine when it access the host hard drive indirectly.
  • 12. DEDUPLICATION Data Deduplication – data compression technology. Eliminate duplicate copies of repeating data. A redundant data block is replaced instead of storing multiple times. Improves storage utilization.
  • 14. DEDUPLICATION(cont) 14 Traditional File system copycopy Deduplication copy Idea. Eliminate storage of data with same content No new blocks! User’s point of view
  • 15. BENEFITS OF DEDUPLICATION  Lower storage space requirements.  Minimize the additional storage cost. Performance increased. Increase Network efficiency.  Efficient Volume replication.
  • 16. EXISTING SYSTEM Hypervisors such as xen ,KVM etc. Network Attached Storage(NAS) Storage Area Network(SAN) Direct Attached storage(DAS)
  • 17. ISSUES IN VM STORAGE High demand on VM storage remains a challenging problem. Existing systems have made efforts to reduce storage consuptions. Uses SAN cluster. Cannot satisfy increasing demand due to cost limitatations. Hence we propose LIQUID.
  • 18. LIQUID SYSTEM ARCHITECTURE Three compononts – single meta server with hot back up , multiple data server and multiple clients. Runs on user – level service process. VM images are split into fixed size data blocks. Meta server – namespace , finger print , reference count. Meta server – mirrored to hot back up shadow meta server.
  • 19. LIQUID SYSTEM ARCHITECTURE(cont) Data servers – charge of managing data blocks in VM images. Organized in a distributed hash table. A liquid client provides a POSIX compatible file system. Client – critical component (provides deduplication). Fault tolerance – mirroring the meta server. Replicas of data blocks are stored.
  • 20. LIQUID SYSTEM ARCHITECTURE(cont) Figure 4: Liquid Architecture[3]
  • 21. DEDUPLICATION IN LIQUID Liquid choose fixed size chunking instead of variable size chunking. Better since all files stored in vm images will be aligned on disk block boundaries. Advantage – simplicity. Block size choice. Block size – balancing factor which is hard to choose. Great impact on both deduplication and IO performance.
  • 22. DEDUPLICATION IN LIQUID(cont) Smaller block size – more random seeks when accessing a VM image. Not tolerable. A block size also not preferable , it will reduce deduplication ratio. Liquid choose different block size under different situation. Advised to use a multiplication of 4 kb between 256 kb and 1 MB to achieve good balance between IO performance and deduplication ratio.
  • 25. OPTIMIZATION ON FINGER PRINT CALCULATION Rely on comparison of data block finger prints for redundancy. Finger print – collision resistant hash value calculated from data block contents. MD5(26) and SHA-1[12] are frequently used for this purpose. Finger print collision – very small , orders of magnitude smaller than hardware error rates.
  • 26. OPTIMIZATION ON FINGER PRINT CALCULATION(cont) So we could safely assume that two data blocks are identical. Finger print calculation – expensive. Delays finger print calculation for recently modified data blocks. Runs deduplication lazily only when it s necessary. Client side maintains a shared cache which contains recently accessed data blocks.
  • 27. OPTIMIZATION ON FINGER PRINT CALCULATION(cont) A portion of memory is used by the client side of liquid as private cache. Private cache hold – modified data blocks and delay finger print calculation on them. Modified data block ejected from shared cache and added to private cache. Modified data will be ejected if private cache becomes full.
  • 28. OPTIMIZATION ON FINGER PRINT CALCULATION(cont) And ejected based on LRU policy. Only then will the modified data block’s finger print be calculated. Liquid uses multiple threads for finger print calculation. Multiple threads will process different data blocks currently. Provides good IO performance.
  • 29. FILE SYSTEM LAYOUT All file system meta data are stored on the meta server. Organized in a file system tree. Client side could cache portions of file systems meta data for fast accesses. When a VM is stopped , modified meta data and data blocks. Will be pushed back to meta server. Data servers ensures modification on VM image is visible to other client nodes.
  • 30. FILE SYSTEM LAYOUT(cont) Figure 5: Process Of Look-up by Fingerprint[4]
  • 31. COMMUNICATION AMONG COMPONENTS  Heartbeatprotocol  P2P Data Block Sharing  On-Demand Data Block Fetching
  • 32. HEART BEAT PROTOCOL META SERVER – manages all data servers. Exchange regular heart beat message with each data server in a ROUND ROBIN FASHION. Detect failed data servers when there are many data servers. To speed up failure detection data servers send an error signal to meta server.
  • 33. FAST CLONING FOR VM IMAGE Copying large images may be time consuming. Liquid provide efficient solution by means of fast cloning. VM images represented by meta data files having reference to data blocks. By copying meta data file and updating reference count a clone VM image is achieved. Modification on cloned images will not effect the original image.
  • 34. FAULT TOLERANCE Data replication Data migration Hot backup of meta server
  • 35. GARBAGE COLLECTIONS Removes unused garbage data blocks when running out of space. Reference counting of all data blocks are maintained by meta servers. Garbage collection request is issued periodically to data server. Garbage collection is executed based on the data block membership in the bloom filter.
  • 36. ADVANTAGES OF LIQUID Fast virtual machine deployment with peer to peer data transfer. Low storage consumption by means of deduplication. Instant cloning for virtual machine images. On demand fetching through a network caching with local disks. LIQUID files has no specific limit.
  • 37. CONCLUSION Presented LIQUID which is a deduplication file system with good IO performance. Achieve by caching frequently accessed data blocks in memory cache. Avoids additional disk operations. Deduplication of VM images proved to be effective.