SlideShare a Scribd company logo
1 of 28
Download to read offline
Liquid- A Scalable Deduplication File System For
Virtual Machine Images
Anamika G V(12143630)
S7 CSA
College of Engineering, Cherthala
August 8, 2016
Guided By
Josna Jose
Assistant Professor
Computer Science & Engineering
CONTENTS
1 INTRODUCTION
2 VIRTUAL MACHINE
3 DEDUPLICATION
4 EXISTING SYSTEM
5 ISSUES IN VM STORAGE
6 LIQUID SYSTEM ARCHITECTURE
7 DEDUPLICATION IN LIQUID
8 OPTIMIZATIONS ON FINGER PRINT CALCULATION
9 FILE SYSTEM LAYOUT
10 COMMUNICATION AMONG COMPONENTS HEART BEAT
PROTOCOL
11 FAST CLONING FOR VM IMAGES
12 FAULT TOLERANCE
13 GARBAGE COLLECTION
14 ADVANTAGES OF LIQUID
15 CONCLUSION
INTRODUCTION
Cloud computing is the practice of using a remote servers hosted
on the internet to store manage and process data rather than a
local server or a personnel computer.
Figure : 1.A sample cloud computing network
VIRTUALIZATION and VIRTUAL
MACHINE
1 Virtualization deals with extending or replacing an existing
interface so as to mimic the behavior of another system
2 A Virtual Machine is a software that creates a virtualized
environment between the computer platform and the end user
in which the end user can operate software.
3 Crucial component in cloud computing.
4 Virtual Machine - Hypothetical Computer.
5 Executes programs like a physical machine.
6 Initial state of a virtual machine is stored in a file called
virtual Machine image.
VIRTUAL MACHINE
Figure : 2.Virtual Machine representation
DEDUPLICATION
1 Data Deduplication data compression technology.
2 Eliminates duplicate copies of repeating data.
3 A redundant data blocks are replaced so as to avoid the
problems regarding storage consumption of a large number of
VM images.
4 Improves storage utilization.
Figure : 3.Deduplicated file system
EXISTING SYSTEM
1 Hypervisors such as Xen, KVM etc.
2 Network Attached Storage (NAS)
3 Storage Area Network (SAN)
4 Direct Attached Storage (DAS)
ISSUES IN VM STORAGE
1 High demand on VM storage remains a challenging problem.
2 Existing systems have made efforts to reduce storage
consumption.
3 Uses SAN cluster.
4 Cannot satisfy increasing demand due to cost limitation.
5 Hence we propose LIQUID.
LIQUID SYSTEM ARCHITECTURE
1 Three components - Single meta server with hot back up
multiple data server and multiple clients.
2 Runs on user-level service process.
3 VM images are split into fixed size data blocks.
4 Meta server namespace , finger print , reference count.
5 Meta server mirrored to hot back up shadow meta server.
6 Data servers change of managing data blocks in VM images.
7 Organized in a distributed hash table.
8 A liquid client provides a POSIX compatible file system.
9 Client critical component (provides deduplication)
10 Fault tolerance Mirroring the meta server.
11 Replicas of data blocks are stored.
LIQUID SYSTEM ARCHITECTURE
(CONT.)
Figure : 4.Liquid Architecture
DEDUPLICATION IN LIQUID
1 Liquid chooses fixed size chunking instead of variable size
chunking.
2 Better since all files stored in VM images will be aligned on
disk block boundaries.
3 Advantage-simplicity.
4 Block size choice.
5 Block size- balancing factor which is hard to choose. Great
impact on both deduplication and io performance.
DEDUPLICATION IN LIQUID(CONT.)
1 Smaller block size-more random seeks when accessing a VM
image.
2 Not tolerable.
3 A large block size is also not preferable, it will reduce
deduplication ratio.
4 Liquid choose different block size under different situation.
5 Advised to use a multiplication of 4 kb between 256 kb and 1
MB to achieve good balance between IO performance and
deduplication ratio.
DEDUPLICATION IN LIQUID(CONT.)
Figure : 5.Pictorial representation of Deduplication
OPTIMIZATIONS ON FINGERPRINT
CALCULATION
∗ Rely on comparison of data block finger prints for redundancy.
∗ Finger print-collision resistant hash value calculated from data
block contents.
∗ MD5[26] and SHA-1[12] are frequently used for this purpose.
∗ Finger print collision - very small, orders of magnitude smaller
than hardware error rates.
∗ So we could safely assume that two data blocks are identical.
∗ Finger print calculation - expensive. ∗ Delays finger print
calculation for recently modified data blocks.
∗ Runs deduplication lazily only when it is necessary.
∗ Client side maintains a shared cache which contains recently
accessed data blocks.
OPTIMIZATIONS ON FINGERPRINT
CALCULATION (CONT.)
∗ A portion of memory is used by the client side of liquid as private
cache.
∗ Private cache hold-modified data blocks and delay finger print
calculation on them.
∗ Modified data block ejected from shared cache and added to
private cache.
∗ Modified data will be ejected if private cache becomes full.
∗ And ejected based on LRU policy.
∗ Only then will the modified data block’s finger print be
calculated.
∗ Liquid uses multiple threads for finger print calculation.
∗ Multiple threads will process different data blocks currently.
∗ Provides good IO performance.
FILE SYSTEM LAY OUT
1 All file system meta data are stored on the meta server.
2 Organized in a file system tree.
3 Client side could cache portions of file system meta data for
fast accesses.
4 When a VM is stopped ,modified meta data and data blocks.
5 Will be pushed back to meta server.
6 Data servers ensures modification on VM image is visible to
other client nodes.
FILE SYSTEM LAY OUT
Figure : 6:File System Structure
COMMUNICATION AMONG COMPONENTS:
HEART BEAT PROTOCOL
1 META SERVER-manages all data servers.
2 Exchange regular heart beat message with each data server in
a ROUND ROBIN FASHION.
3 Detect failed data servers when there are many data servers.
4 To speed up failure detection data servers send an error signal
to meta server.
FAST CLONING FOR VM IMAGES
1 Copying large images may be time consuming.
2 Liquid provide efficient solution by means of fast cloning.
3 VM images represented by meta data files having reference to
data blocks.
4 By copying meta data file and updating reference count a
clone VM image is achieved.
5 Modification on cloned images will not effect the original
image.
FAULT TOLERANCE
1 Data replication
2 Data migration
3 Hot backup of Meta server
GARBAGE COLLECTION
1 Removes unused garbage data blocks when running out of
space.
2 Reference counting of all data blocks are maintained by mete
servers.
3 Garbage collection request is issued periodically to data server.
4 Garbage collection is executed based on the data block
membership in the Bloom filter.
ADVANTAGES OF LIQUID
1 Fast Virtual Machine deployment with peer to peer data
transfer.
2 Low storage consumption by means of deduplication.
3 Instant cloning for virtual machine images.
4 On demand fetching through a network caching with local
disks.
5 LIQUID files has no specific limit.
CONCLUSION
1 Presented LIQUID which is a deduplication file system with
good IO performance.
2 Achieved by caching frequently accessed data blocks in
memory cache.
3 Avoids additional disk operations.
4 Deduplication of VM images proved to be effective.
REFERENCES
[1].Xun Zhao, Yang Zhang, Yongwei Wu, Kang Chen, Jinlei Jiang,
and Keqin Li, Senior Member, IEEE, Liquid: A Scalable
Deduplication File System for Virtual Machine ImagesIEEE
TRANSACTIONS ON PARALLEL AND DISTRIBUTED
SYSTEMS, VOL. 25, NO. 5, MAY 2014.
[2]AmazonMachineImage,Sept.2001.[Online]. Available
http://en.wikipedia.org/wiki/Amazon/Machine/Image.
[3]BittorrentProtocol,Sept.2011.[Online]. Available
http://en.wikipedia.org/wiki/BitTorrent/protocol.
[4]BloomFilter,Sept.2011.[Online]. Available
http://en.wikipedia.org/wiki/Bloom/filter
[5]Xfs:AHigh Performance Journaling Filesystem,
Sept.2011.[Online].http://oss.sgi.com/projects/xfs
[6]RabinFingerprint,Sept.2011.[Online]. Available
http://en.wikipedia.org/wiki/Rabin/fingerprint.
[7]DataDeduplication,Sept.2013.[Online].
Available:http://en.wikipedia.org/wiki/Data/deduplication.
[8]A.T. Clements, I. Ahmad, M. Vilayannur, and J. Li,
Decentralized Deduplication in San Cluster File Systems, in Proc.
Conf. USENIX Annu. Techn. Conf., 2009, p. 8, USENIX
Association.
[9]K. Jin and E.L. Miller, The Effectiveness of Deduplication on
VirtualMachine Disk Images, in Proc. SYSTOR, Israeli Exp. Syst.
Conf., New York, NY, USA, 2009, pp.1-12.
[10]A. Liguori and E. Hensbergen, Experiences with Content
Addressable Storage and Virtual Disks,,in Proc. WIOV08, San
Diego, CA, USA, 2008, p. 5.
[11]M. McLoughlin,The qcow2 Image Format, Sept. 2011.
[Online]. Available:
http://people.gnome.org/markmc/qcow-image-format.html.
[12]C. Tang, Fvd: A High-Performance Virtual Machine Image
Format for Cloud, in Proc. USENIX Conf. USENIX Annu. Tech.
Conf., 2011, p. 18.
[13]B. Zhu, K. Li, and H. Patterson,Avoiding the Disk Bottleneck
in the Data Domain Deduplication File System, in Proc. 6th
USENIX Conf. FAST, Berkeley, CA, USA, 2008, pp. 269-282.
Liquid File System For Efficient VM Storage
Liquid File System For Efficient VM Storage

More Related Content

What's hot

Microsoft India - Windows Server 2008 R2 Hyper - V Live Migration Whitepaper
Microsoft India - Windows Server 2008 R2 Hyper - V Live Migration Whitepaper Microsoft India - Windows Server 2008 R2 Hyper - V Live Migration Whitepaper
Microsoft India - Windows Server 2008 R2 Hyper - V Live Migration Whitepaper Microsoft Private Cloud
 
An introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale ComputersAn introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale ComputersAlessio Villardita
 
Diablo-MCS-with-SQL-Server-on-VSAN-WP
Diablo-MCS-with-SQL-Server-on-VSAN-WPDiablo-MCS-with-SQL-Server-on-VSAN-WP
Diablo-MCS-with-SQL-Server-on-VSAN-WPRicky Trigalo
 
Maginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
Maginatics Cloud Storage Platform - MCSP 3.0 Technical HighlightsMaginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
Maginatics Cloud Storage Platform - MCSP 3.0 Technical HighlightsMaginatics
 
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics
 
Directory Write Leases in MagFS
Directory Write Leases in MagFSDirectory Write Leases in MagFS
Directory Write Leases in MagFSMaginatics
 
Storage as a Service with Gluster
Storage as a Service with GlusterStorage as a Service with Gluster
Storage as a Service with GlusterVijay Bellur
 
Benchmarking a Scalable and Highly Available Architecture for Virtual Desktops
Benchmarking a Scalable and Highly Available Architecture for Virtual DesktopsBenchmarking a Scalable and Highly Available Architecture for Virtual Desktops
Benchmarking a Scalable and Highly Available Architecture for Virtual DesktopsDataCore Software
 
Application layer
Application layerApplication layer
Application layerNeha Kurale
 
Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSGlusterFS
 
ClustrixDB 7.5 Announcement
ClustrixDB 7.5 AnnouncementClustrixDB 7.5 Announcement
ClustrixDB 7.5 AnnouncementClustrix
 
#ITsubbotnik Spring 2017: Roman Dimitrenko "Building Paas with the HashiStack"
#ITsubbotnik Spring 2017: Roman Dimitrenko "Building Paas with the HashiStack"#ITsubbotnik Spring 2017: Roman Dimitrenko "Building Paas with the HashiStack"
#ITsubbotnik Spring 2017: Roman Dimitrenko "Building Paas with the HashiStack"epamspb
 
Introduction to storage technologies
Introduction to storage technologiesIntroduction to storage technologies
Introduction to storage technologiesNuno Alves
 
Voldemort on Solid State Drives
Voldemort on Solid State DrivesVoldemort on Solid State Drives
Voldemort on Solid State DrivesAmy W. Tang
 
Warehouse scale computer
Warehouse scale computerWarehouse scale computer
Warehouse scale computerHassan A-j
 
EOUG95 - Client Server Very Large Databases - Paper
EOUG95 - Client Server Very Large Databases - PaperEOUG95 - Client Server Very Large Databases - Paper
EOUG95 - Client Server Very Large Databases - PaperDavid Walker
 
IMCSummit 2015 - Day 2 Developer Track - The NVM Revolution
IMCSummit 2015 - Day 2 Developer Track - The NVM RevolutionIMCSummit 2015 - Day 2 Developer Track - The NVM Revolution
IMCSummit 2015 - Day 2 Developer Track - The NVM RevolutionIn-Memory Computing Summit
 
Containers and Databases
Containers and DatabasesContainers and Databases
Containers and DatabasesFernando Ike
 
Cost and performance comparison for OpenStack compute and storage infrastructure
Cost and performance comparison for OpenStack compute and storage infrastructureCost and performance comparison for OpenStack compute and storage infrastructure
Cost and performance comparison for OpenStack compute and storage infrastructurePrincipled Technologies
 

What's hot (20)

Microsoft India - Windows Server 2008 R2 Hyper - V Live Migration Whitepaper
Microsoft India - Windows Server 2008 R2 Hyper - V Live Migration Whitepaper Microsoft India - Windows Server 2008 R2 Hyper - V Live Migration Whitepaper
Microsoft India - Windows Server 2008 R2 Hyper - V Live Migration Whitepaper
 
An introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale ComputersAn introduction to the Design of Warehouse-Scale Computers
An introduction to the Design of Warehouse-Scale Computers
 
Diablo-MCS-with-SQL-Server-on-VSAN-WP
Diablo-MCS-with-SQL-Server-on-VSAN-WPDiablo-MCS-with-SQL-Server-on-VSAN-WP
Diablo-MCS-with-SQL-Server-on-VSAN-WP
 
Maginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
Maginatics Cloud Storage Platform - MCSP 3.0 Technical HighlightsMaginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
Maginatics Cloud Storage Platform - MCSP 3.0 Technical Highlights
 
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
 
Directory Write Leases in MagFS
Directory Write Leases in MagFSDirectory Write Leases in MagFS
Directory Write Leases in MagFS
 
Storage as a Service with Gluster
Storage as a Service with GlusterStorage as a Service with Gluster
Storage as a Service with Gluster
 
Benchmarking a Scalable and Highly Available Architecture for Virtual Desktops
Benchmarking a Scalable and Highly Available Architecture for Virtual DesktopsBenchmarking a Scalable and Highly Available Architecture for Virtual Desktops
Benchmarking a Scalable and Highly Available Architecture for Virtual Desktops
 
Application layer
Application layerApplication layer
Application layer
 
Red Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFSRed Hat Storage - Introduction to GlusterFS
Red Hat Storage - Introduction to GlusterFS
 
ClustrixDB 7.5 Announcement
ClustrixDB 7.5 AnnouncementClustrixDB 7.5 Announcement
ClustrixDB 7.5 Announcement
 
#ITsubbotnik Spring 2017: Roman Dimitrenko "Building Paas with the HashiStack"
#ITsubbotnik Spring 2017: Roman Dimitrenko "Building Paas with the HashiStack"#ITsubbotnik Spring 2017: Roman Dimitrenko "Building Paas with the HashiStack"
#ITsubbotnik Spring 2017: Roman Dimitrenko "Building Paas with the HashiStack"
 
Introduction to storage technologies
Introduction to storage technologiesIntroduction to storage technologies
Introduction to storage technologies
 
Voldemort on Solid State Drives
Voldemort on Solid State DrivesVoldemort on Solid State Drives
Voldemort on Solid State Drives
 
Warehouse scale computer
Warehouse scale computerWarehouse scale computer
Warehouse scale computer
 
EOUG95 - Client Server Very Large Databases - Paper
EOUG95 - Client Server Very Large Databases - PaperEOUG95 - Client Server Very Large Databases - Paper
EOUG95 - Client Server Very Large Databases - Paper
 
IMCSummit 2015 - Day 2 Developer Track - The NVM Revolution
IMCSummit 2015 - Day 2 Developer Track - The NVM RevolutionIMCSummit 2015 - Day 2 Developer Track - The NVM Revolution
IMCSummit 2015 - Day 2 Developer Track - The NVM Revolution
 
Containers and Databases
Containers and DatabasesContainers and Databases
Containers and Databases
 
En
EnEn
En
 
Cost and performance comparison for OpenStack compute and storage infrastructure
Cost and performance comparison for OpenStack compute and storage infrastructureCost and performance comparison for OpenStack compute and storage infrastructure
Cost and performance comparison for OpenStack compute and storage infrastructure
 

Viewers also liked

SecCloudPro: A Novel Secure Cloud Storage System for Auditing and Deduplication
SecCloudPro: A Novel Secure Cloud Storage System for Auditing and DeduplicationSecCloudPro: A Novel Secure Cloud Storage System for Auditing and Deduplication
SecCloudPro: A Novel Secure Cloud Storage System for Auditing and DeduplicationIJCERT
 
Securededuplicationschemeforcloudstorage 141128075306-conversion-gate01
Securededuplicationschemeforcloudstorage 141128075306-conversion-gate01Securededuplicationschemeforcloudstorage 141128075306-conversion-gate01
Securededuplicationschemeforcloudstorage 141128075306-conversion-gate01shobhiya kumar
 
A Hybrid Cloud Approach for Secure Authorized Deduplication
A Hybrid Cloud Approach for Secure Authorized DeduplicationA Hybrid Cloud Approach for Secure Authorized Deduplication
A Hybrid Cloud Approach for Secure Authorized DeduplicationSWAMI06
 
A hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplicationA hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplicationPapitha Velumani
 
A hybrid cloud approach for secure authorized deduplication.
A hybrid cloud approach for secure authorized deduplication.A hybrid cloud approach for secure authorized deduplication.
A hybrid cloud approach for secure authorized deduplication.prudhvikumar madithati
 
A survey of web clustering engines
A survey of web clustering enginesA survey of web clustering engines
A survey of web clustering enginesunyil96
 
Mesh network presentation
Mesh network presentationMesh network presentation
Mesh network presentationIftach Ian Amit
 
Security in distributed systems
Security in distributed systems Security in distributed systems
Security in distributed systems Haitham Ahmed
 
web clustering engines
web clustering enginesweb clustering engines
web clustering enginesArun TR
 
Secure shell ppt
Secure shell pptSecure shell ppt
Secure shell pptsravya raju
 

Viewers also liked (20)

Web clustring engine
Web clustring engineWeb clustring engine
Web clustring engine
 
SecCloudPro: A Novel Secure Cloud Storage System for Auditing and Deduplication
SecCloudPro: A Novel Secure Cloud Storage System for Auditing and DeduplicationSecCloudPro: A Novel Secure Cloud Storage System for Auditing and Deduplication
SecCloudPro: A Novel Secure Cloud Storage System for Auditing and Deduplication
 
Securededuplicationschemeforcloudstorage 141128075306-conversion-gate01
Securededuplicationschemeforcloudstorage 141128075306-conversion-gate01Securededuplicationschemeforcloudstorage 141128075306-conversion-gate01
Securededuplicationschemeforcloudstorage 141128075306-conversion-gate01
 
Deduplication
DeduplicationDeduplication
Deduplication
 
A Hybrid Cloud Approach for Secure Authorized Deduplication
A Hybrid Cloud Approach for Secure Authorized DeduplicationA Hybrid Cloud Approach for Secure Authorized Deduplication
A Hybrid Cloud Approach for Secure Authorized Deduplication
 
HydraFS
HydraFSHydraFS
HydraFS
 
A hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplicationA hybrid cloud approach for secure authorized deduplication
A hybrid cloud approach for secure authorized deduplication
 
A hybrid cloud approach for secure authorized deduplication.
A hybrid cloud approach for secure authorized deduplication.A hybrid cloud approach for secure authorized deduplication.
A hybrid cloud approach for secure authorized deduplication.
 
A survey of web clustering engines
A survey of web clustering enginesA survey of web clustering engines
A survey of web clustering engines
 
Project ara report 2
Project ara report 2Project ara report 2
Project ara report 2
 
Mesh network presentation
Mesh network presentationMesh network presentation
Mesh network presentation
 
Electronic Voting System
Electronic Voting SystemElectronic Voting System
Electronic Voting System
 
Freenet
FreenetFreenet
Freenet
 
Wibree
Wibree Wibree
Wibree
 
RAID seminar
RAID seminarRAID seminar
RAID seminar
 
E voting
E voting E voting
E voting
 
RAID CONCEPT
RAID CONCEPTRAID CONCEPT
RAID CONCEPT
 
Security in distributed systems
Security in distributed systems Security in distributed systems
Security in distributed systems
 
web clustering engines
web clustering enginesweb clustering engines
web clustering engines
 
Secure shell ppt
Secure shell pptSecure shell ppt
Secure shell ppt
 

Similar to Liquid File System For Efficient VM Storage

Midwest PHP Presentation - New MSQL Features
Midwest PHP Presentation - New MSQL FeaturesMidwest PHP Presentation - New MSQL Features
Midwest PHP Presentation - New MSQL FeaturesDave Stokes
 
Design and implementation of a reliable and cost-effective cloud computing in...
Design and implementation of a reliable and cost-effective cloud computing in...Design and implementation of a reliable and cost-effective cloud computing in...
Design and implementation of a reliable and cost-effective cloud computing in...Francesco Taurino
 
VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld
 
DBaaS with VMware vCAC, EMC XtremIO, and Cisco UCS
DBaaS with VMware vCAC, EMC XtremIO, and Cisco UCSDBaaS with VMware vCAC, EMC XtremIO, and Cisco UCS
DBaaS with VMware vCAC, EMC XtremIO, and Cisco UCSPrincipled Technologies
 
Scaling MySQL -- Swanseacon.co.uk
Scaling MySQL -- Swanseacon.co.uk Scaling MySQL -- Swanseacon.co.uk
Scaling MySQL -- Swanseacon.co.uk Dave Stokes
 
Using Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data AnalysisUsing Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data AnalysisScaleOut Software
 
Building the Perfect SharePoint 2010 Farm - TechEd Australia 2011
Building the Perfect SharePoint 2010 Farm - TechEd Australia 2011Building the Perfect SharePoint 2010 Farm - TechEd Australia 2011
Building the Perfect SharePoint 2010 Farm - TechEd Australia 2011Michael Noel
 
RedisConf17 - Redis Enterprise on IBM Power Systems
RedisConf17 - Redis Enterprise on IBM Power SystemsRedisConf17 - Redis Enterprise on IBM Power Systems
RedisConf17 - Redis Enterprise on IBM Power SystemsRedis Labs
 
High availability solutions bakostech
High availability solutions bakostechHigh availability solutions bakostech
High availability solutions bakostechViktoria Bakos
 
Building the Perfect SharePoint 2010 Farm - SPS Brisbane 2011
Building the Perfect SharePoint 2010 Farm - SPS Brisbane 2011Building the Perfect SharePoint 2010 Farm - SPS Brisbane 2011
Building the Perfect SharePoint 2010 Farm - SPS Brisbane 2011Michael Noel
 
Sql server 2019 New Features by Yevhen Nedaskivskyi
Sql server 2019 New Features by Yevhen NedaskivskyiSql server 2019 New Features by Yevhen Nedaskivskyi
Sql server 2019 New Features by Yevhen NedaskivskyiAlex Tumanoff
 
Sql 2012 always on
Sql 2012 always onSql 2012 always on
Sql 2012 always ondilip nayak
 
MySQL Replication Performance in the Cloud
MySQL Replication Performance in the CloudMySQL Replication Performance in the Cloud
MySQL Replication Performance in the CloudVitor Oliveira
 
WindowsAzureSDK1.7
WindowsAzureSDK1.7WindowsAzureSDK1.7
WindowsAzureSDK1.7Saravanan G
 

Similar to Liquid File System For Efficient VM Storage (20)

Midwest PHP Presentation - New MSQL Features
Midwest PHP Presentation - New MSQL FeaturesMidwest PHP Presentation - New MSQL Features
Midwest PHP Presentation - New MSQL Features
 
Design and implementation of a reliable and cost-effective cloud computing in...
Design and implementation of a reliable and cost-effective cloud computing in...Design and implementation of a reliable and cost-effective cloud computing in...
Design and implementation of a reliable and cost-effective cloud computing in...
 
VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data CenterVMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
VMworld 2013: Maximize Database Performance in Your Software-Defined Data Center
 
DBaaS with VMware vCAC, EMC XtremIO, and Cisco UCS
DBaaS with VMware vCAC, EMC XtremIO, and Cisco UCSDBaaS with VMware vCAC, EMC XtremIO, and Cisco UCS
DBaaS with VMware vCAC, EMC XtremIO, and Cisco UCS
 
Scaling MySQL -- Swanseacon.co.uk
Scaling MySQL -- Swanseacon.co.uk Scaling MySQL -- Swanseacon.co.uk
Scaling MySQL -- Swanseacon.co.uk
 
Using Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data AnalysisUsing Distributed In-Memory Computing for Fast Data Analysis
Using Distributed In-Memory Computing for Fast Data Analysis
 
Building the Perfect SharePoint 2010 Farm - TechEd Australia 2011
Building the Perfect SharePoint 2010 Farm - TechEd Australia 2011Building the Perfect SharePoint 2010 Farm - TechEd Australia 2011
Building the Perfect SharePoint 2010 Farm - TechEd Australia 2011
 
Future prediction-ds
Future prediction-dsFuture prediction-ds
Future prediction-ds
 
RedisConf17 - Redis Enterprise on IBM Power Systems
RedisConf17 - Redis Enterprise on IBM Power SystemsRedisConf17 - Redis Enterprise on IBM Power Systems
RedisConf17 - Redis Enterprise on IBM Power Systems
 
High availability solutions bakostech
High availability solutions bakostechHigh availability solutions bakostech
High availability solutions bakostech
 
Mysql wp memcached
Mysql wp memcachedMysql wp memcached
Mysql wp memcached
 
Building the Perfect SharePoint 2010 Farm - SPS Brisbane 2011
Building the Perfect SharePoint 2010 Farm - SPS Brisbane 2011Building the Perfect SharePoint 2010 Farm - SPS Brisbane 2011
Building the Perfect SharePoint 2010 Farm - SPS Brisbane 2011
 
Redshift
RedshiftRedshift
Redshift
 
Sql server 2019 New Features by Yevhen Nedaskivskyi
Sql server 2019 New Features by Yevhen NedaskivskyiSql server 2019 New Features by Yevhen Nedaskivskyi
Sql server 2019 New Features by Yevhen Nedaskivskyi
 
Sql 2012 always on
Sql 2012 always onSql 2012 always on
Sql 2012 always on
 
Hyper v r2 deep dive
Hyper v r2 deep diveHyper v r2 deep dive
Hyper v r2 deep dive
 
MySQL Replication Performance in the Cloud
MySQL Replication Performance in the CloudMySQL Replication Performance in the Cloud
MySQL Replication Performance in the Cloud
 
NoSQL
NoSQLNoSQL
NoSQL
 
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
 
WindowsAzureSDK1.7
WindowsAzureSDK1.7WindowsAzureSDK1.7
WindowsAzureSDK1.7
 

Recently uploaded

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 

Recently uploaded (20)

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 

Liquid File System For Efficient VM Storage

  • 1. Liquid- A Scalable Deduplication File System For Virtual Machine Images Anamika G V(12143630) S7 CSA College of Engineering, Cherthala August 8, 2016 Guided By Josna Jose Assistant Professor Computer Science & Engineering
  • 2. CONTENTS 1 INTRODUCTION 2 VIRTUAL MACHINE 3 DEDUPLICATION 4 EXISTING SYSTEM 5 ISSUES IN VM STORAGE 6 LIQUID SYSTEM ARCHITECTURE 7 DEDUPLICATION IN LIQUID 8 OPTIMIZATIONS ON FINGER PRINT CALCULATION 9 FILE SYSTEM LAYOUT 10 COMMUNICATION AMONG COMPONENTS HEART BEAT PROTOCOL 11 FAST CLONING FOR VM IMAGES 12 FAULT TOLERANCE 13 GARBAGE COLLECTION 14 ADVANTAGES OF LIQUID 15 CONCLUSION
  • 3. INTRODUCTION Cloud computing is the practice of using a remote servers hosted on the internet to store manage and process data rather than a local server or a personnel computer. Figure : 1.A sample cloud computing network
  • 4. VIRTUALIZATION and VIRTUAL MACHINE 1 Virtualization deals with extending or replacing an existing interface so as to mimic the behavior of another system 2 A Virtual Machine is a software that creates a virtualized environment between the computer platform and the end user in which the end user can operate software. 3 Crucial component in cloud computing. 4 Virtual Machine - Hypothetical Computer. 5 Executes programs like a physical machine. 6 Initial state of a virtual machine is stored in a file called virtual Machine image.
  • 5. VIRTUAL MACHINE Figure : 2.Virtual Machine representation
  • 6. DEDUPLICATION 1 Data Deduplication data compression technology. 2 Eliminates duplicate copies of repeating data. 3 A redundant data blocks are replaced so as to avoid the problems regarding storage consumption of a large number of VM images. 4 Improves storage utilization. Figure : 3.Deduplicated file system
  • 7. EXISTING SYSTEM 1 Hypervisors such as Xen, KVM etc. 2 Network Attached Storage (NAS) 3 Storage Area Network (SAN) 4 Direct Attached Storage (DAS)
  • 8. ISSUES IN VM STORAGE 1 High demand on VM storage remains a challenging problem. 2 Existing systems have made efforts to reduce storage consumption. 3 Uses SAN cluster. 4 Cannot satisfy increasing demand due to cost limitation. 5 Hence we propose LIQUID.
  • 9. LIQUID SYSTEM ARCHITECTURE 1 Three components - Single meta server with hot back up multiple data server and multiple clients. 2 Runs on user-level service process. 3 VM images are split into fixed size data blocks. 4 Meta server namespace , finger print , reference count. 5 Meta server mirrored to hot back up shadow meta server. 6 Data servers change of managing data blocks in VM images. 7 Organized in a distributed hash table. 8 A liquid client provides a POSIX compatible file system. 9 Client critical component (provides deduplication) 10 Fault tolerance Mirroring the meta server. 11 Replicas of data blocks are stored.
  • 10. LIQUID SYSTEM ARCHITECTURE (CONT.) Figure : 4.Liquid Architecture
  • 11. DEDUPLICATION IN LIQUID 1 Liquid chooses fixed size chunking instead of variable size chunking. 2 Better since all files stored in VM images will be aligned on disk block boundaries. 3 Advantage-simplicity. 4 Block size choice. 5 Block size- balancing factor which is hard to choose. Great impact on both deduplication and io performance.
  • 12. DEDUPLICATION IN LIQUID(CONT.) 1 Smaller block size-more random seeks when accessing a VM image. 2 Not tolerable. 3 A large block size is also not preferable, it will reduce deduplication ratio. 4 Liquid choose different block size under different situation. 5 Advised to use a multiplication of 4 kb between 256 kb and 1 MB to achieve good balance between IO performance and deduplication ratio.
  • 13. DEDUPLICATION IN LIQUID(CONT.) Figure : 5.Pictorial representation of Deduplication
  • 14. OPTIMIZATIONS ON FINGERPRINT CALCULATION ∗ Rely on comparison of data block finger prints for redundancy. ∗ Finger print-collision resistant hash value calculated from data block contents. ∗ MD5[26] and SHA-1[12] are frequently used for this purpose. ∗ Finger print collision - very small, orders of magnitude smaller than hardware error rates. ∗ So we could safely assume that two data blocks are identical. ∗ Finger print calculation - expensive. ∗ Delays finger print calculation for recently modified data blocks. ∗ Runs deduplication lazily only when it is necessary. ∗ Client side maintains a shared cache which contains recently accessed data blocks.
  • 15. OPTIMIZATIONS ON FINGERPRINT CALCULATION (CONT.) ∗ A portion of memory is used by the client side of liquid as private cache. ∗ Private cache hold-modified data blocks and delay finger print calculation on them. ∗ Modified data block ejected from shared cache and added to private cache. ∗ Modified data will be ejected if private cache becomes full. ∗ And ejected based on LRU policy. ∗ Only then will the modified data block’s finger print be calculated. ∗ Liquid uses multiple threads for finger print calculation. ∗ Multiple threads will process different data blocks currently. ∗ Provides good IO performance.
  • 16. FILE SYSTEM LAY OUT 1 All file system meta data are stored on the meta server. 2 Organized in a file system tree. 3 Client side could cache portions of file system meta data for fast accesses. 4 When a VM is stopped ,modified meta data and data blocks. 5 Will be pushed back to meta server. 6 Data servers ensures modification on VM image is visible to other client nodes.
  • 17. FILE SYSTEM LAY OUT Figure : 6:File System Structure
  • 18. COMMUNICATION AMONG COMPONENTS: HEART BEAT PROTOCOL 1 META SERVER-manages all data servers. 2 Exchange regular heart beat message with each data server in a ROUND ROBIN FASHION. 3 Detect failed data servers when there are many data servers. 4 To speed up failure detection data servers send an error signal to meta server.
  • 19. FAST CLONING FOR VM IMAGES 1 Copying large images may be time consuming. 2 Liquid provide efficient solution by means of fast cloning. 3 VM images represented by meta data files having reference to data blocks. 4 By copying meta data file and updating reference count a clone VM image is achieved. 5 Modification on cloned images will not effect the original image.
  • 20. FAULT TOLERANCE 1 Data replication 2 Data migration 3 Hot backup of Meta server
  • 21. GARBAGE COLLECTION 1 Removes unused garbage data blocks when running out of space. 2 Reference counting of all data blocks are maintained by mete servers. 3 Garbage collection request is issued periodically to data server. 4 Garbage collection is executed based on the data block membership in the Bloom filter.
  • 22. ADVANTAGES OF LIQUID 1 Fast Virtual Machine deployment with peer to peer data transfer. 2 Low storage consumption by means of deduplication. 3 Instant cloning for virtual machine images. 4 On demand fetching through a network caching with local disks. 5 LIQUID files has no specific limit.
  • 23. CONCLUSION 1 Presented LIQUID which is a deduplication file system with good IO performance. 2 Achieved by caching frequently accessed data blocks in memory cache. 3 Avoids additional disk operations. 4 Deduplication of VM images proved to be effective.
  • 24. REFERENCES [1].Xun Zhao, Yang Zhang, Yongwei Wu, Kang Chen, Jinlei Jiang, and Keqin Li, Senior Member, IEEE, Liquid: A Scalable Deduplication File System for Virtual Machine ImagesIEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 25, NO. 5, MAY 2014. [2]AmazonMachineImage,Sept.2001.[Online]. Available http://en.wikipedia.org/wiki/Amazon/Machine/Image. [3]BittorrentProtocol,Sept.2011.[Online]. Available http://en.wikipedia.org/wiki/BitTorrent/protocol. [4]BloomFilter,Sept.2011.[Online]. Available http://en.wikipedia.org/wiki/Bloom/filter [5]Xfs:AHigh Performance Journaling Filesystem, Sept.2011.[Online].http://oss.sgi.com/projects/xfs
  • 25. [6]RabinFingerprint,Sept.2011.[Online]. Available http://en.wikipedia.org/wiki/Rabin/fingerprint. [7]DataDeduplication,Sept.2013.[Online]. Available:http://en.wikipedia.org/wiki/Data/deduplication. [8]A.T. Clements, I. Ahmad, M. Vilayannur, and J. Li, Decentralized Deduplication in San Cluster File Systems, in Proc. Conf. USENIX Annu. Techn. Conf., 2009, p. 8, USENIX Association. [9]K. Jin and E.L. Miller, The Effectiveness of Deduplication on VirtualMachine Disk Images, in Proc. SYSTOR, Israeli Exp. Syst. Conf., New York, NY, USA, 2009, pp.1-12. [10]A. Liguori and E. Hensbergen, Experiences with Content Addressable Storage and Virtual Disks,,in Proc. WIOV08, San Diego, CA, USA, 2008, p. 5.
  • 26. [11]M. McLoughlin,The qcow2 Image Format, Sept. 2011. [Online]. Available: http://people.gnome.org/markmc/qcow-image-format.html. [12]C. Tang, Fvd: A High-Performance Virtual Machine Image Format for Cloud, in Proc. USENIX Conf. USENIX Annu. Tech. Conf., 2011, p. 18. [13]B. Zhu, K. Li, and H. Patterson,Avoiding the Disk Bottleneck in the Data Domain Deduplication File System, in Proc. 6th USENIX Conf. FAST, Berkeley, CA, USA, 2008, pp. 269-282.