SlideShare a Scribd company logo
1 of 29
Download to read offline
Understanding and Surpassing Dropbox: Efficient
Incremental Synchronization in Cloud Storage Services
Shenglong Li 1 Quanlu Zhang 1 Zhi Yang 1 Yafei Dai 1
1Peking University
(lishenglong, zql, yangzhi, dyf)@net.pku.edu.cn
June 18, 2016
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 1 / 29
Outline
1 Introduction
Background
Objective
Contribution
2 Related Work
Measurement of cloud storage
services
Similarity detection technique
State of The Art
3 Understanding Incremental Sync
Of Cloud Storage Services
Rsync Algorithm
Sync Mechanism on Dropbox
Detail Measurement and
Analysis
4 System Design and
Implementation
System Architecture
Delta Sharing
Chunk-Based Rsync with
Similarity Detection
Efficient conflict resolution
5 Evaluation
Modification Benchmark
File Conflict
Comparison with other cloud
services
Evaluation of Additional
Overhead
6 Conclusion
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 2 / 29
Introduction Background
Cloud Storage Services
With increasing demand of users for high data reliability and convenient
data access, cloud storage services have become extremely prevalent and
reached phenomenal levels of success. These are famous for file sharing
scenarios.
Sea File
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 3 / 29
Introduction Objective
Understanding and Surpassing Dropbox
Data synchronization is the heart of cloud storage
services with incremental data synchronization is applied
to minimize network traffic.
Whether the ”modified data = uploaded data” for
active client.
Whether the ”downloaded data (passive client) =
uploaded data (active client)”.
Whether both active and passive client still presents
efficiency during file conflict.
Create an improved prototype based on findings.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 4 / 29
Introduction Contribution
3
Measurement on Dropbox
Conduct intensive measurements on Dropbox in file
sharing scenarios.
Mechanism on Dropbox
Unravel the sync mechanisms employed in Dropbox on
both active and passive clients.
Minbox
Design several novel mechanisms, which resolve the traffic
problems, and apply them in an efficient incremental
synchronization system
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 5 / 29
Related Work Measurement of cloud storage services
Measurement of cloud storage services
Drago first uncovers the Dropbox system architecture
and data sync mechanism through an ISP-level
large-scale measurement.
Li reveals the traffic overuse problem in Dropbox when
user frequently modifies the files in synced folder and
he proposes an efficient batched sync mechanism to
avoid massive metadata interaction.
Li focuses on quantifying and understanding traffic
usage effectiveness through the measurements of
several popular cloud storage services on different
devices.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 6 / 29
Related Work Similarity detection technique
Similarity detection technique
Xia proposes a new similarity detection
algorithm to better exploit similarity with low
RAM overhead and high throughput.
Google deploys SimHash to improve space
efficiency and query quality for web crawling.
Mark Manasse implements MinHash using
shingle sampling technique to extract features.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 7 / 29
Related Work State of The Art
1
While these previous works cover the data sync
mechanism as one of the key operations, none of
them tries to fully understand the mechanism of
incremental sync technique in file sharing scenario,
and measure the network traffic with different
write behaviours. Moreover, we reveal the network
traffic waste problems that are not explored before
and design several sync mechanisms to solve them.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 8 / 29
Related Work State of The Art
2
Our system design and implementation are
different from these works. Specifically, we design
an efficient chunk-based delta encoding
mechanism embedding similarity detection
technique, which combines locality-sensitive hash
and content defined chunking technique to
optimize the computation overhead while
guaranteeing precision. Moreover, this mechanism
can integrate with other deduplication techniques
seamlessly.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 9 / 29
Understanding Incremental Sync Of Cloud Storage Services Rsync Algorithm
An Incremental Data Sync Algorithm
The whole point of rsync is when a file is modified on remote host is not
to send the whole file to the client but to send only the modified
part.
When a file is modified, the client retrieves a signature it which consists of
strong checksums (e.g., black2, MD5) and weak checksums (e.g., Adler-32,
a type of rolling checksum).
The client first computes weak checksums of the blocks in the changed file.
If the checksum matches one of the retrieved checksums, the client
calculates its strong checksum to verify if the two blocks are indeed the
same.
While if not match, the client rolls one byte forward and calculates weak
checksum again to find the same blocks,vwhich appeals to finding out the
skewed content.
Finally, all the different parts, called delta, can be found and sent back to
the server. The changed file is generated on the server by merging delta and
the original file, which is called patch the new file.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 10 / 29
Understanding Incremental Sync Of Cloud Storage Services Rsync Algorithm
Illustration of Rsync
Old File
signature
New File
Delta
(patch)
+
+
ServerClient
signature
signature
Delta
(patch)
Old File
Delta
(patch)
New File
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 11 / 29
Understanding Incremental Sync Of Cloud Storage Services Sync Mechanism on Dropbox
Dropbox Index Server and Amazon Data Server
Index
Server
Client
Data Server
1. Request file
location
2. Sends
file
location
3. Sync file using rsync on
certain file location
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 12 / 29
Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis
Active and Passive Client
Dropbox Servers
P
a
s
s
i
v
e
C
li
e
n
t
s
Active Client
10MB + 1B
sync sync
10MB
Add or
Modified
=
or
10MB1B
1B
10MBor 1B
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 13 / 29
Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis
Experiment 1: Replacement at Different Positions
1. Divide both files into 4MB chunks. For example:
4MB 4MB 2MB
4MB 4MB 2MB
2. Check each chunks whether they are identical,
if not then execute rsync.
4MB 4MB 2MB
Same Same rsync
4MB 4MB 2MB
For Figure 1 rsync is executed on:
4MB 4MB 2MB
4MB 4MB 2MB
or
4MB 4MB 2MB
4MB 4MB 2MB
or
4MB 4MB 2MB
4MB 4MB 2MB
Uplink (A) &
Downlink (B)
(based on delta)
should be the
same size:
Downlink for
client A is the
same because
“active client”
already stores
signature data.
Passive clients
have to sent
the signature
to data server
and that’s why
there’s uplink.
Uplink when
“end” modified is
smaller because
4KB block for
rsync. Librsync
uses 256-bit strong
checksum and 32-
bit weak checksum ((256b+32b)/8)*(4MB/4KB) =
36KB
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 14 / 29
Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis
Experiment 1: Insertion at Different Positions
4MB 4MB 2MB
4MB 4MB 2MB
Rsync on every
block. Signature
sent = 36KB + 36
KB + 18KB =
90KB
4MB 4MB 2MB
4MB 4MB 2MB
Rsync on block 2
and 3. Signature
sent = 36 KB +
18KB = 54KB
4MB 4MB 2MB
4MB 4MB 2MB
Rsync on last
block only.
Signature sent =
18KB
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 15 / 29
Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis
Experiment 2: Modification with different amounts of data
Replace or insert different amounts of data, ranging from 4KB to
4MB, in the middle of a 4MB file and a 8MB file.
When replaced content is larger than 100KB, the amounts of data is
less than modified due to data compression in Dropbox.
Data insertion may show waste problem on larger data because of the
fixed lenght skewing, where rsync should have been able to deal with
it normally.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 16 / 29
Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis
Experiment 3: File conflict
Figure 3 A and B modifies at the same time and both sync to server.
B reaches first and A sync from server.
But when A’s modified data reaches the server and sync, B treats it as a
new file and redownload whole.
For Figure 4 the case is complicated but the case is similar to Figure 3 but
with 3 file conflicts.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 17 / 29
System Design and Implementation System Architecture
System Architecture
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 18 / 29
System Design and Implementation Delta Sharing
Delta Sharing
Usually passive client always executes repetitive rsync
to sync update timely which is a waste.
Since passive clients tends to stay online the delta
generated by the active client can be reused.
In other words passive clients doesn’t have to execute
rsync but retrieve delta from delta server.
Passive clients doesn’t have to maintain the online
state since it can be marked through index server.
If passive clients is offline for long, the previous
mechanism is used (execute rsync).
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 19 / 29
System Design and Implementation Chunk-Based Rsync with Similarity Detection
Similarity Detection Mechanism
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 20 / 29
System Design and Implementation Chunk-Based Rsync with Similarity Detection
Algorithm
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 21 / 29
System Design and Implementation Chunk-Based Rsync with Similarity Detection
Algorithm Summary
Use locality-sensitive hash to detect similar chunks.
To reduce computation overhead while guaranteeing
detection precision, it is employed ImpMinHash
algorithm.
The non-deduplicated chunk were sliced into
sub-blocks using Rabin fingerprint.
Then find smallest cyclic redundant check (CRC)
checksums to identify this chunk. Finally used
Jaccard Index to compute similarity between chunks.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 22 / 29
System Design and Implementation Efficient conflict resolution
Efficient conflict resolution
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 23 / 29
Evaluation Modification Benchmark
Modification Benchmark
Replay experiment 1, and the result is unlike Dropbox, no uplink on Minbox’s passive
client.
Replay experiment 2 that the results can be seen on Figure 8 and Figure 9 where Minbox
implements similarity detection algorithm that outperforms Dropbox.
MinboxFD (native) used fixed length chunking 4MB while MinboxVD uses content
defined chunking (CDC) with average 4MB chunking.
In most cases, MinboxVD performs the best by taking advantage of CDC to avoid the
impact of content skewing.
However, for large modification workloads in 8MB file, MinboxFD outperforms MinboxVD,
because MinboxVD slices the new chunks which are not similar to original chunks.
After the matching for these chunks, MinboxVD may generate more redundant delta
compared with MinboxFD.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 24 / 29
Evaluation File Conflict
File Conflict
Dropbox downloads the whole file while Minbox only needs to
download the delta.
High network efficiency on Minbox.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 25 / 29
Evaluation Comparison with other cloud services
Comparison with other cloud services
Figure 12 shows comparison between Seafile and Minbox that also uses
CDC.
Seafile have to send the whole modified chunk, and client download whole
file in each case while Minbox only deals with the rsync part.
Comparison with others such as Google Drive and One Drive, Minbox took
advantage of the incremental sync mechanism.
For file conflict others downloads the whole file, while Minbox uses rsync.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 26 / 29
Evaluation Evaluation of Additional Overhead
Evaluation of Additional Overhead
Finally, it is necessary to discuss the overhead of
Imp-MinHash in Minbox. We generate
ImpMinHash of a 4MB file and record the
signature size and computation time. The result is
that ImpMinHash has the same size as MinHash
which consumes little bytes compared with Rsync
signature. For computation time of signature,
ImpMinHash consumes two additional CPU ticks
in comparison to Rsync.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 27 / 29
Conclusion
Efficient Incremental Synchronization in Cloud Storage
Services
Understanding Dropbox
In this paper, it is conducted comprehensive measurements on
Dropbox in file sharing scenario and unravel the incremental sync
mechanism inside Dropbox.
Surpassing Dropbox
Meanwhile, it is revealed the significant network traffic waste existing
in Dropbox, then designed and implemented an efficient incremental
sync system to solve these problems.
In the evaluation, Minbox significantly reduces the network traffic
during sync and solves the problem of file conflict with little overhead.
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 28 / 29
Conclusion
Thank you
Any comments or questions?
Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 29 / 29

More Related Content

What's hot

A location based least-cost scheduling for data-intensive applications
A location based least-cost scheduling for data-intensive applicationsA location based least-cost scheduling for data-intensive applications
A location based least-cost scheduling for data-intensive applicationsIAEME Publication
 
International Journal of Computational Science and Information Technology (...
  International Journal of Computational Science and Information Technology (...  International Journal of Computational Science and Information Technology (...
International Journal of Computational Science and Information Technology (...ijcsity
 
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...Laurens De Vocht
 
Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...
Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...
Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...Flink Forward
 
Reflections on Almost Two Decades of Research into Stream Processing
Reflections on Almost Two Decades of Research into Stream ProcessingReflections on Almost Two Decades of Research into Stream Processing
Reflections on Almost Two Decades of Research into Stream ProcessingKyumars Sheykh Esmaili
 
CODA-ccgrid21
CODA-ccgrid21CODA-ccgrid21
CODA-ccgrid21sina2127
 
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...Jonas Traub
 
Improvement of Congestion window and Link utilization of High Speed Protocols...
Improvement of Congestion window and Link utilization of High Speed Protocols...Improvement of Congestion window and Link utilization of High Speed Protocols...
Improvement of Congestion window and Link utilization of High Speed Protocols...IOSR Journals
 
Continuous Processing with Apache Flink - Strata London 2016
Continuous Processing with Apache Flink - Strata London 2016Continuous Processing with Apache Flink - Strata London 2016
Continuous Processing with Apache Flink - Strata London 2016Stephan Ewen
 
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...Flink Forward
 
Flink Streaming Hadoop Summit San Jose
Flink Streaming Hadoop Summit San JoseFlink Streaming Hadoop Summit San Jose
Flink Streaming Hadoop Summit San JoseKostas Tzoumas
 
cis97003
cis97003cis97003
cis97003perfj
 
Many Task Applications for Grids and Supercomputers
Many Task Applications for Grids and SupercomputersMany Task Applications for Grids and Supercomputers
Many Task Applications for Grids and SupercomputersIan Foster
 
Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...
Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...
Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...Flink Forward
 
Ling liu part 01:big graph processing
Ling liu part 01:big graph processingLing liu part 01:big graph processing
Ling liu part 01:big graph processingjins0618
 

What's hot (19)

Spark
SparkSpark
Spark
 
A location based least-cost scheduling for data-intensive applications
A location based least-cost scheduling for data-intensive applicationsA location based least-cost scheduling for data-intensive applications
A location based least-cost scheduling for data-intensive applications
 
Journal paper 1
Journal paper 1Journal paper 1
Journal paper 1
 
International Journal of Computational Science and Information Technology (...
  International Journal of Computational Science and Information Technology (...  International Journal of Computational Science and Information Technology (...
International Journal of Computational Science and Information Technology (...
 
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
Using Triple Pattern Fragments To Enable Streaming of Top-k Shortest Paths vi...
 
Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...
Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...
Flink Forward Berlin 2017: Francesco Versaci - Integrating Flink and Kafka in...
 
Reflections on Almost Two Decades of Research into Stream Processing
Reflections on Almost Two Decades of Research into Stream ProcessingReflections on Almost Two Decades of Research into Stream Processing
Reflections on Almost Two Decades of Research into Stream Processing
 
CODA-ccgrid21
CODA-ccgrid21CODA-ccgrid21
CODA-ccgrid21
 
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
 
Improvement of Congestion window and Link utilization of High Speed Protocols...
Improvement of Congestion window and Link utilization of High Speed Protocols...Improvement of Congestion window and Link utilization of High Speed Protocols...
Improvement of Congestion window and Link utilization of High Speed Protocols...
 
H04845157
H04845157H04845157
H04845157
 
Continuous Processing with Apache Flink - Strata London 2016
Continuous Processing with Apache Flink - Strata London 2016Continuous Processing with Apache Flink - Strata London 2016
Continuous Processing with Apache Flink - Strata London 2016
 
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
Flink Forward San Francisco 2019: Massive Scale Data Processing at Netflix us...
 
Eryk_Kulikowski_a4
Eryk_Kulikowski_a4Eryk_Kulikowski_a4
Eryk_Kulikowski_a4
 
Flink Streaming Hadoop Summit San Jose
Flink Streaming Hadoop Summit San JoseFlink Streaming Hadoop Summit San Jose
Flink Streaming Hadoop Summit San Jose
 
cis97003
cis97003cis97003
cis97003
 
Many Task Applications for Grids and Supercomputers
Many Task Applications for Grids and SupercomputersMany Task Applications for Grids and Supercomputers
Many Task Applications for Grids and Supercomputers
 
Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...
Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...
Flink Forward San Francisco 2019: Moving from Lambda and Kappa Architectures ...
 
Ling liu part 01:big graph processing
Ling liu part 01:big graph processingLing liu part 01:big graph processing
Ling liu part 01:big graph processing
 

Viewers also liked (10)

Act 1 equipo rojo
Act 1 equipo rojoAct 1 equipo rojo
Act 1 equipo rojo
 
Ada 2 Bloque 3
Ada 2 Bloque 3Ada 2 Bloque 3
Ada 2 Bloque 3
 
Participacion ciudadana soberania
Participacion ciudadana soberaniaParticipacion ciudadana soberania
Participacion ciudadana soberania
 
Kdd 2016 slide
Kdd 2016 slideKdd 2016 slide
Kdd 2016 slide
 
Class Presentation on WhatsApp
Class Presentation on WhatsAppClass Presentation on WhatsApp
Class Presentation on WhatsApp
 
Dropbox - Architecture and Business Prospective
Dropbox - Architecture and Business ProspectiveDropbox - Architecture and Business Prospective
Dropbox - Architecture and Business Prospective
 
DÍA DE LA HISPANIDAD
DÍA DE LA HISPANIDADDÍA DE LA HISPANIDAD
DÍA DE LA HISPANIDAD
 
Exposición sistema solar
Exposición sistema solarExposición sistema solar
Exposición sistema solar
 
Programa pisa de la ocde
Programa pisa de la ocdePrograma pisa de la ocde
Programa pisa de la ocde
 
Imagenes animacion
Imagenes  animacionImagenes  animacion
Imagenes animacion
 

Similar to Presentation of Understanding and Surpassing Dropbox Globecom 2015

Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...redpel dot com
 
2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdfLevLafayette1
 
Evaluation of load balancing approaches for Erlang concurrent application in ...
Evaluation of load balancing approaches for Erlang concurrent application in ...Evaluation of load balancing approaches for Erlang concurrent application in ...
Evaluation of load balancing approaches for Erlang concurrent application in ...TELKOMNIKA JOURNAL
 
PROPOSED LOAD BALANCING ALGORITHM TO REDUCE RESPONSE TIME AND PROCESSING TIME...
PROPOSED LOAD BALANCING ALGORITHM TO REDUCE RESPONSE TIME AND PROCESSING TIME...PROPOSED LOAD BALANCING ALGORITHM TO REDUCE RESPONSE TIME AND PROCESSING TIME...
PROPOSED LOAD BALANCING ALGORITHM TO REDUCE RESPONSE TIME AND PROCESSING TIME...IJCNCJournal
 
STUDY THE EFFECT OF PARAMETERS TO LOAD BALANCING IN CLOUD COMPUTING
STUDY THE EFFECT OF PARAMETERS TO LOAD BALANCING IN CLOUD COMPUTINGSTUDY THE EFFECT OF PARAMETERS TO LOAD BALANCING IN CLOUD COMPUTING
STUDY THE EFFECT OF PARAMETERS TO LOAD BALANCING IN CLOUD COMPUTINGIJCNCJournal
 
Efficient load rebalancing for distributed file system in Clouds
Efficient load rebalancing for distributed file system in CloudsEfficient load rebalancing for distributed file system in Clouds
Efficient load rebalancing for distributed file system in CloudsIJERA Editor
 
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...ijp2p
 
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...ijp2p
 
Cost-Minimizing Dynamic Migration of Content Distribution Services into Hybri...
Cost-Minimizing Dynamic Migration of Content Distribution Services into Hybri...Cost-Minimizing Dynamic Migration of Content Distribution Services into Hybri...
Cost-Minimizing Dynamic Migration of Content Distribution Services into Hybri...1crore projects
 
IEEE ICC 2013 - Symbiotic Coupling of P2P and Cloud Systems: The Wikipedia Case
IEEE ICC 2013 - Symbiotic Coupling of P2P and Cloud Systems: The Wikipedia CaseIEEE ICC 2013 - Symbiotic Coupling of P2P and Cloud Systems: The Wikipedia Case
IEEE ICC 2013 - Symbiotic Coupling of P2P and Cloud Systems: The Wikipedia CaseKalman Graffi
 
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTINGABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTINGcscpconf
 
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTINGABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTINGcsandit
 
Task Scheduling using Hybrid Algorithm in Cloud Computing Environments
Task Scheduling using Hybrid Algorithm in Cloud Computing EnvironmentsTask Scheduling using Hybrid Algorithm in Cloud Computing Environments
Task Scheduling using Hybrid Algorithm in Cloud Computing Environmentsiosrjce
 
Cooperative Architectures and Algorithms for Discovery and ...
Cooperative Architectures and Algorithms for Discovery and ...Cooperative Architectures and Algorithms for Discovery and ...
Cooperative Architectures and Algorithms for Discovery and ...Videoguy
 
SummerStudentReport-HamzaZafar
SummerStudentReport-HamzaZafarSummerStudentReport-HamzaZafar
SummerStudentReport-HamzaZafarHamza Zafar
 
Event-Driven, Client-Server Archetypes for E-Commerce
Event-Driven, Client-Server Archetypes for E-CommerceEvent-Driven, Client-Server Archetypes for E-Commerce
Event-Driven, Client-Server Archetypes for E-Commerceijtsrd
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstracttsysglobalsolutions
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 

Similar to Presentation of Understanding and Surpassing Dropbox Globecom 2015 (20)

Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
 
Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...
 
2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf2023comp90024_Spartan.pdf
2023comp90024_Spartan.pdf
 
Evaluation of load balancing approaches for Erlang concurrent application in ...
Evaluation of load balancing approaches for Erlang concurrent application in ...Evaluation of load balancing approaches for Erlang concurrent application in ...
Evaluation of load balancing approaches for Erlang concurrent application in ...
 
PROPOSED LOAD BALANCING ALGORITHM TO REDUCE RESPONSE TIME AND PROCESSING TIME...
PROPOSED LOAD BALANCING ALGORITHM TO REDUCE RESPONSE TIME AND PROCESSING TIME...PROPOSED LOAD BALANCING ALGORITHM TO REDUCE RESPONSE TIME AND PROCESSING TIME...
PROPOSED LOAD BALANCING ALGORITHM TO REDUCE RESPONSE TIME AND PROCESSING TIME...
 
STUDY THE EFFECT OF PARAMETERS TO LOAD BALANCING IN CLOUD COMPUTING
STUDY THE EFFECT OF PARAMETERS TO LOAD BALANCING IN CLOUD COMPUTINGSTUDY THE EFFECT OF PARAMETERS TO LOAD BALANCING IN CLOUD COMPUTING
STUDY THE EFFECT OF PARAMETERS TO LOAD BALANCING IN CLOUD COMPUTING
 
Efficient load rebalancing for distributed file system in Clouds
Efficient load rebalancing for distributed file system in CloudsEfficient load rebalancing for distributed file system in Clouds
Efficient load rebalancing for distributed file system in Clouds
 
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
 
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
AN INITIAL PEER CONFIGURATION ALGORITHM FOR MULTI-STREAMING PEER-TO-PEER NETW...
 
Cost-Minimizing Dynamic Migration of Content Distribution Services into Hybri...
Cost-Minimizing Dynamic Migration of Content Distribution Services into Hybri...Cost-Minimizing Dynamic Migration of Content Distribution Services into Hybri...
Cost-Minimizing Dynamic Migration of Content Distribution Services into Hybri...
 
IEEE ICC 2013 - Symbiotic Coupling of P2P and Cloud Systems: The Wikipedia Case
IEEE ICC 2013 - Symbiotic Coupling of P2P and Cloud Systems: The Wikipedia CaseIEEE ICC 2013 - Symbiotic Coupling of P2P and Cloud Systems: The Wikipedia Case
IEEE ICC 2013 - Symbiotic Coupling of P2P and Cloud Systems: The Wikipedia Case
 
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTINGABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
 
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTINGABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
ABOUT THE SUITABILITY OF CLOUDS IN HIGH-PERFORMANCE COMPUTING
 
Task Scheduling using Hybrid Algorithm in Cloud Computing Environments
Task Scheduling using Hybrid Algorithm in Cloud Computing EnvironmentsTask Scheduling using Hybrid Algorithm in Cloud Computing Environments
Task Scheduling using Hybrid Algorithm in Cloud Computing Environments
 
N0173696106
N0173696106N0173696106
N0173696106
 
Cooperative Architectures and Algorithms for Discovery and ...
Cooperative Architectures and Algorithms for Discovery and ...Cooperative Architectures and Algorithms for Discovery and ...
Cooperative Architectures and Algorithms for Discovery and ...
 
SummerStudentReport-HamzaZafar
SummerStudentReport-HamzaZafarSummerStudentReport-HamzaZafar
SummerStudentReport-HamzaZafar
 
Event-Driven, Client-Server Archetypes for E-Commerce
Event-Driven, Client-Server Archetypes for E-CommerceEvent-Driven, Client-Server Archetypes for E-Commerce
Event-Driven, Client-Server Archetypes for E-Commerce
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 

More from Fajar Purnama

Nimesha Ranasinghe's Research on Digital Flavours.pptx
Nimesha Ranasinghe's Research on Digital Flavours.pptxNimesha Ranasinghe's Research on Digital Flavours.pptx
Nimesha Ranasinghe's Research on Digital Flavours.pptxFajar Purnama
 
Mata Uang Kripto 101 untuk Pengguna: BAB 1 Cerita dan Filosofi
Mata Uang Kripto 101 untuk Pengguna: BAB 1 Cerita dan FilosofiMata Uang Kripto 101 untuk Pengguna: BAB 1 Cerita dan Filosofi
Mata Uang Kripto 101 untuk Pengguna: BAB 1 Cerita dan FilosofiFajar Purnama
 
Tugas Kuliah Flow Control Error dan Selective Repeat ARQ
Tugas Kuliah Flow Control Error dan Selective Repeat ARQTugas Kuliah Flow Control Error dan Selective Repeat ARQ
Tugas Kuliah Flow Control Error dan Selective Repeat ARQFajar Purnama
 
Transmisi citra pada perangkat jaringan sensor visual nirkabel platform imote...
Transmisi citra pada perangkat jaringan sensor visual nirkabel platform imote...Transmisi citra pada perangkat jaringan sensor visual nirkabel platform imote...
Transmisi citra pada perangkat jaringan sensor visual nirkabel platform imote...Fajar Purnama
 
Development of a lossy online mouse tracking method for capturing user intera...
Development of a lossy online mouse tracking method for capturing user intera...Development of a lossy online mouse tracking method for capturing user intera...
Development of a lossy online mouse tracking method for capturing user intera...Fajar Purnama
 
Recruiting future backup cryptocurrency miners with solar power bank on singl...
Recruiting future backup cryptocurrency miners with solar power bank on singl...Recruiting future backup cryptocurrency miners with solar power bank on singl...
Recruiting future backup cryptocurrency miners with solar power bank on singl...Fajar Purnama
 
fp slide on deep investigation of cross language plagiarism detection methods
fp slide on deep investigation of cross language plagiarism detection methodsfp slide on deep investigation of cross language plagiarism detection methods
fp slide on deep investigation of cross language plagiarism detection methodsFajar Purnama
 
I present someone's paper on eye tracking and web design.
I present someone's paper on eye tracking and web design.I present someone's paper on eye tracking and web design.
I present someone's paper on eye tracking and web design.Fajar Purnama
 
Portable and Synchronized Distributed Learning Management System in Severe Ne...
Portable and Synchronized Distributed Learning Management System in Severe Ne...Portable and Synchronized Distributed Learning Management System in Severe Ne...
Portable and Synchronized Distributed Learning Management System in Severe Ne...Fajar Purnama
 
Hand-Carry-Data-Collecting-Through-Questionnaire-and-Quiz-Alike-on-Crowds-usi...
Hand-Carry-Data-Collecting-Through-Questionnaire-and-Quiz-Alike-on-Crowds-usi...Hand-Carry-Data-Collecting-Through-Questionnaire-and-Quiz-Alike-on-Crowds-usi...
Hand-Carry-Data-Collecting-Through-Questionnaire-and-Quiz-Alike-on-Crowds-usi...Fajar Purnama
 
Incremental synchronization-implementation-on-survey-using-hand-carry-server-...
Incremental synchronization-implementation-on-survey-using-hand-carry-server-...Incremental synchronization-implementation-on-survey-using-hand-carry-server-...
Incremental synchronization-implementation-on-survey-using-hand-carry-server-...Fajar Purnama
 
Origin of biosignals fajar purnama 152D-8713
Origin of biosignals fajar purnama 152D-8713Origin of biosignals fajar purnama 152D-8713
Origin of biosignals fajar purnama 152D-8713Fajar Purnama
 
Demonstration on extending_the_pageview_feature_to_page_section_based_present...
Demonstration on extending_the_pageview_feature_to_page_section_based_present...Demonstration on extending_the_pageview_feature_to_page_section_based_present...
Demonstration on extending_the_pageview_feature_to_page_section_based_present...Fajar Purnama
 
Introductory Work on Section Based Page View of Web Contents Presentation
Introductory Work on Section Based Page View of Web Contents PresentationIntroductory Work on Section Based Page View of Web Contents Presentation
Introductory Work on Section Based Page View of Web Contents PresentationFajar Purnama
 
Compatible course content synchronization model for course distribution over ...
Compatible course content synchronization model for course distribution over ...Compatible course content synchronization model for course distribution over ...
Compatible course content synchronization model for course distribution over ...Fajar Purnama
 
Rdiff and Rsync Implementation on Moodle's Backup and Restore Feature of Cour...
Rdiff and Rsync Implementation on Moodle's Backup and Restore Feature of Cour...Rdiff and Rsync Implementation on Moodle's Backup and Restore Feature of Cour...
Rdiff and Rsync Implementation on Moodle's Backup and Restore Feature of Cour...Fajar Purnama
 
Fajar Purnama 152-d8713 Dynamic Content Synchronization Distributed LMS
Fajar Purnama 152-d8713 Dynamic Content Synchronization Distributed LMSFajar Purnama 152-d8713 Dynamic Content Synchronization Distributed LMS
Fajar Purnama 152-d8713 Dynamic Content Synchronization Distributed LMSFajar Purnama
 

More from Fajar Purnama (17)

Nimesha Ranasinghe's Research on Digital Flavours.pptx
Nimesha Ranasinghe's Research on Digital Flavours.pptxNimesha Ranasinghe's Research on Digital Flavours.pptx
Nimesha Ranasinghe's Research on Digital Flavours.pptx
 
Mata Uang Kripto 101 untuk Pengguna: BAB 1 Cerita dan Filosofi
Mata Uang Kripto 101 untuk Pengguna: BAB 1 Cerita dan FilosofiMata Uang Kripto 101 untuk Pengguna: BAB 1 Cerita dan Filosofi
Mata Uang Kripto 101 untuk Pengguna: BAB 1 Cerita dan Filosofi
 
Tugas Kuliah Flow Control Error dan Selective Repeat ARQ
Tugas Kuliah Flow Control Error dan Selective Repeat ARQTugas Kuliah Flow Control Error dan Selective Repeat ARQ
Tugas Kuliah Flow Control Error dan Selective Repeat ARQ
 
Transmisi citra pada perangkat jaringan sensor visual nirkabel platform imote...
Transmisi citra pada perangkat jaringan sensor visual nirkabel platform imote...Transmisi citra pada perangkat jaringan sensor visual nirkabel platform imote...
Transmisi citra pada perangkat jaringan sensor visual nirkabel platform imote...
 
Development of a lossy online mouse tracking method for capturing user intera...
Development of a lossy online mouse tracking method for capturing user intera...Development of a lossy online mouse tracking method for capturing user intera...
Development of a lossy online mouse tracking method for capturing user intera...
 
Recruiting future backup cryptocurrency miners with solar power bank on singl...
Recruiting future backup cryptocurrency miners with solar power bank on singl...Recruiting future backup cryptocurrency miners with solar power bank on singl...
Recruiting future backup cryptocurrency miners with solar power bank on singl...
 
fp slide on deep investigation of cross language plagiarism detection methods
fp slide on deep investigation of cross language plagiarism detection methodsfp slide on deep investigation of cross language plagiarism detection methods
fp slide on deep investigation of cross language plagiarism detection methods
 
I present someone's paper on eye tracking and web design.
I present someone's paper on eye tracking and web design.I present someone's paper on eye tracking and web design.
I present someone's paper on eye tracking and web design.
 
Portable and Synchronized Distributed Learning Management System in Severe Ne...
Portable and Synchronized Distributed Learning Management System in Severe Ne...Portable and Synchronized Distributed Learning Management System in Severe Ne...
Portable and Synchronized Distributed Learning Management System in Severe Ne...
 
Hand-Carry-Data-Collecting-Through-Questionnaire-and-Quiz-Alike-on-Crowds-usi...
Hand-Carry-Data-Collecting-Through-Questionnaire-and-Quiz-Alike-on-Crowds-usi...Hand-Carry-Data-Collecting-Through-Questionnaire-and-Quiz-Alike-on-Crowds-usi...
Hand-Carry-Data-Collecting-Through-Questionnaire-and-Quiz-Alike-on-Crowds-usi...
 
Incremental synchronization-implementation-on-survey-using-hand-carry-server-...
Incremental synchronization-implementation-on-survey-using-hand-carry-server-...Incremental synchronization-implementation-on-survey-using-hand-carry-server-...
Incremental synchronization-implementation-on-survey-using-hand-carry-server-...
 
Origin of biosignals fajar purnama 152D-8713
Origin of biosignals fajar purnama 152D-8713Origin of biosignals fajar purnama 152D-8713
Origin of biosignals fajar purnama 152D-8713
 
Demonstration on extending_the_pageview_feature_to_page_section_based_present...
Demonstration on extending_the_pageview_feature_to_page_section_based_present...Demonstration on extending_the_pageview_feature_to_page_section_based_present...
Demonstration on extending_the_pageview_feature_to_page_section_based_present...
 
Introductory Work on Section Based Page View of Web Contents Presentation
Introductory Work on Section Based Page View of Web Contents PresentationIntroductory Work on Section Based Page View of Web Contents Presentation
Introductory Work on Section Based Page View of Web Contents Presentation
 
Compatible course content synchronization model for course distribution over ...
Compatible course content synchronization model for course distribution over ...Compatible course content synchronization model for course distribution over ...
Compatible course content synchronization model for course distribution over ...
 
Rdiff and Rsync Implementation on Moodle's Backup and Restore Feature of Cour...
Rdiff and Rsync Implementation on Moodle's Backup and Restore Feature of Cour...Rdiff and Rsync Implementation on Moodle's Backup and Restore Feature of Cour...
Rdiff and Rsync Implementation on Moodle's Backup and Restore Feature of Cour...
 
Fajar Purnama 152-d8713 Dynamic Content Synchronization Distributed LMS
Fajar Purnama 152-d8713 Dynamic Content Synchronization Distributed LMSFajar Purnama 152-d8713 Dynamic Content Synchronization Distributed LMS
Fajar Purnama 152-d8713 Dynamic Content Synchronization Distributed LMS
 

Recently uploaded

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 

Presentation of Understanding and Surpassing Dropbox Globecom 2015

  • 1. Understanding and Surpassing Dropbox: Efficient Incremental Synchronization in Cloud Storage Services Shenglong Li 1 Quanlu Zhang 1 Zhi Yang 1 Yafei Dai 1 1Peking University (lishenglong, zql, yangzhi, dyf)@net.pku.edu.cn June 18, 2016 Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 1 / 29
  • 2. Outline 1 Introduction Background Objective Contribution 2 Related Work Measurement of cloud storage services Similarity detection technique State of The Art 3 Understanding Incremental Sync Of Cloud Storage Services Rsync Algorithm Sync Mechanism on Dropbox Detail Measurement and Analysis 4 System Design and Implementation System Architecture Delta Sharing Chunk-Based Rsync with Similarity Detection Efficient conflict resolution 5 Evaluation Modification Benchmark File Conflict Comparison with other cloud services Evaluation of Additional Overhead 6 Conclusion Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 2 / 29
  • 3. Introduction Background Cloud Storage Services With increasing demand of users for high data reliability and convenient data access, cloud storage services have become extremely prevalent and reached phenomenal levels of success. These are famous for file sharing scenarios. Sea File Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 3 / 29
  • 4. Introduction Objective Understanding and Surpassing Dropbox Data synchronization is the heart of cloud storage services with incremental data synchronization is applied to minimize network traffic. Whether the ”modified data = uploaded data” for active client. Whether the ”downloaded data (passive client) = uploaded data (active client)”. Whether both active and passive client still presents efficiency during file conflict. Create an improved prototype based on findings. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 4 / 29
  • 5. Introduction Contribution 3 Measurement on Dropbox Conduct intensive measurements on Dropbox in file sharing scenarios. Mechanism on Dropbox Unravel the sync mechanisms employed in Dropbox on both active and passive clients. Minbox Design several novel mechanisms, which resolve the traffic problems, and apply them in an efficient incremental synchronization system Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 5 / 29
  • 6. Related Work Measurement of cloud storage services Measurement of cloud storage services Drago first uncovers the Dropbox system architecture and data sync mechanism through an ISP-level large-scale measurement. Li reveals the traffic overuse problem in Dropbox when user frequently modifies the files in synced folder and he proposes an efficient batched sync mechanism to avoid massive metadata interaction. Li focuses on quantifying and understanding traffic usage effectiveness through the measurements of several popular cloud storage services on different devices. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 6 / 29
  • 7. Related Work Similarity detection technique Similarity detection technique Xia proposes a new similarity detection algorithm to better exploit similarity with low RAM overhead and high throughput. Google deploys SimHash to improve space efficiency and query quality for web crawling. Mark Manasse implements MinHash using shingle sampling technique to extract features. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 7 / 29
  • 8. Related Work State of The Art 1 While these previous works cover the data sync mechanism as one of the key operations, none of them tries to fully understand the mechanism of incremental sync technique in file sharing scenario, and measure the network traffic with different write behaviours. Moreover, we reveal the network traffic waste problems that are not explored before and design several sync mechanisms to solve them. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 8 / 29
  • 9. Related Work State of The Art 2 Our system design and implementation are different from these works. Specifically, we design an efficient chunk-based delta encoding mechanism embedding similarity detection technique, which combines locality-sensitive hash and content defined chunking technique to optimize the computation overhead while guaranteeing precision. Moreover, this mechanism can integrate with other deduplication techniques seamlessly. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 9 / 29
  • 10. Understanding Incremental Sync Of Cloud Storage Services Rsync Algorithm An Incremental Data Sync Algorithm The whole point of rsync is when a file is modified on remote host is not to send the whole file to the client but to send only the modified part. When a file is modified, the client retrieves a signature it which consists of strong checksums (e.g., black2, MD5) and weak checksums (e.g., Adler-32, a type of rolling checksum). The client first computes weak checksums of the blocks in the changed file. If the checksum matches one of the retrieved checksums, the client calculates its strong checksum to verify if the two blocks are indeed the same. While if not match, the client rolls one byte forward and calculates weak checksum again to find the same blocks,vwhich appeals to finding out the skewed content. Finally, all the different parts, called delta, can be found and sent back to the server. The changed file is generated on the server by merging delta and the original file, which is called patch the new file. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 10 / 29
  • 11. Understanding Incremental Sync Of Cloud Storage Services Rsync Algorithm Illustration of Rsync Old File signature New File Delta (patch) + + ServerClient signature signature Delta (patch) Old File Delta (patch) New File Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 11 / 29
  • 12. Understanding Incremental Sync Of Cloud Storage Services Sync Mechanism on Dropbox Dropbox Index Server and Amazon Data Server Index Server Client Data Server 1. Request file location 2. Sends file location 3. Sync file using rsync on certain file location Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 12 / 29
  • 13. Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis Active and Passive Client Dropbox Servers P a s s i v e C li e n t s Active Client 10MB + 1B sync sync 10MB Add or Modified = or 10MB1B 1B 10MBor 1B Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 13 / 29
  • 14. Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis Experiment 1: Replacement at Different Positions 1. Divide both files into 4MB chunks. For example: 4MB 4MB 2MB 4MB 4MB 2MB 2. Check each chunks whether they are identical, if not then execute rsync. 4MB 4MB 2MB Same Same rsync 4MB 4MB 2MB For Figure 1 rsync is executed on: 4MB 4MB 2MB 4MB 4MB 2MB or 4MB 4MB 2MB 4MB 4MB 2MB or 4MB 4MB 2MB 4MB 4MB 2MB Uplink (A) & Downlink (B) (based on delta) should be the same size: Downlink for client A is the same because “active client” already stores signature data. Passive clients have to sent the signature to data server and that’s why there’s uplink. Uplink when “end” modified is smaller because 4KB block for rsync. Librsync uses 256-bit strong checksum and 32- bit weak checksum ((256b+32b)/8)*(4MB/4KB) = 36KB Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 14 / 29
  • 15. Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis Experiment 1: Insertion at Different Positions 4MB 4MB 2MB 4MB 4MB 2MB Rsync on every block. Signature sent = 36KB + 36 KB + 18KB = 90KB 4MB 4MB 2MB 4MB 4MB 2MB Rsync on block 2 and 3. Signature sent = 36 KB + 18KB = 54KB 4MB 4MB 2MB 4MB 4MB 2MB Rsync on last block only. Signature sent = 18KB Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 15 / 29
  • 16. Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis Experiment 2: Modification with different amounts of data Replace or insert different amounts of data, ranging from 4KB to 4MB, in the middle of a 4MB file and a 8MB file. When replaced content is larger than 100KB, the amounts of data is less than modified due to data compression in Dropbox. Data insertion may show waste problem on larger data because of the fixed lenght skewing, where rsync should have been able to deal with it normally. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 16 / 29
  • 17. Understanding Incremental Sync Of Cloud Storage Services Detail Measurement and Analysis Experiment 3: File conflict Figure 3 A and B modifies at the same time and both sync to server. B reaches first and A sync from server. But when A’s modified data reaches the server and sync, B treats it as a new file and redownload whole. For Figure 4 the case is complicated but the case is similar to Figure 3 but with 3 file conflicts. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 17 / 29
  • 18. System Design and Implementation System Architecture System Architecture Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 18 / 29
  • 19. System Design and Implementation Delta Sharing Delta Sharing Usually passive client always executes repetitive rsync to sync update timely which is a waste. Since passive clients tends to stay online the delta generated by the active client can be reused. In other words passive clients doesn’t have to execute rsync but retrieve delta from delta server. Passive clients doesn’t have to maintain the online state since it can be marked through index server. If passive clients is offline for long, the previous mechanism is used (execute rsync). Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 19 / 29
  • 20. System Design and Implementation Chunk-Based Rsync with Similarity Detection Similarity Detection Mechanism Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 20 / 29
  • 21. System Design and Implementation Chunk-Based Rsync with Similarity Detection Algorithm Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 21 / 29
  • 22. System Design and Implementation Chunk-Based Rsync with Similarity Detection Algorithm Summary Use locality-sensitive hash to detect similar chunks. To reduce computation overhead while guaranteeing detection precision, it is employed ImpMinHash algorithm. The non-deduplicated chunk were sliced into sub-blocks using Rabin fingerprint. Then find smallest cyclic redundant check (CRC) checksums to identify this chunk. Finally used Jaccard Index to compute similarity between chunks. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 22 / 29
  • 23. System Design and Implementation Efficient conflict resolution Efficient conflict resolution Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 23 / 29
  • 24. Evaluation Modification Benchmark Modification Benchmark Replay experiment 1, and the result is unlike Dropbox, no uplink on Minbox’s passive client. Replay experiment 2 that the results can be seen on Figure 8 and Figure 9 where Minbox implements similarity detection algorithm that outperforms Dropbox. MinboxFD (native) used fixed length chunking 4MB while MinboxVD uses content defined chunking (CDC) with average 4MB chunking. In most cases, MinboxVD performs the best by taking advantage of CDC to avoid the impact of content skewing. However, for large modification workloads in 8MB file, MinboxFD outperforms MinboxVD, because MinboxVD slices the new chunks which are not similar to original chunks. After the matching for these chunks, MinboxVD may generate more redundant delta compared with MinboxFD. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 24 / 29
  • 25. Evaluation File Conflict File Conflict Dropbox downloads the whole file while Minbox only needs to download the delta. High network efficiency on Minbox. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 25 / 29
  • 26. Evaluation Comparison with other cloud services Comparison with other cloud services Figure 12 shows comparison between Seafile and Minbox that also uses CDC. Seafile have to send the whole modified chunk, and client download whole file in each case while Minbox only deals with the rsync part. Comparison with others such as Google Drive and One Drive, Minbox took advantage of the incremental sync mechanism. For file conflict others downloads the whole file, while Minbox uses rsync. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 26 / 29
  • 27. Evaluation Evaluation of Additional Overhead Evaluation of Additional Overhead Finally, it is necessary to discuss the overhead of Imp-MinHash in Minbox. We generate ImpMinHash of a 4MB file and record the signature size and computation time. The result is that ImpMinHash has the same size as MinHash which consumes little bytes compared with Rsync signature. For computation time of signature, ImpMinHash consumes two additional CPU ticks in comparison to Rsync. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 27 / 29
  • 28. Conclusion Efficient Incremental Synchronization in Cloud Storage Services Understanding Dropbox In this paper, it is conducted comprehensive measurements on Dropbox in file sharing scenario and unravel the incremental sync mechanism inside Dropbox. Surpassing Dropbox Meanwhile, it is revealed the significant network traffic waste existing in Dropbox, then designed and implemented an efficient incremental sync system to solve these problems. In the evaluation, Minbox significantly reduces the network traffic during sync and solves the problem of file conflict with little overhead. Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 28 / 29
  • 29. Conclusion Thank you Any comments or questions? Presented by: Fajar Purnama (HICC LAB) Kumamoto University, GLOBECOM2015 June 18, 2016 29 / 29