bup backup system (2011-04)

1.
bup: the git-basedbackup system Avery Pennarun 2011 04 30

2.
The Challenge Backup entire filesystems (> 1TB)

3.
Including huge VMdisk images (files >100GB)

4.
Lots of separatefiles (500k or more)

5.
Calculate/store incrementals efficiently

6.
Create backups inO(n), where n = number of changed bytes

7.
Incremental backup directto a remote computer (no local copy)

8.
...and don't forgetmetadata 5

9.
The Result bupis extremely fast - ~80 megs/sec in python

10.
Sub-file incrementals arevery space efficient >5x better than rsnapshot in real-life use VMs compress smaller and faster than gzip

11.
Dedupe between different client machines

12.
O(log N) seektimes to any part of any file

13.
You can mountyour backup history as a filesystem or browse it as a web page 8

14.
The Design 8

15.
Why git? Easilyhandles file renames

16.
Easily deduplicates betweenidentical files and trees

17.
Debug my codeusing git commands

18.
Someone already thoughtabout the repo format (packs, idxes)

19.
Three-level “work tree”vs “index” vs “commit” 11

20.
Problem 1: Largefiles 'git gc' explodes badly on large files; totally unusable

21.
git bigfiles fork“solves” the problem by just never deltifying large objects: lame

22.
zlib window sizeis very small: lousy compression on VM images -- 13

23.
Digression: zlib windowsize gzip only does two things: backref: copy some bytes from the preceding 64k window

24.
huffman code: adictionary of common words That 64k window is a serious problem!

25.
Duplicated data >64kapart can't be compressed

26.
cat file.tar file.tar| gzip -c | wc -c surprisingly, twice the size of a single tar.gz 18

27.
bupsplit Uses arolling checksum to -- 18

28.
Digression: rolling checksumsPopularized by rsync (Andrew Tridgell, the Samba guy)

29.
He wrote a(readable) Ph.D. thesis about it

30.
bup uses avariant of the rsync algorithm to -- 19

31.
Double Digression: rsyncalgorithm First player: Divide the file into fixed-size chunks

32.
Send the listof all chunk checksums Second player: Look through existing files for any blocks that have those checksums

33.
But any n-byte subsequence might be the match

34.
Searching naively isabout O(n^2) ... ouch.

35.
So we usea rolling checksum instead 21

36.
Digression: rolling checksumsCalculate the checksum of bytes 0..n

37.
Remove byte 0,add byte n+1, to get the checksum from 1..n+1 And so on Searching is now more like O(n)... vastly faster

38.
Requires a special“rollable” checksum (adler32) 23

39.
Digression: gzip --rsyncableYou can't rsync gzipped files efficiently

40.
Changing a byteearly in a file changes the compression dictionary, so the rest of the file is different

41.
--rsyncable resets thecompression dictionary whenever low bits of adler32 == 0

42.
Fraction of apercent overhead on file size

43.
But now yourgzip files are rsyncable! 26

44.
bupsplit Based ongzip --rsyncable

45.
Instead of acompression dictionary, we break the file into blocks on adler32 boundaries If low 13 checksum bits are 1, end this chunk

46.
Average chunk: 2**13= 8192 Now we have a list of chunks and their sums

47.
Inserting/deleting bytes changesat most two chunks! 28

48.
bupsplit trees Inspiredby “Tiger tree” hashing used in some P2P systems

49.
Arrange chunk listinto trees: If low 17 checksum bits are 1, end a superchunk

50.
If low 21bits are 1, end a superduperchunk

51.
and so on.Superchunk boundary is also a chunk boundary

52.
Inserting/deleting bytes changesat most 2*log(n) chunks! 30

53.
Advantages of bupsplitNever loads the whole file into RAM

54.
Compresses most VMimages more (and faster) than gzip

55.
Works well onbinary and text files

56.
Don't need toteach it about file formats

57.
Diff huge filesin about O(log n) time

58.
Seek to anyoffset in a file in O(log n) time 32

59.
Problem 2: Millionsof objects Plain git format: 1TB of data / 8k per chunk: 122 million chunks

60.
x 20 bytesper SHA1: 2.4GB

61.
Divided into 2GBpacks: 500 .idx files of 5MB each

62.
8-bit prefix lookuptable Adding a new chunk means searching 500 * (log(5MB)-8) hops = 500 * 14 hops = 500 * 7 pages = 3500 pages 34

63.
Millions of objects(cont'd) bup .midx files: merge all .idx into a single .midx

64.
Larger initial lookuptable to immediately narrow search to the last 7 bits

65.
log(2.4GB) = 31bits

66.
24-bit lookup table* 4 bytes per entry: 64 MB

67.
Adding a newchunk always only touches two pages 35

68.
Bloom Filters Problemswith .midx: Have to rewrite the whole file to merge a new .idx

69.
Storing 20 bytesper hash is wasteful; even 48 bits would be unique

70.
Paging in twopages per chunk is maybe too much Solution: Bloom filters Idea borrowed from Plan9 WORM fs

71.
Create a bighash table

72.
Hash each blockseveral ways

73.
Gives false positives,never false negatives

74.
Can update incrementally36

75.
bup command line:indexed mode Saving: bup index -vux /

76.
bup save -vnmybackup / Restoring: bup restore (extract multiple files)

77.
bup ftp (command-linebrowsing)

78.
bup web

79.
bup fuse (mountbackups as a filesystem) 36

80.
Not implemented yetPruning old backups 'git gc' is far too simple minded to handle a 1TB backup set Metadata Almost done: 'bup meta' and the .bupmeta files 37

81.
Crazy Ideas Encryption

82.
'bup restore' thatupdates the index like 'git checkout' does

83.
Distributed filesystem: bittorrentwith a smarter data structure 39

84.
Questions? 40

bup backup system (2011-04)

More Related Content

What's hot

Similar to bup backup system (2011-04)

Recently uploaded

bup backup system (2011-04)