2. 2
Who we are ?
● Joseph Fernandes (Senior Engineer, Red Hat Storage)
● Ewen Pinto (VI Sem MCA, NMAMIT, Nitte)
● Srinivas Billava (VI Sem MCA, NMAMIT, Nitte)
3. 3
Agenda
● What is Dedupe
● Why Dedupe
● Type of Dedupe
● What is Deduped
● Where its Deduped
● When its Deduped
● Challenges in Dedupe
● Current work
5. 5
What is Deduplication?
● Data units are identified by hash index
● Redundant data units replaced by pointers
● Hash algorithm with minimum collision
6. 6
Why dedupe?
● Reduces Total Cost of Ownership (TCO)
● Storage
● Network
● Used in
● Backup/Archive
● Disaster Recovery
● Replication local/remote
Search should be precise and fast
Should have rich metadata filter : Modification Frequency, IO Sizes etc
Should deal with distributed nature of data
Should do load balancing