Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Data anonymization

1,615 views

Published on

Published in: Technology
  • Be the first to comment

  • Be the first to like this

Data anonymization

  1. 1. ! ! ! SATYAM AGARWALA DEVELOPER
  2. 2. DATA ANONYMIZATION
  3. 3. Why do we need data?
  4. 4. What is data anonymization?
  5. 5. Why anonymize data?
  6. 6. How do we anonymize data?
  7. 7. https://github.com/sunitparekh/data-anonymization Sunit Parekh Satyam Agarwala
  8. 8. You choose which attributes to anonymize ! ! first name last name address zipcode handphone birth date ! ! : Satyam : Agarwala : 87B Amoy Street : 069906 : 8765 4321 : 01/01/1945 BLACKLIST
  9. 9. WHITELIST You choose which attributes NOT to anonymize ! ! first name last name address zipcode handphone birth date ! ! : Satyam : Woodward : 10 Downing Street : 123456 : 8765 4321 : 01/01/1945
  10. 10. Show me!
  11. 11. Script (DSL, strategies, parallelization) ORM (RDBMS, NoSQL) source DB destination DB SUMMARY
  12. 12. GOTCHAS FK CONSTRAINTS Disable foreign key checks globally before running the script. ! UNIQUE CONSTRAINTS Whitelist or ensure a sequential non-random strategy for attributes that need to be unique.
  13. 13. Are there other ways to anonymize data?
  14. 14. FORMAL APPROACH k-anonymity ! l-diversity ! t-closeness ! δ-presence
  15. 15. ALTERNATIVE TOOLS Arx https://github.com/arx- deidentifier/arx
  16. 16. THANK YOU!

×