Design of a lightweight set of data pipelines to scrub PII information. Scrubbing PII information from data brings ease of sharing data. It also helps organisations to confidently push data outside organisation for large scale analytics on the cloud.