This document discusses the need for data unification across enterprises. It notes that while companies have invested trillions in IT and billions in big data and analytics, data remains extremely siloed within organizations. True big data and analytics require clean and unified data sources. The document advocates for a bottom-up, probabilistic and collaborative approach to data unification using machine learning combined with human insight and curation. This approach is needed to tackle the huge challenge of integrating and making sense of the massive variety of siloed data sources within large organizations. The document provides examples of how Tamr's data unification platform has helped large healthcare and biopharma companies achieve a unified view of their extremely diverse and decentralized data.