Online data deduplication for in memory big-data analytic systems