Do you know where is your data ? Do you know who is responsible of this specific datasets ? Do you know from which application or task this entity was modified last friday ? Apache Atlas helps you to manage all your metadata of your data. With Apache Atlas you can know all lineages between your datasets and process that use them.