The document outlines an exploratory data analysis agenda that includes gathering data from public and social media websites, cleaning messy data using OpenRefine, analyzing data using classic Unix tools like sed, awk, and parallel, and calculating statistics.