The document discusses using Cassandra and Solr together to build a search engine capable of handling large amounts of unstructured data from the internet. It outlines the goals of collecting and normalizing web data at scale and making it searchable. It then describes the challenges faced with memory, speed, space and data representation when implementing this system and potential solutions and future work.