This document discusses leveraging Apache Solr and Apache Spark together for large-scale data analysis. It begins with an overview of Solr and how it can be used for search and analytics through features like faceting. It then introduces Apache Spark, noting how it can be used to process large amounts of data in parallel. The document demonstrates how Spark can be used to import log file data into Solr in parallel and also to perform distributed analytics on Solr data. It highlights the SolrRDD abstraction for accessing Solr from Spark and shows examples of using SQL and DataFrames with Spark on Solr data.