This document provides steps for setting up Solr on HDFS for indexing data from Hive: 1. Create a Solr instance directory, then create a collection within that instance. This sets up the necessary directories in HDFS for the Solr index and data. 2. Generate CSV data from a Hive database with properly formatted fields, and upload it to HDFS. 3. Run a MapReduce job that uses a Morphline configuration to read the CSV from HDFS, index it into Solr, and go live with the new index data in Solr. 4. Verify the indexing was successful by searching from the Solr admin interface using primary keys.