This document summarizes an implementation of Apache Lucene and Solr to build a distributed search engine across large datasets. It implemented Hadoop HDFS for distributed storage, integrated Solr instances for indexing and searching over 500 columns of data from government databases totaling over 200,000 documents and 15 million data items. A Solr client web application was also developed to demonstrate the distributed search capabilities over the indexed data.