Scaling principles and case study for building a Solr index of material from a net archive (harvested web resources). Focus is multiple TB of index, multiple billions of documents, < 2 second response time for faceted search, few concurrent users. And doing it all without much money. The slides are annotated (turn on "Notes").