This document describes Frontera, an open source framework for large scale web crawling. It discusses the architecture and components of Frontera, which includes Scrapy for network operations, Apache Kafka as a data bus, and Apache HBase for storage. It also outlines some challenges faced during the development of Frontera and solutions implemented, such as handling large websites that flood the queue, optimizing traffic to HBase, and prioritizing URLs. The document provides details on using Frontera to crawl the Spanish (.es) web domain and presents results and future plans.