This document is the dissertation of Gianmarco De Francisci Morales submitted for the PhD program in Computer Science and Engineering at IMT Institute for Advanced Studies in Lucca, Italy. The dissertation addresses challenges in managing and analyzing large datasets, or "big data", and presents algorithms for tasks like document filtering, graph computation and real-time news recommendation. It was approved by the program coordinator and supervisor, and reviewed by two external reviewers. The dissertation contains six chapters, including introductions to the data deluge problem and data-intensive computing, descriptions of three contributed algorithms (SSJ, SCM, and T.Rex), related work sections, and experimental evaluations.