This document discusses network biology and large-scale data and text mining. It describes how Lars Jensen uses computational predictions from over 1100 genomes along with experimental data and information extracted from text to build protein-protein association networks in STRING. These networks integrate known and predicted protein-protein interactions with functional associations, and are used to study biological systems at the network level.