This document discusses large-scale integration of biological data and text mining. It describes three main parts: association networks that connect entities based on "guilt by association", protein interaction networks built using data from STRING and 2000+ genomes, and using genomic context like gene fusion, gene neighborhood, and phylogenetic profiles. It then provides examples of using STRING to query protein networks and discusses challenges of text mining like the exponential growth of literature and limitations of current natural language processing. Finally, it describes the Jensen Lab's approach of integrating curated knowledge, experimental data, predictions, and data from databases like STRING, STITCH, PubChem, COMPARTMENTS, Gene Ontology, UniProtKB, and disease databases into a common framework with