This document summarizes a framework for automatically extracting human protein-protein interaction data from biomedical literature. It describes benchmarking interaction datasets based on shared functional annotations and known physical interactions. It also outlines a method using a conditional random field tagger to identify protein names in text and two approaches for extracting interactions: co-citation analysis and learning interaction extractors from annotated sentences. Evaluation shows the extracted interactions have accuracy comparable to manually curated databases.