This document summarizes Suneel Marthi's presentation on large scale natural language processing. It discusses how natural language processing deals with processing and analyzing large amounts of human language data using computers. It provides an overview of Apache OpenNLP and Apache Flink, two open source projects for natural language processing. It also discusses how models for tasks like part-of-speech tagging and named entity recognition can be trained for different languages and integrated into data pipelines for large scale processing using these frameworks.