This document introduces ELMo, a new type of deep contextualized word representation designed to capture complex word usage characteristics and their variation across contexts. It discusses the method of learning embeddings through bidirectional language models and shows that ELMo enhances various NLP tasks by providing task-specific representations. The authors evaluate ELMo's effectiveness in improving performance in tasks like textual entailment, sentiment analysis, and more.