The document discusses building and deploying scalable NLP model services using Kubernetes and Seldon Core. It provides an overview of Kubernetes and Seldon Core, and how to build and deploy a single Seldon Core model using the Python wrapper. It also discusses testing the model endpoint and creating more complex inference graphs by combining multiple models in series or parallel.