SlideShare a Scribd company logo
1 of 17
© 2023 GitLab Inc.
Ved Prakash
Staff Data Engineer GitLab
Intro to GKE setup of
Airflow with helm and
Terraform
© 2023 GitLab Inc.
GitLab company
Our mission: Everyone Can Contribute
© 2023 GitLab Inc.
GitLab CREDIT values
Collaboration Results Iteration Transparency
Efficiency
inclusion & belonging
Diversity,
© 2023 GitLab Inc.
Plan
Create
Verify
Package
Secure
Deploy
Monitor
Govern
What GitLab is doing?
The DevSecOps
Platform
delivered as a
single application
to help you
iterate faster and
innovate together
© 2023 GitLab Inc.
Introduction to Google Kubernetes
Engine(GKE),Apache Airflow,Helm
and Terraform.
How all Components fit together.
Agenda
Best Practices and Considerations
Wins of Data platform Team.
Installation within Gitlab Data
platform.
© 2023 GitLab Inc.
Introduction to Google Kubernetes Engine (GKE)
Google Kubernetes Engine (GKE) is a
managed Kubernetes service that
simplifies containerized application
deployment, scaling, and
management on Google Cloud,
offering a robust and efficient
platform for container orchestration.
What is GKE? Key Features
● Managed Kubernetes
● Automatic Scaling
● Security and Compliance
● Integrations
Use Cases:
● Microservices Deployment
● Continuous Integration/Continuous Deployment (CI/CD)
● Scalable Applications
© 2023 GitLab Inc.
Overview of Apache Airflow
Apache Airflow is an open-source
platform designed to
programmatically author, schedule,
and monitor workflows.
What is Apache Airflow? Key Features
● Directed Acyclic Graphs (DAGs)
● Extensibility
● Dynamic Workflow Execution
● Rich UI and Logging
● Scalability
Use Cases:
● Data pipeline orchestration
● ETL (Extract, Transform, Load) processes
● Workflow automation in diverse industries
© 2023 GitLab Inc.
Terraform: Infrastructure as Code for GKE
Terraform is an open-source
Infrastructure as Code (IaC) tool that
enables users to define and provision
infrastructure using a declarative
configuration language.
What is Terraform? Key Features
● Infrastructure as Code (IaC)
● Multi-Cloud Provisioning
● Declarative Configuration
Language
● Plan and Apply Workflow
● State Management
Use Cases:
● Provisioning Servers
● Network Infrastructure
● Application Deployments
© 2023 GitLab Inc.
Helm: A Package Manager for Kubernetes
Helm is a package manager for
Kubernetes applications, simplifying
the deployment and management of
containerized applications.
What is Helm? Key Features
● Standardized Packaging
● Simplified Configuration
● Version Control and Rollbacks
● Dependency Management
Use Case
● Microservices Deployment
© 2023 GitLab Inc.
Integration of
Airflow with
GKE using Helm
and Terraform
Why Use GKE for Airflow?
Why Helm Charts for Airflow?
Terraform Modules for GKE
© 2023 GitLab Inc.
Infrastructure Provisioning with Terraform
Connecting the
Dots: GKE + Helm +
Terraform + Airflow
Terraform Sets the Foundation
Kubernetes Cluster Orchestration with GKE
GKE Ensures Seamless Operations
Package Management with Helm
Helm Charts Define Airflow Configurations:
Helm Charts Deployed on GKE
Smooth Deployment on GKE
Integrated Workflows
Orchestration with Airflow
© 2023 GitLab Inc.
GKE cluster provisioned through
Terraform
Installation
within Gitlab
Data Platform
Team
● Two Namespace (Prod and testing)
● Seven Nodepools (Different Machine type for
each type load)
● Remote state file for any changes required for
GKE cluster.
● Gitlab CI/CD pipeline to validate the Changes
done to terraform script.(This ensure the
changes will not break the terraform apply)
Airflow Installed using helm chart
● Airflow Version 2.5.3 using Helm Chart for
Apache Airflow which will bootstrap an
Airflow deployment on a Kubernetes cluster
using the Helm package manager.
● Overridden with Cloud SQL Postgres
instance.
● Git sync with the analytics repository.
● Modified Web server secret key and fernet
key
© 2023 GitLab Inc.
● 88 Active Airflow Dags
● 1200+ Task run every 24 hours
● Empowering Workflows:Task Dynamism with Airflow
● K8s Pods operator to schedule dynamic workload
● Cost effective solution
● On-Demand Node Provisioning with Terraform
● Minimal downtime , typically under 45 min, in the event of a
disaster recovery scenario.
How this benefits the data platform team
managing data pipeline?
© 2023 GitLab Inc.
Security Best Practices
● Private Cluster Configuration
● VPC Peering
● Identity and Access Management (IAM) Controls
● Node Pool Isolation
● Securing Secrets
Best Practices
and
Considerations Scalability Considerations
● Horizontal Pod Autoscaling (HPA)
● Database Scaling
● Task Parallelism
● Resource Requests and Limits
● Persistent Storage Considerations
● GKE Node Pools
Monitoring and Logging Strategies
● Leverage K8s-native monitoring solutions
Prometheus and Grafana
● Alerting and Notification Channels.
● Airflow Metrics.
© 2023 GitLab Inc.
About me
Find me, ping me, ask me
© 2023 GitLab Inc.
Additional Resources
● Gitlab Handbook for information about nodepool and
namespace.
● Airflow Infrastructure
● Gitlab Data Analytics or our Dag bags
© 2023 GitLab Inc.
Thank you

More Related Content

Similar to GKE, Helm, Terraform and Airflow Integration

Continuous Delivery with CloudBees Core
Continuous Delivery with CloudBees CoreContinuous Delivery with CloudBees Core
Continuous Delivery with CloudBees CoreBhavani Rao
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and KubernetesAltoros
 
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdfCNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdfLibbySchulze
 
Leveraging HybridMultiCloud for Devops and Automation Platform
Leveraging HybridMultiCloud for Devops and Automation PlatformLeveraging HybridMultiCloud for Devops and Automation Platform
Leveraging HybridMultiCloud for Devops and Automation PlatformDevOps Indonesia
 
Achieve Data & Operational Sovereignty: Managing Hybrid & Edge EKS Deployment...
Achieve Data & Operational Sovereignty: Managing Hybrid & Edge EKS Deployment...Achieve Data & Operational Sovereignty: Managing Hybrid & Edge EKS Deployment...
Achieve Data & Operational Sovereignty: Managing Hybrid & Edge EKS Deployment...Weaveworks
 
Kubernetes best practices with GKE
Kubernetes best practices with GKEKubernetes best practices with GKE
Kubernetes best practices with GKEGDG Cloud Bengaluru
 
Kubernetes Cheatsheet
Kubernetes CheatsheetKubernetes Cheatsheet
Kubernetes CheatsheetAlex Hisaka
 
How Kubernetes helps Devops
How Kubernetes helps DevopsHow Kubernetes helps Devops
How Kubernetes helps DevopsSreenivas Makam
 
Kubernetes and Nested Containers: Enhanced 3 Ps (Performance, Price and Provi...
Kubernetes and Nested Containers: Enhanced 3 Ps (Performance, Price and Provi...Kubernetes and Nested Containers: Enhanced 3 Ps (Performance, Price and Provi...
Kubernetes and Nested Containers: Enhanced 3 Ps (Performance, Price and Provi...Jelastic Multi-Cloud PaaS
 
Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...HostedbyConfluent
 
Fabio Ferrari | particles.io | Presentation
Fabio Ferrari | particles.io | PresentationFabio Ferrari | particles.io | Presentation
Fabio Ferrari | particles.io | PresentationFabio Ferrari
 
GDG Cloud Southlake #8 Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
GDG Cloud Southlake #8  Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...GDG Cloud Southlake #8  Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
GDG Cloud Southlake #8 Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...James Anderson
 
Xpdays: Kubernetes CI-CD Frameworks Case Study
Xpdays: Kubernetes CI-CD Frameworks Case StudyXpdays: Kubernetes CI-CD Frameworks Case Study
Xpdays: Kubernetes CI-CD Frameworks Case StudyDenys Vasyliev
 
Overcoming Regulatory & Compliance Hurdles with Hybrid Cloud EKS and Weave Gi...
Overcoming Regulatory & Compliance Hurdles with Hybrid Cloud EKS and Weave Gi...Overcoming Regulatory & Compliance Hurdles with Hybrid Cloud EKS and Weave Gi...
Overcoming Regulatory & Compliance Hurdles with Hybrid Cloud EKS and Weave Gi...Weaveworks
 
Deploying WSO2 API Manager in Production-Grade Kubernetes
Deploying WSO2 API Manager in Production-Grade KubernetesDeploying WSO2 API Manager in Production-Grade Kubernetes
Deploying WSO2 API Manager in Production-Grade KubernetesWSO2
 
Ultimate Guide to Microservice Architecture on Kubernetes
Ultimate Guide to Microservice Architecture on KubernetesUltimate Guide to Microservice Architecture on Kubernetes
Ultimate Guide to Microservice Architecture on Kuberneteskloia
 
How To Overcome Day 2 Kubernetes Challenges.pdf
How To Overcome Day 2 Kubernetes Challenges.pdfHow To Overcome Day 2 Kubernetes Challenges.pdf
How To Overcome Day 2 Kubernetes Challenges.pdfArif Khan
 
kreuzwerker AWS Modernizing Legacy Operations with Containerized Solutions 20...
kreuzwerker AWS Modernizing Legacy Operations with Containerized Solutions 20...kreuzwerker AWS Modernizing Legacy Operations with Containerized Solutions 20...
kreuzwerker AWS Modernizing Legacy Operations with Containerized Solutions 20...kreuzwerker GmbH
 
Migrating from Self-Managed Kubernetes on EC2 to a GitOps Enabled EKS
Migrating from Self-Managed Kubernetes on EC2 to a GitOps Enabled EKSMigrating from Self-Managed Kubernetes on EC2 to a GitOps Enabled EKS
Migrating from Self-Managed Kubernetes on EC2 to a GitOps Enabled EKSWeaveworks
 

Similar to GKE, Helm, Terraform and Airflow Integration (20)

Continuous Delivery with CloudBees Core
Continuous Delivery with CloudBees CoreContinuous Delivery with CloudBees Core
Continuous Delivery with CloudBees Core
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and Kubernetes
 
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdfCNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
 
Leveraging HybridMultiCloud for Devops and Automation Platform
Leveraging HybridMultiCloud for Devops and Automation PlatformLeveraging HybridMultiCloud for Devops and Automation Platform
Leveraging HybridMultiCloud for Devops and Automation Platform
 
Achieve Data & Operational Sovereignty: Managing Hybrid & Edge EKS Deployment...
Achieve Data & Operational Sovereignty: Managing Hybrid & Edge EKS Deployment...Achieve Data & Operational Sovereignty: Managing Hybrid & Edge EKS Deployment...
Achieve Data & Operational Sovereignty: Managing Hybrid & Edge EKS Deployment...
 
Kubernetes best practices with GKE
Kubernetes best practices with GKEKubernetes best practices with GKE
Kubernetes best practices with GKE
 
Kubernetes Cheatsheet
Kubernetes CheatsheetKubernetes Cheatsheet
Kubernetes Cheatsheet
 
How Kubernetes helps Devops
How Kubernetes helps DevopsHow Kubernetes helps Devops
How Kubernetes helps Devops
 
Kubernetes and Nested Containers: Enhanced 3 Ps (Performance, Price and Provi...
Kubernetes and Nested Containers: Enhanced 3 Ps (Performance, Price and Provi...Kubernetes and Nested Containers: Enhanced 3 Ps (Performance, Price and Provi...
Kubernetes and Nested Containers: Enhanced 3 Ps (Performance, Price and Provi...
 
Gdsc muk - innocent
Gdsc   muk - innocentGdsc   muk - innocent
Gdsc muk - innocent
 
Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...
 
Fabio Ferrari | particles.io | Presentation
Fabio Ferrari | particles.io | PresentationFabio Ferrari | particles.io | Presentation
Fabio Ferrari | particles.io | Presentation
 
GDG Cloud Southlake #8 Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
GDG Cloud Southlake #8  Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...GDG Cloud Southlake #8  Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
GDG Cloud Southlake #8 Steve Cravens: Infrastructure as-Code (IaC) in 2022: ...
 
Xpdays: Kubernetes CI-CD Frameworks Case Study
Xpdays: Kubernetes CI-CD Frameworks Case StudyXpdays: Kubernetes CI-CD Frameworks Case Study
Xpdays: Kubernetes CI-CD Frameworks Case Study
 
Overcoming Regulatory & Compliance Hurdles with Hybrid Cloud EKS and Weave Gi...
Overcoming Regulatory & Compliance Hurdles with Hybrid Cloud EKS and Weave Gi...Overcoming Regulatory & Compliance Hurdles with Hybrid Cloud EKS and Weave Gi...
Overcoming Regulatory & Compliance Hurdles with Hybrid Cloud EKS and Weave Gi...
 
Deploying WSO2 API Manager in Production-Grade Kubernetes
Deploying WSO2 API Manager in Production-Grade KubernetesDeploying WSO2 API Manager in Production-Grade Kubernetes
Deploying WSO2 API Manager in Production-Grade Kubernetes
 
Ultimate Guide to Microservice Architecture on Kubernetes
Ultimate Guide to Microservice Architecture on KubernetesUltimate Guide to Microservice Architecture on Kubernetes
Ultimate Guide to Microservice Architecture on Kubernetes
 
How To Overcome Day 2 Kubernetes Challenges.pdf
How To Overcome Day 2 Kubernetes Challenges.pdfHow To Overcome Day 2 Kubernetes Challenges.pdf
How To Overcome Day 2 Kubernetes Challenges.pdf
 
kreuzwerker AWS Modernizing Legacy Operations with Containerized Solutions 20...
kreuzwerker AWS Modernizing Legacy Operations with Containerized Solutions 20...kreuzwerker AWS Modernizing Legacy Operations with Containerized Solutions 20...
kreuzwerker AWS Modernizing Legacy Operations with Containerized Solutions 20...
 
Migrating from Self-Managed Kubernetes on EC2 to a GitOps Enabled EKS
Migrating from Self-Managed Kubernetes on EC2 to a GitOps Enabled EKSMigrating from Self-Managed Kubernetes on EC2 to a GitOps Enabled EKS
Migrating from Self-Managed Kubernetes on EC2 to a GitOps Enabled EKS
 

More from DataScienceConferenc1

[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDFDataScienceConferenc1
 
[DSC Europe 23] Rania Wazir - Mathematician jokes, cute cat photos, offensiv...
[DSC Europe 23] Rania Wazir -  Mathematician jokes, cute cat photos, offensiv...[DSC Europe 23] Rania Wazir -  Mathematician jokes, cute cat photos, offensiv...
[DSC Europe 23] Rania Wazir - Mathematician jokes, cute cat photos, offensiv...DataScienceConferenc1
 
[DSC Europe 23] Irena Cerovic - AI in International Development.pdf
[DSC Europe 23] Irena Cerovic - AI in International Development.pdf[DSC Europe 23] Irena Cerovic - AI in International Development.pdf
[DSC Europe 23] Irena Cerovic - AI in International Development.pdfDataScienceConferenc1
 
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...DataScienceConferenc1
 
[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx
[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx
[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptxDataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Goran Dumic - Data-Driven Approach In Treatments
[DSC Europe 23][DigiHealth]  Goran Dumic -  Data-Driven Approach In Treatments[DSC Europe 23][DigiHealth]  Goran Dumic -  Data-Driven Approach In Treatments
[DSC Europe 23][DigiHealth] Goran Dumic - Data-Driven Approach In TreatmentsDataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Milos Todorovic - Bridging the Gap-Innovating Ag...
[DSC Europe 23][DigiHealth]  Milos Todorovic - Bridging the Gap-Innovating Ag...[DSC Europe 23][DigiHealth]  Milos Todorovic - Bridging the Gap-Innovating Ag...
[DSC Europe 23][DigiHealth] Milos Todorovic - Bridging the Gap-Innovating Ag...DataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Urosh VIlimanovich Clinical Data Management and C...
[DSC Europe 23][DigiHealth] Urosh VIlimanovich Clinical Data Management and C...[DSC Europe 23][DigiHealth] Urosh VIlimanovich Clinical Data Management and C...
[DSC Europe 23][DigiHealth] Urosh VIlimanovich Clinical Data Management and C...DataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Vladimir Brusic - SMART HEALTH HOME: Technology,...
[DSC Europe 23][DigiHealth]  Vladimir Brusic - SMART HEALTH HOME: Technology,...[DSC Europe 23][DigiHealth]  Vladimir Brusic - SMART HEALTH HOME: Technology,...
[DSC Europe 23][DigiHealth] Vladimir Brusic - SMART HEALTH HOME: Technology,...DataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Dimitar Penkov Grid Search Optimization of Novel...
[DSC Europe 23][DigiHealth]  Dimitar Penkov Grid Search Optimization of Novel...[DSC Europe 23][DigiHealth]  Dimitar Penkov Grid Search Optimization of Novel...
[DSC Europe 23][DigiHealth] Dimitar Penkov Grid Search Optimization of Novel...DataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Tomislav Krizan - AIMED
[DSC Europe 23][DigiHealth] Tomislav Krizan - AIMED[DSC Europe 23][DigiHealth] Tomislav Krizan - AIMED
[DSC Europe 23][DigiHealth] Tomislav Krizan - AIMEDDataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Katarina Vucicevic - Navigating theKinetics of Dr...
[DSC Europe 23][DigiHealth] Katarina Vucicevic - Navigating theKinetics of Dr...[DSC Europe 23][DigiHealth] Katarina Vucicevic - Navigating theKinetics of Dr...
[DSC Europe 23][DigiHealth] Katarina Vucicevic - Navigating theKinetics of Dr...DataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...DataScienceConferenc1
 
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P...
[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P...DataScienceConferenc1
 
[DSC Europe 23][AI:CSI] Uros Arsenijevic Unlocking Cybersecurity with Seif
[DSC Europe 23][AI:CSI] Uros Arsenijevic Unlocking Cybersecurity with Seif[DSC Europe 23][AI:CSI] Uros Arsenijevic Unlocking Cybersecurity with Seif
[DSC Europe 23][AI:CSI] Uros Arsenijevic Unlocking Cybersecurity with SeifDataScienceConferenc1
 
[DSC Europe 23][AI:CSI] Goran Gvozden Improving Cybersecurity Posture with an...
[DSC Europe 23][AI:CSI] Goran Gvozden Improving Cybersecurity Posture with an...[DSC Europe 23][AI:CSI] Goran Gvozden Improving Cybersecurity Posture with an...
[DSC Europe 23][AI:CSI] Goran Gvozden Improving Cybersecurity Posture with an...DataScienceConferenc1
 
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...DataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...DataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Ligia Kornowska-How_may AI help you
[DSC Europe 23][DigiHealth] Ligia Kornowska-How_may AI help you[DSC Europe 23][DigiHealth] Ligia Kornowska-How_may AI help you
[DSC Europe 23][DigiHealth] Ligia Kornowska-How_may AI help youDataScienceConferenc1
 
[DSC Europe 23][DigiHealth] Ilya Zakharov - NETWORK NEUROSCIENCE WHERE THE BR...
[DSC Europe 23][DigiHealth] Ilya Zakharov - NETWORK NEUROSCIENCE WHERE THE BR...[DSC Europe 23][DigiHealth] Ilya Zakharov - NETWORK NEUROSCIENCE WHERE THE BR...
[DSC Europe 23][DigiHealth] Ilya Zakharov - NETWORK NEUROSCIENCE WHERE THE BR...DataScienceConferenc1
 

More from DataScienceConferenc1 (20)

[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
 
[DSC Europe 23] Rania Wazir - Mathematician jokes, cute cat photos, offensiv...
[DSC Europe 23] Rania Wazir -  Mathematician jokes, cute cat photos, offensiv...[DSC Europe 23] Rania Wazir -  Mathematician jokes, cute cat photos, offensiv...
[DSC Europe 23] Rania Wazir - Mathematician jokes, cute cat photos, offensiv...
 
[DSC Europe 23] Irena Cerovic - AI in International Development.pdf
[DSC Europe 23] Irena Cerovic - AI in International Development.pdf[DSC Europe 23] Irena Cerovic - AI in International Development.pdf
[DSC Europe 23] Irena Cerovic - AI in International Development.pdf
 
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...
 
[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx
[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx
[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx
 
[DSC Europe 23][DigiHealth] Goran Dumic - Data-Driven Approach In Treatments
[DSC Europe 23][DigiHealth]  Goran Dumic -  Data-Driven Approach In Treatments[DSC Europe 23][DigiHealth]  Goran Dumic -  Data-Driven Approach In Treatments
[DSC Europe 23][DigiHealth] Goran Dumic - Data-Driven Approach In Treatments
 
[DSC Europe 23][DigiHealth] Milos Todorovic - Bridging the Gap-Innovating Ag...
[DSC Europe 23][DigiHealth]  Milos Todorovic - Bridging the Gap-Innovating Ag...[DSC Europe 23][DigiHealth]  Milos Todorovic - Bridging the Gap-Innovating Ag...
[DSC Europe 23][DigiHealth] Milos Todorovic - Bridging the Gap-Innovating Ag...
 
[DSC Europe 23][DigiHealth] Urosh VIlimanovich Clinical Data Management and C...
[DSC Europe 23][DigiHealth] Urosh VIlimanovich Clinical Data Management and C...[DSC Europe 23][DigiHealth] Urosh VIlimanovich Clinical Data Management and C...
[DSC Europe 23][DigiHealth] Urosh VIlimanovich Clinical Data Management and C...
 
[DSC Europe 23][DigiHealth] Vladimir Brusic - SMART HEALTH HOME: Technology,...
[DSC Europe 23][DigiHealth]  Vladimir Brusic - SMART HEALTH HOME: Technology,...[DSC Europe 23][DigiHealth]  Vladimir Brusic - SMART HEALTH HOME: Technology,...
[DSC Europe 23][DigiHealth] Vladimir Brusic - SMART HEALTH HOME: Technology,...
 
[DSC Europe 23][DigiHealth] Dimitar Penkov Grid Search Optimization of Novel...
[DSC Europe 23][DigiHealth]  Dimitar Penkov Grid Search Optimization of Novel...[DSC Europe 23][DigiHealth]  Dimitar Penkov Grid Search Optimization of Novel...
[DSC Europe 23][DigiHealth] Dimitar Penkov Grid Search Optimization of Novel...
 
[DSC Europe 23][DigiHealth] Tomislav Krizan - AIMED
[DSC Europe 23][DigiHealth] Tomislav Krizan - AIMED[DSC Europe 23][DigiHealth] Tomislav Krizan - AIMED
[DSC Europe 23][DigiHealth] Tomislav Krizan - AIMED
 
[DSC Europe 23][DigiHealth] Katarina Vucicevic - Navigating theKinetics of Dr...
[DSC Europe 23][DigiHealth] Katarina Vucicevic - Navigating theKinetics of Dr...[DSC Europe 23][DigiHealth] Katarina Vucicevic - Navigating theKinetics of Dr...
[DSC Europe 23][DigiHealth] Katarina Vucicevic - Navigating theKinetics of Dr...
 
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
 
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P...
[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P...
 
[DSC Europe 23][AI:CSI] Uros Arsenijevic Unlocking Cybersecurity with Seif
[DSC Europe 23][AI:CSI] Uros Arsenijevic Unlocking Cybersecurity with Seif[DSC Europe 23][AI:CSI] Uros Arsenijevic Unlocking Cybersecurity with Seif
[DSC Europe 23][AI:CSI] Uros Arsenijevic Unlocking Cybersecurity with Seif
 
[DSC Europe 23][AI:CSI] Goran Gvozden Improving Cybersecurity Posture with an...
[DSC Europe 23][AI:CSI] Goran Gvozden Improving Cybersecurity Posture with an...[DSC Europe 23][AI:CSI] Goran Gvozden Improving Cybersecurity Posture with an...
[DSC Europe 23][AI:CSI] Goran Gvozden Improving Cybersecurity Posture with an...
 
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
 
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...
[DSC Europe 23][DigiHealth] Muthu Ramachandran AI and Blockchain Framework fo...
 
[DSC Europe 23][DigiHealth] Ligia Kornowska-How_may AI help you
[DSC Europe 23][DigiHealth] Ligia Kornowska-How_may AI help you[DSC Europe 23][DigiHealth] Ligia Kornowska-How_may AI help you
[DSC Europe 23][DigiHealth] Ligia Kornowska-How_may AI help you
 
[DSC Europe 23][DigiHealth] Ilya Zakharov - NETWORK NEUROSCIENCE WHERE THE BR...
[DSC Europe 23][DigiHealth] Ilya Zakharov - NETWORK NEUROSCIENCE WHERE THE BR...[DSC Europe 23][DigiHealth] Ilya Zakharov - NETWORK NEUROSCIENCE WHERE THE BR...
[DSC Europe 23][DigiHealth] Ilya Zakharov - NETWORK NEUROSCIENCE WHERE THE BR...
 

Recently uploaded

办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 

Recently uploaded (20)

办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 

GKE, Helm, Terraform and Airflow Integration

  • 1. © 2023 GitLab Inc. Ved Prakash Staff Data Engineer GitLab Intro to GKE setup of Airflow with helm and Terraform
  • 2. © 2023 GitLab Inc. GitLab company Our mission: Everyone Can Contribute
  • 3. © 2023 GitLab Inc. GitLab CREDIT values Collaboration Results Iteration Transparency Efficiency inclusion & belonging Diversity,
  • 4. © 2023 GitLab Inc. Plan Create Verify Package Secure Deploy Monitor Govern What GitLab is doing? The DevSecOps Platform delivered as a single application to help you iterate faster and innovate together
  • 5. © 2023 GitLab Inc. Introduction to Google Kubernetes Engine(GKE),Apache Airflow,Helm and Terraform. How all Components fit together. Agenda Best Practices and Considerations Wins of Data platform Team. Installation within Gitlab Data platform.
  • 6. © 2023 GitLab Inc. Introduction to Google Kubernetes Engine (GKE) Google Kubernetes Engine (GKE) is a managed Kubernetes service that simplifies containerized application deployment, scaling, and management on Google Cloud, offering a robust and efficient platform for container orchestration. What is GKE? Key Features ● Managed Kubernetes ● Automatic Scaling ● Security and Compliance ● Integrations Use Cases: ● Microservices Deployment ● Continuous Integration/Continuous Deployment (CI/CD) ● Scalable Applications
  • 7. © 2023 GitLab Inc. Overview of Apache Airflow Apache Airflow is an open-source platform designed to programmatically author, schedule, and monitor workflows. What is Apache Airflow? Key Features ● Directed Acyclic Graphs (DAGs) ● Extensibility ● Dynamic Workflow Execution ● Rich UI and Logging ● Scalability Use Cases: ● Data pipeline orchestration ● ETL (Extract, Transform, Load) processes ● Workflow automation in diverse industries
  • 8. © 2023 GitLab Inc. Terraform: Infrastructure as Code for GKE Terraform is an open-source Infrastructure as Code (IaC) tool that enables users to define and provision infrastructure using a declarative configuration language. What is Terraform? Key Features ● Infrastructure as Code (IaC) ● Multi-Cloud Provisioning ● Declarative Configuration Language ● Plan and Apply Workflow ● State Management Use Cases: ● Provisioning Servers ● Network Infrastructure ● Application Deployments
  • 9. © 2023 GitLab Inc. Helm: A Package Manager for Kubernetes Helm is a package manager for Kubernetes applications, simplifying the deployment and management of containerized applications. What is Helm? Key Features ● Standardized Packaging ● Simplified Configuration ● Version Control and Rollbacks ● Dependency Management Use Case ● Microservices Deployment
  • 10. © 2023 GitLab Inc. Integration of Airflow with GKE using Helm and Terraform Why Use GKE for Airflow? Why Helm Charts for Airflow? Terraform Modules for GKE
  • 11. © 2023 GitLab Inc. Infrastructure Provisioning with Terraform Connecting the Dots: GKE + Helm + Terraform + Airflow Terraform Sets the Foundation Kubernetes Cluster Orchestration with GKE GKE Ensures Seamless Operations Package Management with Helm Helm Charts Define Airflow Configurations: Helm Charts Deployed on GKE Smooth Deployment on GKE Integrated Workflows Orchestration with Airflow
  • 12. © 2023 GitLab Inc. GKE cluster provisioned through Terraform Installation within Gitlab Data Platform Team ● Two Namespace (Prod and testing) ● Seven Nodepools (Different Machine type for each type load) ● Remote state file for any changes required for GKE cluster. ● Gitlab CI/CD pipeline to validate the Changes done to terraform script.(This ensure the changes will not break the terraform apply) Airflow Installed using helm chart ● Airflow Version 2.5.3 using Helm Chart for Apache Airflow which will bootstrap an Airflow deployment on a Kubernetes cluster using the Helm package manager. ● Overridden with Cloud SQL Postgres instance. ● Git sync with the analytics repository. ● Modified Web server secret key and fernet key
  • 13. © 2023 GitLab Inc. ● 88 Active Airflow Dags ● 1200+ Task run every 24 hours ● Empowering Workflows:Task Dynamism with Airflow ● K8s Pods operator to schedule dynamic workload ● Cost effective solution ● On-Demand Node Provisioning with Terraform ● Minimal downtime , typically under 45 min, in the event of a disaster recovery scenario. How this benefits the data platform team managing data pipeline?
  • 14. © 2023 GitLab Inc. Security Best Practices ● Private Cluster Configuration ● VPC Peering ● Identity and Access Management (IAM) Controls ● Node Pool Isolation ● Securing Secrets Best Practices and Considerations Scalability Considerations ● Horizontal Pod Autoscaling (HPA) ● Database Scaling ● Task Parallelism ● Resource Requests and Limits ● Persistent Storage Considerations ● GKE Node Pools Monitoring and Logging Strategies ● Leverage K8s-native monitoring solutions Prometheus and Grafana ● Alerting and Notification Channels. ● Airflow Metrics.
  • 15. © 2023 GitLab Inc. About me Find me, ping me, ask me
  • 16. © 2023 GitLab Inc. Additional Resources ● Gitlab Handbook for information about nodepool and namespace. ● Airflow Infrastructure ● Gitlab Data Analytics or our Dag bags
  • 17. © 2023 GitLab Inc. Thank you

Editor's Notes

  1. Introduction 1
  2. Introduction 4
  3. And Off course Q&A
  4. Key Features: Managed Kubernetes: Leverage the power of Kubernetes without the operational overhead. Automatic Scaling: Seamlessly scale your applications with automated load balancing. Security and Compliance: Built-in security features and compliance standards for peace of mind. Integrated Developer Tools: Tight integration with Google Cloud's developer tools and services. Benefits: Efficiency: Simplifies container orchestration, enabling efficient deployment and scaling. Reliability: Google's infrastructure ensures high availability and reliability. Flexibility: Run containerized applications anywhere, on-premises or in the cloud. Use Cases: Microservices Deployment: Ideal for deploying and managing microservices architectures. Continuous Integration/Continuous Deployment (CI/CD): Streamlines CI/CD pipelines with Kubernetes. Scalable Applications: Easily scale applications based on demand. At gitlab within data platform team it is being used for Scalable airflow , Gitlab CI/CD pipeline for our analytics repo.
  5. Key Features: Directed Acyclic Graphs (DAGs): Represent workflows as code, defining the sequence and dependencies of tasks. Extensibility: Easily extend functionality with custom operators, sensors, and hooks. Dynamic Workflow Execution: Dynamically generate workflows based on external parameters. Rich UI and Logging: User-friendly interface for monitoring, logging, and visualizing workflow runs. Scalability: Scales horizontally to handle large-scale data processing and orchestration. Apache Airflow empowers organizations to streamline complex data workflows with flexibility and reliability.
  6. Few Key features for Terraform Infrastructure as Code (IaC): Terraform allows users to define and manage infrastructure using a declarative configuration language, enabling version control, collaboration, and the ability to treat infrastructure as code. Multi-Cloud Provisioning: Terraform supports various cloud providers (AWS, Azure, Google Cloud, etc.) and on-premises environments, providing a consistent approach to provisioning and managing infrastructure across different platforms. Declarative Configuration Language: The HashiCorp Configuration Language (HCL) used by Terraform is designed for readability and ease of use, making it straightforward to express infrastructure configurations. Plan and Apply Workflow: Terraform follows a workflow of planning and applying changes. The terraform plan command previews the changes before execution, and terraform apply implements the changes, ensuring safety and control over infrastructure modifications. State Management: Terraform maintains a state file that records the current state of the infrastructure. This state allows Terraform to determine what changes are necessary and provides a basis for understanding the existing infrastructure. Use Cases: Provisioning Servers: Create and manage virtual machines or containers. Network Infrastructure: Define and configure networks, subnets, and security groups. Application Deployments: Deploy and manage applications and their dependencies.
  7. What is Helm? Helm is a package manager for Kubernetes applications, simplifying the deployment and management of containerized applications. Key Concepts: Charts: Helm packages are called charts, which encapsulate all the resources needed for an application—services, deployments, and more. Values: Parameterized configurations allow customization of charts for different environments. Repositories: Share and discover charts through Helm repositories, fostering a vibrant ecosystem. Benefits of Helm: Reusability: Easily share and reuse application configurations across teams and projects. Versioning: Charts can be versioned, enabling precise control over application deployments. Templating: Helm uses Go templating to generate Kubernetes manifests dynamically. Workflow: helm install: Deploy a chart to a Kubernetes cluster with a single command. helm upgrade: Seamlessly update a deployed application with new configurations or versions. helm rollback: Roll back to a previous version of an application in case of issues. Community and Adoption: Helm has a thriving community and is widely adopted in the Kubernetes ecosystem. Many popular applications and services provide Helm charts for easy integration. Conclusion: Helm simplifies Kubernetes application deployment and management, offering a standardized and efficient way to package, version, and share applications in the Kubernetes environment. Use Case: Microservices Deployment: Step 1: Chart Creation: Package each microservice with its associated Kubernetes resources (Deployments, Services, ConfigMaps) into a Helm chart. Step 2: Chart Sharing: Share Helm charts across your development team or with the broader community via Helm Hub. Step 3: Consistent Deployments: Developers can use the same Helm chart to deploy the microservice consistently across different environments. Step 4: Versioning: Version your Helm charts to track changes, ensuring consistency and repeatability in deployments.
  8. Integration of Airflow with GKE using Helm and Terraform Why Use GKE for Airflow? Lots of advantages and reason but to summary we can call . Managed Kubernetes Service: Effortless Orchestration: GKE provides a fully managed Kubernetes service, eliminating the operational burden of setting up and maintaining Kubernetes clusters. This allows users to focus more on Airflow configurations and workflows. 2. Scalability: Dynamic Scaling: GKE allows for easy horizontal scaling, enabling Airflow to adapt to varying workloads by dynamically adjusting the number of pods based on demand. This ensures optimal resource utilization. 3. Automated Operations: Built-in Automation: GKE automates routine operational tasks like patching, updates, and cluster scaling. This reduces manual intervention and ensures that the Airflow environment is consistently up-to-date and secure. 4. Integrated Developer Tools: Seamless Integration: GKE integrates seamlessly with other Google Cloud services and developer tools. This includes integration with Cloud Monitoring, Logging, and Identity and Access Management (IAM), enhancing the overall management experience. 5. Google Cloud Ecosystem: Interoperability: Leveraging GKE within the broader Google Cloud ecosystem provides opportunities for integration with various services such as BigQuery, Cloud Storage, and Pub/Sub, enhancing the capabilities and data processing options for Airflow workflows. 6. High Availability and Reliability: Built-in Redundancy: GKE ensures high availability and reliability through multi-zone deployments, distributing Airflow components across multiple availability zones to mitigate the risk of single points of failure. Cost Efficiency: Pay-as-You-Go Model: GKE operates on a pay-as-you-go pricing model, providing cost efficiency by dynamically scaling resources based on demand. Users only pay for the resources consumed during active workflows. Helm Charts for Airflow. Due to below reason Standardized Packaging: Consistent Deployment: Helm Charts provide a standardized way to package, version, and deploy applications. Using Helm for Airflow ensures consistency across different environments, making it easier to reproduce deployments. 2. Simplified Configuration: Templating Engine: Helm uses Go templating to parameterize Kubernetes manifests. This allows users to customize Airflow configurations easily, adapting them to specific deployment scenarios without manual editing of YAML files. 3. Version Control and Rollbacks: Built-in Versioning: Helm Charts support versioning, allowing users to roll back to a previous state in case of issues. This ensures that changes to the Airflow deployment can be tracked, managed, and reverted when necessary. 4. Reusability: Shareable Configurations: Helm Charts can be shared and reused across teams and projects. This promotes collaboration and standardizes the deployment process, as the same Helm Chart can be used across different Airflow instances. In conclusion, Helm Charts offer a robust and flexible solution for deploying Apache Airflow by providing a standardized packaging format, streamlined configuration management, and a vibrant community ecosystem. The use of Helm simplifies the deployment and management of Airflow in Kubernetes environments. Terraform Modules for GKE Using Terraform modules for GKE provides a structured, reusable, and scalable approach to managing Kubernetes clusters, promoting consistency and best practices across your infrastructure deployments.
  9. Terraform Sets the Foundation: Initiate the process by using Terraform to provision a robust GKE cluster. Define infrastructure as code to establish the underlying Kubernetes environment for Apache Airflow. GKE Ensures Seamless Operations: Google Kubernetes Engine manages the Kubernetes cluster, providing automated operations, scalability, and integration with Google Cloud services. The GKE cluster becomes the orchestration backbone for deploying and managing applications. The synergy of GKE, Helm, Terraform, and Airflow provides a comprehensive solution for deploying, managing, and orchestrating data workflows in a cloud-native environment. This integrated approach combines infrastructure provisioning, application deployment, and workflow orchestration, offering a scalable, efficient, and maintainable solution for complex data processing scenarios.
  10. The helm chart of airflow creates 4 pods in the cluster for managing airflow, below: airflow-scheduler airflow-webserver airflow-pgbouncer: supplemental DB component which provides additional DB security and connection management. airflow-statsd: enables reading and monitoring of airflow metrics in prometheus (still to be implemented) The scheduler, webserver, and any workers created also include cloud-sql-proxy side car container which connects the containers to the external DB using service account credentials. Additionally, the scheduler and webserver also include: git-sync side car container which updates the DAGs repo with any changes detected in the repository. The install also requires an external postgres DB, which needs to be created manually.
  11. Private Cluster Configuration: Recommendation: Deploy GKE clusters as private clusters to limit exposure to the public internet. Rationale: Private clusters minimize the attack surface by restricting external access to the cluster. VPC Peering or VPN Setup: Recommendation: Establish VPC peering or set up a VPN connection between GKE clusters and other relevant networks. Rationale: Securely connect GKE clusters to other resources while maintaining network isolation and encryption. Identity and Access Management (IAM) Controls: Recommendation: Implement the principle of least privilege by assigning minimal necessary permissions to service accounts and users. Rationale: Reducing unnecessary access minimizes the risk of unauthorized actions. Node Pool Isolation: Recommendation: Utilize separate node pools for Airflow components and user workloads. Rationale: Isolating node pools ensures that Airflow components run independently from user applications, enhancing security and resource management. Securing Secrets: Recommendation: Utilize Kubernetes Secrets or external secret management tools for storing sensitive information such as database credentials and API keys. Rationale: Protecting secrets is crucial for preventing unauthorized access to critical resources. Adhering to these security best practices helps fortify your Apache Airflow installation on Google Kubernetes Engine, fostering a secure and resilient orchestration environment. Scalability Considerations When running Apache Airflow on Google Kubernetes Engine (GKE), several scalability considerations should be taken into account to ensure optimal performance and resource utilization: Horizontal Pod Autoscaling (HPA): Utilize Kubernetes Horizontal Pod Autoscaling to automatically adjust the number of Airflow worker pods based on CPU or memory utilization. This ensures that resources are allocated efficiently to meet the demands of running workflows. Database Scaling: Consider the scalability of the database backend used by Airflow (e.g., PostgreSQL). Ensure that the database is appropriately provisioned and tuned to handle the increasing metadata storage requirements as the number of tasks and workflows grows. Task Parallelism: Design Airflow DAGs with parallelism in mind. Break down workflows into smaller tasks to enable better parallel execution, taking advantage of the scalability features in GKE. Resource Requests and Limits: Set appropriate resource requests and limits for Airflow pods to ensure they receive the necessary resources and prevent resource contention within the cluster. GKE Node Pools: Utilize GKE node pools to segregate workloads with varying resource requirements. This allows for better resource isolation and scaling based on specific task characteristics. By addressing these scalability considerations, you can create a robust and scalable Apache Airflow deployment on GKE, ensuring efficient utilization of resources and accommodating the evolving demands of your data workflows. Monitoring and Logging Strategies When setting up Apache Airflow on Google Kubernetes Engine (GKE) using Terraform, it's crucial to establish effective monitoring and logging strategies to ensure the stability, performance, and security of your deployment. Here are key considerations for monitoring and logging: 1. Kubernetes Monitoring: Leverage Kubernetes-native monitoring solutions like Prometheus and Grafana. Set up Prometheus to collect metrics from the Airflow pods and use Grafana dashboards for visualization. 2. Airflow Metrics:Enable Airflow's built-in metrics exporter to expose key performance metrics. This includes metrics related to DAG execution, task durations, and scheduler performance. 3. Alerting and Notification Channels:- Configure alerting channels such as email, Slack, or PagerDuty to receive notifications when predefined thresholds are breached. Ensure timely responses to critical issues. By incorporating these monitoring and logging strategies into your Airflow deployment on GKE with Terraform, you can create a robust observability framework, allowing for proactive issue detection, efficient debugging, and continuous improvement of your orchestration environment.
  12. Now the favorite part of Q&A/ For any additional questions or info needed, looking forward to hearing from you. Do not hesitate to contact me with any questions.