SlideShare a Scribd company logo
1 of 39
Airflow
a platform to programmatically author, schedule and monitor workflows.
Fact G4roblems
We’re facing with a lots of workflows.
Sale Orders
New
Employee
Onboarding
We’re facing with
a lots of disadvantages with current methods.
Do nothing without saying Good luck!
No notification when finish/fail/retry
No status monitoring for running script
No flexible integration mechanism
No dependencies management
No version controller for workflow’s scripts
No failure-retry and re-run mechanism
No centralize logging mechanism
No backfill mechanism
No scalability
Airflow
comes into the picture
Go deep in
technical perspective
Overview of Airflow
Airflow Operators
Airflow Executors
Demo
Agenda
Airflow is created by Airbnb
Written in Python
Is a workflow management system
Open-source, no cost
“Configuration as code” principal
Airflow is used for:
Run ETL pipelines
Data ingestion pipelines
Machine learning pipelines
Predictive data pipelines
General purpose scheduler
We need to know some concepts.
DAG
Directed Acyclic Graph
DAG
Directed Acyclic Graph
Task
DAG
Directed Acyclic Graph
We need calculate: (a+b)*(d+c)
task_1=a +b
task_2=d +c
task_3=task_1*task_2
We write DAG:
[task_1,task_2]>
>task_3
A DAG can run manually or automatically.
A DAG can be scheduled in the future.
A DAG can run periodically
Three types of Task
Operators
Sensors
TaskFlow
Operators
An Operator is conceptually a template for a predefined Task
BashOperator - executes a bash command
PythonOperator - calls an arbitrary Python function
EmailOperator - sends an email
SimpleHttpOperator
MySqlOperator
PostgresOperator
MsSqlOperator
OracleOperator
JdbcOperator
DockerOperator
KubernetesPodOperator
S3FileTransformOperator
PrestoToMySqlOperator
SlackAPIOperator
Builtin Provider Packages
• The FileSensor: Waits for a file or folder to land in a filesystem.
• The S3KeySensor: Waits for a key to be present in a S3 bucket.
• The SqlSensor: Runs a sql statement repeatedly until a criteria is met.
• The ExternalTaskSensor: Waits for a different DAG or a task in a different DAG to complete for a
specific execution date.
• The DateTimeSensor: Waits until the specified datetime (Useful to add some delay to your DAGs)
• The TimeDeltaSensor: Waits for a timedelta after the task’s execution_date +schedule interval (Looks
similar to the previous one no?)
Sensors
Sensors are a special type of Operator that are designed to do exactly one thing - wait
for something to occur
TaskFlow
Architect
Metadata Database
to store system data
DAG Directory
to store DAG files
Webserver
to monitor executors
Scheduler
engine to fire executors
Executor
execute DAGs
Architect
Webserver
Executors
Debug Executor - debug tool and can be used from IDE
Local Executor - runs tasks by spawning processes in a
controlled fashion in different modes.
Sequential Executor - run one task instance at a time
(for testing purpose)
Celery Executor
CeleryKubernetes Executor
Dask Executor
Kubernetes Executor
LocalKubernetes Executor
Local Remote
Local Executor
one of the ways you can scale out the number of workers.
Celery Executors
one of the ways you can scale out the number of workers.
K8sExecutors
The Kubernetes executor runs each task instance in its own pod on a
Kubernetes cluster.
K8sExecutors
K8sExecutors vs K8sPodOperator
K8s Executor –create a Pod with Airflow image (can not change) and
then run the task inside it. Task in Pod will save result into DB.
K8s Pod Operator –the task run outside of K8s cluster and then create
Pod with any image and run anything inside Pod. After finish, Pod
return the results to task, task save result into DB.
Demo
Case #1
Run simple DAG at Local
Case #2
Run complex DAG on local
Case #3
FileSensor
Case #4
ML training pipeline
Thank you
for your attention!

More Related Content

Similar to adaidoadaoap9dapdadadjoadjoajdoiajodiaoiao

Yaetos Tech Overview
Yaetos Tech OverviewYaetos Tech Overview
Yaetos Tech Overviewprevota
 
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)Brian Brazil
 
Introduce Airflow.ppsx
Introduce Airflow.ppsxIntroduce Airflow.ppsx
Introduce Airflow.ppsxManKD
 
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...Kaxil Naik
 
Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...
Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...
Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...GetInData
 
How I learned to time travel, or, data pipelining and scheduling with Airflow
How I learned to time travel, or, data pipelining and scheduling with AirflowHow I learned to time travel, or, data pipelining and scheduling with Airflow
How I learned to time travel, or, data pipelining and scheduling with AirflowPyData
 
Devops with Python by Yaniv Cohen DevopShift
Devops with Python by Yaniv Cohen DevopShiftDevops with Python by Yaniv Cohen DevopShift
Devops with Python by Yaniv Cohen DevopShiftYaniv cohen
 
Apache Airflow in Production
Apache Airflow in ProductionApache Airflow in Production
Apache Airflow in ProductionRobert Sanders
 
Introduction to Apache Airflow
Introduction to Apache AirflowIntroduction to Apache Airflow
Introduction to Apache Airflowmutt_data
 
Airflow tutorials hands_on
Airflow tutorials hands_onAirflow tutorials hands_on
Airflow tutorials hands_onpko89403
 
airflowpresentation1-180717183432.pptx
airflowpresentation1-180717183432.pptxairflowpresentation1-180717183432.pptx
airflowpresentation1-180717183432.pptxVIJAYAPRABAP
 
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriThinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriDemi Ben-Ari
 
Prefect Paris Airflow Meetup Jeff Hale April 2023.pdf
Prefect Paris Airflow Meetup Jeff Hale April 2023.pdfPrefect Paris Airflow Meetup Jeff Hale April 2023.pdf
Prefect Paris Airflow Meetup Jeff Hale April 2023.pdfJeff Hale
 
Integration-Monday-Stateful-Programming-Models-Serverless-Functions
Integration-Monday-Stateful-Programming-Models-Serverless-FunctionsIntegration-Monday-Stateful-Programming-Models-Serverless-Functions
Integration-Monday-Stateful-Programming-Models-Serverless-FunctionsBizTalk360
 
Airflow techtonic template
Airflow   techtonic templateAirflow   techtonic template
Airflow techtonic templateSampath Kumar
 
DockerCon Europe 2018 Monitoring & Logging Workshop
DockerCon Europe 2018 Monitoring & Logging WorkshopDockerCon Europe 2018 Monitoring & Logging Workshop
DockerCon Europe 2018 Monitoring & Logging WorkshopBrian Christner
 
Relay: The Next Leg, Eric Sorenson, Puppet
Relay: The Next Leg, Eric Sorenson, PuppetRelay: The Next Leg, Eric Sorenson, Puppet
Relay: The Next Leg, Eric Sorenson, PuppetPuppet
 
Airflow presentation
Airflow presentationAirflow presentation
Airflow presentationIlias Okacha
 

Similar to adaidoadaoap9dapdadadjoadjoajdoiajodiaoiao (20)

Yaetos Tech Overview
Yaetos Tech OverviewYaetos Tech Overview
Yaetos Tech Overview
 
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)
 
Introduce Airflow.ppsx
Introduce Airflow.ppsxIntroduce Airflow.ppsx
Introduce Airflow.ppsx
 
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
Apache Airflow in the Cloud: Programmatically orchestrating workloads with Py...
 
North east user group tour
North east user group tourNorth east user group tour
North east user group tour
 
Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...
Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...
Functioning incessantly of Data Science Platform with Kubeflow - Albert Lewan...
 
Airflow presentation
Airflow presentationAirflow presentation
Airflow presentation
 
How I learned to time travel, or, data pipelining and scheduling with Airflow
How I learned to time travel, or, data pipelining and scheduling with AirflowHow I learned to time travel, or, data pipelining and scheduling with Airflow
How I learned to time travel, or, data pipelining and scheduling with Airflow
 
Devops with Python by Yaniv Cohen DevopShift
Devops with Python by Yaniv Cohen DevopShiftDevops with Python by Yaniv Cohen DevopShift
Devops with Python by Yaniv Cohen DevopShift
 
Apache Airflow in Production
Apache Airflow in ProductionApache Airflow in Production
Apache Airflow in Production
 
Introduction to Apache Airflow
Introduction to Apache AirflowIntroduction to Apache Airflow
Introduction to Apache Airflow
 
Airflow tutorials hands_on
Airflow tutorials hands_onAirflow tutorials hands_on
Airflow tutorials hands_on
 
airflowpresentation1-180717183432.pptx
airflowpresentation1-180717183432.pptxairflowpresentation1-180717183432.pptx
airflowpresentation1-180717183432.pptx
 
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-AriThinking DevOps in the era of the Cloud - Demi Ben-Ari
Thinking DevOps in the era of the Cloud - Demi Ben-Ari
 
Prefect Paris Airflow Meetup Jeff Hale April 2023.pdf
Prefect Paris Airflow Meetup Jeff Hale April 2023.pdfPrefect Paris Airflow Meetup Jeff Hale April 2023.pdf
Prefect Paris Airflow Meetup Jeff Hale April 2023.pdf
 
Integration-Monday-Stateful-Programming-Models-Serverless-Functions
Integration-Monday-Stateful-Programming-Models-Serverless-FunctionsIntegration-Monday-Stateful-Programming-Models-Serverless-Functions
Integration-Monday-Stateful-Programming-Models-Serverless-Functions
 
Airflow techtonic template
Airflow   techtonic templateAirflow   techtonic template
Airflow techtonic template
 
DockerCon Europe 2018 Monitoring & Logging Workshop
DockerCon Europe 2018 Monitoring & Logging WorkshopDockerCon Europe 2018 Monitoring & Logging Workshop
DockerCon Europe 2018 Monitoring & Logging Workshop
 
Relay: The Next Leg, Eric Sorenson, Puppet
Relay: The Next Leg, Eric Sorenson, PuppetRelay: The Next Leg, Eric Sorenson, Puppet
Relay: The Next Leg, Eric Sorenson, Puppet
 
Airflow presentation
Airflow presentationAirflow presentation
Airflow presentation
 

Recently uploaded

一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理hwoudye
 
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...samsungultra782445
 
Food Preservatives Market by Product Type, Distribution Channel, End User 202...
Food Preservatives Market by Product Type, Distribution Channel, End User 202...Food Preservatives Market by Product Type, Distribution Channel, End User 202...
Food Preservatives Market by Product Type, Distribution Channel, End User 202...IMARC Group
 
PRESTAIR MANUFACTURER OF DISPLAY COUNTER
PRESTAIR MANUFACTURER OF DISPLAY COUNTERPRESTAIR MANUFACTURER OF DISPLAY COUNTER
PRESTAIR MANUFACTURER OF DISPLAY COUNTERPRESTAIR SYSTEMS LLP
 
Medical Foods final.ppt (Regulatory Aspects of Food & Nutraceiticals)
Medical Foods final.ppt (Regulatory Aspects of Food & Nutraceiticals)Medical Foods final.ppt (Regulatory Aspects of Food & Nutraceiticals)
Medical Foods final.ppt (Regulatory Aspects of Food & Nutraceiticals)Lokesh Kothari
 
一比一原版(LMU毕业证书)英国伦敦都市大学毕业证学位证可查学历认证
一比一原版(LMU毕业证书)英国伦敦都市大学毕业证学位证可查学历认证一比一原版(LMU毕业证书)英国伦敦都市大学毕业证学位证可查学历认证
一比一原版(LMU毕业证书)英国伦敦都市大学毕业证学位证可查学历认证funaxa
 
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理hwoudye
 
CLASSIFICATION AND PROPERTIES OF FATS AND THEIR FUNCTIONS
CLASSIFICATION AND PROPERTIES OF FATS AND THEIR FUNCTIONSCLASSIFICATION AND PROPERTIES OF FATS AND THEIR FUNCTIONS
CLASSIFICATION AND PROPERTIES OF FATS AND THEIR FUNCTIONSDr. TATHAGAT KHOBRAGADE
 
NO1 Top Online Amil Baba in Rawalpindi Contact Number Amil in Rawalpindi Kala...
NO1 Top Online Amil Baba in Rawalpindi Contact Number Amil in Rawalpindi Kala...NO1 Top Online Amil Baba in Rawalpindi Contact Number Amil in Rawalpindi Kala...
NO1 Top Online Amil Baba in Rawalpindi Contact Number Amil in Rawalpindi Kala...Amil baba
 
LESSON 1 PREPARE AND COOKING MEAT GRADE 10
LESSON 1 PREPARE AND COOKING MEAT GRADE 10LESSON 1 PREPARE AND COOKING MEAT GRADE 10
LESSON 1 PREPARE AND COOKING MEAT GRADE 10manwithoutapfp
 
HYDROPONICS cratky method 1234567890qwew
HYDROPONICS  cratky  method 1234567890qwewHYDROPONICS  cratky  method 1234567890qwew
HYDROPONICS cratky method 1234567890qwewRolan Ben Lorono
 
如何办理新加坡管理大学毕业证成绩单留信学历认证
如何办理新加坡管理大学毕业证成绩单留信学历认证如何办理新加坡管理大学毕业证成绩单留信学历认证
如何办理新加坡管理大学毕业证成绩单留信学历认证zg99vwgda
 
一比一原版查尔斯特大学毕业证如何办理
一比一原版查尔斯特大学毕业证如何办理一比一原版查尔斯特大学毕业证如何办理
一比一原版查尔斯特大学毕业证如何办理hwoudye
 
The Codex Alimentarius Commission (CAC).
The Codex Alimentarius Commission (CAC).The Codex Alimentarius Commission (CAC).
The Codex Alimentarius Commission (CAC).Ravikumar Vaniya
 
FSSAI.ppt Food safety standards act in RAFN
FSSAI.ppt Food safety standards act in RAFNFSSAI.ppt Food safety standards act in RAFN
FSSAI.ppt Food safety standards act in RAFNLokesh Kothari
 
原版1:1定制(IC大学毕业证)帝国理工学院大学毕业证国外文凭复刻成绩单#电子版制作#留信入库#多年经营绝对保证质量
原版1:1定制(IC大学毕业证)帝国理工学院大学毕业证国外文凭复刻成绩单#电子版制作#留信入库#多年经营绝对保证质量原版1:1定制(IC大学毕业证)帝国理工学院大学毕业证国外文凭复刻成绩单#电子版制作#留信入库#多年经营绝对保证质量
原版1:1定制(IC大学毕业证)帝国理工学院大学毕业证国外文凭复刻成绩单#电子版制作#留信入库#多年经营绝对保证质量funaxa
 

Recently uploaded (17)

一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
一比一原版(uOttawa毕业证书)加拿大渥太华大学毕业证如何办理
 
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
Abortion pills in Jeddah +966572737505 <> buy cytotec <> unwanted kit Saudi A...
 
Food Preservatives Market by Product Type, Distribution Channel, End User 202...
Food Preservatives Market by Product Type, Distribution Channel, End User 202...Food Preservatives Market by Product Type, Distribution Channel, End User 202...
Food Preservatives Market by Product Type, Distribution Channel, End User 202...
 
PRESTAIR MANUFACTURER OF DISPLAY COUNTER
PRESTAIR MANUFACTURER OF DISPLAY COUNTERPRESTAIR MANUFACTURER OF DISPLAY COUNTER
PRESTAIR MANUFACTURER OF DISPLAY COUNTER
 
Medical Foods final.ppt (Regulatory Aspects of Food & Nutraceiticals)
Medical Foods final.ppt (Regulatory Aspects of Food & Nutraceiticals)Medical Foods final.ppt (Regulatory Aspects of Food & Nutraceiticals)
Medical Foods final.ppt (Regulatory Aspects of Food & Nutraceiticals)
 
一比一原版(LMU毕业证书)英国伦敦都市大学毕业证学位证可查学历认证
一比一原版(LMU毕业证书)英国伦敦都市大学毕业证学位证可查学历认证一比一原版(LMU毕业证书)英国伦敦都市大学毕业证学位证可查学历认证
一比一原版(LMU毕业证书)英国伦敦都市大学毕业证学位证可查学历认证
 
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
一比一原版卡尔顿大学毕业证(carleton毕业证)学历认证靠谱办理
 
CLASSIFICATION AND PROPERTIES OF FATS AND THEIR FUNCTIONS
CLASSIFICATION AND PROPERTIES OF FATS AND THEIR FUNCTIONSCLASSIFICATION AND PROPERTIES OF FATS AND THEIR FUNCTIONS
CLASSIFICATION AND PROPERTIES OF FATS AND THEIR FUNCTIONS
 
NO1 Top Online Amil Baba in Rawalpindi Contact Number Amil in Rawalpindi Kala...
NO1 Top Online Amil Baba in Rawalpindi Contact Number Amil in Rawalpindi Kala...NO1 Top Online Amil Baba in Rawalpindi Contact Number Amil in Rawalpindi Kala...
NO1 Top Online Amil Baba in Rawalpindi Contact Number Amil in Rawalpindi Kala...
 
LESSON 1 PREPARE AND COOKING MEAT GRADE 10
LESSON 1 PREPARE AND COOKING MEAT GRADE 10LESSON 1 PREPARE AND COOKING MEAT GRADE 10
LESSON 1 PREPARE AND COOKING MEAT GRADE 10
 
HYDROPONICS cratky method 1234567890qwew
HYDROPONICS  cratky  method 1234567890qwewHYDROPONICS  cratky  method 1234567890qwew
HYDROPONICS cratky method 1234567890qwew
 
如何办理新加坡管理大学毕业证成绩单留信学历认证
如何办理新加坡管理大学毕业证成绩单留信学历认证如何办理新加坡管理大学毕业证成绩单留信学历认证
如何办理新加坡管理大学毕业证成绩单留信学历认证
 
一比一原版查尔斯特大学毕业证如何办理
一比一原版查尔斯特大学毕业证如何办理一比一原版查尔斯特大学毕业证如何办理
一比一原版查尔斯特大学毕业证如何办理
 
The Codex Alimentarius Commission (CAC).
The Codex Alimentarius Commission (CAC).The Codex Alimentarius Commission (CAC).
The Codex Alimentarius Commission (CAC).
 
FSSAI.ppt Food safety standards act in RAFN
FSSAI.ppt Food safety standards act in RAFNFSSAI.ppt Food safety standards act in RAFN
FSSAI.ppt Food safety standards act in RAFN
 
Jual Obat Aborsi Sorong, Wa : 0822/2310/9953 Apotik Jual Obat Cytotec Di Sorong
Jual Obat Aborsi Sorong, Wa : 0822/2310/9953 Apotik Jual Obat Cytotec Di SorongJual Obat Aborsi Sorong, Wa : 0822/2310/9953 Apotik Jual Obat Cytotec Di Sorong
Jual Obat Aborsi Sorong, Wa : 0822/2310/9953 Apotik Jual Obat Cytotec Di Sorong
 
原版1:1定制(IC大学毕业证)帝国理工学院大学毕业证国外文凭复刻成绩单#电子版制作#留信入库#多年经营绝对保证质量
原版1:1定制(IC大学毕业证)帝国理工学院大学毕业证国外文凭复刻成绩单#电子版制作#留信入库#多年经营绝对保证质量原版1:1定制(IC大学毕业证)帝国理工学院大学毕业证国外文凭复刻成绩单#电子版制作#留信入库#多年经营绝对保证质量
原版1:1定制(IC大学毕业证)帝国理工学院大学毕业证国外文凭复刻成绩单#电子版制作#留信入库#多年经营绝对保证质量
 

adaidoadaoap9dapdadadjoadjoajdoiajodiaoiao

  • 1. Airflow a platform to programmatically author, schedule and monitor workflows.
  • 3. We’re facing with a lots of workflows.
  • 6. We’re facing with a lots of disadvantages with current methods.
  • 7.
  • 8. Do nothing without saying Good luck!
  • 9. No notification when finish/fail/retry No status monitoring for running script No flexible integration mechanism No dependencies management No version controller for workflow’s scripts No failure-retry and re-run mechanism No centralize logging mechanism No backfill mechanism No scalability
  • 11. Go deep in technical perspective
  • 12. Overview of Airflow Airflow Operators Airflow Executors Demo Agenda
  • 13. Airflow is created by Airbnb Written in Python Is a workflow management system Open-source, no cost “Configuration as code” principal
  • 14. Airflow is used for: Run ETL pipelines Data ingestion pipelines Machine learning pipelines Predictive data pipelines General purpose scheduler
  • 15. We need to know some concepts.
  • 18. DAG Directed Acyclic Graph We need calculate: (a+b)*(d+c) task_1=a +b task_2=d +c task_3=task_1*task_2 We write DAG: [task_1,task_2]> >task_3
  • 19.
  • 20. A DAG can run manually or automatically. A DAG can be scheduled in the future. A DAG can run periodically
  • 21. Three types of Task Operators Sensors TaskFlow
  • 22. Operators An Operator is conceptually a template for a predefined Task BashOperator - executes a bash command PythonOperator - calls an arbitrary Python function EmailOperator - sends an email SimpleHttpOperator MySqlOperator PostgresOperator MsSqlOperator OracleOperator JdbcOperator DockerOperator KubernetesPodOperator S3FileTransformOperator PrestoToMySqlOperator SlackAPIOperator Builtin Provider Packages
  • 23. • The FileSensor: Waits for a file or folder to land in a filesystem. • The S3KeySensor: Waits for a key to be present in a S3 bucket. • The SqlSensor: Runs a sql statement repeatedly until a criteria is met. • The ExternalTaskSensor: Waits for a different DAG or a task in a different DAG to complete for a specific execution date. • The DateTimeSensor: Waits until the specified datetime (Useful to add some delay to your DAGs) • The TimeDeltaSensor: Waits for a timedelta after the task’s execution_date +schedule interval (Looks similar to the previous one no?) Sensors Sensors are a special type of Operator that are designed to do exactly one thing - wait for something to occur
  • 25. Architect Metadata Database to store system data DAG Directory to store DAG files Webserver to monitor executors Scheduler engine to fire executors Executor execute DAGs
  • 28. Executors Debug Executor - debug tool and can be used from IDE Local Executor - runs tasks by spawning processes in a controlled fashion in different modes. Sequential Executor - run one task instance at a time (for testing purpose) Celery Executor CeleryKubernetes Executor Dask Executor Kubernetes Executor LocalKubernetes Executor Local Remote
  • 29. Local Executor one of the ways you can scale out the number of workers.
  • 30. Celery Executors one of the ways you can scale out the number of workers.
  • 31. K8sExecutors The Kubernetes executor runs each task instance in its own pod on a Kubernetes cluster.
  • 33. K8sExecutors vs K8sPodOperator K8s Executor –create a Pod with Airflow image (can not change) and then run the task inside it. Task in Pod will save result into DB. K8s Pod Operator –the task run outside of K8s cluster and then create Pod with any image and run anything inside Pod. After finish, Pod return the results to task, task save result into DB.
  • 34. Demo
  • 35. Case #1 Run simple DAG at Local
  • 36. Case #2 Run complex DAG on local
  • 39. Thank you for your attention!