3. Agenda
Components of a modern data Engineering & Warehousing
High-level design of ETL in Azure
Overview of Data Factory & Components
Demo - Create ADF Pipeline to Copy data into Synapse
Analytics
4. Components of a modern data Engineering &
Warehousing
Azure Data
Factory
Azure Data
Lake
Azure
Databricks
Azure
HDInsight
Azure Synapse
Analytics
6. Azure Data Factory
• Code free ETL/ELT orchestrator for hybrid
data ingestion.
• Currently, It has more than 85 pre-built
connectors, which helps user to establish a
connection with source and destination from
various platforms like other cloud & on-
premise.
• Work is defined as a pipeline operation – runs
continuously as CD.
• Can either run in cloud or self-hosted network.
• You can perform data
processing/transformation using ADF though
recommended only for small/medium size
data.
7. Data Ingestion using Azure Data Factory
Types of data ingestion in Azure
ADF
Heterogenous
PolyBase
70
100
80
File based
SSIS
Heterogenous
8. Data Ingestion using Azure Data Factory (Cont…)
components of Azure Data Factory
Linked Service
Data
Lake Store
Azure
Databricks
Activities
Pipeline
Triggers
Dataset
@ Parameters
IR Integration
Runtime
CF Control
Flow