Data classification - Confidential Content
Azure Data Factory
Data classification - Confidential Content
Azure Data Factory is a cloud based integration service that allows to create
data driven workflows in the cloud for orchestrating and automating data
movement and data transformation
The aim of Azure Data Factory is to fetch data from one or more sources and
convert them into format that we process . It ingests data and load the data from
different sources .
It is a cloud based ETL that allow us to create data-driven pipelines for orchestrating
data movement and transforming data
Data classification - Confidential Content
Top Level Concepts of Azure Data Factory
• Pipeline
• Activity
• Datasets
• Linked Services
• Triggers
Data classification - Confidential Content
DATASETS
Data classification - Confidential Content
LINKED SERVICES
Linked Services can be found in Manage section of ADF section
Data classification - Confidential Content
ACTIVITY
There are 3 types of activities
Data Movement Activities – This is basically only the Copy Activity
Data Transformation Activity
Control Flow Activity :
•to deal with variables (Append Variable, Set Variable and Filter)
•looping (ForEach and Until)
•branching (the If Condition activity)
•executing other pipelines with Execute Pipeline activity, or SSIS packages with Execute SSIS
Package activity.
•Handling metadata or reference data with the Get Metadata activity or the Lookup activity.
Data classification - Confidential Content
Building a data flows- How it looks
Data classification - Confidential Content
Pipeline
Data classification - Confidential Content
Self hosted integration runtime as the name suggested, is the IR managed by you itself rather than azure. This will make
you responsible for the installation, configuration, maintenance, installing updates and scaling. Now as you host the IR , it
can access the on premises network as well.
Azure-SSIS Integration Runtimes
As the name suggested the azure-SSIS integration runtimes are actually the set of vm running the SQL Server Integration Services (SSIS
by Microsoft. Again the responsibility of the installation, maintenance, are of azure only. Azure Data Factory uses azure-SSIS integration ru
executing SSIS packages.

Azure PPT.pptx

  • 1.
    Data classification -Confidential Content Azure Data Factory
  • 2.
    Data classification -Confidential Content Azure Data Factory is a cloud based integration service that allows to create data driven workflows in the cloud for orchestrating and automating data movement and data transformation The aim of Azure Data Factory is to fetch data from one or more sources and convert them into format that we process . It ingests data and load the data from different sources . It is a cloud based ETL that allow us to create data-driven pipelines for orchestrating data movement and transforming data
  • 3.
    Data classification -Confidential Content Top Level Concepts of Azure Data Factory • Pipeline • Activity • Datasets • Linked Services • Triggers
  • 4.
    Data classification -Confidential Content DATASETS
  • 5.
    Data classification -Confidential Content LINKED SERVICES Linked Services can be found in Manage section of ADF section
  • 6.
    Data classification -Confidential Content ACTIVITY There are 3 types of activities Data Movement Activities – This is basically only the Copy Activity Data Transformation Activity Control Flow Activity : •to deal with variables (Append Variable, Set Variable and Filter) •looping (ForEach and Until) •branching (the If Condition activity) •executing other pipelines with Execute Pipeline activity, or SSIS packages with Execute SSIS Package activity. •Handling metadata or reference data with the Get Metadata activity or the Lookup activity.
  • 7.
    Data classification -Confidential Content Building a data flows- How it looks
  • 8.
    Data classification -Confidential Content Pipeline
  • 9.
    Data classification -Confidential Content Self hosted integration runtime as the name suggested, is the IR managed by you itself rather than azure. This will make you responsible for the installation, configuration, maintenance, installing updates and scaling. Now as you host the IR , it can access the on premises network as well. Azure-SSIS Integration Runtimes As the name suggested the azure-SSIS integration runtimes are actually the set of vm running the SQL Server Integration Services (SSIS by Microsoft. Again the responsibility of the installation, maintenance, are of azure only. Azure Data Factory uses azure-SSIS integration ru executing SSIS packages.