SlideShare a Scribd company logo
2
Boston Azure Cloud User Group
a journey of a thousand miles begins with a single step
Solution Architect at Slalom
Boston Business Intelligence User Group Leader
I am a bit shy but passionate.
BI Architect Speaker Mentor
Business Intelligence Architect, Mentor and Speaker. I am specializing in Design and Development of the Enterprise Business Intelligence
Systems, Intelligent Applications, Real-Time and Big Data Analytic solutions.
For the past 8 years as Boston Business Intelligence User Group Leader
http://www.meetup.com/Boston-Business-Intelligence/
https://www.linkedin.com/groups/2405400
www.bostonbi.org/blog.aspx
@SlavaKokaev vkokaev@gmail.comhttps://www.linkedin.com/in/kokaev
1
2
3
4
5
6
Key capabilities
Performing transformations and computations
How to create one or more data pipelines
Usage
Brief introduction to event hubs
Configure, Develop and Deploy Solution
6
1
Introduction to Azure Data Factory Service, a data integration service in the cloud
You can create data integration solutions using the Data Factory service that can ingest data
from various data stores, transform/process the data, and publish the result data to the data
stores.
8
9
11
12
Use it to ingest data from multiple on-premises and
cloud sources.
Schedule, orchestrate, and manage the data
transformation and analysis process.
Transform raw data into finished or shaped data that's
ready for consumption by BI tools or by your on-
premises or cloud applications and services.
Manage your entire network of data pipelines at a
glance to identify issues and take action
13
Azure
ADF
Office 365On- premise
SERVER
DLS
DLA
CS
DSL
AAS PBIDW
2
A pipeline is a logical grouping of activities. They are used to
group activities into a unit that together perform a task.
To understand pipelines better, you need to understand an
activity first.
15
Activities define the actions to perform on your data. For
example, you may use a Copy activity to copy data from one
data store to another data store. Similarly, you may use a Hive
activity, which runs a Hive query on an Azure HDInsight cluster
to transform or analyze your data. You may also choose to
create a custom .NET activity to run your own code.
16
17
Activity 1 Activity 2 Activity 3
PIPELINE
18
Data Factory supports two types of activities: and.
Copy Activity in Data Factory copies data from a source data
store to a sink data store. Data from any source can be
written to any sink.
Data Transformation Activity transforms data to desired
format and shape. Transformation activities that can be
added to pipelines either individually or chained with
another activity.
19
Data Movement Activates
Azure Blob storage, Azure Data Lake Store
Azure SQL Database, Azure SQL Data Warehouse
Azure Table storage, Azure DocumentDB
Azure Search Index
File System
HDFS
Amazon S3
FTP
SQL Server , Oracle , MySQL,
DB2, Teradata, PostgreSQL,
Sybase, Cassandra, MongoDB, Amazon Redshift
Salesforce
Generic ODBC
Generic OData
Web Table (table from HTML)
GE Historian
Data Transformation Activities
01 03 05
02 04
20
Data Transformation Activites
06 08 10
07 09
21
3
Linked services define the information needed for Data
Factory to connect to external resources (Examples: Azure
Storage, on-premises SQL Server, Azure HDInsight).
23
24
Linked services are used for two purposes in Data Factory:
including, but not limited to, an on-premises SQL
Server, Oracle database, file share, or an Azure
Blob Storage account.
that can host the execution of an activity. For
example, the HDInsight Hive activity runs on an
HDInsight Hadoop cluster.
25
Linked Services
AB
ADF DLS
AB Linked Service DLS Linked Service
3
Linked services link data stores to an Azure data factory.
Datasets represent data structures with in the data stores. For
example, an Azure Storage linked service provides connection
information for Data Factory to connect to an Azure Storage
account. An Azure Blob dataset specifies the blob container
and folder in the Azure Blob Storage from which the pipeline
should read the data. Similarly, an Azure SQL linked service
provides connection information for an Azure SQL database
and an Azure SQL dataset specifies the table that contains the
data.
27
28
Introduction to Azure Data Factory

More Related Content

What's hot

Data Sharing with Snowflake
Data Sharing with SnowflakeData Sharing with Snowflake
Data Sharing with Snowflake
Snowflake Computing
 

What's hot (20)

Azure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data FlowsAzure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data Flows
 
Azure Data Factory v2
Azure Data Factory v2Azure Data Factory v2
Azure Data Factory v2
 
Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)
 
Azure Data Factory v2
Azure Data Factory v2Azure Data Factory v2
Azure Data Factory v2
 
Introduction to Azure Databricks
Introduction to Azure DatabricksIntroduction to Azure Databricks
Introduction to Azure Databricks
 
Microsoft Azure Databricks
Microsoft Azure DatabricksMicrosoft Azure Databricks
Microsoft Azure Databricks
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
 
Introduction to Azure Data Lake
Introduction to Azure Data LakeIntroduction to Azure Data Lake
Introduction to Azure Data Lake
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
 
Microsoft Azure Data Factory Hands-On Lab Overview Slides
Microsoft Azure Data Factory Hands-On Lab Overview SlidesMicrosoft Azure Data Factory Hands-On Lab Overview Slides
Microsoft Azure Data Factory Hands-On Lab Overview Slides
 
Azure Data Factory Introduction.pdf
Azure Data Factory Introduction.pdfAzure Data Factory Introduction.pdf
Azure Data Factory Introduction.pdf
 
Lift SSIS package to Azure Data Factory V2
Lift SSIS package to Azure Data Factory V2Lift SSIS package to Azure Data Factory V2
Lift SSIS package to Azure Data Factory V2
 
Intro to Azure Data Factory v1
Intro to Azure Data Factory v1Intro to Azure Data Factory v1
Intro to Azure Data Factory v1
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
Azure Data Factory ETL Patterns in the Cloud
Azure Data Factory ETL Patterns in the CloudAzure Data Factory ETL Patterns in the Cloud
Azure Data Factory ETL Patterns in the Cloud
 
Azure Cloud PPT
Azure Cloud PPTAzure Cloud PPT
Azure Cloud PPT
 
Data Sharing with Snowflake
Data Sharing with SnowflakeData Sharing with Snowflake
Data Sharing with Snowflake
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
Azure Data Factory | Moving On-Premise Data to Azure Cloud | Microsoft Azure ...
 
Snowflake Overview
Snowflake OverviewSnowflake Overview
Snowflake Overview
 

Viewers also liked

Business intelligence the next generation of knowledge management (1)
Business intelligence the next generation of knowledge  management (1)Business intelligence the next generation of knowledge  management (1)
Business intelligence the next generation of knowledge management (1)
ichsanovsky
 
03 Integration Services Project
03 Integration Services Project03 Integration Services Project
03 Integration Services Project
Slava Kokaev
 
05 SSIS Control Flow
05 SSIS Control Flow05 SSIS Control Flow
05 SSIS Control Flow
Slava Kokaev
 

Viewers also liked (20)

Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
Introduction to Microsoft’s Hadoop solution (HDInsight)
Introduction to Microsoft’s Hadoop solution (HDInsight)Introduction to Microsoft’s Hadoop solution (HDInsight)
Introduction to Microsoft’s Hadoop solution (HDInsight)
 
Architecture modeling with UML and Visual Studio 2010 Ultimate
Architecture modeling with UML and Visual Studio 2010 UltimateArchitecture modeling with UML and Visual Studio 2010 Ultimate
Architecture modeling with UML and Visual Studio 2010 Ultimate
 
AZURE Data Related Services
AZURE Data Related ServicesAZURE Data Related Services
AZURE Data Related Services
 
Microsoft TechSummit - Deploy your Solution to IaaS and PaaS with VSTS and Az...
Microsoft TechSummit - Deploy your Solution to IaaS and PaaS with VSTS and Az...Microsoft TechSummit - Deploy your Solution to IaaS and PaaS with VSTS and Az...
Microsoft TechSummit - Deploy your Solution to IaaS and PaaS with VSTS and Az...
 
A lap around Azure Data Factory
A lap around Azure Data FactoryA lap around Azure Data Factory
A lap around Azure Data Factory
 
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsBuilding the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
 
Business intelligence the next generation of knowledge management (1)
Business intelligence the next generation of knowledge  management (1)Business intelligence the next generation of knowledge  management (1)
Business intelligence the next generation of knowledge management (1)
 
03 Integration Services Project
03 Integration Services Project03 Integration Services Project
03 Integration Services Project
 
Operational dashboarding and reporting with Microsoft Business Intelligence S...
Operational dashboarding and reporting with Microsoft Business Intelligence S...Operational dashboarding and reporting with Microsoft Business Intelligence S...
Operational dashboarding and reporting with Microsoft Business Intelligence S...
 
Power Bi and Power Bi Embedded for .Net Developers
Power Bi and Power Bi Embedded for .Net DevelopersPower Bi and Power Bi Embedded for .Net Developers
Power Bi and Power Bi Embedded for .Net Developers
 
Ssis
SsisSsis
Ssis
 
VS2017PI - Le novità di visual studio team services
VS2017PI - Le novità di visual studio team servicesVS2017PI - Le novità di visual studio team services
VS2017PI - Le novità di visual studio team services
 
[GAB2016] Workshop - Industrialisez vos expérimentations Azure Machine Learni...
[GAB2016] Workshop - Industrialisez vos expérimentations Azure Machine Learni...[GAB2016] Workshop - Industrialisez vos expérimentations Azure Machine Learni...
[GAB2016] Workshop - Industrialisez vos expérimentations Azure Machine Learni...
 
Azure DocumentDB en Dev@Nights
Azure DocumentDB en Dev@NightsAzure DocumentDB en Dev@Nights
Azure DocumentDB en Dev@Nights
 
Business process modeling and analysis for data warehouse design
Business process modeling and analysis for data warehouse designBusiness process modeling and analysis for data warehouse design
Business process modeling and analysis for data warehouse design
 
05 SSIS Control Flow
05 SSIS Control Flow05 SSIS Control Flow
05 SSIS Control Flow
 
06 SSIS Data Flow
06 SSIS Data Flow06 SSIS Data Flow
06 SSIS Data Flow
 
SSIS 2008 R2 data flow
SSIS 2008 R2 data flowSSIS 2008 R2 data flow
SSIS 2008 R2 data flow
 
Control Flow Using SSIS
Control Flow Using SSISControl Flow Using SSIS
Control Flow Using SSIS
 

Similar to Introduction to Azure Data Factory

Big_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperBig_SQL_3.0_Whitepaper
Big_SQL_3.0_Whitepaper
Scott Gray
 
Session6-SharePoint and Azure- steve fox-windows-and_azure_spfs
Session6-SharePoint and Azure- steve fox-windows-and_azure_spfsSession6-SharePoint and Azure- steve fox-windows-and_azure_spfs
Session6-SharePoint and Azure- steve fox-windows-and_azure_spfs
Mithun T. Dhar
 

Similar to Introduction to Azure Data Factory (20)

Transform your data with Azure Data factory
Transform your data with Azure Data factoryTransform your data with Azure Data factory
Transform your data with Azure Data factory
 
Core Concepts in azure data factory
Core Concepts in azure data factoryCore Concepts in azure data factory
Core Concepts in azure data factory
 
Big Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI MobileBig Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI Mobile
 
Exploring Microsoft Azure Infrastructures
Exploring Microsoft Azure InfrastructuresExploring Microsoft Azure Infrastructures
Exploring Microsoft Azure Infrastructures
 
adf.docx
adf.docxadf.docx
adf.docx
 
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
 
Comprehensive Guide for Microsoft Fabric to Master Data Analytics
Comprehensive Guide for Microsoft Fabric to Master Data AnalyticsComprehensive Guide for Microsoft Fabric to Master Data Analytics
Comprehensive Guide for Microsoft Fabric to Master Data Analytics
 
Power BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data SolutionsPower BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data Solutions
 
SQL Server 2019 Big Data Cluster
SQL Server 2019 Big Data ClusterSQL Server 2019 Big Data Cluster
SQL Server 2019 Big Data Cluster
 
ADF Demo_ppt.pptx
ADF Demo_ppt.pptxADF Demo_ppt.pptx
ADF Demo_ppt.pptx
 
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
 
Big_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperBig_SQL_3.0_Whitepaper
Big_SQL_3.0_Whitepaper
 
Azure Data Engineering.pptx
Azure Data Engineering.pptxAzure Data Engineering.pptx
Azure Data Engineering.pptx
 
Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
Microsoft Data Integration Pipelines: Azure Data Factory and SSISMicrosoft Data Integration Pipelines: Azure Data Factory and SSIS
Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
 
Azure Data.pptx
Azure Data.pptxAzure Data.pptx
Azure Data.pptx
 
Analytics in the Cloud
Analytics in the CloudAnalytics in the Cloud
Analytics in the Cloud
 
Azure Data Engineering course in hyderabad.pptx
Azure Data Engineering course in hyderabad.pptxAzure Data Engineering course in hyderabad.pptx
Azure Data Engineering course in hyderabad.pptx
 
Azure Data Engineering Course in Hyderabad
Azure Data Engineering  Course in HyderabadAzure Data Engineering  Course in Hyderabad
Azure Data Engineering Course in Hyderabad
 
"Azure Data Engineering Course in Hyderabad "
"Azure Data Engineering Course in Hyderabad ""Azure Data Engineering Course in Hyderabad "
"Azure Data Engineering Course in Hyderabad "
 
Session6-SharePoint and Azure- steve fox-windows-and_azure_spfs
Session6-SharePoint and Azure- steve fox-windows-and_azure_spfsSession6-SharePoint and Azure- steve fox-windows-and_azure_spfs
Session6-SharePoint and Azure- steve fox-windows-and_azure_spfs
 

More from Slava Kokaev

SSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business Intelligence
Slava Kokaev
 
MS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
MS SQL Server Analysis Services 2008 and Enterprise Data WarehousingMS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
MS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
Slava Kokaev
 
01 Architecture Of Integration Services
01 Architecture Of Integration Services01 Architecture Of Integration Services
01 Architecture Of Integration Services
Slava Kokaev
 
Bi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkBi Architecture And Conceptual Framework
Bi Architecture And Conceptual Framework
Slava Kokaev
 

More from Slava Kokaev (13)

Introduction to Azure Stream Analytics
Introduction to Azure Stream AnalyticsIntroduction to Azure Stream Analytics
Introduction to Azure Stream Analytics
 
Introduction BI Semantic Model with Sql Server Data Tools copy
Introduction BI Semantic Model with Sql Server Data Tools   copyIntroduction BI Semantic Model with Sql Server Data Tools   copy
Introduction BI Semantic Model with Sql Server Data Tools copy
 
SSIS control flow
SSIS control flowSSIS control flow
SSIS control flow
 
SSIS Connection managers and data sources
SSIS Connection managers and data sourcesSSIS Connection managers and data sources
SSIS Connection managers and data sources
 
Architecture of integration services
Architecture of integration servicesArchitecture of integration services
Architecture of integration services
 
Data visualization
Data visualizationData visualization
Data visualization
 
Developing ssas cube
Developing ssas cubeDeveloping ssas cube
Developing ssas cube
 
Business intelligence architecture
Business intelligence architectureBusiness intelligence architecture
Business intelligence architecture
 
Designing and developing Business Process dimensional Model or Data Warehouse
Designing and developing  Business Process dimensional Model  or Data WarehouseDesigning and developing  Business Process dimensional Model  or Data Warehouse
Designing and developing Business Process dimensional Model or Data Warehouse
 
SSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business IntelligenceSSAS R2 and SharePoint 2010 – Business Intelligence
SSAS R2 and SharePoint 2010 – Business Intelligence
 
MS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
MS SQL Server Analysis Services 2008 and Enterprise Data WarehousingMS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
MS SQL Server Analysis Services 2008 and Enterprise Data Warehousing
 
01 Architecture Of Integration Services
01 Architecture Of Integration Services01 Architecture Of Integration Services
01 Architecture Of Integration Services
 
Bi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkBi Architecture And Conceptual Framework
Bi Architecture And Conceptual Framework
 

Recently uploaded

一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
MAQIB18
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 

Recently uploaded (20)

Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 

Introduction to Azure Data Factory

  • 1.
  • 2. 2 Boston Azure Cloud User Group a journey of a thousand miles begins with a single step
  • 3.
  • 4. Solution Architect at Slalom Boston Business Intelligence User Group Leader
  • 5. I am a bit shy but passionate. BI Architect Speaker Mentor Business Intelligence Architect, Mentor and Speaker. I am specializing in Design and Development of the Enterprise Business Intelligence Systems, Intelligent Applications, Real-Time and Big Data Analytic solutions. For the past 8 years as Boston Business Intelligence User Group Leader http://www.meetup.com/Boston-Business-Intelligence/ https://www.linkedin.com/groups/2405400 www.bostonbi.org/blog.aspx @SlavaKokaev vkokaev@gmail.comhttps://www.linkedin.com/in/kokaev
  • 6. 1 2 3 4 5 6 Key capabilities Performing transformations and computations How to create one or more data pipelines Usage Brief introduction to event hubs Configure, Develop and Deploy Solution 6
  • 7. 1
  • 8. Introduction to Azure Data Factory Service, a data integration service in the cloud You can create data integration solutions using the Data Factory service that can ingest data from various data stores, transform/process the data, and publish the result data to the data stores. 8
  • 9. 9
  • 10.
  • 11. 11
  • 12. 12 Use it to ingest data from multiple on-premises and cloud sources. Schedule, orchestrate, and manage the data transformation and analysis process. Transform raw data into finished or shaped data that's ready for consumption by BI tools or by your on- premises or cloud applications and services. Manage your entire network of data pipelines at a glance to identify issues and take action
  • 14. 2
  • 15. A pipeline is a logical grouping of activities. They are used to group activities into a unit that together perform a task. To understand pipelines better, you need to understand an activity first. 15
  • 16. Activities define the actions to perform on your data. For example, you may use a Copy activity to copy data from one data store to another data store. Similarly, you may use a Hive activity, which runs a Hive query on an Azure HDInsight cluster to transform or analyze your data. You may also choose to create a custom .NET activity to run your own code. 16
  • 17. 17 Activity 1 Activity 2 Activity 3 PIPELINE
  • 18. 18 Data Factory supports two types of activities: and. Copy Activity in Data Factory copies data from a source data store to a sink data store. Data from any source can be written to any sink. Data Transformation Activity transforms data to desired format and shape. Transformation activities that can be added to pipelines either individually or chained with another activity.
  • 19. 19 Data Movement Activates Azure Blob storage, Azure Data Lake Store Azure SQL Database, Azure SQL Data Warehouse Azure Table storage, Azure DocumentDB Azure Search Index File System HDFS Amazon S3 FTP SQL Server , Oracle , MySQL, DB2, Teradata, PostgreSQL, Sybase, Cassandra, MongoDB, Amazon Redshift Salesforce Generic ODBC Generic OData Web Table (table from HTML) GE Historian
  • 22. 3
  • 23. Linked services define the information needed for Data Factory to connect to external resources (Examples: Azure Storage, on-premises SQL Server, Azure HDInsight). 23
  • 24. 24 Linked services are used for two purposes in Data Factory: including, but not limited to, an on-premises SQL Server, Oracle database, file share, or an Azure Blob Storage account. that can host the execution of an activity. For example, the HDInsight Hive activity runs on an HDInsight Hadoop cluster.
  • 25. 25 Linked Services AB ADF DLS AB Linked Service DLS Linked Service
  • 26. 3
  • 27. Linked services link data stores to an Azure data factory. Datasets represent data structures with in the data stores. For example, an Azure Storage linked service provides connection information for Data Factory to connect to an Azure Storage account. An Azure Blob dataset specifies the blob container and folder in the Azure Blob Storage from which the pipeline should read the data. Similarly, an Azure SQL linked service provides connection information for an Azure SQL database and an Azure SQL dataset specifies the table that contains the data. 27
  • 28. 28