Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse Analytics (Global Azure Norway 2021)

Cathrine Wilhelmsen
Cathrine WilhelmsenData & Analytics Solutions Architect in Evidi | Microsoft Data Platform MVP
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
Pipelines & Data Flows:
Introduction to Data Integration
in Azure Synapse Analytics
Cathrine Wilhelmsen
Global Azure Norway · April 16th, 2021
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
Session Abstract
Do you regularly need to get data for your projects?
Yep! 🙋‍♀️
Data is at the core of every Business Intelligence, Data Science, and Machine Learning project.
You need data to understand what has happened in the past, to predict what may happen in
the future, to discover patterns and anomalies, and to gain the insight necessary for making
faster and better decisions.
But before you can do any of those things, you need to ingest, store, transform, integrate, and
prepare your data. Guess what? You can do all of those things in Azure Synapse Analytics –
without having to write any code!
In this session, we will cover the fundamentals of data integration in Azure Synapse Analytics.
First, we will discuss when Azure Synapse Analytics is the right tool of choice. Then, we will go
through what Pipelines and Data Flows are, and when to use them. Finally, we will see how
easy it is to ingest and transform data both on-premises and in the cloud.
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
@cathrinew
cathrinew.net
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Data Warehousing Business Intelligence
Artificial Intelligence
Big Data and Analytics
Machine Learning
Data Science
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Data Warehousing Business Intelligence
Artificial Intelligence
Big Data and Analytics
Machine Learning
Data Science
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
What?
When?
Why?
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
Ingest
Store
Transform
Integrate
Prepare
© 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
Azure Synapse Analytics
What?
Pipelines & Data Flows
How?
Data Integration
When?
…the next 45 minutes…
Azure Synapse
Analytics
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
What is Azure Synapse Analytics?
All-in-one platform for analytical projects:
• Data Lake (All Data)
• Data Warehouse (Relational Data)
• Data Analytics (SQL, Spark)
• Data Integration (Ingest, Transform)
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
What is Azure Synapse Analytics?
Deeply integrated with other services:
• Azure Purview
• Azure Machine Learning
• Azure Cosmos DB
• Power BI
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
Who can use Azure Synapse Analytics?
Built for collaboration between:
• Data Engineers
• Data Analysts
• Data Scientists
• Data Consumers
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
Integration
Data
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
Data Integration in Azure Synapse Analytics
Code-First
Scripts, Notebooks
Designer-First
Pipelines, Data Flows
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
Data Integration in Azure Synapse Analytics
Copy Data Transform Data
Pipelines &
Data Flows
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
What are Pipelines?
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
What are Activities?
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
What are Datasets?
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
What are Linked Services?
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
What are Data Flows?
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
What are Triggers?
DEMO
DEMO
Pipelines &
Data Flows
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
Code-First
Scripts, Notebooks
Designer-First
Pipelines, Data Flows
?
?
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
Azure Synapse or Azure Data Factory?
© 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
@cathrinew
cathrinew.net
hi@cathrinew.net
cathrinew.net/adf
1 of 29

Recommended

Data Lakehouse Symposium | Day 4 by
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
1.8K views74 slides
Introduction SQL Analytics on Lakehouse Architecture by
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureDatabricks
5.8K views52 slides
Modernizing to a Cloud Data Architecture by
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
654 views22 slides
Databricks Fundamentals by
Databricks FundamentalsDatabricks Fundamentals
Databricks FundamentalsDalibor Wijas
635 views16 slides
Learn to Use Databricks for Data Science by
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceDatabricks
1.6K views12 slides
DW Migration Webinar-March 2022.pptx by
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDatabricks
4.3K views25 slides

More Related Content

What's hot

[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga... by
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...DataScienceConferenc1
151 views23 slides
Lakehouse in Azure by
Lakehouse in AzureLakehouse in Azure
Lakehouse in AzureSergio Zenatti Filho
247 views16 slides
Modern Data Warehousing with the Microsoft Analytics Platform System by
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemJames Serra
21.3K views42 slides
Databricks Platform.pptx by
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptxAlex Ivy
3.4K views46 slides
Introduction to Azure Databricks by
Introduction to Azure DatabricksIntroduction to Azure Databricks
Introduction to Azure DatabricksJames Serra
27.3K views53 slides
Free Training: How to Build a Lakehouse by
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseDatabricks
3.4K views42 slides

What's hot(20)

[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga... by DataScienceConferenc1
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
Modern Data Warehousing with the Microsoft Analytics Platform System by James Serra
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
James Serra21.3K views
Databricks Platform.pptx by Alex Ivy
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
Alex Ivy3.4K views
Introduction to Azure Databricks by James Serra
Introduction to Azure DatabricksIntroduction to Azure Databricks
Introduction to Azure Databricks
James Serra27.3K views
Free Training: How to Build a Lakehouse by Databricks
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
Databricks3.4K views
Achieving Lakehouse Models with Spark 3.0 by Databricks
Achieving Lakehouse Models with Spark 3.0Achieving Lakehouse Models with Spark 3.0
Achieving Lakehouse Models with Spark 3.0
Databricks622 views
Azure Synapse Analytics Overview (r1) by James Serra
Azure Synapse Analytics Overview (r1)Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)
James Serra24.8K views
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard by Paris Data Engineers !
Delta Lake OSS: Create reliable and performant Data Lake by Quentin AmbardDelta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Accelerate and modernize your data pipelines by Paul Van Siclen
Accelerate and modernize your data pipelinesAccelerate and modernize your data pipelines
Accelerate and modernize your data pipelines
Paul Van Siclen112 views
Improving Data Literacy Around Data Architecture by DATAVERSITY
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
DATAVERSITY973 views
Large Scale Lakehouse Implementation Using Structured Streaming by Databricks
Large Scale Lakehouse Implementation Using Structured StreamingLarge Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured Streaming
Databricks490 views
Azure Data Factory by HARIHARAN R
Azure Data FactoryAzure Data Factory
Azure Data Factory
HARIHARAN R816 views
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A... by Cathrine Wilhelmsen
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Data Lake Overview by James Serra
Data Lake OverviewData Lake Overview
Data Lake Overview
James Serra19.9K views
Intro to Delta Lake by Databricks
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
Databricks1.5K views
Architect’s Open-Source Guide for a Data Mesh Architecture by Databricks
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks3.1K views

Similar to Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse Analytics (Global Azure Norway 2021)

Analytics in a Day Virtual Workshop by
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopCCG
264 views174 slides
Azure synapse by usama whaba khan by
Azure synapse by usama whaba khanAzure synapse by usama whaba khan
Azure synapse by usama whaba khanUsama Wahab Khan Cloud, Data and AI
101 views24 slides
Analytics in a Day Ft. Synapse Virtual Workshop by
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopCCG
129 views116 slides
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac... by
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Cathrine Wilhelmsen
337 views43 slides
Analytics in a Day Ft. Synapse Virtual Workshop by
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopCCG
186 views160 slides
New ways to apply infrastructure data for better business outcomes by
New ways to apply infrastructure data for better business outcomesNew ways to apply infrastructure data for better business outcomes
New ways to apply infrastructure data for better business outcomesaccenture
1.5K views8 slides

Similar to Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse Analytics (Global Azure Norway 2021)(20)

Analytics in a Day Virtual Workshop by CCG
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
CCG264 views
Analytics in a Day Ft. Synapse Virtual Workshop by CCG
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
CCG129 views
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac... by Cathrine Wilhelmsen
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Analytics in a Day Ft. Synapse Virtual Workshop by CCG
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
CCG186 views
New ways to apply infrastructure data for better business outcomes by accenture
New ways to apply infrastructure data for better business outcomesNew ways to apply infrastructure data for better business outcomes
New ways to apply infrastructure data for better business outcomes
accenture1.5K views
Understanding Azure Data Factory: The What, When, and Why (NIC 2020) by Cathrine Wilhelmsen
Understanding Azure Data Factory: The What, When, and Why (NIC 2020)Understanding Azure Data Factory: The What, When, and Why (NIC 2020)
Understanding Azure Data Factory: The What, When, and Why (NIC 2020)
Cathrine Wilhelmsen1.8K views
Ingesting Click Data for Analytics by ClickMeter
Ingesting Click Data for AnalyticsIngesting Click Data for Analytics
Ingesting Click Data for Analytics
ClickMeter1.5K views
1 Introduction to Microsoft data platform analytics for release by Jen Stirrup
1 Introduction to Microsoft data platform analytics for release1 Introduction to Microsoft data platform analytics for release
1 Introduction to Microsoft data platform analytics for release
Jen Stirrup567 views
Pipelines and Packages: Introduction to Azure Data Factory (24HOP) by Cathrine Wilhelmsen
Pipelines and Packages: Introduction to Azure Data Factory (24HOP)Pipelines and Packages: Introduction to Azure Data Factory (24HOP)
Pipelines and Packages: Introduction to Azure Data Factory (24HOP)
Cathrine Wilhelmsen5.6K views
Smarter Analytics: Supporting the Enterprise with Automation by Inside Analysis
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with Automation
Inside Analysis631 views
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019) by Cathrine Wilhelmsen
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Pipelines and Packages: Introduction to Azure Data Factory (Techorama NL 2019)
Cathrine Wilhelmsen1.4K views
Analytics in a Day Ft. Synapse Virtual Workshop by CCG
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
CCG176 views
2022 Trends in Enterprise Analytics by DATAVERSITY
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
DATAVERSITY511 views
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai) by Denodo
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo 112 views
Databricks on AWS.pptx by Wasm1953
Databricks on AWS.pptxDatabricks on AWS.pptx
Databricks on AWS.pptx
Wasm1953166 views
Trivadis Azure Data Lake by Trivadis
Trivadis Azure Data LakeTrivadis Azure Data Lake
Trivadis Azure Data Lake
Trivadis193 views

More from Cathrine Wilhelmsen

Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D... by
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...Cathrine Wilhelmsen
47 views53 slides
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S... by
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...Cathrine Wilhelmsen
94 views51 slides
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ... by
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ..."I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ...
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ...Cathrine Wilhelmsen
97 views62 slides
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S... by
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...Cathrine Wilhelmsen
687 views49 slides
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022) by
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)Cathrine Wilhelmsen
119 views30 slides
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu... by
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...Cathrine Wilhelmsen
3.8K views53 slides

More from Cathrine Wilhelmsen(20)

Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D... by Cathrine Wilhelmsen
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (D...
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S... by Cathrine Wilhelmsen
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...
Stressed, Depressed, or Burned Out? The Warning Signs You Shouldn't Ignore (S...
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ... by Cathrine Wilhelmsen
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ..."I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ...
"I can't keep up!" - Turning Discomfort into Personal Growth in a Fast-Paced ...
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S... by Cathrine Wilhelmsen
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...
Lessons Learned: Implementing Azure Synapse Analytics in a Rapidly-Changing S...
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022) by Cathrine Wilhelmsen
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)
6 Tips for Building Confidence as a Public Speaker (SQLBits 2022)
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu... by Cathrine Wilhelmsen
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Cathrine Wilhelmsen3.8K views
Azure Data Factory for the SSIS Developer (SentryOne Webinar) by Cathrine Wilhelmsen
Azure Data Factory for the SSIS Developer (SentryOne Webinar)Azure Data Factory for the SSIS Developer (SentryOne Webinar)
Azure Data Factory for the SSIS Developer (SentryOne Webinar)
Cathrine Wilhelmsen1.1K views
Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019) by Cathrine Wilhelmsen
Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019)Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019)
Azure Synapse Analytics Teaser (Microsoft TechX Oslo 2019)
Cathrine Wilhelmsen1.1K views
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2... by Cathrine Wilhelmsen
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
Cathrine Wilhelmsen15.8K views
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019) by Cathrine Wilhelmsen
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Building Dynamic Data Pipelines in Azure Data Factory (Microsoft Ignite 2019)
Cathrine Wilhelmsen4.1K views
Creating Visual Transformations in Azure Data Factory (dataMinds Connect) by Cathrine Wilhelmsen
Creating Visual Transformations in Azure Data Factory (dataMinds Connect)Creating Visual Transformations in Azure Data Factory (dataMinds Connect)
Creating Visual Transformations in Azure Data Factory (dataMinds Connect)
Cathrine Wilhelmsen1.2K views
Building Dynamic Pipelines in Azure Data Factory (Data Saturday Holland) by Cathrine Wilhelmsen
Building Dynamic Pipelines in Azure Data Factory (Data Saturday Holland)Building Dynamic Pipelines in Azure Data Factory (Data Saturday Holland)
Building Dynamic Pipelines in Azure Data Factory (Data Saturday Holland)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019) by Cathrine Wilhelmsen
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Cathrine Wilhelmsen4.1K views
Building Dynamic Pipelines in Azure Data Factory (SQLSaturday Oslo) by Cathrine Wilhelmsen
Building Dynamic Pipelines in Azure Data Factory (SQLSaturday Oslo)Building Dynamic Pipelines in Azure Data Factory (SQLSaturday Oslo)
Building Dynamic Pipelines in Azure Data Factory (SQLSaturday Oslo)
Cathrine Wilhelmsen2.7K views
Uhms and Bunny Hands: Tips for Improving Your Presentation Skills (DataGrille... by Cathrine Wilhelmsen
Uhms and Bunny Hands: Tips for Improving Your Presentation Skills (DataGrille...Uhms and Bunny Hands: Tips for Improving Your Presentation Skills (DataGrille...
Uhms and Bunny Hands: Tips for Improving Your Presentation Skills (DataGrille...
Cathrine Wilhelmsen1.7K views
Biml Tips and Tricks: Not Just for SSIS Packages! (SQLBits 2019) by Cathrine Wilhelmsen
Biml Tips and Tricks: Not Just for SSIS Packages! (SQLBits 2019)Biml Tips and Tricks: Not Just for SSIS Packages! (SQLBits 2019)
Biml Tips and Tricks: Not Just for SSIS Packages! (SQLBits 2019)
Cathrine Wilhelmsen17.6K views
Data Integration through Data Virtualization (SQL Server Konferenz 2019) by Cathrine Wilhelmsen
Data Integration through Data Virtualization (SQL Server Konferenz 2019)Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Cathrine Wilhelmsen3.1K views
Deliver Your Modern Data Warehouse (Microsoft Tech Summit Oslo 2018) by Cathrine Wilhelmsen
Deliver Your Modern Data Warehouse (Microsoft Tech Summit Oslo 2018)Deliver Your Modern Data Warehouse (Microsoft Tech Summit Oslo 2018)
Deliver Your Modern Data Warehouse (Microsoft Tech Summit Oslo 2018)
Cathrine Wilhelmsen1.2K views
Level Up Your Biml: Best Practices and Coding Techniques (PASS Summit 2018) by Cathrine Wilhelmsen
Level Up Your Biml: Best Practices and Coding Techniques (PASS Summit 2018)Level Up Your Biml: Best Practices and Coding Techniques (PASS Summit 2018)
Level Up Your Biml: Best Practices and Coding Techniques (PASS Summit 2018)
Cathrine Wilhelmsen19.5K views
Uhms and Bunny Hands: Tips for Improving Your Presentation Skills (SQLSaturda... by Cathrine Wilhelmsen
Uhms and Bunny Hands: Tips for Improving Your Presentation Skills (SQLSaturda...Uhms and Bunny Hands: Tips for Improving Your Presentation Skills (SQLSaturda...
Uhms and Bunny Hands: Tips for Improving Your Presentation Skills (SQLSaturda...
Cathrine Wilhelmsen1.1K views

Recently uploaded

Employees attrition by
Employees attritionEmployees attrition
Employees attritionMaryAlejandraDiaz
7 views5 slides
Inawsidom - Data Journey by
Inawsidom - Data JourneyInawsidom - Data Journey
Inawsidom - Data JourneyPhilipBasford
9 views38 slides
Report on OSINT by
Report on OSINTReport on OSINT
Report on OSINTAyonDebnathCertified
6 views15 slides
AZConf 2023 - Considerations for LLMOps: Running LLMs in production by
AZConf 2023 - Considerations for LLMOps: Running LLMs in productionAZConf 2023 - Considerations for LLMOps: Running LLMs in production
AZConf 2023 - Considerations for LLMOps: Running LLMs in productionSARADINDU SENGUPTA
9 views16 slides
VoxelNet by
VoxelNetVoxelNet
VoxelNettaeseon ryu
20 views21 slides
Dr. Ousmane Badiane-2023 ReSAKSS Conference by
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceDr. Ousmane Badiane-2023 ReSAKSS Conference
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceAKADEMIYA2063
5 views34 slides

Recently uploaded(20)

AZConf 2023 - Considerations for LLMOps: Running LLMs in production by SARADINDU SENGUPTA
AZConf 2023 - Considerations for LLMOps: Running LLMs in productionAZConf 2023 - Considerations for LLMOps: Running LLMs in production
AZConf 2023 - Considerations for LLMOps: Running LLMs in production
Dr. Ousmane Badiane-2023 ReSAKSS Conference by AKADEMIYA2063
Dr. Ousmane Badiane-2023 ReSAKSS ConferenceDr. Ousmane Badiane-2023 ReSAKSS Conference
Dr. Ousmane Badiane-2023 ReSAKSS Conference
AKADEMIYA20635 views
DGIQ East 2023 AI Ethics SIG by Karen Lopez
DGIQ East 2023 AI Ethics SIGDGIQ East 2023 AI Ethics SIG
DGIQ East 2023 AI Ethics SIG
Karen Lopez5 views
Customer Data Cleansing Project.pptx by Nat O
Customer Data Cleansing Project.pptxCustomer Data Cleansing Project.pptx
Customer Data Cleansing Project.pptx
Nat O6 views
Analytics Center of Excellence | Data CoE |Analytics CoE| WNS Triange by RNayak3
Analytics Center of Excellence | Data CoE |Analytics CoE| WNS TriangeAnalytics Center of Excellence | Data CoE |Analytics CoE| WNS Triange
Analytics Center of Excellence | Data CoE |Analytics CoE| WNS Triange
RNayak35 views
4_4_WP_4_06_ND_Model.pptx by d6fmc6kwd4
4_4_WP_4_06_ND_Model.pptx4_4_WP_4_06_ND_Model.pptx
4_4_WP_4_06_ND_Model.pptx
d6fmc6kwd47 views
Data Journeys Hard Talk workshop final.pptx by info828217
Data Journeys Hard Talk workshop final.pptxData Journeys Hard Talk workshop final.pptx
Data Journeys Hard Talk workshop final.pptx
info82821711 views
Product Research sample.pdf by AllenSingson
Product Research sample.pdfProduct Research sample.pdf
Product Research sample.pdf
AllenSingson35 views
K-Drama Recommendation Using Python by FridaPutriassa
K-Drama Recommendation Using PythonK-Drama Recommendation Using Python
K-Drama Recommendation Using Python
FridaPutriassa7 views
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning by SARADINDU SENGUPTA
GDG Cloud Community Day 2022 -  Managing data quality in Machine LearningGDG Cloud Community Day 2022 -  Managing data quality in Machine Learning
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion by Bertram Ludäscher
Games, Queries, and Argumentation Frameworks: Time for a Family ReunionGames, Queries, and Argumentation Frameworks: Time for a Family Reunion
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion
CRM stick or twist workshop by info828217
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshop
info82821714 views

Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse Analytics (Global Azure Norway 2021)

  • 1. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) Pipelines & Data Flows: Introduction to Data Integration in Azure Synapse Analytics Cathrine Wilhelmsen Global Azure Norway · April 16th, 2021
  • 2. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) Session Abstract Do you regularly need to get data for your projects? Yep! 🙋‍♀️ Data is at the core of every Business Intelligence, Data Science, and Machine Learning project. You need data to understand what has happened in the past, to predict what may happen in the future, to discover patterns and anomalies, and to gain the insight necessary for making faster and better decisions. But before you can do any of those things, you need to ingest, store, transform, integrate, and prepare your data. Guess what? You can do all of those things in Azure Synapse Analytics – without having to write any code! In this session, we will cover the fundamentals of data integration in Azure Synapse Analytics. First, we will discuss when Azure Synapse Analytics is the right tool of choice. Then, we will go through what Pipelines and Data Flows are, and when to use them. Finally, we will see how easy it is to ingest and transform data both on-premises and in the cloud.
  • 3. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) @cathrinew cathrinew.net
  • 4. © 2020 Cathrine Wilhelmsen (hi@cathrinew.net) Data Warehousing Business Intelligence Artificial Intelligence Big Data and Analytics Machine Learning Data Science
  • 5. © 2020 Cathrine Wilhelmsen (hi@cathrinew.net) Data Warehousing Business Intelligence Artificial Intelligence Big Data and Analytics Machine Learning Data Science
  • 6. © 2020 Cathrine Wilhelmsen (hi@cathrinew.net) What? When? Why?
  • 7. © 2020 Cathrine Wilhelmsen (hi@cathrinew.net) Ingest Store Transform Integrate Prepare
  • 8. © 2020 Cathrine Wilhelmsen (hi@cathrinew.net)
  • 9. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) Azure Synapse Analytics What? Pipelines & Data Flows How? Data Integration When? …the next 45 minutes…
  • 11. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) What is Azure Synapse Analytics? All-in-one platform for analytical projects: • Data Lake (All Data) • Data Warehouse (Relational Data) • Data Analytics (SQL, Spark) • Data Integration (Ingest, Transform)
  • 12. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) What is Azure Synapse Analytics? Deeply integrated with other services: • Azure Purview • Azure Machine Learning • Azure Cosmos DB • Power BI
  • 13. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) Who can use Azure Synapse Analytics? Built for collaboration between: • Data Engineers • Data Analysts • Data Scientists • Data Consumers
  • 14. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) Integration Data
  • 15. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) Data Integration in Azure Synapse Analytics Code-First Scripts, Notebooks Designer-First Pipelines, Data Flows
  • 16. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) Data Integration in Azure Synapse Analytics Copy Data Transform Data
  • 18. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) What are Pipelines?
  • 19. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) What are Activities?
  • 20. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) What are Datasets?
  • 21. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) What are Linked Services?
  • 22. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) What are Data Flows?
  • 23. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) What are Triggers?
  • 25. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net)
  • 26. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) Code-First Scripts, Notebooks Designer-First Pipelines, Data Flows
  • 27. ? ?
  • 28. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) Azure Synapse or Azure Data Factory?
  • 29. © 2021 Cathrine Wilhelmsen (hi@cathrinew.net) @cathrinew cathrinew.net hi@cathrinew.net cathrinew.net/adf