Data Warehouse with Fabric on data lakehouse

Marco Pozzan
Marco PozzanMicrosoft MVP 2015 for Data Platform
Sabato 30 settembre 2023
Marco Pozzan
• Consulente e formatore in ambito business intelligence
• Nel 2020 Fondatore e CTO start-up per analisi dati e per
insurance data analytics per le compagnie assicurative
Dal 2017 mi occupo di architetture big Data e in generale di tutta
la proposizione data platform di Microsoft.
• Docente all'Università di Pordenone per i corsi IFTS di analisi
Big Data
• Community Lead di 1nn0va (www.innovazionefvg.net)
• MCP,MCSA,MCSE dal 2017 MCT e dal 2014 MVP per SQL Server e
relatore in diverse conferenze sul tema.
• marco.pozzan@regolofarm.com
• @marcopozzan.it
• www.marcopozzan.it
• http://www.scoop.it/u/marco-pozzan
• http://paper.li/marcopozzan/1422524394
Data Warehouse with Fabric
on data lakehouse
Data Warehouse with Fabric on data lakehouse
Data Warehouse with Fabric on data lakehouse
Data Warehouse with Fabric on data lakehouse
Data Warehouse with Fabric on data lakehouse
Data Warehouse with Fabric on data lakehouse
Data Warehouse with Fabric on data lakehouse
Data Warehouse with Fabric on data lakehouse
Data Warehouse with Fabric on data lakehouse
Databricks recommends taking a layered approach to creating a single
source of truth for enterprise data products.
This architecture ensures atomicity, consistency, isolation, and durability
as data passes through multiple levels of validations and transformations
before being stored in a layout optimized for efficient analysis.
It is important to note that this medallion architecture is not a
substitute for other dimensional modeling techniques.The schemas and
tables within each tier can take a variety of forms and degrees of
normalization depending on the frequency and nature of data updates and
downstream use cases for the data.
Medallion Architecture
Medallion Architecture
I termini bronzo (grezzo), argento (convalidato) e oro (arricchito) descrivono la qualità
dei dati in ciascuno di questi livelli.
Demo
Demo – token SAS generation
?sv=2020-02-10&st=2023-09-29T04%3A46%3A14Z&se=2023-10-
29T05%3A46%3A00Z&sr=c&sp=rle&sig=uds%2BXKBHBBkelOkOFjeOblWlxQ5vPWBk0unBtysXgeY%3D
Demo – Shortcut configuration for bronze layer (part 1)
https://dataengineerrowdata.dfs.core.windows.net
/1nn0vasat
Demo – Shortcut configuration for bronze layer (part 2)
• https://dataengineerrowdata.dfs.core.windows.net/1nn0vasat
• LinkEsternoWWI
Demo – Shortcut configuration for bronze layer (part 3)
• Ext_raw_data_WWI
• /1nn0vasat/Files/
Demo – Load silver layer with Notebook
Demo – Load silver layer with Notebook
Demo – Test table
SELECT YEAR(SO.OrderDate) AS OrderDateYear,
COUNT(SO.OrderDate) AS TotalOrderCount
FROM [DWH].[dbo].[silver_salesorder] SO
GROUP BY YEAR(SO.OrderDate);
SELECT ISNULL(C.ColorName,'No Colour') AS ColourName,
SUM(cast(SOL.Quantity as int)) AS TotalOrderLineQuantity,
SUM(cast(SOL.UnitPrice as numeric)) AS TotalOrderLineUnitPrice
FROM [DWH].[dbo].[silver_salesorderline] SOL
INNER JOIN [DWH].[dbo].[silver_stockitems] SI ON SI.StockItemID = SOL.StockItemID
LEFT JOIN [DWH].[dbo].[silver_colors] C ON C.ColorID = SI.ColorID
GROUP BY ISNULL(C.ColorName,'No Colour’);
SELECT
YEAR(SO.OrderDate) AS OrderDateYear,
SC.SupplierCategoryName,
SUM(cast(SOL.Quantity as int)) AS TotalOrderLineQuantity,
SUM(cast(SOL.UnitPrice as numeric)) AS TotalOrderLineUnitPrice
FROM [DWH].[dbo].[silver_salesorderline] SOL
INNER JOIN [DWH].[dbo].[silver_salesorder] SO ON SO.OrderID = SOL.OrderID
INNER JOIN [DWH].[dbo].[silver_stockitems] SI ON SI.StockItemID = SOL.StockItemID
INNER JOIN [DWH].[dbo].[silver_suppliers] S ON SI.SupplierID = S.SupplierID
INNER JOIN [DWH].[dbo].[silver_categories] SC ON SC.SupplierCategoryID = S.SupplierCategoryID
GROUP BY YEAR(SO.OrderDate),
SC.SupplierCategoryName;
Demo – Load silver layer with DataFlow (Part 1)
Demo – Load silver layer with DataFlow (Part 1)
For each table i can set sink into deltalake
Demo – Load silver layer with Pipeline (Part 1 define variables)
["Countries","DeliveryMethods","People","StateProvince
s","SupplierCategories","Suppliers","BuyingGroups","Cu
stomerCategories","Customers","Colors","PackageTypes",
"StockItems","Cities","Date"]
Set variables with this array
Demo – Load silver layer with Pipeline (Part 2 copy activity source)
@variables('FileName')
@concat(item(),'.csv')
Demo – Load silver layer with Pipeline (Part 3 copy activity sink)
@concat('silver_' ,item())
Demo – Load silver layer with Pipeline (Part 4 copy activity source)
Ext_raw_data_WWI/SalesOrder/*
Demo – Load silver layer with Pipeline (Part 5 copy activity sink)
Demo – Load silver layer with Pipeline (Part 6 copy activity source)
Ext_raw_data_WWI/SalesOrderLine/*
Demo – Load silver layer with Pipeline (Part 7 copy activity sink)
Demo – Load gold layer with sql queries
Demo – Load gold layer with sql queries
Demo – Create Serving Layer
1 of 33

Recommended

コンテナとimmutableとわたし。あとセキュリティ。(Kubernetes Novice Tokyo #15 発表資料) by
コンテナとimmutableとわたし。あとセキュリティ。(Kubernetes Novice Tokyo #15 発表資料)コンテナとimmutableとわたし。あとセキュリティ。(Kubernetes Novice Tokyo #15 発表資料)
コンテナとimmutableとわたし。あとセキュリティ。(Kubernetes Novice Tokyo #15 発表資料)NTT DATA Technology & Innovation
964 views73 slides
VMware HCI solutions - 2020-01-16 by
VMware HCI solutions - 2020-01-16VMware HCI solutions - 2020-01-16
VMware HCI solutions - 2020-01-16David Pasek
1.5K views60 slides
HA/DR options with SQL Server in Azure and hybrid by
HA/DR options with SQL Server in Azure and hybridHA/DR options with SQL Server in Azure and hybrid
HA/DR options with SQL Server in Azure and hybridJames Serra
13.8K views30 slides
Vitualization - Công nghệ ảo hóa. by
Vitualization - Công nghệ ảo hóa.Vitualization - Công nghệ ảo hóa.
Vitualization - Công nghệ ảo hóa.ThaiLeHuy1
123 views21 slides
Clean architecture by
Clean architectureClean architecture
Clean architectureLieven Doclo
413 views38 slides
Verrazzanoご紹介 by
Verrazzanoご紹介Verrazzanoご紹介
Verrazzanoご紹介オラクルエンジニア通信
916 views24 slides

More Related Content

What's hot

Percona Live 2022 - MySQL Shell for Visual Studio Code by
Percona Live 2022 - MySQL Shell for Visual Studio CodePercona Live 2022 - MySQL Shell for Visual Studio Code
Percona Live 2022 - MySQL Shell for Visual Studio CodeFrederic Descamps
432 views75 slides
YugabyteDBを使ってみよう(NewSQL/分散SQLデータベースよろず勉強会 #1 発表資料) by
YugabyteDBを使ってみよう(NewSQL/分散SQLデータベースよろず勉強会 #1 発表資料)YugabyteDBを使ってみよう(NewSQL/分散SQLデータベースよろず勉強会 #1 発表資料)
YugabyteDBを使ってみよう(NewSQL/分散SQLデータベースよろず勉強会 #1 発表資料)NTT DATA Technology & Innovation
3K views27 slides
【ことはじめ】 はじめてのコンテナ活用をクラウドで by
【ことはじめ】 はじめてのコンテナ活用をクラウドで【ことはじめ】 はじめてのコンテナ活用をクラウドで
【ことはじめ】 はじめてのコンテナ活用をクラウドでオラクルエンジニア通信
460 views43 slides
L2延伸を利用したクラウド移行とクラウド活用術 by
L2延伸を利用したクラウド移行とクラウド活用術L2延伸を利用したクラウド移行とクラウド活用術
L2延伸を利用したクラウド移行とクラウド活用術富士通クラウドテクノロジーズ株式会社
3.2K views18 slides
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil... by
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...VMware Tanzu
742 views18 slides
Wars of MySQL Cluster ( InnoDB Cluster VS Galera ) by
Wars of MySQL Cluster ( InnoDB Cluster VS Galera ) Wars of MySQL Cluster ( InnoDB Cluster VS Galera )
Wars of MySQL Cluster ( InnoDB Cluster VS Galera ) Mydbops
2.5K views42 slides

What's hot(20)

Percona Live 2022 - MySQL Shell for Visual Studio Code by Frederic Descamps
Percona Live 2022 - MySQL Shell for Visual Studio CodePercona Live 2022 - MySQL Shell for Visual Studio Code
Percona Live 2022 - MySQL Shell for Visual Studio Code
Frederic Descamps432 views
YugabyteDBを使ってみよう(NewSQL/分散SQLデータベースよろず勉強会 #1 発表資料) by NTT DATA Technology & Innovation
YugabyteDBを使ってみよう(NewSQL/分散SQLデータベースよろず勉強会 #1 発表資料)YugabyteDBを使ってみよう(NewSQL/分散SQLデータベースよろず勉強会 #1 発表資料)
YugabyteDBを使ってみよう(NewSQL/分散SQLデータベースよろず勉強会 #1 発表資料)
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil... by VMware Tanzu
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...
VMware Tanzu742 views
Wars of MySQL Cluster ( InnoDB Cluster VS Galera ) by Mydbops
Wars of MySQL Cluster ( InnoDB Cluster VS Galera ) Wars of MySQL Cluster ( InnoDB Cluster VS Galera )
Wars of MySQL Cluster ( InnoDB Cluster VS Galera )
Mydbops2.5K views
Oracle RAC 19c with Standard Edition (SE) 2 - Support Update by Markus Michalewicz
Oracle RAC 19c with Standard Edition (SE) 2 - Support UpdateOracle RAC 19c with Standard Edition (SE) 2 - Support Update
Oracle RAC 19c with Standard Edition (SE) 2 - Support Update
Markus Michalewicz6.3K views
Introduction to Docker Containers - Docker Captain by Ajeet Singh Raina
Introduction to Docker Containers - Docker CaptainIntroduction to Docker Containers - Docker Captain
Introduction to Docker Containers - Docker Captain
Ajeet Singh Raina3.8K views
Creating AWS infrastructure using Terraform by Knoldus Inc.
Creating AWS infrastructure using TerraformCreating AWS infrastructure using Terraform
Creating AWS infrastructure using Terraform
Knoldus Inc.288 views
Oracle databáze – Konsolidovaná Data Management Platforma by MarketingArrowECS_CZ
Oracle databáze – Konsolidovaná Data Management PlatformaOracle databáze – Konsolidovaná Data Management Platforma
Oracle databáze – Konsolidovaná Data Management Platforma
TFE - Sécurité des architectures en conteneurs Docker orchestrés par Kubernetes by ManuelMareschal
TFE -  Sécurité des architectures en conteneurs Docker orchestrés par KubernetesTFE -  Sécurité des architectures en conteneurs Docker orchestrés par Kubernetes
TFE - Sécurité des architectures en conteneurs Docker orchestrés par Kubernetes
ManuelMareschal542 views
[JJUG CCC 2021 Spring]Eclipse ユーザのための VSCode のススメ by Satoshi Takami
[JJUG CCC 2021 Spring]Eclipse ユーザのための VSCode のススメ[JJUG CCC 2021 Spring]Eclipse ユーザのための VSCode のススメ
[JJUG CCC 2021 Spring]Eclipse ユーザのための VSCode のススメ
Satoshi Takami847 views
MySQL InnoDB Cluster and NDB Cluster by Mario Beck
MySQL InnoDB Cluster and NDB ClusterMySQL InnoDB Cluster and NDB Cluster
MySQL InnoDB Cluster and NDB Cluster
Mario Beck3.8K views
MySQL Connectors 8.0.19 & DNS SRV by Kenny Gryp
MySQL Connectors 8.0.19 & DNS SRVMySQL Connectors 8.0.19 & DNS SRV
MySQL Connectors 8.0.19 & DNS SRV
Kenny Gryp1.4K views
(本当に)一般的なご自宅向けのNutanix CE by AkiraMasago
(本当に)一般的なご自宅向けのNutanix CE(本当に)一般的なご自宅向けのNutanix CE
(本当に)一般的なご自宅向けのNutanix CE
AkiraMasago806 views
Clean architecture by .NET Crowd
Clean architectureClean architecture
Clean architecture
.NET Crowd3.5K views

Similar to Data Warehouse with Fabric on data lakehouse

LoQutus: A deep-dive into Microsoft Power BI by
LoQutus: A deep-dive into Microsoft Power BILoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BILoQutus
831 views43 slides
SQL Server Reporting Services Training by
SQL Server Reporting Services TrainingSQL Server Reporting Services Training
SQL Server Reporting Services TrainingRainer Stropek
2K views92 slides
Uncovering SQL Server query problems with execution plans - Tony Davis by
Uncovering SQL Server query problems with execution plans - Tony DavisUncovering SQL Server query problems with execution plans - Tony Davis
Uncovering SQL Server query problems with execution plans - Tony DavisRed Gate Software
961 views63 slides
Date dimension table - part II by
Date dimension table - part IIDate dimension table - part II
Date dimension table - part IIDirk Cludts
667 views15 slides
Freeing Yourself from an RDBMS Architecture by
Freeing Yourself from an RDBMS ArchitectureFreeing Yourself from an RDBMS Architecture
Freeing Yourself from an RDBMS ArchitectureDavid Hoerster
998 views30 slides
Introduction to PolyBase by
Introduction to PolyBaseIntroduction to PolyBase
Introduction to PolyBaseJames Serra
5K views24 slides

Similar to Data Warehouse with Fabric on data lakehouse(20)

LoQutus: A deep-dive into Microsoft Power BI by LoQutus
LoQutus: A deep-dive into Microsoft Power BILoQutus: A deep-dive into Microsoft Power BI
LoQutus: A deep-dive into Microsoft Power BI
LoQutus831 views
SQL Server Reporting Services Training by Rainer Stropek
SQL Server Reporting Services TrainingSQL Server Reporting Services Training
SQL Server Reporting Services Training
Rainer Stropek2K views
Uncovering SQL Server query problems with execution plans - Tony Davis by Red Gate Software
Uncovering SQL Server query problems with execution plans - Tony DavisUncovering SQL Server query problems with execution plans - Tony Davis
Uncovering SQL Server query problems with execution plans - Tony Davis
Red Gate Software961 views
Date dimension table - part II by Dirk Cludts
Date dimension table - part IIDate dimension table - part II
Date dimension table - part II
Dirk Cludts667 views
Freeing Yourself from an RDBMS Architecture by David Hoerster
Freeing Yourself from an RDBMS ArchitectureFreeing Yourself from an RDBMS Architecture
Freeing Yourself from an RDBMS Architecture
David Hoerster998 views
Introduction to PolyBase by James Serra
Introduction to PolyBaseIntroduction to PolyBase
Introduction to PolyBase
James Serra5K views
Trivadis TechEvent 2016 Introduction to DataStax Enterprise (DSE) Graph by Gu... by Trivadis
Trivadis TechEvent 2016 Introduction to DataStax Enterprise (DSE) Graph by Gu...Trivadis TechEvent 2016 Introduction to DataStax Enterprise (DSE) Graph by Gu...
Trivadis TechEvent 2016 Introduction to DataStax Enterprise (DSE) Graph by Gu...
Trivadis644 views
Extreme Salesforce Data Volumes Webinar (with Speaker Notes) by Salesforce Developers
Extreme Salesforce Data Volumes Webinar (with Speaker Notes)Extreme Salesforce Data Volumes Webinar (with Speaker Notes)
Extreme Salesforce Data Volumes Webinar (with Speaker Notes)
[MongoDB.local Bengaluru 2018] Keynote by MongoDB
[MongoDB.local Bengaluru 2018] Keynote[MongoDB.local Bengaluru 2018] Keynote
[MongoDB.local Bengaluru 2018] Keynote
MongoDB347 views
Moving from SQL Server to MongoDB by Nick Court
Moving from SQL Server to MongoDBMoving from SQL Server to MongoDB
Moving from SQL Server to MongoDB
Nick Court14.4K views
Neo4j GraphTour New York_EY Presentation_Michael Moore by Neo4j
Neo4j GraphTour New York_EY Presentation_Michael MooreNeo4j GraphTour New York_EY Presentation_Michael Moore
Neo4j GraphTour New York_EY Presentation_Michael Moore
Neo4j348 views
Kevin Bengtson Portfolio by Kbengt521
Kevin Bengtson PortfolioKevin Bengtson Portfolio
Kevin Bengtson Portfolio
Kbengt521466 views
Your Roadmap for An Enterprise Graph Strategy by Neo4j
Your Roadmap for An Enterprise Graph StrategyYour Roadmap for An Enterprise Graph Strategy
Your Roadmap for An Enterprise Graph Strategy
Neo4j1.2K views
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction by Mark Ginnebaugh
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
Mark Ginnebaugh4.3K views
Data science guide for PASS Summit 2014 by Mark Tabladillo
Data science guide for PASS Summit 2014Data science guide for PASS Summit 2014
Data science guide for PASS Summit 2014
Mark Tabladillo1.6K views
Creating a Single View Part 1: Overview and Data Analysis by MongoDB
Creating a Single View Part 1: Overview and Data AnalysisCreating a Single View Part 1: Overview and Data Analysis
Creating a Single View Part 1: Overview and Data Analysis
MongoDB2.2K views
Creating a Single View: Overview and Analysis by MongoDB
Creating a Single View: Overview and AnalysisCreating a Single View: Overview and Analysis
Creating a Single View: Overview and Analysis
MongoDB1.4K views
ADL/U-SQL Introduction (SQLBits 2016) by Michael Rys
ADL/U-SQL Introduction (SQLBits 2016)ADL/U-SQL Introduction (SQLBits 2016)
ADL/U-SQL Introduction (SQLBits 2016)
Michael Rys1.2K views

More from Marco Pozzan

Data modelling for Power BI by
Data modelling for Power BIData modelling for Power BI
Data modelling for Power BIMarco Pozzan
16 views53 slides
SlideModellingDataSat.pdf by
SlideModellingDataSat.pdfSlideModellingDataSat.pdf
SlideModellingDataSat.pdfMarco Pozzan
93 views51 slides
Datamart.pdf by
Datamart.pdfDatamart.pdf
Datamart.pdfMarco Pozzan
23 views16 slides
Datamart.pptx by
Datamart.pptxDatamart.pptx
Datamart.pptxMarco Pozzan
268 views15 slides
Quanto mi costa SQL Pool Serverless Synapse by
Quanto mi costa SQL Pool Serverless SynapseQuanto mi costa SQL Pool Serverless Synapse
Quanto mi costa SQL Pool Serverless SynapseMarco Pozzan
110 views21 slides
Power BI: Introduzione ai dataflow e alla preparazione dei dati self-service by
Power BI: Introduzione ai dataflow e alla preparazione dei dati self-servicePower BI: Introduzione ai dataflow e alla preparazione dei dati self-service
Power BI: Introduzione ai dataflow e alla preparazione dei dati self-serviceMarco Pozzan
160 views42 slides

More from Marco Pozzan(20)

Data modelling for Power BI by Marco Pozzan
Data modelling for Power BIData modelling for Power BI
Data modelling for Power BI
Marco Pozzan16 views
SlideModellingDataSat.pdf by Marco Pozzan
SlideModellingDataSat.pdfSlideModellingDataSat.pdf
SlideModellingDataSat.pdf
Marco Pozzan93 views
Quanto mi costa SQL Pool Serverless Synapse by Marco Pozzan
Quanto mi costa SQL Pool Serverless SynapseQuanto mi costa SQL Pool Serverless Synapse
Quanto mi costa SQL Pool Serverless Synapse
Marco Pozzan110 views
Power BI: Introduzione ai dataflow e alla preparazione dei dati self-service by Marco Pozzan
Power BI: Introduzione ai dataflow e alla preparazione dei dati self-servicePower BI: Introduzione ai dataflow e alla preparazione dei dati self-service
Power BI: Introduzione ai dataflow e alla preparazione dei dati self-service
Marco Pozzan160 views
Microsoft Power BI fast with aggregation and composite model by Marco Pozzan
Microsoft Power BI fast with aggregation and composite modelMicrosoft Power BI fast with aggregation and composite model
Microsoft Power BI fast with aggregation and composite model
Marco Pozzan118 views
REAL TIME ANALYTICS INFRASTRUCTURE WITH AZURE by Marco Pozzan
REAL TIME ANALYTICS INFRASTRUCTURE WITH AZUREREAL TIME ANALYTICS INFRASTRUCTURE WITH AZURE
REAL TIME ANALYTICS INFRASTRUCTURE WITH AZURE
Marco Pozzan153 views
Big data analytics quanto vale e come sfruttarlo con stream analytics e power bi by Marco Pozzan
Big data analytics quanto vale e come sfruttarlo con stream analytics e power biBig data analytics quanto vale e come sfruttarlo con stream analytics e power bi
Big data analytics quanto vale e come sfruttarlo con stream analytics e power bi
Marco Pozzan182 views
What is in reality a DAX filter context by Marco Pozzan
What is in reality a DAX filter contextWhat is in reality a DAX filter context
What is in reality a DAX filter context
Marco Pozzan1.8K views
Azure saturday pn 2018 by Marco Pozzan
Azure saturday pn 2018Azure saturday pn 2018
Azure saturday pn 2018
Marco Pozzan122 views
Power bi Clean and Modelling (SQL Saturday #675) by Marco Pozzan
Power bi Clean and Modelling  (SQL Saturday #675)Power bi Clean and Modelling  (SQL Saturday #675)
Power bi Clean and Modelling (SQL Saturday #675)
Marco Pozzan123 views
Power BI and business application platform by Marco Pozzan
Power BI and business application platformPower BI and business application platform
Power BI and business application platform
Marco Pozzan231 views

Recently uploaded

Ukraine Infographic_22NOV2023_v2.pdf by
Ukraine Infographic_22NOV2023_v2.pdfUkraine Infographic_22NOV2023_v2.pdf
Ukraine Infographic_22NOV2023_v2.pdfAnastosiyaGurin
1.4K views3 slides
Data about the sector workshop by
Data about the sector workshopData about the sector workshop
Data about the sector workshopinfo828217
16 views27 slides
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation by
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented GenerationDataScienceConferenc1
17 views29 slides
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ... by
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...DataScienceConferenc1
6 views15 slides
Infomatica-MDM.pptx by
Infomatica-MDM.pptxInfomatica-MDM.pptx
Infomatica-MDM.pptxKapil Rangwani
11 views16 slides
Short Story Assignment by Kelly Nguyen by
Short Story Assignment by Kelly NguyenShort Story Assignment by Kelly Nguyen
Short Story Assignment by Kelly Nguyenkellynguyen01
19 views17 slides

Recently uploaded(20)

Ukraine Infographic_22NOV2023_v2.pdf by AnastosiyaGurin
Ukraine Infographic_22NOV2023_v2.pdfUkraine Infographic_22NOV2023_v2.pdf
Ukraine Infographic_22NOV2023_v2.pdf
AnastosiyaGurin1.4K views
Data about the sector workshop by info828217
Data about the sector workshopData about the sector workshop
Data about the sector workshop
info82821716 views
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation by DataScienceConferenc1
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ... by DataScienceConferenc1
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
[DSC Europe 23][AI:CSI] Aleksa Stojanovic - Applying AI for Threat Detection ...
Short Story Assignment by Kelly Nguyen by kellynguyen01
Short Story Assignment by Kelly NguyenShort Story Assignment by Kelly Nguyen
Short Story Assignment by Kelly Nguyen
kellynguyen0119 views
SUPER STORE SQL PROJECT.pptx by khan888620
SUPER STORE SQL PROJECT.pptxSUPER STORE SQL PROJECT.pptx
SUPER STORE SQL PROJECT.pptx
khan88862013 views
Chapter 3b- Process Communication (1) (1)(1) (1).pptx by ayeshabaig2004
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptx
ayeshabaig20047 views
CRM stick or twist workshop by info828217
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshop
info82821712 views
Cross-network in Google Analytics 4.pdf by GA4 Tutorials
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdf
GA4 Tutorials6 views
[DSC Europe 23] Luca Morena - From Psychohistory to Curious Machines by DataScienceConferenc1
[DSC Europe 23] Luca Morena - From Psychohistory to Curious Machines[DSC Europe 23] Luca Morena - From Psychohistory to Curious Machines
[DSC Europe 23] Luca Morena - From Psychohistory to Curious Machines
PRIVACY AWRE PERSONAL DATA STORAGE by antony420421
PRIVACY AWRE PERSONAL DATA STORAGEPRIVACY AWRE PERSONAL DATA STORAGE
PRIVACY AWRE PERSONAL DATA STORAGE
antony4204217 views
[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int... by DataScienceConferenc1
[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int...[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int...
[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int...
4_4_WP_4_06_ND_Model.pptx by d6fmc6kwd4
4_4_WP_4_06_ND_Model.pptx4_4_WP_4_06_ND_Model.pptx
4_4_WP_4_06_ND_Model.pptx
d6fmc6kwd47 views
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an... by StatsCommunications
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
UNEP FI CRS Climate Risk Results.pptx by pekka28
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptx
pekka2811 views

Data Warehouse with Fabric on data lakehouse

  • 2. Marco Pozzan • Consulente e formatore in ambito business intelligence • Nel 2020 Fondatore e CTO start-up per analisi dati e per insurance data analytics per le compagnie assicurative Dal 2017 mi occupo di architetture big Data e in generale di tutta la proposizione data platform di Microsoft. • Docente all'Università di Pordenone per i corsi IFTS di analisi Big Data • Community Lead di 1nn0va (www.innovazionefvg.net) • MCP,MCSA,MCSE dal 2017 MCT e dal 2014 MVP per SQL Server e relatore in diverse conferenze sul tema. • marco.pozzan@regolofarm.com • @marcopozzan.it • www.marcopozzan.it • http://www.scoop.it/u/marco-pozzan • http://paper.li/marcopozzan/1422524394
  • 3. Data Warehouse with Fabric on data lakehouse
  • 12. Databricks recommends taking a layered approach to creating a single source of truth for enterprise data products. This architecture ensures atomicity, consistency, isolation, and durability as data passes through multiple levels of validations and transformations before being stored in a layout optimized for efficient analysis. It is important to note that this medallion architecture is not a substitute for other dimensional modeling techniques.The schemas and tables within each tier can take a variety of forms and degrees of normalization depending on the frequency and nature of data updates and downstream use cases for the data. Medallion Architecture
  • 13. Medallion Architecture I termini bronzo (grezzo), argento (convalidato) e oro (arricchito) descrivono la qualità dei dati in ciascuno di questi livelli.
  • 14. Demo
  • 15. Demo – token SAS generation ?sv=2020-02-10&st=2023-09-29T04%3A46%3A14Z&se=2023-10- 29T05%3A46%3A00Z&sr=c&sp=rle&sig=uds%2BXKBHBBkelOkOFjeOblWlxQ5vPWBk0unBtysXgeY%3D
  • 16. Demo – Shortcut configuration for bronze layer (part 1) https://dataengineerrowdata.dfs.core.windows.net /1nn0vasat
  • 17. Demo – Shortcut configuration for bronze layer (part 2) • https://dataengineerrowdata.dfs.core.windows.net/1nn0vasat • LinkEsternoWWI
  • 18. Demo – Shortcut configuration for bronze layer (part 3) • Ext_raw_data_WWI • /1nn0vasat/Files/
  • 19. Demo – Load silver layer with Notebook
  • 20. Demo – Load silver layer with Notebook
  • 21. Demo – Test table SELECT YEAR(SO.OrderDate) AS OrderDateYear, COUNT(SO.OrderDate) AS TotalOrderCount FROM [DWH].[dbo].[silver_salesorder] SO GROUP BY YEAR(SO.OrderDate); SELECT ISNULL(C.ColorName,'No Colour') AS ColourName, SUM(cast(SOL.Quantity as int)) AS TotalOrderLineQuantity, SUM(cast(SOL.UnitPrice as numeric)) AS TotalOrderLineUnitPrice FROM [DWH].[dbo].[silver_salesorderline] SOL INNER JOIN [DWH].[dbo].[silver_stockitems] SI ON SI.StockItemID = SOL.StockItemID LEFT JOIN [DWH].[dbo].[silver_colors] C ON C.ColorID = SI.ColorID GROUP BY ISNULL(C.ColorName,'No Colour’); SELECT YEAR(SO.OrderDate) AS OrderDateYear, SC.SupplierCategoryName, SUM(cast(SOL.Quantity as int)) AS TotalOrderLineQuantity, SUM(cast(SOL.UnitPrice as numeric)) AS TotalOrderLineUnitPrice FROM [DWH].[dbo].[silver_salesorderline] SOL INNER JOIN [DWH].[dbo].[silver_salesorder] SO ON SO.OrderID = SOL.OrderID INNER JOIN [DWH].[dbo].[silver_stockitems] SI ON SI.StockItemID = SOL.StockItemID INNER JOIN [DWH].[dbo].[silver_suppliers] S ON SI.SupplierID = S.SupplierID INNER JOIN [DWH].[dbo].[silver_categories] SC ON SC.SupplierCategoryID = S.SupplierCategoryID GROUP BY YEAR(SO.OrderDate), SC.SupplierCategoryName;
  • 22. Demo – Load silver layer with DataFlow (Part 1)
  • 23. Demo – Load silver layer with DataFlow (Part 1) For each table i can set sink into deltalake
  • 24. Demo – Load silver layer with Pipeline (Part 1 define variables) ["Countries","DeliveryMethods","People","StateProvince s","SupplierCategories","Suppliers","BuyingGroups","Cu stomerCategories","Customers","Colors","PackageTypes", "StockItems","Cities","Date"] Set variables with this array
  • 25. Demo – Load silver layer with Pipeline (Part 2 copy activity source) @variables('FileName') @concat(item(),'.csv')
  • 26. Demo – Load silver layer with Pipeline (Part 3 copy activity sink) @concat('silver_' ,item())
  • 27. Demo – Load silver layer with Pipeline (Part 4 copy activity source) Ext_raw_data_WWI/SalesOrder/*
  • 28. Demo – Load silver layer with Pipeline (Part 5 copy activity sink)
  • 29. Demo – Load silver layer with Pipeline (Part 6 copy activity source) Ext_raw_data_WWI/SalesOrderLine/*
  • 30. Demo – Load silver layer with Pipeline (Part 7 copy activity sink)
  • 31. Demo – Load gold layer with sql queries
  • 32. Demo – Load gold layer with sql queries
  • 33. Demo – Create Serving Layer