Data Quality Patterns in the Cloud
with Azure Data Factory
Mark Kromer
Sr. ProgramManager
Azure Data Factory
ADF: Simple and Productive ETL in the Cloud at any Scale
Modern Data Warehouse Pattern with ADF Mapping Data Flows
Applications
Dashboards
Business/custom apps
(structured)
Logs, files, and media
(unstructured)
r Azure Storage/
Data Lake Store
Azure Data
Factory
Load files into data
lake on a schedule
Azure Data
Factory
Extract and
transform
relational data
Azure SQL DW
Load processed
data into tables
optimized for
analytics
Clean and
join disparate
data
Databases
Azure Databricks
list of connectors
full list of compute environments data
Data Flow
Data Quality Patterns
• iif (length(title) == 0,toString(null()),title)
http://aka.ms/dataflowvideos
http://aka.ms/dataflowpatterns
https://docs.microsoft.com/en-us/azure/data-
factory/concepts-data-flow-overview
http://aka.ms/dataflowexpressions
https://aka.ms/dfperf
https://aka.ms/dflinks
https://github.com/kromerm/adfdataflowdocs/tree/master/sampledata
http://slideshare.net/kromerm

Data quality patterns in the cloud with ADF

Editor's Notes

  • #3 Let’s take a look at the process through the visual user interface Orchestration. Directs other services to execute actions as part of the transformation process. Mapping Data Flows. Develop graphical data transformation logic at scale without writing code using Mapping Data Flows (preview). Mapping Data Flows. Develop graphical data transformation logic at scale without writing code using Mapping Data Flows (preview). Monitoring: Monitor pipeline and activity runs with a simple list view interface