Data saturday Oslo Azure Purview Erwin de Kreuk

Sep. 4, 2021
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
1 of 52

More Related Content

What's hot

Building Data Lakes for Analytics on AWSBuilding Data Lakes for Analytics on AWS
Building Data Lakes for Analytics on AWSAmazon Web Services
Building a Data Lake on AWSBuilding a Data Lake on AWS
Building a Data Lake on AWSAmazon Web Services
Azure Security Center-  Zero to HeroAzure Security Center-  Zero to Hero
Azure Security Center- Zero to HeroKasun Rajapakse
Introduction to AWS GlueIntroduction to AWS Glue
Introduction to AWS GlueAmazon Web Services
Power of the cloud - Introduction to azure securityPower of the cloud - Introduction to azure security
Power of the cloud - Introduction to azure securityBruno Capuano
Microsoft Cloud Adoption Framework for Azure: Thru Partner Governance WorkshopMicrosoft Cloud Adoption Framework for Azure: Thru Partner Governance Workshop
Microsoft Cloud Adoption Framework for Azure: Thru Partner Governance WorkshopNicholas Vossburg

Similar to Data saturday Oslo Azure Purview Erwin de Kreuk

Data weekender4.2  azure purview erwin de kreukData weekender4.2  azure purview erwin de kreuk
Data weekender4.2 azure purview erwin de kreukErwin de Kreuk
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)Trivadis
CC -Unit4.pptxCC -Unit4.pptx
CC -Unit4.pptxRevathiparamanathan
TechEvent Databricks on AzureTechEvent Databricks on Azure
TechEvent Databricks on AzureTrivadis
Introduction to Azure monitorIntroduction to Azure monitor
Introduction to Azure monitorPraveen Nair
Azure Data Platform Overview.pdfAzure Data Platform Overview.pdf
Azure Data Platform Overview.pdfDustin Vannoy

More from Erwin de Kreuk

Azure Key Vault, Azure Dev Ops and Azure Synapse - how these services work pe...Azure Key Vault, Azure Dev Ops and Azure Synapse - how these services work pe...
Azure Key Vault, Azure Dev Ops and Azure Synapse - how these services work pe...Erwin de Kreuk
Lake Database  Database Template  Map Data in Azure Synapse AnalyticsLake Database  Database Template  Map Data in Azure Synapse Analytics
Lake Database Database Template Map Data in Azure Synapse AnalyticsErwin de Kreuk
Dealing with different Synapse Roles in Azure Synapse Analytics Erwin de KreukDealing with different Synapse Roles in Azure Synapse Analytics Erwin de Kreuk
Dealing with different Synapse Roles in Azure Synapse Analytics Erwin de KreukErwin de Kreuk
Is there a way that we can build our Azure Synapse Pipelines all with paramet...Is there a way that we can build our Azure Synapse Pipelines all with paramet...
Is there a way that we can build our Azure Synapse Pipelines all with paramet...Erwin de Kreuk
Is there a way that we can build our Azure Data Factory all with parameters b...Is there a way that we can build our Azure Data Factory all with parameters b...
Is there a way that we can build our Azure Data Factory all with parameters b...Erwin de Kreuk
SQL KONFERENZ 2020  Azure Key Vault, Azure Dev Ops and Azure Data Factory how...SQL KONFERENZ 2020  Azure Key Vault, Azure Dev Ops and Azure Data Factory how...
SQL KONFERENZ 2020 Azure Key Vault, Azure Dev Ops and Azure Data Factory how...Erwin de Kreuk

Recently uploaded

FavorIndexReport_R8.pdfFavorIndexReport_R8.pdf
FavorIndexReport_R8.pdfFavor Delivery
All-sql-cheat-sheet-a4.pdfAll-sql-cheat-sheet-a4.pdf
All-sql-cheat-sheet-a4.pdfssuser8392a0
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Cathrine Wilhelmsen
BUSINESS ANALYTICS FOR DATA-DRIVEN DECISION MAKING - DUBAI.pdfBUSINESS ANALYTICS FOR DATA-DRIVEN DECISION MAKING - DUBAI.pdf
BUSINESS ANALYTICS FOR DATA-DRIVEN DECISION MAKING - DUBAI.pdfMAWAEVENTS1
apidays London 2023 - Why and how to apply DDD to APIs, Radhouane Jrad, QBE E...apidays London 2023 - Why and how to apply DDD to APIs, Radhouane Jrad, QBE E...
apidays London 2023 - Why and how to apply DDD to APIs, Radhouane Jrad, QBE E...apidays
HR ANALYSIS pdf.pdfHR ANALYSIS pdf.pdf
HR ANALYSIS pdf.pdfMehakSethi19

Recently uploaded(20)

Data saturday Oslo Azure Purview Erwin de Kreuk

Editor's Notes

  1. Hallo and Welcome to my session about Azure Purview My name is Erwin de Kreuk and I’m working as a Lead Data and AI for InSpark a Microsoft Partner in the Netherlands
  2. Ciao e benvenuto alla mia sessione su Azure Purview Hallo and Welcome to my session about Azure Purview My name is Erwin de Kreuk and I’m working as a Lead Data and AI for InSpark a Microsoft Partner in the Netherlands
  3. Azure Purview is a unified data governance service. During this session I will explain what Azure Purview is. The position of Azure Purview within your Data Estate And how it works with some practical examples If you have questions, please feel free to ask them
  4. History Blue Talon June 2019 With Azure Purview Microsoft has now his own Cloud Native Service for Data Governance and Data Lineage. I'm curious what the future will bring, but also which position it will take compared to Colibra / Informatica / AWS Glue Data Catalog or other Data Governance products
  5. As we all know Data Governance is becoming more and more becoming increasingly interdisciplinary. A chief data officer (CDO) is a corporate officer who is responsible for enterprise-wide governance and utilization of information as an asset, via data processing, analysis, data mining, information trading and other means. He will be one of the users who will use Azure Purview to get answers On what kind of do I have within my Data Estate Where is the data coming from but also I can trust the data. But also compliance is getting more and more important with all the required regulations from the local government or industries. F.E ISO and NEN certifications. Besided these questions the CDO wants to have also answers based On what are the risk to exposure mu data How can we control the access and use of data and compliant is our data.
  6. The following elements can lead to a successful data governance which is one of the key components in a modern Data Estate: You need to have control on your growing data landscape You want to Overcome operational silos A data silo is a collection of data held by one group that is not easily or fully accessible by other groups. ... Finance, administration, HR, and other departments need different information to do their work, and those individual collections of often overlapping-but-inconsistent data are in separate silos You want Increase the flexibility/agility of your data And You want make sure you comply with all different industry regulations and local government regulations. Azure Purview can help you with these elements
  7. Azure Purview organizes metadata that enables your organization to break down silos and derive meaning from data. Once data can be understood and annotated, it then lends itself to several applications – During the public we can use the data map where automate and manage metadata at scale Data catalog to Discover and search for data Data insights. To get an overview of the data in our Data Estate This’s what Azure Purview currently has to offer In the future, privacy, quality and master data management will follow.
  8. There are 4 pilars which helps you to maximize the business value of data in your organization Data Governance Set the Foundation Create Business Value for the consumers And of course, insights should not be missing
  9. Key features of Reimagine data governance in the cloud Cloud Native Managed Serverless PaaS
  10. Key features for the foundation are Automate and Discover data of different sources Classify data to specify sensitivity Know where your data is coming from
  11. Key features to maximize the business values Connect the different roles within your organization to a trusted data catalog Enable them to quickly find this data
  12. Key features to gain insights Understand at a glance how data is being created and used across your data estate Visually the state of data assets, scans, business glossary and sensitive data
  13. Datasource Power BI, SQL Sever on-prem, Azure Data Services including Synapse, Cosmos DB & Storage, Non-Microsoft systems including SAP ECC, SAP S4 HANA & Teradata, Multi-cloud systems including AWS S3 With Purview Platform: Automate scanning and classification of multicloud, SaaS, on-prem data. 25 plus out of box connectors and file formats supported Modernize homegrown catalogs built on opensource technology with Purview using Apache Atlas APIs supported out-of-the-box Get catalog features (C0 Tier) for FREE included with Purview platform: Search and browse Empower business and technical data analysts via a catalog to find and interpret data. Power data scientists and engineers with business context to drive BI, Analytics, AI and ML initiatives Automated metadata and lineage extraction Enrich the business value of data with technical, business and semantic metadata Scale understanding of data with automated, fully managed, serverless metadata management capability Leverage support of Apache Atlas’s open-source Lineage APIs to push lineage information into the Purview Data Map. Analyze impact of changes to data and understand dependencies visually.
  14. Azure Purview Catalog (C1 Tier) includes the following in addition to the free features included with the platform: Business Glossary Deliver a curated and consistent understanding of business terms and definitions. Import existing glossary terms from existing data dictionaries easily.  Also add ability to define custom attributes for the glossary terms and create templates for different domains like ‘Finance’, ‘Sales’ etc. Lineage views Ensure data provenance with a visual representation of owners, sources, transformation, and lifecycle Built-in integrations with solutions to automatically extract lineage such as Synapse Analytics, Azure Data Factory, Azure Data Share etc.
  15. Data Insights (D1 Tier) provides a bird’s eye view of your data landscape intended to help users such as Chief Data Officers quickly understand their data estate at large and gain key insights such as where sensitive data resides. It includes: Catalog insights: Asset Insights: Quickly see where all your data resides across a range of data sources Scan Insights: Success/failures/cancellations over a period Glossary Insights: Quickly understand changes made to the glossary over time and assess how much coverage glossary has over your data map. Sensitive data insights Simplify compliance risk assessment across all your operational and transactional data sources. Assess risk and derive audit trails of data qualified by sensitivity and business relevance.
  16. Purview Data Source Administrator Role Does not have access to the Purview Portal (the user needs to also be in the Data Reader or Data Curator roles) and can manage all aspects of scanning data into Azure Purview but does not have read or write access to content in Azure Purview beyond those related to scanning. programmatic processes, such as service principals, that need to be able to set up and monitor scans but should not have access to any of the catalog's data.
  17. Purview Data Reader Role Has access to the Purview portal and can read all content in Azure Purview except for scan bindings
  18. Purview Data Curator Role  Has access to the Purview portal and can read all content in Azure Purview except for scan bindings, can edit information about assets, can edit classification definitions and glossary terms, and can apply classifications and glossary terms to assets.
  19. Purview Data Source Administrator Role Does not have access to the Purview Portal (the user needs to also be in the Data Reader or Data Curator roles) and can manage all aspects of scanning data into Azure Purview but does not have read or write access to content in Azure Purview beyond those related to scanning. programmatic processes, such as service principals, that need to be able to set up and monitor scans but should not have access to any of the catalog's data. Purview Data Reader Role Has access to the Purview portal and can read all content in Azure Purview except for scan bindings Purview Data Curator Role  Has access to the Purview portal and can read all content in Azure Purview except for scan bindings, can edit information about assets, can edit classification definitions and glossary terms, and can apply classifications and glossary terms to assets.
  20. When deploying an Azure Purview Account on or after August 18th, 2021 you now can also assign roles bases on Collection So as you can see in the Example you can restricted people to see data in the Collection Assets Revenue. How this all works, I will show that I a later demo
  21. 4 capacity units are only for some subscriptions types
  22. Charging will now start as of 1 Capacity unit, for all Azure Purview accounts created on or after Augusts 18, 2021. Existing Purview accounts will be migrated starting September/October. Currently the Elastic Data Map is free
  23. Purview Data Map can automatically scale up and down within the elasticity window To get the next level of the elasticity window, a support ticket needs to be created.
  24. A single, centralized place that provides unified experience for data producers, data consumers, data & security officers
  25. Home Quick Actions, recently accessed items, owned Items, search bar and Documentation Sources Create collections, register data sources, setup Scans, Integration runtimeGlossary Manage Glossary Items, search, manage terms templates and custom attributes, import and export Terms using csv Insights Insights on your data Management Center Meta Data Management Security, ADF and data share Connections
  26. Demo Activity Hubs Home Page Tabs
  27. Table view-Map View
  28. Scan ADLS Define Scope
  29. All Source are categorized Pay Attention when you have enabled Private endpoint that you can access selected networks/sources
  30. Intended to help users such as Chief Data Officers quickly understand their data estate at large and gain key insights such as where sensitive data resides Asset Insight Understand distribution of data assets across a range of data sources & environments Scan Insight Number of successful, failed and cancelled scans over time Glossary Insights Understand changes made to business terms and assess how much coverage glossary has over the data map Classifications Insights Understand what sensitive data exists across the data estate from various lens Sensitivity Labels Insights Understand what sensitivity labels have been applied across the data estate File Extensions Insights Recently scanned files based on their extensions Reports on Assets, Scans, Glossary, Classification, and Labeling
  31. You need make sure that your Azure Purview Account as permission to read the PowerBI Tenant. You need to be a Power BI Admin to see the tenant settings page. First of all create a Security Group and add your Purview Account as a Member Then you need to add this Security Group to the tenant setting Allow service principals to use read-only Power BI admin APIs to allow Purview to scan your PowerBI Metadata you need to enable Enhance admin APIs responses with detailed metadata
  32. Make sure that before you start scanning your Power BI Dataset and to get the metadata, you must schedule a refresh in the powerbi service.
  33. I immediately thought back to a keynote from Pass Summit 2015, in which , Microsoft's new vision immediately became clear Walk with your head in the Cloud and your feet on the ground. I don’t why but it just came up. But it makes it clear that Microsoft is now busy to create a Unified experience for his customers. Where Azure Synapse is the heart and with the link to Azure Purview and Azure Cosmos DB/
  34. I immediately thought back to a keynote from Pass Summit 2015, in which , Microsoft's new vision immediately became clear Walk with your head in the Cloud and your feet on the ground. I don’t why but it just came up. But it makes it clear that Microsoft is now busy to create a Unified experience for his customers. Where Azure Synapse is the heart and with the link to Azure Purview and Azure Cosmos DB is getting even more simple.
  35. Once you created this connection you directly search with the Azure Purview catalog And for 2 weeks your Data Lineage will be enabled also when connecting your Purview Account Azure Purview drops lineage if the source or destination uses an unsupported data storage system.
  36. Once you created this connection you directly search with the Azure Purview catalog And for 2 weeks your Data Lineage will be enabled also when connecting your Purview Account Azure Purview drops lineage if the source or destination uses an unsupported data storage system.
  37. You may see below warning if you have the privilege to read Purview role assignment information and the needed role is not granted. To make sure the connection is properly set for the pipeline lineage push, go to your Purview account and check if Purview Data Curator role is granted to the Synapse workspace's managed identity. If not, manually add the role assignment.
  38. Source Collection Scan + Scan Rule set + Custom File Type Schedule Search catalog cities Lineage Browse Assets Edit/Overview/Lineage/Contacts Show Insights Show Synapse Integration
  39. https://azuredatagovernance.eventcore.com/