SlideShare a Scribd company logo
Azure Synapse
Udaiappa Ramachandran ( Udai )
https://udai.io
About me
• Udaiappa Ramachandran ( Udai )
• CTO/CSO-Akumina, Inc.
• Microsoft Azure MVP
• Cloud Expert
• Microsoft Azure, Amazon Web Services, and Google
• New Hampshire Cloud User Group (http://www.meetup.com/nashuaug )
• https://udai.io
Agenda
• Quick review on Azure Data Factory, Azure Databricks
• Azure Synapse Analytics
• Aggregating data from multiple data sources
• Exploring processed data
• Azure Synapse Security
• Demo…Demo…Demo…
Azure Datafactory
• Easy to use
• Wide range of connectors and features (90+)
• Powerful data integration capabilities (ingestion and transformation)
• GUI – Pipelines, data flows, power query
Azure Databricks
• Powerful data processing capabilities
• Machine learning and real-time analytics capabilities
• Managed service
• Notebooks
• Steeper learning curve
• Can be more expensive
What is Azure Synapse Analytics?
Azure Synapse Analytics - Components
• Data Warehouse
• SQL Pool
• Dedicated
• Serverless
• Spark Pool
• Python, SQL and C#
• Big Data Engine
• Serverless Engine
• Data Flows
• Ecosystem- PowerBI+Azure Machine Learning
What is Azure Synapse Analytics?
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/overview-what-is
Azure Synapse Analytics - Capabilities
• Unified analytics platform
• Serverless and dedicated options
• Enterprise data warehouse
• Data lake exploration
• Code-free hybrid data integration
• Deeply integrated Apache Spark and SQL engines
• Cloud-native HTAP
• Choice of language (T-SQL, Python, Scala, SparkSQL, and .NET)
• Integrated AI and BI
• Data Security
Synapse Analytics – SQL Pools
• Serverless SQL
• Query data from ADLS Gen2 directly
• Using T-SQL to query CSV, Parquet, JSON, etc.,
• No infrastructure needed
• Stand-alone polybase service
• Pay-per query model
• No charges for metadata queries (ex., select * from sys.objects)
• When to use?
• Quick ad-hoc queries
• Logical data warehouse
• Transform data in lake
• Dedicated SQL
• Provisioned Resource: Setup infrastructure in advance
• Massively Parallel Processing (MPP) Engine
Synapse SQL Architecture
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/massively-parallel-
processing-mpp-architecture
Synapse Analytics - Spark Pool
• Provisioned Resource: Setup infrastructure in advance
• Machine learning with MLib
• Data Engineering/Data Preparation with C#, Scala, Spark SQL, Python
• Streaming Data
• Spark notebooks
Synapse SPARK Overview
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-overview
Data Explorer Pool
• Unified experience
• Real-time insights
• Scalability
• Security
• High performance
• Real-time ingestion
• Time series analysis
• Machine learning
Data Explorer Pool
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/data-explorer/data-explorer-overview
When to use Azure Synapse Analytics?
• Large-scale data warehousing
• Advanced analytics
• Data exploration and discovery
• Real time analytics
• Data integration
• Integrated analytics
Synapse Analytics Vs. Synapse Private Hub
Feature Azure Synapse Analytics Azyre Synapse Analytics Private
Hub
Access Public access over the internet Private access over a private
connection
Security Data is encrypted at rest and in
transit
Data never leaves your network
Compliance Complies with a variety of data
regulations
Can be used to comply with sticker
data privacy regulations
Use cases General-purpose data analytics Secure access to Azure synapse
Analytics from on-premises network
or another virtual network
Azure Synapse – Use Case
• Propose a solution for ABC company to build real-time analytics using various data
sources such as Cosmos DB, Log Analytics, and SharePoint List Items. How can we
achieve this?
Demo
• Create Azure Synapse
• Walkthrough Azure Synapse properties
• Create Pools
• Run Samples
• Link Cosmos DB
• Create External table
• Data Explorer --Add Table and export data / Data explorer ingest data
• PowerBI
Azure Synapse – Use Case
• Aggregation
• Azure Cosmos DB – Synapse Link, then external view
• Azure Log Analytics Workspace – Continuous Export then Parquet transformer using Spark and
then external table
• SharePoint Lists – Continuous export then parquet transformer using spark and then external
table
• Presentation
• PowerBI – Direct Access
• HTML controls – DW Queries
• Cost
• SQL Server – Serverless/Dedicated
• Spark Nodes
• https://azure.com/e/6233ac854ace4eddb06d15b8b056df21
Diagrams:
Diagrams:
Security on Azure Synapse
• Data at REST encryption using TDE (Transparent Data Encryption)
• In-Transit (in motion) Encryption using TLS
• Key Management
• Customer Managed
• Bring your own key (BYOK)
• Must enabled when creating Azure Synapse
• TDE Protector (key to encrypt DEK)
• Data Masking – Dynamic and Static
• Row-Level and Column-Level Security
Reference
• https://learn.microsoft.com/en-us/azure/synapse-analytics/?WT.mc_id=AZ-MVP-
5004665
• https://techcommunity.microsoft.com/t5/azure-observability-blog/how-to-analyze-
data-exported-from-log-analytics-data-using/ba-p/2547888?WT.mc_id=AZ-MVP-
5004665
• https://www.youtube.com/watch?v=o2iFdU0EBLg&list=FLg-vqK9bYhHNecF-p-
ZftLQ&index=1
• https://www.youtube.com/watch?v=lLrjaVdBuM0&list=FLg-vqK9bYhHNecF-p-
ZftLQ&index=2&t=4712s
Thanks for your time and trust!
New Hampshire CLOUD .NET User Group

More Related Content

Similar to AzureSynapse.pptx

Azure Data Platform Overview.pdf
Azure Data Platform Overview.pdfAzure Data Platform Overview.pdf
Azure Data Platform Overview.pdf
Dustin Vannoy
 
Serverless SQL
Serverless SQLServerless SQL
Serverless SQL
Torsten Steinbach
 
Overview of data analytics service: Treasure Data Service
Overview of data analytics service: Treasure Data ServiceOverview of data analytics service: Treasure Data Service
Overview of data analytics service: Treasure Data Service
SATOSHI TAGOMORI
 
Apache Spark in Industry
Apache Spark in IndustryApache Spark in Industry
Apache Spark in Industry
Dorian Beganovic
 
Analytics in the Cloud
Analytics in the CloudAnalytics in the Cloud
Analytics in the Cloud
Ross McNeely
 
AWS Česko-Slovenský Webinár 03: Vývoj v AWS
AWS Česko-Slovenský Webinár 03: Vývoj v AWSAWS Česko-Slovenský Webinár 03: Vývoj v AWS
AWS Česko-Slovenský Webinár 03: Vývoj v AWS
Vladimir Simek
 
IBM Cloud Day January 2021 Data Lake Deep Dive
IBM Cloud Day January 2021 Data Lake Deep DiveIBM Cloud Day January 2021 Data Lake Deep Dive
IBM Cloud Day January 2021 Data Lake Deep Dive
Torsten Steinbach
 
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de KreukData saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
Erwin de Kreuk
 
Serverless spark
Serverless sparkServerless spark
Serverless spark
MamathaBusi
 
Survey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data LandscapeSurvey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data Landscape
Ike Ellis
 
Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)
James Serra
 
DBP-010_Using Azure Data Services for Modern Data Applications
DBP-010_Using Azure Data Services for Modern Data ApplicationsDBP-010_Using Azure Data Services for Modern Data Applications
DBP-010_Using Azure Data Services for Modern Data Applications
decode2016
 
Drupal performance
Drupal performanceDrupal performance
Drupal performance
Gabi Lee
 
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Accelerating Business Intelligence Solutions with Microsoft Azure   passAccelerating Business Intelligence Solutions with Microsoft Azure   pass
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Jason Strate
 
REDSHIFT - Amazon
REDSHIFT - AmazonREDSHIFT - Amazon
REDSHIFT - Amazon
Douglas Bernardini
 
Tokyo azure meetup #2 big data made easy
Tokyo azure meetup #2   big data made easyTokyo azure meetup #2   big data made easy
Tokyo azure meetup #2 big data made easy
Tokyo Azure Meetup
 
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
PROIDEA
 
Introduction to Promitor
Introduction to PromitorIntroduction to Promitor
Introduction to Promitor
Tom Kerkhove
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
Sergio Zenatti Filho
 
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
Amazon Web Services
 

Similar to AzureSynapse.pptx (20)

Azure Data Platform Overview.pdf
Azure Data Platform Overview.pdfAzure Data Platform Overview.pdf
Azure Data Platform Overview.pdf
 
Serverless SQL
Serverless SQLServerless SQL
Serverless SQL
 
Overview of data analytics service: Treasure Data Service
Overview of data analytics service: Treasure Data ServiceOverview of data analytics service: Treasure Data Service
Overview of data analytics service: Treasure Data Service
 
Apache Spark in Industry
Apache Spark in IndustryApache Spark in Industry
Apache Spark in Industry
 
Analytics in the Cloud
Analytics in the CloudAnalytics in the Cloud
Analytics in the Cloud
 
AWS Česko-Slovenský Webinár 03: Vývoj v AWS
AWS Česko-Slovenský Webinár 03: Vývoj v AWSAWS Česko-Slovenský Webinár 03: Vývoj v AWS
AWS Česko-Slovenský Webinár 03: Vývoj v AWS
 
IBM Cloud Day January 2021 Data Lake Deep Dive
IBM Cloud Day January 2021 Data Lake Deep DiveIBM Cloud Day January 2021 Data Lake Deep Dive
IBM Cloud Day January 2021 Data Lake Deep Dive
 
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de KreukData saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
 
Serverless spark
Serverless sparkServerless spark
Serverless spark
 
Survey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data LandscapeSurvey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data Landscape
 
Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)
 
DBP-010_Using Azure Data Services for Modern Data Applications
DBP-010_Using Azure Data Services for Modern Data ApplicationsDBP-010_Using Azure Data Services for Modern Data Applications
DBP-010_Using Azure Data Services for Modern Data Applications
 
Drupal performance
Drupal performanceDrupal performance
Drupal performance
 
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Accelerating Business Intelligence Solutions with Microsoft Azure   passAccelerating Business Intelligence Solutions with Microsoft Azure   pass
Accelerating Business Intelligence Solutions with Microsoft Azure pass
 
REDSHIFT - Amazon
REDSHIFT - AmazonREDSHIFT - Amazon
REDSHIFT - Amazon
 
Tokyo azure meetup #2 big data made easy
Tokyo azure meetup #2   big data made easyTokyo azure meetup #2   big data made easy
Tokyo azure meetup #2 big data made easy
 
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
 
Introduction to Promitor
Introduction to PromitorIntroduction to Promitor
Introduction to Promitor
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
 
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
 

More from Udaiappa Ramachandran

RAG Patterns and Vector Search in Generative AI
RAG Patterns and Vector Search in Generative AIRAG Patterns and Vector Search in Generative AI
RAG Patterns and Vector Search in Generative AI
Udaiappa Ramachandran
 
Level up your security using Intune.pptx
Level up your security using Intune.pptxLevel up your security using Intune.pptx
Level up your security using Intune.pptx
Udaiappa Ramachandran
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptx
Udaiappa Ramachandran
 
AI-Plugins-Planners-Persona-SemanticKernel.pptx
AI-Plugins-Planners-Persona-SemanticKernel.pptxAI-Plugins-Planners-Persona-SemanticKernel.pptx
AI-Plugins-Planners-Persona-SemanticKernel.pptx
Udaiappa Ramachandran
 
DOTNET8.pptx
DOTNET8.pptxDOTNET8.pptx
DOTNET8.pptx
Udaiappa Ramachandran
 
Vector Search using OpenAI in Azure Cognitive Search.pptx
Vector Search using OpenAI in Azure Cognitive Search.pptxVector Search using OpenAI in Azure Cognitive Search.pptx
Vector Search using OpenAI in Azure Cognitive Search.pptx
Udaiappa Ramachandran
 
SecureAzureServicesUsingADAuthentication.pptx
SecureAzureServicesUsingADAuthentication.pptxSecureAzureServicesUsingADAuthentication.pptx
SecureAzureServicesUsingADAuthentication.pptx
Udaiappa Ramachandran
 
AzureOpenAI.pptx
AzureOpenAI.pptxAzureOpenAI.pptx
AzureOpenAI.pptx
Udaiappa Ramachandran
 
OpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptxOpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptx
Udaiappa Ramachandran
 
DiagnoseAndSolveproblems.pptx
DiagnoseAndSolveproblems.pptxDiagnoseAndSolveproblems.pptx
DiagnoseAndSolveproblems.pptx
Udaiappa Ramachandran
 
MAUI.pptx
MAUI.pptxMAUI.pptx
.NET7.pptx
.NET7.pptx.NET7.pptx
AzureDevOps
AzureDevOpsAzureDevOps
AzureCostManagementAndBilling
AzureCostManagementAndBillingAzureCostManagementAndBilling
AzureCostManagementAndBilling
Udaiappa Ramachandran
 
.NET6.pptx
.NET6.pptx.NET6.pptx
Azure Automation and Update Management
Azure Automation and Update ManagementAzure Automation and Update Management
Azure Automation and Update Management
Udaiappa Ramachandran
 
Azure staticwebapps
Azure staticwebappsAzure staticwebapps
Azure staticwebapps
Udaiappa Ramachandran
 
Azure privatelink
Azure privatelinkAzure privatelink
Azure privatelink
Udaiappa Ramachandran
 
Azure Security Center
Azure Security CenterAzure Security Center
Azure Security Center
Udaiappa Ramachandran
 
Azure signalr service
Azure signalr serviceAzure signalr service
Azure signalr service
Udaiappa Ramachandran
 

More from Udaiappa Ramachandran (20)

RAG Patterns and Vector Search in Generative AI
RAG Patterns and Vector Search in Generative AIRAG Patterns and Vector Search in Generative AI
RAG Patterns and Vector Search in Generative AI
 
Level up your security using Intune.pptx
Level up your security using Intune.pptxLevel up your security using Intune.pptx
Level up your security using Intune.pptx
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptx
 
AI-Plugins-Planners-Persona-SemanticKernel.pptx
AI-Plugins-Planners-Persona-SemanticKernel.pptxAI-Plugins-Planners-Persona-SemanticKernel.pptx
AI-Plugins-Planners-Persona-SemanticKernel.pptx
 
DOTNET8.pptx
DOTNET8.pptxDOTNET8.pptx
DOTNET8.pptx
 
Vector Search using OpenAI in Azure Cognitive Search.pptx
Vector Search using OpenAI in Azure Cognitive Search.pptxVector Search using OpenAI in Azure Cognitive Search.pptx
Vector Search using OpenAI in Azure Cognitive Search.pptx
 
SecureAzureServicesUsingADAuthentication.pptx
SecureAzureServicesUsingADAuthentication.pptxSecureAzureServicesUsingADAuthentication.pptx
SecureAzureServicesUsingADAuthentication.pptx
 
AzureOpenAI.pptx
AzureOpenAI.pptxAzureOpenAI.pptx
AzureOpenAI.pptx
 
OpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptxOpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptx
 
DiagnoseAndSolveproblems.pptx
DiagnoseAndSolveproblems.pptxDiagnoseAndSolveproblems.pptx
DiagnoseAndSolveproblems.pptx
 
MAUI.pptx
MAUI.pptxMAUI.pptx
MAUI.pptx
 
.NET7.pptx
.NET7.pptx.NET7.pptx
.NET7.pptx
 
AzureDevOps
AzureDevOpsAzureDevOps
AzureDevOps
 
AzureCostManagementAndBilling
AzureCostManagementAndBillingAzureCostManagementAndBilling
AzureCostManagementAndBilling
 
.NET6.pptx
.NET6.pptx.NET6.pptx
.NET6.pptx
 
Azure Automation and Update Management
Azure Automation and Update ManagementAzure Automation and Update Management
Azure Automation and Update Management
 
Azure staticwebapps
Azure staticwebappsAzure staticwebapps
Azure staticwebapps
 
Azure privatelink
Azure privatelinkAzure privatelink
Azure privatelink
 
Azure Security Center
Azure Security CenterAzure Security Center
Azure Security Center
 
Azure signalr service
Azure signalr serviceAzure signalr service
Azure signalr service
 

Recently uploaded

Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 

Recently uploaded (20)

Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 

AzureSynapse.pptx

  • 1. Azure Synapse Udaiappa Ramachandran ( Udai ) https://udai.io
  • 2. About me • Udaiappa Ramachandran ( Udai ) • CTO/CSO-Akumina, Inc. • Microsoft Azure MVP • Cloud Expert • Microsoft Azure, Amazon Web Services, and Google • New Hampshire Cloud User Group (http://www.meetup.com/nashuaug ) • https://udai.io
  • 3. Agenda • Quick review on Azure Data Factory, Azure Databricks • Azure Synapse Analytics • Aggregating data from multiple data sources • Exploring processed data • Azure Synapse Security • Demo…Demo…Demo…
  • 4. Azure Datafactory • Easy to use • Wide range of connectors and features (90+) • Powerful data integration capabilities (ingestion and transformation) • GUI – Pipelines, data flows, power query
  • 5. Azure Databricks • Powerful data processing capabilities • Machine learning and real-time analytics capabilities • Managed service • Notebooks • Steeper learning curve • Can be more expensive
  • 6. What is Azure Synapse Analytics?
  • 7. Azure Synapse Analytics - Components • Data Warehouse • SQL Pool • Dedicated • Serverless • Spark Pool • Python, SQL and C# • Big Data Engine • Serverless Engine • Data Flows • Ecosystem- PowerBI+Azure Machine Learning
  • 8. What is Azure Synapse Analytics? Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/overview-what-is
  • 9. Azure Synapse Analytics - Capabilities • Unified analytics platform • Serverless and dedicated options • Enterprise data warehouse • Data lake exploration • Code-free hybrid data integration • Deeply integrated Apache Spark and SQL engines • Cloud-native HTAP • Choice of language (T-SQL, Python, Scala, SparkSQL, and .NET) • Integrated AI and BI • Data Security
  • 10. Synapse Analytics – SQL Pools • Serverless SQL • Query data from ADLS Gen2 directly • Using T-SQL to query CSV, Parquet, JSON, etc., • No infrastructure needed • Stand-alone polybase service • Pay-per query model • No charges for metadata queries (ex., select * from sys.objects) • When to use? • Quick ad-hoc queries • Logical data warehouse • Transform data in lake • Dedicated SQL • Provisioned Resource: Setup infrastructure in advance • Massively Parallel Processing (MPP) Engine
  • 11. Synapse SQL Architecture Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/massively-parallel- processing-mpp-architecture
  • 12. Synapse Analytics - Spark Pool • Provisioned Resource: Setup infrastructure in advance • Machine learning with MLib • Data Engineering/Data Preparation with C#, Scala, Spark SQL, Python • Streaming Data • Spark notebooks
  • 13. Synapse SPARK Overview Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-overview
  • 14. Data Explorer Pool • Unified experience • Real-time insights • Scalability • Security • High performance • Real-time ingestion • Time series analysis • Machine learning
  • 15. Data Explorer Pool Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/data-explorer/data-explorer-overview
  • 16. When to use Azure Synapse Analytics? • Large-scale data warehousing • Advanced analytics • Data exploration and discovery • Real time analytics • Data integration • Integrated analytics
  • 17. Synapse Analytics Vs. Synapse Private Hub Feature Azure Synapse Analytics Azyre Synapse Analytics Private Hub Access Public access over the internet Private access over a private connection Security Data is encrypted at rest and in transit Data never leaves your network Compliance Complies with a variety of data regulations Can be used to comply with sticker data privacy regulations Use cases General-purpose data analytics Secure access to Azure synapse Analytics from on-premises network or another virtual network
  • 18. Azure Synapse – Use Case • Propose a solution for ABC company to build real-time analytics using various data sources such as Cosmos DB, Log Analytics, and SharePoint List Items. How can we achieve this?
  • 19. Demo • Create Azure Synapse • Walkthrough Azure Synapse properties • Create Pools • Run Samples • Link Cosmos DB • Create External table • Data Explorer --Add Table and export data / Data explorer ingest data • PowerBI
  • 20. Azure Synapse – Use Case • Aggregation • Azure Cosmos DB – Synapse Link, then external view • Azure Log Analytics Workspace – Continuous Export then Parquet transformer using Spark and then external table • SharePoint Lists – Continuous export then parquet transformer using spark and then external table • Presentation • PowerBI – Direct Access • HTML controls – DW Queries • Cost • SQL Server – Serverless/Dedicated • Spark Nodes • https://azure.com/e/6233ac854ace4eddb06d15b8b056df21
  • 23. Security on Azure Synapse • Data at REST encryption using TDE (Transparent Data Encryption) • In-Transit (in motion) Encryption using TLS • Key Management • Customer Managed • Bring your own key (BYOK) • Must enabled when creating Azure Synapse • TDE Protector (key to encrypt DEK) • Data Masking – Dynamic and Static • Row-Level and Column-Level Security
  • 24. Reference • https://learn.microsoft.com/en-us/azure/synapse-analytics/?WT.mc_id=AZ-MVP- 5004665 • https://techcommunity.microsoft.com/t5/azure-observability-blog/how-to-analyze- data-exported-from-log-analytics-data-using/ba-p/2547888?WT.mc_id=AZ-MVP- 5004665 • https://www.youtube.com/watch?v=o2iFdU0EBLg&list=FLg-vqK9bYhHNecF-p- ZftLQ&index=1 • https://www.youtube.com/watch?v=lLrjaVdBuM0&list=FLg-vqK9bYhHNecF-p- ZftLQ&index=2&t=4712s
  • 25. Thanks for your time and trust! New Hampshire CLOUD .NET User Group

Editor's Notes

  1. Azure SQL Data Warehouse – a cloud-based enterprise data warehouse (EDW) that uses massively parallel processing (MPP) to reun complex queries across petabytes of data quickly.
  2. Azure SQL Data Warehouse – a cloud-based enterprise data warehouse (EDW) that uses massively parallel processing (MPP) to reun complex queries across petabytes of data quickly.
  3. Descriptive analytics, which answers the question “What is happening in my business?”. The data to answer this question is typically answered through the creation of a data warehouse in which historical data is persisted in relational tables for multidimensional modeling and reporting. Diagnostic analytics, which deals with answering the question “Why is it happening?”. This may involve exploring information that already exists in a data warehouse, but typically involves a wider search of your data estate to find more data to support this type of analysis. Predictive analytics, which enables you to answer the question “What is likely to happen in the future based on previous trends and patterns?” Prescriptive analytics, which enables autonomous decision making based on real-time or near real-time analysis of data, using predictive analytics.
  4. Data Warehouse: The already popular Azure Data Warehouse technology for storing and managing data for analysis and decision making, now through SQL pools. Big Data engine: With Spark pools, engineers can now run scalable analytics with Spark languages to do Big Data processing with them . Serverless engine: Query Data Lakes directly using SQL statements in a simple way. Data flows: To Develop ETL flows that consume or receive data in your Data Warehouse or Data Lake with the same engine used with Azure Data Factory. Azure Data Lake Storage+Azure SQL Data Warehouse+Azure Analytics=Azure Synapse Analytics
  5. Data Warehouse: The already popular Azure Data Warehouse technology for storing and managing data for analysis and decision making, now through SQL pools. Big Data engine: With Spark pools, engineers can now run scalable analytics with Spark languages to do Big Data processing with them . Serverless engine: Query Data Lakes directly using SQL statements in a simple way. Data flows: To Develop ETL flows that consume or receive data in your Data Warehouse or Data Lake with the same engine used with Azure Data Factory. Azure Data Lake Storage+Azure SQL Data Warehouse+Azure Analytics=Azure Synapse Analytics
  6. Azure SQL Data Warehouse – a cloud-based enterprise data warehouse (EDW) that uses massively parallel processing (MPP) to reun complex queries across petabytes of data quickly.
  7. Quick ad-hoc queries – before you decide how to proceed Logical Data warehouse- abstract layer on top of raw data Transform data in lake-consume it directly using powerBI
  8. The number of compute nodes ranges from 1 to 60, and is determined by the service level for Synapse SQL.
  9. Spark notebooks- combine code, text, markdown and data visualization
  10. YARN (Yet Another Resource Negotiator) https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-machine-learning-mllib-notebook
  11. Spark notebooks- combine code, text, markdown and data visualization
  12. https://learn.microsoft.com/en-us/training/modules/introduction-azure-synapse-analytics/4-when-use
  13. Azure SQL Data Warehouse – a cloud-based enterprise data warehouse (EDW) that uses massively parallel processing (MPP) to reun complex queries across petabytes of data quickly.
  14. df_nyc.write.mode("overwrite").saveAsTable("dbo.PassengerData")
  15. Double Encryption on top of Microsoft managed keys TDE using Az Key vault--Get/Wrap/Unwrap DEK key length 2048 or 3072 supported formats for imported key .pfx, .byok, .backup backup your keys before using it create a new backup when changes are made to the key Dynamic data masking mask data to non-privileged users ability to specify how much is revealed configured on specific databse fields can be used alongside encrytion, auditing, row-level-security etc., can be enabled via as portal or t-sql statements types of data masking full xxxx partial uxxx@xxx.com random salary=10000;FUNCTION='random(1,8)';Masked=6 custom string ex., name=Udai; FUNCTION='partial(1,'XXXX',1);masked=UxxxxI create user testuser without login grant select on sales.customer to testuser execute as user='testuser' select.... revert go grant unmask to test user revoke unmask to testuser select c.name,tbl.name as table_name,c.is_masked,c.masking_function from sys.masked_columns as c join sys.tables as tbl on c.[object_it]=tbl.[object_id] where is_masked=1 how does row level sec works not permissin based but predicate based security policy security predicate is an inline table-valued function (iTVF) filter predicate creating rls create table, insert rows, create users, create a schema(create schema),create security redicate(create function),create securith policy RLS best practices crate a separate chcema for the securit predicate function alter any security permission is required drop components in the following order: security policy, Table, function, schemas avoid excessive table joins in the predicate function CLS control access to specific column based on users context grant access to -sql user and azure ad