AzureSynapse.pptx

Azure Synapse
Udaiappa Ramachandran ( Udai )
https://udai.io

About me
• Udaiappa Ramachandran ( Udai )
• CTO/CSO-Akumina, Inc.
• Microsoft Azure MVP
• Cloud Expert
• Microsoft Azure, Amazon Web Services, and Google
• New Hampshire Cloud User Group (http://www.meetup.com/nashuaug )
• https://udai.io

Agenda
• Quick review on Azure Data Factory, Azure Databricks
• Azure Synapse Analytics
• Aggregating data from multiple data sources
• Exploring processed data
• Azure Synapse Security
• Demo…Demo…Demo…

Azure Datafactory
• Easy to use
• Wide range of connectors and features (90+)
• Powerful data integration capabilities (ingestion and transformation)
• GUI – Pipelines, data flows, power query

Azure Databricks
• Powerful data processing capabilities
• Machine learning and real-time analytics capabilities
• Managed service
• Notebooks
• Steeper learning curve
• Can be more expensive

What is Azure Synapse Analytics?

Azure Synapse Analytics - Components
• Data Warehouse
• SQL Pool
• Dedicated
• Serverless
• Spark Pool
• Python, SQL and C#
• Big Data Engine
• Serverless Engine
• Data Flows
• Ecosystem- PowerBI+Azure Machine Learning

What is Azure Synapse Analytics?
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/overview-what-is

Azure Synapse Analytics - Capabilities
• Unified analytics platform
• Serverless and dedicated options
• Enterprise data warehouse
• Data lake exploration
• Code-free hybrid data integration
• Deeply integrated Apache Spark and SQL engines
• Cloud-native HTAP
• Choice of language (T-SQL, Python, Scala, SparkSQL, and .NET)
• Integrated AI and BI
• Data Security

Synapse Analytics – SQL Pools
• Serverless SQL
• Query data from ADLS Gen2 directly
• Using T-SQL to query CSV, Parquet, JSON, etc.,
• No infrastructure needed
• Stand-alone polybase service
• Pay-per query model
• No charges for metadata queries (ex., select * from sys.objects)
• When to use?
• Quick ad-hoc queries
• Logical data warehouse
• Transform data in lake
• Dedicated SQL
• Provisioned Resource: Setup infrastructure in advance
• Massively Parallel Processing (MPP) Engine

Synapse SQL Architecture
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/massively-parallel-
processing-mpp-architecture

Synapse Analytics - Spark Pool
• Provisioned Resource: Setup infrastructure in advance
• Machine learning with MLib
• Data Engineering/Data Preparation with C#, Scala, Spark SQL, Python
• Streaming Data
• Spark notebooks

Synapse SPARK Overview
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-overview

Data Explorer Pool
• Unified experience
• Real-time insights
• Scalability
• Security
• High performance
• Real-time ingestion
• Time series analysis
• Machine learning

Data Explorer Pool
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/data-explorer/data-explorer-overview

When to use Azure Synapse Analytics?
• Large-scale data warehousing
• Advanced analytics
• Data exploration and discovery
• Real time analytics
• Data integration
• Integrated analytics

Synapse Analytics Vs. Synapse Private Hub
Feature Azure Synapse Analytics Azyre Synapse Analytics Private
Hub
Access Public access over the internet Private access over a private
connection
Security Data is encrypted at rest and in
transit
Data never leaves your network
Compliance Complies with a variety of data
regulations
Can be used to comply with sticker
data privacy regulations
Use cases General-purpose data analytics Secure access to Azure synapse
Analytics from on-premises network
or another virtual network

Azure Synapse – Use Case
• Propose a solution for ABC company to build real-time analytics using various data
sources such as Cosmos DB, Log Analytics, and SharePoint List Items. How can we
achieve this?

Demo
• Create Azure Synapse
• Walkthrough Azure Synapse properties
• Create Pools
• Run Samples
• Link Cosmos DB
• Create External table
• Data Explorer --Add Table and export data / Data explorer ingest data
• PowerBI

Azure Synapse – Use Case
• Aggregation
• Azure Cosmos DB – Synapse Link, then external view
• Azure Log Analytics Workspace – Continuous Export then Parquet transformer using Spark and
then external table
• SharePoint Lists – Continuous export then parquet transformer using spark and then external
table
• Presentation
• PowerBI – Direct Access
• HTML controls – DW Queries
• Cost
• SQL Server – Serverless/Dedicated
• Spark Nodes
• https://azure.com/e/6233ac854ace4eddb06d15b8b056df21

Security on Azure Synapse
• Data at REST encryption using TDE (Transparent Data Encryption)
• In-Transit (in motion) Encryption using TLS
• Key Management
• Customer Managed
• Bring your own key (BYOK)
• Must enabled when creating Azure Synapse
• TDE Protector (key to encrypt DEK)
• Data Masking – Dynamic and Static
• Row-Level and Column-Level Security

Reference
• https://learn.microsoft.com/en-us/azure/synapse-analytics/?WT.mc_id=AZ-MVP-
5004665
• https://techcommunity.microsoft.com/t5/azure-observability-blog/how-to-analyze-
data-exported-from-log-analytics-data-using/ba-p/2547888?WT.mc_id=AZ-MVP-
5004665
• https://www.youtube.com/watch?v=o2iFdU0EBLg&list=FLg-vqK9bYhHNecF-p-
ZftLQ&index=1
• https://www.youtube.com/watch?v=lLrjaVdBuM0&list=FLg-vqK9bYhHNecF-p-
ZftLQ&index=2&t=4712s

Thanks for your time and trust!
New Hampshire CLOUD .NET User Group

AzureSynapse.pptx

Recommended

Recommended

More Related Content

Similar to AzureSynapse.pptx

Similar to AzureSynapse.pptx (20)

More from Udaiappa Ramachandran

More from Udaiappa Ramachandran (20)

Recently uploaded

Recently uploaded (20)

AzureSynapse.pptx

Editor's Notes