SlideShare a Scribd company logo
1 of 25
Azure Synapse
Udaiappa Ramachandran ( Udai )
https://udai.io
About me
• Udaiappa Ramachandran ( Udai )
• CTO/CSO-Akumina, Inc.
• Microsoft Azure MVP
• Cloud Expert
• Microsoft Azure, Amazon Web Services, and Google
• New Hampshire Cloud User Group (http://www.meetup.com/nashuaug )
• https://udai.io
Agenda
• Quick review on Azure Data Factory, Azure Databricks
• Azure Synapse Analytics
• Aggregating data from multiple data sources
• Exploring processed data
• Azure Synapse Security
• Demo…Demo…Demo…
Azure Datafactory
• Easy to use
• Wide range of connectors and features (90+)
• Powerful data integration capabilities (ingestion and transformation)
• GUI – Pipelines, data flows, power query
Azure Databricks
• Powerful data processing capabilities
• Machine learning and real-time analytics capabilities
• Managed service
• Notebooks
• Steeper learning curve
• Can be more expensive
What is Azure Synapse Analytics?
Azure Synapse Analytics - Components
• Data Warehouse
• SQL Pool
• Dedicated
• Serverless
• Spark Pool
• Python, SQL and C#
• Big Data Engine
• Serverless Engine
• Data Flows
• Ecosystem- PowerBI+Azure Machine Learning
What is Azure Synapse Analytics?
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/overview-what-is
Azure Synapse Analytics - Capabilities
• Unified analytics platform
• Serverless and dedicated options
• Enterprise data warehouse
• Data lake exploration
• Code-free hybrid data integration
• Deeply integrated Apache Spark and SQL engines
• Cloud-native HTAP
• Choice of language (T-SQL, Python, Scala, SparkSQL, and .NET)
• Integrated AI and BI
• Data Security
Synapse Analytics – SQL Pools
• Serverless SQL
• Query data from ADLS Gen2 directly
• Using T-SQL to query CSV, Parquet, JSON, etc.,
• No infrastructure needed
• Stand-alone polybase service
• Pay-per query model
• No charges for metadata queries (ex., select * from sys.objects)
• When to use?
• Quick ad-hoc queries
• Logical data warehouse
• Transform data in lake
• Dedicated SQL
• Provisioned Resource: Setup infrastructure in advance
• Massively Parallel Processing (MPP) Engine
Synapse SQL Architecture
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/massively-parallel-
processing-mpp-architecture
Synapse Analytics - Spark Pool
• Provisioned Resource: Setup infrastructure in advance
• Machine learning with MLib
• Data Engineering/Data Preparation with C#, Scala, Spark SQL, Python
• Streaming Data
• Spark notebooks
Synapse SPARK Overview
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-overview
Data Explorer Pool
• Unified experience
• Real-time insights
• Scalability
• Security
• High performance
• Real-time ingestion
• Time series analysis
• Machine learning
Data Explorer Pool
Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/data-explorer/data-explorer-overview
When to use Azure Synapse Analytics?
• Large-scale data warehousing
• Advanced analytics
• Data exploration and discovery
• Real time analytics
• Data integration
• Integrated analytics
Synapse Analytics Vs. Synapse Private Hub
Feature Azure Synapse Analytics Azyre Synapse Analytics Private
Hub
Access Public access over the internet Private access over a private
connection
Security Data is encrypted at rest and in
transit
Data never leaves your network
Compliance Complies with a variety of data
regulations
Can be used to comply with sticker
data privacy regulations
Use cases General-purpose data analytics Secure access to Azure synapse
Analytics from on-premises network
or another virtual network
Azure Synapse – Use Case
• Propose a solution for ABC company to build real-time analytics using various data
sources such as Cosmos DB, Log Analytics, and SharePoint List Items. How can we
achieve this?
Demo
• Create Azure Synapse
• Walkthrough Azure Synapse properties
• Create Pools
• Run Samples
• Link Cosmos DB
• Create External table
• Data Explorer --Add Table and export data / Data explorer ingest data
• PowerBI
Azure Synapse – Use Case
• Aggregation
• Azure Cosmos DB – Synapse Link, then external view
• Azure Log Analytics Workspace – Continuous Export then Parquet transformer using Spark and
then external table
• SharePoint Lists – Continuous export then parquet transformer using spark and then external
table
• Presentation
• PowerBI – Direct Access
• HTML controls – DW Queries
• Cost
• SQL Server – Serverless/Dedicated
• Spark Nodes
• https://azure.com/e/6233ac854ace4eddb06d15b8b056df21
Diagrams:
Diagrams:
Security on Azure Synapse
• Data at REST encryption using TDE (Transparent Data Encryption)
• In-Transit (in motion) Encryption using TLS
• Key Management
• Customer Managed
• Bring your own key (BYOK)
• Must enabled when creating Azure Synapse
• TDE Protector (key to encrypt DEK)
• Data Masking – Dynamic and Static
• Row-Level and Column-Level Security
Reference
• https://learn.microsoft.com/en-us/azure/synapse-analytics/?WT.mc_id=AZ-MVP-
5004665
• https://techcommunity.microsoft.com/t5/azure-observability-blog/how-to-analyze-
data-exported-from-log-analytics-data-using/ba-p/2547888?WT.mc_id=AZ-MVP-
5004665
• https://www.youtube.com/watch?v=o2iFdU0EBLg&list=FLg-vqK9bYhHNecF-p-
ZftLQ&index=1
• https://www.youtube.com/watch?v=lLrjaVdBuM0&list=FLg-vqK9bYhHNecF-p-
ZftLQ&index=2&t=4712s
Thanks for your time and trust!
New Hampshire CLOUD .NET User Group

More Related Content

Similar to AzureSynapse.pptx

Azure Data Platform Overview.pdf
Azure Data Platform Overview.pdfAzure Data Platform Overview.pdf
Azure Data Platform Overview.pdfDustin Vannoy
 
Overview of data analytics service: Treasure Data Service
Overview of data analytics service: Treasure Data ServiceOverview of data analytics service: Treasure Data Service
Overview of data analytics service: Treasure Data ServiceSATOSHI TAGOMORI
 
Analytics in the Cloud
Analytics in the CloudAnalytics in the Cloud
Analytics in the CloudRoss McNeely
 
AWS Česko-Slovenský Webinár 03: Vývoj v AWS
AWS Česko-Slovenský Webinár 03: Vývoj v AWSAWS Česko-Slovenský Webinár 03: Vývoj v AWS
AWS Česko-Slovenský Webinár 03: Vývoj v AWSVladimir Simek
 
IBM Cloud Day January 2021 Data Lake Deep Dive
IBM Cloud Day January 2021 Data Lake Deep DiveIBM Cloud Day January 2021 Data Lake Deep Dive
IBM Cloud Day January 2021 Data Lake Deep DiveTorsten Steinbach
 
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de KreukData saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de KreukErwin de Kreuk
 
Serverless spark
Serverless sparkServerless spark
Serverless sparkMamathaBusi
 
Survey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data LandscapeSurvey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data LandscapeIke Ellis
 
Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)James Serra
 
DBP-010_Using Azure Data Services for Modern Data Applications
DBP-010_Using Azure Data Services for Modern Data ApplicationsDBP-010_Using Azure Data Services for Modern Data Applications
DBP-010_Using Azure Data Services for Modern Data Applicationsdecode2016
 
Drupal performance
Drupal performanceDrupal performance
Drupal performanceGabi Lee
 
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Accelerating Business Intelligence Solutions with Microsoft Azure   passAccelerating Business Intelligence Solutions with Microsoft Azure   pass
Accelerating Business Intelligence Solutions with Microsoft Azure passJason Strate
 
Tokyo azure meetup #2 big data made easy
Tokyo azure meetup #2   big data made easyTokyo azure meetup #2   big data made easy
Tokyo azure meetup #2 big data made easyTokyo Azure Meetup
 
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...PROIDEA
 
Introduction to Promitor
Introduction to PromitorIntroduction to Promitor
Introduction to PromitorTom Kerkhove
 
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料Amazon Web Services
 

Similar to AzureSynapse.pptx (20)

Azure Data Platform Overview.pdf
Azure Data Platform Overview.pdfAzure Data Platform Overview.pdf
Azure Data Platform Overview.pdf
 
Serverless SQL
Serverless SQLServerless SQL
Serverless SQL
 
Overview of data analytics service: Treasure Data Service
Overview of data analytics service: Treasure Data ServiceOverview of data analytics service: Treasure Data Service
Overview of data analytics service: Treasure Data Service
 
Apache Spark in Industry
Apache Spark in IndustryApache Spark in Industry
Apache Spark in Industry
 
Analytics in the Cloud
Analytics in the CloudAnalytics in the Cloud
Analytics in the Cloud
 
AWS Česko-Slovenský Webinár 03: Vývoj v AWS
AWS Česko-Slovenský Webinár 03: Vývoj v AWSAWS Česko-Slovenský Webinár 03: Vývoj v AWS
AWS Česko-Slovenský Webinár 03: Vývoj v AWS
 
IBM Cloud Day January 2021 Data Lake Deep Dive
IBM Cloud Day January 2021 Data Lake Deep DiveIBM Cloud Day January 2021 Data Lake Deep Dive
IBM Cloud Day January 2021 Data Lake Deep Dive
 
Data saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de KreukData saturday Oslo Azure Purview Erwin de Kreuk
Data saturday Oslo Azure Purview Erwin de Kreuk
 
Serverless spark
Serverless sparkServerless spark
Serverless spark
 
Survey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data LandscapeSurvey of the Microsoft Azure Data Landscape
Survey of the Microsoft Azure Data Landscape
 
Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)Azure Synapse Analytics Overview (r2)
Azure Synapse Analytics Overview (r2)
 
DBP-010_Using Azure Data Services for Modern Data Applications
DBP-010_Using Azure Data Services for Modern Data ApplicationsDBP-010_Using Azure Data Services for Modern Data Applications
DBP-010_Using Azure Data Services for Modern Data Applications
 
Drupal performance
Drupal performanceDrupal performance
Drupal performance
 
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Accelerating Business Intelligence Solutions with Microsoft Azure   passAccelerating Business Intelligence Solutions with Microsoft Azure   pass
Accelerating Business Intelligence Solutions with Microsoft Azure pass
 
REDSHIFT - Amazon
REDSHIFT - AmazonREDSHIFT - Amazon
REDSHIFT - Amazon
 
Tokyo azure meetup #2 big data made easy
Tokyo azure meetup #2   big data made easyTokyo azure meetup #2   big data made easy
Tokyo azure meetup #2 big data made easy
 
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
 
Introduction to Promitor
Introduction to PromitorIntroduction to Promitor
Introduction to Promitor
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
 
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
使用 Amazon Athena 直接分析儲存於 S3 的巨量資料
 

More from Udaiappa Ramachandran

RAG Patterns and Vector Search in Generative AI
RAG Patterns and Vector Search in Generative AIRAG Patterns and Vector Search in Generative AI
RAG Patterns and Vector Search in Generative AIUdaiappa Ramachandran
 
Level up your security using Intune.pptx
Level up your security using Intune.pptxLevel up your security using Intune.pptx
Level up your security using Intune.pptxUdaiappa Ramachandran
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxUdaiappa Ramachandran
 
AI-Plugins-Planners-Persona-SemanticKernel.pptx
AI-Plugins-Planners-Persona-SemanticKernel.pptxAI-Plugins-Planners-Persona-SemanticKernel.pptx
AI-Plugins-Planners-Persona-SemanticKernel.pptxUdaiappa Ramachandran
 
Vector Search using OpenAI in Azure Cognitive Search.pptx
Vector Search using OpenAI in Azure Cognitive Search.pptxVector Search using OpenAI in Azure Cognitive Search.pptx
Vector Search using OpenAI in Azure Cognitive Search.pptxUdaiappa Ramachandran
 
SecureAzureServicesUsingADAuthentication.pptx
SecureAzureServicesUsingADAuthentication.pptxSecureAzureServicesUsingADAuthentication.pptx
SecureAzureServicesUsingADAuthentication.pptxUdaiappa Ramachandran
 
Azure Automation and Update Management
Azure Automation and Update ManagementAzure Automation and Update Management
Azure Automation and Update ManagementUdaiappa Ramachandran
 

More from Udaiappa Ramachandran (20)

RAG Patterns and Vector Search in Generative AI
RAG Patterns and Vector Search in Generative AIRAG Patterns and Vector Search in Generative AI
RAG Patterns and Vector Search in Generative AI
 
Level up your security using Intune.pptx
Level up your security using Intune.pptxLevel up your security using Intune.pptx
Level up your security using Intune.pptx
 
Building AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptxBuilding AI-Driven Apps Using Semantic Kernel.pptx
Building AI-Driven Apps Using Semantic Kernel.pptx
 
AI-Plugins-Planners-Persona-SemanticKernel.pptx
AI-Plugins-Planners-Persona-SemanticKernel.pptxAI-Plugins-Planners-Persona-SemanticKernel.pptx
AI-Plugins-Planners-Persona-SemanticKernel.pptx
 
DOTNET8.pptx
DOTNET8.pptxDOTNET8.pptx
DOTNET8.pptx
 
Vector Search using OpenAI in Azure Cognitive Search.pptx
Vector Search using OpenAI in Azure Cognitive Search.pptxVector Search using OpenAI in Azure Cognitive Search.pptx
Vector Search using OpenAI in Azure Cognitive Search.pptx
 
SecureAzureServicesUsingADAuthentication.pptx
SecureAzureServicesUsingADAuthentication.pptxSecureAzureServicesUsingADAuthentication.pptx
SecureAzureServicesUsingADAuthentication.pptx
 
AzureOpenAI.pptx
AzureOpenAI.pptxAzureOpenAI.pptx
AzureOpenAI.pptx
 
OpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptxOpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptx
 
DiagnoseAndSolveproblems.pptx
DiagnoseAndSolveproblems.pptxDiagnoseAndSolveproblems.pptx
DiagnoseAndSolveproblems.pptx
 
MAUI.pptx
MAUI.pptxMAUI.pptx
MAUI.pptx
 
.NET7.pptx
.NET7.pptx.NET7.pptx
.NET7.pptx
 
AzureDevOps
AzureDevOpsAzureDevOps
AzureDevOps
 
AzureCostManagementAndBilling
AzureCostManagementAndBillingAzureCostManagementAndBilling
AzureCostManagementAndBilling
 
.NET6.pptx
.NET6.pptx.NET6.pptx
.NET6.pptx
 
Azure Automation and Update Management
Azure Automation and Update ManagementAzure Automation and Update Management
Azure Automation and Update Management
 
Azure staticwebapps
Azure staticwebappsAzure staticwebapps
Azure staticwebapps
 
Azure privatelink
Azure privatelinkAzure privatelink
Azure privatelink
 
Azure Security Center
Azure Security CenterAzure Security Center
Azure Security Center
 
Azure signalr service
Azure signalr serviceAzure signalr service
Azure signalr service
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

AzureSynapse.pptx

  • 1. Azure Synapse Udaiappa Ramachandran ( Udai ) https://udai.io
  • 2. About me • Udaiappa Ramachandran ( Udai ) • CTO/CSO-Akumina, Inc. • Microsoft Azure MVP • Cloud Expert • Microsoft Azure, Amazon Web Services, and Google • New Hampshire Cloud User Group (http://www.meetup.com/nashuaug ) • https://udai.io
  • 3. Agenda • Quick review on Azure Data Factory, Azure Databricks • Azure Synapse Analytics • Aggregating data from multiple data sources • Exploring processed data • Azure Synapse Security • Demo…Demo…Demo…
  • 4. Azure Datafactory • Easy to use • Wide range of connectors and features (90+) • Powerful data integration capabilities (ingestion and transformation) • GUI – Pipelines, data flows, power query
  • 5. Azure Databricks • Powerful data processing capabilities • Machine learning and real-time analytics capabilities • Managed service • Notebooks • Steeper learning curve • Can be more expensive
  • 6. What is Azure Synapse Analytics?
  • 7. Azure Synapse Analytics - Components • Data Warehouse • SQL Pool • Dedicated • Serverless • Spark Pool • Python, SQL and C# • Big Data Engine • Serverless Engine • Data Flows • Ecosystem- PowerBI+Azure Machine Learning
  • 8. What is Azure Synapse Analytics? Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/overview-what-is
  • 9. Azure Synapse Analytics - Capabilities • Unified analytics platform • Serverless and dedicated options • Enterprise data warehouse • Data lake exploration • Code-free hybrid data integration • Deeply integrated Apache Spark and SQL engines • Cloud-native HTAP • Choice of language (T-SQL, Python, Scala, SparkSQL, and .NET) • Integrated AI and BI • Data Security
  • 10. Synapse Analytics – SQL Pools • Serverless SQL • Query data from ADLS Gen2 directly • Using T-SQL to query CSV, Parquet, JSON, etc., • No infrastructure needed • Stand-alone polybase service • Pay-per query model • No charges for metadata queries (ex., select * from sys.objects) • When to use? • Quick ad-hoc queries • Logical data warehouse • Transform data in lake • Dedicated SQL • Provisioned Resource: Setup infrastructure in advance • Massively Parallel Processing (MPP) Engine
  • 11. Synapse SQL Architecture Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/massively-parallel- processing-mpp-architecture
  • 12. Synapse Analytics - Spark Pool • Provisioned Resource: Setup infrastructure in advance • Machine learning with MLib • Data Engineering/Data Preparation with C#, Scala, Spark SQL, Python • Streaming Data • Spark notebooks
  • 13. Synapse SPARK Overview Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-overview
  • 14. Data Explorer Pool • Unified experience • Real-time insights • Scalability • Security • High performance • Real-time ingestion • Time series analysis • Machine learning
  • 15. Data Explorer Pool Source: https://learn.microsoft.com/en-us/azure/synapse-analytics/data-explorer/data-explorer-overview
  • 16. When to use Azure Synapse Analytics? • Large-scale data warehousing • Advanced analytics • Data exploration and discovery • Real time analytics • Data integration • Integrated analytics
  • 17. Synapse Analytics Vs. Synapse Private Hub Feature Azure Synapse Analytics Azyre Synapse Analytics Private Hub Access Public access over the internet Private access over a private connection Security Data is encrypted at rest and in transit Data never leaves your network Compliance Complies with a variety of data regulations Can be used to comply with sticker data privacy regulations Use cases General-purpose data analytics Secure access to Azure synapse Analytics from on-premises network or another virtual network
  • 18. Azure Synapse – Use Case • Propose a solution for ABC company to build real-time analytics using various data sources such as Cosmos DB, Log Analytics, and SharePoint List Items. How can we achieve this?
  • 19. Demo • Create Azure Synapse • Walkthrough Azure Synapse properties • Create Pools • Run Samples • Link Cosmos DB • Create External table • Data Explorer --Add Table and export data / Data explorer ingest data • PowerBI
  • 20. Azure Synapse – Use Case • Aggregation • Azure Cosmos DB – Synapse Link, then external view • Azure Log Analytics Workspace – Continuous Export then Parquet transformer using Spark and then external table • SharePoint Lists – Continuous export then parquet transformer using spark and then external table • Presentation • PowerBI – Direct Access • HTML controls – DW Queries • Cost • SQL Server – Serverless/Dedicated • Spark Nodes • https://azure.com/e/6233ac854ace4eddb06d15b8b056df21
  • 23. Security on Azure Synapse • Data at REST encryption using TDE (Transparent Data Encryption) • In-Transit (in motion) Encryption using TLS • Key Management • Customer Managed • Bring your own key (BYOK) • Must enabled when creating Azure Synapse • TDE Protector (key to encrypt DEK) • Data Masking – Dynamic and Static • Row-Level and Column-Level Security
  • 24. Reference • https://learn.microsoft.com/en-us/azure/synapse-analytics/?WT.mc_id=AZ-MVP- 5004665 • https://techcommunity.microsoft.com/t5/azure-observability-blog/how-to-analyze- data-exported-from-log-analytics-data-using/ba-p/2547888?WT.mc_id=AZ-MVP- 5004665 • https://www.youtube.com/watch?v=o2iFdU0EBLg&list=FLg-vqK9bYhHNecF-p- ZftLQ&index=1 • https://www.youtube.com/watch?v=lLrjaVdBuM0&list=FLg-vqK9bYhHNecF-p- ZftLQ&index=2&t=4712s
  • 25. Thanks for your time and trust! New Hampshire CLOUD .NET User Group

Editor's Notes

  1. Azure SQL Data Warehouse – a cloud-based enterprise data warehouse (EDW) that uses massively parallel processing (MPP) to reun complex queries across petabytes of data quickly.
  2. Azure SQL Data Warehouse – a cloud-based enterprise data warehouse (EDW) that uses massively parallel processing (MPP) to reun complex queries across petabytes of data quickly.
  3. Descriptive analytics, which answers the question “What is happening in my business?”. The data to answer this question is typically answered through the creation of a data warehouse in which historical data is persisted in relational tables for multidimensional modeling and reporting. Diagnostic analytics, which deals with answering the question “Why is it happening?”. This may involve exploring information that already exists in a data warehouse, but typically involves a wider search of your data estate to find more data to support this type of analysis. Predictive analytics, which enables you to answer the question “What is likely to happen in the future based on previous trends and patterns?” Prescriptive analytics, which enables autonomous decision making based on real-time or near real-time analysis of data, using predictive analytics.
  4. Data Warehouse: The already popular Azure Data Warehouse technology for storing and managing data for analysis and decision making, now through SQL pools. Big Data engine: With Spark pools, engineers can now run scalable analytics with Spark languages to do Big Data processing with them . Serverless engine: Query Data Lakes directly using SQL statements in a simple way. Data flows: To Develop ETL flows that consume or receive data in your Data Warehouse or Data Lake with the same engine used with Azure Data Factory. Azure Data Lake Storage+Azure SQL Data Warehouse+Azure Analytics=Azure Synapse Analytics
  5. Data Warehouse: The already popular Azure Data Warehouse technology for storing and managing data for analysis and decision making, now through SQL pools. Big Data engine: With Spark pools, engineers can now run scalable analytics with Spark languages to do Big Data processing with them . Serverless engine: Query Data Lakes directly using SQL statements in a simple way. Data flows: To Develop ETL flows that consume or receive data in your Data Warehouse or Data Lake with the same engine used with Azure Data Factory. Azure Data Lake Storage+Azure SQL Data Warehouse+Azure Analytics=Azure Synapse Analytics
  6. Azure SQL Data Warehouse – a cloud-based enterprise data warehouse (EDW) that uses massively parallel processing (MPP) to reun complex queries across petabytes of data quickly.
  7. Quick ad-hoc queries – before you decide how to proceed Logical Data warehouse- abstract layer on top of raw data Transform data in lake-consume it directly using powerBI
  8. The number of compute nodes ranges from 1 to 60, and is determined by the service level for Synapse SQL.
  9. Spark notebooks- combine code, text, markdown and data visualization
  10. YARN (Yet Another Resource Negotiator) https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-machine-learning-mllib-notebook
  11. Spark notebooks- combine code, text, markdown and data visualization
  12. https://learn.microsoft.com/en-us/training/modules/introduction-azure-synapse-analytics/4-when-use
  13. Azure SQL Data Warehouse – a cloud-based enterprise data warehouse (EDW) that uses massively parallel processing (MPP) to reun complex queries across petabytes of data quickly.
  14. df_nyc.write.mode("overwrite").saveAsTable("dbo.PassengerData")
  15. Double Encryption on top of Microsoft managed keys TDE using Az Key vault--Get/Wrap/Unwrap DEK key length 2048 or 3072 supported formats for imported key .pfx, .byok, .backup backup your keys before using it create a new backup when changes are made to the key Dynamic data masking mask data to non-privileged users ability to specify how much is revealed configured on specific databse fields can be used alongside encrytion, auditing, row-level-security etc., can be enabled via as portal or t-sql statements types of data masking full xxxx partial uxxx@xxx.com random salary=10000;FUNCTION='random(1,8)';Masked=6 custom string ex., name=Udai; FUNCTION='partial(1,'XXXX',1);masked=UxxxxI create user testuser without login grant select on sales.customer to testuser execute as user='testuser' select.... revert go grant unmask to test user revoke unmask to testuser select c.name,tbl.name as table_name,c.is_masked,c.masking_function from sys.masked_columns as c join sys.tables as tbl on c.[object_it]=tbl.[object_id] where is_masked=1 how does row level sec works not permissin based but predicate based security policy security predicate is an inline table-valued function (iTVF) filter predicate creating rls create table, insert rows, create users, create a schema(create schema),create security redicate(create function),create securith policy RLS best practices crate a separate chcema for the securit predicate function alter any security permission is required drop components in the following order: security policy, Table, function, schemas avoid excessive table joins in the predicate function CLS control access to specific column based on users context grant access to -sql user and azure ad