SlideShare a Scribd company logo
1 of 72
Designing Big Data Analytics
Solutions on Azure
Mohamed Tawfik
Cloud Solutions Architect
Azure CoE - EMEA
The 4 Industrial Revolutions (by Christoph Roser at AllAboutLean.com)
Azure Data Landscape
Source: Mastering Azure Analytics, 1st Edition - Zoiner
Tejada, O'Reilly Media, Inc., April 2017
Architecting Big Data Solutions on Azure:
Custom Scenarios & Patterns
AZURE SQL DATA WAREHOUSE
AZURE SQL DATABASE
DATA MIGRATION SERVICE
DATA MIGRATION SERVICE
DATA MIGRATION SERVICE
DATA MIGRATION SERVICE
AZURE ANALYSIS SERVICES
BUSINESS APPS
CUSTOM APPS
CUSTOM APPS
BUSINESS APPS
ANALYTICAL DASHBOARDS
Scenario 1
SQL Data Warehouse
An illustration
Relational Data
. . . Blobs, Azure
Data Lake Store
Binary
Data
10001110110101111011
1101010101010111100
000101010101010110
0000111100111
Poly
Base
Clients
Excel
Power
BI
Tableau
. . .
Transact-SQL Query
. . .ComputeComputeCompute
AZURE CLI, AZURE DATA FACTORY
DATA MIGRATION SERVICE
AZURE SQL DATA WAREHOUSE ANALYTICAL DASHBOARDSAZURE ANALYSIS SERVICES
Scenario 2
New Pipeline Model
Rich pipeline orchestration
Triggers – ondemand, schedule, event
Data Movement as a
Service
Cloud, Hybrid
30 connectors provided
SSIS Package Execution
In a managed cloud environment
Use familiar tools, SSMS & SSDT
Author & Monitor
Programmability (Python, .NET, Powershell, etc)
Visual Tools (coming soon)
Stored Procedures
Hadoop on Azure
Trusted data
BI & analyticsData Lake Analytics
Custom Code
Machine Learning
Category Data store Supported as source Supported as sink
Azure
Azure Data Lake Store
Azure Blob storage
Azure SQL Database
Azure SQL Data Warehouse
Azure Table storage
Azure DocumentDB
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
Databases
SQL Server*
Oracle*
MySQL*
DB2*
Teradata*
PostgreSQL*
Sybase*
Cassandra*
MongoDB*
Amazon Redshift
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
✓
File
File System*
HDFS*
Amazon S3
✓
✓
✓
✓
Others
Salesforce
Generic ODBC*
Generic OData
Web Table (table from HTML)
GE Historian*
✓
✓
✓
✓
✓
AZURE CLI, AZURE DATA FACTORY
DATA MIGRATION SERVICE
AZURE SQL DATA WAREHOUSE ANALYTICAL DASHBOARDSAZURE ANALYSIS SERVICES
ExpressRoute
Scenario 3
AZURE STORAGE
Polybase
ANALYTICAL DASHBOARDS
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE FUNCTIONS
Scenario 4
ANALYTICAL DASHBOARDS
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE DATA LAKE STORE
Scenario 5
AZURE FUNCTIONS
No limits to SCALE
Store ANY DATA in its native format
HADOOP FILE SYSTEM (HDFS) for the
cloud
Optimized for analytics workload
PERFORMANCE
ENTERPRISE GRADE access control,
encryption at rest
A hyper scale repository for big
data analytics workloads
Map reduce
HBase
transactions
Any HDFS applicationHive query
Azure HDInsight
Hadoop WebHDFS client
Azure Data Lake Store
WebHDFS-compatible REST API
Spark queries
Enterprise grade security
ADL .NET SDK
ADL
PowerShell
ADL
XPlat CLI
ADL Node.js SDK ADL Java SDK ADL Python*
Your application
Azure and ADL Store REST APIs
Capability ADLS Azure Blob
Purpose Optimized for Analytics
Analysis using Batch, Interactive, Streaming, ML
General purpose storage scenarios
App backend, backup data, media storage for
streaming, log files, IoT telemetry, Big Data
analytics
Geographic Availability East US 2, Central US, North Europe All Data Centers
HDFS Yes (Web HDFS) No
Scale No Limit on Bandwidth or Storage size Limits
-5PB Storage (announced)
-50GBps Bandwidth
Authentication & Authorization Azure Active Directory
POSIX ACLs on Files and Folders
Access keys & SAS tokens
Structure Accounts / Folders / Files (with Hierarchical
folders)
Accounts / Containers / Blobs (flat namespace)
Encryption Yes Yes
Geo- Replication No Yes [LRS, GRS, RA-GRS]
Cost [1PB] $40K
Coming soon
HOT $20K
COOL $16K
LOB
Applications
SocialDevices
Clickstream
Sensors
Video
Web
Relational
A highly scalable, distributed, parallel file system in the cloud specifically designed to work
with a variety of big data analytics workloads
Azure Data Lake Store
Batch
Map
Reduce
Script
Pig
SQL
Hive
NoSQL
HBase
In-Memory
Spark
Predictive
R Server
Batch
U-SQL
HDInsight
ADL
Analytics
ANALYTICAL DASHBOARDS
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE DATA LAKE STORE
Scenario 6
Azure Batch
Enable applications and algorithms
to easily and efficiently run in
parallel at scale
Rendering
Media transcoding & pre-/post-
processing
Test execution
Monte Carlo simulations
Genomics
Deep Learning
OCR
Data ingestion, processing, ETL
R at scale
Compiled MATLAB
Engineering simulations
Image analysis & processing
INPUT OUTPUT
Azure Batch
Concepts
Applications /
Algorithms
Queue
Pool of VMs
Jobs &
Tasks
Azure Batch Rendering GA
Queue
Upload assets
Submit job
Return outputs
Pay-per-minute
licensing
Windows and Linux VMs
Autodesk Maya
Plug-in
Batch Labs
x-plat client
Azure CLI /
PowerShell APIs
Monitor job
ANALYTICAL DASHBOARDS
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS
Scenario 7
Data Lake Analytics Workloads
With BATCH workload, Data Lake Analytics is ideal for
• The transformation and preparation of data for use in other systems
• Analytics on VERY LARGE amounts of data
• Massively Parallel programs written in .NET, Python and R, scaled out with U-
SQL
• Performing Cognition at Scale on large collections
Data Lake Analytics
Data Lake Store
An illustration
U-SQL Query
. . .ComputeComputeCompute
Unstructured Data
. . .
U-SQL
Query
Query
Azure
Storage Blobs
Azure SQL
in VMs
Azure
SQL DB
Azure Data
Lake Analytics
Azure
SQL Data Warehouse
Azure
Data Lake Storage
Easily query data in multiple Azure data stores
without moving it to a single store
Embedded Artificial Intelligence
Host Deep Neural Networks (DNNs)
6 Built-in Cognitive Functions
– Face API
– Image Tagging
– Emotion analysis
– OCR
– Text Key Phrase Extraction
– Text Sentiment Analysis
Extract
Process
Output
User CodeUser Code
User Code
User Code
Declarative Framework
User Extensions
U-SQL Example
Extract
User Code
User Code
U-SQL
Declarative
+
Imperative
Structured
+
Semi-structured
+
Unstructured
Batch
+
Interactive
+
Streaming
+
Machine Learning
Programming models Data Workloads
a language that unifies
ANALYTICAL DASHBOARDS
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS
Scenario 8
AZURE DATA LAKE ANALYTICS
Cleansing Analysis
Orchestration Key ManagementPrivate Connections Monitoring
AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE
ANALYTICAL DASHBOARDS
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS WEB & MOBILE APPS
AZURE STREAM ANALYTICS
Scenario 9
Azure Time Series Insights
Store and manage terabytes of time-series data
Explore and visualize billions of events simultaneously
Conduct root-cause analysis, and to compare multiple sites and assets
Illustrating an application
Stream Analytics
Time
Window
SELECT …
Written in Stream Analytics
Query Language, a subset
of T-SQL
Stream
A standing
query
Orchestration Key ManagementPrivate Connections Monitoring
AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE
ANALYTICAL DASHBOARDS
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS COSMOS DB WEB & MOBILE APPS
AZURE STREAM ANALYTICS
Scenario 10
Orchestration Key ManagementPrivate Connections Monitoring
AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE
ANALYTICAL DASHBOARDS
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE MACHINE LEARNING & MACHINE LEARNING SERVER
AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS COSMOS DB WEB & MOBILE APPS
AZURE STREAM ANALYTICS
Scenario 11
$2,600.45
$2,294.58
$1,003.30
$8,488.32
Name Amount Fraudulent
Smith
Janet
John
Adams
No
Yes
Yes
No
What’s the pattern for
fraudulent
transactions?
$2,600.45
$2,294.58
$1,003.30
$8,488.32
Name Amount Fraudulent
Smith
Janet
John
Adams
No
Yes
Yes
No
Where
Issued
Where
Used
Age of
Cardholder
$200.12
$3,250.11
$8,156.20
$7,475.11
Pali
Jones
Hanford
Marx
USA
USA
USA
FRA
AUS
USA
USA
UK
22
29
25
64
58
43
27
32
No
No
Yes
No
USA
RUS
RUS
USA
JAP
RUS
RUS
GER
$540.00
$7,475.11
Norse
Edson
USA
USA
27
20
No
Yes
RUS
RUS
What’s the pattern
for fraudulent
transactions?
Illustrating the process
MICROSOFTAZURE
Model
Call Center Staff
Call Center
ApplicationBlobsDetailed
Call Data
ONPREMISES
CRM
Data
Data
for ML
Aggregated
Call Data
ADLA Azure ML
Azure Data Factory
Need a real-time
prediction of each caller’s
propensity to churn
Model is rebuil
and redeployed
regularly
Orchestration Key ManagementPrivate Connections Monitoring
AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE MACHINE LEARNING & MACHINE LEARNING SERVER
AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS COSMOS DB WEB & MOBILE APPS
AZURE STREAM ANALYTICS
Scenario 12
Power BI
Power BI
Embedded
http://bit.ly/pbie
Microsoft Azure
subscription
Embed
End users
Workspace
Workspace collection
1,N
Developer
Name
Admin Users
Endpoints
Keys
Gateways
Credentials
Geo Location
Tags
Name
Reports
Datasets
Tags
Your app
Azure SQL
Data Warehouse
Azure SQL Database
1,N
1,N
Power BI
Users
Permissions
Auth. providers
API keys
Token
+ Claim: Can view Report 1
+ Expiration: 5 minutes
User requests to view
Report 1
Validate token
API keys
Report 2
Workspace
Report 1
Application
Provide seamless authentication experiences
Provide seamless authentication experiences
Power BI
Users
Permissions
Auth. providers
API keys API keys
Report 2
Workspace
Report 1Report 1
Application
Row Level Security
Users
Application
Permissions
Auth. providers
Power BI
API keys
Report 2
Workspace
Report 1
Token
+ Claim: Can view Report 1
+ Expiration: 5 minutes
+ username: “user1”
+ roles: “sales”
API keys
Copy API keys to your application
Sign token
Provide seamless authentication experiences
Power BI REST API
Authentication flow: Web application
FAQ
• What is a report session and how is it billed?
• A session is a set of interactions between an end user and a Power BI Embedded report.
Each time a Power BI Embedded report is displayed to a user, a session is initiated and the
subscription holder will be charged for a session. Sessions are billed at a flat rate,
independent of the number of visual elements in a report or how frequently the report
content is refreshed. A session ends when either the user closes the report, or the session
times out after one hour.
• Do you offer any tools or guidance to help me estimate how many renders/session I
should expect? How will I know how many renders have been completed?
• The Azure Portal will provide billing details on how many renders / report sessions have
been performed against your subscription.
• Do I need a Power BI subscription in order to develop applications with Power BI
Embedded? How do I get started?
• As the application developer, you do not need to have a Power BI subscription in order to
create the reports and visualizations you wish to use in your application. You will need a
Microsoft Azure subscription and the free Power BI Desktop application.
Orchestration Key ManagementPrivate Connections Monitoring
AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE MACHINE LEARNING & MACHINE LEARNING SERVER
AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS COSMOS DB WEB & MOBILE APPS
AZURE STREAM ANALYTICS
Scenario 13
Power BI
COGNITIVE SERVICESBOT SERVICE Logic App
Orchestration Key ManagementPrivate Connections Monitoring
AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE MACHINE LEARNING & MACHINE LEARNING SERVER
AZURE DATA LAKE STORE WEB & MOBILE APPS
Scenario 14
ANALYTICAL DASHBOARDS
AZURE HDINSIGHT
(Hadoop/Hive)
AZURE HDINSIGHT
(Hadoop/Storm)
AZURE HDINSIGHT
(Hadoop/Kafka)
Kafka
AZURE HDINSIGHT
(Hadoop/HBase)
COGNITIVE SERVICESBOT SERVICE Logic App
Clusters
Microsoft Azure Datacenter
HDInsight Cluster
VMVMVMVMVMVMVMVMVMVMVMVM
Created through the
Azure portal
Microsoft Hadoop Stack
Azure HDInsight
Machine
Learning
Local (HDFS) or Cloud (Azure Blob/Azure Data Lake Store)
Open source analytics
service for the Enterprise
Multi Region Availability
Available in >25 regions world-wide
Launched most recently in US West 2, and UK
regions
Available in China, Europe and US
Government clouds
IaaS Clusters Managed Clusters Big Data as-a-service
Best for…
Workloads
Administrative
Developer
Control &
configuration
Service Level
Agreement
TCO
CONTROL EASE OF USE AND ADOPTION
Orchestration Key ManagementPrivate Connections Monitoring
AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE DATA LAKE STORE
Scenario 14
ANALYTICAL DASHBOARDS
AZURE HDINSIGHT
(Hadoop/Hive)
AZURE HDINSIGHT
(Hadoop/Storm)
AZURE HDINSIGHT
(Hadoop/Kafka)
Kafka
AZURE HDINSIGHT
(Hadoop/R)
Jupyter
Data Science
Notebooks
AZURE HDINSIGHT
(Hadoop/Spark)
Community Algorithms
Spark ML (PySpark, SparkR)
Caffe on Spark
BigDL on HDInsight
SparklyR
XGBoost
Supported by community
ISV Applications
H2O
Dataiku
Supported by ISV
Orchestration Key ManagementPrivate Connections Monitoring
AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE
AZURE SQL DATA WAREHOUSE
DATA FACTORY
DATA FACTORY
AZURE DATA LAKE STORE
Scenario 15
ANALYTICAL DASHBOARDS
AZURE HDINSIGHT
(Hadoop/Hive)
AZURE HDINSIGHT
(Hadoop/Storm)
AZURE HDINSIGHT
(Hadoop/Kafka)
Kafka
AZURE HDINSIGHT
(Hadoop/R)
Jupyter
Data Science
Notebooks
AZURE HDINSIGHT
(Hadoop/Spark)
DATA CATALOG
Analyze
Enabling the Entire Enterprise Data Ecosystem
• Search
• Browse
• Filter
Discover
• Metadata
• Experts
• Context
Understand
• Your data
• Your tools
• Your way
Consume
• Tag
• Document
• Publish
Contribute
Source: Mastering Azure Analytics, 1st Edition - Zoiner
Tejada, O'Reilly Media, Inc., April 2017
Thank You
Mohamed Tawfik
Cloud Solutions Architect
Azure CoE - EMEA

More Related Content

What's hot

AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAmazon Web Services
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategyJames Serra
 
Azure SQL Database Managed Instance
Azure SQL Database Managed InstanceAzure SQL Database Managed Instance
Azure SQL Database Managed InstanceJames Serra
 
Modern Data Warehouse Overview
Modern Data Warehouse OverviewModern Data Warehouse Overview
Modern Data Warehouse OverviewJohn Chang
 
Overview of Microsoft Appliances: Scaling SQL Server to Hundreds of Terabytes
Overview of Microsoft Appliances: Scaling SQL Server to Hundreds of TerabytesOverview of Microsoft Appliances: Scaling SQL Server to Hundreds of Terabytes
Overview of Microsoft Appliances: Scaling SQL Server to Hundreds of TerabytesJames Serra
 
AI for Intelligent Cloud and Intelligent Edge: Discover, Deploy, and Manage w...
AI for Intelligent Cloud and Intelligent Edge:Discover, Deploy, and Manage w...AI for Intelligent Cloud and Intelligent Edge:Discover, Deploy, and Manage w...
AI for Intelligent Cloud and Intelligent Edge: Discover, Deploy, and Manage w...John Chang
 
Azure Lowlands: An intro to Azure Data Lake
Azure Lowlands: An intro to Azure Data LakeAzure Lowlands: An intro to Azure Data Lake
Azure Lowlands: An intro to Azure Data LakeRick van den Bosch
 
Microsoft Azure Big Data Analytics
Microsoft Azure Big Data AnalyticsMicrosoft Azure Big Data Analytics
Microsoft Azure Big Data AnalyticsMark Kromer
 
Azure SQL Database & Azure SQL Data Warehouse
Azure SQL Database & Azure SQL Data WarehouseAzure SQL Database & Azure SQL Data Warehouse
Azure SQL Database & Azure SQL Data WarehouseMohamed Tawfik
 
Data warehouse con azure synapse analytics
Data warehouse con azure synapse analyticsData warehouse con azure synapse analytics
Data warehouse con azure synapse analyticsEduardo Castro
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure Antonios Chatzipavlis
 
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...James Serra
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing DataWorks Summit
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAi & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAlberto Diaz Martin
 
Customer migration to azure sql database from on-premises SQL, for a SaaS app...
Customer migration to azure sql database from on-premises SQL, for a SaaS app...Customer migration to azure sql database from on-premises SQL, for a SaaS app...
Customer migration to azure sql database from on-premises SQL, for a SaaS app...George Walters
 
Azure Purview Data Toboggan Erwin de Kreuk
Azure Purview Data Toboggan Erwin de KreukAzure Purview Data Toboggan Erwin de Kreuk
Azure Purview Data Toboggan Erwin de KreukErwin de Kreuk
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overviewJames Serra
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureMark Kromer
 
DataMinds 2022 Azure Purview Erwin de Kreuk
DataMinds 2022 Azure Purview Erwin de KreukDataMinds 2022 Azure Purview Erwin de Kreuk
DataMinds 2022 Azure Purview Erwin de KreukErwin de Kreuk
 

What's hot (20)

AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
Azure SQL Database Managed Instance
Azure SQL Database Managed InstanceAzure SQL Database Managed Instance
Azure SQL Database Managed Instance
 
Modern Data Warehouse Overview
Modern Data Warehouse OverviewModern Data Warehouse Overview
Modern Data Warehouse Overview
 
Overview of Microsoft Appliances: Scaling SQL Server to Hundreds of Terabytes
Overview of Microsoft Appliances: Scaling SQL Server to Hundreds of TerabytesOverview of Microsoft Appliances: Scaling SQL Server to Hundreds of Terabytes
Overview of Microsoft Appliances: Scaling SQL Server to Hundreds of Terabytes
 
AI for Intelligent Cloud and Intelligent Edge: Discover, Deploy, and Manage w...
AI for Intelligent Cloud and Intelligent Edge:Discover, Deploy, and Manage w...AI for Intelligent Cloud and Intelligent Edge:Discover, Deploy, and Manage w...
AI for Intelligent Cloud and Intelligent Edge: Discover, Deploy, and Manage w...
 
Azure HDInsight
Azure HDInsightAzure HDInsight
Azure HDInsight
 
Azure Lowlands: An intro to Azure Data Lake
Azure Lowlands: An intro to Azure Data LakeAzure Lowlands: An intro to Azure Data Lake
Azure Lowlands: An intro to Azure Data Lake
 
Microsoft Azure Big Data Analytics
Microsoft Azure Big Data AnalyticsMicrosoft Azure Big Data Analytics
Microsoft Azure Big Data Analytics
 
Azure SQL Database & Azure SQL Data Warehouse
Azure SQL Database & Azure SQL Data WarehouseAzure SQL Database & Azure SQL Data Warehouse
Azure SQL Database & Azure SQL Data Warehouse
 
Data warehouse con azure synapse analytics
Data warehouse con azure synapse analyticsData warehouse con azure synapse analytics
Data warehouse con azure synapse analytics
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure
 
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAi & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientist
 
Customer migration to azure sql database from on-premises SQL, for a SaaS app...
Customer migration to azure sql database from on-premises SQL, for a SaaS app...Customer migration to azure sql database from on-premises SQL, for a SaaS app...
Customer migration to azure sql database from on-premises SQL, for a SaaS app...
 
Azure Purview Data Toboggan Erwin de Kreuk
Azure Purview Data Toboggan Erwin de KreukAzure Purview Data Toboggan Erwin de Kreuk
Azure Purview Data Toboggan Erwin de Kreuk
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft Azure
 
DataMinds 2022 Azure Purview Erwin de Kreuk
DataMinds 2022 Azure Purview Erwin de KreukDataMinds 2022 Azure Purview Erwin de Kreuk
DataMinds 2022 Azure Purview Erwin de Kreuk
 

Similar to Designing big data analytics solutions on azure

IoT & Azure, the field of possibilities
IoT & Azure, the field of possibilitiesIoT & Azure, the field of possibilities
IoT & Azure, the field of possibilitiesAlex Danvy
 
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...Lace Lofranco
 
Big Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI MobileBig Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI MobileRoy Kim
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Amazon Web Services
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Amazon Web Services
 
Azure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar PresentationAzure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar PresentationMatthew W. Bowers
 
Azure Data Lake Intro (SQLBits 2016)
Azure Data Lake Intro (SQLBits 2016)Azure Data Lake Intro (SQLBits 2016)
Azure Data Lake Intro (SQLBits 2016)Michael Rys
 
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...The Hive
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSAmazon Web Services
 
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Amazon Web Services
 
Trivadis Azure Data Lake
Trivadis Azure Data LakeTrivadis Azure Data Lake
Trivadis Azure Data LakeTrivadis
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionDifferentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionJames Serra
 
Azure Databricks & Spark @ Techorama 2018
Azure Databricks & Spark @ Techorama 2018Azure Databricks & Spark @ Techorama 2018
Azure Databricks & Spark @ Techorama 2018Nathan Bijnens
 
Using Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFUsing Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFAmazon Web Services
 
AWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapIan Massingham
 
AWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAdrian Hornsby
 
Building IoT and Big Data Solutions on Azure
Building IoT and Big Data Solutions on AzureBuilding IoT and Big Data Solutions on Azure
Building IoT and Big Data Solutions on AzureIdo Flatow
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices - BDA305 - Ana...
Build Data Lakes & Analytics on AWS: Patterns & Best Practices - BDA305 - Ana...Build Data Lakes & Analytics on AWS: Patterns & Best Practices - BDA305 - Ana...
Build Data Lakes & Analytics on AWS: Patterns & Best Practices - BDA305 - Ana...Amazon Web Services
 
Azure databricks c sharp corner toronto feb 2019 heather grandy
Azure databricks c sharp corner toronto feb 2019 heather grandyAzure databricks c sharp corner toronto feb 2019 heather grandy
Azure databricks c sharp corner toronto feb 2019 heather grandyNilesh Shah
 

Similar to Designing big data analytics solutions on azure (20)

IoT & Azure, the field of possibilities
IoT & Azure, the field of possibilitiesIoT & Azure, the field of possibilities
IoT & Azure, the field of possibilities
 
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
Microsoft Ignite AU 2017 - Orchestrating Big Data Pipelines with Azure Data F...
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 
Big Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI MobileBig Data Analytics from Azure Cloud to Power BI Mobile
Big Data Analytics from Azure Cloud to Power BI Mobile
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Azure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar PresentationAzure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar Presentation
 
Azure Data Lake Intro (SQLBits 2016)
Azure Data Lake Intro (SQLBits 2016)Azure Data Lake Intro (SQLBits 2016)
Azure Data Lake Intro (SQLBits 2016)
 
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
The Hive Think Tank - The Microsoft Big Data Stack by Raghu Ramakrishnan, CTO...
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
 
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
 
Trivadis Azure Data Lake
Trivadis Azure Data LakeTrivadis Azure Data Lake
Trivadis Azure Data Lake
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionDifferentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
 
Azure Databricks & Spark @ Techorama 2018
Azure Databricks & Spark @ Techorama 2018Azure Databricks & Spark @ Techorama 2018
Azure Databricks & Spark @ Techorama 2018
 
Using Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFUsing Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SF
 
AWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:Cap
 
AWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:Cap
 
Building IoT and Big Data Solutions on Azure
Building IoT and Big Data Solutions on AzureBuilding IoT and Big Data Solutions on Azure
Building IoT and Big Data Solutions on Azure
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices - BDA305 - Ana...
Build Data Lakes & Analytics on AWS: Patterns & Best Practices - BDA305 - Ana...Build Data Lakes & Analytics on AWS: Patterns & Best Practices - BDA305 - Ana...
Build Data Lakes & Analytics on AWS: Patterns & Best Practices - BDA305 - Ana...
 
Azure databricks c sharp corner toronto feb 2019 heather grandy
Azure databricks c sharp corner toronto feb 2019 heather grandyAzure databricks c sharp corner toronto feb 2019 heather grandy
Azure databricks c sharp corner toronto feb 2019 heather grandy
 

More from Mohamed Tawfik

Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services Mohamed Tawfik
 
SCCM on Microsoft Azure
SCCM on Microsoft AzureSCCM on Microsoft Azure
SCCM on Microsoft AzureMohamed Tawfik
 
Upcoming Challenges in E-Learning & Online Learning Environments
Upcoming Challenges in E-Learning & Online Learning EnvironmentsUpcoming Challenges in E-Learning & Online Learning Environments
Upcoming Challenges in E-Learning & Online Learning EnvironmentsMohamed Tawfik
 
FINTDI 2011 - Remote Laboratories for Electrical & Electronic Subjects in New...
FINTDI 2011 - Remote Laboratories for Electrical & Electronic Subjects in New...FINTDI 2011 - Remote Laboratories for Electrical & Electronic Subjects in New...
FINTDI 2011 - Remote Laboratories for Electrical & Electronic Subjects in New...Mohamed Tawfik
 
UNED MURE Project Amman
UNED MURE Project AmmanUNED MURE Project Amman
UNED MURE Project AmmanMohamed Tawfik
 
VISIR INSTALLATION & START-UP GUIDE V.1
VISIR INSTALLATION & START-UP GUIDE V.1VISIR INSTALLATION & START-UP GUIDE V.1
VISIR INSTALLATION & START-UP GUIDE V.1Mohamed Tawfik
 
GOLC 2012 - On Standardizing the Management of LabVIEW-based Remote Laborator...
GOLC 2012 - On Standardizing the Management of LabVIEW-based Remote Laborator...GOLC 2012 - On Standardizing the Management of LabVIEW-based Remote Laborator...
GOLC 2012 - On Standardizing the Management of LabVIEW-based Remote Laborator...Mohamed Tawfik
 
REV 2011 - A New Node in the VISIR Community
REV 2011 - A New Node in the VISIR CommunityREV 2011 - A New Node in the VISIR Community
REV 2011 - A New Node in the VISIR CommunityMohamed Tawfik
 
REV 2013 - Grid Remote Laboratory Management System: Sahara Reaches Europe
REV 2013 - Grid Remote Laboratory Management System: Sahara Reaches EuropeREV 2013 - Grid Remote Laboratory Management System: Sahara Reaches Europe
REV 2013 - Grid Remote Laboratory Management System: Sahara Reaches EuropeMohamed Tawfik
 
TAEE 2011- State-of-the-Art Remote Laboratories for Industrial Electronics Ap...
TAEE 2011- State-of-the-Art Remote Laboratories for Industrial Electronics Ap...TAEE 2011- State-of-the-Art Remote Laboratories for Industrial Electronics Ap...
TAEE 2011- State-of-the-Art Remote Laboratories for Industrial Electronics Ap...Mohamed Tawfik
 
Copec ICECE 2011- DESIGN OF PRACTICAL ACTIVITIES IN ELECTRONICS
Copec ICECE 2011- DESIGN OF PRACTICAL ACTIVITIES IN ELECTRONICSCopec ICECE 2011- DESIGN OF PRACTICAL ACTIVITIES IN ELECTRONICS
Copec ICECE 2011- DESIGN OF PRACTICAL ACTIVITIES IN ELECTRONICSMohamed Tawfik
 
TAEE 2012- Shareable Educational Architectures for Remote Laboratories
TAEE 2012- Shareable Educational Architectures for Remote LaboratoriesTAEE 2012- Shareable Educational Architectures for Remote Laboratories
TAEE 2012- Shareable Educational Architectures for Remote LaboratoriesMohamed Tawfik
 
TAEE 2012- PAC - Performance-Centered Adaptive Curriculum for Employment Needs
TAEE 2012- PAC - Performance-Centered Adaptive Curriculum for Employment NeedsTAEE 2012- PAC - Performance-Centered Adaptive Curriculum for Employment Needs
TAEE 2012- PAC - Performance-Centered Adaptive Curriculum for Employment NeedsMohamed Tawfik
 
Educon 2012- On the Design of Remote Laboratories
Educon 2012- On the Design of Remote LaboratoriesEducon 2012- On the Design of Remote Laboratories
Educon 2012- On the Design of Remote LaboratoriesMohamed Tawfik
 
ASEE 2012 - Common Multidisciplinary Prototypes of Remote Laboratories in the...
ASEE 2012 - Common Multidisciplinary Prototypes of Remote Laboratories in the...ASEE 2012 - Common Multidisciplinary Prototypes of Remote Laboratories in the...
ASEE 2012 - Common Multidisciplinary Prototypes of Remote Laboratories in the...Mohamed Tawfik
 
TAEE2012-Putting Fundmentals of Electronic Circuits Practices online
TAEE2012-Putting Fundmentals of Electronic Circuits Practices onlineTAEE2012-Putting Fundmentals of Electronic Circuits Practices online
TAEE2012-Putting Fundmentals of Electronic Circuits Practices onlineMohamed Tawfik
 
Visir- Practicas Electronica Remotas Orientadas a la Industria
Visir- Practicas Electronica Remotas Orientadas a la IndustriaVisir- Practicas Electronica Remotas Orientadas a la Industria
Visir- Practicas Electronica Remotas Orientadas a la IndustriaMohamed Tawfik
 

More from Mohamed Tawfik (20)

Azure Cosmos DB
Azure Cosmos DBAzure Cosmos DB
Azure Cosmos DB
 
Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services Microsoft Azure Offerings and New Services
Microsoft Azure Offerings and New Services
 
SCCM on Microsoft Azure
SCCM on Microsoft AzureSCCM on Microsoft Azure
SCCM on Microsoft Azure
 
IBM Watson
IBM WatsonIBM Watson
IBM Watson
 
Upcoming Challenges in E-Learning & Online Learning Environments
Upcoming Challenges in E-Learning & Online Learning EnvironmentsUpcoming Challenges in E-Learning & Online Learning Environments
Upcoming Challenges in E-Learning & Online Learning Environments
 
FINTDI 2011 - Remote Laboratories for Electrical & Electronic Subjects in New...
FINTDI 2011 - Remote Laboratories for Electrical & Electronic Subjects in New...FINTDI 2011 - Remote Laboratories for Electrical & Electronic Subjects in New...
FINTDI 2011 - Remote Laboratories for Electrical & Electronic Subjects in New...
 
UNED MURE Project Amman
UNED MURE Project AmmanUNED MURE Project Amman
UNED MURE Project Amman
 
VISIR INSTALLATION & START-UP GUIDE V.1
VISIR INSTALLATION & START-UP GUIDE V.1VISIR INSTALLATION & START-UP GUIDE V.1
VISIR INSTALLATION & START-UP GUIDE V.1
 
GOLC 2012 - On Standardizing the Management of LabVIEW-based Remote Laborator...
GOLC 2012 - On Standardizing the Management of LabVIEW-based Remote Laborator...GOLC 2012 - On Standardizing the Management of LabVIEW-based Remote Laborator...
GOLC 2012 - On Standardizing the Management of LabVIEW-based Remote Laborator...
 
REV 2011 - A New Node in the VISIR Community
REV 2011 - A New Node in the VISIR CommunityREV 2011 - A New Node in the VISIR Community
REV 2011 - A New Node in the VISIR Community
 
REV 2013 - Grid Remote Laboratory Management System: Sahara Reaches Europe
REV 2013 - Grid Remote Laboratory Management System: Sahara Reaches EuropeREV 2013 - Grid Remote Laboratory Management System: Sahara Reaches Europe
REV 2013 - Grid Remote Laboratory Management System: Sahara Reaches Europe
 
TAEE 2011- State-of-the-Art Remote Laboratories for Industrial Electronics Ap...
TAEE 2011- State-of-the-Art Remote Laboratories for Industrial Electronics Ap...TAEE 2011- State-of-the-Art Remote Laboratories for Industrial Electronics Ap...
TAEE 2011- State-of-the-Art Remote Laboratories for Industrial Electronics Ap...
 
IEEE Presentation
IEEE PresentationIEEE Presentation
IEEE Presentation
 
Copec ICECE 2011- DESIGN OF PRACTICAL ACTIVITIES IN ELECTRONICS
Copec ICECE 2011- DESIGN OF PRACTICAL ACTIVITIES IN ELECTRONICSCopec ICECE 2011- DESIGN OF PRACTICAL ACTIVITIES IN ELECTRONICS
Copec ICECE 2011- DESIGN OF PRACTICAL ACTIVITIES IN ELECTRONICS
 
TAEE 2012- Shareable Educational Architectures for Remote Laboratories
TAEE 2012- Shareable Educational Architectures for Remote LaboratoriesTAEE 2012- Shareable Educational Architectures for Remote Laboratories
TAEE 2012- Shareable Educational Architectures for Remote Laboratories
 
TAEE 2012- PAC - Performance-Centered Adaptive Curriculum for Employment Needs
TAEE 2012- PAC - Performance-Centered Adaptive Curriculum for Employment NeedsTAEE 2012- PAC - Performance-Centered Adaptive Curriculum for Employment Needs
TAEE 2012- PAC - Performance-Centered Adaptive Curriculum for Employment Needs
 
Educon 2012- On the Design of Remote Laboratories
Educon 2012- On the Design of Remote LaboratoriesEducon 2012- On the Design of Remote Laboratories
Educon 2012- On the Design of Remote Laboratories
 
ASEE 2012 - Common Multidisciplinary Prototypes of Remote Laboratories in the...
ASEE 2012 - Common Multidisciplinary Prototypes of Remote Laboratories in the...ASEE 2012 - Common Multidisciplinary Prototypes of Remote Laboratories in the...
ASEE 2012 - Common Multidisciplinary Prototypes of Remote Laboratories in the...
 
TAEE2012-Putting Fundmentals of Electronic Circuits Practices online
TAEE2012-Putting Fundmentals of Electronic Circuits Practices onlineTAEE2012-Putting Fundmentals of Electronic Circuits Practices online
TAEE2012-Putting Fundmentals of Electronic Circuits Practices online
 
Visir- Practicas Electronica Remotas Orientadas a la Industria
Visir- Practicas Electronica Remotas Orientadas a la IndustriaVisir- Practicas Electronica Remotas Orientadas a la Industria
Visir- Practicas Electronica Remotas Orientadas a la Industria
 

Recently uploaded

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 

Designing big data analytics solutions on azure

  • 1. Designing Big Data Analytics Solutions on Azure Mohamed Tawfik Cloud Solutions Architect Azure CoE - EMEA
  • 2.
  • 3. The 4 Industrial Revolutions (by Christoph Roser at AllAboutLean.com)
  • 4.
  • 6. Source: Mastering Azure Analytics, 1st Edition - Zoiner Tejada, O'Reilly Media, Inc., April 2017
  • 7. Architecting Big Data Solutions on Azure: Custom Scenarios & Patterns
  • 8. AZURE SQL DATA WAREHOUSE AZURE SQL DATABASE DATA MIGRATION SERVICE DATA MIGRATION SERVICE DATA MIGRATION SERVICE DATA MIGRATION SERVICE AZURE ANALYSIS SERVICES BUSINESS APPS CUSTOM APPS CUSTOM APPS BUSINESS APPS ANALYTICAL DASHBOARDS Scenario 1
  • 9. SQL Data Warehouse An illustration Relational Data . . . Blobs, Azure Data Lake Store Binary Data 10001110110101111011 1101010101010111100 000101010101010110 0000111100111 Poly Base Clients Excel Power BI Tableau . . . Transact-SQL Query . . .ComputeComputeCompute
  • 10. AZURE CLI, AZURE DATA FACTORY DATA MIGRATION SERVICE AZURE SQL DATA WAREHOUSE ANALYTICAL DASHBOARDSAZURE ANALYSIS SERVICES Scenario 2
  • 11. New Pipeline Model Rich pipeline orchestration Triggers – ondemand, schedule, event Data Movement as a Service Cloud, Hybrid 30 connectors provided SSIS Package Execution In a managed cloud environment Use familiar tools, SSMS & SSDT Author & Monitor Programmability (Python, .NET, Powershell, etc) Visual Tools (coming soon) Stored Procedures Hadoop on Azure Trusted data BI & analyticsData Lake Analytics Custom Code Machine Learning
  • 12. Category Data store Supported as source Supported as sink Azure Azure Data Lake Store Azure Blob storage Azure SQL Database Azure SQL Data Warehouse Azure Table storage Azure DocumentDB ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ Databases SQL Server* Oracle* MySQL* DB2* Teradata* PostgreSQL* Sybase* Cassandra* MongoDB* Amazon Redshift ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ File File System* HDFS* Amazon S3 ✓ ✓ ✓ ✓ Others Salesforce Generic ODBC* Generic OData Web Table (table from HTML) GE Historian* ✓ ✓ ✓ ✓ ✓
  • 13.
  • 14. AZURE CLI, AZURE DATA FACTORY DATA MIGRATION SERVICE AZURE SQL DATA WAREHOUSE ANALYTICAL DASHBOARDSAZURE ANALYSIS SERVICES ExpressRoute Scenario 3
  • 15. AZURE STORAGE Polybase ANALYTICAL DASHBOARDS AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE FUNCTIONS Scenario 4
  • 16. ANALYTICAL DASHBOARDS AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE DATA LAKE STORE Scenario 5 AZURE FUNCTIONS
  • 17. No limits to SCALE Store ANY DATA in its native format HADOOP FILE SYSTEM (HDFS) for the cloud Optimized for analytics workload PERFORMANCE ENTERPRISE GRADE access control, encryption at rest A hyper scale repository for big data analytics workloads
  • 18. Map reduce HBase transactions Any HDFS applicationHive query Azure HDInsight Hadoop WebHDFS client Azure Data Lake Store WebHDFS-compatible REST API Spark queries
  • 20. ADL .NET SDK ADL PowerShell ADL XPlat CLI ADL Node.js SDK ADL Java SDK ADL Python* Your application Azure and ADL Store REST APIs
  • 21. Capability ADLS Azure Blob Purpose Optimized for Analytics Analysis using Batch, Interactive, Streaming, ML General purpose storage scenarios App backend, backup data, media storage for streaming, log files, IoT telemetry, Big Data analytics Geographic Availability East US 2, Central US, North Europe All Data Centers HDFS Yes (Web HDFS) No Scale No Limit on Bandwidth or Storage size Limits -5PB Storage (announced) -50GBps Bandwidth Authentication & Authorization Azure Active Directory POSIX ACLs on Files and Folders Access keys & SAS tokens Structure Accounts / Folders / Files (with Hierarchical folders) Accounts / Containers / Blobs (flat namespace) Encryption Yes Yes Geo- Replication No Yes [LRS, GRS, RA-GRS] Cost [1PB] $40K Coming soon HOT $20K COOL $16K
  • 22. LOB Applications SocialDevices Clickstream Sensors Video Web Relational A highly scalable, distributed, parallel file system in the cloud specifically designed to work with a variety of big data analytics workloads Azure Data Lake Store Batch Map Reduce Script Pig SQL Hive NoSQL HBase In-Memory Spark Predictive R Server Batch U-SQL HDInsight ADL Analytics
  • 23. ANALYTICAL DASHBOARDS AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE DATA LAKE STORE Scenario 6
  • 24. Azure Batch Enable applications and algorithms to easily and efficiently run in parallel at scale Rendering Media transcoding & pre-/post- processing Test execution Monte Carlo simulations Genomics Deep Learning OCR Data ingestion, processing, ETL R at scale Compiled MATLAB Engineering simulations Image analysis & processing
  • 25. INPUT OUTPUT Azure Batch Concepts Applications / Algorithms Queue Pool of VMs Jobs & Tasks
  • 26. Azure Batch Rendering GA Queue Upload assets Submit job Return outputs Pay-per-minute licensing Windows and Linux VMs Autodesk Maya Plug-in Batch Labs x-plat client Azure CLI / PowerShell APIs Monitor job
  • 27. ANALYTICAL DASHBOARDS AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS Scenario 7
  • 28. Data Lake Analytics Workloads With BATCH workload, Data Lake Analytics is ideal for • The transformation and preparation of data for use in other systems • Analytics on VERY LARGE amounts of data • Massively Parallel programs written in .NET, Python and R, scaled out with U- SQL • Performing Cognition at Scale on large collections
  • 29. Data Lake Analytics Data Lake Store An illustration U-SQL Query . . .ComputeComputeCompute Unstructured Data . . .
  • 30. U-SQL Query Query Azure Storage Blobs Azure SQL in VMs Azure SQL DB Azure Data Lake Analytics Azure SQL Data Warehouse Azure Data Lake Storage Easily query data in multiple Azure data stores without moving it to a single store
  • 31. Embedded Artificial Intelligence Host Deep Neural Networks (DNNs) 6 Built-in Cognitive Functions – Face API – Image Tagging – Emotion analysis – OCR – Text Key Phrase Extraction – Text Sentiment Analysis
  • 32. Extract Process Output User CodeUser Code User Code User Code Declarative Framework User Extensions U-SQL Example Extract User Code User Code
  • 34. ANALYTICAL DASHBOARDS AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS Scenario 8 AZURE DATA LAKE ANALYTICS Cleansing Analysis
  • 35. Orchestration Key ManagementPrivate Connections Monitoring AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE ANALYTICAL DASHBOARDS AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS WEB & MOBILE APPS AZURE STREAM ANALYTICS Scenario 9 Azure Time Series Insights
  • 36. Store and manage terabytes of time-series data Explore and visualize billions of events simultaneously Conduct root-cause analysis, and to compare multiple sites and assets
  • 37. Illustrating an application Stream Analytics Time Window SELECT … Written in Stream Analytics Query Language, a subset of T-SQL Stream A standing query
  • 38. Orchestration Key ManagementPrivate Connections Monitoring AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE ANALYTICAL DASHBOARDS AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS COSMOS DB WEB & MOBILE APPS AZURE STREAM ANALYTICS Scenario 10
  • 39.
  • 40. Orchestration Key ManagementPrivate Connections Monitoring AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE ANALYTICAL DASHBOARDS AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE MACHINE LEARNING & MACHINE LEARNING SERVER AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS COSMOS DB WEB & MOBILE APPS AZURE STREAM ANALYTICS Scenario 11
  • 42. $2,600.45 $2,294.58 $1,003.30 $8,488.32 Name Amount Fraudulent Smith Janet John Adams No Yes Yes No Where Issued Where Used Age of Cardholder $200.12 $3,250.11 $8,156.20 $7,475.11 Pali Jones Hanford Marx USA USA USA FRA AUS USA USA UK 22 29 25 64 58 43 27 32 No No Yes No USA RUS RUS USA JAP RUS RUS GER $540.00 $7,475.11 Norse Edson USA USA 27 20 No Yes RUS RUS What’s the pattern for fraudulent transactions?
  • 44. MICROSOFTAZURE Model Call Center Staff Call Center ApplicationBlobsDetailed Call Data ONPREMISES CRM Data Data for ML Aggregated Call Data ADLA Azure ML Azure Data Factory Need a real-time prediction of each caller’s propensity to churn Model is rebuil and redeployed regularly
  • 45. Orchestration Key ManagementPrivate Connections Monitoring AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE MACHINE LEARNING & MACHINE LEARNING SERVER AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS COSMOS DB WEB & MOBILE APPS AZURE STREAM ANALYTICS Scenario 12 Power BI Power BI Embedded
  • 47. Microsoft Azure subscription Embed End users Workspace Workspace collection 1,N Developer Name Admin Users Endpoints Keys Gateways Credentials Geo Location Tags Name Reports Datasets Tags Your app Azure SQL Data Warehouse Azure SQL Database 1,N 1,N
  • 48. Power BI Users Permissions Auth. providers API keys Token + Claim: Can view Report 1 + Expiration: 5 minutes User requests to view Report 1 Validate token API keys Report 2 Workspace Report 1 Application Provide seamless authentication experiences
  • 49. Provide seamless authentication experiences Power BI Users Permissions Auth. providers API keys API keys Report 2 Workspace Report 1Report 1 Application
  • 51. Users Application Permissions Auth. providers Power BI API keys Report 2 Workspace Report 1 Token + Claim: Can view Report 1 + Expiration: 5 minutes + username: “user1” + roles: “sales” API keys Copy API keys to your application Sign token Provide seamless authentication experiences
  • 52. Power BI REST API Authentication flow: Web application
  • 53.
  • 54. FAQ • What is a report session and how is it billed? • A session is a set of interactions between an end user and a Power BI Embedded report. Each time a Power BI Embedded report is displayed to a user, a session is initiated and the subscription holder will be charged for a session. Sessions are billed at a flat rate, independent of the number of visual elements in a report or how frequently the report content is refreshed. A session ends when either the user closes the report, or the session times out after one hour. • Do you offer any tools or guidance to help me estimate how many renders/session I should expect? How will I know how many renders have been completed? • The Azure Portal will provide billing details on how many renders / report sessions have been performed against your subscription. • Do I need a Power BI subscription in order to develop applications with Power BI Embedded? How do I get started? • As the application developer, you do not need to have a Power BI subscription in order to create the reports and visualizations you wish to use in your application. You will need a Microsoft Azure subscription and the free Power BI Desktop application.
  • 55. Orchestration Key ManagementPrivate Connections Monitoring AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE MACHINE LEARNING & MACHINE LEARNING SERVER AZURE DATA LAKE STORE AZURE DATA LAKE ANALYTICS COSMOS DB WEB & MOBILE APPS AZURE STREAM ANALYTICS Scenario 13 Power BI COGNITIVE SERVICESBOT SERVICE Logic App
  • 56. Orchestration Key ManagementPrivate Connections Monitoring AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE MACHINE LEARNING & MACHINE LEARNING SERVER AZURE DATA LAKE STORE WEB & MOBILE APPS Scenario 14 ANALYTICAL DASHBOARDS AZURE HDINSIGHT (Hadoop/Hive) AZURE HDINSIGHT (Hadoop/Storm) AZURE HDINSIGHT (Hadoop/Kafka) Kafka AZURE HDINSIGHT (Hadoop/HBase) COGNITIVE SERVICESBOT SERVICE Logic App
  • 57. Clusters Microsoft Azure Datacenter HDInsight Cluster VMVMVMVMVMVMVMVMVMVMVMVM Created through the Azure portal
  • 58.
  • 59. Microsoft Hadoop Stack Azure HDInsight Machine Learning Local (HDFS) or Cloud (Azure Blob/Azure Data Lake Store)
  • 60. Open source analytics service for the Enterprise
  • 61. Multi Region Availability Available in >25 regions world-wide Launched most recently in US West 2, and UK regions Available in China, Europe and US Government clouds
  • 62. IaaS Clusters Managed Clusters Big Data as-a-service Best for… Workloads Administrative Developer Control & configuration Service Level Agreement TCO CONTROL EASE OF USE AND ADOPTION
  • 63. Orchestration Key ManagementPrivate Connections Monitoring AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE DATA LAKE STORE Scenario 14 ANALYTICAL DASHBOARDS AZURE HDINSIGHT (Hadoop/Hive) AZURE HDINSIGHT (Hadoop/Storm) AZURE HDINSIGHT (Hadoop/Kafka) Kafka AZURE HDINSIGHT (Hadoop/R) Jupyter Data Science Notebooks AZURE HDINSIGHT (Hadoop/Spark)
  • 64. Community Algorithms Spark ML (PySpark, SparkR) Caffe on Spark BigDL on HDInsight SparklyR XGBoost Supported by community ISV Applications H2O Dataiku Supported by ISV
  • 65.
  • 66. Orchestration Key ManagementPrivate Connections Monitoring AZURE EXPRESSROUTE AZURE DATA FACTORY AZURE KEY VAULT OPERATIONS MANAGEMENT SUITE AZURE SQL DATA WAREHOUSE DATA FACTORY DATA FACTORY AZURE DATA LAKE STORE Scenario 15 ANALYTICAL DASHBOARDS AZURE HDINSIGHT (Hadoop/Hive) AZURE HDINSIGHT (Hadoop/Storm) AZURE HDINSIGHT (Hadoop/Kafka) Kafka AZURE HDINSIGHT (Hadoop/R) Jupyter Data Science Notebooks AZURE HDINSIGHT (Hadoop/Spark) DATA CATALOG
  • 67. Analyze Enabling the Entire Enterprise Data Ecosystem • Search • Browse • Filter Discover • Metadata • Experts • Context Understand • Your data • Your tools • Your way Consume • Tag • Document • Publish Contribute
  • 68.
  • 69.
  • 70.
  • 71. Source: Mastering Azure Analytics, 1st Edition - Zoiner Tejada, O'Reilly Media, Inc., April 2017
  • 72. Thank You Mohamed Tawfik Cloud Solutions Architect Azure CoE - EMEA

Editor's Notes

  1. Add key for the coluors
  2. Add key for the coluors
  3. Add key for the coluors
  4. Notes: Web jobs can be used for streaming processing when set to continuous, functions can only be triggered or scheduled so they are not suitable. In some cases logic apps might fit for orchestrating specific tasks Azure Data Factor and Oozie are the main orchestrators offered in Azure Apache Oozie is a Java web application that does workflow coordination for Hadoop jobs. In Oozie, a workflow is defined as directed acyclic graphs (DAGs) of actions. It supports different types of Hadoop jobs, such as MapReduce, Streaming, Pig, Hive, Sqoop, and more. Not only these, but also system-specific jobs, such as shell scripts and Java programs. Apache Sqoop is a tool to transfer bulk data to and from Hadoop and relational databases as efficiently as possible. It is used to import data from relational database management systems (RDBMS)— such as Oracle, MySQL, SQL Server, or any other structured relational database—and into the HDFS. It then does processing and/or transformation on the data using Hive or MapReduce, and then exports the data back to the RDBMS.
  5. Add key for the coluors
  6. Add key for the coluors
  7. Add key for the coluors
  8. Add key for the coluors
  9. Add key for the coluors
  10. Add key for the coluors
  11. Add key for the coluors
  12. Add key for the coluors
  13. Add key for the coluors
  14. Add key for the coluors
  15. Add key for the coluors
  16. Add key for the coluors
  17. Add key for the coluors
  18. Add key for the coluors
  19. Add key for the coluors
  20. Add key for the coluors
  21. Add key for the coluors
  22. Add key for the coluors
  23. Add key for the coluors
  24. Add key for the coluors
  25. Add key for the coluors
  26. Add key for the coluors
  27. Add key for the coluors
  28. Add key for the coluors
  29. Add key for the coluors
  30. Add key for the coluors
  31. Add key for the coluors
  32. Add key for the coluors
  33. Add key for the coluors
  34. Add key for the coluors
  35. Add key for the coluors
  36. Add key for the coluors
  37. Add key for the coluors
  38. Add key for the coluors
  39. Add key for the coluors
  40. Add key for the coluors
  41. Add key for the coluors
  42. Add key for the coluors
  43. Add key for the coluors
  44. Add key for the coluors
  45. Add key for the coluors
  46. Add key for the coluors
  47. Add key for the coluors
  48. Add key for the coluors
  49. Add key for the coluors
  50. Add key for the coluors
  51. Add key for the coluors
  52. Add key for the coluors
  53. Add key for the coluors
  54. Add key for the coluors
  55. Add key for the coluors
  56. Add key for the coluors
  57. Add key for the coluors
  58. Add key for the coluors
  59. Add key for the coluors
  60. Add key for the coluors
  61. Add key for the coluors
  62. Add key for the coluors
  63. Add key for the coluors
  64. Add key for the coluors
  65. Notes: Web jobs can be used for streaming processing when set to continuous, functions can only be triggered or scheduled so they are not suitable. In some cases logic apps might fit for orchestrating specific tasks Azure Data Factor and Oozie are the main orchestrators offered in Azure Apache Oozie is a Java web application that does workflow coordination for Hadoop jobs. In Oozie, a workflow is defined as directed acyclic graphs (DAGs) of actions. It supports different types of Hadoop jobs, such as MapReduce, Streaming, Pig, Hive, Sqoop, and more. Not only these, but also system-specific jobs, such as shell scripts and Java programs. Apache Sqoop is a tool to transfer bulk data to and from Hadoop and relational databases as efficiently as possible. It is used to import data from relational database management systems (RDBMS)— such as Oracle, MySQL, SQL Server, or any other structured relational database—and into the HDFS. It then does processing and/or transformation on the data using Hive or MapReduce, and then exports the data back to the RDBMS.