SlideShare a Scribd company logo
© Copyright Microsoft Corporation. All rights reserved.
FOR USE ONLY AS PART OF MICROSOFT VIRTUAL TRAINING DAYS PROGRAM. THESE MATERIALS ARE NOT AUTHORIZED
FOR DISTRIBUTION, REPRODUCTION OR OTHER USE BY NON-MICROSOFT PARTIES.
We will be starting shortly
Microsoft Azure Virtual
Training Day: Data
Fundamentals
About
this course
Course objectives:
 Describe core data concepts
 Identify services for relational data
 Identify services for non-relational data
 Identify services for data analytics
This course is supplemented by online training at
https://aka.ms/AzureLearn_DataFundamentals
Course Agenda Module 1: Explore fundamentals of data
 Core data concepts
 Data roles and services
Module 2: Explore fundamentals of relational data in Azure
 Explore relational data concepts
 Explore Azure services for relational data
Module 3: Explore fundamentals of non-relational data in Azure
 Fundamentals of Azure Storage
 Fundamentals of Azure Cosmos DB
Module 4: Explore fundamentals of large-scale data warehousing
 Large-scale data warehousing
Module 5: Explore fundamentals of real-time analytics
 Streaming and real-time analytics
Module 6: Explore fundamentals of data visualization
 Data visualization
Demos
• Demos in this course are based
on exercises in Microsoft Learn
Explore Fundamentals of Data
Module 1:
• Lesson 1: Core data concepts
• Lesson 2: Data roles and services
Lesson 1: Core Data Concepts
What is data?
Values used to record information – often representing entities that have one or more attributes
Structured
Customer
ID FirstName LastName Email Address
1 Joe Jones
joe@litware.c
om
1 Main St.
2 Samir Nadoy
samir@north
wind.com
123 Elm Pl.
Product
ID Name Price
123 Hammer 2.99
162 Screwdriver 3.49
201 Wrench 4.25
Semi-structured Unstructured
{
"firstName": "Joe",
"lastName": "Jones",
"address":
{
"streetAddress": "1 Main
St.",
"city": "New York",
"state": "NY",
"postalCode": "10099"
},
"contact":
[
{
"type": "home",
"number": "555 123-1234"
},
{
"type": "email",
"address":
"joe@litware.com"
}
]
}
{
"firstName": "Samir",
"lastName": "Nadoy",
"address":
{
"streetAddress": "123 Elm
Pl.",
"unit": "500",
"city": "Seattle",
"state": "WA",
"postalCode": "98999"
},
"contact":
[
{
"type": "email",
"address":
"samir@northwind.com"
}
]
}
How is data stored?
Files
{
"customers":
[
{ "firstName": "Joe", "lastName": "Jones"},
{ "firstName": "Samir", "lastName": "Nadoy"}
]
}
Databases
Graph
Transactional data workloads
Order
… … …
… … …
* * *
Analytical data workloads
▲
▼
▲
DW
DL
Lesson 2: Data Roles and Services
Data professional roles
Database Administrator
Database provisioning,
configuration and management
Database security and user access
Database backups and resiliency
Database performance monitoring
and optimization
Data Engineer
Data integration pipelines and ETL
processes
Data cleansing and transformation
Analytical data store schemas and
data loads
Data Analyst
Analytical modeling
Data reporting and summarization
Data visualization
Microsoft cloud services for data
Data stores Data engineering and analytics
Explore Fundamentals of Relational Data in Azure
Module 2:
• Lesson 1: Explore relational data concepts
• Lesson 2: Explore Azure services for relational data
Lesson 1: Explore Relational Data
Concepts
Relational tables
 Data is stored in tables
 Tables consists of rows and columns
 All rows have the same columns
 Each column is assigned a datatype
Customer
ID FirstName MiddleName LastName Email Address City
1 Joe David Jones
joe@litware.co
m
1 Main St. Seattle
2 Samir Nadoy
samir@northwi
nd.com
123 Elm Pl. New York
Product
ID Name Price
123 Hammer 2.99
162 Screwdriver 3.49
201 Wrench 4.25
Order
OrderNo OrderDate Customer
1000 1/1/2022 1
1001 1/1/2022 2
LineItem
OrderNo ItemNo ProductID Quantity
1000 1 123 1
1000 2 201 2
1001 1 123 2
Normalization
 Separate each entity into its own table
 Separate each discrete attribute into its own
column
 Uniquely identify each entity instance (row)
using a primary key
 Use foreign key columns to link related entities
Customer
ID FirstName LastName Address City
1 Joe Jones 1 Main St. Seattle
2 Samir Nadoy 123 Elm Pl. New York
Product
ID Name Price
123 Hammer 2.99
162 Screwdriver 3.49
201 Wrench 4.25
Order
OrderNo OrderDate Customer
1000 1/1/2022 1
1001 1/1/2022 2
LineItem
OrderNo ItemNo ProductID Quantity
1000 1 123 1
1000 2 201 2
1001 1 123 2
Sales Data
OrderNo OrderDate Customer Product Quantity
1000 1/1/2022 Joe Jones, 1 Main St, Seattle Hammer ($2.99) 1
1000 1/1/2022 Joe Jones- 1 Main St, Seattle Screwdriver ($3.49) 2
1001 1/1/2022 Samir Nadoy, 123 Elm Pl, New York Hammer ($2.99) 2
… … … … …
Structured Query Language (SQL)
 SQL is a standard language for use with relational databases
 Standards are maintained by ANSI and ISO
 Most RDBMS systems support proprietary extensions of standard SQL
Data Definition Language (DDL) Data Control Language (DCL) Data Manipulation Language (DML)
CREATE, ALTER, DROP, RENAME GRANT, DENY, REVOKE INSERT, UPDATE, DELETE, SELECT
CREATE TABLE Product
(
ProductID INT PRIMARY KEY,
Name VARCHAR(20) NOT NULL,
Price DECIMAL NULL
);
GRANT SELECT, INSERT, UPDATE
ON Product
TO user1;
SELECT Name, Price
FROM Product
WHERE Price > 2.50
ORDER BY Price;
Product
ID Name Price
Product
ID Name Price
123 Hammer 2.99
162 Screwdriver 3.49
201 Wrench 4.25
Results
Name Price
Hammer 2.99
Screwdriver 3.49
Wrench 4.25
Other common database objects
Views
Pre-defined SQL queries that behave as
virtual tables
Stored Procedures
Pre-defined SQL statements that can
include parameters
Indexes
Tree-based structures that improve query
performance
Customer
… … …
… … …
Deliveries
OrderNo OrderDate Address City
1000 1/1/2022 1 Main St. Seattle
1001 1/1/2022 123 Elm Pl. New York
Order
… … …
… … …
Product
ID Name Price
201 Wrench Spanner 4.25
Product
ID Name Price
123 Hammer 2.99
162 Screwdriver 3.49
201 Wrench 4.25
Lesson 2: Explore Azure Services for
Relational Data
Azure SQL
Family of SQL Server based cloud database services
Azure Database services for open-source
Azure managed solutions for common open-source RDBMSs
Demo Provision Azure relational database services
Explore Fundamentals of Non-relational Data in Azure
Module 3:
• Lesson 1: Fundamentals of Azure Storage
• Lesson 2: Fundamentals of Azure Cosmos DB
Lesson 1: Fundamentals of Azure Storage
Azure Blob Storage
Storage for data as binary large objects (BLOBs)
• Block blobs
o Large, discrete, binary objects that change infrequently
o Blobs can be up to 4.7 TB, composed of blocks of up to 100 MB
- A blob can contain up to 50,000 blocks
• Page blobs
o Used as virtual disk storage for VMs
o Blobs can be up to 8 TB, composed of fixed sized-512 byte pages
• Append blobs
o Block blobs that are used to optimize append operations
o Maximum size just over 195 GB - each block can be up to 4 MB
Per-blob storage tiers
 Hot – Highest cost, lowest latency
 Cool – Lower cost, higher latency
 Archive – Lowest cost, highest latency
Azure Data Lake Store Gen 2
Distributed file system built on Blob Storage
• Combines Azure Data Lake Store Gen 1 with Azure Blob
Storage for large-scale file storage and analytics
• Enables file and directory level access control and
management
• Compatible with common large scale analytical systems
Enabled in an Azure Storage account through the
Hierarchical Namespace option
• Set during account creation
• Upgrade existing storage account
o One-way upgrade process
Azure Files
Files shares in the cloud that can be
accessed from anywhere with an internet
connection
• Support for common file sharing protocols:
o Server Message Block (SMB)
o Network File System (NFS) – requires premium tier
• Data is replicated for redundancy and encrypted at
rest
Azure Table Storage
Key-Value storage for application data
• Tables consist of key and value columns
o Partition and row keys
o Custom property columns for data values
- A Timestamp column is added automatically to log
data changes
• Rows are grouped into partitions to improve
performance
• Property columns are assigned a data type, and
can contain any value of that type
• Rows do not need to include the same
property columns
PartitionKey RowKey Timestamp Property1 Property2
1 123 2022/1/1 A value Another value
1 124 2022/1/1 This value
2 125 2022/1/1 That value
Demo Explore Azure Storage
Lesson 2: Fundamentals of Azure
Cosmos DB
What is Azure Cosmos DB?
A multi-model, global-scale NoSQL database
management system
{
"x":[…]
}
Key Value Col1 Col2 Col3
Azure Cosmos DB APIs
Azure Cosmos DB for MongoDB
PartitionKey RowKey Name
1 123 Joe Jones
1 124 Samir Nadoy
id name dept manager
1 Sue Smith Hardware
2 Ben Chan Hardware Sue Smith
Azure Cosmos DB for PostgreSQL
id name dept manager
1 Sue Smith Hardware Joe Jones
2 Ben Chan Hardware Sue Smith
Demo Explore Azure Cosmos DB
Explore Fundamentals of Large-scale data
warehousing
Module 4:
• Lesson 1: Large-scale data warehousing
Lesson 1: Large-scale data warehousing
What is large-scale data warehousing?
Data ingestion and processing Analytical data store Analytical data model Data visualization
▲
▼
▲
Data ingestion and processing pipelines
Activities
Analytical data stores
Choose an analytical data store service
Azure Synapse Analytics Azure Databricks Azure HDInsight
Use to leverage Databricks skills and
for cloud portability
Use when you need to support
multiple open-source platforms
Demo Explore Azure Synapse Analytics
Explore Fundamentals of real-time analytics
Module 5:
• Lesson 1: Streaming and real-time analytics
Lesson 1: Streaming and Real-time
Analytics
Batch vs stream processing
Batch processing Stream processing
Real-time data processing with Azure Stream Analytics
• Ingest data from an input, such as:
o Azure Event Hubs
o Azure IoT Hub
o Azure Blob Storage
o …
• Process data with a perpetual query
• Send results to an output, such as:
o Azure Blob Storage
o Azure SQL Database
o Azure Synapse Analytics
o Azure Function
o Azure Event Hubs
o Power BI
o …
SELECT …
Real-time log and telemetry analysis with Azure Data Explorer
• Data is ingested from streaming and
batch sources into tables in a database
• Tables can be queried using Kusto Query
Language (KQL):
o Intuitive syntax for read-only queries
o Optimized for raw telemetry and time-
series data
Demo Explore Azure Stream Analytics
Explore Fundamentals of Data Visualization
Module 6:
• Lesson 1: Data visualization
Lesson 1: Data Visualization
Introduction to data visualization with Power BI
Analytical data modeling
Customer (dimension)
Key Name Address City
1 Joe 1 Main St. Seattle
2 Samir 123 Elm Pl. New York
3 Alice 2 High St. Seattle
Product (dimension)
Key Name Category
1 Hammer Tools
2 Screwdriver Tools
3 Wrench Tools
4 Bolts Hardware
Time (dimension)
Key Year Month Day WeekDay
01012022 2022 Jan 1 Sat
02012022 2022 Jan 2 Sun
Sales (fact)
Key TimeKey ProductKey CustomerKey Quantity Revenue
1 01012022 1 1 1 2.99
2 01012022 2 1 2 6.98
3 02012022 1 2 2 5.98
Total revenue for wrenches
sold to Samir in January
∑
Model aggregates measures
at each hierarchy level
Year Month Day Revenue
2022 8221.48
Jan 574.86
1 9.97
2 5.98
… …
Common data visualizations in reports
Tables and text Bar or column chart Line chart
Pie chart Scatter plot Map
Demo Visualize data with Power BI
Further learning
https://aka.ms/ExploreDataConcepts
https://aka.ms/ExploreRelationalData
https://aka.ms/ExploreNonRelationalData
https://aka.ms/ExploreDataAnalytics
Thank you

More Related Content

What's hot

Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Neo4j
 
Dbms and rdbms
Dbms and rdbmsDbms and rdbms
Dbms and rdbms
Umair Shakir
 
Data Modeling Basics
Data Modeling BasicsData Modeling Basics
Data Modeling Basics
renuindia
 
Data model
Data modelData model
Data model
Syed Zaid Irshad
 
Neo4j Presentation
Neo4j PresentationNeo4j Presentation
Neo4j Presentation
Max De Marzi
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
Neo4j
 
Introduction to Graph Databases
Introduction to Graph DatabasesIntroduction to Graph Databases
Introduction to Graph Databases
Max De Marzi
 
AI Science
AI Science AI Science
AI Science
Melanie Swan
 
Data mining with big data implementation
Data mining with big data implementationData mining with big data implementation
Data mining with big data implementation
Sandip Tipayle Patil
 
Deductive databases
Deductive databasesDeductive databases
Deductive databases
John Popoola
 
Using Databricks as an Analysis Platform
Using Databricks as an Analysis PlatformUsing Databricks as an Analysis Platform
Using Databricks as an Analysis Platform
Databricks
 
Smarter Fraud Detection With Graph Data Science
Smarter Fraud Detection With Graph Data ScienceSmarter Fraud Detection With Graph Data Science
Smarter Fraud Detection With Graph Data Science
Neo4j
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
Flavio Vit
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based Search
Neo4j
 
Graph database
Graph database Graph database
Graph database
Shruti Arya
 
security and privacy in dbms and in sql database
security and privacy in dbms and in sql databasesecurity and privacy in dbms and in sql database
security and privacy in dbms and in sql database
gourav kottawar
 
Big Data technology Landscape
Big Data technology LandscapeBig Data technology Landscape
Big Data technology Landscape
ShivanandaVSeeri
 
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
NodeXperts
 
Introduction to the world of Cloud Computing & Microsoft Azure.pptx
Introduction to the world of Cloud Computing & Microsoft Azure.pptxIntroduction to the world of Cloud Computing & Microsoft Azure.pptx
Introduction to the world of Cloud Computing & Microsoft Azure.pptx
PrazolBista
 

What's hot (20)

Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
 
Dbms and rdbms
Dbms and rdbmsDbms and rdbms
Dbms and rdbms
 
Data Modeling Basics
Data Modeling BasicsData Modeling Basics
Data Modeling Basics
 
Data model
Data modelData model
Data model
 
Neo4j Presentation
Neo4j PresentationNeo4j Presentation
Neo4j Presentation
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
 
Introduction to Graph Databases
Introduction to Graph DatabasesIntroduction to Graph Databases
Introduction to Graph Databases
 
AI Science
AI Science AI Science
AI Science
 
Data mining with big data implementation
Data mining with big data implementationData mining with big data implementation
Data mining with big data implementation
 
Deductive databases
Deductive databasesDeductive databases
Deductive databases
 
Using Databricks as an Analysis Platform
Using Databricks as an Analysis PlatformUsing Databricks as an Analysis Platform
Using Databricks as an Analysis Platform
 
Smarter Fraud Detection With Graph Data Science
Smarter Fraud Detection With Graph Data ScienceSmarter Fraud Detection With Graph Data Science
Smarter Fraud Detection With Graph Data Science
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
Knowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based SearchKnowledge Graphs - The Power of Graph-Based Search
Knowledge Graphs - The Power of Graph-Based Search
 
Graph database
Graph database Graph database
Graph database
 
security and privacy in dbms and in sql database
security and privacy in dbms and in sql databasesecurity and privacy in dbms and in sql database
security and privacy in dbms and in sql database
 
Big Data technology Landscape
Big Data technology LandscapeBig Data technology Landscape
Big Data technology Landscape
 
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
 
Introduction to the world of Cloud Computing & Microsoft Azure.pptx
Introduction to the world of Cloud Computing & Microsoft Azure.pptxIntroduction to the world of Cloud Computing & Microsoft Azure.pptx
Introduction to the world of Cloud Computing & Microsoft Azure.pptx
 

Similar to Azure Fundamentals.pdf

Cloud architectural patterns and Microsoft Azure tools
Cloud architectural patterns and Microsoft Azure toolsCloud architectural patterns and Microsoft Azure tools
Cloud architectural patterns and Microsoft Azure tools
Pushkar Chivate
 
PASS_Summit_2019_Azure_Storage_Options_for_Analytics
PASS_Summit_2019_Azure_Storage_Options_for_AnalyticsPASS_Summit_2019_Azure_Storage_Options_for_Analytics
PASS_Summit_2019_Azure_Storage_Options_for_Analytics
Dustin Vannoy
 
Azure - Data Platform
Azure - Data PlatformAzure - Data Platform
Azure - Data Platform
giventocode
 
Day2
Day2Day2
SQL or NoSQL, is this the question? - George Grammatikos
SQL or NoSQL, is this the question? - George GrammatikosSQL or NoSQL, is this the question? - George Grammatikos
SQL or NoSQL, is this the question? - George Grammatikos
George Grammatikos
 
Modern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptxModern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptx
ssuser290967
 
Understanding The Azure Platform Jan
Understanding The Azure Platform   JanUnderstanding The Azure Platform   Jan
Understanding The Azure Platform Jan
DavidGristwood
 
Ms sql server architecture
Ms sql server architectureMs sql server architecture
Ms sql server architecture
Ajeet Singh
 
NoSQL for SQL Users
NoSQL for SQL UsersNoSQL for SQL Users
NoSQL for SQL Users
IBM Cloud Data Services
 
Microsoft Azure Big Data Analytics
Microsoft Azure Big Data AnalyticsMicrosoft Azure Big Data Analytics
Microsoft Azure Big Data Analytics
Mark Kromer
 
Scalable relational database with SQL Azure
Scalable relational database with SQL AzureScalable relational database with SQL Azure
Scalable relational database with SQL Azure
Shy Engelberg
 
DBMS.pptx
DBMS.pptxDBMS.pptx
DBMS.pptx
SwapnamaySamui
 
Steps towards business intelligence
Steps towards business intelligenceSteps towards business intelligence
Steps towards business intelligence
Ahsan Kabir
 
Cloud Architecture Patterns for Mere Mortals - Bill Wilder - Vermont Code Cam...
Cloud Architecture Patterns for Mere Mortals - Bill Wilder - Vermont Code Cam...Cloud Architecture Patterns for Mere Mortals - Bill Wilder - Vermont Code Cam...
Cloud Architecture Patterns for Mere Mortals - Bill Wilder - Vermont Code Cam...
Bill Wilder
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon Redshift
Amazon Web Services
 
Introducing Azure SQL Data Warehouse
Introducing Azure SQL Data WarehouseIntroducing Azure SQL Data Warehouse
Introducing Azure SQL Data Warehouse
James Serra
 
Sql server basics
Sql server basicsSql server basics
Sql server basics
Dilfaroz Khan
 
Sterling for Windows Phone 7
Sterling for Windows Phone 7Sterling for Windows Phone 7
Sterling for Windows Phone 7
Jeremy Likness
 
20210427 azure lille_meetup_azure_data_stack
20210427 azure lille_meetup_azure_data_stack20210427 azure lille_meetup_azure_data_stack
20210427 azure lille_meetup_azure_data_stack
Alexandre BERGERE
 
Active directory interview_questions
Active directory interview_questionsActive directory interview_questions
Active directory interview_questions
subhashmr
 

Similar to Azure Fundamentals.pdf (20)

Cloud architectural patterns and Microsoft Azure tools
Cloud architectural patterns and Microsoft Azure toolsCloud architectural patterns and Microsoft Azure tools
Cloud architectural patterns and Microsoft Azure tools
 
PASS_Summit_2019_Azure_Storage_Options_for_Analytics
PASS_Summit_2019_Azure_Storage_Options_for_AnalyticsPASS_Summit_2019_Azure_Storage_Options_for_Analytics
PASS_Summit_2019_Azure_Storage_Options_for_Analytics
 
Azure - Data Platform
Azure - Data PlatformAzure - Data Platform
Azure - Data Platform
 
Day2
Day2Day2
Day2
 
SQL or NoSQL, is this the question? - George Grammatikos
SQL or NoSQL, is this the question? - George GrammatikosSQL or NoSQL, is this the question? - George Grammatikos
SQL or NoSQL, is this the question? - George Grammatikos
 
Modern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptxModern Analytics Academy - Data Modeling (1).pptx
Modern Analytics Academy - Data Modeling (1).pptx
 
Understanding The Azure Platform Jan
Understanding The Azure Platform   JanUnderstanding The Azure Platform   Jan
Understanding The Azure Platform Jan
 
Ms sql server architecture
Ms sql server architectureMs sql server architecture
Ms sql server architecture
 
NoSQL for SQL Users
NoSQL for SQL UsersNoSQL for SQL Users
NoSQL for SQL Users
 
Microsoft Azure Big Data Analytics
Microsoft Azure Big Data AnalyticsMicrosoft Azure Big Data Analytics
Microsoft Azure Big Data Analytics
 
Scalable relational database with SQL Azure
Scalable relational database with SQL AzureScalable relational database with SQL Azure
Scalable relational database with SQL Azure
 
DBMS.pptx
DBMS.pptxDBMS.pptx
DBMS.pptx
 
Steps towards business intelligence
Steps towards business intelligenceSteps towards business intelligence
Steps towards business intelligence
 
Cloud Architecture Patterns for Mere Mortals - Bill Wilder - Vermont Code Cam...
Cloud Architecture Patterns for Mere Mortals - Bill Wilder - Vermont Code Cam...Cloud Architecture Patterns for Mere Mortals - Bill Wilder - Vermont Code Cam...
Cloud Architecture Patterns for Mere Mortals - Bill Wilder - Vermont Code Cam...
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon Redshift
 
Introducing Azure SQL Data Warehouse
Introducing Azure SQL Data WarehouseIntroducing Azure SQL Data Warehouse
Introducing Azure SQL Data Warehouse
 
Sql server basics
Sql server basicsSql server basics
Sql server basics
 
Sterling for Windows Phone 7
Sterling for Windows Phone 7Sterling for Windows Phone 7
Sterling for Windows Phone 7
 
20210427 azure lille_meetup_azure_data_stack
20210427 azure lille_meetup_azure_data_stack20210427 azure lille_meetup_azure_data_stack
20210427 azure lille_meetup_azure_data_stack
 
Active directory interview_questions
Active directory interview_questionsActive directory interview_questions
Active directory interview_questions
 

Recently uploaded

Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
Divyanshu
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
Anant Corporation
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Sinan KOZAK
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
bijceesjournal
 
Data Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason WebinarData Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason Webinar
UReason
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
AI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptxAI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptx
architagupta876
 
cnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classicationcnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classication
SakkaravarthiShanmug
 
Data Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptxData Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptx
ramrag33
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
bijceesjournal
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
VANDANAMOHANGOUDA
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
Madan Karki
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
IJECEIAES
 
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
shadow0702a
 
Design and optimization of ion propulsion drone
Design and optimization of ion propulsion droneDesign and optimization of ion propulsion drone
Design and optimization of ion propulsion drone
bjmsejournal
 
Software Quality Assurance-se412-v11.ppt
Software Quality Assurance-se412-v11.pptSoftware Quality Assurance-se412-v11.ppt
Software Quality Assurance-se412-v11.ppt
TaghreedAltamimi
 
Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
gowrishankartb2005
 
CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1
PKavitha10
 

Recently uploaded (20)

Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
 
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by AnantLLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
 
Data Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason WebinarData Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason Webinar
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
AI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptxAI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptx
 
cnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classicationcnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classication
 
Data Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptxData Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptx
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
 
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
 
Design and optimization of ion propulsion drone
Design and optimization of ion propulsion droneDesign and optimization of ion propulsion drone
Design and optimization of ion propulsion drone
 
Software Quality Assurance-se412-v11.ppt
Software Quality Assurance-se412-v11.pptSoftware Quality Assurance-se412-v11.ppt
Software Quality Assurance-se412-v11.ppt
 
Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
 
CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1
 

Azure Fundamentals.pdf

  • 1. © Copyright Microsoft Corporation. All rights reserved. FOR USE ONLY AS PART OF MICROSOFT VIRTUAL TRAINING DAYS PROGRAM. THESE MATERIALS ARE NOT AUTHORIZED FOR DISTRIBUTION, REPRODUCTION OR OTHER USE BY NON-MICROSOFT PARTIES.
  • 2. We will be starting shortly Microsoft Azure Virtual Training Day: Data Fundamentals
  • 3. About this course Course objectives:  Describe core data concepts  Identify services for relational data  Identify services for non-relational data  Identify services for data analytics This course is supplemented by online training at https://aka.ms/AzureLearn_DataFundamentals
  • 4. Course Agenda Module 1: Explore fundamentals of data  Core data concepts  Data roles and services Module 2: Explore fundamentals of relational data in Azure  Explore relational data concepts  Explore Azure services for relational data Module 3: Explore fundamentals of non-relational data in Azure  Fundamentals of Azure Storage  Fundamentals of Azure Cosmos DB Module 4: Explore fundamentals of large-scale data warehousing  Large-scale data warehousing Module 5: Explore fundamentals of real-time analytics  Streaming and real-time analytics Module 6: Explore fundamentals of data visualization  Data visualization
  • 5. Demos • Demos in this course are based on exercises in Microsoft Learn
  • 6. Explore Fundamentals of Data Module 1: • Lesson 1: Core data concepts • Lesson 2: Data roles and services
  • 7. Lesson 1: Core Data Concepts
  • 8. What is data? Values used to record information – often representing entities that have one or more attributes Structured Customer ID FirstName LastName Email Address 1 Joe Jones joe@litware.c om 1 Main St. 2 Samir Nadoy samir@north wind.com 123 Elm Pl. Product ID Name Price 123 Hammer 2.99 162 Screwdriver 3.49 201 Wrench 4.25 Semi-structured Unstructured { "firstName": "Joe", "lastName": "Jones", "address": { "streetAddress": "1 Main St.", "city": "New York", "state": "NY", "postalCode": "10099" }, "contact": [ { "type": "home", "number": "555 123-1234" }, { "type": "email", "address": "joe@litware.com" } ] } { "firstName": "Samir", "lastName": "Nadoy", "address": { "streetAddress": "123 Elm Pl.", "unit": "500", "city": "Seattle", "state": "WA", "postalCode": "98999" }, "contact": [ { "type": "email", "address": "samir@northwind.com" } ] }
  • 9. How is data stored? Files { "customers": [ { "firstName": "Joe", "lastName": "Jones"}, { "firstName": "Samir", "lastName": "Nadoy"} ] } Databases Graph
  • 10. Transactional data workloads Order … … … … … … * * *
  • 12. Lesson 2: Data Roles and Services
  • 13. Data professional roles Database Administrator Database provisioning, configuration and management Database security and user access Database backups and resiliency Database performance monitoring and optimization Data Engineer Data integration pipelines and ETL processes Data cleansing and transformation Analytical data store schemas and data loads Data Analyst Analytical modeling Data reporting and summarization Data visualization
  • 14. Microsoft cloud services for data Data stores Data engineering and analytics
  • 15. Explore Fundamentals of Relational Data in Azure Module 2: • Lesson 1: Explore relational data concepts • Lesson 2: Explore Azure services for relational data
  • 16. Lesson 1: Explore Relational Data Concepts
  • 17. Relational tables  Data is stored in tables  Tables consists of rows and columns  All rows have the same columns  Each column is assigned a datatype Customer ID FirstName MiddleName LastName Email Address City 1 Joe David Jones joe@litware.co m 1 Main St. Seattle 2 Samir Nadoy samir@northwi nd.com 123 Elm Pl. New York Product ID Name Price 123 Hammer 2.99 162 Screwdriver 3.49 201 Wrench 4.25 Order OrderNo OrderDate Customer 1000 1/1/2022 1 1001 1/1/2022 2 LineItem OrderNo ItemNo ProductID Quantity 1000 1 123 1 1000 2 201 2 1001 1 123 2
  • 18. Normalization  Separate each entity into its own table  Separate each discrete attribute into its own column  Uniquely identify each entity instance (row) using a primary key  Use foreign key columns to link related entities Customer ID FirstName LastName Address City 1 Joe Jones 1 Main St. Seattle 2 Samir Nadoy 123 Elm Pl. New York Product ID Name Price 123 Hammer 2.99 162 Screwdriver 3.49 201 Wrench 4.25 Order OrderNo OrderDate Customer 1000 1/1/2022 1 1001 1/1/2022 2 LineItem OrderNo ItemNo ProductID Quantity 1000 1 123 1 1000 2 201 2 1001 1 123 2 Sales Data OrderNo OrderDate Customer Product Quantity 1000 1/1/2022 Joe Jones, 1 Main St, Seattle Hammer ($2.99) 1 1000 1/1/2022 Joe Jones- 1 Main St, Seattle Screwdriver ($3.49) 2 1001 1/1/2022 Samir Nadoy, 123 Elm Pl, New York Hammer ($2.99) 2 … … … … …
  • 19. Structured Query Language (SQL)  SQL is a standard language for use with relational databases  Standards are maintained by ANSI and ISO  Most RDBMS systems support proprietary extensions of standard SQL Data Definition Language (DDL) Data Control Language (DCL) Data Manipulation Language (DML) CREATE, ALTER, DROP, RENAME GRANT, DENY, REVOKE INSERT, UPDATE, DELETE, SELECT CREATE TABLE Product ( ProductID INT PRIMARY KEY, Name VARCHAR(20) NOT NULL, Price DECIMAL NULL ); GRANT SELECT, INSERT, UPDATE ON Product TO user1; SELECT Name, Price FROM Product WHERE Price > 2.50 ORDER BY Price; Product ID Name Price Product ID Name Price 123 Hammer 2.99 162 Screwdriver 3.49 201 Wrench 4.25 Results Name Price Hammer 2.99 Screwdriver 3.49 Wrench 4.25
  • 20. Other common database objects Views Pre-defined SQL queries that behave as virtual tables Stored Procedures Pre-defined SQL statements that can include parameters Indexes Tree-based structures that improve query performance Customer … … … … … … Deliveries OrderNo OrderDate Address City 1000 1/1/2022 1 Main St. Seattle 1001 1/1/2022 123 Elm Pl. New York Order … … … … … … Product ID Name Price 201 Wrench Spanner 4.25 Product ID Name Price 123 Hammer 2.99 162 Screwdriver 3.49 201 Wrench 4.25
  • 21. Lesson 2: Explore Azure Services for Relational Data
  • 22. Azure SQL Family of SQL Server based cloud database services
  • 23. Azure Database services for open-source Azure managed solutions for common open-source RDBMSs
  • 24. Demo Provision Azure relational database services
  • 25. Explore Fundamentals of Non-relational Data in Azure Module 3: • Lesson 1: Fundamentals of Azure Storage • Lesson 2: Fundamentals of Azure Cosmos DB
  • 26. Lesson 1: Fundamentals of Azure Storage
  • 27. Azure Blob Storage Storage for data as binary large objects (BLOBs) • Block blobs o Large, discrete, binary objects that change infrequently o Blobs can be up to 4.7 TB, composed of blocks of up to 100 MB - A blob can contain up to 50,000 blocks • Page blobs o Used as virtual disk storage for VMs o Blobs can be up to 8 TB, composed of fixed sized-512 byte pages • Append blobs o Block blobs that are used to optimize append operations o Maximum size just over 195 GB - each block can be up to 4 MB Per-blob storage tiers  Hot – Highest cost, lowest latency  Cool – Lower cost, higher latency  Archive – Lowest cost, highest latency
  • 28. Azure Data Lake Store Gen 2 Distributed file system built on Blob Storage • Combines Azure Data Lake Store Gen 1 with Azure Blob Storage for large-scale file storage and analytics • Enables file and directory level access control and management • Compatible with common large scale analytical systems Enabled in an Azure Storage account through the Hierarchical Namespace option • Set during account creation • Upgrade existing storage account o One-way upgrade process
  • 29. Azure Files Files shares in the cloud that can be accessed from anywhere with an internet connection • Support for common file sharing protocols: o Server Message Block (SMB) o Network File System (NFS) – requires premium tier • Data is replicated for redundancy and encrypted at rest
  • 30. Azure Table Storage Key-Value storage for application data • Tables consist of key and value columns o Partition and row keys o Custom property columns for data values - A Timestamp column is added automatically to log data changes • Rows are grouped into partitions to improve performance • Property columns are assigned a data type, and can contain any value of that type • Rows do not need to include the same property columns PartitionKey RowKey Timestamp Property1 Property2 1 123 2022/1/1 A value Another value 1 124 2022/1/1 This value 2 125 2022/1/1 That value
  • 32. Lesson 2: Fundamentals of Azure Cosmos DB
  • 33. What is Azure Cosmos DB? A multi-model, global-scale NoSQL database management system { "x":[…] } Key Value Col1 Col2 Col3
  • 34. Azure Cosmos DB APIs Azure Cosmos DB for MongoDB PartitionKey RowKey Name 1 123 Joe Jones 1 124 Samir Nadoy id name dept manager 1 Sue Smith Hardware 2 Ben Chan Hardware Sue Smith Azure Cosmos DB for PostgreSQL id name dept manager 1 Sue Smith Hardware Joe Jones 2 Ben Chan Hardware Sue Smith
  • 35. Demo Explore Azure Cosmos DB
  • 36. Explore Fundamentals of Large-scale data warehousing Module 4: • Lesson 1: Large-scale data warehousing
  • 37. Lesson 1: Large-scale data warehousing
  • 38. What is large-scale data warehousing? Data ingestion and processing Analytical data store Analytical data model Data visualization ▲ ▼ ▲
  • 39. Data ingestion and processing pipelines Activities
  • 41. Choose an analytical data store service Azure Synapse Analytics Azure Databricks Azure HDInsight Use to leverage Databricks skills and for cloud portability Use when you need to support multiple open-source platforms
  • 42. Demo Explore Azure Synapse Analytics
  • 43. Explore Fundamentals of real-time analytics Module 5: • Lesson 1: Streaming and real-time analytics
  • 44. Lesson 1: Streaming and Real-time Analytics
  • 45. Batch vs stream processing Batch processing Stream processing
  • 46. Real-time data processing with Azure Stream Analytics • Ingest data from an input, such as: o Azure Event Hubs o Azure IoT Hub o Azure Blob Storage o … • Process data with a perpetual query • Send results to an output, such as: o Azure Blob Storage o Azure SQL Database o Azure Synapse Analytics o Azure Function o Azure Event Hubs o Power BI o … SELECT …
  • 47. Real-time log and telemetry analysis with Azure Data Explorer • Data is ingested from streaming and batch sources into tables in a database • Tables can be queried using Kusto Query Language (KQL): o Intuitive syntax for read-only queries o Optimized for raw telemetry and time- series data
  • 48. Demo Explore Azure Stream Analytics
  • 49. Explore Fundamentals of Data Visualization Module 6: • Lesson 1: Data visualization
  • 50. Lesson 1: Data Visualization
  • 51. Introduction to data visualization with Power BI
  • 52. Analytical data modeling Customer (dimension) Key Name Address City 1 Joe 1 Main St. Seattle 2 Samir 123 Elm Pl. New York 3 Alice 2 High St. Seattle Product (dimension) Key Name Category 1 Hammer Tools 2 Screwdriver Tools 3 Wrench Tools 4 Bolts Hardware Time (dimension) Key Year Month Day WeekDay 01012022 2022 Jan 1 Sat 02012022 2022 Jan 2 Sun Sales (fact) Key TimeKey ProductKey CustomerKey Quantity Revenue 1 01012022 1 1 1 2.99 2 01012022 2 1 2 6.98 3 02012022 1 2 2 5.98 Total revenue for wrenches sold to Samir in January ∑ Model aggregates measures at each hierarchy level Year Month Day Revenue 2022 8221.48 Jan 574.86 1 9.97 2 5.98 … …
  • 53. Common data visualizations in reports Tables and text Bar or column chart Line chart Pie chart Scatter plot Map
  • 54. Demo Visualize data with Power BI