SlideShare a Scribd company logo
1/23/2014 by Shehap El Nagar
Sponsors & Media Partners
Shehap EL-Nagar
I am MVP,MCTS , MCITP SQL Server, I am DB consultant and Architect for lots of Banking, Telecom ,Ministries and
governmental organizations all over Gulf ,also he has deep knowledge about T-SQL performance , HW Performance issues,
Data Warehousing solutions , SQL Server Replication, Clustering solutions and Database Designs for different kinds of
systems ...
The founder of the biggest SQL Server community all over the middle east http://sqlserver-performance-tuning.net/ , you
can watch its success memories at http://www.youtube.com/user/ShehapElNagar
Moderator and author at http://www.sql-server-performance.com ,
, the 1st SQL Server Author at MSDN Arabia http://msdn.microsoft.com/ar-sa/library/jj149119.aspx
, Speaker at SQL Saturday Events worldwide , local events at Saudi Arabia , many online events , more than 90 video
tutorials and also many private sessions for .net developers and Database Administrators
And also influent participator at Microsoft Forums of SQL Server at http://social.technet.microsoft.com.
More about him , you can find him on MVP Microsoft site http://mvp.microsoft.com/en-us/mvp/Shehap%20El-Nagar5000188 .
You can contact him at the below contacts :Mail :idgdirector@yahoo.com ….Cellular phone :00966560700733
Agenda and Overview:
First :Definitions and benefits of DWH
•Definition of Data Warehousing Solutions
•Benefits of DWH Solutions
•Why RTDWH (Real Time Data Warehousing

) is high necessary..?

•Data Warehouse

vs. Data Mart
•Relational DB vs. Dimensional DB
•Dimensional Database vs. Multidimensional Database
•Star Schema vs. Snow flake schema
•Techniques of DWH solutions

Second : RTDWH for online Reporting
•Technique and concepts

•Demo

Third :DWH for online Archiving
•Technique and concepts
•Demo

Fourth :DWH for online ETL
•Technique and concepts
•Demo
First :Definitions and Benefits of DWH
Definition of Data Warehousing
Relational
Database 1

Optimized Loader
Relational
Database 2

Data
Cleansing
Data Warehouse
Engine

Relational
Database 3

Relational
Database 4

De-normalize
Data

Metadata Repository
6
Benefits of Data warehousing:
Data Consolidation & organization
Data standardization for different attributes such as Collation
Support numerous RDBM sources flexibly like SQL Server , Oracle ,

TeraData , Informix , SAP BI , Sybase, Access , CSV files , Excel…etc
Scale up reports either SSRS or SSAS reports (OLAP Reports)
Speed up reports performance
Why Real Time Data
Warehousing..?
 Active

decision support

 Business

activity monitoring (BAM)

 Alerting

 Efficiently

execute business strategy
Relational DB vs. Dimensional DB:
Relational DB represents a normalized DB for OLTP transactions
purposes.
More normalization >>>Less no of columns >>> less possibility of
indexes >>> Less IO cost of cluster indexes while using them for
insert /update /delete of OLTP transactions
Dimensional DB represents a de-normalized DB for OLAP purposes
More number of interrelated columns in one table >>> Less
possibility for joins >>> More covering compound indexes
Data Warehouse vs. Data Mart:
•Data warehouse is a global repository for wide scale of

business
•Data mart is a smaller repository for specific business

scope
Therefore, we could say a Data Mart solution is sub set of
a bigger Data warehousing solution
Data Warehouse vs. Data Mart
Data Sources

Data Marts

Data Warehouse

11
Dimensional Database vs. Multidimensional Database:
Dimensional DBs could be used as staging DBs for SSAS reports or they could
be used directly for SSRS reports
Multidimensional DB represent SSAS DBs composed of cubes which are
formed basically of :
•Facts

tables which Contain business process core where aggregative columns
called measures could be found there.
•Dimension

tables which Contain Lookup details relevant to these aggregative

data
(DWH DB)
Dimensional DB

DB Service

(OLAP DB)
Multi-Dimensional DB

OLAP Service

Decision Support Client

Presentation Layer
Star Schema vs. Snow flake schema :
Snow flake schema close much the design of star schema
design but the first one is trying to break down schema
design more into smaller tables to avoid more redundancy of
columns.
Snow flake schema isn’t recommended for neither OLAP
transaction nor OLTP transaction
Star Schema

Telephone

date, custno, prodno, cityname,Region ...

Name

Fact Table
Gender

Marital status
Snow flake Schema
Telephone

date, custno, prodno, cityname,Region ...

Name

Fact Table
Gender

Marital status

Gender
lookup table

Marital status
lookup table
Data warehousing techniques
•
•Old

2005 codes (Select /insert/Update /Delete)

•New

2008 codes “Merge” which could replace more
efficiently all of above commands in one statement
•DTS

(Data transformation Service) and SSIS Packages

•Enterprise

platform solutions for LDWH(Large DWH)
Enterprise Platform Solutions
Fast Data tracking solution
Sybase IQ
Red Brick Warehouse
IBM
DB2 MVS
Universal Server
IBM Data Warehousing
Teradata
Informix
Online Dynamic Server
XPS --Extended Parallel Server
Universal Server for object relational applications
Second :RTDWH for online Reporting
Technique of DWH Solutions used for Online Reporting
•Creating

2 tables (One Temp table and the second is DWH table itself)

•
•Making
•Then

all DML transactions on a Temp table.

compare Temp table results with DWH table.

•If

not match for any record /column, then Bulk Merge command from
Temp table to DWH Table
•You

can use now this DWH Table for your online Reports
Concepts of DWH Solution used for Online Reporting
1- Set xact_abort on : To ensure the highest transactional status for group of
DML transactions to commit all if all succeeded and rollback all if any of them failed
2- Set nocount on :To speed up queries by avoiding counting no of records each
time of run
3- Set deadlock_priority low; To avoid any impact on end users transactions
while this online data warehousing.

4- Try /Catch commands : To capture any possible errors and report them by
mail.
5- Bulk Logged mode :To save efficiently more storage capacity while bulk
merge
6-Using Read committed snapshot isolation level using row versioning is
recommended to avoid heavy locks/deadlcoksd
Demo
Third :Data warehousing
For online Archiving solution
Techniques of DWH Solutions for Archiving
•Bulk

insert the old data from a Source table to an Archived table

•Bulk

delete from source table after success of 1st step

•Bulk

delete should be split into smaller patches with small
no of records like 1000 delay of 5-30 sec between each patch and
another to avoid any tangible locks or deadlocks
Concepts of DWH Solutions used for Archiving
1 - Bulk Logged mode :To save efficiently more storage capacity while bulk
merge as we are going more to show by next workshops
2- Use WAITFOR DELAY '00:00:30'doesn’t mark for risky waits here, but just a
normal wait command like service broker wait.
3- Bulk Insert and bulk delete phases could be conducted in different
transactions in different time intervals without any risks
4- You could validate that also using output commands
Demo
Fourth :Data warehousing
For online ETL solutions
Technique of DWH Solutions used for ETL

Run your ETL process in parallel with end users activities
but to a different table rather than online tables
Once finish, start to scan all mismatches between the 2
tables through the 3 data warehousing statements
Concepts of DWH Solutions of DWH used for ETL
Scanning any new inserted data entity within the source tables to be inserted to
the target tables
Scanning any updated data entity through scanning any records shared between
the 2 tables for PK values but different for any other columns
.
Scanning any deleted data through using except commands
•The

3 phases could be undertaken asynchronously without any risk at all
Q&A
Post your questions at:

http://www.sqlserver-performance-tuning.net/forums/
Thank you ..See you again

1/23/2014
Sponsors & Media Partners

More Related Content

What's hot

NoSQL - 05March2014 Seminar
NoSQL - 05March2014 SeminarNoSQL - 05March2014 Seminar
NoSQL - 05March2014 Seminar
Jainul Musani
 
Azure SQL DWH
Azure SQL DWHAzure SQL DWH
Azure SQL DWH
Shy Engelberg
 
VoltDB : A Technical Overview
VoltDB : A Technical OverviewVoltDB : A Technical Overview
VoltDB : A Technical Overview
Tim Callaghan
 
Cassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction GuideCassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction Guide
Mohammed Fazuluddin
 
Big Data Technologies and Why They Matter To R Users
Big Data Technologies and Why They Matter To R UsersBig Data Technologies and Why They Matter To R Users
Big Data Technologies and Why They Matter To R Users
Adaryl "Bob" Wakefield, MBA
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
Adaryl "Bob" Wakefield, MBA
 
Gcp data engineer
Gcp data engineerGcp data engineer
Gcp data engineer
Narendranath Reddy T
 
Apache Cassandra introduction
Apache Cassandra introductionApache Cassandra introduction
Apache Cassandra introduction
fardinjamshidi
 
CCV: migrating our payment processing system to MariaDB
CCV: migrating our payment processing system to MariaDBCCV: migrating our payment processing system to MariaDB
CCV: migrating our payment processing system to MariaDB
MariaDB plc
 
ClustrixDB: how distributed databases scale out
ClustrixDB: how distributed databases scale outClustrixDB: how distributed databases scale out
ClustrixDB: how distributed databases scale out
MariaDB plc
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentials
qureshihamid
 
NoSQL Seminer
NoSQL SeminerNoSQL Seminer
NoSQL Seminer
Partha Das
 
What is Change Data Capture (CDC) and Why is it Important?
What is Change Data Capture (CDC) and Why is it Important?What is Change Data Capture (CDC) and Why is it Important?
What is Change Data Capture (CDC) and Why is it Important?
FlyData Inc.
 
Datastage ppt
Datastage pptDatastage ppt
Datastage ppt
Newyorksys.com
 
SQL Server 2016 - Stretch DB
SQL Server 2016 - Stretch DB SQL Server 2016 - Stretch DB
SQL Server 2016 - Stretch DB
Shy Engelberg
 
Delivering rapid-fire Analytics with Snowflake and Tableau
Delivering rapid-fire Analytics with Snowflake and TableauDelivering rapid-fire Analytics with Snowflake and Tableau
Delivering rapid-fire Analytics with Snowflake and Tableau
Harald Erb
 
VF NZ
VF NZVF NZ
In-memory ColumnStore Index
In-memory ColumnStore IndexIn-memory ColumnStore Index
In-memory ColumnStore Index
SolidQ
 
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScaleHow Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
MariaDB plc
 
Introduction to hd insight
Introduction to hd insightIntroduction to hd insight
Introduction to hd insight
MSDEVMTL
 

What's hot (20)

NoSQL - 05March2014 Seminar
NoSQL - 05March2014 SeminarNoSQL - 05March2014 Seminar
NoSQL - 05March2014 Seminar
 
Azure SQL DWH
Azure SQL DWHAzure SQL DWH
Azure SQL DWH
 
VoltDB : A Technical Overview
VoltDB : A Technical OverviewVoltDB : A Technical Overview
VoltDB : A Technical Overview
 
Cassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction GuideCassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction Guide
 
Big Data Technologies and Why They Matter To R Users
Big Data Technologies and Why They Matter To R UsersBig Data Technologies and Why They Matter To R Users
Big Data Technologies and Why They Matter To R Users
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
 
Gcp data engineer
Gcp data engineerGcp data engineer
Gcp data engineer
 
Apache Cassandra introduction
Apache Cassandra introductionApache Cassandra introduction
Apache Cassandra introduction
 
CCV: migrating our payment processing system to MariaDB
CCV: migrating our payment processing system to MariaDBCCV: migrating our payment processing system to MariaDB
CCV: migrating our payment processing system to MariaDB
 
ClustrixDB: how distributed databases scale out
ClustrixDB: how distributed databases scale outClustrixDB: how distributed databases scale out
ClustrixDB: how distributed databases scale out
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentials
 
NoSQL Seminer
NoSQL SeminerNoSQL Seminer
NoSQL Seminer
 
What is Change Data Capture (CDC) and Why is it Important?
What is Change Data Capture (CDC) and Why is it Important?What is Change Data Capture (CDC) and Why is it Important?
What is Change Data Capture (CDC) and Why is it Important?
 
Datastage ppt
Datastage pptDatastage ppt
Datastage ppt
 
SQL Server 2016 - Stretch DB
SQL Server 2016 - Stretch DB SQL Server 2016 - Stretch DB
SQL Server 2016 - Stretch DB
 
Delivering rapid-fire Analytics with Snowflake and Tableau
Delivering rapid-fire Analytics with Snowflake and TableauDelivering rapid-fire Analytics with Snowflake and Tableau
Delivering rapid-fire Analytics with Snowflake and Tableau
 
VF NZ
VF NZVF NZ
VF NZ
 
In-memory ColumnStore Index
In-memory ColumnStore IndexIn-memory ColumnStore Index
In-memory ColumnStore Index
 
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScaleHow Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
 
Introduction to hd insight
Introduction to hd insightIntroduction to hd insight
Introduction to hd insight
 

Similar to Data warehousing guidelines for bi and BAM solutions

AWS Webcast - Tableau Big Data Solution Showcase
AWS Webcast - Tableau Big Data Solution ShowcaseAWS Webcast - Tableau Big Data Solution Showcase
AWS Webcast - Tableau Big Data Solution Showcase
Amazon Web Services
 
Pass chapter meeting dec 2013 - compression a hidden gem for io heavy databas...
Pass chapter meeting dec 2013 - compression a hidden gem for io heavy databas...Pass chapter meeting dec 2013 - compression a hidden gem for io heavy databas...
Pass chapter meeting dec 2013 - compression a hidden gem for io heavy databas...
Charley Hanania
 
Webinar How to Achieve True Scalability in SaaS Applications
Webinar How to Achieve True Scalability in SaaS ApplicationsWebinar How to Achieve True Scalability in SaaS Applications
Webinar How to Achieve True Scalability in SaaS Applications
Techcello
 
SQL PPT.pptx
SQL PPT.pptxSQL PPT.pptx
SQL PPT.pptx
Kulbir4
 
1 extreme performance - part i
1   extreme performance - part i1   extreme performance - part i
1 extreme performance - part i
sqlserver.co.il
 
KoprowskiT_SQLSaturday409_MaintenancePlansForBeginners
KoprowskiT_SQLSaturday409_MaintenancePlansForBeginnersKoprowskiT_SQLSaturday409_MaintenancePlansForBeginners
KoprowskiT_SQLSaturday409_MaintenancePlansForBeginners
Tobias Koprowski
 
KoprowskiT_SQLSat409_MaintenancePlansForBeginners
KoprowskiT_SQLSat409_MaintenancePlansForBeginnersKoprowskiT_SQLSat409_MaintenancePlansForBeginners
KoprowskiT_SQLSat409_MaintenancePlansForBeginners
Tobias Koprowski
 
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Dean Delamont
 
Senior database administrator
Senior database administratorSenior database administrator
Senior database administrator
Mustafa EL-Masry
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
SnapLogic
 
Data warehouse 2.0 and sql server architecture and vision
Data warehouse 2.0 and sql server architecture and visionData warehouse 2.0 and sql server architecture and vision
Data warehouse 2.0 and sql server architecture and vision
Klaudiia Jacome
 
5 Quick Wins for the Cloud
5 Quick Wins for the Cloud5 Quick Wins for the Cloud
5 Quick Wins for the Cloud
RightScale
 
The End of a Myth: Ultra-Scalable Transactional Management
The End of a Myth: Ultra-Scalable Transactional ManagementThe End of a Myth: Ultra-Scalable Transactional Management
The End of a Myth: Ultra-Scalable Transactional Management
Ricardo Jimenez-Peris
 
Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architecture
Deepak Chaurasia
 
AWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data AnalyticsAWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data Analytics
Keeyong Han
 
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Continuent
 
3 OLAP.pptx
3 OLAP.pptx3 OLAP.pptx
3 OLAP.pptx
Priyanshu931034
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
James Serra
 
Choosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudChoosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloud
James Serra
 
SQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for ImpalaSQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for Impala
markgrover
 

Similar to Data warehousing guidelines for bi and BAM solutions (20)

AWS Webcast - Tableau Big Data Solution Showcase
AWS Webcast - Tableau Big Data Solution ShowcaseAWS Webcast - Tableau Big Data Solution Showcase
AWS Webcast - Tableau Big Data Solution Showcase
 
Pass chapter meeting dec 2013 - compression a hidden gem for io heavy databas...
Pass chapter meeting dec 2013 - compression a hidden gem for io heavy databas...Pass chapter meeting dec 2013 - compression a hidden gem for io heavy databas...
Pass chapter meeting dec 2013 - compression a hidden gem for io heavy databas...
 
Webinar How to Achieve True Scalability in SaaS Applications
Webinar How to Achieve True Scalability in SaaS ApplicationsWebinar How to Achieve True Scalability in SaaS Applications
Webinar How to Achieve True Scalability in SaaS Applications
 
SQL PPT.pptx
SQL PPT.pptxSQL PPT.pptx
SQL PPT.pptx
 
1 extreme performance - part i
1   extreme performance - part i1   extreme performance - part i
1 extreme performance - part i
 
KoprowskiT_SQLSaturday409_MaintenancePlansForBeginners
KoprowskiT_SQLSaturday409_MaintenancePlansForBeginnersKoprowskiT_SQLSaturday409_MaintenancePlansForBeginners
KoprowskiT_SQLSaturday409_MaintenancePlansForBeginners
 
KoprowskiT_SQLSat409_MaintenancePlansForBeginners
KoprowskiT_SQLSat409_MaintenancePlansForBeginnersKoprowskiT_SQLSat409_MaintenancePlansForBeginners
KoprowskiT_SQLSat409_MaintenancePlansForBeginners
 
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
 
Senior database administrator
Senior database administratorSenior database administrator
Senior database administrator
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
 
Data warehouse 2.0 and sql server architecture and vision
Data warehouse 2.0 and sql server architecture and visionData warehouse 2.0 and sql server architecture and vision
Data warehouse 2.0 and sql server architecture and vision
 
5 Quick Wins for the Cloud
5 Quick Wins for the Cloud5 Quick Wins for the Cloud
5 Quick Wins for the Cloud
 
The End of a Myth: Ultra-Scalable Transactional Management
The End of a Myth: Ultra-Scalable Transactional ManagementThe End of a Myth: Ultra-Scalable Transactional Management
The End of a Myth: Ultra-Scalable Transactional Management
 
Data ware house architecture
Data ware house architectureData ware house architecture
Data ware house architecture
 
AWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data AnalyticsAWS Redshift Introduction - Big Data Analytics
AWS Redshift Introduction - Big Data Analytics
 
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
 
3 OLAP.pptx
3 OLAP.pptx3 OLAP.pptx
3 OLAP.pptx
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Choosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudChoosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloud
 
SQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for ImpalaSQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for Impala
 

Recently uploaded

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
Pixlogix Infotech
 

Recently uploaded (20)

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
 

Data warehousing guidelines for bi and BAM solutions

  • 2. Sponsors & Media Partners
  • 3. Shehap EL-Nagar I am MVP,MCTS , MCITP SQL Server, I am DB consultant and Architect for lots of Banking, Telecom ,Ministries and governmental organizations all over Gulf ,also he has deep knowledge about T-SQL performance , HW Performance issues, Data Warehousing solutions , SQL Server Replication, Clustering solutions and Database Designs for different kinds of systems ... The founder of the biggest SQL Server community all over the middle east http://sqlserver-performance-tuning.net/ , you can watch its success memories at http://www.youtube.com/user/ShehapElNagar Moderator and author at http://www.sql-server-performance.com , , the 1st SQL Server Author at MSDN Arabia http://msdn.microsoft.com/ar-sa/library/jj149119.aspx , Speaker at SQL Saturday Events worldwide , local events at Saudi Arabia , many online events , more than 90 video tutorials and also many private sessions for .net developers and Database Administrators And also influent participator at Microsoft Forums of SQL Server at http://social.technet.microsoft.com. More about him , you can find him on MVP Microsoft site http://mvp.microsoft.com/en-us/mvp/Shehap%20El-Nagar5000188 . You can contact him at the below contacts :Mail :idgdirector@yahoo.com ….Cellular phone :00966560700733
  • 4. Agenda and Overview: First :Definitions and benefits of DWH •Definition of Data Warehousing Solutions •Benefits of DWH Solutions •Why RTDWH (Real Time Data Warehousing ) is high necessary..? •Data Warehouse vs. Data Mart •Relational DB vs. Dimensional DB •Dimensional Database vs. Multidimensional Database •Star Schema vs. Snow flake schema •Techniques of DWH solutions Second : RTDWH for online Reporting •Technique and concepts •Demo Third :DWH for online Archiving •Technique and concepts •Demo Fourth :DWH for online ETL •Technique and concepts •Demo
  • 5. First :Definitions and Benefits of DWH
  • 6. Definition of Data Warehousing Relational Database 1 Optimized Loader Relational Database 2 Data Cleansing Data Warehouse Engine Relational Database 3 Relational Database 4 De-normalize Data Metadata Repository 6
  • 7. Benefits of Data warehousing: Data Consolidation & organization Data standardization for different attributes such as Collation Support numerous RDBM sources flexibly like SQL Server , Oracle , TeraData , Informix , SAP BI , Sybase, Access , CSV files , Excel…etc Scale up reports either SSRS or SSAS reports (OLAP Reports) Speed up reports performance
  • 8. Why Real Time Data Warehousing..?  Active decision support  Business activity monitoring (BAM)  Alerting  Efficiently execute business strategy
  • 9. Relational DB vs. Dimensional DB: Relational DB represents a normalized DB for OLTP transactions purposes. More normalization >>>Less no of columns >>> less possibility of indexes >>> Less IO cost of cluster indexes while using them for insert /update /delete of OLTP transactions Dimensional DB represents a de-normalized DB for OLAP purposes More number of interrelated columns in one table >>> Less possibility for joins >>> More covering compound indexes
  • 10. Data Warehouse vs. Data Mart: •Data warehouse is a global repository for wide scale of business •Data mart is a smaller repository for specific business scope Therefore, we could say a Data Mart solution is sub set of a bigger Data warehousing solution
  • 11. Data Warehouse vs. Data Mart Data Sources Data Marts Data Warehouse 11
  • 12. Dimensional Database vs. Multidimensional Database: Dimensional DBs could be used as staging DBs for SSAS reports or they could be used directly for SSRS reports Multidimensional DB represent SSAS DBs composed of cubes which are formed basically of : •Facts tables which Contain business process core where aggregative columns called measures could be found there. •Dimension tables which Contain Lookup details relevant to these aggregative data (DWH DB) Dimensional DB DB Service (OLAP DB) Multi-Dimensional DB OLAP Service Decision Support Client Presentation Layer
  • 13. Star Schema vs. Snow flake schema : Snow flake schema close much the design of star schema design but the first one is trying to break down schema design more into smaller tables to avoid more redundancy of columns. Snow flake schema isn’t recommended for neither OLAP transaction nor OLTP transaction
  • 14. Star Schema Telephone date, custno, prodno, cityname,Region ... Name Fact Table Gender Marital status
  • 15. Snow flake Schema Telephone date, custno, prodno, cityname,Region ... Name Fact Table Gender Marital status Gender lookup table Marital status lookup table
  • 16. Data warehousing techniques • •Old 2005 codes (Select /insert/Update /Delete) •New 2008 codes “Merge” which could replace more efficiently all of above commands in one statement •DTS (Data transformation Service) and SSIS Packages •Enterprise platform solutions for LDWH(Large DWH)
  • 17. Enterprise Platform Solutions Fast Data tracking solution Sybase IQ Red Brick Warehouse IBM DB2 MVS Universal Server IBM Data Warehousing Teradata Informix Online Dynamic Server XPS --Extended Parallel Server Universal Server for object relational applications
  • 18. Second :RTDWH for online Reporting
  • 19. Technique of DWH Solutions used for Online Reporting •Creating 2 tables (One Temp table and the second is DWH table itself) • •Making •Then all DML transactions on a Temp table. compare Temp table results with DWH table. •If not match for any record /column, then Bulk Merge command from Temp table to DWH Table •You can use now this DWH Table for your online Reports
  • 20. Concepts of DWH Solution used for Online Reporting 1- Set xact_abort on : To ensure the highest transactional status for group of DML transactions to commit all if all succeeded and rollback all if any of them failed 2- Set nocount on :To speed up queries by avoiding counting no of records each time of run 3- Set deadlock_priority low; To avoid any impact on end users transactions while this online data warehousing. 4- Try /Catch commands : To capture any possible errors and report them by mail. 5- Bulk Logged mode :To save efficiently more storage capacity while bulk merge 6-Using Read committed snapshot isolation level using row versioning is recommended to avoid heavy locks/deadlcoksd
  • 21. Demo
  • 22. Third :Data warehousing For online Archiving solution
  • 23. Techniques of DWH Solutions for Archiving •Bulk insert the old data from a Source table to an Archived table •Bulk delete from source table after success of 1st step •Bulk delete should be split into smaller patches with small no of records like 1000 delay of 5-30 sec between each patch and another to avoid any tangible locks or deadlocks
  • 24. Concepts of DWH Solutions used for Archiving 1 - Bulk Logged mode :To save efficiently more storage capacity while bulk merge as we are going more to show by next workshops 2- Use WAITFOR DELAY '00:00:30'doesn’t mark for risky waits here, but just a normal wait command like service broker wait. 3- Bulk Insert and bulk delete phases could be conducted in different transactions in different time intervals without any risks 4- You could validate that also using output commands
  • 25. Demo
  • 26. Fourth :Data warehousing For online ETL solutions
  • 27. Technique of DWH Solutions used for ETL Run your ETL process in parallel with end users activities but to a different table rather than online tables Once finish, start to scan all mismatches between the 2 tables through the 3 data warehousing statements
  • 28. Concepts of DWH Solutions of DWH used for ETL Scanning any new inserted data entity within the source tables to be inserted to the target tables Scanning any updated data entity through scanning any records shared between the 2 tables for PK values but different for any other columns . Scanning any deleted data through using except commands •The 3 phases could be undertaken asynchronously without any risk at all
  • 29. Q&A Post your questions at: http://www.sqlserver-performance-tuning.net/forums/
  • 30.
  • 31. Thank you ..See you again 1/23/2014
  • 32. Sponsors & Media Partners