SlideShare a Scribd company logo
WELCOME TO VIBRANT
TECHNOLOGIES & COMPUTER
Online Informatica Training
Contact Us On : www.vibranttechnologies.co.in
Contact Us On : www.vibranttechnologies.co.in
Data Warehousing - Architecture
Enterprise
Data
Warehouse Data Mart
Data Mart
Execution
Systems
• CRM
• ERP
• Legacy
• e-Commerce
Reporting
Tools
OLAP Tools
Ad Hoc
Query Tools
Data Mining
Tools
External
Data
• Purchased
Market Data
• Spreadsheets
•Oracle
•SQL Server
•Teradata
•DB2
Data and Metadata
Repository Layer
ETL Tools:
•Informatica PowerMart
•ETI
•Oracle Warehouse Builder
•Custom programs
•SQL scripts
Extract,
Transformation, and
Load (ETL) Layer
• Cleanse Data
• Filter Records
• Standardize Values
• Decode Values
• Apply Business Rules
• Householding
• Dedupe Records
• Merge Records
Presentation
Layer
ETL Layer
Metadata
Repository
ODS
•PeopleSoft
•SAP
•Siebel
•Oracle Applications
•Manugistics
•Custom Systems
Data Mart
•Custom Tools
•HTML Reports
•Cognos
•Business Objects
•MicroStrategy
•Oracle Discoverer
•Brio
•Data Mining Tools
•Portals
Source Systems
Sample Technologies:
Contact Us On :
www.vibranttechnologies.co.in
OLTP DW
Data dependencies (E-R)
model
Dimensional model
Microscopic data
consistency
Global data consistency
Millions of transactions
per day
One transaction per day
Mostly does not keep
history
Keeping history is
necessary
Gets loaded in the day Gets loaded in the night
OLTP vs DW
Contact Us On :
www.vibranttechnologies.co.in
Dimensional Data Modeling
E-R model
Symmetric
Divides data into many entities
Describes entities and relationships
Seeks to eliminate data redundancy
Good for high transaction performance
Dimensional model
Asymmetric
Divides data into dimensions and facts
Describes dimensions and measures
Encourages data redundancy
Good for high query performance
Contact Us On :
www.vibranttechnologies.co.in
Fact
Central, dominant table
Multi-part primary key
Holds millions & billions of records
Links directly to dimensions
Stores business measures
Constantly varying data
Contact Us On :
www.vibranttechnologies.co.in
Facts/Dimensions (contd.)
 Dimensions
 Single join to the fact table (single primary key)
 Stores business attributes
 Attributes are textual in nature
 Organized into hierarchies
 More or less constant data
 E.g. Time, Product, Customer, Store, etc.
Contact Us On :
www.vibranttechnologies.co.in
Star/Snowflake schema
 Star schema
 Fact surrounded by 4-15 dimensions
 Dimensions are de-normalized
 Snowflake schema
 Star schema with secondary dimensions
 Don’t snowflake for saving space
 Snowflake if secondary dimensions have many attributes
Contact Us On :
www.vibranttechnologies.co.in
Star schema
Contact Us On :
www.vibranttechnologies.co.in
Star schema example
Contact Us On :
www.vibranttechnologies.co.in
Snowflake schema example
STORE KEY
Store Dimension
Store Description
City
State
District ID
District Desc.
Region_ID
Region Desc.
Regional Mgr.
District_ID
District Desc.
Region_ID
Region_ID
Region Desc.
Regional Mgr.
STORE KEY
PRODUCT KEY
PERIOD KEY
Dollars
Units
Price
Store Fact Table
Contact Us On :
www.vibranttechnologies.co.in
DM , DW & ODS
 DM
 Organized around a single business process
 Represents small part of the organization’s business
 Logical subset of the complete data warehouse
 Faster roll out, but complex integration in the long run
Contact Us On :
www.vibranttechnologies.co.in
DM , DW & ODS (contd.)
 DW
 Union of its constituent data marts
 Queryable source of data in the organization
 Requires extensive business modeling (may take
years to design and build)
 ODS
 Point of integration for operational systems
 Low-level decision support
 Can store integrated data, but at detailed level
Contact Us On :
www.vibranttechnologies.co.in
OLAP
 Element of decision support systems (DSS)
 Support (almost) ad-hoc querying for business analyst
 Helps the knowledge worker (executive, manager,
analyst) make faster & better decisions
 ROLAP - extended RDBMS that maps operations on
multidimensional data to standard relational operators
 MOLAP - Special-purpose server that directly
implements multidimensional data and operations
Contact Us On :
www.vibranttechnologies.co.in
Others
 Additive, semi-additive & non-additive facts
 Factless facts
 Slowly changing dimensions
 Conformed facts and dimensions
 Cubes
 Drill down / Drill up
 Slice and dice
Contact Us On : www.vibranttechnologies.co.in
Thank You

More Related Content

What's hot

Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time ResponsesDenodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo
 
Cloud Modernization with Data Virtualization
Cloud Modernization with Data VirtualizationCloud Modernization with Data Virtualization
Cloud Modernization with Data Virtualization
Denodo
 
Ibm machine learning for z os
Ibm machine learning for z osIbm machine learning for z os
Ibm machine learning for z os
Cuneyt Goksu
 
Dynamics 365 for Finance and Operations - Power BI
Dynamics 365 for Finance and Operations - Power BIDynamics 365 for Finance and Operations - Power BI
Dynamics 365 for Finance and Operations - Power BI
Juan Fabian
 
NoSQL, which way to go?
NoSQL, which way to go?NoSQL, which way to go?
NoSQL, which way to go?
Ahmed Elharouny
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
Hiep Luong
 
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks
 
Top 5 Things to Know About Integrating MongoDB into Your Data Warehouse
Top 5 Things to Know About Integrating MongoDB into Your Data WarehouseTop 5 Things to Know About Integrating MongoDB into Your Data Warehouse
Top 5 Things to Know About Integrating MongoDB into Your Data Warehouse
MongoDB
 
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
Lucas Jellema
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
Elena Lopez
 
Where does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT ProjectsWhere does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT Projects
Denodo
 
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
Databricks
 
Building IoT and Big Data Solutions on Azure
Building IoT and Big Data Solutions on AzureBuilding IoT and Big Data Solutions on Azure
Building IoT and Big Data Solutions on Azure
Ido Flatow
 
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Denodo
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
DataWorks Summit
 
Tapdata Product Intro
Tapdata Product IntroTapdata Product Intro
Tapdata Product Intro
Tapdata
 
Considerations for Data Access in the Lakehouse
Considerations for Data Access in the LakehouseConsiderations for Data Access in the Lakehouse
Considerations for Data Access in the Lakehouse
Databricks
 
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing ForeverSeeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Inside Analysis
 
Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...
Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...
Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...
Rittman Analytics
 
Performance Considerations in Logical Data Warehouse
Performance Considerations in Logical Data WarehousePerformance Considerations in Logical Data Warehouse
Performance Considerations in Logical Data Warehouse
Denodo
 

What's hot (20)

Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time ResponsesDenodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
 
Cloud Modernization with Data Virtualization
Cloud Modernization with Data VirtualizationCloud Modernization with Data Virtualization
Cloud Modernization with Data Virtualization
 
Ibm machine learning for z os
Ibm machine learning for z osIbm machine learning for z os
Ibm machine learning for z os
 
Dynamics 365 for Finance and Operations - Power BI
Dynamics 365 for Finance and Operations - Power BIDynamics 365 for Finance and Operations - Power BI
Dynamics 365 for Finance and Operations - Power BI
 
NoSQL, which way to go?
NoSQL, which way to go?NoSQL, which way to go?
NoSQL, which way to go?
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
Databricks Whitelabel: Making Petabyte Scale Data Consumable to All Our Custo...
 
Top 5 Things to Know About Integrating MongoDB into Your Data Warehouse
Top 5 Things to Know About Integrating MongoDB into Your Data WarehouseTop 5 Things to Know About Integrating MongoDB into Your Data Warehouse
Top 5 Things to Know About Integrating MongoDB into Your Data Warehouse
 
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
 
Where does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT ProjectsWhere does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT Projects
 
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
 
Building IoT and Big Data Solutions on Azure
Building IoT and Big Data Solutions on AzureBuilding IoT and Big Data Solutions on Azure
Building IoT and Big Data Solutions on Azure
 
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...Designing Fast Data Architecture for Big Data  using Logical Data Warehouse a...
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
 
Tapdata Product Intro
Tapdata Product IntroTapdata Product Intro
Tapdata Product Intro
 
Considerations for Data Access in the Lakehouse
Considerations for Data Access in the LakehouseConsiderations for Data Access in the Lakehouse
Considerations for Data Access in the Lakehouse
 
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing ForeverSeeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing Forever
 
Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...
Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...
Data Integration and Data Warehousing for Cloud, Big Data and IoT: 
What’s Ne...
 
Performance Considerations in Logical Data Warehouse
Performance Considerations in Logical Data WarehousePerformance Considerations in Logical Data Warehouse
Performance Considerations in Logical Data Warehouse
 

Viewers also liked

множественное число существительных
множественное число существительныхмножественное число существительных
множественное число существительных
5555rok
 
Tony Hudson - Dissertation
Tony Hudson - DissertationTony Hudson - Dissertation
Tony Hudson - Dissertation
Tony Hudson
 
COMPUTER PEOPLE SALESFORCE
COMPUTER PEOPLE SALESFORCECOMPUTER PEOPLE SALESFORCE
COMPUTER PEOPLE SALESFORCE
Salesforce Recruitment Hub
 
Central nervous system introduction
Central nervous system introductionCentral nervous system introduction
Central nervous system introduction
Med Study
 
Inglés primaria-cuarto-1
Inglés primaria-cuarto-1Inglés primaria-cuarto-1
Inglés primaria-cuarto-1
Cecilia Perez
 
Bachelor Thesis final Version
Bachelor Thesis final VersionBachelor Thesis final Version
Bachelor Thesis final Version
Julia Schipperges
 
Trabajo final
Trabajo finalTrabajo final
Trabajo final
Kmilo1620
 
Social Connections - Installing Free Addons to IBM Conenctions
Social Connections - Installing Free Addons to IBM ConenctionsSocial Connections - Installing Free Addons to IBM Conenctions
Social Connections - Installing Free Addons to IBM Conenctions
Victor Toal
 
Jamie Rentoul, Department of Health, CfWI Annual Conference 2013
Jamie Rentoul, Department of Health, CfWI Annual Conference 2013Jamie Rentoul, Department of Health, CfWI Annual Conference 2013
Jamie Rentoul, Department of Health, CfWI Annual Conference 2013
C4WI
 
New screens, new measurements
New screens, new measurementsNew screens, new measurements
New screens, new measurements
iProspect Norge
 
POWERPOINT PRESENTATION
POWERPOINT PRESENTATIONPOWERPOINT PRESENTATION
POWERPOINT PRESENTATION
Rejith Raghavan
 
Nelinkertaisesta kaksinkertaiseen kirjanpitoon
Nelinkertaisesta kaksinkertaiseen kirjanpitoonNelinkertaisesta kaksinkertaiseen kirjanpitoon
Nelinkertaisesta kaksinkertaiseen kirjanpitoon
Flashnode Ltd.
 
Dicas de Presença no Facebook
Dicas de Presença no FacebookDicas de Presença no Facebook
Dicas de Presença no Facebook
Fabiano Santos de Oliveira
 

Viewers also liked (14)

множественное число существительных
множественное число существительныхмножественное число существительных
множественное число существительных
 
Tony Hudson - Dissertation
Tony Hudson - DissertationTony Hudson - Dissertation
Tony Hudson - Dissertation
 
COMPUTER PEOPLE SALESFORCE
COMPUTER PEOPLE SALESFORCECOMPUTER PEOPLE SALESFORCE
COMPUTER PEOPLE SALESFORCE
 
Central nervous system introduction
Central nervous system introductionCentral nervous system introduction
Central nervous system introduction
 
Marcos
MarcosMarcos
Marcos
 
Inglés primaria-cuarto-1
Inglés primaria-cuarto-1Inglés primaria-cuarto-1
Inglés primaria-cuarto-1
 
Bachelor Thesis final Version
Bachelor Thesis final VersionBachelor Thesis final Version
Bachelor Thesis final Version
 
Trabajo final
Trabajo finalTrabajo final
Trabajo final
 
Social Connections - Installing Free Addons to IBM Conenctions
Social Connections - Installing Free Addons to IBM ConenctionsSocial Connections - Installing Free Addons to IBM Conenctions
Social Connections - Installing Free Addons to IBM Conenctions
 
Jamie Rentoul, Department of Health, CfWI Annual Conference 2013
Jamie Rentoul, Department of Health, CfWI Annual Conference 2013Jamie Rentoul, Department of Health, CfWI Annual Conference 2013
Jamie Rentoul, Department of Health, CfWI Annual Conference 2013
 
New screens, new measurements
New screens, new measurementsNew screens, new measurements
New screens, new measurements
 
POWERPOINT PRESENTATION
POWERPOINT PRESENTATIONPOWERPOINT PRESENTATION
POWERPOINT PRESENTATION
 
Nelinkertaisesta kaksinkertaiseen kirjanpitoon
Nelinkertaisesta kaksinkertaiseen kirjanpitoonNelinkertaisesta kaksinkertaiseen kirjanpitoon
Nelinkertaisesta kaksinkertaiseen kirjanpitoon
 
Dicas de Presença no Facebook
Dicas de Presença no FacebookDicas de Presença no Facebook
Dicas de Presença no Facebook
 

Similar to professional informatica trainer

Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Denodo
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft Azure
Mark Kromer
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Hortonworks
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Denodo
 
The Value of Customer Insights & Analytics in a Modern Retail Environment
The Value of Customer Insights & Analytics in a Modern Retail EnvironmentThe Value of Customer Insights & Analytics in a Modern Retail Environment
The Value of Customer Insights & Analytics in a Modern Retail Environment
Denodo
 
Logical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business OutcomesLogical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business Outcomes
Denodo
 
Kaizentric Presentation
Kaizentric PresentationKaizentric Presentation
Kaizentric Presentation
Azhagarasan Annadorai
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
kcmallu
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligence
Ahsan Kabir
 
Cerebro: Bringing together data scientists and bi users - Royal Caribbean - S...
Cerebro: Bringing together data scientists and bi users - Royal Caribbean - S...Cerebro: Bringing together data scientists and bi users - Royal Caribbean - S...
Cerebro: Bringing together data scientists and bi users - Royal Caribbean - S...
Thomas W. Fry
 
the Data World Distilled
the Data World Distilledthe Data World Distilled
the Data World Distilled
RTTS
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateEnable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
CCG
 
Logical Data Warehouse and Data Lakes
Logical Data Warehouse and Data Lakes Logical Data Warehouse and Data Lakes
Logical Data Warehouse and Data Lakes
Denodo
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
DATAVERSITY
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Denodo
 
Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)
Denodo
 
How to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSIHow to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSI
Denodo
 
Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24
Martin Bém
 
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
Amazon Web Services Korea
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Denodo
 

Similar to professional informatica trainer (20)

Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft Azure
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
The Value of Customer Insights & Analytics in a Modern Retail Environment
The Value of Customer Insights & Analytics in a Modern Retail EnvironmentThe Value of Customer Insights & Analytics in a Modern Retail Environment
The Value of Customer Insights & Analytics in a Modern Retail Environment
 
Logical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business OutcomesLogical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business Outcomes
 
Kaizentric Presentation
Kaizentric PresentationKaizentric Presentation
Kaizentric Presentation
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligence
 
Cerebro: Bringing together data scientists and bi users - Royal Caribbean - S...
Cerebro: Bringing together data scientists and bi users - Royal Caribbean - S...Cerebro: Bringing together data scientists and bi users - Royal Caribbean - S...
Cerebro: Bringing together data scientists and bi users - Royal Caribbean - S...
 
the Data World Distilled
the Data World Distilledthe Data World Distilled
the Data World Distilled
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateEnable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
 
Logical Data Warehouse and Data Lakes
Logical Data Warehouse and Data Lakes Logical Data Warehouse and Data Lakes
Logical Data Warehouse and Data Lakes
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)
 
How to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSIHow to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSI
 
Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24Pitfalls of Data Warehousing_2019-04-24
Pitfalls of Data Warehousing_2019-04-24
 
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
 

Recently uploaded

Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
FODUU
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 

Recently uploaded (20)

Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Things to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUUThings to Consider When Choosing a Website Developer for your Website | FODUU
Things to Consider When Choosing a Website Developer for your Website | FODUU
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 

professional informatica trainer

  • 1. WELCOME TO VIBRANT TECHNOLOGIES & COMPUTER Online Informatica Training Contact Us On : www.vibranttechnologies.co.in
  • 2. Contact Us On : www.vibranttechnologies.co.in
  • 3. Data Warehousing - Architecture Enterprise Data Warehouse Data Mart Data Mart Execution Systems • CRM • ERP • Legacy • e-Commerce Reporting Tools OLAP Tools Ad Hoc Query Tools Data Mining Tools External Data • Purchased Market Data • Spreadsheets •Oracle •SQL Server •Teradata •DB2 Data and Metadata Repository Layer ETL Tools: •Informatica PowerMart •ETI •Oracle Warehouse Builder •Custom programs •SQL scripts Extract, Transformation, and Load (ETL) Layer • Cleanse Data • Filter Records • Standardize Values • Decode Values • Apply Business Rules • Householding • Dedupe Records • Merge Records Presentation Layer ETL Layer Metadata Repository ODS •PeopleSoft •SAP •Siebel •Oracle Applications •Manugistics •Custom Systems Data Mart •Custom Tools •HTML Reports •Cognos •Business Objects •MicroStrategy •Oracle Discoverer •Brio •Data Mining Tools •Portals Source Systems Sample Technologies:
  • 4. Contact Us On : www.vibranttechnologies.co.in OLTP DW Data dependencies (E-R) model Dimensional model Microscopic data consistency Global data consistency Millions of transactions per day One transaction per day Mostly does not keep history Keeping history is necessary Gets loaded in the day Gets loaded in the night OLTP vs DW
  • 5. Contact Us On : www.vibranttechnologies.co.in Dimensional Data Modeling E-R model Symmetric Divides data into many entities Describes entities and relationships Seeks to eliminate data redundancy Good for high transaction performance Dimensional model Asymmetric Divides data into dimensions and facts Describes dimensions and measures Encourages data redundancy Good for high query performance
  • 6. Contact Us On : www.vibranttechnologies.co.in Fact Central, dominant table Multi-part primary key Holds millions & billions of records Links directly to dimensions Stores business measures Constantly varying data
  • 7. Contact Us On : www.vibranttechnologies.co.in Facts/Dimensions (contd.)  Dimensions  Single join to the fact table (single primary key)  Stores business attributes  Attributes are textual in nature  Organized into hierarchies  More or less constant data  E.g. Time, Product, Customer, Store, etc.
  • 8. Contact Us On : www.vibranttechnologies.co.in Star/Snowflake schema  Star schema  Fact surrounded by 4-15 dimensions  Dimensions are de-normalized  Snowflake schema  Star schema with secondary dimensions  Don’t snowflake for saving space  Snowflake if secondary dimensions have many attributes
  • 9. Contact Us On : www.vibranttechnologies.co.in Star schema
  • 10. Contact Us On : www.vibranttechnologies.co.in Star schema example
  • 11. Contact Us On : www.vibranttechnologies.co.in Snowflake schema example STORE KEY Store Dimension Store Description City State District ID District Desc. Region_ID Region Desc. Regional Mgr. District_ID District Desc. Region_ID Region_ID Region Desc. Regional Mgr. STORE KEY PRODUCT KEY PERIOD KEY Dollars Units Price Store Fact Table
  • 12. Contact Us On : www.vibranttechnologies.co.in DM , DW & ODS  DM  Organized around a single business process  Represents small part of the organization’s business  Logical subset of the complete data warehouse  Faster roll out, but complex integration in the long run
  • 13. Contact Us On : www.vibranttechnologies.co.in DM , DW & ODS (contd.)  DW  Union of its constituent data marts  Queryable source of data in the organization  Requires extensive business modeling (may take years to design and build)  ODS  Point of integration for operational systems  Low-level decision support  Can store integrated data, but at detailed level
  • 14. Contact Us On : www.vibranttechnologies.co.in OLAP  Element of decision support systems (DSS)  Support (almost) ad-hoc querying for business analyst  Helps the knowledge worker (executive, manager, analyst) make faster & better decisions  ROLAP - extended RDBMS that maps operations on multidimensional data to standard relational operators  MOLAP - Special-purpose server that directly implements multidimensional data and operations
  • 15. Contact Us On : www.vibranttechnologies.co.in Others  Additive, semi-additive & non-additive facts  Factless facts  Slowly changing dimensions  Conformed facts and dimensions  Cubes  Drill down / Drill up  Slice and dice
  • 16. Contact Us On : www.vibranttechnologies.co.in Thank You