SlideShare a Scribd company logo
Presented by Gopalakrishnan K
KG Data Solutions
gopalk@kgds.org
 What Is A Data Warehouse?
 History
 Current scenario
 Characteristics
 Operational Database vs. Data Warehouse
 Architecture
 Data Model
Gopal K KGDS
 The term "data warehouse" refers to a special
type of database that acts as the central
repository for company data. It can be thought of
as a database archive that is segregated from the
operational databases, and used primarily for
reporting and data mining purposes.
 The relational database revolution in the early
1980s ushered in an era of improved access to
the valuable information contained deep within
data. Still improvements were needed.
 It was soon discovered that databases modeled
to be efficient at transactional processing were
not always optimized for complex reporting or
analytical needs
 Inmon champions the large centralized Data Warehouse approach
leveraging solid relational design principles. His Corporate
Information Factory remains an example of this "top down"
philosophy.
 Kimball, on the other hand, favors the development of individual
data marts at the departmental level that get integrated together
using the Information Bus architecture. This "bottom up" approach
dovetails nicely with Kimball's preference for star-schema modeling
Many of the current changes in today's data industry also affect Data
Warehousing. Cloud storage and high-velocity, real-time data analysis
being two obvious factors playing a role in the practice's evolution. On
the end-user side, web-based and mobile access to decision support or
reporting data is a major requirement on many projects. Advances in
the practice of ontology have enhanced the capabilities of ETL systems
to parse information out of unstructured as well as structured data
sources
 Subject-oriented
The data in the database is organized so that all the data elements
relating to the same real-world event or object are linked together.
 Time-variant
The changes to the data in the database are tracked and recorded
so that reports can be produced showing changes over time.
 Non-volatile
Data in the database is never over-written or deleted. Once
committed, the data is static, read-only, but retained for future
reporting.
 Integrated
The database contains data from most or all of an organization's
operational applications, and that this data is made consistent.
 The processing load of reporting reduced the
response time of the operational systems.
 The database designs of operational systems
were not optimized for information analysis and
reporting.
 Most organizations had more than one
operational system, so company-wide reporting
could not be supported from a single system.
 Development of reports in operational systems
often required writing specific computer
programs which was slow and expensive.
 Consolidation of data from a wide variety of data
sources.
 Ability to analyze data beyond the level of
standard monitoring reports.
 Operational response time unaffected.
Data warehouse presentation
Data warehouse presentation
Data warehouse presentation

More Related Content

What's hot

Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecture
samaksh1982
 
Hds ucp sap hana infographic v6[1]
Hds ucp  sap hana infographic v6[1]Hds ucp  sap hana infographic v6[1]
Hds ucp sap hana infographic v6[1]
Barbara Götz
 
Data flow in Extraction of ETL data warehousing
Data flow in Extraction of ETL data warehousingData flow in Extraction of ETL data warehousing
Data flow in Extraction of ETL data warehousing
Dr. Dipti Patil
 

What's hot (20)

data warehousing
data warehousingdata warehousing
data warehousing
 
Managing Data Integration Initiatives
Managing Data Integration InitiativesManaging Data Integration Initiatives
Managing Data Integration Initiatives
 
Datos iO Product Overview
Datos iO Product OverviewDatos iO Product Overview
Datos iO Product Overview
 
Data warehouse architecture
Data warehouse architecture Data warehouse architecture
Data warehouse architecture
 
Data warehouseconceptsandarchitecture
Data warehouseconceptsandarchitectureData warehouseconceptsandarchitecture
Data warehouseconceptsandarchitecture
 
Sridhar-Profile-0117
Sridhar-Profile-0117Sridhar-Profile-0117
Sridhar-Profile-0117
 
7 data warehouse & marts
7 data warehouse & marts7 data warehouse & marts
7 data warehouse & marts
 
How Yellowbrick Data Integrates to Existing Environments Webcast
How Yellowbrick Data Integrates to Existing Environments WebcastHow Yellowbrick Data Integrates to Existing Environments Webcast
How Yellowbrick Data Integrates to Existing Environments Webcast
 
Denodo DataFest 2017: Business Needs for a Fast Data Strategy
Denodo DataFest 2017: Business Needs for a Fast Data StrategyDenodo DataFest 2017: Business Needs for a Fast Data Strategy
Denodo DataFest 2017: Business Needs for a Fast Data Strategy
 
The Big Data Analytics Ecosystem at LinkedIn
The Big Data Analytics Ecosystem at LinkedInThe Big Data Analytics Ecosystem at LinkedIn
The Big Data Analytics Ecosystem at LinkedIn
 
Data mining and data warehousing
Data mining and data warehousingData mining and data warehousing
Data mining and data warehousing
 
Hds ucp sap hana infographic v6[1]
Hds ucp  sap hana infographic v6[1]Hds ucp  sap hana infographic v6[1]
Hds ucp sap hana infographic v6[1]
 
IT Category Purchasing Managers Opportunity for Savings with Non Relational S...
IT Category Purchasing Managers Opportunity for Savings with Non Relational S...IT Category Purchasing Managers Opportunity for Savings with Non Relational S...
IT Category Purchasing Managers Opportunity for Savings with Non Relational S...
 
Gartner Cool Vendor Report 2014
Gartner Cool Vendor Report 2014Gartner Cool Vendor Report 2014
Gartner Cool Vendor Report 2014
 
Etl elt simplified
Etl elt simplifiedEtl elt simplified
Etl elt simplified
 
Data warehouse
Data warehouse Data warehouse
Data warehouse
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Data flow in Extraction of ETL data warehousing
Data flow in Extraction of ETL data warehousingData flow in Extraction of ETL data warehousing
Data flow in Extraction of ETL data warehousing
 
Taming the ETL beast: How LinkedIn uses metadata to run complex ETL flows rel...
Taming the ETL beast: How LinkedIn uses metadata to run complex ETL flows rel...Taming the ETL beast: How LinkedIn uses metadata to run complex ETL flows rel...
Taming the ETL beast: How LinkedIn uses metadata to run complex ETL flows rel...
 
Why shift from ETL to ELT?
Why shift from ETL to ELT?Why shift from ETL to ELT?
Why shift from ETL to ELT?
 

Viewers also liked

Jennifer_Goldberg_QuittingFeature
Jennifer_Goldberg_QuittingFeatureJennifer_Goldberg_QuittingFeature
Jennifer_Goldberg_QuittingFeature
Jennifer Goldberg
 
Social Media position paper
Social Media position paperSocial Media position paper
Social Media position paper
Shelly Lawrence
 
Managing warehouse operations. How to manage and run warehouse operations by ...
Managing warehouse operations. How to manage and run warehouse operations by ...Managing warehouse operations. How to manage and run warehouse operations by ...
Managing warehouse operations. How to manage and run warehouse operations by ...
Omar Youssef
 

Viewers also liked (20)

Immigrate to canada from hyderabad
Immigrate to canada from hyderabadImmigrate to canada from hyderabad
Immigrate to canada from hyderabad
 
Qué es una webquest
Qué es una webquest Qué es una webquest
Qué es una webquest
 
Jennifer_Goldberg_QuittingFeature
Jennifer_Goldberg_QuittingFeatureJennifer_Goldberg_QuittingFeature
Jennifer_Goldberg_QuittingFeature
 
Gestão empreendedora de mídia social
Gestão empreendedora de mídia socialGestão empreendedora de mídia social
Gestão empreendedora de mídia social
 
Presentation zse drukarska-karolina&dominika
Presentation zse drukarska-karolina&dominikaPresentation zse drukarska-karolina&dominika
Presentation zse drukarska-karolina&dominika
 
Dicas e truques no photoshop
Dicas e truques no photoshopDicas e truques no photoshop
Dicas e truques no photoshop
 
ashutosh_1401
ashutosh_1401ashutosh_1401
ashutosh_1401
 
Syphilis congenital
Syphilis congenital Syphilis congenital
Syphilis congenital
 
Volume benda putar cincin untuk diupload di slide share
Volume benda putar cincin untuk diupload di slide shareVolume benda putar cincin untuk diupload di slide share
Volume benda putar cincin untuk diupload di slide share
 
Творческий проект "Кулинарные истории"Телегиной Е.
Творческий проект "Кулинарные истории"Телегиной Е.Творческий проект "Кулинарные истории"Телегиной Е.
Творческий проект "Кулинарные истории"Телегиной Е.
 
Творческий проект "Хранение информации"
Творческий проект "Хранение информации"Творческий проект "Хранение информации"
Творческий проект "Хранение информации"
 
"Кулайка"
 "Кулайка" "Кулайка"
"Кулайка"
 
Творческая работа "Мы за ЗОЖ!"
Творческая работа "Мы за ЗОЖ!"Творческая работа "Мы за ЗОЖ!"
Творческая работа "Мы за ЗОЖ!"
 
Творческий проект "Кулинарный поединок"
Творческий проект "Кулинарный поединок"Творческий проект "Кулинарный поединок"
Творческий проект "Кулинарный поединок"
 
Творческая работа "Реки на просторах Томской области"
Творческая работа "Реки на просторах Томской области"Творческая работа "Реки на просторах Томской области"
Творческая работа "Реки на просторах Томской области"
 
Sampling - Stratified vs Cluster
Sampling - Stratified vs ClusterSampling - Stratified vs Cluster
Sampling - Stratified vs Cluster
 
Come un territorio diventa creativo. Una lezione veneziana 11 01 17 Andrea Po...
Come un territorio diventa creativo. Una lezione veneziana 11 01 17 Andrea Po...Come un territorio diventa creativo. Una lezione veneziana 11 01 17 Andrea Po...
Come un territorio diventa creativo. Una lezione veneziana 11 01 17 Andrea Po...
 
Parametric vs Non-Parametric
Parametric vs Non-ParametricParametric vs Non-Parametric
Parametric vs Non-Parametric
 
Social Media position paper
Social Media position paperSocial Media position paper
Social Media position paper
 
Managing warehouse operations. How to manage and run warehouse operations by ...
Managing warehouse operations. How to manage and run warehouse operations by ...Managing warehouse operations. How to manage and run warehouse operations by ...
Managing warehouse operations. How to manage and run warehouse operations by ...
 

Similar to Data warehouse presentation

Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Material
obieefans
 

Similar to Data warehouse presentation (20)

Data warehouse concepts
Data warehouse conceptsData warehouse concepts
Data warehouse concepts
 
Informatica and datawarehouse Material
Informatica and datawarehouse MaterialInformatica and datawarehouse Material
Informatica and datawarehouse Material
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | Qubole
 
Top 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdfTop 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdf
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guide
 
Benefits of a data lake
Benefits of a data lake Benefits of a data lake
Benefits of a data lake
 
DW 101
DW 101DW 101
DW 101
 
Oracle sql plsql & dw
Oracle sql plsql & dwOracle sql plsql & dw
Oracle sql plsql & dw
 
TOPIC 9 data warehousing and data mining.pdf
TOPIC 9 data warehousing and data mining.pdfTOPIC 9 data warehousing and data mining.pdf
TOPIC 9 data warehousing and data mining.pdf
 
data warehousing and data mining (1).pdf
data warehousing and data mining (1).pdfdata warehousing and data mining (1).pdf
data warehousing and data mining (1).pdf
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
A Comparitive Study Of ETL Tools
A Comparitive Study Of ETL ToolsA Comparitive Study Of ETL Tools
A Comparitive Study Of ETL Tools
 
Fbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_servicesFbdl enabling comprehensive_data_services
Fbdl enabling comprehensive_data_services
 
The technology of the business data lake
The technology of the business data lakeThe technology of the business data lake
The technology of the business data lake
 
Data warehousing
Data warehousingData warehousing
Data warehousing
 
BI Architecture in support of data quality
BI Architecture in support of data qualityBI Architecture in support of data quality
BI Architecture in support of data quality
 
Datawarehouse
DatawarehouseDatawarehouse
Datawarehouse
 
Data processing in Industrial Systems course notes after week 5
Data processing in Industrial Systems course notes after week 5Data processing in Industrial Systems course notes after week 5
Data processing in Industrial Systems course notes after week 5
 
Implementation of Data Marts in Data ware house
Implementation of Data Marts in Data ware houseImplementation of Data Marts in Data ware house
Implementation of Data Marts in Data ware house
 
DATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining forDATAWAREHOUSE MAIn under data mining for
DATAWAREHOUSE MAIn under data mining for
 

Recently uploaded

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 

Recently uploaded (20)

When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 

Data warehouse presentation

  • 1. Presented by Gopalakrishnan K KG Data Solutions gopalk@kgds.org
  • 2.  What Is A Data Warehouse?  History  Current scenario  Characteristics  Operational Database vs. Data Warehouse  Architecture  Data Model Gopal K KGDS
  • 3.  The term "data warehouse" refers to a special type of database that acts as the central repository for company data. It can be thought of as a database archive that is segregated from the operational databases, and used primarily for reporting and data mining purposes.
  • 4.  The relational database revolution in the early 1980s ushered in an era of improved access to the valuable information contained deep within data. Still improvements were needed.  It was soon discovered that databases modeled to be efficient at transactional processing were not always optimized for complex reporting or analytical needs
  • 5.  Inmon champions the large centralized Data Warehouse approach leveraging solid relational design principles. His Corporate Information Factory remains an example of this "top down" philosophy.  Kimball, on the other hand, favors the development of individual data marts at the departmental level that get integrated together using the Information Bus architecture. This "bottom up" approach dovetails nicely with Kimball's preference for star-schema modeling
  • 6. Many of the current changes in today's data industry also affect Data Warehousing. Cloud storage and high-velocity, real-time data analysis being two obvious factors playing a role in the practice's evolution. On the end-user side, web-based and mobile access to decision support or reporting data is a major requirement on many projects. Advances in the practice of ontology have enhanced the capabilities of ETL systems to parse information out of unstructured as well as structured data sources
  • 7.  Subject-oriented The data in the database is organized so that all the data elements relating to the same real-world event or object are linked together.  Time-variant The changes to the data in the database are tracked and recorded so that reports can be produced showing changes over time.
  • 8.  Non-volatile Data in the database is never over-written or deleted. Once committed, the data is static, read-only, but retained for future reporting.  Integrated The database contains data from most or all of an organization's operational applications, and that this data is made consistent.
  • 9.  The processing load of reporting reduced the response time of the operational systems.  The database designs of operational systems were not optimized for information analysis and reporting.
  • 10.  Most organizations had more than one operational system, so company-wide reporting could not be supported from a single system.  Development of reports in operational systems often required writing specific computer programs which was slow and expensive.
  • 11.  Consolidation of data from a wide variety of data sources.  Ability to analyze data beyond the level of standard monitoring reports.  Operational response time unaffected.