SlideShare a Scribd company logo
1 of 20
Download to read offline
How Real Time Data
Requirements Change the Data
Warehouse Environment
Mark Madsen – September 17, 2008
www.ThirdNature.net




                     Attribution-NonCommercial-No Derivative
                     http://creativecommons.org/licenses/by-nc-nd/3.0/us/
Outline
 What’s real-time about?
 Impacts on the data
 warehouse architecture
        Delivering data to users
        Extracting the data
        Storing the data
        Operations
 Getting started


Third Nature, January 2008    Mark Madsen   Slide 2
Speeding Up the Data Warehouse



Why?
Faster reaction time

Reduced decision time

New process capabilities




Third Nature, January 2008   Mark Madsen   Slide 3
Which Decisions Benefit?
                             Strategic                 Operational
    Decision time            flexible, long cycle
                                         constrained, short
                                         cycle
    Decision scope broad, organizational narrow, departmental
                                         or process
    Decision model Complex                             Simple

    Data latency             High, history is core     Low, recent data is
                             to decisions              core to decisions

    Data scope               Many sources, many Few sources,
                             types, aggregated  structured, detailed
      Most real time needs will be driven by operational decision
      making, not strategic decisions.
Third Nature, January 2008               Mark Madsen                         Slide 4
Strategy, Decisions and Data Latency

Goal                          Increase share of low to mid market customers


Strategy               Reduce cost of products sold             Improve promotional performance



Tactics                      Efficient sourcing                     Decrease Out of Stocks




        Consolidate suppliers           Improve delivery compliance          Catch out of stocks
                                                                              before they occur


BI Needs
                    Reports &                Dashboards, alerts               Real time alerts &
                  spreadsheets                 & scorecards                  embedded analytics
Third Nature, January 2008                        Mark Madsen                                 Slide 5
What People Are Doing Today
                              Monthly      W eekly   Daily       Multiple times per day      On demand


  2002                  32%                    34%                               69%                      15%         6%




  2004              27%                  29%                         65%                       30%              19%




  2006       3                24                                  44                                 29



         0%            10%         20%         30%   40%          50%      60%         70%     80%        90%         100%
                                                                                                Sources: TDWI, Gartner


         At the same time, data volumes are rising for most data
         warehouses at 50% to 100% per year.

Third Nature, January 2008                                   Mark Madsen                                              Slide 6
BI Efforts Involving Real Time Data Access
                             Terms you may hear from the
                             BI market that imply real time:
                                   Operational BI
                                   Embedded analytics
                                   Decision automation
                                   Complex event processing
                                   Event-driven BI
                                   Process-driven BI

                             They are all similar in
                             requiring some level of low
                             latency data access.
Third Nature, January 2008   Mark Madsen                      Slide 7
Impacts on the DW Architecture
     Databases Dashboards           OLAP     Productivity   BAM/BPM         Reporting   Analytics Applications




                                                Data Consumers

                                                   Delivery

                             DW Platforms                                                        Adding current
                                                                                                 data to the system
                                     Warehouse                      Mart                         requires effort at all
                                     Database                                                    three layers
                                                                 Content
                                           ODS                    Store


                                  ETL                   EDR                    EII

      Databases              Documents     Flat Files       XML             Queues        ERP      Applications



                                            Source Environments
Third Nature, January 2008                                    Mark Madsen                                         Slide 8
One Architecture or Two?
   In-line with process:
                                                                   RT BI
      • Real time data flows separately
        from the warehouse data
      • May include a low-latency data
        store in the real time environment                         Process

      • This model be needed for
        extremely low latency data
                                                                   BI
      • More applicable for event-driven
                                                      Batch   DW



   Out of band:
      • Data to the consumer first flows                           Process
        through the DW
      • Unified architecture for both low
        and high latency data                                      BI &
                                                                   RT BI
      • More applicable for on-demand                  DW


Third Nature, January 2008              Mark Madsen                     Slide 9
User Interface: Two BI Usage Models
                                Demand driven
                                  • Users ask for current data
                                  • Most BI tools work this way
                                  • Harder to adapt these tools to
                                    event-driven models


                                Event driven
                                  • System takes action based on
                                    data, e.g. alerts, rule engines
                                  • May not have (or need) an end
                                    user interface
                                  • Need understanding of decision
                                    & action process for this model
Third Nature, January 2008   Mark Madsen                          Slide 10
BI Tools Need New Capabilities
    Embedding BI within
    applications
        • UI embedding
        • Full embedding
    Event-based integration
    Feeding BI data to
    applications: services, not
    SQL, may be desired

    Custom UI code may be
    preferable to a BI tool

Third Nature, January 2008    Mark Madsen   Slide 11
The Data Integration Layer
    • Integration is the most complex
      element of adding real time data.
    • Inline vs. out of band, demand vs.
      event-driven BI usage create
      different DI requirements.
    • You may not have exactly the
      same metrics, attributes or data
      extract logic.
    • Don’t count on replacing the ETL
      batch; more likely you are
      augmenting it.
    • You probably need to add new DI
      technologies to your portfolio.
    • Batch performance design isn’t
      like real time design.
Third Nature, January 2008       Mark Madsen   Slide 12
Speeding Up Data Integration Methods


      Single batch

                             Frequent batch

                                       Mini-batch

                                                            Continuous load

                                                                      Streaming



          Hourly+                                                        Immediate


Third Nature, January 2008                    Mark Madsen                         Slide 13
The Platform Layer: Data and Database
                                   • Schemas will need changes.
                                   • You don’t need to convert the
                                     entire database to a real time
                                     schema.
                                   • One schema or two?
                                   • Event-driven BI creates
                                     different query patterns and
                                     workloads.
                                   • Configuration and tuning may
                                     be different than what you are
                                     used to with traditional BI.
                                   • Application developers want
                                     services or ORMs, not SQL.


Third Nature, January 2008   Mark Madsen                       Slide 14
Different Platform Workloads
        Databases Dashboards        OLAP     Productivity   BAM/BPM   Reporting   Analytics Applications




                                                Data Consumers

                                                   Delivery

                             DW Platforms                                                 Three workloads:
                                                                                            Data loading +
                                     Warehouse                    Mart                      Normal BI +
                                     Database                                               Real time BI
                                                               Content                    = complications
                                           ODS                  Store


                                  ETL                   EDR              EII

         Databases           Documents     Flat Files       XML       Queues        ERP      Applications




Third Nature, January 2008
                                            Source Environments
                                                     Mark Madsen                                            Slide 15
Development, Maintenance & Operations
                                    • Real time decisions on real
                                      time data mean data
                                      quality plays a larger role,
                                      and it’s harder to address.
                                    • Warehouse availability
                                      becomes much more
                                      important to the business,
                                      and it isn’t just the
                                      database – it’s everything.
                                    • Performance and meeting
                                      strict BI SLAs will rise in
                                      importance since you are
                                      now tied in to business
                                      operations.

Third Nature, January 2008   Mark Madsen                      Slide 16
A Prescription for Getting Started
    1. Star with a decision
       process
    2. Define data needs for the
       process
    3. Ensure that data is
       available at the right
       latency
    4. Determine appropriate
       data integration
       technologies.
    5. Design and initiate
       upstream work
    6. Build
Third Nature, January 2008   Mark Madsen   Slide 17
Thanks




Third Nature, January 2008   Mark Madsen   Slide 18
CC Image Attributions
    Thanks to the people who supplied the creative commons licensed images used in this presentation:
    • Divers - http://flickr.com/photos/raveller/
    • Fast dog - http://flickr.com/photos/marinacvinhal/379111290/
    • Febo - http://flickr.com/photos/igor/419425754/
    • Subway - http://flickr.com/photos/neilsphotoalbum/504517855/
    • Cadillac ranch - http://flickr.com/photos/whatknot/179655095/




Third Nature, January 2008                          Mark Madsen                                         Slide 19
About the Presenter
                            Mark Madsen is president of Third
                            Nature, a technology research and
                            consulting firm focused on business
                            intelligence, data integration and
                            data management. Mark is an
                            award-winning author, architect and
                            CTO whose work has been featured
                            in numerous industry publications.
                            Over the past ten years Mark
                            received awards for his work from
                            the American Productivity & Quality
                            Center, TDWI, and the Smithsonian
                            Institute. He is an international
                            speaker, a contributing editor at
                            Intelligent Enterprise, and manages
                            the open source channel at the
                            Business Intelligence Network. For
                            more information or to contact Mark,
                            visit http://ThirdNature.net.
                      Page 20

More Related Content

What's hot

White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...Eric Javier Espino Man
 
Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction Kernel Training
 
Data warehouse 101-fundamentals-
Data warehouse 101-fundamentals-Data warehouse 101-fundamentals-
Data warehouse 101-fundamentals-AshishGuleria
 
Enterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementationEnterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementationSumya Abdelrazek
 
Warehousing dimension star-snowflake_schemas
Warehousing dimension star-snowflake_schemasWarehousing dimension star-snowflake_schemas
Warehousing dimension star-snowflake_schemasEric Matthews
 
IOUG93 - Technical Architecture for the Data Warehouse - Presentation
IOUG93 - Technical Architecture for the Data Warehouse - PresentationIOUG93 - Technical Architecture for the Data Warehouse - Presentation
IOUG93 - Technical Architecture for the Data Warehouse - PresentationDavid Walker
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guidethomasmary607
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousinguncleRhyme
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Miningcpjcollege
 
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Edureka!
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseShanthi Mukkavilli
 
Rev_3 Components of a Data Warehouse
Rev_3 Components of a Data WarehouseRev_3 Components of a Data Warehouse
Rev_3 Components of a Data WarehouseRyan Andhavarapu
 
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...Denodo
 
Data warehouse design
Data warehouse designData warehouse design
Data warehouse designines beltaief
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing conceptspcherukumalla
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewNagaraj Yerram
 
Bi presentation to bkk
Bi presentation to bkkBi presentation to bkk
Bi presentation to bkkguest4e975e2
 

What's hot (20)

White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
 
Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction Data warehousing Demo PPTS | Over View | Introduction
Data warehousing Demo PPTS | Over View | Introduction
 
Data warehouse 101-fundamentals-
Data warehouse 101-fundamentals-Data warehouse 101-fundamentals-
Data warehouse 101-fundamentals-
 
Enterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementationEnterprise resource planning system & data warehousing implementation
Enterprise resource planning system & data warehousing implementation
 
Warehousing dimension star-snowflake_schemas
Warehousing dimension star-snowflake_schemasWarehousing dimension star-snowflake_schemas
Warehousing dimension star-snowflake_schemas
 
DW 101
DW 101DW 101
DW 101
 
IOUG93 - Technical Architecture for the Data Warehouse - Presentation
IOUG93 - Technical Architecture for the Data Warehouse - PresentationIOUG93 - Technical Architecture for the Data Warehouse - Presentation
IOUG93 - Technical Architecture for the Data Warehouse - Presentation
 
Data Warehouse Basic Guide
Data Warehouse Basic GuideData Warehouse Basic Guide
Data Warehouse Basic Guide
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousing
 
DATA Warehousing & Data Mining
DATA Warehousing & Data MiningDATA Warehousing & Data Mining
DATA Warehousing & Data Mining
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
Data Warehouse Tutorial For Beginners | Data Warehouse Concepts | Data Wareho...
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Rev_3 Components of a Data Warehouse
Rev_3 Components of a Data WarehouseRev_3 Components of a Data Warehouse
Rev_3 Components of a Data Warehouse
 
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
 
Data warehouse design
Data warehouse designData warehouse design
Data warehouse design
 
Date warehousing concepts
Date warehousing conceptsDate warehousing concepts
Date warehousing concepts
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overview
 
Bi presentation to bkk
Bi presentation to bkkBi presentation to bkk
Bi presentation to bkk
 

Viewers also liked

Traditional data warehouse vs data lake
Traditional data warehouse vs data lakeTraditional data warehouse vs data lake
Traditional data warehouse vs data lakeBHASKAR CHAUDHURY
 
Big Data Smart City processes and tools, Real Time data processing tools
Big Data Smart City processes and tools, Real Time data processing toolsBig Data Smart City processes and tools, Real Time data processing tools
Big Data Smart City processes and tools, Real Time data processing toolsPaolo Nesi
 
Smart Cities Datasets and Applications Ecosystem: An Exploratory Analysis
Smart Cities Datasets and Applications Ecosystem: An Exploratory AnalysisSmart Cities Datasets and Applications Ecosystem: An Exploratory Analysis
Smart Cities Datasets and Applications Ecosystem: An Exploratory AnalysisAlberto Abella
 
Realtime streaming architecture in INFINARIO
Realtime streaming architecture in INFINARIORealtime streaming architecture in INFINARIO
Realtime streaming architecture in INFINARIOJozo Kovac
 
Telco Big Data Workshop Sample
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop SampleAlan Quayle
 
Sharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataSharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataTERN Australia
 
Real-time information analysis: social networks and open data
Real-time information analysis: social networks and open dataReal-time information analysis: social networks and open data
Real-time information analysis: social networks and open dataData Science Society
 
Banking & Smart City Ecosystem
Banking & Smart City EcosystemBanking & Smart City Ecosystem
Banking & Smart City EcosystemArki Rifazka
 
Oracle BI publisher intro
Oracle BI publisher introOracle BI publisher intro
Oracle BI publisher introAdil Arshad
 
Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart CityKoltiva
 
Km4city Smart City Ecosystem Urban Platform
Km4city Smart City Ecosystem Urban PlatformKm4city Smart City Ecosystem Urban Platform
Km4city Smart City Ecosystem Urban PlatformPaolo Nesi
 
Smart Cities and the Value of Ecosystem Services
Smart Cities and the Value of Ecosystem ServicesSmart Cities and the Value of Ecosystem Services
Smart Cities and the Value of Ecosystem ServicesSylvain Remy
 
Big data real time architectures
Big data real time architecturesBig data real time architectures
Big data real time architecturesDaniel Marcous
 
Oracle XML Publisher / BI Publisher
Oracle XML Publisher / BI PublisherOracle XML Publisher / BI Publisher
Oracle XML Publisher / BI PublisherEdi Yanto
 
Database vs Data Warehouse: A Comparative Review
Database vs Data Warehouse: A Comparative ReviewDatabase vs Data Warehouse: A Comparative Review
Database vs Data Warehouse: A Comparative ReviewHealth Catalyst
 
lecture 1 information systems and business strategy
lecture 1  information systems and business strategylecture 1  information systems and business strategy
lecture 1 information systems and business strategyNorazila Mat
 
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?Health Catalyst
 
An Introduction into the design of business using business architecture
An Introduction into the design of business using business architectureAn Introduction into the design of business using business architecture
An Introduction into the design of business using business architectureCraig Martin
 

Viewers also liked (20)

Traditional data warehouse vs data lake
Traditional data warehouse vs data lakeTraditional data warehouse vs data lake
Traditional data warehouse vs data lake
 
Big Data Smart City processes and tools, Real Time data processing tools
Big Data Smart City processes and tools, Real Time data processing toolsBig Data Smart City processes and tools, Real Time data processing tools
Big Data Smart City processes and tools, Real Time data processing tools
 
Smart Cities Datasets and Applications Ecosystem: An Exploratory Analysis
Smart Cities Datasets and Applications Ecosystem: An Exploratory AnalysisSmart Cities Datasets and Applications Ecosystem: An Exploratory Analysis
Smart Cities Datasets and Applications Ecosystem: An Exploratory Analysis
 
Realtime streaming architecture in INFINARIO
Realtime streaming architecture in INFINARIORealtime streaming architecture in INFINARIO
Realtime streaming architecture in INFINARIO
 
Telco Big Data Workshop Sample
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop Sample
 
Sharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataSharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem Data
 
Real-time information analysis: social networks and open data
Real-time information analysis: social networks and open dataReal-time information analysis: social networks and open data
Real-time information analysis: social networks and open data
 
Banking & Smart City Ecosystem
Banking & Smart City EcosystemBanking & Smart City Ecosystem
Banking & Smart City Ecosystem
 
Oracle BI publisher intro
Oracle BI publisher introOracle BI publisher intro
Oracle BI publisher intro
 
Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart City
 
Km4city Smart City Ecosystem Urban Platform
Km4city Smart City Ecosystem Urban PlatformKm4city Smart City Ecosystem Urban Platform
Km4city Smart City Ecosystem Urban Platform
 
Smart Cities and the Value of Ecosystem Services
Smart Cities and the Value of Ecosystem ServicesSmart Cities and the Value of Ecosystem Services
Smart Cities and the Value of Ecosystem Services
 
Big data real time architectures
Big data real time architecturesBig data real time architectures
Big data real time architectures
 
Oracle XML Publisher / BI Publisher
Oracle XML Publisher / BI PublisherOracle XML Publisher / BI Publisher
Oracle XML Publisher / BI Publisher
 
Smart City Framework
Smart City FrameworkSmart City Framework
Smart City Framework
 
Database vs Data Warehouse: A Comparative Review
Database vs Data Warehouse: A Comparative ReviewDatabase vs Data Warehouse: A Comparative Review
Database vs Data Warehouse: A Comparative Review
 
lecture 1 information systems and business strategy
lecture 1  information systems and business strategylecture 1  information systems and business strategy
lecture 1 information systems and business strategy
 
Introduction to ETL and Data Integration
Introduction to ETL and Data IntegrationIntroduction to ETL and Data Integration
Introduction to ETL and Data Integration
 
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
 
An Introduction into the design of business using business architecture
An Introduction into the design of business using business architectureAn Introduction into the design of business using business architecture
An Introduction into the design of business using business architecture
 

Similar to How Real TIme Data Changes the Data Warehouse

Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntelAPAC
 
Building a business intelligence architecture fit for the 21st century by Jon...
Building a business intelligence architecture fit for the 21st century by Jon...Building a business intelligence architecture fit for the 21st century by Jon...
Building a business intelligence architecture fit for the 21st century by Jon...Mark Tapley
 
White Paper Data Quality Process Design For Ad Hoc Reporting
White Paper   Data Quality Process Design For Ad Hoc ReportingWhite Paper   Data Quality Process Design For Ad Hoc Reporting
White Paper Data Quality Process Design For Ad Hoc Reportingmacrochaotic
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantStuart Miniman
 
Customer summit - big data (final)
Customer summit  - big data (final)Customer summit  - big data (final)
Customer summit - big data (final)Anand Deshpande
 
Intel Cloud Summit: Big Data
Intel Cloud Summit: Big DataIntel Cloud Summit: Big Data
Intel Cloud Summit: Big DataIntelAPAC
 
IBM Smarter Business 2012 - PureSystems - PureData
IBM Smarter Business 2012 - PureSystems - PureDataIBM Smarter Business 2012 - PureSystems - PureData
IBM Smarter Business 2012 - PureSystems - PureDataIBM Sverige
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Empowered Holdings, LLC
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Denodo
 
Outboard Feel Good NLS
Outboard Feel Good NLSOutboard Feel Good NLS
Outboard Feel Good NLSDave Fox
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data SolutionsMark Kromer
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureOdinot Stanislas
 
Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Denodo
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItDenodo
 
OSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - TechnicalOSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - TechnicalAccenture the Netherlands
 
Big data appliances for BI on Cloud
Big data appliances for BI on CloudBig data appliances for BI on Cloud
Big data appliances for BI on Cloudtdwiindia
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Cana Ko
 
Analyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast DataAnalyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast DataEMC
 

Similar to How Real TIme Data Changes the Data Warehouse (20)

Intel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick KnupfferIntel Cloud summit: Big Data by Nick Knupffer
Intel Cloud summit: Big Data by Nick Knupffer
 
Building a business intelligence architecture fit for the 21st century by Jon...
Building a business intelligence architecture fit for the 21st century by Jon...Building a business intelligence architecture fit for the 21st century by Jon...
Building a business intelligence architecture fit for the 21st century by Jon...
 
White Paper Data Quality Process Design For Ad Hoc Reporting
White Paper   Data Quality Process Design For Ad Hoc ReportingWhite Paper   Data Quality Process Design For Ad Hoc Reporting
White Paper Data Quality Process Design For Ad Hoc Reporting
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
Customer summit - big data (final)
Customer summit  - big data (final)Customer summit  - big data (final)
Customer summit - big data (final)
 
Intel Cloud Summit: Big Data
Intel Cloud Summit: Big DataIntel Cloud Summit: Big Data
Intel Cloud Summit: Big Data
 
IBM Smarter Business 2012 - PureSystems - PureData
IBM Smarter Business 2012 - PureSystems - PureDataIBM Smarter Business 2012 - PureSystems - PureData
IBM Smarter Business 2012 - PureSystems - PureData
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
 
The New Enterprise Data Platform
The New Enterprise Data PlatformThe New Enterprise Data Platform
The New Enterprise Data Platform
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
 
Outboard Feel Good NLS
Outboard Feel Good NLSOutboard Feel Good NLS
Outboard Feel Good NLS
 
Anexinet Big Data Solutions
Anexinet Big Data SolutionsAnexinet Big Data Solutions
Anexinet Big Data Solutions
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the Future
 
Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need It
 
OSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - TechnicalOSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - Technical
 
Big data appliances for BI on Cloud
Big data appliances for BI on CloudBig data appliances for BI on Cloud
Big data appliances for BI on Cloud
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831
 
Barak regev
Barak regevBarak regev
Barak regev
 
Analyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast DataAnalyze This! Best Practices For Big And Fast Data
Analyze This! Best Practices For Big And Fast Data
 

More from mark madsen

Data Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of PeopleData Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of Peoplemark madsen
 
Solve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for HumansSolve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for Humansmark madsen
 
The Black Box: Interpretability, Reproducibility, and Data Management
The Black Box: Interpretability, Reproducibility, and Data ManagementThe Black Box: Interpretability, Reproducibility, and Data Management
The Black Box: Interpretability, Reproducibility, and Data Managementmark madsen
 
Operationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the EnterpriseOperationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the Enterprisemark madsen
 
Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019mark madsen
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)mark madsen
 
Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018mark madsen
 
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou RangeA Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Rangemark madsen
 
How to understand trends in the data & software market
How to understand trends in the data & software marketHow to understand trends in the data & software market
How to understand trends in the data & software marketmark madsen
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...mark madsen
 
Assumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slidesAssumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slidesmark madsen
 
Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehousemark madsen
 
A Pragmatic Approach to Analyzing Customers
A Pragmatic Approach to Analyzing CustomersA Pragmatic Approach to Analyzing Customers
A Pragmatic Approach to Analyzing Customersmark madsen
 
Disruptive Innovation: how do you use these theories to manage your IT?
Disruptive Innovation: how do you use these theories to manage your IT?Disruptive Innovation: how do you use these theories to manage your IT?
Disruptive Innovation: how do you use these theories to manage your IT?mark madsen
 
Briefing room: An alternative for streaming data collection
Briefing room: An alternative for streaming data collectionBriefing room: An alternative for streaming data collection
Briefing room: An alternative for streaming data collectionmark madsen
 
Building the Enterprise Data Lake: A look at architecture
Building the Enterprise Data Lake: A look at architectureBuilding the Enterprise Data Lake: A look at architecture
Building the Enterprise Data Lake: A look at architecturemark madsen
 
Briefing Room analyst comments - streaming analytics
Briefing Room analyst comments - streaming analyticsBriefing Room analyst comments - streaming analytics
Briefing Room analyst comments - streaming analyticsmark madsen
 
Everything has changed except us
Everything has changed except usEverything has changed except us
Everything has changed except usmark madsen
 
Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)mark madsen
 
On the edge: analytics for the modern enterprise (analyst comments)
On the edge: analytics for the modern enterprise (analyst comments)On the edge: analytics for the modern enterprise (analyst comments)
On the edge: analytics for the modern enterprise (analyst comments)mark madsen
 

More from mark madsen (20)

Data Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of PeopleData Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of People
 
Solve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for HumansSolve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for Humans
 
The Black Box: Interpretability, Reproducibility, and Data Management
The Black Box: Interpretability, Reproducibility, and Data ManagementThe Black Box: Interpretability, Reproducibility, and Data Management
The Black Box: Interpretability, Reproducibility, and Data Management
 
Operationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the EnterpriseOperationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the Enterprise
 
Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 
Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018
 
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou RangeA Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
 
How to understand trends in the data & software market
How to understand trends in the data & software marketHow to understand trends in the data & software market
How to understand trends in the data & software market
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...
 
Assumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slidesAssumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slides
 
Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehouse
 
A Pragmatic Approach to Analyzing Customers
A Pragmatic Approach to Analyzing CustomersA Pragmatic Approach to Analyzing Customers
A Pragmatic Approach to Analyzing Customers
 
Disruptive Innovation: how do you use these theories to manage your IT?
Disruptive Innovation: how do you use these theories to manage your IT?Disruptive Innovation: how do you use these theories to manage your IT?
Disruptive Innovation: how do you use these theories to manage your IT?
 
Briefing room: An alternative for streaming data collection
Briefing room: An alternative for streaming data collectionBriefing room: An alternative for streaming data collection
Briefing room: An alternative for streaming data collection
 
Building the Enterprise Data Lake: A look at architecture
Building the Enterprise Data Lake: A look at architectureBuilding the Enterprise Data Lake: A look at architecture
Building the Enterprise Data Lake: A look at architecture
 
Briefing Room analyst comments - streaming analytics
Briefing Room analyst comments - streaming analyticsBriefing Room analyst comments - streaming analytics
Briefing Room analyst comments - streaming analytics
 
Everything has changed except us
Everything has changed except usEverything has changed except us
Everything has changed except us
 
Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)
 
On the edge: analytics for the modern enterprise (analyst comments)
On the edge: analytics for the modern enterprise (analyst comments)On the edge: analytics for the modern enterprise (analyst comments)
On the edge: analytics for the modern enterprise (analyst comments)
 

Recently uploaded

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 

Recently uploaded (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 

How Real TIme Data Changes the Data Warehouse

  • 1. How Real Time Data Requirements Change the Data Warehouse Environment Mark Madsen – September 17, 2008 www.ThirdNature.net Attribution-NonCommercial-No Derivative http://creativecommons.org/licenses/by-nc-nd/3.0/us/
  • 2. Outline What’s real-time about? Impacts on the data warehouse architecture Delivering data to users Extracting the data Storing the data Operations Getting started Third Nature, January 2008 Mark Madsen Slide 2
  • 3. Speeding Up the Data Warehouse Why? Faster reaction time Reduced decision time New process capabilities Third Nature, January 2008 Mark Madsen Slide 3
  • 4. Which Decisions Benefit? Strategic Operational Decision time flexible, long cycle constrained, short cycle Decision scope broad, organizational narrow, departmental or process Decision model Complex Simple Data latency High, history is core Low, recent data is to decisions core to decisions Data scope Many sources, many Few sources, types, aggregated structured, detailed Most real time needs will be driven by operational decision making, not strategic decisions. Third Nature, January 2008 Mark Madsen Slide 4
  • 5. Strategy, Decisions and Data Latency Goal Increase share of low to mid market customers Strategy Reduce cost of products sold Improve promotional performance Tactics Efficient sourcing Decrease Out of Stocks Consolidate suppliers Improve delivery compliance Catch out of stocks before they occur BI Needs Reports & Dashboards, alerts Real time alerts & spreadsheets & scorecards embedded analytics Third Nature, January 2008 Mark Madsen Slide 5
  • 6. What People Are Doing Today Monthly W eekly Daily Multiple times per day On demand 2002 32% 34% 69% 15% 6% 2004 27% 29% 65% 30% 19% 2006 3 24 44 29 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Sources: TDWI, Gartner At the same time, data volumes are rising for most data warehouses at 50% to 100% per year. Third Nature, January 2008 Mark Madsen Slide 6
  • 7. BI Efforts Involving Real Time Data Access Terms you may hear from the BI market that imply real time: Operational BI Embedded analytics Decision automation Complex event processing Event-driven BI Process-driven BI They are all similar in requiring some level of low latency data access. Third Nature, January 2008 Mark Madsen Slide 7
  • 8. Impacts on the DW Architecture Databases Dashboards OLAP Productivity BAM/BPM Reporting Analytics Applications Data Consumers Delivery DW Platforms Adding current data to the system Warehouse Mart requires effort at all Database three layers Content ODS Store ETL EDR EII Databases Documents Flat Files XML Queues ERP Applications Source Environments Third Nature, January 2008 Mark Madsen Slide 8
  • 9. One Architecture or Two? In-line with process: RT BI • Real time data flows separately from the warehouse data • May include a low-latency data store in the real time environment Process • This model be needed for extremely low latency data BI • More applicable for event-driven Batch DW Out of band: • Data to the consumer first flows Process through the DW • Unified architecture for both low and high latency data BI & RT BI • More applicable for on-demand DW Third Nature, January 2008 Mark Madsen Slide 9
  • 10. User Interface: Two BI Usage Models Demand driven • Users ask for current data • Most BI tools work this way • Harder to adapt these tools to event-driven models Event driven • System takes action based on data, e.g. alerts, rule engines • May not have (or need) an end user interface • Need understanding of decision & action process for this model Third Nature, January 2008 Mark Madsen Slide 10
  • 11. BI Tools Need New Capabilities Embedding BI within applications • UI embedding • Full embedding Event-based integration Feeding BI data to applications: services, not SQL, may be desired Custom UI code may be preferable to a BI tool Third Nature, January 2008 Mark Madsen Slide 11
  • 12. The Data Integration Layer • Integration is the most complex element of adding real time data. • Inline vs. out of band, demand vs. event-driven BI usage create different DI requirements. • You may not have exactly the same metrics, attributes or data extract logic. • Don’t count on replacing the ETL batch; more likely you are augmenting it. • You probably need to add new DI technologies to your portfolio. • Batch performance design isn’t like real time design. Third Nature, January 2008 Mark Madsen Slide 12
  • 13. Speeding Up Data Integration Methods Single batch Frequent batch Mini-batch Continuous load Streaming Hourly+ Immediate Third Nature, January 2008 Mark Madsen Slide 13
  • 14. The Platform Layer: Data and Database • Schemas will need changes. • You don’t need to convert the entire database to a real time schema. • One schema or two? • Event-driven BI creates different query patterns and workloads. • Configuration and tuning may be different than what you are used to with traditional BI. • Application developers want services or ORMs, not SQL. Third Nature, January 2008 Mark Madsen Slide 14
  • 15. Different Platform Workloads Databases Dashboards OLAP Productivity BAM/BPM Reporting Analytics Applications Data Consumers Delivery DW Platforms Three workloads: Data loading + Warehouse Mart Normal BI + Database Real time BI Content = complications ODS Store ETL EDR EII Databases Documents Flat Files XML Queues ERP Applications Third Nature, January 2008 Source Environments Mark Madsen Slide 15
  • 16. Development, Maintenance & Operations • Real time decisions on real time data mean data quality plays a larger role, and it’s harder to address. • Warehouse availability becomes much more important to the business, and it isn’t just the database – it’s everything. • Performance and meeting strict BI SLAs will rise in importance since you are now tied in to business operations. Third Nature, January 2008 Mark Madsen Slide 16
  • 17. A Prescription for Getting Started 1. Star with a decision process 2. Define data needs for the process 3. Ensure that data is available at the right latency 4. Determine appropriate data integration technologies. 5. Design and initiate upstream work 6. Build Third Nature, January 2008 Mark Madsen Slide 17
  • 18. Thanks Third Nature, January 2008 Mark Madsen Slide 18
  • 19. CC Image Attributions Thanks to the people who supplied the creative commons licensed images used in this presentation: • Divers - http://flickr.com/photos/raveller/ • Fast dog - http://flickr.com/photos/marinacvinhal/379111290/ • Febo - http://flickr.com/photos/igor/419425754/ • Subway - http://flickr.com/photos/neilsphotoalbum/504517855/ • Cadillac ranch - http://flickr.com/photos/whatknot/179655095/ Third Nature, January 2008 Mark Madsen Slide 19
  • 20. About the Presenter Mark Madsen is president of Third Nature, a technology research and consulting firm focused on business intelligence, data integration and data management. Mark is an award-winning author, architect and CTO whose work has been featured in numerous industry publications. Over the past ten years Mark received awards for his work from the American Productivity & Quality Center, TDWI, and the Smithsonian Institute. He is an international speaker, a contributing editor at Intelligent Enterprise, and manages the open source channel at the Business Intelligence Network. For more information or to contact Mark, visit http://ThirdNature.net. Page 20