SlideShare a Scribd company logo
1 of 12
Download to read offline
“BIG DATA” Appliances



                               R Sathyanarayana


                                 TDWI Bangalore
                                   5 Feb 2010

           Enterprise Information Management & Analytics practice, EMC Consulting




2/9/2011                                                                            1
Starter Kit : Why Analytics on the Cloud?


 Internet scale analytics capability
     Analyze enormous volumes of data with cost efficiency
      and response time unimagined a few years ago
     Integrate cross platform data to derive meaningful
      insights : Ingest not gigas but teras in a day
 Transform fundamentally what is currently possible
     Can we promote offerings/products to individuals?
     Can we respond quickly to emergencies, frauds?
 Problem statement
     What capabilities are required and how enterprises
      would use such capability?



 2/9/2011                                                  2
Picture This….




                                         Rich
                                     Visualization
                                      + Analytics


             DATA WAREHOUSE
                 (ENTERPRISE/CROSS
                    ENTERPRISE)




2/9/2011                                             3
“Big Data”
   Big data are datasets that grow so large that they become awkward to work
    with using on-hand database management tools. Difficulties include capture,
    storage, search, sharing, analytics, and visualizing. This trend continues
    because of the benefits of working with larger and larger datasets allowing
    analysts to "spot business trends, prevent diseases, combat crime.“

   One current feature of Big data is the difficulty working with it using relational
    databases and desktop statistics/visualization packages, requiring instead
    "massively parallel software running on tens, hundreds, or even thousands of
    servers."

   Big data sizes are a constantly moving target currently ranging from a few
    dozen terabytes to many petabytes of data in a single data set.

   Sample This : web logs, RFID, sensor networks, social networks, Internet text
    and documents, Internet search indexing, call detail records, genomics,
    astronomy, biological research, military surveillance, medical records,
    photography archives, video archives, and large scale eCommerce


2/9/2011                                                                            4
Data Warehouse Appliances : Setting
the context

 As IT organizations build up massive numbers of
  databases to deal with the explosion of data, the
  ability to make real-time decisions on new questions
  (BI) that involve enormous amounts of information
  (DW) will need to be a core competency for many
  organizations.
  Due to this shift, DW/BI customers need a solution
  that can provide extreme predictable performance,
  scale-out architecture for ‘Big Data’ analytics and an
  enterprise-proven feature set all at the lowest TCO.




 2/9/2011                                             5
UNDER WORKS: Definition of
DataWarehouse Appliances
 Original Definition : Hardware + Software – built
  and supported by a single vendor

 Partial Technology Stack : May or may not bundle
  with other vendors’ hardware and / or operating
  system




2/9/2011                                              6
Architectural considerations are evolving
CONFIDENTIAL - REMOVED




2/9/2011                                7
Trends: Consolidation in the industry : Small
 Vendors Vs Infrastructure Providers


 Focus is shifting in multiple areas:
  1.From whole technology stack to pieces of it
    2.From hardware to software
    3.From proprietary to commodity hardware

    4.From new vendors to infrastructure providers
    5.From single to mixed workloads
    6.From data marts to enterprise data warehouses

• These trends affect the content & capabilities of DWAs, where to get
    them, how to define them, how to use them.
 2/9/2011                                                            8
Major differences between the DW
appliances

   Column vs. Row Storage
   Polymorphic Storage (Both Column and Row)
   Proprietary and Commodity Hardware
   In-Memory Processing
   Relationship with Existing Architecture
   Shared Nothing Architecture




2/9/2011                                        9
The Challenges in Today’s Data
Warehousing Environments

 Sources of data and the amount of data to analyze is
  growing exponentially
 Stale data exists because DW solutions cannot ingest
  the vast amounts of data fast enough
 Lack of performance for advanced analytics and
  complex queries
 The number of users and the concurrency of users is
  increasing rapidly




 2/9/2011                                           10
Considerations of DW Solution for the
  Big Data
These are the characteristics you want in your DW
  solution:

 Easily scales to analyze the growing amounts of
  data
 Rapidly ingests large amounts of data from
  sources
 Provides high performance in database analytics
 Supports high user concurrency securely, reliably
 Handle multiple workloads
  2/9/2011                                          11
THANK YOU
 Sathyanarayana.Ranganatha@EMC.com




2/9/2011                              12

More Related Content

What's hot

Data Services and the Modern Data Ecosystem (Middle East)
Data Services and the Modern Data Ecosystem (Middle East)Data Services and the Modern Data Ecosystem (Middle East)
Data Services and the Modern Data Ecosystem (Middle East)
Denodo
 

What's hot (20)

Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
Role of Unified AI and ML in Cloud Technologies. Which Cloud Service Provider...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization (US)
 
Building a Consistent Hybrid Cloud Semantic Model In Denodo
Building a Consistent Hybrid Cloud Semantic Model In DenodoBuilding a Consistent Hybrid Cloud Semantic Model In Denodo
Building a Consistent Hybrid Cloud Semantic Model In Denodo
 
Solution Centric Architectural Presentation - Implementing a Logical Data War...
Solution Centric Architectural Presentation - Implementing a Logical Data War...Solution Centric Architectural Presentation - Implementing a Logical Data War...
Solution Centric Architectural Presentation - Implementing a Logical Data War...
 
The Virtualization of Clouds - The New Enterprise Data Architecture Opportunity
The Virtualization of Clouds - The New Enterprise Data Architecture OpportunityThe Virtualization of Clouds - The New Enterprise Data Architecture Opportunity
The Virtualization of Clouds - The New Enterprise Data Architecture Opportunity
 
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
 
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural Components
 
Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...
Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...
Creating a Healthcare Data Fabric, and Providing a Single, Unified, and Curat...
 
Self Service Analytics enabled by Data Virtualization from Denodo
Self Service Analytics enabled by Data Virtualization from DenodoSelf Service Analytics enabled by Data Virtualization from Denodo
Self Service Analytics enabled by Data Virtualization from Denodo
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
 
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
 
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
 
Logical Data Fabric: An Introduction
Logical Data Fabric: An IntroductionLogical Data Fabric: An Introduction
Logical Data Fabric: An Introduction
 
Data Services and the Modern Data Ecosystem (Middle East)
Data Services and the Modern Data Ecosystem (Middle East)Data Services and the Modern Data Ecosystem (Middle East)
Data Services and the Modern Data Ecosystem (Middle East)
 
Cisco Data Center Fabric
Cisco Data Center FabricCisco Data Center Fabric
Cisco Data Center Fabric
 
Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)Introduction to Modern Data Virtualization 2021 (APAC)
Introduction to Modern Data Virtualization 2021 (APAC)
 
Cloud Migration headache? Ease the pain with Data Virtualization! (EMEA)
Cloud Migration headache? Ease the pain with Data Virtualization! (EMEA)Cloud Migration headache? Ease the pain with Data Virtualization! (EMEA)
Cloud Migration headache? Ease the pain with Data Virtualization! (EMEA)
 
Data Democratization for Faster Decision-making and Business Agility (ASEAN)
Data Democratization for Faster Decision-making and Business Agility (ASEAN)Data Democratization for Faster Decision-making and Business Agility (ASEAN)
Data Democratization for Faster Decision-making and Business Agility (ASEAN)
 
Denodo Design Studio: Modeling and Creation of Data Services
Denodo Design Studio: Modeling and Creation of Data ServicesDenodo Design Studio: Modeling and Creation of Data Services
Denodo Design Studio: Modeling and Creation of Data Services
 

Viewers also liked

The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
Srinath Perera
 

Viewers also liked (6)

Concur to Sage Integration
Concur to Sage IntegrationConcur to Sage Integration
Concur to Sage Integration
 
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the CloudCloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
 
Rethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data HubRethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data Hub
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Enterprise Data Hub: The Next Big Thing in Big Data
Enterprise Data Hub: The Next Big Thing in Big DataEnterprise Data Hub: The Next Big Thing in Big Data
Enterprise Data Hub: The Next Big Thing in Big Data
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 

Similar to Big data appliances for BI on Cloud

big data Big Things
big data Big Thingsbig data Big Things
big data Big Things
pateelhs
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Nathan Bijnens
 

Similar to Big data appliances for BI on Cloud (20)

Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
1
11
1
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forum
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
big data Big Things
big data Big Thingsbig data Big Things
big data Big Things
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need It
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
Big data - what, why, where, when and how
Big data - what, why, where, when and howBig data - what, why, where, when and how
Big data - what, why, where, when and how
 
Big data data lake and beyond
Big data data lake and beyond Big data data lake and beyond
Big data data lake and beyond
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
 
Cloud Migration Strategies that Ensure Greater Value for the Business
Cloud Migration Strategies that Ensure Greater Value for the BusinessCloud Migration Strategies that Ensure Greater Value for the Business
Cloud Migration Strategies that Ensure Greater Value for the Business
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Delivering Analytics at The Speed of Transactions with Data Fabric
Delivering Analytics at The Speed of Transactions with Data FabricDelivering Analytics at The Speed of Transactions with Data Fabric
Delivering Analytics at The Speed of Transactions with Data Fabric
 
big_data.ppt
big_data.pptbig_data.ppt
big_data.ppt
 

More from tdwiindia (9)

TDWI Speaker profiles
TDWI Speaker profilesTDWI Speaker profiles
TDWI Speaker profiles
 
TDWI Inda BI on Cloud Future State Vision
TDWI Inda BI on Cloud Future State VisionTDWI Inda BI on Cloud Future State Vision
TDWI Inda BI on Cloud Future State Vision
 
Tdwi event summary
Tdwi event summaryTdwi event summary
Tdwi event summary
 
Intel IT Cloud Strategy
Intel IT Cloud StrategyIntel IT Cloud Strategy
Intel IT Cloud Strategy
 
BI on Cloud - Perspective from SAP
BI on Cloud - Perspective from SAPBI on Cloud - Perspective from SAP
BI on Cloud - Perspective from SAP
 
Data Warehousing Infrastructure on Cloud
Data Warehousing Infrastructure on CloudData Warehousing Infrastructure on Cloud
Data Warehousing Infrastructure on Cloud
 
BI on Cloud Computing
BI on Cloud ComputingBI on Cloud Computing
BI on Cloud Computing
 
What is BI on Cloud
What is BI on CloudWhat is BI on Cloud
What is BI on Cloud
 
Business Intelligence on Cloud: A Business Perspective
Business Intelligence on Cloud: A Business PerspectiveBusiness Intelligence on Cloud: A Business Perspective
Business Intelligence on Cloud: A Business Perspective
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

Big data appliances for BI on Cloud

  • 1. “BIG DATA” Appliances R Sathyanarayana TDWI Bangalore 5 Feb 2010 Enterprise Information Management & Analytics practice, EMC Consulting 2/9/2011 1
  • 2. Starter Kit : Why Analytics on the Cloud?  Internet scale analytics capability  Analyze enormous volumes of data with cost efficiency and response time unimagined a few years ago  Integrate cross platform data to derive meaningful insights : Ingest not gigas but teras in a day  Transform fundamentally what is currently possible  Can we promote offerings/products to individuals?  Can we respond quickly to emergencies, frauds?  Problem statement  What capabilities are required and how enterprises would use such capability? 2/9/2011 2
  • 3. Picture This…. Rich Visualization + Analytics DATA WAREHOUSE (ENTERPRISE/CROSS ENTERPRISE) 2/9/2011 3
  • 4. “Big Data”  Big data are datasets that grow so large that they become awkward to work with using on-hand database management tools. Difficulties include capture, storage, search, sharing, analytics, and visualizing. This trend continues because of the benefits of working with larger and larger datasets allowing analysts to "spot business trends, prevent diseases, combat crime.“  One current feature of Big data is the difficulty working with it using relational databases and desktop statistics/visualization packages, requiring instead "massively parallel software running on tens, hundreds, or even thousands of servers."  Big data sizes are a constantly moving target currently ranging from a few dozen terabytes to many petabytes of data in a single data set.  Sample This : web logs, RFID, sensor networks, social networks, Internet text and documents, Internet search indexing, call detail records, genomics, astronomy, biological research, military surveillance, medical records, photography archives, video archives, and large scale eCommerce 2/9/2011 4
  • 5. Data Warehouse Appliances : Setting the context  As IT organizations build up massive numbers of databases to deal with the explosion of data, the ability to make real-time decisions on new questions (BI) that involve enormous amounts of information (DW) will need to be a core competency for many organizations. Due to this shift, DW/BI customers need a solution that can provide extreme predictable performance, scale-out architecture for ‘Big Data’ analytics and an enterprise-proven feature set all at the lowest TCO. 2/9/2011 5
  • 6. UNDER WORKS: Definition of DataWarehouse Appliances  Original Definition : Hardware + Software – built and supported by a single vendor  Partial Technology Stack : May or may not bundle with other vendors’ hardware and / or operating system 2/9/2011 6
  • 7. Architectural considerations are evolving CONFIDENTIAL - REMOVED 2/9/2011 7
  • 8. Trends: Consolidation in the industry : Small Vendors Vs Infrastructure Providers  Focus is shifting in multiple areas: 1.From whole technology stack to pieces of it 2.From hardware to software 3.From proprietary to commodity hardware 4.From new vendors to infrastructure providers 5.From single to mixed workloads 6.From data marts to enterprise data warehouses • These trends affect the content & capabilities of DWAs, where to get them, how to define them, how to use them. 2/9/2011 8
  • 9. Major differences between the DW appliances  Column vs. Row Storage  Polymorphic Storage (Both Column and Row)  Proprietary and Commodity Hardware  In-Memory Processing  Relationship with Existing Architecture  Shared Nothing Architecture 2/9/2011 9
  • 10. The Challenges in Today’s Data Warehousing Environments  Sources of data and the amount of data to analyze is growing exponentially  Stale data exists because DW solutions cannot ingest the vast amounts of data fast enough  Lack of performance for advanced analytics and complex queries  The number of users and the concurrency of users is increasing rapidly 2/9/2011 10
  • 11. Considerations of DW Solution for the Big Data These are the characteristics you want in your DW solution:  Easily scales to analyze the growing amounts of data  Rapidly ingests large amounts of data from sources  Provides high performance in database analytics  Supports high user concurrency securely, reliably  Handle multiple workloads 2/9/2011 11