Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Strategies for Integrating with Hadoop

1,564 views

Published on

Based on a survey of BI Leadership Forum members, April 2012. Focused on Hadoop adoption.

Published in: Technology, Business
  • Be the first to comment

Strategies for Integrating with Hadoop

  1. 1. Strategies for Integrating with Hadoop Survey Results: May, 2012www.bileadership.com 1
  2. 2. Hadoop adoption rates No plans 38% Considering 32% Experimenting 20% Implementing 5% In production 4% Based on 158 respondents, BI Leadership Forum, April, 2012www.bileadership.com 2
  3. 3. Hadoop workloads today Staging area 92% Online archive 92% Transformation Engine 83% Ad hoc queries 58% Scheduled reports 42% Visual exploration 25% Data mining 58% Based on respondents that have implemented3 Hadoop. BI Leadership Forum, April, 2012
  4. 4. Hadoop workloads in 18 months Today In 18 Months Staging area 92% 92% Online archive 92% 92%Transformation Engine 83% 92% Ad hoc queries 58% 67% Scheduled reports 42% 67% Visual exploration 25% 67% Data mining 58% 83% 4 Based on respondents that have implemented Hadoop. BI Leadership Forum, April, 2012
  5. 5. Hadoop’s impact on the data warehouse Replaces it 0% Offloads existing workloads 50% Handles new workloads 67% Shares existing workloads 33% Shares new workloads 25% Dont know 8% Based on respondents that have implemented5 Hadoop. BI Leadership Forum, April, 2012
  6. 6. What data does Hadoop support? Today In 18 months Web logs 67% 75% System logs 67% 67% Social media 58% 75% Transaction data 92% 100% Semi-structured data 58% 67% Sensor data 17% 42% Audio or video 0% 42% Email 25% 42% Documents 33% 50% Based on respondents that have implemented Hadoop. BI Leadership Forum, April, 2012
  7. 7. Hadoop integration options API request Import/Export Relational Data set Data dump Hadoop SQL query (ODBC, Hive , HCatalog) Interoperability Relational SQL input Tables MR output Hadoop Relational Hadoop Hybrid MapReduce Relational (MR Query) Hadoop Native Relational (SQL response) MR Appwww.bileadership.com 7
  8. 8. Adoption Rate of Hadoop by Non-Implementers Within 12 months 40% Within 24 months 22% Within 36 months 5% In 3+ years 3% Not sure 30% Never 0%Based on 76 respondents that have not yet implementedHadoop. BI Leadership Forum, April, 2012
  9. 9. Expected Use of Hadoop by Non-Implementers Staging area 37% Online archive 23% Transformation Engine 39% Ad hoc queries 45% Scheduled reports 5% Visual exploration 27% Data mining 57% Not sure 23% Other 5% Based on respondents that have not yet implemented Hadoop. BI Leadership Forum, April, 2012www.bileader.com 9
  10. 10. Data that Non-Implementers WillStore in Hadoop Web logs 53% System logs 33% Social media 47% Transaction data 44% Semi-structured data 50% Sensor data 24% Audio or video 8% Email 18% Documents 18% Not sure 11% Based on respondents that have not yet implemented Hadoop. BI Leadership Forum, April, 2012www.bileader.com 10

×